Most Common Word

Given a string paragraph and a string array of the banned words banned, return the most frequent word that is not banned . It is guaranteed there is at least one word that is not banned, and that the answer is unique . The words in paragraph are case-insensitive and the answer should be returned in lowercase . Example 1: Input: paragraph = "Bob hit a ball, the hit BALL flew far after it was hit.", banned = ["hit"] Output: "ball" Explanation: "hit" occurs 3 times, but it is a banned word. "ball" occurs twice (and no other word does), so it is the most frequent non-banned word in the paragraph. Note that words in the paragraph are not case sensitive, that punctuation is ignored (even if adjacent to words, such as "ball,"), and that "hit" isn't the answer even though it occurs more because it is banned. Example 2: Input: paragraph = "a.", banned = [] Output: "a" Constraints: 1 <= paragraph.length <= 1000 paragraph consists of English letters, space' ', or one of the symbols: "!?',;." . 0 <= banned.length <= 100 1 <= banned[i].length <= 10 banned[i] consists of only lowercase English letters.

Solution Explanation for LeetCode 819: Most Common Word This problem involves finding the most frequent word in a given paragraph, excluding a list of banned words. The solution involves several steps: Preprocessing: Clean the input paragraph by converting it to lowercase and removing punctuation. This ensures case-insensitivity and avoids counting variations of the same word differently. Word Counting: Count the occurrences of each word in the cleaned paragraph. A hash map (dictionary in Python) is ideal for this, storing words as keys and their counts as values. Banned Word Filtering: Exclude words from the count that appear in the banned list. A set (or HashSet) is efficient for checking if a word is banned. Finding the Most Frequent Word: Iterate through the word counts and identify the word with the highest count. Time and Space Complexity Analysis: Time Complexity: O(N + M), where N is the length of the paragraph and M is the length of the banned list. The preprocessing step (cleaning the paragraph) takes O(N) time. Building the word count takes O(N) time in the average case (hash table lookups are typically O(1)). Checking if a word is banned takes O(1) on average using a set. Finally, finding the most frequent word takes O(N) in the worst case if we need to iterate over all words. Space Complexity: O(N), where N is the length of the paragraph. In the worst-case scenario (all words are unique and not banned), the hash map will store all unique words from the paragraph. The space for the banned words set is O(M). Therefore, the overall space complexity is dominated by O(N). Code Examples: The solutions below demonstrate different approaches using several programming languages. Note that some optimizations are used to improve efficiency, but the core logic remains consistent. Python: import re from collections import Counter class Solution: def mostCommonWord(self, paragraph: str, banned: List[str]) -> str: # Preprocessing: Convert to lowercase and remove punctuation cleaned_paragraph = re.sub(r'[^\w\s]', '', paragraph).lower() # Word Counting and Filtering word_counts = Counter(cleaned_paragraph.split()) # Finding the Most Frequent Word (Efficiently using next) banned_set = set(banned) for word, count in word_counts.most_common(): if word not in banned_set: return word:root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} Java: import java.util.*; import java.util.regex.Matcher; import java.util.regex.Pattern; class Solution { public String mostCommonWord(String paragraph, String[] banned) { // Preprocessing paragraph = paragraph.toLowerCase(); Pattern p = Pattern.compile("[a-zA-Z]+"); // Matches only alphabetic words Matcher m = p.matcher(paragraph); Map<String, Integer> wordCounts = new HashMap<>(); Set<String> bannedSet = new HashSet<>(Arrays.asList(banned)); // Word Counting and Filtering while (m.find()) { String word = m.group(); if (!bannedSet.contains(word)) { wordCounts.put(word, wordCounts.getOrDefault(word, 0) + 1); } } // Finding Most Frequent Word String mostFrequent = ""; int maxCount = 0; for (Map.Entry<String, Integer> entry : wordCounts.entrySet()) { if (entry.getValue() > maxCount) { maxCount = entry.getValue(); mostFrequent = entry.getKey(); } } return mostFrequent; } }:root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} Other Languages: The core logic is similar across other languages (C++, Go, TypeScript, Rust) with variations in library usage and syntax. The provided solutions in the original response demonstrate those variations effectively. The key is to use efficient data structures (hash maps/dictionaries and sets) and appropriate string manipulation techniques.

Also Explore

DSA Questions

Subdomain Visit Count

DSA Questions

Largest Triangle Area

DSA Questions

Largest Sum of Averages

DSA Questions

Binary Tree Pruning

DSA Questions

Bus Routes

DSA Questions

Ambiguous Coordinates

DSA Questions

Linked List Components

DSA Questions

Race Car

DSA Questions

Most Common Word

DSA Questions

Short Encoding of Words

DSA Questions

Shortest Distance to a Character

DSA Questions

Card Flipping Game

DSA Questions

Binary Trees With Factors

DSA Questions

Goat Latin

DSA Questions

Friends Of Appropriate Ages

DSA Questions

Most Profit Assigning Work

DSA Questions

Most Common Word

Solution Explanation for LeetCode 819: Most Common Word

On This Page

Also Explore

Subdomain Visit Count

Largest Triangle Area

Largest Sum of Averages

Binary Tree Pruning

Bus Routes

Ambiguous Coordinates

Linked List Components

Race Car

Most Common Word

Short Encoding of Words

Shortest Distance to a Character

Card Flipping Game

Binary Trees With Factors

Goat Latin

Friends Of Appropriate Ages

Most Profit Assigning Work

Making A Large Island