Substring with Concatenation of All Words

You are given a string s and an array of strings words . All the strings of words are of the same length . A concatenated string is a string that exactly contains all the strings of any permutation of words concatenated. For example, if words = ["ab","cd","ef"], then "abcdef", "abefcd", "cdabef", "cdefab", "efabcd", and "efcdab" are all concatenated strings. "acdbef" is not a concatenated string because it is not the concatenation of any permutation of words . Return an array of the starting indices of all the concatenated substrings in s . You can return the answer in any order . Example 1: Input: s = "barfoothefoobarman", words = ["foo","bar"] Output: [0,9] Explanation: The substring starting at 0 is "barfoo" . It is the concatenation of ["bar","foo"] which is a permutation of words . The substring starting at 9 is "foobar" . It is the concatenation of ["foo","bar"] which is a permutation of words . Example 2: Input: s = "wordgoodgoodgoodbestword", words = ["word","good","best","word"] Output: [] Explanation: There is no concatenated substring. Example 3: Input: s = "barfoofoobarthefoobarman", words = ["bar","foo","the"] Output: [6,9,12] Explanation: The substring starting at 6 is "foobarthe" . It is the concatenation of ["foo","bar","the"] . The substring starting at 9 is "barthefoo" . It is the concatenation of ["bar","the","foo"] . The substring starting at 12 is "thefoobar" . It is the concatenation of ["the","foo","bar"] . Constraints: 1 <= s.length <= 10 4 1 <= words.length <= 5000 1 <= words[i].length <= 30 s and words[i] consist of lowercase English letters.

Solution Explanation: Substring with Concatenation of All Words This problem asks to find all starting indices of substrings within a given string s that are concatenations of all words from an input array words. All words in words have the same length. Approach: The optimal approach uses a sliding window technique combined with hash tables (or dictionaries) for efficient word counting. Algorithm: Preprocessing: Create a hash table (cnt) to store the frequency of each word in the words array. This allows for O(1) lookup of word counts. Sliding Window: Iterate through the string s using a sliding window of size n * k, where n is the number of words in words and k is the length of each word. The window slides one character at a time. Inner Window (Word-by-Word): Within the main sliding window, we maintain a smaller, inner sliding window of size k. This inner window iterates through the words within the larger window. Counting Words: As the inner window moves, we use a second hash table (cnt1) to count the occurrences of words within the current larger window. Validation: After processing the complete larger window, we compare cnt1 with cnt. If they are identical (meaning all words from words are present in the correct frequencies), we add the starting index of the large window to the results. Handling Mismatches: If, at any point during the inner window's traversal, a word is encountered that is not in cnt, or its count in cnt1 exceeds its count in cnt, we reset the larger window and the cnt1 counter. Time Complexity: O(m*k), where m is the length of string s and k is the length of each word. The outer loop iterates through s at most m times. The inner loop iterates over at most m characters, divided by k. In the worst case, the inner while loop which shrinks the window, might run up to m/k times. The overall time complexity is dominated by these nested loops. Space Complexity: O(n*k), where n is the number of words and k is the length of each word. This comes from storing the word counts in the hash tables cnt and cnt1. Code Examples (Python): from collections import Counter def findSubstring(s, words): word_len = len(words[0]) num_words = len(words) total_len = word_len * num_words result = [] word_counts = Counter(words) #Preprocessing word counts for i in range(len(s) - total_len + 1): substring = s[i:i + total_len] current_counts = Counter() valid = True for j in range(0, total_len, word_len): word = substring[j:j + word_len] if word not in word_counts: valid = False break current_counts[word] += 1 if current_counts[word] > word_counts[word]: valid = False break if valid and current_counts == word_counts: result.append(i) return result :root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} This Python code efficiently implements the sliding window and hash table approach, offering a clearer, more concise representation than the multi-language version. The time and space complexities remain the same.

Also Explore

DSA Questions

Generate Parentheses

DSA Questions

Merge k Sorted Lists

DSA Questions

Swap Nodes in Pairs

DSA Questions

Reverse Nodes in k-Group

DSA Questions

Remove Duplicates from Sorted Array

DSA Questions

Remove Element

DSA Questions

Find the Index of the First Occurrence in a String

DSA Questions

Divide Two Integers

DSA Questions

Substring with Concatenation of All Words

DSA Questions

Next Permutation

DSA Questions

Longest Valid Parentheses

DSA Questions

Search in Rotated Sorted Array

DSA Questions

Find First and Last Position of Element in Sorted Array

DSA Questions

Search Insert Position

DSA Questions

Valid Sudoku

DSA Questions

Sudoku Solver

DSA Questions

Substring with Concatenation of All Words

Solution Explanation: Substring with Concatenation of All Words

On This Page

Also Explore

Generate Parentheses

Merge k Sorted Lists

Swap Nodes in Pairs

Reverse Nodes in k-Group

Remove Duplicates from Sorted Array

Remove Element

Find the Index of the First Occurrence in a String

Divide Two Integers

Substring with Concatenation of All Words

Next Permutation

Longest Valid Parentheses

Search in Rotated Sorted Array

Find First and Last Position of Element in Sorted Array

Search Insert Position

Valid Sudoku

Sudoku Solver

Count and Say