Find Duplicate Subtrees

Given the root of a binary tree, return all duplicate subtrees . For each kind of duplicate subtrees, you only need to return the root node of any one of them. Two trees are duplicate if they have the same structure with the same node values . Example 1: Input: root = [1,2,3,4,null,2,4,null,null,4] Output: [[2,4],[4]] Example 2: Input: root = [2,1,1] Output: [[1]] Example 3: Input: root = [2,2,2,3,null,3,null] Output: [[2,3],[3]] Constraints: The number of the nodes in the tree will be in the range [1, 5000] -200 <= Node.val <= 200

Problem: Find Duplicate Subtrees This problem asks to find all duplicate subtrees within a given binary tree. Two trees are considered duplicates if they have the same structure and node values. The solution should return a list containing the root node of each unique duplicate subtree. Approach The most efficient approach uses Depth-First Search (DFS) and a hash table (or map) to identify duplicate subtrees. Tree Serialization: We need a way to represent each subtree as a unique string. A common technique is to serialize the tree using a pre-order traversal, representing each node as node.val,left_subtree_serialization,right_subtree_serialization. This string uniquely identifies the subtree's structure and values. DFS Traversal: Perform a DFS traversal of the binary tree. For each node, recursively serialize its subtree using the method above. Hash Table (Map): Use a hash table (map) to store the serialized strings as keys and their counts as values. For each serialized string encountered during DFS, increment its count in the hash table. If the count becomes 2, it indicates a duplicate subtree, and its root node is added to the result list. Time and Space Complexity Time Complexity: O(N), where N is the number of nodes in the tree. The DFS traversal visits each node once, and serialization/hash table operations take constant time per node. Space Complexity: O(N) in the worst case, to store the serialized strings in the hash table. In the best case, if there are no duplicates, the space complexity is O(log N) (for balanced trees) or O(N) (for skewed trees) to maintain the recursion stack during DFS. Code Explanation (Python) # Definition for a binary tree node. # class TreeNode: # def __init__(self, val=0, left=None, right=None): # self.val = val # self.left = left # self.right = right class Solution: def findDuplicateSubtrees(self, root: TreeNode) -> List[TreeNode]: count = collections.Counter() # Hash table to store serialized subtrees and counts result = [] # list to store the root of the duplicate subtrees def dfs(node): if not node: return "#" # Empty subtree represented by "#" subtree_str = f"{node.val},{dfs(node.left)},{dfs(node.right)}" # subtree serialization using preorder traversal count[subtree_str] += 1 if count[subtree_str] == 2: # If count becomes 2, it's a duplicate result.append(node) return subtree_str dfs(root) return result :root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} The Python code efficiently implements the described approach. The dfs function recursively serializes subtrees and updates the count dictionary. The result list stores the root nodes of duplicate subtrees as soon as they are identified. Code in Other Languages The approach is similar in other languages; the primary difference lies in the syntax for hash tables/maps and string manipulation. The provided code examples in Java, C++, Go, TypeScript, and Rust showcase these adaptations. All of them adhere to the same algorithmic strategy.

Also Explore

DSA Questions

Maximum Average Subarray II

DSA Questions

Set Mismatch

DSA Questions

Maximum Length of Pair Chain

DSA Questions

Palindromic Substrings

DSA Questions

Replace Words

DSA Questions

Dota2 Senate

DSA Questions

2 Keys Keyboard

DSA Questions

4 Keys Keyboard

DSA Questions

Find Duplicate Subtrees

DSA Questions

Two Sum IV - Input is a BST

DSA Questions

Maximum Binary Tree

DSA Questions

Print Binary Tree

DSA Questions

Coin Path

DSA Questions

Robot Return to Origin

DSA Questions

Find K Closest Elements

DSA Questions

Split Array into Consecutive Subsequences

DSA Questions

Find Duplicate Subtrees

Problem: Find Duplicate Subtrees

Approach

Time and Space Complexity

Code Explanation (Python)

Code in Other Languages

On This Page

Also Explore

Maximum Average Subarray II

Set Mismatch

Maximum Length of Pair Chain

Palindromic Substrings

Replace Words

Dota2 Senate

2 Keys Keyboard

4 Keys Keyboard

Find Duplicate Subtrees

Two Sum IV - Input is a BST

Maximum Binary Tree

Print Binary Tree

Coin Path

Robot Return to Origin

Find K Closest Elements

Split Array into Consecutive Subsequences

Remove 9