Delete Duplicate Folders in System - Problem

Due to a bug, there are many duplicate folders in a file system. You are given a 2D array paths, where paths[i] is an array representing an absolute path to the ith folder in the file system.

For example, ["one", "two", "three"] represents the path "/one/two/three".

Two folders (not necessarily on the same level) are identical if they contain the same non-empty set of identical subfolders and underlying subfolder structure. The folders do not need to be at the root level to be identical. If two or more folders are identical, then mark the folders as well as all their subfolders.

Once all the identical folders and their subfolders have been marked, the file system will delete all of them. The file system only runs the deletion once, so any folders that become identical after the initial deletion are not deleted.

Return the 2D array ans containing the paths of the remaining folders after deleting all the marked folders. The paths may be returned in any order.

Input & Output

Example 1 — Basic Duplicate Detection
$ Input: paths = [["a"], ["c"], ["d"], ["a", "b"], ["c", "b"], ["d", "a"]]
Output: [["d"], ["d", "a"]]
💡 Note: Folders /a and /c both contain subfolder "b", making them identical structures. Both /a and /c (and their subfolders) are deleted, leaving only /d and /d/a.
Example 2 — No Duplicates
$ Input: paths = [["a"], ["c"], ["a", "b"], ["c", "d"]]
Output: [["a"], ["c"], ["a", "b"], ["c", "d"]]
💡 Note: Folder /a contains subfolder "b" while /c contains subfolder "d". Since their structures are different, no folders are deleted.
Example 3 — Multiple Level Duplicates
$ Input: paths = [["a"], ["b"], ["a", "x"], ["a", "x", "y"], ["b", "x"], ["b", "x", "y"]]
Output: []
💡 Note: Folders /a and /b have identical structure (both contain x/y subfolder tree). All folders are deleted as duplicates.

Constraints

  • 1 ≤ paths.length ≤ 2 × 104
  • 1 ≤ paths[i].length ≤ 500
  • 1 ≤ paths[i][j].length ≤ 10
  • paths[i][j] consists of lowercase English letters

Visualization

Tap to expand
Delete Duplicate Folders in SystemInput: [["a"], ["c"], ["d"], ["a","b"], ["c","b"], ["d","a"]]/ahas: b/chas: b/dhas: aIdentical Structures Detected!/a and /c both contain subfolder "b"/aDELETED/cDELETED/dKEPTOutput: [["d"], ["d","a"]]Only unique folder structures remain
Understanding the Visualization
1
Input
File system with folder paths including duplicates
2
Process
Identify folders with identical subfolder structures
3
Output
Remaining folders after deleting all duplicates
Key Takeaway
🎯 Key Insight: Use hash-based signatures to efficiently identify folders with identical subfolder structures
Asked in
Google 15 Microsoft 12 Amazon 8
18.5K Views
Medium Frequency
~35 min Avg. Time
425 Likes
Ln 1, Col 1
Smart Actions
💡 Explanation
AI Ready
💡 Suggestion Tab to accept Esc to dismiss
// Output will appear here after running code
Code Editor Closed
Click the red button to reopen