Suffix Trees

Description

Suffix Trees
msladey
Flashcards by msladey, updated more than 1 year ago
msladey
Created by msladey about 9 years ago
10
0

Resource summary

Question Answer
What is the text indexing problem? Matching a pattern string P of length m to a text string T of length n. Finding all locations where P matches T.
For exact pattern matching, what is the time complexity of the best known algorithm? O(n) (KMP)
List 4 properties of an atomic suffix tree 1. The suffix tree contains every suffix of T as a root to leaf path. 2. Every edge is labelled with a character from T 3. No two edges leaving the same node have the same label. 4. Each leaf corresponds to a suffix (so there are n leaves)
How do you find a pattern in an atomic suffix tree? Start at the root and walk down the tree. Matches occur at the leaves of the subtree.
How does a compacted suffix tree differ from an atomic suffix tree? Each non-branching path in an atomic suffix tree is replaced with a single edge. Edges are labelled with substrings.
List 6 properties of a compacted suffix tree 1. It contains n leaves. 2. Every internal node has two or more children 3. Every edge is labelled with a substring 4. No two edges leaving the same node have the same first character 5. Each leaf is labelled with a location in T 6. Any root-to-leaf path spells out corresponding suffix
How can we construct a compacted suffix tree in O(n^2) time? Insert suffixes one at a time. Search for the new suffix in the partial suffix tree. Add a new edge and leaf for the new suffix (may require the breaking of an edge into two).
What are the time complexities of a Compacted Suffix Tree? O(n) space. O(m+occ) matching time, O(n) building time
How can we make sure that a compacted suffix tree always exists? Add a unique symbol $ to the end of the text
How can we construct a suffix array in O(n) time given a suffix tree? Perform a depth first search in lexicographical order, reporting the index corresponding to the leaf.
Show full summary Hide full summary

Similar

Mathematical Preliminaries
msladey
Suffix Arrays
msladey
Hamming Distance
msladey
Range Minimum Query
msladey
Data Structures & Algorithms
Reuben Caruana
Computer science unit 2
Somto Ibeme
Algorithms ♡
lauren ♥
Computational Thinking ♡
lauren ♥
Searching and Sorting Algorithms
Josh Calvert
Systems Software Revision
cocacolai
Grafy I.
Michal Roch