Sampled suffix array with minimizers.

SOFTWARE-PRACTICE & EXPERIENCE(2017)

引用 6|浏览30
暂无评分
摘要
Sampling (evenly) the suffixes from the suffix array is an old idea trading the pattern search time for reduced index space. A few years ago Claude et al. showed an alphabet sampling scheme allowing for more efficient pattern searches compared with the sparse suffix array, for long enough patterns. A drawback of their approach is the requirement that sought patterns need to contain at least one character from the chosen subalphabet. In this work, we propose an alternative suffix sampling approach with only a minimum pattern length as a requirement, which is more convenient in practice. Experiments show that our algorithm (in a few variants) achieves competitive time-space tradeoffs on most standard benchmark data. Copyright (C) 2017 John Wiley & Sons, Ltd.
更多
查看译文
关键词
full-text indexing,sparse suffix array,sampled suffix array,minimizers
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要