Probabilistic Models of k-mer Frequencies (Extended Abstract)

CONNECTING WITH COMPUTABILITY(2021)

引用 0|浏览2
暂无评分
摘要
In this article, we review existing probabilistic models for modeling abundance of fixed-length strings (k-mers) in DNA sequencing data. These models capture dependence of the abundance on various phenomena, such as the size and repeat content of the genome, heterozygosity levels, and sequencing error rate. This in turn allows to estimate these properties from k-mer abundance histograms observed in real data. We also briefly discuss the issue of comparing k-mer abundance between related sequencing samples and meaningfully summarizing the results.
更多
查看译文
关键词
k-mer abundance, DNA sequencing, Genome size
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要