MICon – A Contamination Detection Workflow for NGS Laboratories using Microhaplotype Locus and Supervised Learning.

The Journal of Molecular Diagnostics(2023)

引用 0|浏览4
暂无评分
摘要
Innovation in sequencing instrumentation is increasing the per-batch data volumes and decreasing the per-base costs. Multiplexed chemistry protocols after the addition of index tags have further contributed to efficient and cost-effective sequencer utilization. With these pooled processing strategies, however, comes an increased risk of sample contamination. Sample contamination poses a risk of missing critical variants in a patient sample or wrongly reporting variants derived from the contaminant, which are particularly relevant issues in oncology specimen testing where low variant allele frequencies have clinical relevance. Small custom targeted Next Generation Sequencing (NGS) panels yield limited variants and pose challenges in delineating true somatic variants versus contamination calls. A number of popular contamination identification tools have the ability to perform well in whole genome/exome sequencing data, but in smaller gene panels there are fewer variant candidates for the tools to perform accurately. To prevent clinical reporting of potentially contaminated samples in small NGS panels, we have developed MICon (Microhaplotype Contamination detection) - a novel contamination detection model that utilizes microhaplotype site variant allele frequencies. In a heterogeneous hold-out test cohort of 210 samples, the model demonstrated state-of-the-art performance with an AUROC of 0.995.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要