Robust in-silico identification of Cancer Cell Lines based on RNA and targeted DNA sequencing data

SCIENTIFIC REPORTS(2019)

引用 1|浏览29
暂无评分
摘要
Cancer cell lines (CCL) are an integral part of modern cancer research but are susceptible to misidentification. The increasing popularity of sequencing technologies motivates the in-silico identification of CCLs based on their mutational fingerprint, but care must be taken when identifying heterogeneous data. We recently developed the proof-of-concept Uniquorn 1 method which could reliably identify heterogeneous sequencing data from selected sequencing technologies. Here we present Uniquorn 2, a generic and robust in-silico identification method for CCLs with DNA/RNA-seq and panel-seq information. We benchmarked Uniquorn 2 by cross-identifying 1612 RNA and 3596 panel-sized NGS profiles derived from 1516 CCLs, five repositories, four technologies and three major cancer panel-designs. Our method achieves an accuracy of 96% for RNA-seq and 95% for mixed DNA-seq and RNA-seq identification. Even for a panel of only 94 cancer-related genes, accuracy remains at 82% but decreases when using smaller panels. Uniquorn 2 is freely available as R-Bioconductor-package ‘Uniquorn’.
更多
查看译文
关键词
Bioinformatics,Cancer,DNA sequencing,RNA sequencing,Targeted resequencing,Science,Humanities and Social Sciences,multidisciplinary
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要