Regenotyper: Detecting Mislabeled Samples In Genetic Data

Konrad Zych,Basten L Snoek,Mark Elvin,Miriam Rodriguez,K Joeri Van der Velde,Danny Arends,Harm-Jan Westra,Morris A Swertz,Gino Poulin,Jan E Kammenga,Rainer Breitling,Ritsert C Jansen,Yang Li

PLOS ONE（2017）

引用 14|浏览43

暂无评分

摘要

In high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the "ideal" genotype and identify "best-matched " labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a "data cleaning" step before standard data analysis.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要