COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study.

Journal of computational biology : a journal of computational molecular cell biology(2010)

引用 29|浏览54
暂无评分
摘要
The availability of high-density single nucleotide polymorphisms (SNPs) data has made genome-wide association study computationally challenging. Two-locus epistasis (gene-gene interaction) detection has attracted great research interest as a promising method for genetic analysis of complex diseases. In this article, we propose a general approach, COE, for efficient large scale gene-gene interaction analysis, which supports a wide range of tests. In particular, we show that many commonly used statistics are convex functions. From the observed values of the events in two-locus association test, we can develop an upper bound of the test value. Such an upper bound only depends on single-locus test and the genotype of the SNP-pair. We thus group and index SNP-pairs by their genotypes. This indexing structure can benefit the computation of all convex statistics. Utilizing the upper bound and the indexing structure, we can prune most of the SNP-pairs without compromising the optimality of the result. Our approach is especially efficient for large permutation test. Extensive experiments demonstrate that our approach provides orders of magnitude performance improvement over the brute force approach.
更多
查看译文
关键词
brute force approach,indexing structure,single-locus test,convex function,disease association study,general approach,two-locus association test,test value,efficient genome-wide two-locus epistasis,efficient large scale gene-gene,large permutation test,convex statistic,indexation,permutation test,upper bound,genome wide association study,single nucleotide polymorphism
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要