Extremely sparse models of linkage disequilibrium in ancestrally diverse association studies

bioRxiv (Cold Spring Harbor Laboratory)(2022)

引用 2|浏览18
暂无评分
摘要
Linkage disequilibrium (LD) is the correlation among nearby genetic variants. In genetic association studies, LD is often modeled using massive local correlation matrices, but this approach is slow, especially in ancestrally diverse studies. Here, we introduce LD graphical models (LDGMs), which are an extremely sparse and efficient representation of LD. LDGMs are derived from genome-wide genealogies; statistical relationships among alleles in the LDGM correspond to genealogical relationships among haplotypes. We publish LDGMs and ancestry specific LDGM precision matrices for 18 million common SNPs (MAF>1%) in five ancestry groups, validate their accuracy, and demonstrate order-of-magnitude improvements in runtime for commonly used LD matrix computations. We implement an extremely fast multi-ancestry polygenic prediction method, BLUPx-ldgm , which performs better than a similar method based on the reference LD correlation matrix. LDGMs will enable sophisticated methods that scale to ancestrally genetic association data across millions of variants and individuals. ### Competing Interest Statement BMN is a member of the scientific advisory board at Deep Genomics and Neumora, consultant of the scientific advisory board for Camp4 Therapeutics and consultant for Merck.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要