Protein-Protein interactions uncover candidate 'core genes' within omnigenic disease networks.

PLOS GENETICS(2020)

引用 12|浏览7
暂无评分
摘要
Author summary A recent theory suggests that only a small number of genes underpin the biology of a disease, these genes are called 'core genes', and for most diseases, these core genes remain unknown. The suggested methods for finding them requires complex and expensive experiments. We reasoned that if we merge currently available datasets in smart ways, we may be able to uncover these 'core genes'. Our method finds "hub" proteins by merging lists of genes previously linked with disease to information on how proteins interact with each other. We found that many of these hub proteins have central roles in disease, such as insulin for both A1C measurement and Type 2 Diabetes, BRCA1 in Breast cancer, and Amyloid Precursor Protein in Alzheimer's Disease. We think these 'hub' proteins are candidate 'core genes', and offer our method as a way to find 'core genes' by utilizing publicly available reference datasets. Genome wide association studies (GWAS) of human diseases have generally identified many loci associated with risk with relatively small effect sizes. The omnigenic model attempts to explain this observation by suggesting that diseases can be thought of as networks, where genes with direct involvement in disease-relevant biological pathways are named 'core genes', while peripheral genes influence disease risk via their interactions or regulatory effects on core genes. Here, we demonstrate a method for identifying candidate core genes solely from genes in or near disease-associated SNPs (GWAS hits) in conjunction with protein-protein interaction network data. Applied to 1,381 GWAS studies from 5 ancestries, we identify a total of 1,865 candidate core genes in 343 GWAS studies. Our analysis identifies several well-known disease-related genes that are not identified by GWAS, includingBRCA1in Breast Cancer, Amyloid Precursor Protein (APP) in Alzheimer's Disease,INSin A1C measurement and Type 2 Diabetes, andPCSK9in LDL cholesterol, amongst others. Notably candidate core genes are preferentially enriched for disease relevance over GWAS hits and are enriched for both Clinvar pathogenic variants and known drug targets-consistent with the predictions of the omnigenic model. We subsequently use parent term annotations provided by the GWAS catalog, to merge related GWAS studies and identify candidate core genes in over-arching disease processes such as cancer-where we identify 109 candidate core genes.
更多
查看译文
关键词
omnigenic disease networks,‘core genes,protein-protein
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要