谷歌浏览器插件
订阅小程序
在清言上使用

Essentiality, Protein-Protein Interactions and Evolutionary Properties are Key Predictors for Identifying Cancer Genes Using Machine Learning

bioRxiv(2021)

引用 0|浏览14
暂无评分
摘要
The identification of genes that may be linked to cancer is of great importance for the discovery of new drug targets. The rate at which cancer genes are being found experimentally is slow, however, due to the complexity of the identification and confirmation process, giving a narrow range of therapeutic targets to investigate and develop. One solution to this problem is to use predictive analysis techniques that can accurately identify cancer gene candidates in a timely fashion. Furthermore, the effort in identifying characteristics that are linked to cancer genes is crucial to further our understanding of this disease. These characteristics can be employed in recognising therapeutic drug targets. Here, we investigated whether certain genes’ properties can indicate the likelihood of it to be involved in the initiation or progression of cancer. We found that for cancer, the essentiality scores tend to be higher for cancer genes than for all protein coding human genes. A machine-learning model was developed and we found that essentiality related properties and properties arising from protein-protein interaction networks or evolution are particularly effective in predicting cancer-associated genes. We were also able to identify potential drug targets that have not been previously linked with cancer, but have the characteristics of cancer-related genes. Author Summary Mutations in numerous genes are known to be involved in cancer, yet there are undoubtedly many more to be discovered. We analysed a set of hundreds of cancer genes with the aim of finding out what makes them different from genes not known to be mutated in cancer. In particular, we found that genes that are essential for the survival of an organism are more likely to be involved in cancer. We used the gene properties that we examined to develop an artificial intelligence method that can accurately predict whether a gene is involved in cancer or not. Applying the method gives hundreds of non-cancer genes that resemble cancer genes. New discoveries of cancer genes are likely to be found within this set. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
cancer genes,machine learning,protein-protein
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要