Enhanced Parkinson’s Disease Genes Prediction

2022 18th International Computer Engineering Conference (ICENCO)(2022)

引用 0|浏览5
暂无评分
摘要
Identifying genes associated with Parkinson’s disease (PD) is an interesting research area that aids in the disease’s diagnosis and treatment. Most PD approaches were created only to find protein-coding genes and ignore long non-coding (lncRNA) genes, which are crucial for biological functions, disease transformation, and development. In this paper, the proposed system is developed to predict protein and lncRNA genes associated with PD, which consists of four steps. First, preprocessing step is to represent the genes as DNA FASTA sequences. Second, feature extraction methods are used to extract and discover the most discriminative features of the biological sequences, and the extracted features are fused. Third, the Adaboost technique is used as feature selection to reduce these fused feature vectors’ dimensionality. Finally, the gradient-boosted decision tree algorithm is used for diagnosing various test cases. The proposed prediction system achieves the best result compared with state-of-arts studies with an area under the curve (AUC) equal to 86.7%.
更多
查看译文
关键词
Parkinson’s Disease,Gene Analysis,Math-Feature,PyFeat,iLearn,Pse-in-One2.0,Adaboost(AB),Gradient Boosted Decision Tree(GBDT)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络