Random Forest in Splice Site Prediction of Human Genome

XIV MEDITERRANEAN CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING AND COMPUTING 2016(2016)

引用 6|浏览1
暂无评分
摘要
With the rapid growth of huge amounts of DNA sequence, genes identification has become an important task in bioinformatics. To detect genes, it is important to accurately predict splice sites, i.e. exon intron boundaries. Moreover, in biology where structures are described by a large number of features as splice sites, the feature selection is an important step toward the classification task. It provides useful biological knowledge and allows for a faster and better classification. Feature selection techniques can be divided into two groups: feature-ranking and feature-subset selection. This paper investigates the performance of combining support vector machine (SVM) with two different feature ranking methods, namely F-score and Random Forest feature ranking competitively in splice site detection of Human genome. Also a new classification method based on Random Forest for splice site prediction is presented.
更多
查看译文
关键词
Random forest, Feature ranking, Splice site prediction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要