谷歌浏览器插件
订阅小程序
在清言上使用

Data Fusion Analysis for Determining Localization of Proteins Associated to Escherichia Coli

2022 IEEE COLOMBIAN CONFERENCE ON APPLICATIONS OF COMPUTATIONAL INTELLIGENCE (COLCACI 2022)(2022)

引用 0|浏览9
暂无评分
摘要
In recent years, the interest in protein analysis based on biomolecular features has rapidly grown. This has led to explore the use of machine learning models, as they could hold an important alternative to contribute to the problems associated to these analyses. Models as support vector machines, artificial neural networks and random forest were compared for the prediction of protein localization. Two main sources of data were used to train the models: the information from targeting signal and from the protein sequences to determine the localization sites of the protein. A third scenario with a fusion of both sources of data was employed. Four classes were established according to the subcellular localization of the protein: cytoplasm, periplasmatic space, outer and inner membranes. Results reached values between 77% and 92% in terms of balanced accuracy. The models with better performance were based on random forest and support vector machines.
更多
查看译文
关键词
Bioinformatics,Proteomics,Proteins,Machine Learning,Localization Sites Prediction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要