PseUpred-ELPSO Is an Ensemble Learning Predictor with Particle Swarm Optimizer for Improving the Prediction of RNA Pseudouridine Sites

BIOLOGY-BASEL(2024)

引用 0|浏览0
暂无评分
摘要
Simple Summary RNA pseudouridine modifications are present in various RNAs across different organisms and play crucial roles in regulating gene expression during biological processes. The accurate identification of pseudouridine sites within RNA sequences is essential for understanding their functional mechanisms. This study proposes a novel ensemble learning predictor named PseUpred-ELPSO, which accurately predicts RNA pseudouridine sites. The predictor demonstrates excellent performance in both cross-validation and independent testing. A user-friendly web server has been established, making it a powerful tool for pseudouridine site identification.Abstract RNA pseudouridine modification exists in different RNA types of many species, and it has a significant role in regulating the expression of biological processes. To understand the functional mechanisms for RNA pseudouridine sites, the accurate identification of pseudouridine sites in RNA sequences is essential. Although several fast and inexpensive computational methods have been proposed, the challenge of improving recognition accuracy and generalization still exists. This study proposed a novel ensemble predictor called PseUpred-ELPSO for improved RNA pseudouridine site prediction. After analyzing the nucleotide composition preferences between RNA pseudouridine site sequences, two feature representations were determined and fed into the stacking ensemble framework. Then, using five tree-based machine learning classifiers as base classifiers, 30-dimensional RNA profiles are constructed to represent RNA sequences, and using the PSO algorithm, the weights of the RNA profiles were searched to further enhance the representation. A logistic regression classifier was used as a meta-classifier to complete the final predictions. Compared to the most advanced predictors, the performance of PseUpred-ELPSO is superior in both cross-validation and the independent test. Based on the PseUpred-ELPSO predictor, a free and easy-to-operate web server has been established, which will be a powerful tool for pseudouridine site identification.
更多
查看译文
关键词
pseudouridine,machine learning,RNA profile,One-Hot Encoding,K-mer,PSO
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要