ProBAPred: Inferring protein-protein binding affinity by incorporating protein sequence and structural features.

JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY(2018)

引用 4|浏览28
暂无评分
摘要
Protein-protein binding interaction is the most prevalent biological activity that mediates a great variety of biological processes. The increasing availability of experimental data of protein-protein interaction allows a systematic construction of protein-protein interaction networks, significantly contributing to a better understanding of protein functions and their roles in cellular pathways and human diseases. Compared to well-established classification for protein- protein interactions (PPIs), limited work has been conducted for estimating protein-protein binding free energy, which can provide informative real-value regression models for characterizing the protein-protein binding affinity. In this study, we propose a novel ensemble computational framework, termed ProBAPred (Protein-protein Binding Affinity Predictor), for quantitative estimation of protein-protein binding affinity. A large number of sequence and structural features, including physical-chemical properties, binding energy and conformation annotations, were collected and calculated from currently available protein binding complex datasets and the literature. Feature selection based on the WEKA package was performed to identify and characterize the most informative and contributing feature subsets. Experiments on the independent test showed that our ensemble method achieved the lowest Mean Absolute Error (MAE; 1.657 kcal/mol) and the second highest correlation coefficient (R-value = 0.467), compared with the existing methods. The datasets and source codes of ProBAPred, and the supplementary materials in this study can be downloaded at http://lightning.med.monash.edu/probapred/ for academic use. We anticipate that the developed ProBAPred regression models can facilitate computational characterization and experimental studies of protein-protein binding affinity.
更多
查看译文
关键词
Protein-protein binding affinity,regression model,sequence-derived features,structural features,feature selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要