On using physico-chemical properties of amino acids in string kernels for protein classification via support vector machines

Limin Li, Kiyoko F. Aoki-Kinoshita, Wai-Ki Ching, Hao Jiang

JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY(2015)

引用 1|浏览2
暂无评分
摘要
String kernels are popular tools for analyzing protein sequence data and they have been successfully applied to many computational biology problems. The traditional string kernels assume that different substrings are independent. However, substrings can be highly correlated due to their substructure relationship or common physico-chemical properties. This paper proposes two kinds of weighted spectrum kernels: The correlation spectrum kernel and the AA spectrum kernel. We evaluate their performances by predicting glycan-binding proteins of 12 glycans. The results show that the correlation spectrum kernel and the AA spectrum kernel perform significantly better than the spectrum kernel for nearly all the 12 glycans. By comparing the predictive power of AA spectrum kernels constructed by different physico-chemical properties, the authors can also identify the physicochemical properties which contributes the most to the glycan-protein binding. The results indicate that physico-chemical properties of amino acids in proteins play an important role in the mechanism of glycan-protein binding.
更多
查看译文
关键词
AAindex,AA spectrum kernel,correlation spectrum kernel,physico-chemical properties,string kernel,weighted spectrum kernel
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要