Discovering novel subsystems using comparative genomics.

BIOINFORMATICS(2011)

引用 6|浏览0
暂无评分
摘要
Key problems for computational genomics include discovering novel pathways in genome data, and discovering functional interaction partners for genes to define new members of partially elucidated pathways.We propose a novel method for the discovery of subsystems from annotated genomes. For each gene pair, a score measuring the likelihood that the two genes belong to a same subsystem is computed using genome context methods. Genes are then grouped based on these scores, and the resulting groups are filtered to keep only high-confidence groups. Since the method is based on genome context analysis, it relies solely on structural annotation of the genomes. The method can be used to discover new pathways, find missing genes from a known pathway, find new protein complexes or other kinds of functional groups and assign function to genes. We tested the accuracy of our method in Escherichia coli K-12. In one configuration of the system, we find that 31.6% of the candidate groups generated by our method match a known pathway or protein complex closely, and that we rediscover 31.2% of all known pathways and protein complexes of at least 4 genes. We believe that a significant proportion of the candidates that do not match any known group in E.coli K-12 corresponds to novel subsystems that may represent promising leads for future laboratory research. We discuss in-depth examples of these findings.Predicted subsystems are available at http://brg.ai.sri.com/pwy-discovery/journal.html.lferrer@ai.sri.comSupplementary data are available at Bioinformatics online.
更多
查看译文
关键词
predicted subsystems,novel method,genome context analysis,known group,genome context method,known pathway,genome data,new member,new pathway,protein complex,comparative genomics,discovering novel,genome,computational biology,algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要