A Biclique Approach to Reference-Anchored Gene Blocks and Its Applications to Genomic Islands.

JOURNAL OF COMPUTATIONAL BIOLOGY(2018)

引用 3|浏览28
暂无评分
摘要
We formalize a new problem variant in gene-block discovery, denoted Reference-Anchored Gene Blocks (RAGB), given a query sequence Q of length n, representing the gene array of a DNA element, a window size bound d on the length of a substring of interest in Q, and a set of target gene sequences . Our objective is to identify gene blocks in that are centered in a subset q of co-localized genes from Q, and contain genomes from in which the corresponding orthologs of the genes from q are also co-localized. We cast RAGB as a variant of a (colored) biclique problem in bipartite graphs, and analyze its parameterized complexity, as well as the parameterized complexity of other related problems. We give an time algorithm for the uncolored variant of our biclique problem, where m is the number of areas of interest that are parsed from the target sequences, and n and d are as defined earlier. Our algorithm can be adapted to compute all maximal bicliques in the graph within the same time complexity, and to handle edge weights with a slight increase to its time complexity. For the colored version of the problem, our algorithm has a time complexity of . We implement the algorithm and exemplify its application to the data mining of proteobacterial gene blocks that are centered in predicted proteobacterial genomic islands, leading to the identification of putatively mobilized clusters of virulence, pathogenicity, and resistance genes.
更多
查看译文
关键词
bicliques,bipartite graphs,gene blocks,genomic islands,parameterized complexity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要