Maxicircle architecture and evolutionary insights into Trypanosoma cruzi complex

PLOS NEGLECTED TROPICAL DISEASES(2021)

引用 6|浏览9
暂无评分
摘要
We sequenced maxicircles from T. cruzi strains representative of the species evolutionary diversity by using long-read sequencing, which allowed us to uncollapse their repetitive regions, finding that their real lengths range from 35 to 50 kb. T. cruzi maxicircles have a common architecture composed of four regions: coding region (CR), AT-rich region, short (SR) and long repeats (LR). Distribution of genes, both in order and in strand orientation are conserved, being the main differences the presence of deletions affecting genes coding for NADH dehydrogenase subunits, reinforcing biochemical findings that indicate that complex I is not functional in T. cruzi. Moreover, the presence of complete minicircles into maxicircles of some strains lead us to think about the origin of minicircles. Finally, a careful phylogenetic analysis was conducted using coding regions of maxicircles from up to 29 strains, and 1108 single copy nuclear genes from all of the DTUs, clearly establishing that taxonomically T. cruzi is a complex of species composed by group 1 that contains clades A (TcI), B (TcIII) and D (TcIV), and group 2 (1 and 2 do not coincide with groups I and II described decades ago) containing Glade C (TcII), being all hybrid strains of the BC type. Three variants of maxicircles exist in T. cruzi: a, b and c, in correspondence with clades A, B, and C from mitochondria) phylogenies. While A and C carry maxicircles a and c respectively, both clades B and D carry b maxicircle variant; hybrid strains also carry the b-variant. We then propose a new nomenclature that is self-descriptive and makes use of both the phylogenetic relationships and the maxicircle variants present in T. cruzi.
更多
查看译文
关键词
cruzi
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要