On the contingent nature of satellite DNA evolution

biorxiv(2021)

引用 0|浏览9
暂无评分
摘要
Background The full catalogue of satellite DNA (satDNA) within a same genome constitutes the satellitome. The Library Hypothesis predicts that satDNA in relative species reflects that in their common ancestor, but the evolutionary mechanisms and pathways of satDNA evolution have never been analyzed for full satellitomes. We compare here the satellitomes of two Oedipodine grasshoppers ( Locusta migratoria and Oedaleus decorus ) which shared their most recent common ancestor about 22.8 Ma ago. Results We found that about one-third of their satDNA families (near 60 in every species) showed sequence homology, and were grouped into 12 orthologous superfamilies. The turnover rate of consensus sequences was extremely variable among the 20 orthologous family pairs analyzed in both species. The satDNAs shared by both species showed poor association with sequence signatures and motives frequently argued as functional, except for short inverted repeats allowing short dyad symmetries and non-B DNA conformations. Orthologous satDNAs frequently showed different FISH pattern at both intra- and interspecific levels. We defined indices of homogenization and degeneration, and quantified the level of incomplete library sorting between species. Conclusions Our analyses revealed that satDNA degenerates through point mutation and rejuvenates through partial turnovers caused by massive tandem duplications (the so-called satDNA amplification). Remarkably, satDNA amplification increases homogenization, at intragenomic level, and diversification between species, thus constituting the basis for concerted evolution. We suggest a model of satDNA evolution by means of recursive cycles of amplification, degeneration, and rejuvenation, leading to mostly contingent evolutionary pathways where concerted evolution emerges promptly after lineages split. ### Competing Interest Statement The authors have declared no competing interest. * B-pattern : Banded pattern (pattern in FISH analyses) CEI: Concerted Evolution Index CI : Confidence Interval CTR : Consensus Turnover Rate dG: Gibbs free energy DIVPEAK: Divergence Peak FISH : Fluorescence In Situ Hybridization ILibS: Incomplete Library Sorting K2P : Kimura Two-Parameter (substitution model) Lmi: Locusta migratoria NS-pattern : No signal pattern (in FISH analyses) MAL : Maximum Array Length (observed in MinIon reads of L. migratoria ) MST : Minimum Spanning Tree Ode : Oedaleus decorus OSF : Orthologous Superfamily RL: Repeat Landscape RPS : Relative peak size RUL: Repeat Unit Length satDNA: satellite DNA SF: Superfamily TSI : Tandem Structure Index VIF : Variance inflation factors
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要