A high coverage Mesolithic aurochs genome and effective leveraging of ancient cattle genomes using whole genome imputation.

Molecular biology and evolution(2024)

引用 0|浏览0
暂无评分
摘要
Ancient genomic analyses are often restricted to utilising pseudo-haploid data due to low genome coverage. Leveraging low coverage data by imputation to calculate phased diploid genotypes that enable haplotype-based interrogation and SNP calling at unsequenced positions is highly desirable. This has not been investigated for ancient cattle genomes despite these being compelling subjects for archaeological, evolutionary and economic reasons. Here we test this approach by sequencing a Mesolithic European aurochs (18.49x; 9852-9376 calBCE), an Early Medieval European cow (18.69x; 427-580 calCE), and combine these with published individuals; two ancient and three modern. We downsample these genomes (0.25x, 0.5x, 1.0x, 2.0x) and impute diploid genotypes, utilising a reference panel of 171 published modern cattle genomes that we curated for 21.7 million (Mn) phased single-nucleotide polymorphisms (SNPs). We recover high densities of correct calls with an accuracy of >99.1% at variant sites for the lowest downsample depth of 0.25x, increasing to >99.5% for 2.0x (transversions only, minor allele frequency (MAF) ≥ 2.5%). The recovery of SNPs correlates with coverage, on average 58% of sites are recovered for 0.25x increasing to 87% for 2.0x, utilising an average of 3.5 million (Mn) transversions (MAF ≥2.5%), even in the aurochs, despite the highest temporal distance from the modern reference panel. Our imputed genomes behave similarly to directly called data in allele-frequency-based analyses; for example consistently identifying runs of homozygosity >2mb, including a long homozygous region in the Mesolithic European aurochs.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要