Function and constraint in enhancers with multiple evolutionary origins

Sarah L. Fong, John A. Capra

biorxiv(2022)

引用 1|浏览1
暂无评分
摘要
Motivation Thousands of human gene regulatory enhancers are composed of sequences with multiple evolutionary origins. These evolutionarily “complex” enhancers consist of older “core” sequences and younger “derived” sequences. However, the functional relationship between the sequences of different evolutionary origins within complex enhancers is poorly understood. Results We evaluated the function, selective pressures, and sequence variation across core and derived components of human complex enhancers. We find that both components are older than expected from the genomic background, and cores are enriched for derived sequences of similar evolutionary ages. Both components show strong evidence of biochemical activity in massively parallel report assays (MPRAs). However, core and derived sequences have distinct transcription factor (TF) binding preferences that are largely stable across evolutionary origins. Given these signatures of function, both core and derived sequences have substantial evidence of purifying selection. Nonetheless, derived sequences exhibit weaker purifying selection than adjacent cores. Derived sequences also tolerate more common genetic variation and are enriched compared to cores for eQTL associated with gene expression variability in human populations. Conclusions Both core and derived sequences have strong evidence of gene regulatory function, but derived sequences have distinct constraint profiles, TF binding preferences, and tolerance to variation compared with cores. We propose that the step-wise integration of younger derived and older core sequences has generated regulatory substrates with robust activity and the potential for functional variation. Our analyses demonstrate that synthesizing study of enhancer evolution and function can aid interpretation of regulatory sequence activity and functional variation across human populations. Significance Statement Thousands of human gene regulatory enhancers are mosaics of sequences from multiple evolutionary origins, yet how these different segments contribute to enhancer function is poorly understood. By dissecting their regulatory functions, transcription factor binding, constraint, and human genetic variation, we show that both older “core” and younger “derived” sequences in complex enhancers have strong evidence of gene regulatory function, but derived sequences are more likely to harbor genetic variants that influence function. Together, our results support a model in which the integration of sequences of different origins generates regulatory substrates with robust activity and the potential for functional variation. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要