Dissecting the regulatory activity and sequence content of loci with exceptional numbers of transcription factor associations.

GENOME RESEARCH(2020)

引用 12|浏览41
暂无评分
摘要
DNA-associated proteins (DAPs) classically regulate gene expression by binding to regulatory loci such as enhancers or pro-moters. As expanding catalogs of genome-wide DAP binding maps reveal thousands of loci that, unlike the majority of con-ventional enhancers and promoters, associate with dozens of different DAPs with apparently little regard for motif preference, an understanding of DAP association and coordination at such regulatory loci is essential to deciphering how these regions contribute to normal development and disease. In this study, we aggregated publicly available ChIP-seq data from 469 human DAPs assayed in three cell lines and integrated these data with an orthogonal data set of 352 non -redundant, in vitro-derived motifs mapped to the genome within DNase I hypersensitivity footprints to characterize re-gions with high numbers of DAP associations. We establish a generalizable definition for high occupancy target (HOT) loci and identify putative driver DAP motifs in HepG2 cells, including HNF4A, SP1, SP5, and ETV4, that are highly prevalent and show sequence conservation at HOT loci. The number of different DAPs associated with an element is positively asso-ciated with evidence of regulatory activity, and by systematically mutating 245 HOT loci with a massively parallel muta-genesis assay, we localized regulatory activity to a central core region that depends on the motif sequences of our previously nominated driver DAPs. In sum, this work leverages the increasingly large number of DAP motif and ChIP-seq data publicly available to explore how DAP associations contribute to genome-wide transcriptional regulation.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要