The distribution of human snps in gene promoter and coding regions

The distribution of human snps in gene promoter and coding regions(2005)

引用 23|浏览13
暂无评分
摘要
Genetic sequence variation not only affects an individual's anthropometric characteristics, but also influences his risk of disease and response to environment. As a result of high-throughput genotyping methods, millions of genetic variants have been reported in recent years. To explore those important to phenotype, a common strategy is to focus on the variants in functionally important gene regions. Here, the most abundant human genetic variants, Single Nucleotide Polymorphisms (SNPS), were studied in gene promoter and coding regions. Specifically, SNP distributions in transcription factor binding sites, DNA regions corresponding to protein motif and secondary structure were investigated for all of known human genes. Analysis of promoter SNPS in the gene promoters shows that more SNPs occur in the region close to transcriptional start sites than in the region further. SNPs caused by genetic mutation in transversion class are increased more in the former region. Transcription factor binding sites were computationally derived in gene promoters. More SNPs were observed in the putative binding sites than in the non-binding site sequence region, suggesting that the current information about transcription factor binding site sequence pattern may not be exhaustive. It may also suggest that SNPs are actively involved in influencing gene expression during evolution by affecting the transcription factor binding sites. Analysis of coding SNPs in the gene regions corresponding to protein motifs, alpha-helices and beta-sheets shows that the number of non-synonymous SNPs is significantly less than that of synonymous SNPs, although the total number of non-synonymous SNPs in all genes is larger. Non-synonymous SNPs are also more likely to influence amino acids in less conservative motif sites. There are only fewer than 10% of them may have functional impact to the gene products. A simulation on SNP occurrence also suggests that most non-synonymous SNPs only have neutral effects and are not under strong selection pressure. A functional SNP database was created as a byproduct to include SNP and gene sequence feature data, which can be accessed by two types of web services. One is a Distributed Annotation System (DAS) server, which delivers gene annotation data in DAS XML format, and the other is a general web server which supports data view and visualization.
更多
查看译文
关键词
binding site,human snps,transcription factor,synonymous snps,gene promoter,functionally important gene region,non-synonymous snps,gene expression,coding region,gene annotation data,gene product,coding snps
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要