1
|
Pluta A, Rola-Łuszczak M, Douville RN, Kuźmak J. Bovine leukemia virus long terminal repeat variability: identification of single nucleotide polymorphisms in regulatory sequences. Virol J 2018; 15:165. [PMID: 30359262 PMCID: PMC6202831 DOI: 10.1186/s12985-018-1062-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 09/25/2018] [Indexed: 12/03/2022] Open
Abstract
Background Limited data are available on the incidence of variations in nucleotide sequences of long terminal repeat (LTR) regions of Bovine Leukemia Virus (BLV). Consequently, the possible impact of SNPs on BLV LTR function are poorly elucidated. Thus, a detailed and representative study of full-length LTR sequences obtained from sixty-four BLV isolates from different geographical regions of Poland, Moldova, Croatia, Ukraine and Russia were analyzed for their genetic variability. Methods Overlap extension PCR, sequencing and Bayesian phylogenetic reconstruction of LTR sequences were performed. These analyses were followed by detailed sequence comparison, estimation of genetic heterogeneity and identification of transcription factor binding site (TFBS) modifications. Results Phylogenetic analysis of curated LTR sequences and those available in the GenBank database reflected the acknowledged env gene classification of BLV into 10 genotypes, and further clustered analysed sequences into three genotypes - G4, G7 and G8. Additional molecular studies revealed the presence of 97 point mutations distributed at 89 positions throughout all 64 LTR sequences. The highest rate of variability was noted in U3 and U5 subregions. However, the variability in regulatory sequences (VR) was assessed as lower than the variability within non-regulatory sequences (VNR) for both, U3 and U5 subregions. In contrast, VR value for R subregion, as well as for the total LTR, was higher than the VNR suggesting the existence of positive selection. Twelve unique SNPs for these LTR sequences localized in regulatory and non-regulatory elements were identified. The presence of different types of substitutions lead to the abrogation of present or to the creation of additional TFBS. Conclusion This study represents the largest study of LTR genetic variability of BLV field isolates from Eastern part of Europe. Phylogenetic analysis of LTRs supports the clustering BLV variants based on their geographic origin. The SNP screening showed variations modifying LTR regulatory sequences, as well as altering TFBS. These features warrant further exploration as they could be related to proviral load and distinctive regulation of BLV transcription and replication. Electronic supplementary material The online version of this article (10.1186/s12985-018-1062-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Aneta Pluta
- Department of Biochemistry, National Veterinary Research Institute, Puławy, Poland.
| | | | - Renée N Douville
- Department of Biology, The University of Winnipeg, Winnipeg, MB, Canada.,Department of Immunology, University of Manitoba, Winnipeg, MB, Canada
| | - Jacek Kuźmak
- Department of Biochemistry, National Veterinary Research Institute, Puławy, Poland
| |
Collapse
|
2
|
Ibrahim M, Osman M, Peprah E. Understanding the Evolutionary Biology of CVD From Analysis of Ancestral Population Genomes. Glob Heart 2017; 12:73-75. [PMID: 28302549 DOI: 10.1016/j.gheart.2017.01.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 01/13/2017] [Indexed: 01/29/2023] Open
Affiliation(s)
- Muntaser Ibrahim
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum, Khartoum, Sudan.
| | - Maha Osman
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum, Khartoum, Sudan
| | - Emmanuel Peprah
- Center for Translation Research and Implementation Science, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
3
|
Ma L, Zhao B, Chen K, Thomas A, Tuteja JH, He X, He C, White KP. Evolution of transcript modification by N6-methyladenosine in primates. Genome Res 2017; 27:385-392. [PMID: 28052920 PMCID: PMC5340966 DOI: 10.1101/gr.212563.116] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 12/19/2016] [Indexed: 11/24/2022]
Abstract
Phenotypic differences within populations and between closely related species are often driven by variation and evolution of gene expression. However, most analyses have focused on the effects of genomic variation at cis-regulatory elements such as promoters and enhancers that control transcriptional activity, and little is understood about the influence of post-transcriptional processes on transcript evolution. Post-transcriptional modification of RNA by N6-methyladenosine (m6A) has been shown to be widespread throughout the transcriptome, and this reversible mark can affect transcript stability and translation dynamics. Here we analyze m6A mRNA modifications in lymphoblastoid cell lines (LCLs) from human, chimpanzee and rhesus, and we identify patterns of m6A evolution among species. We find that m6A evolution occurs in parallel with evolution of consensus RNA sequence motifs known to be associated with the enzymatic complexes that regulate m6A dynamics, and expression evolution of m6A-modified genes occurs in parallel with m6A evolution.
Collapse
Affiliation(s)
- Lijia Ma
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Boxuan Zhao
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Biochemistry and Molecular Biology and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, USA.,Howard Hughes Medical Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Kai Chen
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Biochemistry and Molecular Biology and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Amber Thomas
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Jigyasa H Tuteja
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Xin He
- Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Chuan He
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Biochemistry and Molecular Biology and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, USA.,Howard Hughes Medical Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Kevin P White
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Human Genetics, The University of Chicago, Chicago, Illinois 60637, USA.,Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637, USA.,Tempus Health, Incorporated, Chicago, Illinois 60654, USA
| |
Collapse
|
4
|
Smith JD, McManus KF, Fraser HB. A novel test for selection on cis-regulatory elements reveals positive and negative selection acting on mammalian transcriptional enhancers. Mol Biol Evol 2013; 30:2509-18. [PMID: 23904330 DOI: 10.1093/molbev/mst134] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Measuring natural selection on genomic elements involved in the cis-regulation of gene expression--such as transcriptional enhancers and promoters--is critical for understanding the evolution of genomes, yet it remains a major challenge. Many studies have attempted to detect positive or negative selection in these noncoding elements by searching for those with the fastest or slowest rates of evolution, but this can be problematic. Here, we introduce a new approach to this issue, and demonstrate its utility on three mammalian transcriptional enhancers. Using results from saturation mutagenesis studies of these enhancers, we classified all possible point mutations as upregulating, downregulating, or silent, and determined which of these mutations have occurred on each branch of a phylogeny. Applying a framework analogous to Ka/Ks in protein-coding genes, we measured the strength of selection on upregulating and downregulating mutations, in specific branches as well as entire phylogenies. We discovered distinct modes of selection acting on different enhancers: although all three have experienced negative selection against downregulating mutations, the selection pressures on upregulating mutations vary. In one case, we detected positive selection for upregulation, whereas the other two had no detectable selection on upregulating mutations. Our methodology is applicable to the growing number of saturation mutagenesis data sets, and provides a detailed picture of the mode and strength of natural selection acting on cis-regulatory elements.
Collapse
|
5
|
Moreno-Hagelsieb G, Jokic P. The evolutionary dynamics of functional modules and the extraordinary plasticity of regulons: the Escherichia coli perspective. Nucleic Acids Res 2012; 40:7104-12. [PMID: 22618875 PMCID: PMC3424573 DOI: 10.1093/nar/gks443] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Using profiles of phylogenetic profiles (P-cubic) we compared the evolutionary dynamics of different kinds of functional associations. Ordered from most to least evolutionarily stable, these associations were genes in the same operons, genes whose products participate in the same biochemical pathway, genes coding for physically interacting proteins and genes in the same regulons. Regulons showed the most plastic functional interactions with evolutionary stabilities barely better than those of unrelated genes. Further regulon analyses showed that global regulators contain less evolutionarily stable associations than local regulators. Genes co-repressed by global regulators had a higher evolutionary conservation than genes co-activated by global regulators. However, the reverse was true for genes co-repressed and co-activated by local regulators. Of all the regulon-related associations, the relationship between regulators and their target genes showed the most evolutionary stability. Different negative data sets built to contrast against each of the analysed kinds of modules also differed in evolutionary conservation revealing further underlying genome organization. Applying P-cubic analyses to other genomes might help visualize genome organization, understand the evolutionary importance and plasticity of functional associations and compare the quality of data sets expected to reflect functional interactions, such as those coming from high-throughput experiments.
Collapse
Affiliation(s)
- Gabriel Moreno-Hagelsieb
- Department of Biology, Wilfrid Laurier University, 75 University Avenue West, Waterloo, ON, Canada N2L 3C5.
| | | |
Collapse
|
6
|
Abstract
Vast tracts of noncoding DNA contain elements that regulate gene expression in higher eukaryotes. Describing these regulatory elements and understanding how they evolve represent major challenges for biologists. Advances in the ability to survey genome-scale DNA sequence data are providing unprecedented opportunities to use evolutionary models and computational tools to identify functionally important elements and the mode of selection acting on them in multiple species. This chapter reviews some of the current methods that have been developed and applied on noncoding DNA, what they have shown us, and how they are limited. Results of several recent studies reveal that a significantly larger fraction of noncoding DNA in eukaryotic organisms is likely to be functional than previously believed, implying that the functional annotation of most noncoding DNA in these organisms is largely incomplete. In Drosophila, recent studies have further suggested that a large fraction of noncoding DNA divergence observed between species may be the product of recurrent adaptive substitution. Similar studies in humans have revealed a more complex pattern, with signatures of recurrent positive selection being largely concentrated in conserved noncoding DNA elements. Understanding these patterns and the extent to which they generalize to other organisms awaits the analysis of forthcoming genome-scale polymorphism and divergence data from more species.
Collapse
Affiliation(s)
- Ying Zhen
- Department of Ecology and Evolutionary Biology, The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | | |
Collapse
|
7
|
Rhoné B, Brandenburg JT, Austerlitz F. Impact of selection on genes involved in regulatory network: a modelling study. J Evol Biol 2011; 24:2087-98. [PMID: 21682788 DOI: 10.1111/j.1420-9101.2011.02335.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Complex phenotypes are often controlled by many interacting genes. One question emerging from such organization is how selection, acting at the phenotypic level, shapes the evolution of genes involved in regulatory networks controlling the phenotypes. We studied this issue through a matrix model of such networks. In a population submitted to selection, we simulated the evolution of a quantitative trait controlled by a set of loci that regulate each other through positive or negative interactions. Investigating several levels of selection intensity on the trait, we studied the evolution of regulation intensity between the genes and the evolution of the genetic diversity of those genes as an indirect measure of the strength of selection acting on them. We show that an increasing intensity of selection on the phenotype leads to an increased level of regulation between the loci. Moreover, we found that the genes responding more strongly to selection within the network were those evolving towards stronger regulatory action on the other genes and/or those that are the less regulated by the other genes. This observation is strongest for an intermediate level of selection. This may explain why several experimental studies have shown evidence of selection on regulatory genes inside gene networks.
Collapse
Affiliation(s)
- B Rhoné
- Ecologie, Systématique et Evolution, UMR 8079 CNRS/Université Paris-Sud/AgroParisTech, Université Paris-Sud 11, Orsay Cedex, France
| | | | | |
Collapse
|
8
|
Goering LM, Hunt PK, Heighington C, Busick C, Pennings PS, Hermisson J, Kumar S, Gibson G. Association of orthodenticle with natural variation for early embryonic patterning in Drosophila melanogaster. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2010; 312:841-54. [PMID: 19488993 DOI: 10.1002/jez.b.21299] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Although it is well established that cis-acting regulatory variation contributes to morphological evolution between species, few concrete examples of polymorphism affecting developmental patterning within species have been demonstrated. Early embryogenesis in Drosophila is initiated by a gradient of Bicoid morphogen activity that results in differential expression of multiple target genes. In a screen for genetic variation affecting this process, we surveyed 96 wild-type lines of Drosophila melanogaster for polymorphisms in binding sites within 16 Bicoid cis-regulatory response elements. One common polymorphism in the orthodenticle (otd) early head enhancer is associated with a complex series of indels/substitutions that define two distinct haplotypes. The middle region of this enhancer exhibits an unusual pattern of nucleotide diversity that does not easily fit into standard models of selection and demography. Population Gene Expression Maps, generated by extracting binary expression profiles from normalized embryo images, revealed a ventral reduction of otd transcript abundance in one of the haplotypes that was recapitulated in expression of transgenic constructs containing the two alleles. We thus demonstrate that even a process as robust as early developmental patterning is affected by standing genetic variation, intriguingly involving otd, whose morphogenetic function bicoid is thought to have displaced during dipteran evolution.
Collapse
Affiliation(s)
- Lisa M Goering
- Department of Genetics, North Carolina State University, Raleigh, North Carolina, USA.
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Athanasiadis G, Esteban E, Gayà -Vidal M, Dugoujon JM, Moschonas N, Chaabani H, Bissar-Tadmouri N, Harich N, Stoneking M, Moral P. Different Evolutionary Histories of the Coagulation Factor VII Gene in Human Populations? Ann Hum Genet 2010; 74:34-45. [DOI: 10.1111/j.1469-1809.2009.00557.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
10
|
A TOMM40 variable-length polymorphism predicts the age of late-onset Alzheimer's disease. THE PHARMACOGENOMICS JOURNAL 2009; 10:375-84. [PMID: 20029386 PMCID: PMC2946560 DOI: 10.1038/tpj.2009.69] [Citation(s) in RCA: 271] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The ɛ4 allele of the apolipoprotein E (APOE) gene is currently the strongest and most highly replicated genetic factor for risk and age of onset of late-onset Alzheimer's disease (LOAD). Using phylogenetic analysis, we have identified a polymorphic poly-T variant, rs10524523, in the translocase of outer mitochondrial membrane 40 homolog (TOMM40) gene that provides greatly increased precision in the estimation of age of LOAD onset for APOE ɛ3 carriers. In two independent clinical cohorts, longer lengths of rs10524523 are associated with a higher risk for LOAD. For APOE ɛ3/4 patients who developed LOAD after 60 years of age, individuals with long poly-T repeats linked to APOE ɛ3 develop LOAD on an average of 7 years earlier than individuals with shorter poly-T repeats linked to APOE ɛ3 (70.5 ± 1.2 years versus 77.6 ± 2.1 years, P=0.02, n=34). Independent mutation events at rs10524523 that occurred during Caucasian evolution have given rise to multiple categories of poly-T length variants at this locus. On replication, these results will have clinical utility for predictive risk estimates for LOAD and for enabling clinical disease prevention studies. In addition, these results show the effective use of a phylogenetic approach for analysis of haplotypes of polymorphisms, including structural polymorphisms, which contribute to complex diseases.
Collapse
|
11
|
Luca F, Kashyap S, Southard C, Zou M, Witonsky D, Di Rienzo A, Conzen SD. Adaptive variation regulates the expression of the human SGK1 gene in response to stress. PLoS Genet 2009; 5:e1000489. [PMID: 19461886 PMCID: PMC2679193 DOI: 10.1371/journal.pgen.1000489] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2009] [Accepted: 04/22/2009] [Indexed: 12/22/2022] Open
Abstract
The Serum and Glucocorticoid-regulated Kinase1 (SGK1) gene is a target of the glucocorticoid receptor (GR) and is central to the stress response in many human tissues. Because environmental stress varies across habitats, we hypothesized that natural selection shaped the geographic distribution of genetic variants regulating the level of SGK1 expression following GR activation. By combining population genetics and molecular biology methods, we identified a variant (rs9493857) with marked allele frequency differences between populations of African and European ancestry and with a strong correlation between allele frequency and latitude in worldwide population samples. This SNP is located in a GR-binding region upstream of SGK1 that was identified using a GR ChIP-chip. SNP rs9493857 also lies within a predicted binding site for Oct1, a transcription factor known to cooperate with the GR in the transactivation of target genes. Using ChIP assays, we show that both GR and Oct1 bind to this region and that the ancestral allele at rs9493857 binds the GR-Oct1 complex more efficiently than the derived allele. Finally, using a reporter gene assay, we demonstrate that the ancestral allele is associated with increased glucocorticoid-dependent gene expression when compared to the derived allele. Our results suggest a novel paradigm in which hormonal responsiveness is modulated by sequence variation in the regulatory regions of nuclear receptor target genes. Identifying such functional variants may shed light on the mechanisms underlying inter-individual variation in response to environmental stressors and to hormonal therapy, as well as in the susceptibility to hormone-dependent diseases. Susceptibility to many common human diseases including hypertension, heart disease, and the metabolic syndrome is associated with increased neuroendocrine signaling in response to environmental stressors. A key component of the human stress response involves increased systemic glucocorticoid secretion that in turn leads to glucocorticoid receptor (GR) activation. As a result, a variety of GR-expressing cell types undergo gene expression changes, thereby providing an integrated physiological response to stress. The SGK1 gene is a well-established GR target that promotes cellular homeostasis in response to stress. Here, we use a combination of population genetics and molecular biology approaches to identify an SNP (rs9493857) in a distant SGK1 GR-binding region with unusually large differences in allele frequency between populations of European and African ancestry. Furthermore, rs9493857 shows a strong correlation between allele frequency and distance from the equator, a pattern consistent with a varying selective advantage across environments. Indeed, the ancestral allele at rs9493857 results in increased GR-binding and glucocorticoid-regulated gene expression, suggesting that an increased stress response (i.e., glucocorticoid responsiveness) was advantageous in ancestral human populations. We speculate that, in modern times, such variation could favor the negative effects of a heightened glucocorticoid response, potentially predisposing individuals to chronic diseases such as metabolic syndrome and hypertension.
Collapse
Affiliation(s)
- Francesca Luca
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America
| | - Sonal Kashyap
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - Catherine Southard
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America
| | - Min Zou
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - David Witonsky
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America
| | - Anna Di Rienzo
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (ADR); (SDC)
| | - Suzanne D. Conzen
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (ADR); (SDC)
| |
Collapse
|
12
|
Bitocchi E, Nanni L, Rossi M, Rau D, Bellucci E, Giardini A, Buonamici A, Vendramin GG, Papa R. Introgression from modern hybrid varieties into landrace populations of maize (Zea mays ssp. mays L.) in central Italy. Mol Ecol 2009; 18:603-21. [PMID: 19215582 DOI: 10.1111/j.1365-294x.2008.04064.x] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Landraces are domesticated local plant varieties that did not experience a deliberate and intensive selection during a formal breeding programme. In Europe, maize landraces are still cultivated, particularly in marginal areas where traditional farming is often practiced. Here, we have studied the evolution of flint maize landraces from central Italy over 50 years of on-farm cultivation, when dent hybrid varieties were introduced and their use was widespread. We have compared an 'old' collection, obtained during the 1950s, before the introduction of hybrids, and a recent collection of maize landraces. For comparison, a sample of maize landraces from north Italy, and of improved germplasm, including hybrids and inbred lines were also used. A total of 296 genotypes were analysed using 21 microsatellites. Our results show that the maize landraces collected in the last 5-10 years have evolved directly from the flint landrace gene pool cultivated in central Italy before the introduction of modern hybrids. The population structure, diversity and linkage disequilibrium analyses indicate a significant amount of introgression from hybrid varieties into the recent landrace populations. No evidence of genetic erosion of the maize landraces was seen, suggesting that in situ conservation of landraces is an efficient strategy for preserving genetic diversity. Finally, the level of introgression detected was very variable among recent landraces, with most of them showing a low level of introgression; this suggests that coexistence between different types of agriculture is possible, with the adoption of correct practices that are aimed at avoiding introgression from undesired genetic sources.
Collapse
Affiliation(s)
- E Bitocchi
- Dipartimento di Scienze Ambientali e delle Produzioni Vegetali, Università Politecnica delle Marche, Via Brecce Bianche, 60131 Ancona, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Affiliation(s)
- Keyue Ding
- From the Division of Cardiovascular Diseases, Mayo Clinic, Rochester, Minn
| | - Iftikhar J. Kullo
- From the Division of Cardiovascular Diseases, Mayo Clinic, Rochester, Minn
| |
Collapse
|
14
|
Hofer T, Ray N, Wegmann D, Excoffier L. Large Allele Frequency Differences between Human Continental Groups are more Likely to have Occurred by Drift During range Expansions than by Selection. Ann Hum Genet 2009; 73:95-108. [PMID: 19040659 DOI: 10.1111/j.1469-1809.2008.00489.x] [Citation(s) in RCA: 135] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- T Hofer
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland
| | | | | | | |
Collapse
|
15
|
Kudaravalli S, Veyrieras JB, Stranger BE, Dermitzakis ET, Pritchard JK. Gene expression levels are a target of recent natural selection in the human genome. Mol Biol Evol 2008; 26:649-58. [PMID: 19091723 PMCID: PMC2767089 DOI: 10.1093/molbev/msn289] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Changes in gene expression may represent an important mode of human adaptation. However, to date, there are relatively few known examples in which selection has been shown to act directly on levels or patterns of gene expression. In order to test whether single nucleotide polymorphisms (SNPs) that affect gene expression in cis are frequently targets of positive natural selection in humans, we analyzed genome-wide SNP and expression data from cell lines associated with the International HapMap Project. Using a haplotype-based test for selection that was designed to detect incomplete selective sweeps, we found that SNPs showing signals of selection are more likely than random SNPs to be associated with gene expression levels in cis. This signal is significant in the Yoruba (which is the population that shows the strongest signals of selection overall) and shows a trend in the same direction in the other HapMap populations. Our results argue that selection on gene expression levels is an important type of human adaptation. Finally, our work provides an analytical framework for tackling a more general problem that will become increasingly important: namely, testing whether selection signals overlap significantly with SNPs that are associated with phenotypes of interest.
Collapse
|
16
|
Myles S, Davison D, Barrett J, Stoneking M, Timpson N. Worldwide population differentiation at disease-associated SNPs. BMC Med Genomics 2008; 1:22. [PMID: 18533027 PMCID: PMC2440747 DOI: 10.1186/1755-8794-1-22] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2008] [Accepted: 06/04/2008] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Recent genome-wide association (GWA) studies have provided compelling evidence of association between genetic variants and common complex diseases. These studies have made use of cases and controls almost exclusively from populations of European ancestry and little is known about the frequency of risk alleles in other populations. The present study addresses the transferability of disease associations across human populations by examining levels of population differentiation at disease-associated single nucleotide polymorphisms (SNPs). METHODS We genotyped ~1000 individuals from 53 populations worldwide at 25 SNPs which show robust association with 6 complex human diseases (Crohn's disease, type 1 diabetes, type 2 diabetes, rheumatoid arthritis, coronary artery disease and obesity). Allele frequency differences between populations for these SNPs were measured using Fst. The Fst values for the disease-associated SNPs were compared to Fst values from 2750 random SNPs typed in the same set of individuals. RESULTS On average, disease SNPs are not significantly more differentiated between populations than random SNPs in the genome. Risk allele frequencies, however, do show substantial variation across human populations and may contribute to differences in disease prevalence between populations. We demonstrate that, in some cases, risk allele frequency differences are unusually high compared to random SNPs and may be due to the action of local (i.e. geographically-restricted) positive natural selection. Moreover, some risk alleles were absent or fixed in a population, which implies that risk alleles identified in one population do not necessarily account for disease prevalence in all human populations. CONCLUSION Although differences in risk allele frequencies between human populations are not unusually large and are thus likely not due to positive local selection, there is substantial variation in risk allele frequencies between populations which may account for differences in disease prevalence between human populations.
Collapse
Affiliation(s)
- Sean Myles
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany.
| | | | | | | | | |
Collapse
|
17
|
Levasseur A, Orlando L, Bailly X, Milinkovitch MC, Danchin EGJ, Pontarotti P. Conceptual bases for quantifying the role of the environment on gene evolution: the participation of positive selection and neutral evolution. Biol Rev Camb Philos Soc 2007; 82:551-72. [PMID: 17944617 DOI: 10.1111/j.1469-185x.2007.00024.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
To demonstrate that a given change in the environment has contributed to the emergence of a given genotypic and phenotypic shift during the course of evolution, one should ask to what extent such shifts would have occurred without environmental change. Of course, such tests are rarely practical but phenotypic novelties can still be correlated to genomic shifts in response to environmental changes if enough information is available. We surveyed and re-evaluated the published data in order to estimate the role of environmental changes on the course of species and genomic evolution. Only a few published examples clearly demonstrate a causal link between a given environmental change and the fixation of a genomic variant resulting in functional modification (gain, loss or alteration of function). Many others suggested a link between a given phenotypic shift and a given environmental change but failed to identify the underlying genomic determinant(s) and/or the associated functional consequence(s). The proportion of genotypic and phenotypic variation that is fixed concomitantly with environmental changes is often considered adaptive and hence, the result of positive selection, even though alternative causes, such as genetic drift, are rarely investigated. Therefore, the second aim herein is to review evidence for the mechanisms leading to fixation.
Collapse
Affiliation(s)
- Anthony Levasseur
- Phylogenomics Laboratory, EA 3781 Evolution Biologique Université de Provence, Case 19, Pl. V. Hugo, 13331 Marseille Cedex 03, France.
| | | | | | | | | | | |
Collapse
|
18
|
Ding K, McDonough SJ, Kullo IJ. Evidence for positive selection in the C-terminal domain of the cholesterol metabolism gene PCSK9 based on phylogenetic analysis in 14 primate species. PLoS One 2007; 2:e1098. [PMID: 17971861 PMCID: PMC2034530 DOI: 10.1371/journal.pone.0001098] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2007] [Accepted: 10/08/2007] [Indexed: 12/31/2022] Open
Abstract
Background Cholesterol homeostasis is maintained through finely tuned mechanisms regulating intestinal absorption, hepatic biosynthesis and secretion as well as plasma clearance. Proprotein convertase subtilisin/kexin type 9 (PCSK9) is a secreted enzyme of the serine protease family that reduces cellular uptake of plasma low-density lipoprotein (LDL) cholesterol by promoting LDL receptor (LDL-R) degradation. Species-specific positive selection has been noted in the LDLR promoter, leading to differential expression of LDLR among primates. Whether PCSK9 experienced significant selective pressure to maintain a functional relationship with its target protein, LDL-R, is unknown. Methodology/Principal Findings We compiled the sequences of the coding regions of PCSK9 from 14 primate species in the clade of Hominoids, Old World monkeys and New World monkeys. To detect selective pressure at the protein level, the ratios of nonsynonymous/synonymous substitution rate (dN/dS) under different evolutionary models were calculated across the phylogeny of PCSK9. Maximum likelihood analyses of dN/dS ratios for the aligned coding region sequences among 14 primate species indicated that PCSK9 was subject to a strong functional constraint (i.e., purifying selection). However, positive selection was noted in the functional carboxyl-terminal (C-terminal) domain in many branches across the phylogeny, especially in the lineage leading to the orangutan. Furthermore, at least five positively selected amino acids were detected in this lineage using the branch-site model A. In a sliding-window analysis, several dN/dS peaks in the C-terminal domain in both the human and the orangutan branches were noted. Conclusions These results suggest that among primates, differential selective pressure has shaped evolutionary patterns in the functional domains of PCSK9, an important regulator of cholesterol homeostasis.
Collapse
Affiliation(s)
- Keyue Ding
- Division of Cardiovascular Diseases, Mayo Clinic and Foundation, Rochester, Minnesota, United States of America
| | - Samantha J. McDonough
- Division of Cardiovascular Diseases, Mayo Clinic and Foundation, Rochester, Minnesota, United States of America
| | - Iftikhar J. Kullo
- Division of Cardiovascular Diseases, Mayo Clinic and Foundation, Rochester, Minnesota, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
19
|
Baena A, Mootnick AR, Falvo JV, Tsytsykova AV, Ligeiro F, Diop OM, Brieva C, Gagneux P, O'Brien SJ, Ryder OA, Goldfeld AE. Primate TNF promoters reveal markers of phylogeny and evolution of innate immunity. PLoS One 2007; 2:e621. [PMID: 17637837 PMCID: PMC1905939 DOI: 10.1371/journal.pone.0000621] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2007] [Accepted: 06/12/2007] [Indexed: 11/18/2022] Open
Abstract
Background Tumor necrosis factor (TNF) is a critical cytokine in the immune response whose transcriptional activation is controlled by a proximal promoter region that is highly conserved in mammals and, in particular, primates. Specific single nucleotide polymorphisms (SNPs) upstream of the proximal human TNF promoter have been identified, which are markers of human ancestry. Methodology/Principal findings Using a comparative genomics approach we show that certain fixed genetic differences in the TNF promoter serve as markers of primate speciation. We also demonstrate that distinct alleles of most human TNF promoter SNPs are identical to fixed nucleotides in primate TNF promoters. Furthermore, we identify fixed genetic differences within the proximal TNF promoters of Asian apes that do not occur in African ape or human TNF promoters. Strikingly, protein-DNA binding assays and gene reporter assays comparing these Asian ape TNF promoters to African ape and human TNF promoters demonstrate that, unlike the fixed differences that we define that are associated with primate phylogeny, these Asian ape-specific fixed differences impair transcription factor binding at an Sp1 site and decrease TNF transcription induced by bacterial stimulation of macrophages. Conclusions/significance Here, we have presented the broadest interspecies comparison of a regulatory region of an innate immune response gene to date. We have characterized nucleotide positions in Asian ape TNF promoters that underlie functional changes in cell type- and stimulus-specific activation of the TNF gene. We have also identified ancestral TNF promoter nucleotide states in the primate lineage that correspond to human SNP alleles. These findings may reflect evolution of Asian and African apes under a distinct set of infectious disease pressures involving the innate immune response and TNF.
Collapse
Affiliation(s)
- Andres Baena
- The CBR Institute for Biomedical Research, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Alan R. Mootnick
- Gibbon Conservation Center, Santa Clarita, California, United States of America
| | - James V. Falvo
- The CBR Institute for Biomedical Research, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Alla V. Tsytsykova
- The CBR Institute for Biomedical Research, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Filipa Ligeiro
- The CBR Institute for Biomedical Research, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Ousmane M. Diop
- Laboratoire de Rétrovirologie, Institut Pasteur, Dakar, Senegal
| | - Claudia Brieva
- Unidad de Rescate y Rehabilitación de Animales Silvestres, Facultad de Medicina Veterinaria y Zootecnia, Universidad Nacional de Colombia, Bogotá, Colombia
| | - Pascal Gagneux
- Project for Explaining the Origin of Humans, Glycobiology Research and Training Center, Department of Medicine and Cellular and Molecular Medicine, University of California at San Diego, La Jolla, California, United States of America
| | - Stephen J. O'Brien
- Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland, United States of America
| | - Oliver A. Ryder
- Conservation and Research for Endangered Species, Zoological Society of San Diego, San Diego, California, United States of America
- Division of Biological Sciences, University of California at San Diego, La Jolla, California, United States of America
| | - Anne E. Goldfeld
- The CBR Institute for Biomedical Research, Harvard Medical School, Boston, Massachusetts, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
20
|
Labuda D, Labbé C, Langlois S, Lefebvre JF, Freytag V, Moreau C, Sawicki J, Beaulieu P, Pastinen T, Hudson TJ, Sinnett D. Patterns of variation in DNA segments upstream of transcription start sites. Hum Mutat 2007; 28:441-50. [PMID: 17274005 PMCID: PMC2683062 DOI: 10.1002/humu.20463] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
It is likely that evolutionary differences among species are driven by sequence changes in regulatory regions. Likewise, polymorphisms in the promoter regions may be responsible for interindividual differences at the level of populations. We present an unbiased survey of genetic variation in 2-kb segments upstream of the transcription start sites of 28 protein-coding genes, characterized in five population groups of different geographic origin. On average, we found 9.1 polymorphisms and 8.8 haplotypes per segment with corresponding nucleotide and haplotype diversities of 0.082% and 58%, respectively. We characterized these segments through different summary statistics, Hardy-Weinberg equilibria fixation index (Fst) estimates, and neutrality tests, as well as by analyzing the distributions of haplotype allelic classes, introduced here to assess the departure from neutrality and examined by coalescent simulations under a simple population model, assuming recombinations or different demography. Our results suggest that genetic diversity in some of these regions could have been shaped by purifying selection and driven by adaptive changes in the other, thus explaining the relatively large variance in the corresponding genetic diversity indices loci. However, some of these effects could be also due to linkage with surrounding sequences, and the neutralists' explanations cannot be ruled out given uncertainty in the underlying demographic histories and the possibility of random effects due to the small size of the studied segments.
Collapse
Affiliation(s)
- Damian Labuda
- Centre de Recherche, Hôpital Sainte-Justine, Montréal, Quebec, Canada.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Kehrer-Sawatzki H, Cooper DN. Understanding the recent evolution of the human genome: insights from human-chimpanzee genome comparisons. Hum Mutat 2007; 28:99-130. [PMID: 17024666 DOI: 10.1002/humu.20420] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The sequencing of the chimpanzee genome and the comparison with its human counterpart have begun to reveal the spectrum of genetic changes that has accompanied human evolution. In addition to gross karyotypic rearrangements such as the fusion that formed human chromosome 2 and the human-specific pericentric inversions of chromosomes 1 and 18, there is considerable submicroscopic structural variation involving deletions, duplications, and inversions. Lineage-specific segmental duplications, detected by array comparative genomic hybridization and direct sequence comparison, have made a very significant contribution to this structural divergence, which is at least three-fold greater than that due to nucleotide substitutions. Since structural genomic changes may have given rise to irreversible functional differences between the diverging species, their detailed analysis could help to identify the biological processes that have accompanied speciation. To this end, interspecies comparisons have revealed numerous human-specific gains and losses of genes as well as changes in gene expression. The very considerable structural diversity (polymorphism) evident within both lineages has, however, hampered the analysis of the structural divergence between the human and chimpanzee genomes. The concomitant evaluation of genetic divergence and diversity at the nucleotide level has nevertheless served to identify many genes that have evolved under positive selection and may thus have been involved in the development of human lineage-specific traits. Genes that display signs of weak negative selection have also been identified and could represent candidate loci for complex genomic disorders. Here, we review recent progress in comparing the human and chimpanzee genomes and discuss how the differences detected have improved our understanding of the evolution of the human genome.
Collapse
|
22
|
Stinchcombe JR, Hoekstra HE. Combining population genomics and quantitative genetics: finding the genes underlying ecologically important traits. Heredity (Edinb) 2007; 100:158-70. [PMID: 17314923 DOI: 10.1038/sj.hdy.6800937] [Citation(s) in RCA: 380] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
A central challenge in evolutionary biology is to identify genes underlying ecologically important traits and describe the fitness consequences of naturally occurring variation at these loci. To address this goal, several novel approaches have been developed, including 'population genomics,' where a large number of molecular markers are scored in individuals from different environments with the goal of identifying markers showing unusual patterns of variation, potentially due to selection at linked sites. Such approaches are appealing because of (1) the increasing ease of generating large numbers of genetic markers, (2) the ability to scan the genome without measuring phenotypes and (3) the simplicity of sampling individuals without knowledge of their breeding history. Although such approaches are inherently applicable to non-model systems, to date these studies have been limited in their ability to uncover functionally relevant genes. By contrast, quantitative genetics has a rich history, and more recently, quantitative trait locus (QTL) mapping has had some success in identifying genes underlying ecologically relevant variation even in novel systems. QTL mapping, however, requires (1) genetic markers that specifically differentiate parental forms, (2) a focus on a particular measurable phenotype and (3) controlled breeding and maintenance of large numbers of progeny. Here we present current advances and suggest future directions that take advantage of population genomics and quantitative genetic approaches - in both model and non-model systems. Specifically, we discuss advantages and limitations of each method and argue that a combination of the two provides a powerful approach to uncovering the molecular mechanisms responsible for adaptation.
Collapse
Affiliation(s)
- J R Stinchcombe
- Department of Ecology and Evolutionary Biology, Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada.
| | | |
Collapse
|
23
|
Ding K, Kullo IJ. Molecular evolution of 5' flanking regions of 87 candidate genes for atherosclerotic cardiovascular disease. Genet Epidemiol 2007; 30:557-69. [PMID: 16799961 DOI: 10.1002/gepi.20169] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Inter-individual variation in quantitative traits as well as in disease susceptibility may be due to differences in the level and spatio-temporal pattern of gene expression. An evolutionary model of genetic variation in cis-regulatory regions may help identify loci of interest for the study of the genetic basis of complex diseases such as atherosclerotic cardiovascular disease (ASCVD). We studied the molecular evolution of 87 candidate genes for ASCVD to detect signatures of selection in 5' flanking regions. Resequenced data for these genes were available in 24 African-Americans, 23 European-Americans, and 1 chimpanzee (Pan troglodytes). Statistical tests of evolutionary neutrality (Tajima's D and Fay and Wu's H) were performed using coalescent simulations under a standard neutral model and a population structure model to differentiate selection from human demographic history. Evidence suggestive of selection was present in 5' flanking regions of 15 genes. A modified McDonald-Kreitman test was used to compare the ratio of putative functional and non-functional sites between and within species in 5' flanking regions. A significant excess or deficit of fixed changes over polymorphisms was noted in 16 genes. Of the 26 genes showing deviation from evolutionary neutrality based on the above two tests, 13 genes showed an unusual haplotype pattern in 5' flanking regions, providing supportive evidence of selection. These results indicate that selection may play a role in establishing variation in 5' flanking regions of a subset of candidate genes for ASCVD and motivate further studies of these loci in determining inter-individual susceptibility to ASCVD.
Collapse
Affiliation(s)
- Keyue Ding
- Division of Cardiovascular Diseases, Mayo Clinic and Foundation, 200 First Street SW, Rochester, MN 55905, USA
| | | |
Collapse
|
24
|
Demuth JP, Bie TD, Stajich JE, Cristianini N, Hahn MW. The evolution of mammalian gene families. PLoS One 2006; 1:e85. [PMID: 17183716 PMCID: PMC1762380 DOI: 10.1371/journal.pone.0000085] [Citation(s) in RCA: 220] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2006] [Accepted: 11/14/2006] [Indexed: 11/18/2022] Open
Abstract
Gene families are groups of homologous genes that are likely to have highly similar functions. Differences in family size due to lineage-specific gene duplication and gene loss may provide clues to the evolutionary forces that have shaped mammalian genomes. Here we analyze the gene families contained within the whole genomes of human, chimpanzee, mouse, rat, and dog. In total we find that more than half of the 9,990 families present in the mammalian common ancestor have either expanded or contracted along at least one lineage. Additionally, we find that a large number of families are completely lost from one or more mammalian genomes, and a similar number of gene families have arisen subsequent to the mammalian common ancestor. Along the lineage leading to modern humans we infer the gain of 689 genes and the loss of 86 genes since the split from chimpanzees, including changes likely driven by adaptive natural selection. Our results imply that humans and chimpanzees differ by at least 6% (1,418 of 22,000 genes) in their complement of genes, which stands in stark contrast to the oft-cited 1.5% difference between orthologous nucleotide sequences. This genomic “revolving door” of gene gain and loss represents a large number of genetic differences separating humans from our closest relatives.
Collapse
Affiliation(s)
- Jeffery P. Demuth
- Department of Biology and School of Informatics, Indiana UniversityBloomington, Indiana, United States of America
| | - Tijl De Bie
- School of Electronics and Computer Science, ISIS Group, University of SouthamptonSouthampton, United Kingdom
| | - Jason E. Stajich
- Department of Molecular Genetics and Microbiology, Duke UniversityDurham, North Carolina, United States of America
| | - Nello Cristianini
- Department of Statistics, University of California DavisDavis, California, United States of America
| | - Matthew W. Hahn
- Department of Biology and School of Informatics, Indiana UniversityBloomington, Indiana, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
25
|
Rockman MV, Hahn MW, Soranzo N, Zimprich F, Goldstein DB, Wray GA. Ancient and recent positive selection transformed opioid cis-regulation in humans. PLoS Biol 2006; 3:e387. [PMID: 16274263 PMCID: PMC1283535 DOI: 10.1371/journal.pbio.0030387] [Citation(s) in RCA: 141] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2005] [Accepted: 09/13/2005] [Indexed: 11/18/2022] Open
Abstract
Changes in the cis-regulation of neural genes likely contributed to the evolution of our species' unique attributes, but evidence of a role for natural selection has been lacking. We found that positive natural selection altered the cis-regulation of human prodynorphin, the precursor molecule for a suite of endogenous opioids and neuropeptides with critical roles in regulating perception, behavior, and memory. Independent lines of phylogenetic and population genetic evidence support a history of selective sweeps driving the evolution of the human prodynorphin promoter. In experimental assays of chimpanzee-human hybrid promoters, the selected sequence increases transcriptional inducibility. The evidence for a change in the response of the brain's natural opioids to inductive stimuli points to potential human-specific characteristics favored during evolution. In addition, the pattern of linked nucleotide and microsatellite variation among and within modern human populations suggests that recent selection, subsequent to the fixation of the human-specific mutations and the peopling of the globe, has favored different prodynorphin cis-regulatory alleles in different parts of the world.
Collapse
Affiliation(s)
- Matthew V Rockman
- Department of Biology, Duke University, Durham, North Carolina, USA.
| | | | | | | | | | | |
Collapse
|
26
|
Abstract
A new field of genetic analysis of global gene expression has emerged in recent years, driven by the realization that traditional techniques of linkage and association analysis can be applied to thousands of transcript levels measured by microarrays. Genetic dissection of transcript abundance has shed light on the architecture of quantitative traits, provided a new approach for connecting DNA sequence variation with phenotypic variation, and improved our understanding of transcriptional regulation and regulatory variation.
Collapse
Affiliation(s)
- Matthew V Rockman
- Lewis-Sigler Institute for Integrative Genomics and Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA
| | | |
Collapse
|
27
|
de Meaux J, Pop A, Mitchell-Olds T. Cis-regulatory evolution of chalcone-synthase expression in the genus Arabidopsis. Genetics 2006; 174:2181-202. [PMID: 17028316 PMCID: PMC1698642 DOI: 10.1534/genetics.106.064543] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2006] [Accepted: 09/26/2006] [Indexed: 02/06/2023] Open
Abstract
The contribution of cis-regulation to adaptive evolutionary change is believed to be essential, yet little is known about the evolutionary rules that govern regulatory sequences. Here, we characterize the short-term evolutionary dynamics of a cis-regulatory region within and among two closely related species, A. lyrata and A. halleri, and compare our findings to A. thaliana. We focused on the cis-regulatory region of chalcone synthase (CHS), a key enzyme involved in the synthesis of plant secondary metabolites. We observed patterns of nucleotide diversity that differ among species but do not depart from neutral expectations. Using intra- and interspecific F1 progeny, we have evaluated functional cis-regulatory variation in response to light and herbivory, environmental cues, which are known to induce CHS expression. We find that substantial cis-regulatory variation segregates within and among populations as well as between species, some of which results from interspecific genetic introgression. We further demonstrate that, in A. thaliana, CHS cis-regulation in response to herbivory is greater than in A. lyrata or A. halleri. Our work indicates that the evolutionary dynamics of a cis-regulatory region is characterized by pervasive functional variation, achieved mostly by modification of response modules to one but not all environmental cues. Our study did not detect the footprint of selection on this variation.
Collapse
Affiliation(s)
- Juliette de Meaux
- Genetics and Plant Breeding, Max Planck Institute for Plant Breeding Research, Cologne, Germany.
| | | | | |
Collapse
|
28
|
Donaldson IJ, Göttgens B. Evolution of candidate transcriptional regulatory motifs since the human-chimpanzee divergence. Genome Biol 2006; 7:R52. [PMID: 16808854 PMCID: PMC1779530 DOI: 10.1186/gb-2006-7-6-r52] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2006] [Revised: 06/01/2006] [Accepted: 06/09/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Despite the recent completion of the chimpanzee genome project, few functionally significant sequence differences between humans and chimpanzees have thus far been identified. Alteration in transcriptional regulatory mechanisms represents an important platform for evolutionary change, suggesting that a significant proportion of functional human-chimpanzee sequence differences may affect regulatory elements. RESULTS To explore this hypothesis, we performed genome-wide identification of conserved candidate transcription-factor binding sites that have evolved since the divergence of humans and chimpanzees. Analysis of candidate transcription-factor binding sites conserved between mouse and chimpanzee yet absent in human indicated that loss of candidate transcription-factor binding sites in the human lineage was not random but instead correlated with the biologic functions of associated genes. CONCLUSION Our data support the notion that changes in transcriptional regulation have contributed to the recent evolution of humans. Moreover, genes associated with mutated candidate transcription-factor binding sites highlight potential pathways underlying human-chimpanzee divergence.
Collapse
Affiliation(s)
- Ian J Donaldson
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, CB2 2XY, UK
| | - Berthold Göttgens
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, CB2 2XY, UK
| |
Collapse
|
29
|
Hahn MW. Detecting natural selection on cis-regulatory DNA. Genetica 2006; 129:7-18. [PMID: 16955334 DOI: 10.1007/s10709-006-0029-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2004] [Accepted: 06/25/2005] [Indexed: 10/24/2022]
Abstract
Changes in transcriptional regulation play an important role in the genetic basis for evolutionary change. Here I review a growing body of literature that seeks to determine the forces governing the non-coding regulatory sequences underlying these changes. I address the challenges present in studying natural selection without the familiar structure and regularity of protein-coding sequences, but show that most tests of neutrality that have been used for coding regions are applicable to non-coding regions, albeit with some caveats. While some experimental investment is necessary to identify heritable regulatory variation, the most basic inferences about selection require very little functional information. A growing body of research on cis-regulatory variation has uncovered all the forms of selection common to coding regions, in addition to novel forms of selection. An emerging pattern seems to be the ubiquity of local adaptation and balancing selection, possibly due to the greater freedom organisms have to fine-tune gene expression without changing protein function. It is clear from multiple single locus and whole genome studies of non-coding regulatory DNA that the effects of natural selection reach far beyond the start and stop codons.
Collapse
Affiliation(s)
- Matthew W Hahn
- Department of Biology and School of Informatics, Indiana University, Bloomington, IN, 47405, USA.
| |
Collapse
|
30
|
Moses AM, Pollard DA, Nix DA, Iyer VN, Li XY, Biggin MD, Eisen MB. Large-scale turnover of functional transcription factor binding sites in Drosophila. PLoS Comput Biol 2006; 2:e130. [PMID: 17040121 PMCID: PMC1599766 DOI: 10.1371/journal.pcbi.0020130] [Citation(s) in RCA: 181] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2006] [Accepted: 08/21/2006] [Indexed: 11/30/2022] Open
Abstract
The gain and loss of functional transcription factor binding sites has been proposed as a major source of evolutionary change in cis-regulatory DNA and gene expression. We have developed an evolutionary model to study binding-site turnover that uses multiple sequence alignments to assess the evolutionary constraint on individual binding sites, and to map gain and loss events along a phylogenetic tree. We apply this model to study the evolutionary dynamics of binding sites of the Drosophila melanogaster transcription factor Zeste, using genome-wide in vivo (ChIP–chip) binding data to identify functional Zeste binding sites, and the genome sequences of D. melanogaster, D. simulans, D. erecta, and D. yakuba to study their evolution. We estimate that more than 5% of functional Zeste binding sites in D. melanogaster were gained along the D. melanogaster lineage or lost along one of the other lineages. We find that Zeste-bound regions have a reduced rate of binding-site loss and an increased rate of binding-site gain relative to flanking sequences. Finally, we show that binding-site gains and losses are asymmetrically distributed with respect to D. melanogaster, consistent with lineage-specific acquisition and loss of Zeste-responsive regulatory elements. Understanding the ways in which mutations in DNA result in alterations of an organism's form and function is a major goal of molecular evolutionary biology. Changes in gene expression were long-ago proposed as a source of evolutionary diversity, but it was only in the last few years that researchers described specific cases where identified changes in DNA cause differences in gene expression, which in turn affect morphology. Attention has now turned to understanding how such sequence changes produce their effect and whether additional examples of evolutionary novelty can be found by examining the growing number of available genome sequences. Moses et al. focus on transcription factor binding sites, pieces of DNA that serve as molecular switches to turn genes on and off. These switches are organized into larger units that function as molecular computers and ensure that genes are made when and where they are needed. Moses and colleagues introduce a set of new computational methods to study how these larger units of regulatory function evolve. While they find that most of these switches remain fixed in place, a substantial number are created or destroyed by mutations, yielding new insights into the evolutionary forces that shape animal morphology.
Collapse
Affiliation(s)
- Alan M Moses
- Graduate Group in Biophysics, University of California Berkeley, Berkeley, California, United States of America
| | - Daniel A Pollard
- Graduate Group in Biophysics, University of California Berkeley, Berkeley, California, United States of America
| | - David A Nix
- Department of Genome Sciences, Genomics Division, Ernest Orlando Lawrence Berkeley National Lab, Berkeley, California, United States of America
| | - Venky N Iyer
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Xiao-Yong Li
- Department of Genome Sciences, Genomics Division, Ernest Orlando Lawrence Berkeley National Lab, Berkeley, California, United States of America
| | - Mark D Biggin
- Department of Genome Sciences, Genomics Division, Ernest Orlando Lawrence Berkeley National Lab, Berkeley, California, United States of America
| | - Michael B Eisen
- Graduate Group in Biophysics, University of California Berkeley, Berkeley, California, United States of America
- Department of Genome Sciences, Genomics Division, Ernest Orlando Lawrence Berkeley National Lab, Berkeley, California, United States of America
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
- Center for Integrative Genomics, University of California Berkeley, Berkeley, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
31
|
Wang QF, Prabhakar S, Wang Q, Moses AM, Chanan S, Brown M, Eisen MB, Cheng JF, Rubin EM, Boffelli D. Primate-specific evolution of an LDLR enhancer. Genome Biol 2006; 7:R68. [PMID: 16884525 PMCID: PMC1779597 DOI: 10.1186/gb-2006-7-8-r68] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2006] [Revised: 06/28/2006] [Accepted: 08/02/2006] [Indexed: 01/13/2023] Open
Abstract
Analysis of primate-specific evolution of the LDL receptor enhancer demonstrates a molecular mechanism by which ancestral mammalian regulatory elements can evolve to perform new functions. Background Sequence changes in regulatory regions have often been invoked to explain phenotypic divergence among species, but molecular examples of this have been difficult to obtain. Results In this study we identified an anthropoid primate-specific sequence element that contributed to the regulatory evolution of the low-density lipoprotein receptor. Using a combination of close and distant species genomic sequence comparisons coupled with in vivo and in vitro studies, we found that a functional cholesterol-sensing sequence motif arose and was fixed within a pre-existing enhancer in the common ancestor of anthropoid primates. Conclusion Our study demonstrates one molecular mechanism by which ancestral mammalian regulatory elements can evolve to perform new functions in the primate lineage leading to human.
Collapse
Affiliation(s)
- Qian-fei Wang
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Shyam Prabhakar
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Qianben Wang
- Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Alan M Moses
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Sumita Chanan
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Myles Brown
- Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Michael B Eisen
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Jan-Fang Cheng
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Edward M Rubin
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Dario Boffelli
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| |
Collapse
|
32
|
Fredman D, Sawyer SL, Strömqvist L, Mottagui-Tabar S, Kidd KK, Wahlestedt C, Chanock SJ, Brookes AJ. Nonsynonymous SNPs: validation characteristics, derived allele frequency patterns, and suggestive evidence for natural selection. Hum Mutat 2006; 27:173-86. [PMID: 16429399 DOI: 10.1002/humu.20289] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
We experimentally investigated more than 1,200 entries in dbSNP that would change amino-acids (nsSNPs), using various subsets of DNA samples drawn from 18 global populations (approximately 1,000 subjects in total). First, we mined the data for any SNP features that correlated with a high validation rate. Useful predictors of valid SNPs included multiple submissions to dbSNP, having a dbSNP validation statement, and being present in a low number of ESTs. Together, these features improved validation rates by almost 10-fold. Higher-abundance SNPs (e.g., T/C variants) also validated more frequently. Second, we considered derived alleles and noted a considerably (approximately 10%) increased average derived allele frequency (DAF) in Europeans vs. Africans, plus a further increase in some other populations. This was not primarily due to an SNP ascertainment bias, nor to the effects of natural selection. Instead, it can be explained as a drift-based, progressive increase in DAF that occurs over many generations and becomes exaggerated during population bottlenecks. This observation could be used as the basis for novel DAF-based tests for comparing demographic histories. Finally, we considered individual marker patterns and identified 37 SNPs with allele frequency variance or FST values consistent with the effects of population-specific natural selection. Four particularly striking clusters of these markers were apparent, and three of these coincide with genes/regions from among only several dozen such domains previously suggested by others to carry signatures of selection.
Collapse
Affiliation(s)
- David Fredman
- Center for Genomics and Bioinformatics, Karolinska Institute, Stockholm, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
Variation among populations in gene expression should be related to the accumulation of random-neutral changes and evolution by natural selection. The following evolutionary analysis has general applicability to biological and medical science because it accounts for genetic relatedness and identifies patterns of expression variation that are affected by natural selection. To identify genes evolving by natural selection, we allocate the maximum among-population variation to genetic distance and then examine the remaining variation relative to a hypothesized important ecological parameter (temperature). These analyses measure the expression of metabolic genes in common-gardened populations of the fish Fundulus heteroclitus whose habitat is distributed along a steep thermal gradient. Although much of the variation in gene expression fits a null model of neutral drift, the variation in expression for 22% of the genes that regress with habitat temperature was far greater than could be accounted for by genetic distance alone. The most parsimonious explanation for among-population variation for these genes is evolution by natural selection. In addition, many metabolic genes have patterns of variation incongruent with neutral evolution: They have too much or too little variation. These patterns of biological variation in expression may reflect important physiological or ecological functions.
Collapse
Affiliation(s)
- Andrew Whitehead
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA.
| | | |
Collapse
|
34
|
Perry GH, Verrelli BC, Stone AC. Molecular evolution of the primate developmental genes MSX1 and PAX9. Mol Biol Evol 2005; 23:644-54. [PMID: 16326750 DOI: 10.1093/molbev/msj072] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
In primates, the craniofacial skeleton and the dentition are marked by high levels of interspecific variation. Despite this, there are few comparative species studies conducted at the molecular level to investigate this functional diversity. We have determined nucleotide sequences of MSX1 and PAX9, two developmental genes, in a sample of 27 diverse primate species in order to identify coding or regulatory variation that may be associated with phenotypic diversity. Our analyses have identified four highly conserved noncoding sequences, including one that is conserved across primates and with dogs but not with mice. Although we find that substitution rates vary significantly across MSX1 exons, comparisons of nonsynonymous and synonymous substitution rates (dN/dS) suggest that, as a whole, MSX1 and PAX9 amino acid sequences have been under functional constraint throughout primate evolution. Compared to all other primates in our sample, our analysis of exon 1 in MSX1 finds an unusual pattern of amino acid substitution for Tarsius syrichta, a member of a lineage (tarsiers) that has many unique features among primates. For example, tarsiers are the only extant primates without deciduous incisors, and MSX1 is expressed exclusively in the incisor regions during the earliest stages of dental development. Our overall results provide insight into the utility of comparative species analyses of highly conserved developmental genes and their roles in the evolution of complex phenotypes.
Collapse
Affiliation(s)
- George H Perry
- School of Human Evolution and Social Change, Arizona State University, USA
| | | | | |
Collapse
|
35
|
Sabater-Lleal M, Soria JM, Bertranpetit J, Almasy L, Blangero J, Fontcuberta J, Calafell F. Human F7 sequence is split into three deep clades that are related to FVII plasma levels. Hum Genet 2005; 118:741-51. [PMID: 16292673 DOI: 10.1007/s00439-005-0045-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2005] [Accepted: 08/01/2005] [Indexed: 01/04/2023]
Abstract
It is widely accepted that FVII levels are strongly, consistently, and independently related to cardiovascular risk. These levels are influenced by genetic and environmental factors. Among the genetic factors, only a limited number of polymorphisms in the F7 gene have been reported, and they explain only a small proportion of the genetic variability. Recently, we have accomplished the complete dissection of the F7 quantitative trait locus responsible for all of the genetic variability observed in FVII levels. Now, we present the thorough study of the haplotype organization of F7 DNA sequence variation among individuals and the evolutionary processes that produced this variation, by sequencing 15 kb of genomic DNA sequence from the F7 locus in 40 unrelated individual (80 chromosomes) from the genetic analysis of idiopathic thrombophilia (GAIT) project as well as four non-human primate species. Our study revealed 49 polymorphisms, of which 39 SNPs were further considered. Genotyping of these DNA variations in the whole family-based GAIT sample helped resolve linkage phases, and a total of 37 distinct haplotypes were identified.Tajima's D was significantly positive in this sample, suggesting balancing selection. This parameter was a reflection of the phylogenetic structure of F7 haplotype, which was deeply split into three well-supported clades or haplogroups, suggesting that functional differences among F7 variants do not depend on a few single-site variations. Moreover, haplogroup 2 was associated with high FVII levels and haplogroup 3 with low levels. In this study, we have for the first time established a clear relation between genotypic variability structure and phenotypic variability of a particular quantitative trait involved in a complex disease.
Collapse
Affiliation(s)
- Maria Sabater-Lleal
- Unitat d'Hemostàsia i Trombosi, Hospital de la Santa Creu i Sant Pau, 08005 Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
36
|
|
37
|
Abstract
Emerging knowledge about organismal evolution suggests that changes in the regulation of gene expression have played a major role - a thesis proposed 30 years ago by King and Wilson.
Collapse
Affiliation(s)
- Sean B Carroll
- Howard Hughes Medical Institute, Laboratory of Molecular Biology, University of Wisconsin, Madison, Wisconsin, USA.
| |
Collapse
|
38
|
Abstract
Elucidating the genetic basis of adaptive population divergence is a goal of central importance in evolutionary biology. In principle, it should be possible to identify chromosomal regions involved in adaptive divergence by screening genome-wide patterns of DNA polymorphism to detect the locus-specific signature of positive directional selection. In the case of spatially separated populations that inhabit different environments or sympatric populations that exploit different ecological niches, it is possible to identify loci that underlie divergently selected traits by comparing relative levels of differentiation among large numbers of unlinked markers. In this review I first address the question of whether diversifying selection on polygenic traits can be expected to produce predictable patterns of allelic variation at the underlying quantitative trait loci (QTL), and whether the locus-specific effects of selection can be reliably detected against the genome-wide backdrop of stochastic variability. I then review different approaches that have been developed to identify loci involved in adaptive population divergence and I discuss the relative merits of model-based approaches that rely on assumptions about population structure vs. model-free approaches that are based on empirical distributions of summary statistics. Finally, I consider the evolutionary and functional insights that might be gained by conducting genome scans for loci involved in adaptive population divergence.
Collapse
Affiliation(s)
- Jay F Storz
- Department of Biology, San Francisco State University, 1600 Holloway Avenue, San Francisco, CA 94132, USA.
| |
Collapse
|