1
|
Extreme purifying selection against point mutations in the human genome. Nat Commun 2022; 13:4312. [PMID: 35879308 PMCID: PMC9314448 DOI: 10.1038/s41467-022-31872-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 07/07/2022] [Indexed: 12/13/2022] Open
Abstract
Large-scale genome sequencing has enabled the measurement of strong purifying selection in protein-coding genes. Here we describe a new method, called ExtRaINSIGHT, for measuring such selection in noncoding as well as coding regions of the human genome. ExtRaINSIGHT estimates the prevalence of “ultraselection” by the fractional depletion of rare single-nucleotide variants, after controlling for variation in mutation rates. Applying ExtRaINSIGHT to 71,702 whole genome sequences from gnomAD v3, we find abundant ultraselection in evolutionarily ancient miRNAs and neuronal protein-coding genes, as well as at splice sites. By contrast, we find much less ultraselection in other noncoding RNAs and transcription factor binding sites, and only modest levels in ultraconserved elements. We estimate that ~0.4–0.7% of the human genome is ultraselected, implying ~ 0.26–0.51 strongly deleterious mutations per generation. Overall, our study sheds new light on the genome-wide distribution of fitness effects by combining deep sequencing data and classical theory from population genetics. Previous work has investigated selection in the coding genome, but it is not as well characterized in the non-coding genome. By analyzing rare variants in 70k genome sequences from gnomAD, the authors detect very strong purifying selection ("ultraselection”) across the human genome, finding it in some microRNAs and coding sequences but generally rare in regulatory sequences.
Collapse
|
2
|
Grieneisen L, Muehlbauer AL, Blekhman R. Microbial control of host gene regulation and the evolution of host-microbiome interactions in primates. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190598. [PMID: 32772669 PMCID: PMC7435160 DOI: 10.1098/rstb.2019.0598] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/19/2020] [Indexed: 12/23/2022] Open
Abstract
Recent comparative studies have found evidence consistent with the action of natural selection on gene regulation across primate species. Other recent work has shown that the microbiome can regulate host gene expression in a wide range of relevant tissues, leading to downstream effects on immunity, metabolism and other biological systems in the host. In primates, even closely related host species can have large differences in microbiome composition. One potential consequence of these differences is that host species-specific microbial traits could lead to differences in gene expression that influence primate physiology and adaptation to local environments. Here, we will discuss and integrate recent findings from primate comparative genomics and microbiome research, and explore the notion that the microbiome can influence host evolutionary dynamics by affecting gene regulation across primate host species. This article is part of the theme issue 'The role of the microbiome in host evolution'.
Collapse
Affiliation(s)
- Laura Grieneisen
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA
| | - Amanda L. Muehlbauer
- Department of Ecology, Evolution and Behavior, University of Minnesota, Minneapolis, MN 55455, USA
| | - Ran Blekhman
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA
- Department of Ecology, Evolution and Behavior, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
3
|
Huang YF, Gulko B, Siepel A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat Genet 2017; 49:618-624. [PMID: 28288115 PMCID: PMC5395419 DOI: 10.1038/ng.3810] [Citation(s) in RCA: 209] [Impact Index Per Article: 29.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 02/13/2017] [Indexed: 12/17/2022]
Abstract
Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncoding nucleotide sites at which mutations are likely to have deleterious fitness consequences, and which, therefore, are likely to be phenotypically important. LINSIGHT combines a generalized linear model for functional genomic data with a probabilistic model of molecular evolution. The method is fast and highly scalable, enabling it to exploit the 'big data' available in modern genomics. We show that LINSIGHT outperforms the best available methods in identifying human noncoding variants associated with inherited diseases. In addition, we apply LINSIGHT to an atlas of human enhancers and show that the fitness consequences at enhancers depend on cell type, tissue specificity, and constraints at associated promoters.
Collapse
Affiliation(s)
- Yi-Fei Huang
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Brad Gulko
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA.,Graduate Field of Computer Science, Cornell University, Ithaca, New York, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| |
Collapse
|
4
|
Siepel A, Arbiza L. Cis-regulatory elements and human evolution. Curr Opin Genet Dev 2014; 29:81-9. [PMID: 25218861 PMCID: PMC4258466 DOI: 10.1016/j.gde.2014.08.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Revised: 08/17/2014] [Accepted: 08/23/2014] [Indexed: 11/20/2022]
Abstract
Modification of gene regulation has long been considered an important force in human evolution, particularly through changes to cis-regulatory elements (CREs) that function in transcriptional regulation. For decades, however, the study of cis-regulatory evolution was severely limited by the available data. New data sets describing the locations of CREs and genetic variation within and between species have now made it possible to study CRE evolution much more directly on a genome-wide scale. Here, we review recent research on the evolution of CREs in humans based on large-scale genomic data sets. We consider inferences based on primate divergence, human polymorphism, and combinations of divergence and polymorphism. We then consider 'new frontiers' in this field stemming from recent research on transcriptional regulation.
Collapse
Affiliation(s)
- Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14853, USA.
| | - Leonardo Arbiza
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
5
|
Necsulea A, Kaessmann H. Evolutionary dynamics of coding and non-coding transcriptomes. Nat Rev Genet 2014; 15:734-48. [PMID: 25297727 DOI: 10.1038/nrg3802] [Citation(s) in RCA: 147] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Gene expression changes may underlie much of phenotypic evolution. The development of high-throughput RNA sequencing protocols has opened the door to unprecedented large-scale and cross-species transcriptome comparisons by allowing accurate and sensitive assessments of transcript sequences and expression levels. Here, we review the initial wave of the new generation of comparative transcriptomic studies in mammals and vertebrate outgroup species in the context of earlier work. Together with various large-scale genomic and epigenomic data, these studies have unveiled commonalities and differences in the dynamics of gene expression evolution for various types of coding and non-coding genes across mammalian lineages, organs, developmental stages, chromosomes and sexes. They have also provided intriguing new clues to the regulatory basis and phenotypic implications of evolutionary gene expression changes.
Collapse
Affiliation(s)
- Anamaria Necsulea
- Laboratory of Developmental Genomics, School of Life Sciences, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Henrik Kaessmann
- 1] Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland. [2] Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
6
|
Whittle CA, Sun Y, Johannesson H. Dynamics of transcriptome evolution in the model eukaryote Neurospora. J Evol Biol 2014; 27:1125-35. [PMID: 24848562 DOI: 10.1111/jeb.12386] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Revised: 03/23/2014] [Accepted: 03/28/2014] [Indexed: 12/27/2022]
Abstract
Mounting evidence indicates that changes in the transcriptome contribute significantly to the phenotypic differentiation of closely related species. Nonetheless, further genome-wide studies, spanning a broad range of organisms, are needed to decipher the factors driving transcriptome evolution. The model Neurospora (Ascomycota) comprises a simple system for empirically studying the evolutionary dynamics of the transcriptome. Here, we studied the evolution of gene expression in Neurospora crassa and Neurospora tetrasperma and show that patterns of transcriptome evolution are connected to genome evolution, tissue type and sexual identity (mating types, mat A and mat a) in these eukaryotes. Based on the comparisons of inter- and intraspecies expression divergence, our data reveal that rapid expression divergence is more apt to occur in sexual/female (SF) than vegetative/male (VM) tissues. In addition, interspecies gene expression and protein sequence divergence were strongly correlated for SF, but not VM, tissue. A correlation between transcriptome and protein evolution parallels findings from certain animals, but not yeast, and add support for the theory that expression evolution differs fundamentally among multicellular and unicellular eukaryotes. Finally, we found that sexual identity in these hermaphroditic Neurospora species is connected to interspecies expression divergence in a tissue-dependent manner: rapid divergence occurred for mat A- and mat a-biased genes from SF and VM tissues, respectively. Based on these findings, it is hypothesized that rapid interspecies transcriptome evolution is shifting the mating types of Neurospora towards distinct female and male phenotypes, that is, sexual dimorphism.
Collapse
Affiliation(s)
- C A Whittle
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden
| | | | | |
Collapse
|
7
|
Zhang Q. The role of mRNA-based duplication in the evolution of the primate genome. FEBS Lett 2013; 587:3500-7. [DOI: 10.1016/j.febslet.2013.08.042] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Revised: 08/24/2013] [Accepted: 08/30/2013] [Indexed: 12/28/2022]
|
8
|
Arbiza L, Gronau I, Aksoy BA, Hubisz MJ, Gulko B, Keinan A, Siepel A. Genome-wide inference of natural selection on human transcription factor binding sites. Nat Genet 2013; 45:723-9. [PMID: 23749186 DOI: 10.1038/ng.2658] [Citation(s) in RCA: 106] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Accepted: 05/08/2013] [Indexed: 11/09/2022]
Abstract
For decades, it has been hypothesized that gene regulation has had a central role in human evolution, yet much remains unknown about the genome-wide impact of regulatory mutations. Here we use whole-genome sequences and genome-wide chromatin immunoprecipitation and sequencing data to demonstrate that natural selection has profoundly influenced human transcription factor binding sites since the divergence of humans from chimpanzees 4-6 million years ago. Our analysis uses a new probabilistic method, called INSIGHT, for measuring the influence of selection on collections of short, interspersed noncoding elements. We find that, on average, transcription factor binding sites have experienced somewhat weaker selection than protein-coding genes. However, the binding sites of several transcription factors show clear evidence of adaptation. Several measures of selection are strongly correlated with predicted binding affinity. Overall, regulatory elements seem to contribute substantially to both adaptive substitutions and deleterious polymorphisms with key implications for human evolution and disease.
Collapse
Affiliation(s)
- Leonardo Arbiza
- Department of Biological Statistics & Computational Biology, Cornell University, Ithaca, NY, USA
| | | | | | | | | | | | | |
Collapse
|
9
|
Dupont PY, Guttin A, Issartel JP, Stepien G. Computational identification of transcriptionally co-regulated genes, validation with the four ANT isoform genes. BMC Genomics 2012; 13:482. [PMID: 22978616 PMCID: PMC3477019 DOI: 10.1186/1471-2164-13-482] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 08/16/2012] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND The analysis of gene promoters is essential to understand the mechanisms of transcriptional regulation required under the effects of physiological processes, nutritional intake or pathologies. In higher eukaryotes, transcriptional regulation implies the recruitment of a set of regulatory proteins that bind on combinations of nucleotide motifs. We developed a computational analysis of promoter nucleotide sequences, to identify co-regulated genes by combining several programs that allowed us to build regulatory models and perform a crossed analysis on several databases. This strategy was tested on a set of four human genes encoding isoforms 1 to 4 of the mitochondrial ADP/ATP carrier ANT. Each isoform has a specific tissue expression profile linked to its role in cellular bioenergetics. RESULTS From their promoter sequence and from the phylogenetic evolution of these ANT genes in mammals, we constructed combinations of specific regulatory elements. These models were screened using the full human genome and databases of promoter sequences from human and several other mammalian species. For each of transcriptionally regulated ANT1, 2 and 4 genes, a set of co-regulated genes was identified and their over-expression was verified in microarray databases. CONCLUSIONS Most of the identified genes encode proteins with a cellular function and specificity in agreement with those of the corresponding ANT isoform. Our in silico study shows that the tissue specific gene expression is mainly driven by promoter regulatory sequences located up to about a thousand base pairs upstream the transcription start site. Moreover, this computational strategy on the study of regulatory pathways should provide, along with transcriptomics and metabolomics, data to construct cellular metabolic networks.
Collapse
Affiliation(s)
- Pierre-Yves Dupont
- INRA, UMR 1019, Unité de Nutrition Humaine, 63122, St Genès-Champanelle, France
- Université d'Auvergne, Unité de Nutrition Humaine, Clermont Université, BP 10448, 63000, Clermont-Ferrand, France
| | - Audrey Guttin
- Institut des Neurosciences, Equipe Nanomédecine et Cerveau, Inserm U836, 38700, La Tronche, France
- Université Joseph Fourier 1, Grenoble, 38041, France
- Plate-forme Transcriptome et Protéome Cliniques, Institut de Biologie et Pathologie, CHU Grenoble, 38043, Grenoble, France
| | - Jean-Paul Issartel
- Institut des Neurosciences, Equipe Nanomédecine et Cerveau, Inserm U836, 38700, La Tronche, France
- Université Joseph Fourier 1, Grenoble, 38041, France
- Plate-forme Transcriptome et Protéome Cliniques, Institut de Biologie et Pathologie, CHU Grenoble, 38043, Grenoble, France
- CNRS, 38042, Grenoble, France
| | - Georges Stepien
- INRA, UMR 1019, Unité de Nutrition Humaine, 63122, St Genès-Champanelle, France
- Université d'Auvergne, Unité de Nutrition Humaine, Clermont Université, BP 10448, 63000, Clermont-Ferrand, France
| |
Collapse
|
10
|
Warnefors M, Eyre-Walker A. A selection index for gene expression evolution and its application to the divergence between humans and chimpanzees. PLoS One 2012; 7:e34935. [PMID: 22529958 PMCID: PMC3329554 DOI: 10.1371/journal.pone.0034935] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2011] [Accepted: 03/09/2012] [Indexed: 12/22/2022] Open
Abstract
The importance of gene regulation in animal evolution is a matter of long-standing interest, but measuring the impact of selection on gene expression has proven a challenge. Here, we propose a selection index of gene expression as a straightforward method for assessing the mode and strength of selection operating on gene expression levels. The index is based on the widely used McDonald-Kreitman test and requires the estimation of four quantities: the within-species and between-species expression variances as well as the sequence heterozygosity and divergence of neutrally evolving sequences. We apply the method to data from human and chimpanzee lymphoblastoid cell lines and show that gene expression is in general under strong stabilizing selection. We also demonstrate how the same framework can be used to estimate the proportion of adaptive gene expression evolution.
Collapse
Affiliation(s)
- Maria Warnefors
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
11
|
DNA shape, genetic codes, and evolution. Curr Opin Struct Biol 2011; 21:342-7. [PMID: 21439813 DOI: 10.1016/j.sbi.2011.03.002] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2011] [Revised: 03/03/2011] [Accepted: 03/04/2011] [Indexed: 01/04/2023]
Abstract
Although the three-letter genetic code that maps nucleotide sequence to protein sequence is well known, there must exist other codes that are embedded in the human genome. Recent work points to sequence-dependent variation in DNA shape as one mechanism by which regulatory and other information could be encoded in DNA. Recent advances include the discovery of shape-dependent recognition of DNA that depends on minor groove width and electrostatics, the existence of overlapping codes in protein-coding regions of the genome, and evolutionary selection for compensatory changes in nucleotide composition that facilitate nucleosome occupancy. It is becoming clear that DNA shape is important to biological function, and therefore will be subject to evolutionary constraint.
Collapse
|
12
|
Harris EE. Nonadaptive processes in primate and human evolution. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2011; 143 Suppl 51:13-45. [PMID: 21086525 DOI: 10.1002/ajpa.21439] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Evolutionary biology has tended to focus on adaptive evolution by positive selection as the primum mobile of evolutionary trajectories in species while underestimating the importance of nonadaptive evolutionary processes. In this review, I describe evidence that suggests that primate and human evolution has been strongly influenced by nonadaptive processes, particularly random genetic drift and mutation. This is evidenced by three fundamental effects: a relative relaxation of selective constraints (i.e., purifying selection), a relative increase in the fixation of slightly deleterious mutations, and a general reduction in the efficacy of positive selection. These effects are observed in protein-coding, regulatory regions, and in gene expression data, as well as in an augmentation of fixation of large-scale mutations, including duplicated genes, mobile genetic elements, and nuclear mitochondrial DNA. The evidence suggests a general population-level explanation such as a reduction in effective population size (N(e)). This would have tipped the balance between the evolutionary forces of natural selection and random genetic drift toward genetic drift for variants having small selective effects. After describing these proximate effects, I describe the potential consequences of these effects for primate and human evolution. For example, an increase in the fixation of slightly deleterious mutations could potentially have led to an increase in the fixation rate of compensatory mutations that act to suppress the effects of slightly deleterious substitutions. The potential consequences of compensatory evolution for the evolution of novel gene functions and in potentially confounding the detection of positively selected genes are explored. The consequences of the passive accumulation of large-scale genomic mutations by genetic drift are unclear, though evidence suggests that new gene copies as well as insertions of transposable elements into genes can potentially lead to adaptive phenotypes. Finally, because a decrease in selective constraint at the genetic level is expected to have effects at the morphological level, I review studies that compare rates of morphological change in various mammalian and island populations where N(e) is reduced. Furthermore, I discuss evidence that suggests that craniofacial morphology in the Homo lineage has shifted from an evolutionary rate constrained by purifying selection toward a neutral evolutionary rate.
Collapse
Affiliation(s)
- Eugene E Harris
- Department of Biological Sciences and Geology, Queensborough Community College, City University of New York, Bayside, NY 10364, USA.
| |
Collapse
|
13
|
Lappalainen T, Dermitzakis ET. Evolutionary history of regulatory variation in human populations. Hum Mol Genet 2010; 19:R197-203. [DOI: 10.1093/hmg/ddq406] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
|
14
|
Giger T, Khaitovich P, Somel M, Lorenc A, Lizano E, Harris LW, Ryan MM, Lan M, Wayland MT, Bahn S, Pääbo S. Evolution of neuronal and endothelial transcriptomes in primates. Genome Biol Evol 2010; 2:284-92. [PMID: 20624733 PMCID: PMC2998193 DOI: 10.1093/gbe/evq018] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The study of gene expression evolution in vertebrates has hitherto focused on the analysis of transcriptomes in tissues of different species. However, because a tissue is made up of different cell types, and cell types differ with respect to their transcriptomes, the analysis of tissues offers a composite picture of transcriptome evolution. The isolation of individual cells from tissue sections opens up the opportunity to study gene expression evolution at the cell type level. We have stained neurons and endothelial cells in human brains by antibodies against cell type-specific marker proteins, isolated the cells using laser capture microdissection, and identified genes preferentially expressed in the two cell types. We analyze these two classes of genes with respect to their expression in 62 different human tissues, with respect to their expression in 44 human "postmortem" brains from different developmental stages and with respect to between-species brain expression differences. We find that genes preferentially expressed in neurons differ less across tissues and developmental stages than genes preferentially expressed in endothelial cells. We also observe less expression differences within primate species for neuronal transcriptomes. In stark contrast, we see more gene expression differences between humans, chimpanzees, and rhesus macaques relative to within-species differences in genes expressed preferentially in neurons than in genes expressed in endothelial cells. This suggests that neuronal and endothelial transcriptomes evolve at different rates within brain tissue.
Collapse
Affiliation(s)
- Thomas Giger
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Present address: Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- Corresponding author: E-mail:
| | - Philipp Khaitovich
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Present address: Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- Present address: Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Yue Yang Road, Shanghai, 200031, P.R. China
| | - Mehmet Somel
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Present address: Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- Present address: Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Yue Yang Road, Shanghai, 200031, P.R. China
| | - Anna Lorenc
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Max Planck Institute for Evolutionary Biology, Ploen, Germany
- Present address: Max Planck Institute for Evolutionary Biology, August-Thienemann-Strasse 2, 24306 Ploen, Germany
| | - Esther Lizano
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Present address: Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
| | - Laura W. Harris
- Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, United Kingdom
- Present address: Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, Institute of Biotechnology, University of Cambridge, Tennis Court Road, Cambridge CB21QT, United Kingdom
| | - Margaret M. Ryan
- Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, United Kingdom
- Present address: Department of Anatomy and Structural Biology, Otago School of Medical Sciences, P.O. Box 913, Dunedin, New Zealand
| | - Martin Lan
- Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, United Kingdom
- Present address: Psychiatry Department, New York Presbyterian Hospital, 525 East 68th Street, New York, NY 10065, USA
| | - Matthew T. Wayland
- Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, United Kingdom
- Present address: Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, Institute of Biotechnology, University of Cambridge, Tennis Court Road, Cambridge CB21QT, United Kingdom
| | - Sabine Bahn
- Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, United Kingdom
- Present address: Cambridge Centre for Neuropsychiatric Research, Department of Chemical Engineering and Biotechnology, Institute of Biotechnology, University of Cambridge, Tennis Court Road, Cambridge CB21QT, United Kingdom
| | - Svante Pääbo
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Present address: Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
| |
Collapse
|
15
|
Moses AM. Statistical tests for natural selection on regulatory regions based on the strength of transcription factor binding sites. BMC Evol Biol 2009; 9:286. [PMID: 19995462 PMCID: PMC2800119 DOI: 10.1186/1471-2148-9-286] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2009] [Accepted: 12/09/2009] [Indexed: 02/04/2023] Open
Abstract
Background Although cis-regulatory changes play an important role in evolution, it remains difficult to establish the contribution of natural selection to regulatory differences between species. For protein coding regions, powerful tests of natural selection have been developed based on comparisons of synonymous and non-synonymous substitutions, and analogous tests for regulatory regions would be of great utility. Results Here, tests for natural selection on regulatory regions are proposed based on nucleotide substitutions that occur in characterized transcription factor binding sites (an important type functional element within regulatory regions). In the absence of selection, these substitutions will tend to reduce the strength of existing binding sites. On the other hand, purifying selection will act to preserve the binding sites in regulatory regions, while positive selection can act to create or destroy binding sites, as well as change their strength. Using standard models of binding site strength and molecular evolution in the absence of selection, this intuition can be used to develop statistical tests for natural selection. Application of these tests to two well-characterized regulatory regions in Drosophila provides evidence for purifying selection. Conclusion This demonstrates that it is possible to develop tests for selection on regulatory regions based on the specific functional constrains on these sequences.
Collapse
Affiliation(s)
- Alan M Moses
- Department of Cell & Systems Biology, University of Toronto, Toronto, ON M5S 3B2, Canada.
| |
Collapse
|
16
|
Torgerson DG, Boyko AR, Hernandez RD, Indap A, Hu X, White TJ, Sninsky JJ, Cargill M, Adams MD, Bustamante CD, Clark AG. Evolutionary processes acting on candidate cis-regulatory regions in humans inferred from patterns of polymorphism and divergence. PLoS Genet 2009; 5:e1000592. [PMID: 19662163 PMCID: PMC2714078 DOI: 10.1371/journal.pgen.1000592] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2008] [Accepted: 07/10/2009] [Indexed: 01/30/2023] Open
Abstract
Analysis of polymorphism and divergence in the non-coding portion of the human genome yields crucial information about factors driving the evolution of gene regulation. Candidate cis-regulatory regions spanning more than 15,000 genes in 15 African Americans and 20 European Americans were re-sequenced and aligned to the chimpanzee genome in order to identify potentially functional polymorphism and to characterize and quantify departures from neutral evolution. Distortions of the site frequency spectra suggest a general pattern of selective constraint on conserved non-coding sites in the flanking regions of genes (CNCs). Moreover, there is an excess of fixed differences that cannot be explained by a Gamma model of deleterious fitness effects, suggesting the presence of positive selection on CNCs. Extensions of the McDonald-Kreitman test identified candidate cis-regulatory regions with high probabilities of positive and negative selection near many known human genes, the biological characteristics of which exhibit genome-wide trends that differ from patterns observed in protein-coding regions. Notably, there is a higher probability of positive selection in candidate cis-regulatory regions near genes expressed in the fetal brain, suggesting that a larger portion of adaptive regulatory changes has occurred in genes expressed during brain development. Overall we find that natural selection has played an important role in the evolution of candidate cis-regulatory regions throughout hominid evolution. It has been suggested that changes in gene expression may have played a more important role in the evolution of modern humans than changes in protein-coding sequences. In order to identify signatures of natural selection on candidate cis-regulatory regions, we examined single nucleotide polymorphisms obtained from the complete re-sequencing of conserved non-coding sites (CNCs) in the flanking regions of over 15,000 genes in 35 humans. Patterns of allele frequencies in CNCs indicate the presence of both positive and negative selection acting on standing variation within these candidate cis-regulatory regions, particularly for the 5′ and 3′ UTRs of genes. Gene-specific tests comparing levels of polymorphism and divergence identify several genes with strong signatures of selection on candidate cis-regulatory regions and suggest that the biological characteristics of genes subject to selection are different between coding and candidate cis-regulatory regions with respect to gene expression and function. For example, we find stronger signatures of positive selection in candidate cis-regulatory regions near genes expressed in the fetal brain, which we do not observe in a concurrent analysis on protein-coding regions. Our results suggest that both positive and negative selection have acted on candidate cis-regulatory regions and that the evolution of non-coding DNA has played an important role throughout hominid evolution.
Collapse
Affiliation(s)
- Dara G Torgerson
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|