1
|
Shaw DE, Naftaly AS, White MA. Positive Selection Drives cis-regulatory Evolution Across the Threespine Stickleback Y Chromosome. Mol Biol Evol 2024; 41:msae020. [PMID: 38306314 PMCID: PMC10899008 DOI: 10.1093/molbev/msae020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 12/31/2023] [Accepted: 01/24/2024] [Indexed: 02/04/2024] Open
Abstract
Allele-specific gene expression evolves rapidly on heteromorphic sex chromosomes. Over time, the accumulation of mutations on the Y chromosome leads to widespread loss of gametolog expression, relative to the X chromosome. It remains unclear if expression evolution on degrading Y chromosomes is primarily driven by mutations that accumulate through processes of selective interference, or if positive selection can also favor the down-regulation of coding regions on the Y chromosome that contain deleterious mutations. Identifying the relative rates of cis-regulatory sequence evolution across Y chromosomes has been challenging due to the limited number of reference assemblies. The threespine stickleback (Gasterosteus aculeatus) Y chromosome is an excellent model to identify how regulatory mutations accumulate on Y chromosomes due to its intermediate state of divergence from the X chromosome. A large number of Y-linked gametologs still exist across 3 differently aged evolutionary strata to test these hypotheses. We found that putative enhancer regions on the Y chromosome exhibited elevated substitution rates and decreased polymorphism when compared to nonfunctional sites, like intergenic regions and synonymous sites. This suggests that many cis-regulatory regions are under positive selection on the Y chromosome. This divergence was correlated with X-biased gametolog expression, indicating the loss of expression from the Y chromosome may be favored by selection. Our findings provide evidence that Y-linked cis-regulatory regions exhibit signs of positive selection quickly after the suppression of recombination and allow comparisons with recent theoretical models that suggest the rapid divergence of regulatory regions may be favored to mask deleterious mutations on the Y chromosome.
Collapse
Affiliation(s)
- Daniel E Shaw
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | | | - Michael A White
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
2
|
Barua A, Mikheyev AS. Toxin expression in snake venom evolves rapidly with constant shifts in evolutionary rates. Proc Biol Sci 2020; 287:20200613. [PMID: 32345154 PMCID: PMC7282918 DOI: 10.1098/rspb.2020.0613] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 03/30/2020] [Indexed: 12/21/2022] Open
Abstract
Key innovations provide ecological opportunity by enabling access to new resources, colonization of new environments, and are associated with adaptive radiation. The most well-known pattern associated with adaptive radiation is an early burst of phenotypic diversification. Venoms facilitate prey capture and are widely believed to be key innovations leading to adaptive radiation. However, few studies have estimated their evolutionary rate dynamics. Here, we test for patterns of adaptive evolution in venom gene expression data from 52 venomous snake species. By identifying shifts in tempo and mode of evolution along with models of phenotypic evolution, we show that snake venom exhibits the macroevolutionary dynamics expected of key innovations. Namely, all toxin families undergo shifts in their rates of evolution, likely in response to changes in adaptive optima. Furthermore, we show that rapid-pulsed evolution modelled as a Lévy process better fits snake venom evolution than conventional early burst or Ornstein-Uhlenbeck models. While our results support the idea of snake venom being a key innovation, the innovation of venom chemistry lacks clear mechanisms that would lead to reproductive isolation and thus adaptive radiation. Therefore, the extent to which venom directly influences the diversification process is still a matter of contention.
Collapse
Affiliation(s)
- Agneesh Barua
- Ecology and Evolution Unit, Okinawa Institute of Science and Technology Graduate University, 1919-1 Tancha, Onna-son, Kunigami-gun, Okinawa-ken 904-0495, Japan
| | - Alexander S. Mikheyev
- Ecology and Evolution Unit, Okinawa Institute of Science and Technology Graduate University, 1919-1 Tancha, Onna-son, Kunigami-gun, Okinawa-ken 904-0495, Japan
- Evolutionary genomics group, Australian National University, Canberra ACT 0200, Australia
| |
Collapse
|
3
|
On the specificity of gene regulatory networks: How does network co-option affect subsequent evolution? Curr Top Dev Biol 2020; 139:375-405. [PMID: 32450967 DOI: 10.1016/bs.ctdb.2020.03.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The process of multicellular organismal development hinges upon the specificity of developmental programs: for different parts of the organism to form unique features, processes must exist to specify each part. This specificity is thought to be hardwired into gene regulatory networks, which activate cohorts of genes in particular tissues at particular times during development. However, the evolution of gene regulatory networks sometimes occurs by mechanisms that sacrifice specificity. One such mechanism is network co-option, in which existing gene networks are redeployed in new developmental contexts. While network co-option may offer an efficient mechanism for generating novel phenotypes, losses of tissue specificity at redeployed network genes could restrict the ability of the affected traits to evolve independently. At present, there has not been a detailed discussion regarding how tissue specificity of network genes might be altered due to gene network co-option at its initiation, as well as how trait independence can be retained or restored after network co-option. A lack of clarity about network co-option makes it more difficult to speculate on the long-term evolutionary implications of this mechanism. In this review, we will discuss the possible initial outcomes of network co-option, outline the mechanisms by which networks may retain or subsequently regain specificity after network co-option, and comment on some of the possible evolutionary consequences of network co-option. We place special emphasis on the need to consider selectively-neutral outcomes of network co-option to improve our understanding of the role of this mechanism in trait evolution.
Collapse
|
4
|
Catalán A, Briscoe AD, Höhna S. Drift and Directional Selection Are the Evolutionary Forces Driving Gene Expression Divergence in Eye and Brain Tissue of Heliconius Butterflies. Genetics 2019; 213:581-594. [PMID: 31467133 PMCID: PMC6781903 DOI: 10.1534/genetics.119.302493] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 08/24/2019] [Indexed: 01/05/2023] Open
Abstract
Investigating gene expression evolution over micro- and macroevolutionary timescales will expand our understanding of the role of gene expression in adaptation and speciation. In this study, we characterized the evolutionary forces acting on gene expression levels in eye and brain tissue of five Heliconius butterflies with divergence times of ∼5-12 MYA. We developed and applied Brownian motion (BM) and Ornstein-Uhlenbeck (OU) models to identify genes whose expression levels are evolving through drift, stabilizing selection, or a lineage-specific shift. We found that 81% of the genes evolve under genetic drift. When testing for branch-specific shifts in gene expression, we detected 368 (16%) shift events. Genes showing a shift toward upregulation have significantly lower gene expression variance than those genes showing a shift leading toward downregulation. We hypothesize that directional selection is acting in shifts causing upregulation, since transcription is costly. We further uncovered through simulations that parameter estimation of OU models is biased when using small phylogenies and only becomes reliable with phylogenies having ≥ 50 taxa. Therefore, we developed a new statistical test based on BM to identify highly conserved genes (i.e., evolving under strong stabilizing selection), which comprised 3% of the orthoclusters. In conclusion, we found that drift is the dominant evolutionary force driving gene expression evolution in eye and brain tissue in Heliconius Nevertheless, the higher proportion of genes evolving under directional than under stabilizing selection might reflect species-specific selective pressures on vision and the brain that are necessary to fulfill species-specific requirements.
Collapse
Affiliation(s)
- Ana Catalán
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, 75236, Sweden
- Division of Evolutionary Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
| | - Adriana D Briscoe
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697
| | - Sebastian Höhna
- Division of Evolutionary Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
- Department of Earth and Environmental Sciences, Paleontology and Geobiology, 80333 Munich, Germany
- GeoBio-Center, Ludwig-Maximilians-Universität München, 80333 Munich, Germany
| |
Collapse
|
5
|
Assis R. Lineage-Specific Expression Divergence in Grasses Is Associated with Male Reproduction, Host-Pathogen Defense, and Domestication. Genome Biol Evol 2019; 11:207-219. [PMID: 30398650 PMCID: PMC6331041 DOI: 10.1093/gbe/evy245] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/03/2018] [Indexed: 02/02/2023] Open
Abstract
Poaceae (grasses) is an agriculturally important and widely distributed family of plants with extraordinary phenotypic diversity, much of which was generated under recent lineage-specific evolution. Yet, little is known about the genes and functional modules involved in the lineage-specific divergence of grasses. Here, I address this question on a genome-wide scale by applying a novel branch-based statistic of lineage-specific expression divergence, LED, to RNA-seq data from nine tissues of the wild grass Brachypodium distachyon and its domesticated relatives Oryza sativa japonica (rice) and Sorghum bicolor (sorghum). I find that LED is generally smallest in B. distachyon and largest in O. sativa japonica, which underwent domestication earlier than S. bicolor, supporting the hypothesis that domestication may increase the rate of lineage-specific expression divergence in grasses. Moreover, in all three species, LED is positively correlated with protein-coding sequence divergence and tissue specificity, and negatively correlated with network connectivity. Further analysis reveals that genes with large LED are often primarily expressed in anther, implicating lineage-specific expression divergence in the evolution of male reproductive phenotypes. Gene ontology enrichment analysis also identifies an overrepresentation of terms related to male reproduction in the two domesticated grasses, as well as to those involved in host-pathogen defense in all three species. Last, examinations of genes with the largest LED reveal that their lineage-specific expression divergence may have contributed to antimicrobial functions in B. distachyon, to enhanced adaptation and yield during domestication in O. sativa japonica, and to defense against a widespread and devastating fungal pathogen in S. bicolor. Together, these findings suggest that lineage-specific expression divergence in grasses may increase under domestication and preferentially target rapidly evolving genes involved in male reproduction, host-pathogen defense, and the origin of domesticated phenotypes.
Collapse
Affiliation(s)
- Raquel Assis
- Department of Biology, Pennsylvania State University, University Park
| |
Collapse
|
6
|
Hodgins-Davis A, Rice DP, Townsend JP. Gene Expression Evolves under a House-of-Cards Model of Stabilizing Selection. Mol Biol Evol 2015; 32:2130-40. [PMID: 25901014 PMCID: PMC4592357 DOI: 10.1093/molbev/msv094] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Divergence in gene regulation is hypothesized to underlie much of phenotypic evolution, but the role of natural selection in shaping the molecular phenotype of gene expression continues to be debated. To resolve the mode of gene expression, evolution requires accessible theoretical predictions for the effect of selection over long timescales. Evolutionary quantitative genetic models of phenotypic evolution can provide such predictions, yet those predictions depend on the underlying hypotheses about the distributions of mutational and selective effects that are notoriously difficult to disentangle. Here, we draw on diverse genomic data sets including expression profiles of natural genetic variation and mutation accumulation lines, empirical estimates of genomic mutation rates, and inferences of genetic architecture to differentiate contrasting hypotheses for the roles of stabilizing selection and mutation in shaping natural expression variation. Our analysis suggests that gene expression evolves in a domain of phenotype space well fit by the House-of-Cards (HC) model. Although the strength of selection inferred is sensitive to the number of loci controlling gene expression, the model is not. The consistency of these results across evolutionary time from budding yeast through fruit fly implies that this model is general and that mutational effects on gene expression are relatively large. Empirical estimates of the genetic architecture of gene expression traits imply that selection provides modest constraints on gene expression levels for most genes, but that the potential for regulatory evolution is high. Our prediction using data from laboratory environments should encourage the collection of additional data sets allowing for more nuanced parameterizations of HC models for gene expression.
Collapse
Affiliation(s)
- Andrea Hodgins-Davis
- Department of Ecology and Evolutionary Biology, Yale University Department of Biostatistics, School of Public Health, Yale University
| | - Daniel P Rice
- Department of Ecology and Evolutionary Biology, Yale University Department of Organismic and Evolutionary Biology, Harvard University
| | - Jeffrey P Townsend
- Department of Ecology and Evolutionary Biology, Yale University Department of Biostatistics, School of Public Health, Yale University Program in Computational Biology and Bioinformatics, Yale University
| |
Collapse
|
7
|
Sensitivity of quantitative traits to mutational effects and number of loci. Theor Popul Biol 2015; 102:85-93. [PMID: 25840144 DOI: 10.1016/j.tpb.2015.03.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2015] [Revised: 03/20/2015] [Accepted: 03/24/2015] [Indexed: 12/18/2022]
Abstract
When models of quantitative genetic variation are built from population genetic first principles, several assumptions are often made. One of the most important assumptions is that traits are controlled by many genes of small effect. This leads to a prediction of a Gaussian trait distribution in the population, via the Central Limit Theorem. Since these biological assumptions are often unknown or untrue, we characterized how finite numbers of loci or large mutational effects can impact the sampling distribution of a quantitative trait. To do so, we developed a neutral coalescent-based framework, allowing us to gain a detailed understanding of how number of loci and the underlying mutational model impacts the distribution of a quantitative trait. Through both analytical theory and simulation we found the normality assumption was highly sensitive to the details of the mutational process, with the greatest discrepancies arising when the number of loci was small or the mutational kernel was heavy-tailed. In particular, skewed mutational effects will produce skewed trait distributions and fat-tailed mutational kernels result in multimodal sampling distributions, even for traits controlled by a large number of loci. Since selection models and robust neutral models may produce qualitatively similar sampling distributions, we advise extra caution should be taken when interpreting model-based results for poorly understood systems of quantitative traits.
Collapse
|
8
|
Somel M, Rohlfs R, Liu X. Transcriptomic insights into human brain evolution: acceleration, neutrality, heterochrony. Curr Opin Genet Dev 2014; 29:110-9. [PMID: 25233113 DOI: 10.1016/j.gde.2014.09.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Revised: 08/31/2014] [Accepted: 09/01/2014] [Indexed: 01/09/2023]
Abstract
Primate brain transcriptome comparisons within the last 12 years have yielded interesting but contradictory observations on how the transcriptome evolves, and its adaptive role in human cognitive evolution. Since the human-chimpanzee common ancestor, the human prefrontal cortex transcriptome seems to have evolved more than that of the chimpanzee. But at the same time, most expression differences among species, especially those observed in adults, appear as consequences of neutral evolution at cis-regulatory sites. Adaptive expression changes in the human brain may be rare events involving timing shifts, or heterochrony, in specific neurodevelopmental processes. Disentangling adaptive and neutral expression changes, and associating these with human-specific features of the brain require improved methods, comparisons across more species, and further work on comparative development.
Collapse
Affiliation(s)
- Mehmet Somel
- Department of Biology, Middle East Technical University, Ankara, Turkey.
| | - Rori Rohlfs
- Department of Integrative Biology, University of California, Berkeley, CA, USA
| | - Xiling Liu
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Shanghai, China
| |
Collapse
|
9
|
8.2% of the Human genome is constrained: variation in rates of turnover across functional element classes in the human lineage. PLoS Genet 2014; 10:e1004525. [PMID: 25057982 PMCID: PMC4109858 DOI: 10.1371/journal.pgen.1004525] [Citation(s) in RCA: 133] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 06/05/2014] [Indexed: 01/27/2023] Open
Abstract
Ten years on from the finishing of the human reference genome sequence, it remains unclear what fraction of the human genome confers function, where this sequence resides, and how much is shared with other mammalian species. When addressing these questions, functional sequence has often been equated with pan-mammalian conserved sequence. However, functional elements that are short-lived, including those contributing to species-specific biology, will not leave a footprint of long-lasting negative selection. Here, we address these issues by identifying and characterising sequence that has been constrained with respect to insertions and deletions for pairs of eutherian genomes over a range of divergences. Within noncoding sequence, we find increasing amounts of mutually constrained sequence as species pairs become more closely related, indicating that noncoding constrained sequence turns over rapidly. We estimate that half of present-day noncoding constrained sequence has been gained or lost in approximately the last 130 million years (half-life in units of divergence time, d1/2 = 0.25–0.31). While enriched with ENCODE biochemical annotations, much of the short-lived constrained sequences we identify are not detected by models optimized for wider pan-mammalian conservation. Constrained DNase 1 hypersensitivity sites, promoters and untranslated regions have been more evolutionarily stable than long noncoding RNA loci which have turned over especially rapidly. By contrast, protein coding sequence has been highly stable, with an estimated half-life of over a billion years (d1/2 = 2.1–5.0). From extrapolations we estimate that 8.2% (7.1–9.2%) of the human genome is presently subject to negative selection and thus is likely to be functional, while only 2.2% has maintained constraint in both human and mouse since these species diverged. These results reveal that the evolutionary history of the human genome has been highly dynamic, particularly for its noncoding yet biologically functional fraction. Nearly 99% of the human genome does not encode proteins, and while there recently has been extensive biochemical annotation of the remaining noncoding fraction, it remains unclear whether or not the bulk of these DNA sequences have important functional roles. By comparing the genome sequences of different species we identify genomic regions that have evolved unexpectedly slowly, a signature of natural selection upon functional sequence. Using a high resolution evolutionary approach to find sequence showing evolutionary signatures of functionality we estimate that a total of 8.2% (7.1–9.2%) of the human genome is presently functional, more than three times as much than is functional and shared between human and mouse. This implies that there is an abundance of sequences with short lived lineage-specific functionality. As expected, most of the sequence involved in this functional “turnover” is noncoding, while protein coding sequence is stably preserved over longer evolutionary timescales. More generally, we find that the rate of functional turnover varies significantly across categories of functional noncoding elements. Our results provide a pan-mammalian and whole genome perspective on how rapidly different classes of sequence have gained and lost functionality down the human lineage.
Collapse
|
10
|
Rohlfs RV, Harrigan P, Nielsen R. Modeling gene expression evolution with an extended Ornstein-Uhlenbeck process accounting for within-species variation. Mol Biol Evol 2014; 31:201-11. [PMID: 24113538 PMCID: PMC3879452 DOI: 10.1093/molbev/mst190] [Citation(s) in RCA: 82] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Much of the phenotypic variation observed between even closely related species may be driven by differences in gene expression levels. The current availability of reliable techniques like RNA-Seq, which can quantify expression levels across species, has enabled comparative studies. Ornstein-Uhlenbeck (OU) processes have been proposed to model gene expression evolution as they model both random drift and stabilizing selection and can be extended to model changes in selection regimes. The OU models provide a statistical framework that allows comparisons of specific hypotheses of selective regimes, including random drift, constrained drift, and expression level shifts. In this way, inferences may be made about the mode of selection acting on the expression level of a gene. We augment this model to include within-species expression variance, allowing for modeling of nonevolutionary expression variance that could be caused by individual genetic, environmental, or technical variation. Through simulations, we explore the reliability of parameter estimates and the extent to which different selective regimes can be distinguished using phylogenies of varying size using both the typical OU model and our extended model. We find that if individual variation is not accounted for, nonevolutionary expression variation is often mistaken for strong stabilizing selection. The methods presented in this article are increasingly relevant as comparative expression data becomes more available and researchers turn to expression as a primary evolving phenotype.
Collapse
Affiliation(s)
- Rori V. Rohlfs
- Department of Integrative Biology, University of California, Berkeley
| | - Patrick Harrigan
- Division of Bioinformatics, University of California, San Francisco
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California, Berkeley
| |
Collapse
|
11
|
Schraiber JG, Mostovoy Y, Hsu TY, Brem RB. Inferring evolutionary histories of pathway regulation from transcriptional profiling data. PLoS Comput Biol 2013; 9:e1003255. [PMID: 24130471 PMCID: PMC3794907 DOI: 10.1371/journal.pcbi.1003255] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Accepted: 08/20/2013] [Indexed: 01/09/2023] Open
Abstract
One of the outstanding challenges in comparative genomics is to interpret the evolutionary importance of regulatory variation between species. Rigorous molecular evolution-based methods to infer evidence for natural selection from expression data are at a premium in the field, and to date, phylogenetic approaches have not been well-suited to address the question in the small sets of taxa profiled in standard surveys of gene expression. We have developed a strategy to infer evolutionary histories from expression profiles by analyzing suites of genes of common function. In a manner conceptually similar to molecular evolution models in which the evolutionary rates of DNA sequence at multiple loci follow a gamma distribution, we modeled expression of the genes of an a priori-defined pathway with rates drawn from an inverse gamma distribution. We then developed a fitting strategy to infer the parameters of this distribution from expression measurements, and to identify gene groups whose expression patterns were consistent with evolutionary constraint or rapid evolution in particular species. Simulations confirmed the power and accuracy of our inference method. As an experimental testbed for our approach, we generated and analyzed transcriptional profiles of four Saccharomyces yeasts. The results revealed pathways with signatures of constrained and accelerated regulatory evolution in individual yeasts and across the phylogeny, highlighting the prevalence of pathway-level expression change during the divergence of yeast species. We anticipate that our pathway-based phylogenetic approach will be of broad utility in the search to understand the evolutionary relevance of regulatory change. Comparative transcriptomic studies routinely identify thousands of genes differentially expressed between species. The central question in the field is whether and how such regulatory changes have been the product of natural selection. Can the signal of evolutionarily relevant expression divergence be detected amid the noise of changes resulting from genetic drift? Our work develops a theory of gene expression variation among a suite of genes that function together. We derive a formalism that relates empirical observations of expression of pathway genes in divergent species to the underlying strength of natural selection on expression output. We show that fitting this type of model to simulated data accurately recapitulates the parameters used to generate the simulation. We then make experimental measurements of gene expression in a panel of single-celled eukaryotic yeast species. To these data we apply our inference method, and identify pathways with striking evidence for accelerated or constrained regulatory evolution, in particular species and across the phylogeny. Our method provides a key advance over previous approaches in that it maximizes the power of rigorous molecular-evolution analysis of regulatory variation even when data are relatively sparse. As such, the theory and tools we have developed will likely find broad application in the field of comparative genomics.
Collapse
Affiliation(s)
- Joshua G. Schraiber
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Yulia Mostovoy
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Tiffany Y. Hsu
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Rachel B. Brem
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, California, United States of America
- * E-mail:
| |
Collapse
|
12
|
Paris M, Kaplan T, Li XY, Villalta JE, Lott SE, Eisen MB. Extensive divergence of transcription factor binding in Drosophila embryos with highly conserved gene expression. PLoS Genet 2013; 9:e1003748. [PMID: 24068946 PMCID: PMC3772039 DOI: 10.1371/journal.pgen.1003748] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2013] [Accepted: 07/10/2013] [Indexed: 11/19/2022] Open
Abstract
To better characterize how variation in regulatory sequences drives divergence in gene expression, we undertook a systematic study of transcription factor binding and gene expression in blastoderm embryos of four species, which sample much of the diversity in the 40 million-year old genus Drosophila: D. melanogaster, D. yakuba, D. pseudoobscura and D. virilis. We compared gene expression, measured by mRNA-seq, to the genome-wide binding, measured by ChIP-seq, of four transcription factors involved in early anterior-posterior patterning. We found that mRNA levels are much better conserved than individual transcription factor binding events, and that changes in a gene's expression were poorly explained by changes in adjacent transcription factor binding. However, highly bound sites, sites in regions bound by multiple factors and sites near genes are conserved more frequently than other binding, suggesting that a considerable amount of transcription factor binding is weakly or non-functional and not subject to purifying selection.
Collapse
Affiliation(s)
- Mathilde Paris
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Tommy Kaplan
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
- School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel
| | - Xiao Yong Li
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California, United States of America
| | | | - Susan E. Lott
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
- Department of Evolution and Ecology, University of California, Davis, California, United States of America
| | - Michael B. Eisen
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
- School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California, United States of America
| |
Collapse
|
13
|
Landis MJ, Schraiber JG, Liang M. Phylogenetic analysis using Lévy processes: finding jumps in the evolution of continuous traits. Syst Biol 2012; 62:193-204. [PMID: 23034385 DOI: 10.1093/sysbio/sys086] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Gaussian processes, a class of stochastic processes including Brownian motion and the Ornstein-Uhlenbeck process, are widely used to model continuous trait evolution in statistical phylogenetics. Under such processes, observations at the tips of a phylogenetic tree have a multivariate Gaussian distribution, which may lead to suboptimal model specification under certain evolutionary conditions, as supposed in models of punctuated equilibrium or adaptive radiation. To consider non-normally distributed continuous trait evolution, we introduce a method to compute posterior probabilities when modeling continuous trait evolution as a Lévy process. Through data simulation and model testing, we establish that single-rate Brownian motion (BM) and Lévy processes with jumps generate distinct patterns in comparative data. We then analyzed body mass and endocranial volume measurements for 126 primates. We rejected single-rate BM in favor of a Lévy process with jumps for each trait, with the lineage leading to most recent common ancestor of great apes showing particularly strong evidence against single-rate BM.
Collapse
Affiliation(s)
- Michael J Landis
- Department of Integrative Biology, University of California, Berkeley, CA 94720-3140, USA
| | | | | |
Collapse
|
14
|
Conservation and divergence in Toll-like receptor 4-regulated gene expression in primary human versus mouse macrophages. Proc Natl Acad Sci U S A 2012; 109:E944-53. [PMID: 22451944 DOI: 10.1073/pnas.1110156109] [Citation(s) in RCA: 235] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Evolutionary change in gene expression is generally considered to be a major driver of phenotypic differences between species. We investigated innate immune diversification by analyzing interspecies differences in the transcriptional responses of primary human and mouse macrophages to the Toll-like receptor (TLR)-4 agonist lipopolysaccharide (LPS). By using a custom platform permitting cross-species interrogation coupled with deep sequencing of mRNA 5' ends, we identified extensive divergence in LPS-regulated orthologous gene expression between humans and mice (24% of orthologues were identified as "divergently regulated"). We further demonstrate concordant regulation of human-specific LPS target genes in primary pig macrophages. Divergently regulated orthologues were enriched for genes encoding cellular "inputs" such as cell surface receptors (e.g., TLR6, IL-7Rα) and functional "outputs" such as inflammatory cytokines/chemokines (e.g., CCL20, CXCL13). Conversely, intracellular signaling components linking inputs to outputs were typically concordantly regulated. Functional consequences of divergent gene regulation were confirmed by showing LPS pretreatment boosts subsequent TLR6 responses in mouse but not human macrophages, in keeping with mouse-specific TLR6 induction. Divergently regulated genes were associated with a large dynamic range of gene expression, and specific promoter architectural features (TATA box enrichment, CpG island depletion). Surprisingly, regulatory divergence was also associated with enhanced interspecies promoter conservation. Thus, the genes controlled by complex, highly conserved promoters that facilitate dynamic regulation are also the most susceptible to evolutionary change.
Collapse
|
15
|
Harrison PW, Wright AE, Mank JE. The evolution of gene expression and the transcriptome-phenotype relationship. Semin Cell Dev Biol 2011; 23:222-9. [PMID: 22210502 DOI: 10.1016/j.semcdb.2011.12.004] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2011] [Revised: 12/09/2011] [Accepted: 12/16/2011] [Indexed: 11/26/2022]
Abstract
Changes in gene expression underlie the adaptive evolution in many complex phenotypes, and the recent increase in the availability of multi-species comparative transcriptome data has made it possible to scan whole transcriptomes for loci that have experienced adaptive changes in expression. However, despite the increase in data availability, current models of gene expression evolution often do not account for the complexities and inherent noise associated with transcriptome data. Additionally, in contrast to current models of gene sequence evolution, models of transcriptome evolution often lack the sophistication to effectively determine whether transcriptional differences between species or within a clade are the result of neutral or adaptive processes. In this review, we discuss the tools, methods and models that define our current understanding of the relationship between gene expression and complex phenotype evolution. Our goal is to summarize what we know about the evolution of global gene expression patterns underlying complex traits, as well to identify some of the questions that remain to be answered.
Collapse
Affiliation(s)
- Peter W Harrison
- University of Oxford, Edward Grey institute, Department of Zoology, South Parks Road, Oxford OX1 3PS, United Kingdom
| | | | | |
Collapse
|