1
|
Everman ER, Macdonald SJ. Gene expression variation underlying tissue-specific responses to copper stress in Drosophila melanogaster. G3 (BETHESDA, MD.) 2024; 14:jkae015. [PMID: 38262701 PMCID: PMC11021028 DOI: 10.1093/g3journal/jkae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/04/2024] [Accepted: 01/08/2024] [Indexed: 01/25/2024]
Abstract
Copper is one of a handful of biologically necessary heavy metals that is also a common environmental pollutant. Under normal conditions, copper ions are required for many key physiological processes. However, in excess, copper results in cell and tissue damage ranging in severity from temporary injury to permanent neurological damage. Because of its biological relevance, and because many conserved copper-responsive genes respond to nonessential heavy metal pollutants, copper resistance in Drosophila melanogaster is a useful model system with which to investigate the genetic control of the heavy metal stress response. Because heavy metal toxicity has the potential to differently impact specific tissues, we genetically characterized the control of the gene expression response to copper stress in a tissue-specific manner in this study. We assessed the copper stress response in head and gut tissue of 96 inbred strains from the Drosophila Synthetic Population Resource using a combination of differential expression analysis and expression quantitative trait locus mapping. Differential expression analysis revealed clear patterns of tissue-specific expression. Tissue and treatment specific responses to copper stress were also detected using expression quantitative trait locus mapping. Expression quantitative trait locus associated with MtnA, Mdr49, Mdr50, and Sod3 exhibited both genotype-by-tissue and genotype-by-treatment effects on gene expression under copper stress, illuminating tissue- and treatment-specific patterns of gene expression control. Together, our data build a nuanced description of the roles and interactions between allelic and expression variation in copper-responsive genes, provide valuable insight into the genomic architecture of susceptibility to metal toxicity, and highlight candidate genes for future functional characterization.
Collapse
Affiliation(s)
- Elizabeth R Everman
- School of Biological Sciences, The University of Oklahoma, 730 Van Vleet Oval, Norman, OK 73019, USA
| | - Stuart J Macdonald
- Molecular Biosciences, University of Kansas, 1200 Sunnyside Ave, Lawrence, KS 66045, USA
| |
Collapse
|
2
|
Everman ER, Macdonald SJ. Gene expression variation underlying tissue-specific responses to copper stress in Drosophila melanogaster. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.12.548746. [PMID: 37503205 PMCID: PMC10370140 DOI: 10.1101/2023.07.12.548746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Copper is one of a handful of biologically necessary heavy metals that is also a common environmental pollutant. Under normal conditions, copper ions are required for many key physiological processes. However, in excess, copper quickly results in cell and tissue damage that can range in severity from temporary injury to permanent neurological damage. Because of its biological relevance, and because many conserved copper-responsive genes also respond to other non-essential heavy metal pollutants, copper resistance in Drosophila melanogaster is a useful model system with which to investigate the genetic control of the response to heavy metal stress. Because heavy metal toxicity has the potential to differently impact specific tissues, we genetically characterized the control of the gene expression response to copper stress in a tissue-specific manner in this study. We assessed the copper stress response in head and gut tissue of 96 inbred strains from the Drosophila Synthetic Population Resource (DSPR) using a combination of differential expression analysis and expression quantitative trait locus (eQTL) mapping. Differential expression analysis revealed clear patterns of tissue-specific expression, primarily driven by a more pronounced gene expression response in gut tissue. eQTL mapping of gene expression under control and copper conditions as well as for the change in gene expression following copper exposure (copper response eQTL) revealed hundreds of genes with tissue-specific local cis-eQTL and many distant trans-eQTL. eQTL associated with MtnA, Mdr49, Mdr50, and Sod3 exhibited genotype by environment effects on gene expression under copper stress, illuminating several tissue- and treatment-specific patterns of gene expression control. Together, our data build a nuanced description of the roles and interactions between allelic and expression variation in copper-responsive genes, provide valuable insight into the genomic architecture of susceptibility to metal toxicity, and highlight many candidate genes for future functional characterization.
Collapse
Affiliation(s)
- Elizabeth R Everman
- 1200 Sunnyside Ave, University of Kansas, Molecular Biosciences, Lawrence, KS 66045, USA
- 730 Van Vleet Oval, University of Oklahoma, Biology, Norman, OK 73019, USA
| | - Stuart J Macdonald
- 1200 Sunnyside Ave, University of Kansas, Molecular Biosciences, Lawrence, KS 66045, USA
- 1200 Sunnyside Ave, University of Kansas, Center for Computational Biology, Lawrence, KS 66045, USA
| |
Collapse
|
3
|
Everman ER, Macdonald SJ, Kelly JK. The genetic basis of adaptation to copper pollution in Drosophila melanogaster. Front Genet 2023; 14:1144221. [PMID: 37082199 PMCID: PMC10110907 DOI: 10.3389/fgene.2023.1144221] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 03/21/2023] [Indexed: 04/22/2023] Open
Abstract
Introduction: Heavy metal pollutants can have long lasting negative impacts on ecosystem health and can shape the evolution of species. The persistent and ubiquitous nature of heavy metal pollution provides an opportunity to characterize the genetic mechanisms that contribute to metal resistance in natural populations. Methods: We examined variation in resistance to copper, a common heavy metal contaminant, using wild collections of the model organism Drosophila melanogaster. Flies were collected from multiple sites that varied in copper contamination risk. We characterized phenotypic variation in copper resistance within and among populations using bulked segregant analysis to identify regions of the genome that contribute to copper resistance. Results and Discussion: Copper resistance varied among wild populations with a clear correspondence between resistance level and historical exposure to copper. We identified 288 SNPs distributed across the genome associated with copper resistance. Many SNPs had population-specific effects, but some had consistent effects on copper resistance in all populations. Significant SNPs map to several novel candidate genes involved in refolding disrupted proteins, energy production, and mitochondrial function. We also identified one SNP with consistent effects on copper resistance in all populations near CG11825, a gene involved in copper homeostasis and copper resistance. We compared the genetic signatures of copper resistance in the wild-derived populations to genetic control of copper resistance in the Drosophila Synthetic Population Resource (DSPR) and the Drosophila Genetic Reference Panel (DGRP), two copper-naïve laboratory populations. In addition to CG11825, which was identified as a candidate gene in the wild-derived populations and previously in the DSPR, there was modest overlap of copper-associated SNPs between the wild-derived populations and laboratory populations. Thirty-one SNPs associated with copper resistance in wild-derived populations fell within regions of the genome that were associated with copper resistance in the DSPR in a prior study. Collectively, our results demonstrate that the genetic control of copper resistance is highly polygenic, and that several loci can be clearly linked to genes involved in heavy metal toxicity response. The mixture of parallel and population-specific SNPs points to a complex interplay between genetic background and the selection regime that modifies the effects of genetic variation on copper resistance.
Collapse
Affiliation(s)
| | - Stuart J. Macdonald
- Molecular Biosciences, University of Kansas, Lawrence, KS, United States
- Center for Computational Biology, University of Kansas, Lawrence, KS, United States
| | - John K. Kelly
- Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, United States
| |
Collapse
|
4
|
Green L, Coronado-Zamora M, Radío S, Rech GE, Salces-Ortiz J, González J. The genomic basis of copper tolerance in Drosophila is shaped by a complex interplay of regulatory and environmental factors. BMC Biol 2022; 20:275. [PMID: 36482348 PMCID: PMC9733279 DOI: 10.1186/s12915-022-01479-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 11/24/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Escalation in industrialization and anthropogenic activity have resulted in an increase of pollutants released into the environment. Of these pollutants, heavy metals such as copper are particularly concerning due to their bio-accumulative nature. Due to its highly heterogeneous distribution and its dual nature as an essential micronutrient and toxic element, the genetic basis of copper tolerance is likely shaped by a complex interplay of genetic and environmental factors. RESULTS In this study, we utilized the natural variation present in multiple populations of Drosophila melanogaster collected across Europe to screen for variation in copper tolerance. We found that latitude and the degree of urbanization at the collection sites, rather than any other combination of environmental factors, were linked to copper tolerance. While previously identified copper-related genes were not differentially expressed in tolerant vs. sensitive strains, genes involved in metabolism, reproduction, and protease induction contributed to the differential stress response. Additionally, the greatest transcriptomic and physiological responses to copper toxicity were seen in the midgut, where we found that preservation of gut acidity is strongly linked to greater tolerance. Finally, we identified transposable element insertions likely to play a role in copper stress response. CONCLUSIONS Overall, by combining genome-wide approaches with environmental association analysis, and functional analysis of candidate genes, our study provides a unique perspective on the genetic and environmental factors that shape copper tolerance in natural D. melanogaster populations and identifies new genes, transposable elements, and physiological traits involved in this complex phenotype.
Collapse
Affiliation(s)
- Llewellyn Green
- grid.5612.00000 0001 2172 2676Institute of Evolutionary Biology, CSIC, Universitat Pompeu Fabra, Barcelona, Spain
| | - Marta Coronado-Zamora
- grid.5612.00000 0001 2172 2676Institute of Evolutionary Biology, CSIC, Universitat Pompeu Fabra, Barcelona, Spain
| | - Santiago Radío
- grid.5612.00000 0001 2172 2676Institute of Evolutionary Biology, CSIC, Universitat Pompeu Fabra, Barcelona, Spain
| | - Gabriel E. Rech
- grid.5612.00000 0001 2172 2676Institute of Evolutionary Biology, CSIC, Universitat Pompeu Fabra, Barcelona, Spain
| | - Judit Salces-Ortiz
- grid.5612.00000 0001 2172 2676Institute of Evolutionary Biology, CSIC, Universitat Pompeu Fabra, Barcelona, Spain
| | - Josefa González
- grid.5612.00000 0001 2172 2676Institute of Evolutionary Biology, CSIC, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
5
|
Ramnarine TJS, Grath S, Parsch J. Natural variation in the transcriptional response of Drosophila melanogaster to oxidative stress. G3-GENES GENOMES GENETICS 2021; 12:6409858. [PMID: 34747443 PMCID: PMC8727983 DOI: 10.1093/g3journal/jkab366] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 10/15/2021] [Indexed: 11/26/2022]
Abstract
Broadly distributed species must cope with diverse and changing environmental conditions, including various forms of stress. Cosmopolitan populations of Drosophila melanogaster are more tolerant to oxidative stress than those from the species’ ancestral range in sub-Saharan Africa, and the degree of tolerance is associated with an insertion/deletion polymorphism in the 3′ untranslated region of the Metallothionein A (MtnA) gene that varies clinally in frequency. We examined oxidative stress tolerance and the transcriptional response to oxidative stress in cosmopolitan and sub-Saharan African populations of D. melanogaster, including paired samples with allelic differences at the MtnA locus. We found that the effect of the MtnA polymorphism on oxidative stress tolerance was dependent on the genomic background, with the deletion allele increasing tolerance only in a northern, temperate population. Genes that were differentially expressed under oxidative stress included MtnA and other metallothioneins, as well as those involved in glutathione metabolism and other genes known to be part of the oxidative stress response or the general stress response. A gene coexpression analysis revealed further genes and pathways that respond to oxidative stress including those involved in additional metabolic processes, autophagy, and apoptosis. There was a significant overlap among the genes induced by oxidative and cold stress, which suggests a shared response pathway to these two stresses. Interestingly, the MtnA deletion was associated with consistent changes in the expression of many genes across all genomic backgrounds, regardless of the expression level of the MtnA gene itself. We hypothesize that this is an indirect effect driven by the loss of microRNA binding sites within the MtnA 3′ untranslated region.
Collapse
Affiliation(s)
- Timothy J S Ramnarine
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität (LMU) München, Planegg-Martinsried 82152, Germany
| | - Sonja Grath
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität (LMU) München, Planegg-Martinsried 82152, Germany
| | - John Parsch
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität (LMU) München, Planegg-Martinsried 82152, Germany
| |
Collapse
|
6
|
SHARMA KAMALDEV, PATIL GAURAV, KIRAN ASHA. Characterization and differential expression of sucrose and starch metabolism genes in contrasting chickpea (Cicer arietinum L.) genotypes under low temperature. J Genet 2021. [DOI: 10.1007/s12041-021-01317-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
7
|
Calatayud S, Garcia-Risco M, Palacios Ò, Capdevila M, Cañestro C, Albalat R. Tunicates Illuminate the Enigmatic Evolution of Chordate Metallothioneins by Gene Gains and Losses, Independent Modular Expansions, and Functional Convergences. Mol Biol Evol 2021; 38:4435-4448. [PMID: 34146103 PMCID: PMC8476144 DOI: 10.1093/molbev/msab184] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
To investigate novel patterns and processes of protein evolution, we have focused in the metallothioneins (MTs), a singular group of metal-binding, cysteine-rich proteins that, due to their high degree of sequence diversity, still represents a "black hole" in Evolutionary Biology. We have identified and analyzed more than 160 new MTs in nonvertebrate chordates (especially in 37 species of ascidians, 4 thaliaceans, and 3 appendicularians) showing that prototypic tunicate MTs are mono-modular proteins with a pervasive preference for cadmium ions, whereas vertebrate and cephalochordate MTs are bimodular proteins with diverse metal preferences. These structural and functional differences imply a complex evolutionary history of chordate MTs-including de novo emergence of genes and domains, processes of convergent evolution, events of gene gains and losses, and recurrent amplifications of functional domains-that would stand for an unprecedented case in the field of protein evolution.
Collapse
Affiliation(s)
- Sara Calatayud
- Departament de Genètica, Microbiologia i Estadística and Institut de Recerca de la Biodiversitat (IRBio), Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Mario Garcia-Risco
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Òscar Palacios
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Mercè Capdevila
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Cristian Cañestro
- Departament de Genètica, Microbiologia i Estadística and Institut de Recerca de la Biodiversitat (IRBio), Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Ricard Albalat
- Departament de Genètica, Microbiologia i Estadística and Institut de Recerca de la Biodiversitat (IRBio), Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| |
Collapse
|
8
|
Stephan W. Rapid Evolutionary Adaptation in Response to Selection on Quantitative Traits. Life (Basel) 2021; 11:life11080797. [PMID: 34440541 PMCID: PMC8398862 DOI: 10.3390/life11080797] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 07/29/2021] [Accepted: 08/03/2021] [Indexed: 11/25/2022] Open
Abstract
Evolutionary adaptation after sudden environmental changes can occur very rapidly. The mechanisms facilitating rapid adaptation range from strong positive directional selection leading to large shifts in the allele frequencies at a few loci (selective sweeps) to polygenic selection causing small changes in allele frequencies at many loci. In addition, combinations of these two extreme mechanisms may also result in fast evolution. In recent years, following reports of new case studies of rapid adaptation, population genetic models have been proposed to explain these observations. In these models, the role of the major selective forces (positive directional and stabilizing selection) is highlighted as well as the genetic architecture of quantitative traits. Furthermore, the factors limiting the speed of adaptation are analyzed, in particular, the effects of random genetic drift and demography due to finite population size.
Collapse
Affiliation(s)
- Wolfgang Stephan
- Natural History Museum, 10115 Berlin, Germany;
- Faculty of Biology, Evolutionary Biology, Ludwig-Maximilian University of Munich, 82152 Planegg-Martinsried, Germany
| |
Collapse
|
9
|
Calatayud S, Garcia-Risco M, Capdevila M, Cañestro C, Palacios Ò, Albalat R. Modular Evolution and Population Variability of Oikopleura dioica Metallothioneins. Front Cell Dev Biol 2021; 9:702688. [PMID: 34277643 PMCID: PMC8283569 DOI: 10.3389/fcell.2021.702688] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 06/09/2021] [Indexed: 01/29/2023] Open
Abstract
Chordate Oikopleura dioica probably is the fastest evolving metazoan reported so far, and thereby, a suitable system in which to explore the limits of evolutionary processes. For this reason, and in order to gain new insights on the evolution of protein modularity, we have investigated the organization, function and evolution of multi-modular metallothionein (MT) proteins in O. dioica. MTs are a heterogeneous group of modular proteins defined by their cysteine (C)-rich domains, which confer the capacity of coordinating different transition metal ions. O. dioica has two MTs, a bi-modular OdiMT1 consisting of two domains (t-12C and 12C), and a multi-modular OdiMT2 with six t-12C/12C repeats. By means of mass spectrometry and spectroscopy of metal-protein complexes, we have shown that the 12C domain is able to autonomously bind four divalent metal ions, although the t-12C/12C pair –as it is found in OdiMT1– is the optimized unit for divalent metal binding. We have also shown a direct relationship between the number of the t-12C/12C repeats and the metal-binding capacity of the MTs, which means a stepwise mode of functional and structural evolution for OdiMT2. Finally, after analyzing four different O. dioica populations worldwide distributed, we have detected several OdiMT2 variants with changes in their number of t-12C/12C domain repeats. This finding reveals that the number of repeats fluctuates between current O. dioica populations, which provides a new perspective on the evolution of domain repeat proteins.
Collapse
Affiliation(s)
- Sara Calatayud
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Mario Garcia-Risco
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Mercè Capdevila
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Cristian Cañestro
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Òscar Palacios
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Ricard Albalat
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| |
Collapse
|
10
|
Calatayud S, Garcia-Risco M, Pedrini-Martha V, Eernisse DJ, Dallinger R, Palacios Ò, Capdevila M, Albalat R. Modularity in Protein Evolution: Modular Organization and De Novo Domain Evolution in Mollusk Metallothioneins. Mol Biol Evol 2021; 38:424-436. [PMID: 32915992 PMCID: PMC7826182 DOI: 10.1093/molbev/msaa230] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Metallothioneins (MTs) are proteins devoted to the control of metal homeostasis and detoxification, and therefore, MTs have been crucial for the adaptation of the living beings to variable situations of metal bioavailability. The evolution of MTs is, however, not yet fully understood, and to provide new insights into it, we have investigated the MTs in the diverse classes of Mollusks. We have shown that most molluskan MTs are bimodular proteins that combine six domains—α, β1, β2, β3, γ, and δ—in a lineage-specific manner. We have functionally characterized the Neritimorpha β3β1 and the Patellogastropoda γβ1 MTs, demonstrating the metal-binding capacity of the new γ domain. Our results have revealed a modular organization of mollusk MT, whose evolution has been impacted by duplication, loss, and de novo emergence of domains. MTs represent a paradigmatic example of modular evolution probably driven by the structural and functional requirements of metal binding.
Collapse
Affiliation(s)
- Sara Calatayud
- Departament de Genètica, Microbiologia i Estadística and Institut de Recerca de la Biodiversitat (IRBio), Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Mario Garcia-Risco
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Veronika Pedrini-Martha
- Department of Zoology and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Douglas J Eernisse
- Department of Biological Science, California State University Fullerton, Fullerton, CA
| | - Reinhard Dallinger
- Department of Zoology and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Òscar Palacios
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Mercè Capdevila
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Ricard Albalat
- Departament de Genètica, Microbiologia i Estadística and Institut de Recerca de la Biodiversitat (IRBio), Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| |
Collapse
|
11
|
Clifton BD, Jimenez J, Kimura A, Chahine Z, Librado P, Sánchez-Gracia A, Abbassi M, Carranza F, Chan C, Marchetti M, Zhang W, Shi M, Vu C, Yeh S, Fanti L, Xia XQ, Rozas J, Ranz JM. Understanding the Early Evolutionary Stages of a Tandem Drosophilamelanogaster-Specific Gene Family: A Structural and Functional Population Study. Mol Biol Evol 2021; 37:2584-2600. [PMID: 32359138 PMCID: PMC7475035 DOI: 10.1093/molbev/msaa109] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Gene families underlie genetic innovation and phenotypic diversification. However, our understanding of the early genomic and functional evolution of tandemly arranged gene families remains incomplete as paralog sequence similarity hinders their accurate characterization. The Drosophila melanogaster-specific gene family Sdic is tandemly repeated and impacts sperm competition. We scrutinized Sdic in 20 geographically diverse populations using reference-quality genome assemblies, read-depth methodologies, and qPCR, finding that ∼90% of the individuals harbor 3-7 copies as well as evidence of population differentiation. In strains with reliable gene annotations, copy number variation (CNV) and differential transposable element insertions distinguish one structurally distinct version of the Sdic region per strain. All 31 annotated copies featured protein-coding potential and, based on the protein variant encoded, were categorized into 13 paratypes differing in their 3' ends, with 3-5 paratypes coexisting in any strain examined. Despite widespread gene conversion, the only copy present in all strains has functionally diverged at both coding and regulatory levels under positive selection. Contrary to artificial tandem duplications of the Sdic region that resulted in increased male expression, CNV in cosmopolitan strains did not correlate with expression levels, likely as a result of differential genome modifier composition. Duplicating the region did not enhance sperm competitiveness, suggesting a fitness cost at high expression levels or a plateau effect. Beyond facilitating a minimally optimal expression level, Sdic CNV acts as a catalyst of protein and regulatory diversity, showcasing a possible evolutionary path recently formed tandem multigene families can follow toward long-term consolidation in eukaryotic genomes.
Collapse
Affiliation(s)
- Bryan D Clifton
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Jamie Jimenez
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Ashlyn Kimura
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Zeinab Chahine
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Pablo Librado
- Laboratoire AMIS CNRS UMR 5288, Faculté de Médicine de Purpan, Université Paul Sabatier, Toulouse, France
| | - Alejandro Sánchez-Gracia
- Departament de Genètica, Microbiologia i Estadistica, Universitat de Barcelona, Barcelona, Spain.,Institut de Recerca de la Biodiversitat, Universitat de Barcelona, Barcelona, Spain
| | - Mashya Abbassi
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Francisco Carranza
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Carolus Chan
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Marcella Marchetti
- Istituto Pasteur Italia, Fondazione Cenci-Bolognetti, Rome, Italy.,Department of Biology and Biotechnology "C. Darwin", Sapienza University of Rome, Rome, Italy
| | - Wanting Zhang
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei Province, China
| | - Mijuan Shi
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei Province, China
| | - Christine Vu
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| | - Shudan Yeh
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA.,Department of Life Sciences, National Central University, Taoyuan City, Zhongli District, Taiwan
| | - Laura Fanti
- Istituto Pasteur Italia, Fondazione Cenci-Bolognetti, Rome, Italy.,Department of Biology and Biotechnology "C. Darwin", Sapienza University of Rome, Rome, Italy
| | - Xiao-Qin Xia
- Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei Province, China
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadistica, Universitat de Barcelona, Barcelona, Spain.,Institut de Recerca de la Biodiversitat, Universitat de Barcelona, Barcelona, Spain
| | - José M Ranz
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA
| |
Collapse
|
12
|
Stephan W, John S. Polygenic Adaptation in a Population of Finite Size. ENTROPY 2020; 22:e22080907. [PMID: 33286676 PMCID: PMC7517530 DOI: 10.3390/e22080907] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Revised: 07/31/2020] [Accepted: 08/15/2020] [Indexed: 12/15/2022]
Abstract
Polygenic adaptation in response to selection on quantitative traits has become an important topic in evolutionary biology. Here we review the recent literature on models of polygenic adaptation. In particular, we focus on a model that includes mutation and both directional and stabilizing selection on a highly polygenic trait in a population of finite size (thus experiencing random genetic drift). Assuming that a sudden environmental shift of the fitness optimum occurs while the population is in a stochastic equilibrium, we analyze the adaptation of the trait to the new optimum. When the shift is not too large relative to the equilibrium genetic variance and this variance is determined by loci with mostly small effects, the approach of the mean phenotype to the optimum can be approximated by a rapid exponential process (whose rate is proportional to the genetic variance). During this rapid phase the underlying changes to allele frequencies, however, may depend strongly on genetic drift. While trait-increasing alleles with intermediate equilibrium frequencies are dominated by selection and contribute positively to changes of the trait mean (i.e., are aligned with the direction of the optimum shift), alleles with low or high equilibrium frequencies show more of a random dynamics, which is expected when drift is dominating. A strong effect of drift is also predicted for population size bottlenecks. Our simulations show that the presence of a bottleneck results in a larger deviation of the population mean of the trait from the fitness optimum, which suggests that more loci experience the influence of drift.
Collapse
Affiliation(s)
- Wolfgang Stephan
- Leibniz Institute for Evolution and Biodiversity Science, Natural History Museum, 10115 Berlin, Germany;
| | - Sona John
- Department of Life Science Systems, Technical University of Munich, 85354 Freising, Germany
- Correspondence:
| |
Collapse
|
13
|
Barua A, Mikheyev AS. Many Options, Few Solutions: Over 60 My Snakes Converged on a Few Optimal Venom Formulations. Mol Biol Evol 2020; 36:1964-1974. [PMID: 31220860 PMCID: PMC6736290 DOI: 10.1093/molbev/msz125] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Gene expression changes contribute to complex trait variations in both individuals and populations. However, the evolution of gene expression underlying complex traits over macroevolutionary timescales remains poorly understood. Snake venoms are proteinaceous cocktails where the expression of each toxin can be quantified and mapped to a distinct genomic locus and traced for millions of years. Using a phylogenetic generalized linear mixed model, we analyzed expression data of toxin genes from 52 snake species spanning the 3 venomous snake families and estimated phylogenetic covariance, which acts as a measure of evolutionary constraint. We find that evolution of toxin combinations is not constrained. However, although all combinations are in principle possible, the actual dimensionality of phylomorphic space is low, with envenomation strategies focused around only four major toxin families: metalloproteases, three-finger toxins, serine proteases, and phospholipases A2. Although most extant snakes prioritize either a single or a combination of major toxin families, they are repeatedly recruited and lost. We find that over macroevolutionary timescales, the venom phenotypes were not shaped by phylogenetic constraints, which include important microevolutionary constraints such as epistasis and pleiotropy, but more likely by ecological filtering that permits a small number of optimal solutions. As a result, phenotypic optima were repeatedly attained by distantly related species. These results indicate that venoms evolve by selection on biochemistry of prey envenomation, which permit diversity through parallelism, and impose strong limits, since only a few of the theoretically possible strategies seem to work well and are observed in extant snakes.
Collapse
Affiliation(s)
- Agneesh Barua
- Okinawa Institute of Science and Technology Graduate University, Onna, Japan
| | - Alexander S Mikheyev
- Okinawa Institute of Science and Technology Graduate University, Onna, Japan.,Evolutionary Genomics Research Group, Ecology and Evolution Unit, Australian National University, Canberra, Australia
| |
Collapse
|
14
|
Koropoulis A, Alachiotis N, Pavlidis P. Detecting Positive Selection in Populations Using Genetic Data. Methods Mol Biol 2020; 2090:87-123. [PMID: 31975165 DOI: 10.1007/978-1-0716-0199-0_5] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
High-throughput genomic sequencing allows to disentangle the evolutionary forces acting in populations. Among evolutionary forces, positive selection has received a lot of attention because it is related to the adaptation of populations in their environments, both biotic and abiotic. Positive selection, also known as Darwinian selection, occurs when an allele is favored by natural selection. The frequency of the favored allele increases in the population and, due to genetic hitchhiking, neighboring linked variation diminishes, creating so-called selective sweeps. Such a process leaves traces in genomes that can be detected in a future time point. Detecting traces of positive selection in genomes is achieved by searching for signatures introduced by selective sweeps, such as regions of reduced variation, a specific shift of the site frequency spectrum, and particular linkage disequilibrium (LD) patterns in the region. A variety of approaches can be used for detecting selective sweeps, ranging from simple implementations that compute summary statistics to more advanced statistical approaches, e.g., Bayesian approaches, maximum-likelihood-based methods, and machine learning methods. In this chapter, we discuss selective sweep detection methodologies on the basis of their capacity to analyze whole genomes or just subgenomic regions, and on the specific polymorphism patterns they exploit as selective sweep signatures. We also summarize the results of comparisons among five open-source software releases (SweeD, SweepFinder, SweepFinder2, OmegaPlus, and RAiSD) regarding sensitivity, specificity, and execution times. Furthermore, we test and discuss machine learning methods and present a thorough performance analysis. In equilibrium neutral models or mild bottlenecks, most methods are able to detect selective sweeps accurately. Methods and tools that rely on linkage disequilibrium (LD) rather than single SNPs exhibit higher true positive rates than the site frequency spectrum (SFS)-based methods under the model of a single sweep or recurrent hitchhiking. However, their false positive rate is elevated when a misspecified demographic model is used to build the distribution of the statistic under the null hypothesis. Both LD and SFS-based approaches suffer from decreased accuracy on localizing the true target of selection in bottleneck scenarios. Furthermore, we present an extensive analysis of the effects of gene flow on selective sweep detection, a problem that has been understudied in selective sweep literature.
Collapse
Affiliation(s)
- Angelos Koropoulis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
- Computer Science Department, University of Crete, Crete, Heraklion, Greece
| | - Nikolaos Alachiotis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - Pavlos Pavlidis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece.
| |
Collapse
|
15
|
Three Quantitative Trait Loci Explain More than 60% of Variation for Chill Coma Recovery Time in a Natural Population of Drosophila ananassae. G3-GENES GENOMES GENETICS 2019; 9:3715-3725. [PMID: 31690597 PMCID: PMC6829138 DOI: 10.1534/g3.119.400453] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Ectothermic species such as insects are particularly vulnerable to climatic fluctuations. Nevertheless, many insects that evolved and diversified in the tropics have successfully colonized temperate regions all over the globe. To shed light on the genetic basis of cold tolerance in such species, we conducted a quantitative trait locus (QTL) mapping experiment for chill coma recovery time (CCRT) in Drosophila ananassae, a cosmopolitan species that has expanded its range from tropical to temperate regions. We created a mapping population of recombinant inbred advanced intercross lines (RIAILs) from two founder strains with diverging CCRT phenotypes. The RIAILs were phenotyped for their CCRT and, together with the founder strains, genotyped for polymorphic markers with double-digest restriction site-associated DNA (ddRAD) sequencing. Using a hierarchical mapping approach that combined standard interval mapping and a multiple-QTL model, we mapped three QTL which altogether explained 64% of the phenotypic variance. For two of the identified QTL, we found evidence of epistasis. To narrow down the list of cold tolerance candidate genes, we cross-referenced the QTL intervals with genes that we previously identified as differentially expressed in response to cold in D. ananassae, and with thermotolerance candidate genes of D. melanogaster. Among the 58 differentially expressed genes that were contained within the QTL, GF15058 showed a significant interaction of the CCRT phenotype and gene expression. Further, we identified the orthologs of four D. melanogaster thermotolerance candidate genes, MtnA, klarsicht, CG5246 (D.ana/GF17132) and CG10383 (D.ana/GF14829) as candidates for cold tolerance in D. ananassae.
Collapse
|
16
|
Catalán A, Briscoe AD, Höhna S. Drift and Directional Selection Are the Evolutionary Forces Driving Gene Expression Divergence in Eye and Brain Tissue of Heliconius Butterflies. Genetics 2019; 213:581-594. [PMID: 31467133 PMCID: PMC6781903 DOI: 10.1534/genetics.119.302493] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 08/24/2019] [Indexed: 01/05/2023] Open
Abstract
Investigating gene expression evolution over micro- and macroevolutionary timescales will expand our understanding of the role of gene expression in adaptation and speciation. In this study, we characterized the evolutionary forces acting on gene expression levels in eye and brain tissue of five Heliconius butterflies with divergence times of ∼5-12 MYA. We developed and applied Brownian motion (BM) and Ornstein-Uhlenbeck (OU) models to identify genes whose expression levels are evolving through drift, stabilizing selection, or a lineage-specific shift. We found that 81% of the genes evolve under genetic drift. When testing for branch-specific shifts in gene expression, we detected 368 (16%) shift events. Genes showing a shift toward upregulation have significantly lower gene expression variance than those genes showing a shift leading toward downregulation. We hypothesize that directional selection is acting in shifts causing upregulation, since transcription is costly. We further uncovered through simulations that parameter estimation of OU models is biased when using small phylogenies and only becomes reliable with phylogenies having ≥ 50 taxa. Therefore, we developed a new statistical test based on BM to identify highly conserved genes (i.e., evolving under strong stabilizing selection), which comprised 3% of the orthoclusters. In conclusion, we found that drift is the dominant evolutionary force driving gene expression evolution in eye and brain tissue in Heliconius Nevertheless, the higher proportion of genes evolving under directional than under stabilizing selection might reflect species-specific selective pressures on vision and the brain that are necessary to fulfill species-specific requirements.
Collapse
Affiliation(s)
- Ana Catalán
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, 75236, Sweden
- Division of Evolutionary Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
| | - Adriana D Briscoe
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697
| | - Sebastian Höhna
- Division of Evolutionary Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
- Department of Earth and Environmental Sciences, Paleontology and Geobiology, 80333 Munich, Germany
- GeoBio-Center, Ludwig-Maximilians-Universität München, 80333 Munich, Germany
| |
Collapse
|
17
|
Durmaz E, Rajpurohit S, Betancourt N, Fabian DK, Kapun M, Schmidt P, Flatt T. A clinal polymorphism in the insulin signaling transcription factor foxo contributes to life-history adaptation in Drosophila. Evolution 2019; 73:1774-1792. [PMID: 31111462 PMCID: PMC6771989 DOI: 10.1111/evo.13759] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Revised: 05/04/2019] [Accepted: 05/06/2019] [Indexed: 12/11/2022]
Abstract
A fundamental aim of adaptation genomics is to identify polymorphisms that underpin variation in fitness traits. In Drosophila melanogaster, latitudinal life-history clines exist on multiple continents and make an excellent system for dissecting the genetics of adaptation. We have previously identified numerous clinal single-nucleotide polymorphism in insulin/insulin-like growth factor signaling (IIS), a pathway known from mutant studies to affect life history. However, the effects of natural variants in this pathway remain poorly understood. Here we investigate how two clinal alternative alleles at foxo, a transcriptional effector of IIS, affect fitness components (viability, size, starvation resistance, fat content). We assessed this polymorphism from the North American cline by reconstituting outbred populations, fixed for either the low- or high-latitude allele, from inbred DGRP lines. Because diet and temperature modulate IIS, we phenotyped alleles across two temperatures (18°C, 25°C) and two diets differing in sugar source and content. Consistent with clinal expectations, the high-latitude allele conferred larger body size and reduced wing loading. Alleles also differed in starvation resistance and expression of insulin-like receptor, a transcriptional target of FOXO. Allelic reaction norms were mostly parallel, with few GxE interactions. Together, our results suggest that variation in IIS makes a major contribution to clinal life-history adaptation.
Collapse
Affiliation(s)
- Esra Durmaz
- Department of Ecology and EvolutionUniversity of LausanneLausanneSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| | - Subhash Rajpurohit
- Department of BiologyUniversity of PennsylvaniaPhiladelphiaPennsylvania19140
- Division of Biological and Life SciencesAhmedabad UniversityAhmedabadIndia
| | - Nicolas Betancourt
- Department of BiologyUniversity of PennsylvaniaPhiladelphiaPennsylvania19140
| | - Daniel K. Fabian
- European Molecular Biology LaboratoryEuropean Bioinformatics InstituteWellcome Genome Campus, HinxtonCambridgeUnited Kingdom
- Institut für PopulationsgenetikVetmeduni ViennaViennaAustria
- Vienna Graduate School of Population, GeneticsViennaAustria
| | - Martin Kapun
- Department of Ecology and EvolutionUniversity of LausanneLausanneSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| | - Paul Schmidt
- Department of BiologyUniversity of PennsylvaniaPhiladelphiaPennsylvania19140
| | - Thomas Flatt
- Department of Ecology and EvolutionUniversity of LausanneLausanneSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| |
Collapse
|
18
|
Buchberger E, Reis M, Lu TH, Posnien N. Cloudy with a Chance of Insights: Context Dependent Gene Regulation and Implications for Evolutionary Studies. Genes (Basel) 2019; 10:E492. [PMID: 31261769 PMCID: PMC6678813 DOI: 10.3390/genes10070492] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Revised: 06/20/2019] [Accepted: 06/26/2019] [Indexed: 12/20/2022] Open
Abstract
Research in various fields of evolutionary biology has shown that divergence in gene expression is a key driver for phenotypic evolution. An exceptional contribution of cis-regulatory divergence has been found to contribute to morphological diversification. In the light of these findings, the analysis of genome-wide expression data has become one of the central tools to link genotype and phenotype information on a more mechanistic level. However, in many studies, especially if general conclusions are drawn from such data, a key feature of gene regulation is often neglected. With our article, we want to raise awareness that gene regulation and thus gene expression is highly context dependent. Genes show tissue- and stage-specific expression. We argue that the regulatory context must be considered in comparative expression studies.
Collapse
Affiliation(s)
- Elisa Buchberger
- University Göttingen, Göttingen Center for Molecular Biosciences (GZMB), Dpt. of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany.
| | - Micael Reis
- University Göttingen, Göttingen Center for Molecular Biosciences (GZMB), Dpt. of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany.
| | - Ting-Hsuan Lu
- University Göttingen, Göttingen Center for Molecular Biosciences (GZMB), Dpt. of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany.
- International Max Planck Research School for Genome Science, Am Fassberg 11, 37077 Göttingen, Germany.
| | - Nico Posnien
- University Göttingen, Göttingen Center for Molecular Biosciences (GZMB), Dpt. of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany.
| |
Collapse
|
19
|
Jiang J, Liu L, Gao Y, Shi L, Li Y, Liang W, Sun D. Determination of genetic associations between indels in 11 candidate genes and milk composition traits in Chinese Holstein population. BMC Genet 2019; 20:48. [PMID: 31138106 PMCID: PMC6537361 DOI: 10.1186/s12863-019-0751-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 05/20/2019] [Indexed: 01/20/2023] Open
Abstract
Background We have previously identified 11 promising candidate genes for milk composition traits by resequencing the whole genomes of 8 Holstein bulls with extremely high and low estimated breeding values for milk protein and fat percentages (high and low groups), including FCGR2B, CENPE, RETSAT, ACSBG2, NFKB2, TBC1D1, NLK, MAP3K1, SLC30A2, ANGPT1 and UGDH those contained 25 indels between high and low groups. In this study, the purpose was to further examine whether these candidates have significant genetic effects on milk protein and fat traits. Results With PCR product sequencing, 13 indels identified by whole genome resequencing were successfully genotyped. With association analysis in 769 Chinese Holstein cows, we found that the indel in FCGR2B was significantly associated with milk yield, protein yield and protein percentage (P = 0.0041 to 0.0297); five indels in CENPE and one indel in MAP3K1 were markedly relevant to milk yield, fat yield and protein yield (P < 0.0001 to 0.0073); polymorphism in RETSAT was evidently associated with milk yield, fat yield, protein yield and protein percentage (P = 0.0001 to 0.0237); variant in ACSBG2 affected fat yield and protein percentage (P = 0.0088 and 0.0052); one indel in TBC1D1 was with respect to fat percentage and protein percentage (P = 0.0224 and 0.0209). Significant associations were shown between indels in NLK and protein yield and protein percentage (P = 0.0012 to 0.0257); variant in UGDH was related to the milk yield (P = 0.0312). The two exonic indels in FCGR2B and CENPE were predicted to change the mRNA and protein secondary structures, and resulted in the corresponding protein dysfunction. Conclusion Our findings presented here provide the first evidence for the associations of eight functional genes with milk yield and composition traits in dairy cattle. Electronic supplementary material The online version of this article (10.1186/s12863-019-0751-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jianping Jiang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, China Agricultural University, 2 Yuanmingyuan West Road, Beijing, 100193, China.,College of Animal Science and Technology, Guangxi University, Nanning, 530004, China
| | - Lin Liu
- Beijing Dairy Cattle Center, Beijing, 100085, China
| | - Yahui Gao
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, China Agricultural University, 2 Yuanmingyuan West Road, Beijing, 100193, China
| | - Lijun Shi
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, China Agricultural University, 2 Yuanmingyuan West Road, Beijing, 100193, China
| | - Yanhua Li
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, China Agricultural University, 2 Yuanmingyuan West Road, Beijing, 100193, China.,Beijing Dairy Cattle Center, Beijing, 100085, China
| | - Weijun Liang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, China Agricultural University, 2 Yuanmingyuan West Road, Beijing, 100193, China
| | - Dongxiao Sun
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, China Agricultural University, 2 Yuanmingyuan West Road, Beijing, 100193, China.
| |
Collapse
|
20
|
Abstract
For almost 20 years, many inference methods have been developed to detect selective sweeps and localize the targets of directional selection in the genome. These methods are based on population genetic models that describe the effect of a beneficial allele (e.g., a new mutation) on linked neutral variation (driven by directional selection from a single copy to fixation). Here, I discuss these models, ranging from selective sweeps in a panmictic population of constant size to evolutionary traffic when simultaneous sweeps at multiple loci interfere, and emphasize the important role of demography and population structure in data analysis. In the past 10 years, soft sweeps that may arise after an environmental change from directional selection on standing variation have become a focus of population genetic research. In contrast to selective sweeps, they are caused by beneficial alleles that were neutrally segregating in a population before the environmental change or were present at a mutation-selection balance in appreciable frequency.
Collapse
|
21
|
Ramnarine TJS, Glaser-Schmitt A, Catalán A, Parsch J. Population Genetic and Functional Analysis of a cis-Regulatory Polymorphism in the DrosophilamelanogasterMetallothionein A gene. Genes (Basel) 2019; 10:E147. [PMID: 30769915 PMCID: PMC6410240 DOI: 10.3390/genes10020147] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Revised: 02/08/2019] [Accepted: 02/08/2019] [Indexed: 12/24/2022] Open
Abstract
Although gene expression can vary extensively within and among populations, the genetic basis of this variation and the evolutionary forces that maintain it are largely unknown. In Drosophilamelanogaster, a 49-bp insertion/deletion (indel) polymorphism in the Metallothionein A (MtnA) gene is associated with variation in MtnA expression and oxidative stress tolerance. To better understand the functional and evolutionary significance of this polymorphism, we investigated it in several worldwide populations. In a German population, the deletion was present at a high and stable frequency over multiple seasons and years, and was associated with increased MtnA expression. There was, however, no evidence that the polymorphism was maintained by overdominant, seasonally fluctuating, or sexually antagonistic selection. The deletion was rare in a population from the species' ancestral range in sub-Saharan Africa and is likely the result of non-African admixture, suggesting that it spread to high frequency following the species' out-of-Africa expansion. Using data from a North American population, we found that the deletion was associated with MtnA expression and tolerance to oxidative stress induced by menadione sodium bisulfite. Our results are consistent with the deletion being selectively favored in temperate populations due to the increased MtnA expression and oxidative stress tolerance that it confers.
Collapse
Affiliation(s)
- Timothy J S Ramnarine
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Grosshaderner Str. 2, 82152 Planegg-Martinsried, Germany.
| | - Amanda Glaser-Schmitt
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Grosshaderner Str. 2, 82152 Planegg-Martinsried, Germany.
| | - Ana Catalán
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Grosshaderner Str. 2, 82152 Planegg-Martinsried, Germany.
- Evolutionary Biology Centre (EBC), Uppsala University, Norbyvägen 14-18 75236, Uppsala, Sweden.
| | - John Parsch
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Grosshaderner Str. 2, 82152 Planegg-Martinsried, Germany.
| |
Collapse
|
22
|
Functional Analysis of a Putative Target of Spatially Varying Selection in the Menin1 Gene of Drosophila melanogaster. G3-GENES GENOMES GENETICS 2019; 9:73-80. [PMID: 30404774 PMCID: PMC6325912 DOI: 10.1534/g3.118.200818] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
While significant effort has been devoted to investigating the potential influence of spatially varying selection on genomic variation, relatively little effort has been devoted to experimental analysis of putative variants or genes experiencing such selection. Previous population genetic work identified an amino acid polymorphism in the Mnn1 gene as one of the most strongly latitudinally differentiated SNPs in the genome of Drosophila melanogaster in the United States and Australia. Here we report the results of our transgenic analysis of this amino acid polymorphism. Genotypes carrying alternative Mnn1 alleles differed in multiple phenotypes in a direction generally consistent with phenotypic differences previously observed along latitudinal clines. These results support inferences from earlier population genomic work that this variant influences fitness, and support the idea that the alleles exhibiting clines may be likely to have pleiotropic effects that are correlated along the axes favored by natural selection.
Collapse
|
23
|
Gene Regulatory Variation in Drosophila melanogaster Renal Tissue. Genetics 2018; 210:287-301. [PMID: 29976765 DOI: 10.1534/genetics.118.301073] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Accepted: 07/03/2018] [Indexed: 12/13/2022] Open
Abstract
Genetic variation influencing levels of gene expression is abundant in natural populations, and may exert its effects through complex mechanisms that depend on an organism's genetic background and the tissue in which expression is measured. We investigated natural variation in gene expression in the Malpighian tubules of three inbred Drosophila melanogaster strains and their F1 hybrids. One of the strains was from a population in the species' ancestral range (Zambia), while the other two were from a more recently derived population (Sweden). Although closely related, the two Swedish strains differed greatly in terms of their expression inheritance when hybridized with the Zambian strain, with one Swedish strain showing a large excess of genes with recessive expression inheritance, as well as a large number of genes with overdominant inheritance. Although most expression variation could be attributed to trans-regulation, there were ∼200 genes that showed allele-specific expression differences in each of the between-population hybrids, indicating that cis-regulation contributes as well. The cis-regulated genes were enriched with cytochrome P450 genes, and the upstream regions of six of these genes were incorporated into transgenic reporter gene constructs to test their effects on expression. Differential expression was observed for five of the six reporter genes in the Malpighian tubule, suggesting that a large proportion of cis-regulatory variation lies directly upstream of the affected gene. In most cases, the differential expression was specific to the Malpighian tubule or greater in this tissue than in the rest of the body, highlighting the importance of single-tissue studies of gene expression variation.
Collapse
|
24
|
Croset V, Treiber CD, Waddell S. Cellular diversity in the Drosophila midbrain revealed by single-cell transcriptomics. eLife 2018; 7:34550. [PMID: 29671739 PMCID: PMC5927767 DOI: 10.7554/elife.34550] [Citation(s) in RCA: 173] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Accepted: 04/18/2018] [Indexed: 12/12/2022] Open
Abstract
To understand the brain, molecular details need to be overlaid onto neural wiring diagrams so that synaptic mode, neuromodulation and critical signaling operations can be considered. Single-cell transcriptomics provide a unique opportunity to collect this information. Here we present an initial analysis of thousands of individual cells from Drosophila midbrain, that were acquired using Drop-Seq. A number of approaches permitted the assignment of transcriptional profiles to several major brain regions and cell-types. Expression of biosynthetic enzymes and reuptake mechanisms allows all the neurons to be typed according to the neurotransmitter or neuromodulator that they produce and presumably release. Some neuropeptides are preferentially co-expressed in neurons using a particular fast-acting transmitter, or monoamine. Neuromodulatory and neurotransmitter receptor subunit expression illustrates the potential of these molecules in generating complexity in neural circuit function. This cell atlas dataset provides an important resource to link molecular operations to brain regions and complex neural processes.
Collapse
Affiliation(s)
- Vincent Croset
- Centre for Neural Circuits and Behaviour, The University of Oxford, Oxford, United Kingdom
| | - Christoph D Treiber
- Centre for Neural Circuits and Behaviour, The University of Oxford, Oxford, United Kingdom
| | - Scott Waddell
- Centre for Neural Circuits and Behaviour, The University of Oxford, Oxford, United Kingdom
| |
Collapse
|
25
|
Rybina OY, Rozovsky YM, Veselkina ER, Pasyukova EG. Polycomb/Trithorax group-dependent regulation of the neuronal gene Lim3 involved in Drosophila lifespan control. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2018; 1861:451-462. [PMID: 29555581 DOI: 10.1016/j.bbagrm.2018.03.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 02/18/2018] [Accepted: 03/14/2018] [Indexed: 12/14/2022]
Abstract
Molecular mechanisms governing gene expression and defining complex phenotypes are central to understanding the basics of development and aging. Here, we demonstrate that naturally occurring polymorphisms of the Lim3 regulatory region that are associated with variation in gene expression and Drosophila lifespan control are located exclusively in the Polycomb response element (PRE). We find that the Polycomb group (PcG) protein Polycomb (PC) is bound to the PRE only in embryos where Lim3 is present in both repressed and active states. In contrast, the Trithorax group (TrxG) protein absent, small, or homeotic discs 1 (ASH1) is bound downstream of the PRE, to a region adjacent to the Lim3 transcription start site in embryos and adult flies, in which Lim3 is in an active state. Furthermore, mutations in Pc and ash1 genes affect Lim3 expression depending on the structural integrity of the Lim3 PRE, thus confirming functional interactions between these proteins and Lim3 regulatory region. In addition, we demonstrate that the evolutionary conserved Lim3 core promoter provides basic Lim3 expression, whereas structural changes in the Lim3 PRE of distal promoter provide stage-, and tissue-specific Lim3 expression. Therefore, we hypothesize that PcG/TrxG proteins, which are directly involved in Lim3 transcription regulation, participate in lifespan control.
Collapse
Affiliation(s)
- Olga Y Rybina
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia; Federal State-Financed Educational Institution of Higher Professional Education, Moscow State Pedagogical University, M. Pirogovskaya Str. 1/1, Moscow 119991, Russia.
| | - Yakov M Rozovsky
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia
| | - Ekaterina R Veselkina
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia
| | - Elena G Pasyukova
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia
| |
Collapse
|
26
|
Glaser-Schmitt A, Parsch J. Functional characterization of adaptive variation within a cis-regulatory element influencing Drosophila melanogaster growth. PLoS Biol 2018; 16:e2004538. [PMID: 29324742 PMCID: PMC5783415 DOI: 10.1371/journal.pbio.2004538] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2017] [Revised: 01/24/2018] [Accepted: 12/18/2017] [Indexed: 11/18/2022] Open
Abstract
Gene expression variation is a major contributor to phenotypic diversity within species and is thought to play an important role in adaptation. However, examples of adaptive regulatory polymorphism are rare, especially those that have been characterized at both the molecular genetic level and the organismal level. In this study, we perform a functional analysis of the Drosophila melanogaster CG9509 enhancer, a cis-regulatory element that shows evidence of adaptive evolution in populations outside the species’ ancestral range in sub-Saharan Africa. Using site-directed mutagenesis and transgenic reporter gene assays, we determined that 3 single nucleotide polymorphisms are responsible for the difference in CG9509 expression that is observed between sub-Saharan African and cosmopolitan populations. Interestingly, while 2 of these variants appear to have been the targets of a selective sweep outside of sub-Saharan Africa, the variant with the largest effect on expression remains polymorphic in cosmopolitan populations, suggesting it may be subject to a different mode of selection. To elucidate the function of CG9509, we performed a series of functional and tolerance assays on flies in which CG9509 expression was disrupted. We found that CG9509 plays a role in larval growth and influences adult body and wing size, as well as wing loading. Furthermore, variation in several of these traits was associated with variation within the CG9509 enhancer. The effect on growth appears to result from a modulation of active ecdysone levels and expression of growth factors. Taken together, our findings suggest that selection acted on 3 sites within the CG9509 enhancer to increase CG9509 expression and, as a result, reduce wing loading as D. melanogaster expanded out of sub-Saharan Africa. Much of the phenotypic variation that is observed within species is thought to be caused by variation in gene expression. Variants within cis-regulatory elements, which affect the expression of nearby genes within the same DNA strand, are thought to be an abundant resource upon which natural selection can act. Understanding the functional consequences of adaptive cis-regulatory changes is important, as it can help elucidate the mechanisms underlying phenotypic evolution in general and provide insight into the development and maintenance of biodiversity. However, functional analyses of these types of changes remain rare. Here we present a functional analysis of an adaptively evolving enhancer element of a D. melanogaster gene called CG9509, of previously unknown function. We show that 3 single nucleotide polymorphisms located within the enhancer of this gene are responsible for an increase in CG9509 expression in cosmopolitan populations (outside of south and central Africa) relative to sub-Saharan populations, which include ancestral populations. We further show that CG9509 is involved in the regulation of growth rate and body size determination and propose that the CG9509 enhancer underwent positive selection to reduce wing loading as the species expanded out of sub-Saharan Africa.
Collapse
Affiliation(s)
- Amanda Glaser-Schmitt
- Faculty of Biology, Ludwig-Maximilians-Universität München, Munich, Germany
- * E-mail: (AGS); (JP)
| | - John Parsch
- Faculty of Biology, Ludwig-Maximilians-Universität München, Munich, Germany
- * E-mail: (AGS); (JP)
| |
Collapse
|
27
|
Macias-Muñoz A, McCulloch KJ, Briscoe AD. Copy Number Variation and Expression Analysis Reveals a Nonorthologous Pinta Gene Family Member Involved in Butterfly Vision. Genome Biol Evol 2017; 9:3398-3412. [PMID: 29136137 PMCID: PMC5739039 DOI: 10.1093/gbe/evx230] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/08/2017] [Indexed: 02/06/2023] Open
Abstract
Vertebrate (cellular retinaldehyde-binding protein) and Drosophila (prolonged depolarization afterpotential is not apparent [PINTA]) proteins with a CRAL-TRIO domain transport retinal-based chromophores that bind to opsin proteins and are necessary for phototransduction. The CRAL-TRIO domain gene family is composed of genes that encode proteins with a common N-terminal structural domain. Although there is an expansion of this gene family in Lepidoptera, there is no lepidopteran ortholog of pinta. Further, the function of these genes in lepidopterans has not yet been established. Here, we explored the molecular evolution and expression of CRAL-TRIO domain genes in the butterfly Heliconius melpomene in order to identify a member of this gene family as a candidate chromophore transporter. We generated and searched a four tissue transcriptome and searched a reference genome for CRAL-TRIO domain genes. We expanded an insect CRAL-TRIO domain gene phylogeny to include H. melpomene and used 18 genomes from 4 subspecies to assess copy number variation. A transcriptome-wide differential expression analysis comparing four tissue types identified a CRAL-TRIO domain gene, Hme CTD31, upregulated in heads suggesting a potential role in vision for this CRAL-TRIO domain gene. RT-PCR and immunohistochemistry confirmed that Hme CTD31 and its protein product are expressed in the retina, specifically in primary and secondary pigment cells and in tracheal cells. Sequencing of eye protein extracts that fluoresce in the ultraviolet identified Hme CTD31 as a possible chromophore binding protein. Although we found several recent duplications and numerous copy number variants in CRAL-TRIO domain genes, we identified a single copy pinta paralog that likely binds the chromophore in butterflies.
Collapse
Affiliation(s)
- Aide Macias-Muñoz
- Department of Ecology and Evolutionary Biology, University of California, Irvine
| | - Kyle J McCulloch
- Department of Ecology and Evolutionary Biology, University of California, Irvine.,FAS Center for Systems Biology, Harvard University
| | - Adriana D Briscoe
- Department of Ecology and Evolutionary Biology, University of California, Irvine
| |
Collapse
|
28
|
Yablonovitch AL, Fu J, Li K, Mahato S, Kang L, Rashkovetsky E, Korol AB, Tang H, Michalak P, Zelhof AC, Nevo E, Li JB. Regulation of gene expression and RNA editing in Drosophila adapting to divergent microclimates. Nat Commun 2017; 8:1570. [PMID: 29146998 PMCID: PMC5691062 DOI: 10.1038/s41467-017-01658-2] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 10/05/2017] [Indexed: 12/30/2022] Open
Abstract
Determining the mechanisms by which a species adapts to its environment is a key endeavor in the study of evolution. In particular, relatively little is known about how transcriptional processes are fine-tuned to adjust to different environmental conditions. Here we study Drosophila melanogaster from 'Evolution Canyon' in Israel, which consists of two opposing slopes with divergent microclimates. We identify several hundred differentially expressed genes and dozens of differentially edited sites between flies from each slope, correlate these changes with genetic differences, and use CRISPR mutagenesis to validate that an intronic SNP in prominin regulates its editing levels. We also demonstrate that while temperature affects editing levels at more sites than genetic differences, genetically regulated sites tend to be less affected by temperature. This work shows the extent to which gene expression and RNA editing differ between flies from different microclimates, and provides insights into the regulation responsible for these differences.
Collapse
Affiliation(s)
- Arielle L Yablonovitch
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA.,Biophysics Program, Stanford University, Stanford, CA, 94305, USA
| | - Jeremy Fu
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Kexin Li
- Institute of Evolution, University of Haifa, Haifa, 3498838, Israel.,Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, 100093, Beijing, China
| | - Simpla Mahato
- Department of Biology, Indiana University, Bloomington, IN, 47405, USA
| | - Lin Kang
- Edward Via College of Osteopathic Medicine, Blacksburg, VA, 24060, USA
| | | | - Abraham B Korol
- Institute of Evolution, University of Haifa, Haifa, 3498838, Israel
| | - Hua Tang
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Pawel Michalak
- Edward Via College of Osteopathic Medicine, Blacksburg, VA, 24060, USA.,Biocomplexity Institute, Virginia Tech, Blacksburg, VA, 24061, USA.,Center for One Health Research, Virginia-Maryland College of Veterinary Medicine, Blacksburg, VA, 24060, USA
| | - Andrew C Zelhof
- Department of Biology, Indiana University, Bloomington, IN, 47405, USA
| | - Eviatar Nevo
- Institute of Evolution, University of Haifa, Haifa, 3498838, Israel.
| | - Jin Billy Li
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA. .,Biophysics Program, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
29
|
Zdraljevic S, Andersen EC. Natural diversity facilitates the discovery of conserved chemotherapeutic response mechanisms. Curr Opin Genet Dev 2017; 47:41-47. [PMID: 28892780 DOI: 10.1016/j.gde.2017.08.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Revised: 07/15/2017] [Accepted: 08/05/2017] [Indexed: 11/15/2022]
Abstract
Organismal fitness depends on adaptation to complex niches where chemical compounds and pathogens are omnipresent. These stresses can lead to the fixation of alleles in both xenobiotic responses and proliferative signaling pathways that promote survival in these niches. However, both xenobiotic responses and proliferative pathways vary within and among species. For example, genetic differences can accumulate within populations because xenobiotic exposures are not constant and selection is variable. Additionally, neutral genetic variation can accumulate in conserved proliferative pathway genes because these systems are robust to genetic perturbations given their essential roles in normal cell-fate specification. For these reasons, sensitizing mutations or chemical perturbations can disrupt pathways and reveal cryptic variation. With this fundamental view of how organisms respond to cytotoxic compounds and cryptic variation in conserved signaling pathways, it is not surprising that human patients have highly variable responses to chemotherapeutic compounds. These different responses result in the low FDA-approval rates for chemotherapeutics and underscore the need for new approaches to understand these diseases and therapeutic interventions. Model organisms, especially the classic invertebrate systems of Caenorhabditis elegans and Drosophila melanogaster, can be used to combine studies of natural variation across populations with responses to both xenobiotic compounds and chemotherapeutics targeted to conserved proliferative signaling pathways.
Collapse
Affiliation(s)
- Stefan Zdraljevic
- Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL 60208, USA; Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA
| | - Erik C Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA; Robert H. Lurie Comprehensive Cancer Center of Northwestern University, Chicago, IL 60611, USA.
| |
Collapse
|
30
|
Pavlidis P, Alachiotis N. A survey of methods and tools to detect recent and strong positive selection. ACTA ACUST UNITED AC 2017; 24:7. [PMID: 28405579 PMCID: PMC5385031 DOI: 10.1186/s40709-017-0064-0] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2016] [Accepted: 03/29/2017] [Indexed: 01/25/2023]
Abstract
Positive selection occurs when an allele is favored by natural selection. The frequency of the favored allele increases in the population and due to genetic hitchhiking the neighboring linked variation diminishes, creating so-called selective sweeps. Detecting traces of positive selection in genomes is achieved by searching for signatures introduced by selective sweeps, such as regions of reduced variation, a specific shift of the site frequency spectrum, and particular LD patterns in the region. A variety of methods and tools can be used for detecting sweeps, ranging from simple implementations that compute summary statistics such as Tajima's D, to more advanced statistical approaches that use combinations of statistics, maximum likelihood, machine learning etc. In this survey, we present and discuss summary statistics and software tools, and classify them based on the selective sweep signature they detect, i.e., SFS-based vs. LD-based, as well as their capacity to analyze whole genomes or just subgenomic regions. Additionally, we summarize the results of comparisons among four open-source software releases (SweeD, SweepFinder, SweepFinder2, and OmegaPlus) regarding sensitivity, specificity, and execution times. In equilibrium neutral models or mild bottlenecks, both SFS- and LD-based methods are able to detect selective sweeps accurately. Methods and tools that rely on LD exhibit higher true positive rates than SFS-based ones under the model of a single sweep or recurrent hitchhiking. However, their false positive rate is elevated when a misspecified demographic model is used to represent the null hypothesis. When the correct (or similar to the correct) demographic model is used instead, the false positive rates are considerably reduced. The accuracy of detecting the true target of selection is decreased in bottleneck scenarios. In terms of execution time, LD-based methods are typically faster than SFS-based methods, due to the nature of required arithmetic.
Collapse
Affiliation(s)
- Pavlos Pavlidis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 70013 Crete, Greece
| | - Nikolaos Alachiotis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 70013 Crete, Greece
| |
Collapse
|