1
|
Wang X, Ingvarsson PK. Quantifying adaptive evolution and the effects of natural selection across the Norway spruce genome. Mol Ecol 2023; 32:5288-5304. [PMID: 37622583 DOI: 10.1111/mec.17106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 08/07/2023] [Accepted: 08/09/2023] [Indexed: 08/26/2023]
Abstract
Detecting natural selection is one of the major goals of evolutionary genomics. Here, we sequenced the whole genome of 25 Picea abies individuals and quantified the amount of selection across the genome. Using an estimate of the distribution of fitness effects, we showed that both negative selection and the rate of positively selected substitutions are very limited in coding regions. We found a positive correlation between the rate of adaptive substitutions and recombination rate and a negative correlation between the rate of adaptive substitutions and gene density, suggesting a widespread influence from Hill-Robertson interference on the efficiency of protein adaptation in P. abies. Finally, the distinct population statistics between genomic regions under either positive or balancing selection with that under neutral regions indicated the impact of natural selection on the genomic architecture of Norway spruce. Further gene ontology enrichment analysis for genes located in regions identified as undergoing either positive or long-term balancing selection also highlighted the specific molecular functions and biological processes that appear to be targets of selection in Norway spruce.
Collapse
Affiliation(s)
- Xi Wang
- Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, Umeå, Sweden
| | - Pär K Ingvarsson
- Linnean Centre for Plant Biology, Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
2
|
Fifer JE, Yasuda N, Yamakita T, Bove CB, Davies SW. Genetic divergence and range expansion in a western North Pacific coral. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 813:152423. [PMID: 34942242 DOI: 10.1016/j.scitotenv.2021.152423] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Revised: 12/09/2021] [Accepted: 12/11/2021] [Indexed: 06/14/2023]
Abstract
Coral poleward range expansions have recently been observed in response to warming oceans. Range expansion can lead to reduced genetic diversity and increased frequency of deleterious mutations that were rare in core populations, potentially limiting the ability for adaptation and persistence in novel environments. Successful expansions that overcome these founder effects and colonize new habitat have been attributed to multiple introductions from different sources, hybridization with native populations, or rapid adaptive evolution. Here, we investigate population genomic patterns of the reef-building coral Acropora hyacinthus along a latitudinal cline that includes a well-established range expansion front in Japan using 2b-RAD sequencing. A total of 184 coral samples were collected across seven sites spanning from ~24°N to near its northern range front at ~33°N. We uncover the presence of three cryptic lineages of A. hyacinthus, which occupy discrete reefs within this region. Only one lineage is present along the expansion front and we find evidence for its historical occupation of marginal habitats. Within this lineage we also find evidence of bottleneck pressures associated with expansion events including higher clonality, increased linkage disequilibrium, and lower genetic diversity in range edge populations compared to core populations. Asymmetric migration between populations was also detected with lower migration from edge sites. Lastly, we describe genomic signatures of local adaptation potentially attributed to lower winter temperatures experienced at the more recently expanded northern populations. Together these data illuminate the genomic consequences of range expansion in a coral and highlight how adaptation to discrete environments along expansion fronts may facilitate further range expansion in this temperate coral lineage.
Collapse
Affiliation(s)
- James E Fifer
- Department of Biology, Boston University, Boston, MA 02215, USA.
| | - Nina Yasuda
- Department of Marine Biology and Environmental Sciences, Faculty of Agriculture, University of Miyazaki, 1-1 Gakuenkibanadainishi, Miyazaki 889-2192, Japan.
| | - Takehisa Yamakita
- Marine Biodiversity and Environmental Assessment Research Center, Japan Agency for Marine-Earth Science and Technology, 2-15 Natsushimacho, Yokosuka, Kanagawa 237-0061, Japan
| | - Colleen B Bove
- Department of Biology, Boston University, Boston, MA 02215, USA
| | - Sarah W Davies
- Department of Biology, Boston University, Boston, MA 02215, USA
| |
Collapse
|
3
|
Abstract
It is known that methods to estimate the rate of adaptive evolution, which are based on the McDonald–Kreitman test, can be biased by changes in effective population size. Here, we demonstrate theoretically that changes in population size can also generate an artifactual correlation between the rate of adaptive evolution and any factor that is correlated to the strength of selection acting against deleterious mutations. In this context, we have investigated whether several site-level factors influence the rate of adaptive evolution in the divergence of humans and chimpanzees, two species that have been inferred to have undergone population size contraction since they diverged. We find that the rate of adaptive evolution, relative to the rate of mutation, is higher for more exposed amino acids, lower for amino acid pairs that are more dissimilar in terms of their polarity, volume, and lower for amino acid pairs that are subject to stronger purifying selection, as measured by the ratio of the numbers of nonsynonymous to synonymous polymorphisms (pN/pS). All of these correlations are opposite to the artifactual correlations expected under contracting population size. We therefore conclude that these correlations are genuine.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Ana Filipa Moutinho
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plon, Germany
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- Corresponding author: E-mail:
| |
Collapse
|
4
|
Bergman J, Eyre-Walker A. Does Adaptive Protein Evolution Proceed by Large or Small Steps at the Amino Acid Level? Mol Biol Evol 2019; 36:990-998. [PMID: 30903659 DOI: 10.1093/molbev/msz033] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
A long-standing question in evolutionary biology is the relative contribution of large and small effect mutations to the adaptive process. We have investigated this question in proteins by estimating the rate of adaptive evolution between all pairs of amino acids separated by one mutational step using a McDonald-Kreitman type approach and genome-wide data from several Drosophila species. We find that the rate of adaptive evolution is highest among amino acids that are more similar. This is partly due to the fact that the proportion of mutations that are adaptive is higher among more similar amino acids. We also find that the rate of neutral evolution between amino acids is higher among more similar amino acids. Overall our results suggest that both the adaptive and nonadaptive evolution of proteins are dominated by substitutions between similar amino acids.
Collapse
Affiliation(s)
- Juraj Bergman
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.,Vienna Graduate School of Population Genetics, Wien, Austria
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
5
|
Vigué L, Eyre-Walker A. The comparative population genetics of Neisseria meningitidis and Neisseria gonorrhoeae. PeerJ 2019; 7:e7216. [PMID: 31293838 PMCID: PMC6599670 DOI: 10.7717/peerj.7216] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Accepted: 05/30/2019] [Indexed: 12/31/2022] Open
Abstract
Neisseria meningitidis and N. gonorrhoeae are closely related pathogenic bacteria. To compare their population genetics, we compiled a dataset of 1,145 genes found across 20 N. meningitidis and 15 N. gonorrhoeae genomes. We find that N. meningitidis is seven-times more diverse than N. gonorrhoeae in their combined core genome. Both species have acquired the majority of their diversity by recombination with divergent strains, however, we find that N. meningitidis has acquired more of its diversity by recombination than N. gonorrhoeae. We find that linkage disequilibrium (LD) declines rapidly across the genomes of both species. Several observations suggest that N. meningitidis has a higher effective population size than N. gonorrhoeae; it is more diverse, the ratio of non-synonymous to synonymous polymorphism is lower, and LD declines more rapidly to a lower asymptote in N. meningitidis. The two species share a modest amount of variation, half of which seems to have been acquired by lateral gene transfer and half from their common ancestor. We investigate whether diversity varies across the genome of each species and find that it does. Much of this variation is due to different levels of lateral gene transfer. However, we also find some evidence that the effective population size varies across the genome. We test for adaptive evolution in the core genome using a McDonald–Kreitman test and by considering the diversity around non-synonymous sites that are fixed for different alleles in the two species. We find some evidence for adaptive evolution using both approaches.
Collapse
|
6
|
Fantastic Beasts and How To Sequence Them: Ecological Genomics for Obscure Model Organisms. Trends Genet 2017; 34:121-132. [PMID: 29198378 DOI: 10.1016/j.tig.2017.11.002] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Revised: 10/30/2017] [Accepted: 11/07/2017] [Indexed: 01/05/2023]
Abstract
The application of genomic approaches to 'obscure model organisms' (OMOs), meaning species with no prior genomic resources, enables increasingly sophisticated studies of the genomic basis of evolution, acclimatization, and adaptation in real ecological contexts. I consider here ecological questions that can be addressed using OMOs, and indicate optimal sequencing and data-handling solutions for each case. With this I hope to promote the diversity of OMO-based projects that would capitalize on the peculiarities of the natural history of OMOs and could feasibly be completed within the scope of a single PhD thesis.
Collapse
|
7
|
Liu HQ, Li Y, Irwin DM, Zhang YP, Wu DD. Integrative analysis of young genes, positively selected genes and lncRNAs in the development of Drosophila melanogaster. BMC Evol Biol 2014; 14:241. [PMID: 25470998 PMCID: PMC4258281 DOI: 10.1186/s12862-014-0241-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Accepted: 11/10/2014] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Young genes and genes under positive selection commonly contribute to adaptive phenotypic evolution. Early developmental stages are very important for establishing phenotypes, which might be helpful for studying the evolutionary patterns of these rapidly evolving genes. RESULTS Here, we performed a weighted gene co-expression network analysis to identify modules of co-expressed genes at different stages of Drosophila melanogaster development. We found that young genes, including duplicated, orphan, and young lncRNA genes, are significantly enriched among modules associated with specific developmental stages. In addition, genes undergoing rapid amino acid sequence evolution driven by positive selection showed a similar proportion of essentiality with other genes, and enrichment in modules for specific developmental stages. CONCLUSIONS Our integrative analysis revealed important roles for the origin of new genes and rapid amino acid sequence evolution in development that may account for specific phenotype evolution in Drosophila melanogaster.
Collapse
Affiliation(s)
| | | | | | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
| | | |
Collapse
|
8
|
Gayà-Vidal M, Albà MM. Uncovering adaptive evolution in the human lineage. BMC Genomics 2014; 15:599. [PMID: 25030307 PMCID: PMC4124166 DOI: 10.1186/1471-2164-15-599] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 07/10/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The recent increase in human polymorphism data, together with the availability of genome sequences from several primate species, provides an unprecedented opportunity to investigate how natural selection has shaped human evolution. RESULTS We compared human branch-specific substitutions with variation data in the current human population to measure the impact of adaptive evolution on human protein coding genes. The use of single nucleotide polymorphisms (SNPs) with high derived allele frequencies (DAFs) minimized the influence of segregating slightly deleterious mutations and improved the estimation of the number of adaptive sites. Using DAF ≥ 60% we showed that the proportion of adaptive substitutions is 0.2% in the complete gene set. However, the percentage rose to 40% when we focused on genes that are specifically accelerated in the human branch with respect to the chimpanzee branch, or on genes that show signatures of adaptive selection at the codon level by the maximum likelihood based branch-site test. In general, neural genes are enriched in positive selection signatures. Genes with multiple lines of evidence of positive selection include taxilin beta, which is involved in motor nerve regeneration and syntabulin, and is required for the formation of new presynaptic boutons. CONCLUSIONS We combined several methods to detect adaptive evolution in human coding sequences at a genome-wide level. The use of variation data, in addition to sequence divergence information, uncovered previously undetected positive selection signatures in neural genes.
Collapse
Affiliation(s)
| | - M Mar Albà
- Evolutionary Genomics Group IMIM-UPF Research Programme on Biomedical Informatics, Barcelona Biomedical Research Park (PRBB), Aiguader 88, 08003 Barcelona, Spain.
| |
Collapse
|
9
|
Arbiza L, Gronau I, Aksoy BA, Hubisz MJ, Gulko B, Keinan A, Siepel A. Genome-wide inference of natural selection on human transcription factor binding sites. Nat Genet 2013; 45:723-9. [PMID: 23749186 DOI: 10.1038/ng.2658] [Citation(s) in RCA: 106] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Accepted: 05/08/2013] [Indexed: 11/09/2022]
Abstract
For decades, it has been hypothesized that gene regulation has had a central role in human evolution, yet much remains unknown about the genome-wide impact of regulatory mutations. Here we use whole-genome sequences and genome-wide chromatin immunoprecipitation and sequencing data to demonstrate that natural selection has profoundly influenced human transcription factor binding sites since the divergence of humans from chimpanzees 4-6 million years ago. Our analysis uses a new probabilistic method, called INSIGHT, for measuring the influence of selection on collections of short, interspersed noncoding elements. We find that, on average, transcription factor binding sites have experienced somewhat weaker selection than protein-coding genes. However, the binding sites of several transcription factors show clear evidence of adaptation. Several measures of selection are strongly correlated with predicted binding affinity. Overall, regulatory elements seem to contribute substantially to both adaptive substitutions and deleterious polymorphisms with key implications for human evolution and disease.
Collapse
Affiliation(s)
- Leonardo Arbiza
- Department of Biological Statistics & Computational Biology, Cornell University, Ithaca, NY, USA
| | | | | | | | | | | | | |
Collapse
|
10
|
Soylemez O, Kondrashov FA. Estimating the rate of irreversibility in protein evolution. Genome Biol Evol 2013; 4:1213-22. [PMID: 23132897 PMCID: PMC3542581 DOI: 10.1093/gbe/evs096] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Whether or not evolutionary change is inherently irreversible remains a controversial
topic. Some examples of evolutionary irreversibility are known; however, this question has
not been comprehensively addressed at the molecular level. Here, we use data from 221
human genes with known pathogenic mutations to estimate the rate of irreversibility in
protein evolution. For these genes, we reconstruct ancestral amino acid sequences along
the mammalian phylogeny and identify ancestral amino acid states that match known
pathogenic mutations. Such cases represent inherent evolutionary irreversibility because,
at the present moment, reversals to these ancestral amino acid states are impossible for
the human lineage. We estimate that approximately 10% of all amino acid
substitutions along the mammalian phylogeny are irreversible, such that a return to the
ancestral amino acid state would lead to a pathogenic phenotype. For a subset of 51 genes
with high rates of irreversibility, as much as 40% of all amino acid evolution was
estimated to be irreversible. Because pathogenic phenotypes do not resemble ancestral
phenotypes, the molecular nature of the high rate of irreversibility in proteins is best
explained by evolution with a high prevalence of compensatory, epistatic interactions
between amino acid sites. Under such mode of protein evolution, once an amino acid
substitution is fixed, the probability of its reversal declines as the protein sequence
accumulates changes that affect the phenotypic manifestation of the ancestral state. The
prevalence of epistasis in evolution indicates that the observed high rate of
irreversibility in protein evolution is an inherent property of protein structure and
function.
Collapse
Affiliation(s)
- Onuralp Soylemez
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | | |
Collapse
|
11
|
Bell CG, Wilson GA, Butcher LM, Roos C, Walter L, Beck S. Human-specific CpG "beacons" identify loci associated with human-specific traits and disease. Epigenetics 2012; 7:1188-99. [PMID: 22968434 PMCID: PMC3469460 DOI: 10.4161/epi.22127] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Regulatory change has long been hypothesized to drive the delineation of the human phenotype from other closely related primates. Here we provide evidence that CpG dinucleotides play a special role in this process. CpGs enable epigenome variability via DNA methylation, and this epigenetic mark functions as a regulatory mechanism. Therefore, species-specific CpGs may influence species-specific regulation. We report non-polymorphic species-specific CpG dinucleotides (termed “CpG beacons”) as a distinct genomic feature associated with CpG island (CGI) evolution, human traits and disease. Using an inter-primate comparison, we identified 21 extreme CpG beacon clusters (≥ 20/kb peaks, empirical p < 1.0 × 10−3) in humans, which include associations with four monogenic developmental and neurological disease related genes (Benjamini-Hochberg corrected p = 6.03 × 10−3). We also demonstrate that beacon-mediated CpG density gain in CGIs correlates with reduced methylation in these species in orthologous CGIs over time, via human, chimpanzee and macaque MeDIP-seq. Therefore mapping into both the genomic and epigenomic space the identified CpG beacon clusters define points of intersection where a substantial two-way interaction between genetic sequence and epigenetic state has occurred. Taken together, our data support a model for CpG beacons to contribute to CGI evolution from genesis to tissue-specific to constitutively active CGIs.
Collapse
Affiliation(s)
- Christopher G Bell
- Medical Genomics, UCL Cancer Institute, University College London, London, UK.
| | | | | | | | | | | |
Collapse
|
12
|
Warnefors M, Eyre-Walker A. A selection index for gene expression evolution and its application to the divergence between humans and chimpanzees. PLoS One 2012; 7:e34935. [PMID: 22529958 PMCID: PMC3329554 DOI: 10.1371/journal.pone.0034935] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2011] [Accepted: 03/09/2012] [Indexed: 12/22/2022] Open
Abstract
The importance of gene regulation in animal evolution is a matter of long-standing interest, but measuring the impact of selection on gene expression has proven a challenge. Here, we propose a selection index of gene expression as a straightforward method for assessing the mode and strength of selection operating on gene expression levels. The index is based on the widely used McDonald-Kreitman test and requires the estimation of four quantities: the within-species and between-species expression variances as well as the sequence heterozygosity and divergence of neutrally evolving sequences. We apply the method to data from human and chimpanzee lymphoblastoid cell lines and show that gene expression is in general under strong stabilizing selection. We also demonstrate how the same framework can be used to estimate the proportion of adaptive gene expression evolution.
Collapse
Affiliation(s)
- Maria Warnefors
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
13
|
Carneiro M, Albert FW, Melo-Ferreira J, Galtier N, Gayral P, Blanco-Aguiar JA, Villafuerte R, Nachman MW, Ferrand N. Evidence for widespread positive and purifying selection across the European rabbit (Oryctolagus cuniculus) genome. Mol Biol Evol 2012; 29:1837-49. [PMID: 22319161 DOI: 10.1093/molbev/mss025] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The nearly neutral theory of molecular evolution predicts that the efficacy of both positive and purifying selection is a function of the long-term effective population size (N(e)) of a species. Under this theory, the efficacy of natural selection should increase with N(e). Here, we tested this simple prediction by surveying ~1.5 to 1.8 Mb of protein coding sequence in the two subspecies of the European rabbit (Oryctolagus cuniculus algirus and O. c. cuniculus), a mammal species characterized by high levels of nucleotide diversity and N(e) estimates for each subspecies on the order of 1 × 10(6). When the segregation of slightly deleterious mutations and demographic effects were taken into account, we inferred that >60% of amino acid substitutions on the autosomes were driven to fixation by positive selection. Moreover, we inferred that a small fraction of new amino acid mutations (<4%) are effectively neutral (defined as 0 < N(e)s < 1) and that this fraction was negatively correlated with a gene's expression level. Consistent with models of recurrent adaptive evolution, we detected a negative correlation between levels of synonymous site polymorphism and the rate of protein evolution, although the correlation was weak and nonsignificant. No systematic X chromosome-autosome difference was found in the efficacy of selection. For example, the proportion of adaptive substitutions was significantly higher on the X chromosome compared with the autosomes in O. c. algirus but not in O. c. cuniculus. Our findings support widespread positive and purifying selection in rabbits and add to a growing list of examples suggesting that differences in N(e) among taxa play a substantial role in determining rates and patterns of protein evolution.
Collapse
Affiliation(s)
- Miguel Carneiro
- Centro de Investigação em Biodiversidade e Recursos Genéticos, Vairão, Portugal.
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Downing T, Imamura H, Decuypere S, Clark TG, Coombs GH, Cotton JA, Hilley JD, de Doncker S, Maes I, Mottram JC, Quail MA, Rijal S, Sanders M, Schönian G, Stark O, Sundar S, Vanaerschot M, Hertz-Fowler C, Dujardin JC, Berriman M. Whole genome sequencing of multiple Leishmania donovani clinical isolates provides insights into population structure and mechanisms of drug resistance. Genome Res 2011; 21:2143-56. [PMID: 22038251 PMCID: PMC3227103 DOI: 10.1101/gr.123430.111] [Citation(s) in RCA: 311] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2011] [Accepted: 08/23/2011] [Indexed: 11/24/2022]
Abstract
Visceral leishmaniasis is a potentially fatal disease endemic to large parts of Asia and Africa, primarily caused by the protozoan parasite Leishmania donovani. Here, we report a high-quality reference genome sequence for a strain of L. donovani from Nepal, and use this sequence to study variation in a set of 16 related clinical lines, isolated from visceral leishmaniasis patients from the same region, which also differ in their response to in vitro drug susceptibility. We show that whole-genome sequence data reveals genetic structure within these lines not shown by multilocus typing, and suggests that drug resistance has emerged multiple times in this closely related set of lines. Sequence comparisons with other Leishmania species and analysis of single-nucleotide diversity within our sample showed evidence of selection acting in a range of surface- and transport-related genes, including genes associated with drug resistance. Against a background of relative genetic homogeneity, we found extensive variation in chromosome copy number between our lines. Other forms of structural variation were significantly associated with drug resistance, notably including gene dosage and the copy number of an experimentally verified circular episome present in all lines and described here for the first time. This study provides a basis for more powerful molecular profiling of visceral leishmaniasis, providing additional power to track the drug resistance and epidemiology of an important human pathogen.
Collapse
Affiliation(s)
- Tim Downing
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Hideo Imamura
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Saskia Decuypere
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Taane G. Clark
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| | - Graham H. Coombs
- Strathclyde Institute of Pharmacy and Biomedical and Sciences, University of Strathclyde, Glasgow G4 0RE, United Kingdom
| | - James A. Cotton
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - James D. Hilley
- Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8TA, Scotland, United Kingdom
| | - Simonne de Doncker
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Ilse Maes
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Jeremy C. Mottram
- Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8TA, Scotland, United Kingdom
| | - Mike A. Quail
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Suman Rijal
- B.P. Koirala Institute of Health Sciences, Ghopa, Dharan, Nepal
| | - Mandy Sanders
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Gabriele Schönian
- Institut für Mikrobiologie und Hygiene, Charité Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Olivia Stark
- Institut für Mikrobiologie und Hygiene, Charité Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Shyam Sundar
- Institute of Medical Sciences, Banaras Hindu University, Varanasi, India
| | - Manu Vanaerschot
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Christiane Hertz-Fowler
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Jean-Claude Dujardin
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| |
Collapse
|
15
|
Weighing the evidence for adaptation at the molecular level. Trends Genet 2011; 27:343-9. [PMID: 21775012 DOI: 10.1016/j.tig.2011.06.003] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2011] [Revised: 06/10/2011] [Accepted: 06/10/2011] [Indexed: 11/24/2022]
Abstract
The abundance of genome polymorphism and divergence data has provided unprecedented insight into how mutation, drift and natural selection shape genome evolution. Application of the McDonald-Kreitman (MK) test to such data indicates a pervasive influence of positive selection, particularly in Drosophila species. However, evidence for positive selection in other species ranging from yeast to humans is often weak or absent. Although evidence for positive selection could be obscured in some species, there is also reason to believe that the frequency of adaptive substitutions could be overestimated as a result of epistatic fitness effects or hitchhiking of deleterious mutations. Based on these considerations it is argued that the common assumption of independence among sites must be relaxed before abandoning the neutral theory of molecular evolution.
Collapse
|
16
|
Suzuki Y. Statistical methods for detecting natural selection from genomic data. Genes Genet Syst 2011; 85:359-76. [PMID: 21415566 DOI: 10.1266/ggs.85.359] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
In the study of molecular and phenotypic evolution, understanding the relative importance of random genetic drift and positive selection as the mechanisms for driving divergences between populations and maintaining polymorphisms within populations has been a central issue. A variety of statistical methods has been developed for detecting natural selection operating at the amino acid and nucleotide sequence levels. These methods may be largely classified into those aimed at detecting recurrent and/or recent/ongoing natural selection by utilizing the divergence and/or polymorphism data. Using these methods, pervasive positive selection has been identified for protein-coding and non-coding sequences in the genomic analysis of some organisms. However, many of these methods have been criticized by using computer simulation and real data analysis to produce excessive false-positives and to be sensitive to various disturbing factors. Importantly, some of these methods have been invalidated experimentally. These facts indicate that many of the statistical methods for detecting natural selection are unreliable. In addition, the signals that have been believed as the evidence for fixations of advantageous mutations due to positive selection may also be interpreted as the evidence for fixations of deleterious mutations due to random genetic drift. The genomic diversity data are rapidly accumulating in various organisms, and detection of natural selection may play a critical role for clarifying the relative role of random genetic drift and positive selection in molecular and phenotypic evolution. It is therefore important to develop reliable statistical methods that are unbiased as well as robust against various disturbing factors, for inferring natural selection.
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Graduate School of Natural Sciences, Nagoya City University, Japan.
| |
Collapse
|
17
|
Harris EE. Nonadaptive processes in primate and human evolution. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2011; 143 Suppl 51:13-45. [PMID: 21086525 DOI: 10.1002/ajpa.21439] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Evolutionary biology has tended to focus on adaptive evolution by positive selection as the primum mobile of evolutionary trajectories in species while underestimating the importance of nonadaptive evolutionary processes. In this review, I describe evidence that suggests that primate and human evolution has been strongly influenced by nonadaptive processes, particularly random genetic drift and mutation. This is evidenced by three fundamental effects: a relative relaxation of selective constraints (i.e., purifying selection), a relative increase in the fixation of slightly deleterious mutations, and a general reduction in the efficacy of positive selection. These effects are observed in protein-coding, regulatory regions, and in gene expression data, as well as in an augmentation of fixation of large-scale mutations, including duplicated genes, mobile genetic elements, and nuclear mitochondrial DNA. The evidence suggests a general population-level explanation such as a reduction in effective population size (N(e)). This would have tipped the balance between the evolutionary forces of natural selection and random genetic drift toward genetic drift for variants having small selective effects. After describing these proximate effects, I describe the potential consequences of these effects for primate and human evolution. For example, an increase in the fixation of slightly deleterious mutations could potentially have led to an increase in the fixation rate of compensatory mutations that act to suppress the effects of slightly deleterious substitutions. The potential consequences of compensatory evolution for the evolution of novel gene functions and in potentially confounding the detection of positively selected genes are explored. The consequences of the passive accumulation of large-scale genomic mutations by genetic drift are unclear, though evidence suggests that new gene copies as well as insertions of transposable elements into genes can potentially lead to adaptive phenotypes. Finally, because a decrease in selective constraint at the genetic level is expected to have effects at the morphological level, I review studies that compare rates of morphological change in various mammalian and island populations where N(e) is reduced. Furthermore, I discuss evidence that suggests that craniofacial morphology in the Homo lineage has shifted from an evolutionary rate constrained by purifying selection toward a neutral evolutionary rate.
Collapse
Affiliation(s)
- Eugene E Harris
- Department of Biological Sciences and Geology, Queensborough Community College, City University of New York, Bayside, NY 10364, USA.
| |
Collapse
|
18
|
Nei M, Suzuki Y, Nozawa M. The neutral theory of molecular evolution in the genomic era. Annu Rev Genomics Hum Genet 2010; 11:265-89. [PMID: 20565254 DOI: 10.1146/annurev-genom-082908-150129] [Citation(s) in RCA: 162] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The neutral theory of molecular evolution has been widely accepted and is the guiding principle for studying evolutionary genomics and the molecular basis of phenotypic evolution. Recent data on genomic evolution are generally consistent with the neutral theory. However, many recently published papers claim the detection of positive Darwinian selection via the use of new statistical methods. Examination of these methods has shown that their theoretical bases are not well established and often result in high rates of false-positive and false-negative results. When the deficiencies of these statistical methods are rectified, the results become largely consistent with the neutral theory. At present, genome-wide analyses of natural selection consist of collections of single-locus analyses. However, because phenotypic evolution is controlled by the interaction of many genes, the study of natural selection ought to take such interactions into account. Experimental studies of evolution will also be crucial.
Collapse
Affiliation(s)
- Masatoshi Nei
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, PA 16802, USA.
| | | | | |
Collapse
|
19
|
Toll-Riera M, Laurie S, Albà MM. Lineage-specific variation in intensity of natural selection in mammals. Mol Biol Evol 2010; 28:383-98. [PMID: 20688808 DOI: 10.1093/molbev/msq206] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The molecular clock hypothesis states that protein-coding genes evolve at an approximately constant rate. However, this is only expected to be true as long as the function and the tertiary structure of the molecule remain unaltered. An important implication of this statement is that significant deviations in the rate of evolution of a gene with respect to the species clock are likely to reflect functional and/or structural alterations. Here, we present a method to identify such deviations and apply it to a data set of 2,929 high-quality coding sequence alignments corresponding to one-to-one orthologous genes from six mammalian species--human, macaque, mouse, rat, cow, and dog. Deviated branches are defined as those that present significant alterations in both the rate of nonsynonymous substitutions (dN) and the selective pressure (dN/dS). Strikingly, we find that as many as 24.5% of the genes show branch-specific deviations in dN and dN/dS, though this is a relatively well-conserved set of genes. Around half of these genes show branch-specific acceleration of evolutionary rates. Positive selection (PS) tests based on divergence data only identify 17.7% of the accelerated branches. Failure to identify PS in accelerated branches with an excess of radical amino acid replacements suggests that these tests are conservative. Interestingly, genes with accelerated branches are significantly enriched in neural proteins, indicating that this type of protein might play a more important role than previously thought in species diversification, although they are generally not detected by PS tests. We discuss in detail several examples of genes that show lineage-specific evolutionary rate acceleration and are involved in synaptic transmission, chemosensory perception, and ubiquitination.
Collapse
Affiliation(s)
- Macarena Toll-Riera
- Evolutionary Genomics Group, Research Programme on Biomedical Informatics, Fundació Institut Municipal d'Investigació Mèdica, Barcelona Biomedical Research Park, Barcelona, Spain
| | | | | |
Collapse
|
20
|
Olson MS, Robertson AL, Takebayashi N, Silim S, Schroeder WR, Tiffin P. Nucleotide diversity and linkage disequilibrium in balsam poplar (Populus balsamifera). THE NEW PHYTOLOGIST 2010; 186:526-536. [PMID: 20122131 DOI: 10.1111/j.1469-8137.2009.03174.x] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
*Current perceptions that poplars have high levels of nucleotide variation, large effective population sizes, and rapid decay of linkage disequilibrium are based primarily on studies from one poplar species, Populus tremula. *We analysed 590 gene fragments (average length 565 bp) from each of 15 individuals from different populations from throughout the range of Populus balsamifera. *Nucleotide diversity (theta(total) = 0.0028, pi = 0.0027) was low compared with other trees and model agricultural systems. Patterns of nucleotide diversity and site frequency spectra were consistent with purifying selection on replacement and intron sites. When averaged across all loci we found no evidence for decay of linkage disequilibrium across 750 bp, consistent with the low estimates of the scaled recombination parameter, rho = 0.0092. *Compared with P. tremula, a well studied congener with a similar distribution, P. balsamifera has low diversity and low effective recombination, both of which indicate a lower effective population size in P. balsamifera. Patterns of diversity and linkage indicate that there is considerable variation in population genomic patterns among poplar species and unlike P. tremula, association mapping techniques in balsam poplar should consider sampling single nucleotide polymorphisms (SNPs) at well-spaced intervals.
Collapse
Affiliation(s)
- Matthew S Olson
- Institute of Arctic Biology and Department of Biology and Wildlife, University of Alaska Fairbanks, AK 99775, USA.
| | | | | | | | | | | |
Collapse
|
21
|
Enard D, Depaulis F, Roest Crollius H. Human and non-human primate genomes share hotspots of positive selection. PLoS Genet 2010; 6:e1000840. [PMID: 20140238 PMCID: PMC2816677 DOI: 10.1371/journal.pgen.1000840] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2009] [Accepted: 01/06/2010] [Indexed: 12/17/2022] Open
Abstract
Among primates, genome-wide analysis of recent positive selection is currently limited to the human species because it requires extensive sampling of genotypic data from many individuals. The extent to which genes positively selected in human also present adaptive changes in other primates therefore remains unknown. This question is important because a gene that has been positively selected independently in the human and in other primate lineages may be less likely to be involved in human specific phenotypic changes such as dietary habits or cognitive abilities. To answer this question, we analysed heterozygous Single Nucleotide Polymorphisms (SNPs) in the genomes of single human, chimpanzee, orangutan, and macaque individuals using a new method aiming to identify selective sweeps genome-wide. We found an unexpectedly high number of orthologous genes exhibiting signatures of a selective sweep simultaneously in several primate species, suggesting the presence of hotspots of positive selection. A similar significant excess is evident when comparing genes positively selected during recent human evolution with genes subjected to positive selection in their coding sequence in other primate lineages and identified using a different test. These findings are further supported by comparing several published human genome scans for positive selection with our findings in non-human primate genomes. We thus provide extensive evidence that the co-occurrence of positive selection in humans and in other primates at the same genetic loci can be measured with only four species, an indication that it may be a widespread phenomenon. The identification of positive selection in humans alongside other primates is a powerful tool to outline those genes that were selected uniquely during recent human evolution.
Collapse
Affiliation(s)
- David Enard
- DYOGEN Lab, CNRS UMR8541, Ecole Normale Supérieure, Paris,
France
| | - Frantz Depaulis
- Laboratoire Ecologie et Evolution, CNRS UMR7625, Ecole Normale
Supérieure, UPMC Paris Universitas, Paris, France
| | | |
Collapse
|
22
|
Halligan DL, Oliver F, Eyre-Walker A, Harr B, Keightley PD. Evidence for pervasive adaptive protein evolution in wild mice. PLoS Genet 2010; 6:e1000825. [PMID: 20107605 PMCID: PMC2809770 DOI: 10.1371/journal.pgen.1000825] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2009] [Accepted: 12/21/2009] [Indexed: 11/18/2022] Open
Abstract
The relative contributions of neutral and adaptive substitutions to molecular evolution has been one of the most controversial issues in evolutionary biology for more than 40 years. The analysis of within-species nucleotide polymorphism and between-species divergence data supports a widespread role for adaptive protein evolution in certain taxa. For example, estimates of the proportion of adaptive amino acid substitutions (alpha) are 50% or more in enteric bacteria and Drosophila. In contrast, recent estimates of alpha for hominids have been at most 13%. Here, we estimate alpha for protein sequences of murid rodents based on nucleotide polymorphism data from multiple genes in a population of the house mouse subspecies Mus musculus castaneus, which inhabits the ancestral range of the Mus species complex and nucleotide divergence between M. m. castaneus and M. famulus or the rat. We estimate that 57% of amino acid substitutions in murids have been driven by positive selection. Hominids, therefore, are exceptional in having low apparent levels of adaptive protein evolution. The high frequency of adaptive amino acid substitutions in wild mice is consistent with their large effective population size, leading to effective natural selection at the molecular level. Effective natural selection also manifests itself as a paucity of effectively neutral nonsynonymous mutations in M. m. castaneus compared to humans.
Collapse
Affiliation(s)
- Daniel L. Halligan
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Fiona Oliver
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Adam Eyre-Walker
- Centre for the Study of Evolution and School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Bettina Harr
- Max-Planck-Institute for Evolutionary Biology, Plön, Germany
| | - Peter D. Keightley
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
- * E-mail:
| |
Collapse
|
23
|
Eyre-Walker A, Keightley PD. Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. Mol Biol Evol 2009; 26:2097-108. [PMID: 19535738 DOI: 10.1093/molbev/msp119] [Citation(s) in RCA: 297] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The prevalence of adaptive evolution relative to genetic drift is a central problem in molecular evolution. Methods to estimate the fraction of adaptive nucleotide substitutions (alpha) have been developed, based on the McDonald-Kreitman test, that contrast polymorphism and divergence between selectively and neutrally evolving sites. However, these methods are expected to give downwardly biased estimates of alpha if there are slightly deleterious mutations, because these inflate polymorphism relative to divergence. Here, we estimate alpha by simultaneously estimating the distribution of fitness effects of new mutations at selected sites from the site frequency spectrum and the number of adaptive substitutions. We test the method using simulations. If data meet the assumptions of the analysis model, estimates of alpha show little bias, even when there is little or no recombination. However, population size differences between the divergence and polymorphism phases may cause alpha to be over or underestimated by a predictable factor that depends on the magnitude of the population size change and the shape of the distribution of effects of deleterious mutations. We analyze several data sets of protein-coding genes and noncoding regions from hominids and Drosophila. In Drosophila genes, we estimate that approximately 50% of amino acid substitutions and approximately 20% of substitutions in introns are adaptive. In protein-coding and noncoding data sets of humans, comparison to macaque sequences reveals little evidence for adaptive substitutions. However, the true frequency of adaptive substitutions in human-coding DNA could be as high as 40%, because estimates based on current polymorphism may be strongly downwardly biased by a decrease in the effective population size along the human lineage.
Collapse
Affiliation(s)
- Adam Eyre-Walker
- Centre for the Study of Evolution and School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | | |
Collapse
|
24
|
Ellegren H. A selection model of molecular evolution incorporating the effective population size. Evolution 2009; 63:301-5. [PMID: 19215289 DOI: 10.1111/j.1558-5646.2008.00560.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden.
| |
Collapse
|
25
|
Pathogenic adaptation of intracellular bacteria by rewiring a cis-regulatory input function. Proc Natl Acad Sci U S A 2009; 106:3982-7. [PMID: 19234126 DOI: 10.1073/pnas.0811669106] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The acquisition of DNA by horizontal gene transfer enables bacteria to adapt to previously unexploited ecological niches. Although horizontal gene transfer and mutation of protein-coding sequences are well-recognized forms of pathogen evolution, the evolutionary significance of cis-regulatory mutations in creating phenotypic diversity through altered transcriptional outputs is not known. We show the significance of regulatory mutation for pathogen evolution by mapping and then rewiring a cis-regulatory module controlling a gene required for murine typhoid. Acquisition of a binding site for the Salmonella pathogenicity island-2 regulator, SsrB, enabled the srfN gene, ancestral to the Salmonella genus, to play a role in pathoadaptation of S. typhimurium to a host animal. We identified the evolved cis-regulatory module and quantified the fitness gain that this regulatory output accrues for the bacterium using competitive infections of host animals. Our findings highlight a mechanism of pathogen evolution involving regulatory mutation that is selected because of the fitness advantage the new regulatory output provides the incipient clones.
Collapse
|
26
|
Axelsson E, Ellegren H. Quantification of adaptive evolution of genes expressed in avian brain and the population size effect on the efficacy of selection. Mol Biol Evol 2009; 26:1073-9. [PMID: 19188264 DOI: 10.1093/molbev/msp019] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Whether protein evolution is mainly due to fixation of beneficial alleles by positive selection or to random genetic drift has remained a contentious issue over the years. Here, we use two genomewide polymorphism data sets collected from chicken populations, together with divergence data from >5,000 chicken-zebra finch gene orthologs expressed in brain, to assess the amount of adaptive evolution in protein-coding genes of birds. First, we show that estimates of the fixation index (FI, the ratio of fixed nonsynonymous-to-synonymous changes over the ratio of the corresponding polymorphisms) are highly dependent on the character of the underlying data sets. Second, by using polymorphism data from high-frequency alleles, to avoid the confounding effect of slightly deleterious mutations segregating at low frequency, we estimate that about 20% of amino acid changes have been brought to fixation through positive selection during avian evolution. This estimate is intermediate to that obtained in humans (lower) and flies as well as bacteria (higher), and is consistent with population genetics theory that stipulates a positive relationship between the efficiency of selection and the effective population size. Further, by comparing the FIs for common and all alleles, we estimate that approximately 20% of nonsynonymous variation segregating in chicken populations represent slightly deleterious mutations, which is less than in Drosophila. Overall, these results highlight the link between the effective population size and positive as well as negative selection.
Collapse
Affiliation(s)
- Erik Axelsson
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | | |
Collapse
|
27
|
Abstract
Human genes responsible for inherited diseases are important for the understanding of human disease. We investigated the degree of polymorphism and divergence in the human disease genes to elucidate the effect of natural selection on human disease genes. In particular, the effect of disease dominance was incorporated into the analysis. Both dominant disease genes (DDG) and recessive disease genes (RDG) had a higher mutation rate per site and encoded longer proteins than the nondisease genes, which exposed the disease genes to a faster flux of new mutations. Using an unbiased polymorphism dataset, we found that, proportionally, RDG harbor more nonsynonymous polymorphisms compared with DDG. We estimated the selection intensity on the disease genes using polymorphism and divergence data and determined whether the different patterns of polymorphism and divergence between DDG and RDG could be explained by the difference in only dominance. Even after the dominance effect was considered, the selection intensity on RDG was significantly different from DDG, suggesting that the deleterious effect of the dominant and recessive disease mutations are fundamentally different.
Collapse
|
28
|
Abstract
Models that explicitly account for the effect of selection on new mutations have been proposed to account for "codon bias" or the excess of "preferred" codons that results from selection for translational efficiency and/or accuracy. In principle, such models can be applied to any mutation that results in a preferred allele, but in most cases, the fitness effect of a specific mutation cannot be predicted. Here we show that it is possible to assign preferred and unpreferred states to amino acid changing mutations that occur in protein domains. We propose that mutations that lead to more common amino acids (at a given position in a domain) can be considered "preferred alleles" just as are synonymous mutations leading to codons for more abundant tRNAs. We use genome-scale polymorphism data to show that alleles for preferred amino acids in protein domains occur at higher frequencies in the population, as has been shown for preferred codons. We show that this effect is quantitative, such that there is a correlation between the shift in frequency of preferred alleles and the predicted fitness effect. As expected, we also observe a reduction in the numbers of polymorphisms and substitutions at more important positions in domains, consistent with stronger selection at those positions. We examine the derived allele frequency distribution and polymorphism to divergence ratios of preferred and unpreferred differences and find evidence for both negative and positive selections acting to maintain protein domains in the human population. Finally, we analyze a model for selection on amino acid preferences in protein domains and find that it is consistent with the quantitative effects that we observe.
Collapse
Affiliation(s)
- Alan M Moses
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, UK.
| | | |
Collapse
|
29
|
Oliveira DCSG, Raychoudhury R, Lavrov DV, Werren JH. Rapidly evolving mitochondrial genome and directional selection in mitochondrial genes in the parasitic wasp nasonia (hymenoptera: pteromalidae). Mol Biol Evol 2008; 25:2167-80. [PMID: 18653734 PMCID: PMC2727384 DOI: 10.1093/molbev/msn159] [Citation(s) in RCA: 179] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2008] [Indexed: 11/13/2022] Open
Abstract
We sequenced the nearly complete mtDNA of 3 species of parasitic wasps, Nasonia vitripennis (2 strains), Nasonia giraulti, and Nasonia longicornis, including all 13 protein-coding genes and the 2 rRNAs, and found unusual patterns of mitochondrial evolution. The Nasonia mtDNA has a unique gene order compared with other insect mtDNAs due to multiple rearrangements. The mtDNAs of these wasps also show nucleotide substitution rates over 30 times faster than nuclear protein-coding genes, indicating among the highest substitution rates found in animal mitochondria (normally <10 times faster). A McDonald and Kreitman test shows that the between-species frequency of fixed replacement sites relative to silent sites is significantly higher compared with within-species polymorphisms in 2 mitochondrial genes of Nasonia, atp6 and atp8, indicating directional selection. Consistent with this interpretation, the Ka/Ks (nonsynonymous/synonymous substitution rates) ratios are higher between species than within species. In contrast, cox1 shows a signature of purifying selection for amino acid sequence conservation, although rates of amino acid substitutions are still higher than for comparable insects. The mitochondrial-encoded polypeptides atp6 and atp8 both occur in F0F1ATP synthase of the electron transport chain. Because malfunction in this fundamental protein severely affects fitness, we suggest that the accelerated accumulation of replacements is due to beneficial mutations necessary to compensate mild-deleterious mutations fixed by random genetic drift or Wolbachia sweeps in the fast evolving mitochondria of Nasonia. We further propose that relatively high rates of amino acid substitution in some mitochondrial genes can be driven by a "Compensation-Draft Feedback"; increased fixation of mildly deleterious mutations results in selection for compensatory mutations, which lead to fixation of additional deleterious mutations in nonrecombining mitochondrial genomes, thus accelerating the process of amino acid substitutions.
Collapse
|
30
|
Hughes AL. Near neutrality: leading edge of the neutral theory of molecular evolution. Ann N Y Acad Sci 2008; 1133:162-79. [PMID: 18559820 DOI: 10.1196/annals.1438.001] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The nearly neutral theory represents a development of Kimura's neutral theory of molecular evolution that makes testable predictions that go beyond a mere null model. Recent evidence has strongly supported several of these predictions, including the prediction that slightly deleterious variants will accumulate in a species that has undergone a severe bottleneck or in cases where recombination is reduced or absent. Because bottlenecks often occur in speciation and slightly deleterious mutations in coding regions will usually be nonsynonymous, we should expect that the ratio of nonsynonymous to synonymous fixed differences between species should often exceed the ratio of nonsynonymous to synonymous polymorphisms within species. Many data support this prediction, although they have often been wrongly interpreted as evidence for positive Darwinian selection. The use of conceptually flawed tests for positive selection has become widespread in recent years, seriously harming the quest for an understanding of genome evolution. When properly analyzed, many (probably most) claimed cases of positive selection will turn out to involve the fixation of slightly deleterious mutations by genetic drift in bottlenecked populations. Slightly deleterious variants are a transient feature of evolution in the long term, but they have substantially affected contemporary species, including our own.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Coker Life Sciences Bldg., 700 Sumter St., Columbia, South Carolina 29208, USA.
| |
Collapse
|
31
|
|
32
|
Studer RA, Penel S, Duret L, Robinson-Rechavi M. Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes. Genome Res 2008; 18:1393-402. [PMID: 18562677 DOI: 10.1101/gr.076992.108] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
A stringent branch-site codon model was used to detect positive selection in vertebrate evolution. We show that the test is robust to the large evolutionary distances involved. Positive selection was detected in 77% of 884 genes studied. Most positive selection concerns a few sites on a single branch of the phylogenetic tree: Between 0.9% and 4.7% of sites are affected by positive selection depending on the branches. No functional category was overrepresented among genes under positive selection. Surprisingly, whole genome duplication had no effect on the prevalence of positive selection, whether the fish-specific genome duplication or the two rounds at the origin of vertebrates. Thus positive selection has not been limited to a few gene classes, or to specific evolutionary events such as duplication, but has been pervasive during vertebrate evolution.
Collapse
Affiliation(s)
- Romain A Studer
- Department of Ecology and Evolution, Biophore, Lausanne University, CH-1015 Lausanne, Switzerland
| | | | | | | |
Collapse
|
33
|
Charlesworth J, Eyre-Walker A. The McDonald-Kreitman test and slightly deleterious mutations. Mol Biol Evol 2008; 25:1007-15. [PMID: 18195052 DOI: 10.1093/molbev/msn005] [Citation(s) in RCA: 138] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
It is possible to estimate the proportion of substitutions that are due to adaptive evolution using the numbers of silent and nonsilent polymorphisms and substitutions in a McDonald and Kreitman-type analysis. Unfortunately, this estimate of adaptive evolution is biased downward by the segregation of slightly deleterious mutations. It has been suggested that 1 way to cope with the effects of these slightly deleterious mutations is to remove low-frequency polymorphisms from the analysis. We investigate the performance of this method theoretically. We show that although removing low-frequency polymorphisms does indeed reduce the bias in the estimate of adaptive evolution, the estimate is always downwardly biased, often to the extent that one would not be able to detect adaptive evolution, even if it existed. The method is reasonably satisfactory, only if the rate of adaptive evolution is high and the distribution of fitness effects for slightly deleterious mutations is very leptokurtic. Our analysis suggests that adaptive evolution could be quite prevalent in humans (>8%) and still not be detectable using current methodologies. Our analysis also suggests that the level of adaptive evolution has probably been underestimated, possibly substantially, in both bacteria and Drosophila.
Collapse
Affiliation(s)
- Jane Charlesworth
- Centre for the Study of Evolution, University of Sussex, Brighton, United Kingdom
| | | |
Collapse
|
34
|
Hanada K, Shiu SH, Li WH. The Nonsynonymous/Synonymous Substitution Rate Ratio versus the Radical/Conservative Replacement Rate Ratio in the Evolution of Mammalian Genes. Mol Biol Evol 2007; 24:2235-41. [PMID: 17652332 DOI: 10.1093/molbev/msm152] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
There are 2 ways to infer selection pressures in the evolution of protein-coding genes, the nonsynonymous and synonymous substitution rate ratio (K(A)/K(S)) and the radical and conservative amino acid replacement rate ratio (K(R)/K(C)). Because the K(R)/K(C) ratio depends on the definition of radical and conservative changes in the classification of amino acids, we develop an amino acid classification that maximizes the correlation between K(A)/K(S) and K(R)/K(C). An analysis of 3,375 orthologous gene groups among 5 mammalian species shows that our classification gives a significantly higher correlation coefficient between the 2 ratios than those of existing classifications. However, there are many orthologous gene groups with a low K(A)/K(S) but a high K(R)/K(C) ratio. Examining the functions of these genes, we found an overrepresentation of functional categories related to development. To determine if the overrepresentation is stage specific, we examined the expression patterns of these genes at different developmental stages of the mouse. Interestingly, these genes are highly expressed in the early middle stage of development (blastocyst to amnion). It is commonly thought that developmental genes tend to be conservative in evolution, but some molecular changes in developmental stages should have contributed to morphological divergence in adult mammals. Therefore, we propose that the relaxed pressures indicated by the K(R)/K(C) ratio but not by K(A)/K(S) in the early middle stage of development may be important for the morphological divergence of mammals at the adult stage, whereas purifying selection detected by K(A)/K(S) occurs in the early middle developmental stage.
Collapse
Affiliation(s)
- Kousuke Hanada
- Department of Ecology and Evolution, University of Chicago, USA
| | | | | |
Collapse
|
35
|
Abstract
An important tenet of evolutionary developmental biology ("evo devo") is that adaptive mutations affecting morphology are more likely to occur in the cis-regulatory regions than in the protein-coding regions of genes. This argument rests on two claims: (1) the modular nature of cis-regulatory elements largely frees them from deleterious pleiotropic effects, and (2) a growing body of empirical evidence appears to support the predominant role of gene regulatory change in adaptation, especially morphological adaptation. Here we discuss and critique these assertions. We first show that there is no theoretical or empirical basis for the evo devo contention that adaptations involving morphology evolve by genetic mechanisms different from those involving physiology and other traits. In addition, some forms of protein evolution can avoid the negative consequences of pleiotropy, most notably via gene duplication. In light of evo devo claims, we then examine the substantial data on the genetic basis of adaptation from both genome-wide surveys and single-locus studies. Genomic studies lend little support to the cis-regulatory theory: many of these have detected adaptation in protein-coding regions, including transcription factors, whereas few have examined regulatory regions. Turning to single-locus studies, we note that the most widely cited examples of adaptive cis-regulatory mutations focus on trait loss rather than gain, and none have yet pinpointed an evolved regulatory site. In contrast, there are many studies that have both identified structural mutations and functionally verified their contribution to adaptation and speciation. Neither the theoretical arguments nor the data from nature, then, support the claim for a predominance of cis-regulatory mutations in evolution. Although this claim may be true, it is at best premature. Adaptation and speciation probably proceed through a combination of cis-regulatory and structural mutations, with a substantial contribution of the latter.
Collapse
Affiliation(s)
- Hopi E Hoekstra
- Department of Organismic and Evolutionary Biology and the Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, Massachusetts 02138, USA.
| | | |
Collapse
|
36
|
Abstract
Recent studies of developmental biology have shown that the genes controlling phenotypic characters expressed in the early stage of development are highly conserved and that recent evolutionary changes have occurred primarily in the characters expressed in later stages of development. Even the genes controlling the latter characters are generally conserved, but there is a large component of neutral or nearly neutral genetic variation within and between closely related species. Phenotypic evolution occurs primarily by mutation of genes that interact with one another in the developmental process. The enormous amount of phenotypic diversity among different phyla or classes of organisms is a product of accumulation of novel mutations and their conservation that have facilitated adaptation to different environments. Novel mutations may be incorporated into the genome by natural selection (elimination of preexisting genotypes) or by random processes such as genetic and genomic drift. However, once the mutations are incorporated into the genome, they may generate developmental constraints that will affect the future direction of phenotypic evolution. It appears that the driving force of phenotypic evolution is mutation, and natural selection is of secondary importance.
Collapse
Affiliation(s)
- Masatoshi Nei
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, 328 Mueller Laboratory, University Park, PA 16802, USA.
| |
Collapse
|
37
|
Hughes AL. Looking for Darwin in all the wrong places: the misguided quest for positive selection at the nucleotide sequence level. Heredity (Edinb) 2007; 99:364-73. [PMID: 17622265 DOI: 10.1038/sj.hdy.6801031] [Citation(s) in RCA: 189] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Recent years have seen an explosion of interest in evidence for positive Darwinian selection at the molecular level. This quest has been hampered by the use of statistical methods that fail adequately to rule out alternative hypotheses, particularly the relaxation of purifying selection and the effects of population bottlenecks, during which the effectiveness of purifying selection is reduced. A further problem has been the assumption that positive selection will generally involve repeated amino-acid changes to a single protein. This model was derived from the case of the vertebrate major histocompatibility complex (MHC), but the MHC proteins are unusual in being involved in protein-protein recognition and in a co-evolutionary process of pathogens. There is no reason to suppose that repeated amino-acid changes to a single protein are involved in selectively advantageous phenotypes in general. Rather adaptive phenotypes are much more likely to result from other causes, including single amino-acid changes; deletion or silencing of genes or changes in the pattern of gene expression.
Collapse
Affiliation(s)
- A L Hughes
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA.
| |
Collapse
|