51
|
Charlesworth B, Campos JL, Jackson BC. Faster-X evolution: Theory and evidence from Drosophila. Mol Ecol 2018; 27:3753-3771. [PMID: 29431881 DOI: 10.1111/mec.14534] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Revised: 01/31/2018] [Accepted: 02/06/2018] [Indexed: 12/13/2022]
Abstract
A faster rate of adaptive evolution of X-linked genes compared with autosomal genes can be caused by the fixation of recessive or partially recessive advantageous mutations, due to the full expression of X-linked mutations in hemizygous males. Other processes, including recombination rate and mutation rate differences between X chromosomes and autosomes, may also cause faster evolution of X-linked genes. We review population genetics theory concerning the expected relative values of variability and rates of evolution of X-linked and autosomal DNA sequences. The theoretical predictions are compared with data from population genomic studies of several species of Drosophila. We conclude that there is evidence for adaptive faster-X evolution of several classes of functionally significant nucleotides. We also find evidence for potential differences in mutation rates between X-linked and autosomal genes, due to differences in mutational bias towards GC to AT mutations. Many aspects of the data are consistent with the male hemizygosity model, although not all possible confounding factors can be excluded.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Benjamin C Jackson
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
52
|
Wu DD, Irwin DM. Evolution of Trichocyte Keratin Associated Proteins. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1054:47-56. [PMID: 29797267 DOI: 10.1007/978-981-10-8195-8_5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The major components of hair are keratins and keratin associated proteins (KRTAPs). KRTAPs form the interfilamentous matrix between intermediate filament bundles through extensive disulfide bond cross-linking with the numerous cysteine residues in hair keratins. A variable number of approximately100-180 genes compose the KRTAP gene family in mammals. KRTAP gene family members present a typical pattern of concerted evolution, and its evolutionary features are consistent with the evolution of mammalian hair. KRATP genes might be more important in determining the structure of cashmere fibers in domestic mammals like sheep and goats. KRTAP gene variants thus should provide information for improved wool by sheep and goat breeding.
Collapse
Affiliation(s)
- Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| |
Collapse
|
53
|
Alves I, Houle AA, Hussin JG, Awadalla P. The impact of recombination on human mutation load and disease. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160465. [PMID: 29109227 PMCID: PMC5698626 DOI: 10.1098/rstb.2016.0465] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2017] [Indexed: 12/21/2022] Open
Abstract
Recombination promotes genomic integrity among cells and tissues through double-strand break repair, and is critical for gamete formation and fertility through a strict regulation of the molecular mechanisms associated with proper chromosomal disjunction. In humans, congenital defects and recurrent structural abnormalities can be attributed to aberrant meiotic recombination. Moreover, mutations affecting genes involved in recombination pathways are directly linked to pathologies including infertility and cancer. Recombination is among the most prominent mechanism shaping genome variation, and is associated with not only the structuring of genomic variability, but is also tightly linked with the purging of deleterious mutations from populations. Together, these observations highlight the multiple roles of recombination in human genetics: its ability to act as a major force of evolution, its molecular potential to maintain genome repair and integrity in cell division and its mutagenic cost impacting disease evolution.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Isabel Alves
- Ontario Institute of Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, Canada M5G 0A3
| | - Armande Ang Houle
- Ontario Institute of Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, Canada M5G 0A3
- Department of Molecular Genetics, University of Toronto, Medical Science Building, 1 King's College Circle, Toronto, Ontario, Canada M5S 1A8
| | - Julie G Hussin
- Montreal Heart Institute, Department of Medicine, University of Montreal, 5000 Rue Bélanger, Montréal, Quebec, Canada H1T 1C8
- The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Philip Awadalla
- Ontario Institute of Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, Canada M5G 0A3
- Department of Molecular Genetics, University of Toronto, Medical Science Building, 1 King's College Circle, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
54
|
Niu Z, Xue Q, Wang H, Xie X, Zhu S, Liu W, Ding X. Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus. Int J Mol Sci 2017; 18:E2307. [PMID: 29099062 PMCID: PMC5713276 DOI: 10.3390/ijms18112307] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 09/27/2017] [Accepted: 10/20/2017] [Indexed: 01/03/2023] Open
Abstract
The variation of GC content is a key genome feature because it is associated with fundamental elements of genome organization. However, the reason for this variation is still an open question. Different kinds of hypotheses have been proposed to explain the variation of GC content during genome evolution. However, these hypotheses have not been explicitly investigated in whole plastome sequences. Dendrobium is one of the largest genera in the orchid species. Evolutionary studies of the plastomic organization and base composition are limited in this genus. In this study, we obtained the high-quality plastome sequences of D. loddigesii and D. devonianum. The comparison results showed a nearly identical organization in Dendrobium plastomes, indicating that the plastomic organization is highly conserved in Dendrobium genus. Furthermore, the impact of three evolutionary forces-selection, mutational biases, and GC-biased gene conversion (gBGC)-on the variation of GC content in Dendrobium plastomes was evaluated. Our results revealed: (1) consistent GC content evolution trends and mutational biases in single-copy (SC) and inverted repeats (IRs) regions; and (2) that gBGC has influenced the plastome-wide GC content evolution. These results suggest that both mutational biases and gBGC affect GC content in the plastomes of Dendrobium genus.
Collapse
Affiliation(s)
- Zhitao Niu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Qingyun Xue
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Hui Wang
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xuezhu Xie
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Shuying Zhu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Wei Liu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xiaoyu Ding
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| |
Collapse
|
55
|
Yahalomi D, Haddas-Sasson M, Rubinstein ND, Feldstein T, Diamant A, Huchon D. The Multipartite Mitochondrial Genome of Enteromyxum leei (Myxozoa): Eight Fast-Evolving Megacircles. Mol Biol Evol 2017; 34:1551-1556. [PMID: 28333349 DOI: 10.1093/molbev/msx072] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Myxozoans are a large group of poorly characterized cnidarian parasites. To gain further insight into their evolution, we sequenced the mitochondrial (mt) genome of Enteromyxum leei and reevaluate the mt genome structure of Kudoa iwatai. Although the typical animal mt genome is a compact, 13-25 kb, circular chromosome, the mt genome of E. leei was found to be fragmented into eight circular chromosomes of ∼23 kb, making it the largest described animal mt genome. Each chromosome was found to harbor a large noncoding region (∼15 kb), nearly identical between chromosomes. The protein coding genes show an unusually high rate of sequence evolution and possess little similarity to their cnidarian homologs. Only five protein coding genes could be identified and no tRNA genes. Surprisingly, the mt genome of K. iwatai was also found to be composed of two chromosomes. These observations confirm the remarkable plasticity of myxozoan mt genomes.
Collapse
Affiliation(s)
- Dayana Yahalomi
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Michal Haddas-Sasson
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Nimrod D Rubinstein
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Tamar Feldstein
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel.,The Steinhardt Museum of Natural History and Israel National Center for Biodiversity Studies, Tel Aviv University, Tel Aviv, Israel
| | - Arik Diamant
- National Center for Mariculture, Israel Oceanographic and Limnological Research, Eilat, Israel
| | - Dorothée Huchon
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel.,The Steinhardt Museum of Natural History and Israel National Center for Biodiversity Studies, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
56
|
Evolutionary forces affecting synonymous variations in plant genomes. PLoS Genet 2017; 13:e1006799. [PMID: 28531201 PMCID: PMC5460877 DOI: 10.1371/journal.pgen.1006799] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Revised: 06/06/2017] [Accepted: 05/04/2017] [Indexed: 01/04/2023] Open
Abstract
Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones. In protein coding genes, base composition strongly varies within and among plant genomes, especially at positions where changes do not alter the coded protein (synonymous variations). Some species, such as the model plant Arabidopsis thaliana, are relatively GC-poor and homogeneous while others, such as grasses, are highly heterogeneous and GC-rich. The causes of these variations are still debated: are they mainly due to selective or neutral processes? Answering to this question is important to correctly infer whether variations in base composition may have functional roles or not. We extended a population genetics method to jointly estimate the different forces that may affect synonymous variations and applied it to genomic datasets in 11 flowering plant species. We found that GC-biased gene conversion, a neutral process associated with recombination that mimics selection by favouring G and C bases, is a widespread and stronger process than selection and that it could explain the large variation in base composition observed in plant genomes. Our results bear implications for analysing plant genomes and for correctly interpreting what could be functional or not.
Collapse
|
57
|
Y chromosome palindromes and gene conversion. Hum Genet 2017; 136:605-619. [PMID: 28303348 DOI: 10.1007/s00439-017-1777-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2016] [Accepted: 03/07/2017] [Indexed: 02/02/2023]
Abstract
The presence of large and near-identical inverted repeat sequences (called palindromes) is a common feature of the constitutively haploid sex chromosomes of different species. Despite the fact palindromes originated in a non-recombining context, they have evolved a strong recombinational activity in the form of abundant arm-to-arm gene conversion. Their independent appearance in different species suggests they can have a profound biological significance that has yet to be fully clarified. It has been theorized that natural selection may have favored palindromic organization of male-specific genes and that the establishment of intra-palindrome gene conversion has strong adaptive significance. Arm-to-arm gene conversion allows the efficient removal of deleterious mutations, increases the fixation rate of beneficial mutations and has played an important role in modulating the equilibrium between gene loss and acquisition during Y chromosome evolution. Additionally, a palindromic organization of duplicates could favor the formation of unusual chromatin structures and could optimize the use of gene conversion as a mechanism to maintain the structural integrity of male-specific genes. In this review, we describe the structural features of palindromes on mammalian sex chromosomes and summarize different hypotheses regarding palindrome evolution and the functional benefits of arm-to-arm gene conversion on the unique haploid portion of the nuclear genome.
Collapse
|
58
|
Role of Ectopic Gene Conversion in the Evolution of a Candida krusei Pleiotropic Drug Resistance Transporter Family. Genetics 2017; 205:1619-1639. [PMID: 28159755 PMCID: PMC5378117 DOI: 10.1534/genetics.116.194811] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 01/31/2017] [Indexed: 11/18/2022] Open
Abstract
Gene duplications enable the evolution of novel gene function, but strong positive selection is required to preserve advantageous mutations in a population. This is because frequent ectopic gene conversions (EGCs) between highly similar, tandem-duplicated, sequences, can rapidly remove fate-determining mutations by replacing them with the neighboring parent gene sequences. Unfortunately, the high sequence similarities between tandem-duplicated genes severely hamper empirical studies of this important evolutionary process, because deciphering their correct sequences is challenging. In this study, we employed the eukaryotic model organism Saccharomyces cerevisiae to clone and functionally characterize all 30 alleles of an important pair of tandem-duplicated multidrug efflux pump genes, ABC1 and ABC11, from seven strains of the diploid pathogenic yeast Candida krusei Discovery and functional characterization of their closest ancestor, C. krusei ABC12, helped elucidate the evolutionary history of the entire gene family. Our data support the proposal that the pleiotropic drug resistance (PDR) transporters Abc1p and Abc11p have evolved by concerted evolution for ∼134 MY. While >90% of their sequences remained identical, very strong purifying selection protected six short DNA patches encoding just 18 core amino acid (aa) differences in particular trans membrane span (TMS) regions causing two distinct efflux pump functions. A proline-kink change at the bottom of Abc11p TMS3 was possibly fate determining. Our data also enabled the first empirical estimates for key parameters of eukaryotic gene evolution, they provided rare examples of intron loss, and PDR transporter phylogeny confirmed that C. krusei belongs to a novel, yet unnamed, third major Saccharomycotina lineage.
Collapse
|
59
|
Badouin H, Gladieux P, Gouzy J, Siguenza S, Aguileta G, Snirc A, Le Prieur S, Jeziorski C, Branca A, Giraud T. Widespread selective sweeps throughout the genome of model plant pathogenic fungi and identification of effector candidates. Mol Ecol 2017; 26:2041-2062. [DOI: 10.1111/mec.13976] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2016] [Revised: 12/15/2016] [Accepted: 12/19/2016] [Indexed: 12/11/2022]
Affiliation(s)
- H. Badouin
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - P. Gladieux
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
- UMR BGPI; Campus International de Baillarguet; INRA; 34398 Montpellier France
| | - J. Gouzy
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR441; INRA; 31326 Castanet-Tolosan France
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR2594; CNRS; 31326 Castanet-Tolosan France
| | - S. Siguenza
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR441; INRA; 31326 Castanet-Tolosan France
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR2594; CNRS; 31326 Castanet-Tolosan France
| | - G. Aguileta
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - A. Snirc
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - S. Le Prieur
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - C. Jeziorski
- Genotoul; GeT-PlaGe; INRA Auzeville 31326 Castanet-Tolosan France
- UAR1209; INRA Auzeville 31326 Castanet-Tolosan France
| | - A. Branca
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - T. Giraud
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| |
Collapse
|
60
|
Dong X, Wang X, Zhang F, Tian W. Genome-Wide Identification of Regulatory Sequences Undergoing Accelerated Evolution in the Human Genome. Mol Biol Evol 2016; 33:2565-75. [PMID: 27401230 PMCID: PMC5026254 DOI: 10.1093/molbev/msw128] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes.
Collapse
Affiliation(s)
- Xinran Dong
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, P.R. China
| | - Xiao Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, P.R. China
| | - Feng Zhang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, P.R. China
| | - Weidong Tian
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, P.R. China Children's Hospital of Fudan University, Shanghai, P.R. China
| |
Collapse
|
61
|
Choi JY, Aquadro CF. Recent and Long-Term Selection Across Synonymous Sites in Drosophila ananassae. J Mol Evol 2016; 83:50-60. [PMID: 27481397 DOI: 10.1007/s00239-016-9753-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2015] [Accepted: 07/23/2016] [Indexed: 11/28/2022]
Abstract
In Drosophila, many studies have examined the short- or long-term evolution occurring across synonymous sites. Few, however, have examined both the recent and long-term evolution to gain a complete view of this selection. Here we have analyzed Drosophila ananassae DNA polymorphism and divergence data using several different methods, and have identified evidence of positive selection favoring preferred codons in both recent and long-term evolutionary time scale. Further in D. ananassae, the strength of selection for preferred codons was stronger on the X chromosome compared to the autosomes. We show that this stronger selection is not due to higher gene expression of X-linked genes. Analysis of the selectively neutral introns indicated that the X chromosome also had a preference for GC over AT nucleotides, potentially from GC-biased gene conversions (gcBGCs) that can also affect the base composition of synonymous sites. Thus selection for preferred codons and gcBGC both seem to be partially responsible for shaping the D. ananassae synonymous site evolution.
Collapse
Affiliation(s)
- Jae Young Choi
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA.
| | - Charles F Aquadro
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| |
Collapse
|
62
|
Korunes KL, Noor MAF. Gene conversion and linkage: effects on genome evolution and speciation. Mol Ecol 2016; 26:351-364. [DOI: 10.1111/mec.13736] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Revised: 06/07/2016] [Accepted: 06/22/2016] [Indexed: 12/12/2022]
|
63
|
Stevison LS, Woerner AE, Kidd JM, Kelley JL, Veeramah KR, McManus KF, Bustamante CD, Hammer MF, Wall JD. The Time Scale of Recombination Rate Evolution in Great Apes. Mol Biol Evol 2016; 33:928-45. [PMID: 26671457 PMCID: PMC5870646 DOI: 10.1093/molbev/msv331] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
We present three linkage-disequilibrium (LD)-based recombination maps generated using whole-genome sequence data from 10 Nigerian chimpanzees, 13 bonobos, and 15 western gorillas, collected as part of the Great Ape Genome Project (Prado-Martinez J, et al. 2013. Great ape genetic diversity and population history. Nature 499:471-475). We also identified species-specific recombination hotspots in each group using a modified LDhot framework, which greatly improves statistical power to detect hotspots at varying strengths. We show that fewer hotspots are shared among chimpanzee subspecies than within human populations, further narrowing the time scale of complete hotspot turnover. Further, using species-specific PRDM9 sequences to predict potential binding sites (PBS), we show higher predicted PRDM9 binding in recombination hotspots as compared to matched cold spot regions in multiple great ape species, including at least one chimpanzee subspecies. We found that correlations between broad-scale recombination rates decline more rapidly than nucleotide divergence between species. We also compared the skew of recombination rates at centromeres and telomeres between species and show a skew from chromosome means extending as far as 10-15 Mb from chromosome ends. Further, we examined broad-scale recombination rate changes near a translocation in gorillas and found minimal differences as compared to other great ape species perhaps because the coordinates relative to the chromosome ends were unaffected. Finally, on the basis of multiple linear regression analysis, we found that various correlates of recombination rate persist throughout the African great apes including repeats, diversity, and divergence. Our study is the first to analyze within- and between-species genome-wide recombination rate variation in several close relatives.
Collapse
Affiliation(s)
- Laurie S Stevison
- Institute for Human Genetics, University of California San Francisco Department of Biological Sciences, Auburn University
| | - August E Woerner
- Arizona Research Laboratories, Division of Biotechnology, University of Arizona Department of Genetics, University of Arizona
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan Department of Computational Medicine & Bioinformatics, University of Michigan
| | - Joanna L Kelley
- School of Biological Sciences, Washington State University Department of Genetics, Stanford University
| | - Krishna R Veeramah
- Arizona Research Laboratories, Division of Biotechnology, University of Arizona Department of Ecology and Evolution, Stony Brook University
| | - Kimberly F McManus
- Department of Biology, Stanford University Department of Biomedical Informatics, Stanford University
| | | | - Michael F Hammer
- Arizona Research Laboratories, Division of Biotechnology, University of Arizona Department of Ecology and Evolutionary Biology, University of Arizona Department of Anthropology, University of Arizona
| | - Jeffrey D Wall
- Institute for Human Genetics, University of California San Francisco Department of Epidemiology & Biostatistics, University of California San Francisco
| |
Collapse
|
64
|
Sundararajan A, Dukowic-Schulze S, Kwicklis M, Engstrom K, Garcia N, Oviedo OJ, Ramaraj T, Gonzales MD, He Y, Wang M, Sun Q, Pillardy J, Kianian SF, Pawlowski WP, Chen C, Mudge J. Gene Evolutionary Trajectories and GC Patterns Driven by Recombination in Zea mays. FRONTIERS IN PLANT SCIENCE 2016; 7:1433. [PMID: 27713757 PMCID: PMC5031598 DOI: 10.3389/fpls.2016.01433] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 09/08/2016] [Indexed: 05/20/2023]
Abstract
Recombination occurring during meiosis is critical for creating genetic variation and plays an essential role in plant evolution. In addition to creating novel gene combinations, recombination can affect genome structure through altering GC patterns. In maize (Zea mays) and other grasses, another intriguing GC pattern exists. Maize genes show a bimodal GC content distribution that has been attributed to nucleotide bias in the third, or wobble, position of the codon. Recombination may be an underlying driving force given that recombination sites are often associated with high GC content. Here we explore the relationship between recombination and genomic GC patterns by comparing GC gene content at each of the three codon positions (GC1, GC2, and GC3, collectively termed GCx) to instances of a variable GC-rich motif that underlies double strand break (DSB) hotspots and to meiocyte-specific gene expression. Surprisingly, GCx bimodality in maize cannot be fully explained by the codon wobble hypothesis. High GCx genes show a strong overlap with the DSB hotspot motif, possibly providing a mechanism for the high evolutionary rates seen in these genes. On the other hand, genes that are turned on in meiosis (early prophase I) are biased against both high GCx genes and genes with the DSB hotspot motif, possibly allowing important meiotic genes to avoid DSBs. Our data suggests a strong link between the GC-rich motif underlying DSB hotspots and high GCx genes.
Collapse
Affiliation(s)
| | | | | | | | - Nathan Garcia
- National Center for Genome Resources, Santa FeNM, USA
| | | | | | | | - Yan He
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Minghui Wang
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Qi Sun
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Jaroslaw Pillardy
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Shahryar F. Kianian
- Cereal Disease Laboratory, United States Department of Agriculture – Agricultural Research Service, St. PaulMN, USA
| | - Wojciech P. Pawlowski
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Changbin Chen
- Department of Horticultural Science, University of Minnesota, St. PaulMN, USA
| | - Joann Mudge
- National Center for Genome Resources, Santa FeNM, USA
- *Correspondence: Joann Mudge,
| |
Collapse
|
65
|
Du K, Yang L, He S. Phylogenomic analyses reveal a molecular signature linked to subterranean adaptation in rodents. BMC Evol Biol 2015; 15:287. [PMID: 26683336 PMCID: PMC4683706 DOI: 10.1186/s12862-015-0564-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Accepted: 12/08/2015] [Indexed: 11/15/2022] Open
Abstract
Background Genome-wide signatures of convergent evolution are widely expected but rarely revealed in animals. Subterranean rodent genome and transcriptome data produced by next-generation sequencing facilitate the use of phylogenetic methods to infer non-synonymous and synonymous substitution rates within coding regions, which can reveal changes at the molecular level that are correlated with the dramatic shift from a terrestrial to subterranean habitat. Results Our study used previously sequenced genome or transcriptome data of two subterranean rodents, the blind mole rat and naked mole rat, and their terrestrial relatives, the mouse and guinea pig, to investigate the genetic basis of rodent subterranean adaptation. An analysis of 4996 orthologous genes revealed that the substitution pace of coding sequences was significantly slower in the blind mole rat than in the mouse, and slower in the naked mole rat than in the guinea pig. The dN/dS ratio was significantly higher in the blind mole rat than in the mouse and in the naked mole rat than in the guinea pig. These patterns are most likely related to the longer generation time and lower effective population size of subterranean rodents caused by subterranean ecological constraints. We also identified some genes and gene ontology (GO) categories that might be candidates for adaptation to subterranean life. Conclusions Our study reveals a case of subterranean convergent evolution in rodents that is correlated with change in the pace and mode of molecular evolution observed at the genome scale. We believe that this genomic signature could have also evolved in other cases of subterranean convergence. Additionally, the genes that displayed the most radical changes in their patterns of evolution and their associated GO categories provide a strong basis for further comparative and functional studies, and potentially reveal molecular signatures of adaptation to subterranean life. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0564-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kang Du
- Key Laboratory of Aquatic Biodiversity and Conservation of the Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei, 430072, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Liandong Yang
- Key Laboratory of Aquatic Biodiversity and Conservation of the Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei, 430072, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Shunping He
- Key Laboratory of Aquatic Biodiversity and Conservation of the Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei, 430072, China.
| |
Collapse
|
66
|
Bolívar P, Mugal CF, Nater A, Ellegren H. Recombination Rate Variation Modulates Gene Sequence Evolution Mainly via GC-Biased Gene Conversion, Not Hill-Robertson Interference, in an Avian System. Mol Biol Evol 2015; 33:216-27. [PMID: 26446902 PMCID: PMC4693978 DOI: 10.1093/molbev/msv214] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill–Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C (“strong,” S) over A:T (“weak,” W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Collapse
Affiliation(s)
- Paulina Bolívar
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Carina F Mugal
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Nater
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
67
|
The relationship of recombination rate, genome structure, and patterns of molecular evolution across angiosperms. BMC Evol Biol 2015; 15:194. [PMID: 26377000 PMCID: PMC4574184 DOI: 10.1186/s12862-015-0473-3] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 09/01/2015] [Indexed: 12/31/2022] Open
Abstract
Background Although homologous recombination affects the efficacy of selection in populations, the pattern of recombination rate evolution and its effects on genome evolution across plants are largely unknown. Recombination can reduce genome size by enabling the removal of LTR retrotransposons, alter codon usage by GC biased gene conversion, contribute to complex histories of gene duplication and loss through tandem duplication, and enhance purifying selection on genes. Therefore, variation in recombination rate across species may explain some of the variation in genomic architecture as well as rates of molecular evolution. We used phylogenetic comparative methods to investigate the evolution of global meiotic recombination rate in angiosperms and its effects on genome architecture and selection at the molecular level using genetic maps and genome sequences from thirty angiosperm species. Results Recombination rate is negatively correlated with genome size, which is likely caused by the removal of LTR retrotransposons. After correcting recombination rates for euchromatin content, we also found an association between global recombination rate and average gene family size. This suggests a role for recombination in the preservation of duplicate genes or expansion of gene families. An analysis of the correlation between the ratio of nonsynonymous to synonymous substitution rates (dN/dS) and recombination rate in 3748 genes indicates that higher recombination rates are associated with an increased efficacy of purifying selection, suggesting that global recombination rates affect variation in rates of molecular evolution across distantly related angiosperm species, not just between populations. We also identified shifts in dN/dS for recombination proteins that are associated with shifts in global recombination rate across our sample of angiosperms. Conclusions Although our analyses only reveal correlations, not mechanisms, and do not include potential covariates of recombination rate, like effective population size, they suggest that global recombination rates may play an important role in shaping the macroevolutionary patterns of gene and genome evolution in plants. Interspecific recombination rate variation is tightly correlated with genome size as well as variation in overall LTR retrotransposon abundances. Recombination may shape gene-to-gene variation in dN/dS between species, which might impact the overall gene duplication and loss rates. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0473-3) contains supplementary material, which is available to authorized users.
Collapse
|
68
|
Acosta S, Carela M, Garcia-Gonzalez A, Gines M, Vicens L, Cruet R, Massey SE. DNA Repair Is Associated with Information Content in Bacteria, Archaea, and DNA Viruses. J Hered 2015; 106:644-59. [PMID: 26320243 DOI: 10.1093/jhered/esv055] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 07/07/2015] [Indexed: 11/13/2022] Open
Abstract
The concept of a "proteomic constraint" proposes that DNA repair capacity is positively correlated with the information content of a genome, which can be approximated to the size of the proteome (P). This in turn implies that DNA repair genes are more likely to be present in genomes with larger values of P. This stands in contrast to the common assumption that informational genes have a core function and so are evenly distributed across organisms. We examined the presence/absence of 18 DNA repair genes in bacterial genomes. A positive relationship between gene presence and P was observed for 17 genes in the total dataset, and 16 genes when only nonintracellular bacteria were examined. A marked reduction of DNA repair genes was observed in intracellular bacteria, consistent with their reduced value of P. We also examined archaeal and DNA virus genomes, and show that the presence of DNA repair genes is likewise related to a larger value of P. In addition, the products of the bacterial genes mutY, vsr, and ndk, involved in the correction of GC/AT mutations, are strongly associated with reduced genome GC content. We therefore propose that a reduction in information content leads to a loss of DNA repair genes and indirectly to a reduction in genome GC content in bacteria by exposure to the underlying AT mutation bias. The reduction in P may also indirectly lead to the increase in substitution rates observed in intracellular bacteria via loss of DNA repair genes.
Collapse
Affiliation(s)
- Sharlene Acosta
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Miguelina Carela
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Aurian Garcia-Gonzalez
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Mariela Gines
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Luis Vicens
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Ricardo Cruet
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Steven E Massey
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey).
| |
Collapse
|
69
|
Wu CS, Chaw SM. Evolutionary Stasis in Cycad Plastomes and the First Case of Plastome GC-Biased Gene Conversion. Genome Biol Evol 2015; 7:2000-9. [PMID: 26116919 PMCID: PMC4524490 DOI: 10.1093/gbe/evv125] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/22/2015] [Indexed: 01/09/2023] Open
Abstract
In angiosperms, gene conversion has been known to reduce the mutational load of plastid genomes (the plastomes). Particularly, more frequent gene conversions in inverted repeat (IR) than in single copy (SC) regions result in contrasting substitution rates between these two regions. However, little has been known about the effect of gene conversion in the evolution of gymnosperm plastomes. Cycads (Cycadophyta) are the second largest gymnosperm group. Evolutionary study of their plastomes is limited to the basal cycad genus, Cycas. In this study, we addressed three questions. 1) Do the plastomes of other cycad genera evolve slowly as previously observed in the plastome of Cycas taitungensis? 2) Do substitution rates differ between their SC and IR regions? And 3) Does gene conversion occur in the cycad plastomes? If yes, is it AT-biased or GC-biased? Plastomes of eight species from other eight genera of cycads were sequenced. These plastomes are highly conserved in genome organization. Excluding ginkgo, cycad plastomes have significantly lower synonymous and nonsynonymous substitution rates than other gymnosperms, reflecting their evolutionary stasis in nucleotide mutations. In the IRs of cycad plastomes, the reduced substitution rates and GC-biased mutations are associated with a GC-biased gene conversion (gBGC) mechanism. Further investigations suggest that in cycads, gBGC is able to rectify plastome-wide mutations. Therefore, this study is the first to uncover the plastomic gBGC in seed plants. We also propose a gBGC model to interpret the dissimilar evolutionary patterns as well as the compositionally biased mutations in the SC and IR regions of cycad plastomes.
Collapse
Affiliation(s)
- Chung-Shien Wu
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Shu-Miaw Chaw
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
70
|
Glémin S, Arndt PF, Messer PW, Petrov D, Galtier N, Duret L. Quantification of GC-biased gene conversion in the human genome. Genome Res 2015; 25:1215-28. [PMID: 25995268 PMCID: PMC4510005 DOI: 10.1101/gr.185488.114] [Citation(s) in RCA: 108] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 05/18/2015] [Indexed: 11/25/2022]
Abstract
Much evidence indicates that GC-biased gene conversion (gBGC) has a major impact on the evolution of mammalian genomes. However, a detailed quantification of the process is still lacking. The strength of gBGC can be measured from the analysis of derived allele frequency spectra (DAF), but this approach is sensitive to a number of confounding factors. In particular, we show by simulations that the inference is pervasively affected by polymorphism polarization errors and by spatial heterogeneity in gBGC strength. We propose a new general method to quantify gBGC from DAF spectra, incorporating polarization errors, taking spatial heterogeneity into account, and jointly estimating mutation bias. Applying it to human polymorphism data from the 1000 Genomes Project, we show that the strength of gBGC does not differ between hypermutable CpG sites and non-CpG sites, suggesting that in humans gBGC is not caused by the base-excision repair machinery. Genome-wide, the intensity of gBGC is in the nearly neutral area. However, given that recombination occurs primarily within recombination hotspots, 1%–2% of the human genome is subject to strong gBGC. On average, gBGC is stronger in African than in non-African populations, reflecting differences in effective population sizes. However, due to more heterogeneous recombination landscapes, the fraction of the genome affected by strong gBGC is larger in non-African than in African populations. Given that the location of recombination hotspots evolves very rapidly, our analysis predicts that, in the long term, a large fraction of the genome is affected by short episodes of strong gBGC.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France; Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Peter F Arndt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Philipp W Messer
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | - Dmitri Petrov
- Department of Biology, Stanford University, Stanford, California 94305-5020, USA
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Lyon 1, 69622 Villeurbanne, France
| |
Collapse
|
71
|
Wu M, Moore RC. The Evolutionary Tempo of Sex Chromosome Degradation in Carica papaya. J Mol Evol 2015; 80:265-77. [PMID: 25987354 DOI: 10.1007/s00239-015-9680-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2014] [Accepted: 05/02/2015] [Indexed: 10/23/2022]
Abstract
Genes on non-recombining heterogametic sex chromosomes may degrade over time through the irreversible accumulation of deleterious mutations. In papaya, the non-recombining male-specific region of the Y (MSY) consists of two evolutionary strata corresponding to chromosomal inversions occurring approximately 7.0 and 1.9 MYA. The step-wise recombination suppression between the papaya X and Y allows for a temporal examination of the degeneration progress of the young Y chromosome. Comparative evolutionary analyses of 55 X/Y gene pairs showed that Y-linked genes have more unfavorable substitutions than X-linked genes. However, this asymmetric evolutionary pattern is confined to the oldest stratum, and is only observed when recently evolved pseudogenes are included in the analysis, indicating a slow degeneration tempo of the papaya Y chromosome. Population genetic analyses of coding sequence variation of six Y-linked focal loci in the oldest evolutionary stratum detected an excess of nonsynonymous polymorphism and reduced codon bias relative to autosomal loci. However, this pattern was also observed for corresponding X-linked loci. Both the MSY and its corresponding X-specific region are pericentromeric where recombination has been shown to be greatly reduced. Like the MSY region, overall selective efficacy on the X-specific region may be reduced due to the interference of selective forces between highly linked loci, or the Hill-Robertson effect, that is accentuated in regions of low or suppressed recombination. Thus, a pattern of gene decay on the X-specific region may be explained by relaxed purifying selection and widespread genetic hitchhiking due to its pericentromeric location.
Collapse
Affiliation(s)
- Meng Wu
- Botany Graduate Program, Department of Biology, Miami University, Oxford, OH, 45056, USA
| | | |
Collapse
|
72
|
Banerjee P, Chaube R, Joy KP. Molecular cloning, sequencing and tissue expression of vasotocin and isotocin precursor genes from Ostariophysian catfishes: phylogeny and evolutionary considerations in teleosts. Front Neurosci 2015; 9:166. [PMID: 26029040 PMCID: PMC4432659 DOI: 10.3389/fnins.2015.00166] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2014] [Accepted: 04/23/2015] [Indexed: 11/30/2022] Open
Abstract
Basic and neutral neurohypophyseal (NH) nonapeptides have evolved from vasotocin (VT) by a gene duplication at the base of the gnathostome lineage. In teleosts, VT and IT are the basic and neutral peptides, respectively. In the present study, VT and IT precursor genes of Heteropneustes fossilis and Clarias batrachus (Siluriformes, Ostariophysi) were cloned and sequenced. The channel catfish Icatalurus punctatus NH precursor sequences were obtained from EST database. The catfish NH sequences were used along with the available Acanthopterygii and other vertebrate NH precursor sequences to draw phylogenetic inference on the evolutionary history of the teleost NH peptides. Synteny analysis of the NH gene loci in various teleost species was done to complement the phylogenetic analysis. In H. fossilis, the NH transcripts were also sequenced from the ovary. The cloned genes and the deduced precursor proteins showed conserved characteristics of the NH nonapeptide precursors. The genes are expressed in brain and ovary (follicular envelope) of H. fossilis with higher transcript abundance in the brain. The addition of the catfish sequences in the phylogenetic analysis revealed that the VT and IT precursors of the species-rich superorders of teleosts have a distinct phylogenetic history with the Acanthopterygii VT and IT precursors sharing a less evolutionary distance and the Ostariophysi VT and IT having a greater evolutionary distance. The genomic location of VT and IT precursors, and synteny analysis of the NH loci lend support to the phylogenetic inference and suggest a footprint of fish- specific whole genome duplication (3R) and subsequent diploidization in the NH loci. The VT and IT precursor genes are most likely lineage-specific paralogs resulting from differential losses of the 3R NH paralogs in the two superorders. The independent yet consistent retention of VT and IT in the two superorders might be directed by a stringent ligand-receptor selectivity.
Collapse
Affiliation(s)
- Putul Banerjee
- Department of Zoology, Centre of Advanced Study, Banaras Hindu UniversityVaranasi, India
| | - Radha Chaube
- Zoology Department, Mahila Mahavidhylaya, Banaras Hindu UniversityVaranasi, India
| | - Keerikkattil P. Joy
- Department of Zoology, Centre of Advanced Study, Banaras Hindu UniversityVaranasi, India
- *Correspondence: Keerikkattil P. Joy, Department of Zoology, Centre of Advanced Study, Faculty of Science, Banaras Hindu University, Varanasi 221005, India
| |
Collapse
|
73
|
Burgarella C, Gayral P, Ballenghien M, Bernard A, David P, Jarne P, Correa A, Hurtrez-Boussès S, Escobar J, Galtier N, Glémin S. Molecular Evolution of Freshwater Snails with Contrasting Mating Systems. Mol Biol Evol 2015; 32:2403-16. [PMID: 25980005 DOI: 10.1093/molbev/msv121] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Because mating systems affect population genetics and ecology, they are expected to impact the molecular evolution of species. Self-fertilizing species experience reduced effective population size, recombination rates, and heterozygosity, which in turn should decrease the efficacy of natural selection, both adaptive and purifying, and the strength of meiotic drive processes such as GC-biased gene conversion. The empirical evidence is only partly congruent with these predictions, depending on the analyzed species, some, but not all, of the expected effects have been observed. One possible reason is that self-fertilization is an evolutionary dead-end, so that most current selfers recently evolved self-fertilization, and their genome has not yet been strongly impacted by selfing. Here, we investigate the molecular evolution of two groups of freshwater snails in which mating systems have likely been stable for several millions of years. Analyzing coding sequence polymorphism, divergence, and expression levels, we report a strongly reduced genetic diversity, decreased efficacy of purifying selection, slower rate of adaptive evolution, and weakened codon usage bias/GC-biased gene conversion in the selfer Galba compared with the outcrosser Physa, in full agreement with theoretical expectations. Our results demonstrate that self-fertilization, when effective in the long run, is a major driver of population genomic and molecular evolutionary processes. Despite the genomic effects of selfing, Galba truncatula seems to escape the demographic consequences of the genetic load. We suggest that the particular ecology of the species may buffer the negative consequences of selfing, shedding new light on the dead-end hypothesis.
Collapse
Affiliation(s)
- Concetta Burgarella
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Philippe Gayral
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Marion Ballenghien
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Aurélien Bernard
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | | | | | - Ana Correa
- MIVEGEC (Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution, Contrôle), UMR (UM1-UM2-CNRS 5290-IRD224), IRD, Montpellier, France
| | - Sylvie Hurtrez-Boussès
- MIVEGEC (Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution, Contrôle), UMR (UM1-UM2-CNRS 5290-IRD224), IRD, Montpellier, France
| | - Juan Escobar
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| | - Sylvain Glémin
- Institut des Sciences de l'Evolution, UMR, CNRS 5554, Université Montpellier II, Montpellier, France
| |
Collapse
|
74
|
Evaluation of Ancestral Sequence Reconstruction Methods to Infer Nonstationary Patterns of Nucleotide Substitution. Genetics 2015; 200:873-90. [PMID: 25948563 DOI: 10.1534/genetics.115.177386] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 04/28/2015] [Indexed: 01/07/2023] Open
Abstract
Inference of gene sequences in ancestral species has been widely used to test hypotheses concerning the process of molecular sequence evolution. However, the approach may produce spurious results, mainly because using the single best reconstruction while ignoring the suboptimal ones creates systematic biases. Here we implement methods to correct for such biases and use computer simulation to evaluate their performance when the substitution process is nonstationary. The methods we evaluated include parsimony and likelihood using the single best reconstruction (SBR), averaging over reconstructions weighted by the posterior probabilities (AWP), and a new method called expected Markov counting (EMC) that produces maximum-likelihood estimates of substitution counts for any branch under a nonstationary Markov model. We simulated base composition evolution on a phylogeny for six species, with different selective pressures on G+C content among lineages, and compared the counts of nucleotide substitutions recorded during simulation with the inference by different methods. We found that large systematic biases resulted from (i) the use of parsimony or likelihood with SBR, (ii) the use of a stationary model when the substitution process is nonstationary, and (iii) the use of the Hasegawa-Kishino-Yano (HKY) model, which is too simple to adequately describe the substitution process. The nonstationary general time reversible (GTR) model, used with AWP or EMC, accurately recovered the substitution counts, even in cases of complex parameter fluctuations. We discuss model complexity and the compromise between bias and variance and suggest that the new methods may be useful for studying complex patterns of nucleotide substitution in large genomic data sets.
Collapse
|
75
|
Clément Y, Fustier MA, Nabholz B, Glémin S. The bimodal distribution of genic GC content is ancestral to monocot species. Genome Biol Evol 2014; 7:336-48. [PMID: 25527839 PMCID: PMC4316631 DOI: 10.1093/gbe/evu278] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
In grasses such as rice or maize, the distribution of genic GC content is well known to be bimodal. It is mainly driven by GC content at third codon positions (GC3 for short). This feature is thought to be specific to grasses as closely related species like banana have a unimodal GC3 distribution. GC3 is associated with numerous genomics features and uncovering the origin of this peculiar distribution will help understanding the potential roles and consequences of GC3 variations within and between genomes. Until recently, the origin of the peculiar GC3 distribution in grasses has remained unknown. Thanks to the recent publication of several complete genomes and transcriptomes of nongrass monocots, we studied more than 1,000 groups of one-to-one orthologous genes in seven grasses and three outgroup species (banana, palm tree, and yam). Using a maximum likelihood-based method, we reconstructed GC3 at several ancestral nodes. We found that the bimodal GC3 distribution observed in extant grasses is ancestral to both grasses and most monocot species, and that other species studied here have lost this peculiar structure. We also found that GC3 in grass lineages is globally evolving very slowly and that the decreasing GC3 gradient observed from 5′ to 3′ along coding sequences is also conserved and ancestral to monocots. This result strongly challenges the previous views on the specificity of grass genomes and we discuss its implications for the possible causes of the evolution of GC content in monocots.
Collapse
Affiliation(s)
- Yves Clément
- Montpellier SupAgro, Unité Mixte de Recherche 1334, Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, Montpellier, France Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| | | | - Benoit Nabholz
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| | - Sylvain Glémin
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| |
Collapse
|
76
|
Figuet E, Ballenghien M, Romiguier J, Galtier N. Biased gene conversion and GC-content evolution in the coding sequences of reptiles and vertebrates. Genome Biol Evol 2014; 7:240-50. [PMID: 25527834 PMCID: PMC4316630 DOI: 10.1093/gbe/evu277] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins.
Collapse
Affiliation(s)
- Emeric Figuet
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France
| | - Marion Ballenghien
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France
| | - Jonathan Romiguier
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France Department of Ecology and Evolution, Biophore, University of Lausanne, Switzerland
| | - Nicolas Galtier
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France
| |
Collapse
|
77
|
Sloan DB, Wu Z. History of plastid DNA insertions reveals weak deletion and at mutation biases in angiosperm mitochondrial genomes. Genome Biol Evol 2014; 6:3210-21. [PMID: 25416619 PMCID: PMC4986453 DOI: 10.1093/gbe/evu253] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Angiosperm mitochondrial genomes exhibit many unusual properties, including heterogeneous nucleotide composition and exceptionally large and variable genome sizes. Determining the role of nonadaptive mechanisms such as mutation bias in shaping the molecular evolution of these unique genomes has proven challenging because their dynamic structures generally prevent identification of homologous intergenic sequences for comparative analyses. Here, we report an analysis of angiosperm mitochondrial DNA sequences that are derived from inserted plastid DNA (mtpts). The availability of numerous completely sequenced plastid genomes allows us to infer the evolutionary history of these insertions, including the specific nucleotide substitutions and indels that have occurred because their incorporation into the mitochondrial genome. Our analysis confirmed that many mtpts have a complex history, including frequent gene conversion and multiple examples of horizontal transfer between divergent angiosperm lineages. Nevertheless, it is clear that the majority of extant mtpt sequence in angiosperms is the product of recent transfer (or gene conversion) and is subject to rapid loss/deterioration, suggesting that most mtpts are evolving relatively free from functional constraint. The evolution of mtpt sequences reveals a pattern of biased mutational input in angiosperm mitochondrial genomes, including an excess of small deletions over insertions and a skew toward nucleotide substitutions that increase AT content. However, these mutation biases are far weaker than have been observed in many other cellular genomes, providing insight into some of the notable features of angiosperm mitochondrial architecture, including the retention of large intergenic regions and the relatively neutral GC content found in these regions.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins
| | - Zhiqiang Wu
- Department of Biology, Colorado State University, Fort Collins
| |
Collapse
|
78
|
Babbitt GA, Alawad MA, Schulze KV, Hudson AO. Synonymous codon bias and functional constraint on GC3-related DNA backbone dynamics in the prokaryotic nucleoid. Nucleic Acids Res 2014; 42:10915-26. [PMID: 25200075 PMCID: PMC4176184 DOI: 10.1093/nar/gku811] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
While mRNA stability has been demonstrated to control rates of translation, generating both global and local synonymous codon biases in many unicellular organisms, this explanation cannot adequately explain why codon bias strongly tracks neighboring intergene GC content; suggesting that structural dynamics of DNA might also influence codon choice. Because minor groove width is highly governed by 3-base periodicity in GC, the existence of triplet-based codons might imply a functional role for the optimization of local DNA molecular dynamics via GC content at synonymous sites (≈GC3). We confirm a strong association between GC3-related intrinsic DNA flexibility and codon bias across 24 different prokaryotic multiple whole-genome alignments. We develop a novel test of natural selection targeting synonymous sites and demonstrate that GC3-related DNA backbone dynamics have been subject to moderate selective pressure, perhaps contributing to our observation that many genes possess extreme DNA backbone dynamics for their given protein space. This dual function of codons may impose universal functional constraints affecting the evolution of synonymous and non-synonymous sites. We propose that synonymous sites may have evolved as an 'accessory' during an early expansion of a primordial genetic code, allowing for multiplexed protein coding and structural dynamic information within the same molecular context.
Collapse
Affiliation(s)
- Gregory A Babbitt
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, Rochester NY, USA 14623
| | - Mohammed A Alawad
- B. Thomas Golisano College of Computing and Information Sciences, Rochester Institute of Technology, Rochester NY, USA 14623
| | - Katharina V Schulze
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX, USA 77030
| | - André O Hudson
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, Rochester NY, USA 14623
| |
Collapse
|
79
|
Zaghloul L, Drillon G, Boulos RE, Argoul F, Thermes C, Arneodo A, Audit B. Large replication skew domains delimit GC-poor gene deserts in human. Comput Biol Chem 2014; 53 Pt A:153-65. [PMID: 25224847 DOI: 10.1016/j.compbiolchem.2014.08.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2014] [Indexed: 01/25/2023]
Abstract
Besides their large-scale organization in isochores, mammalian genomes display megabase-sized regions, spanning both genes and intergenes, where the strand nucleotide composition asymmetry decreases linearly, possibly due to replication activity. These so-called skew-N domains cover about a third of the human genome and are bordered by two skew upward jumps that were hypothesized to compose a subset of "master" replication origins active in the germline. Skew-N domains were shown to exhibit a particular gene organization. Genes with CpG-rich promoters likely expressed in the germline are over represented near the master replication origins, with large genes being co-oriented with replication fork progression, which suggests some coordination of replication and transcription. In this study, we describe another skew structure that covers ∼13% of the human genome and that is bordered by putative master replication origins similar to the ones flanking skew-N domains. These skew-split-N domains have a shape reminiscent of a N, but split in half, leaving in the center a region of null skew whose length increases with domain size. These central regions (median size ∼860 kb) have a homogeneous composition, i.e. both a null and constant skew and a constant and low GC content. They correspond to heterochromatin gene deserts found in low-GC isochores with an average gene density of 0.81 promoters/Mb as compared to 7.73 promoters/Mb genome wide. The analysis of epigenetic marks and replication timing data confirms that, in these late replicating heterochomatic regions, the initiation of replication is likely to be random. This contrasts with the transcriptionally active euchromatin state found around the bordering well positioned master replication origins. Altogether skew-N domains and skew-split-N domains cover about 50% of the human genome.
Collapse
Affiliation(s)
- Lamia Zaghloul
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR 5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Guénola Drillon
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR 5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Rasha E Boulos
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR 5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Françoise Argoul
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR 5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Claude Thermes
- Centre de Génétique Moléculaire, CNRS UPR 3404, Gif-sur-Yvette, France
| | - Alain Arneodo
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR 5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Benjamin Audit
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR 5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France.
| |
Collapse
|
80
|
Murat F, Zhang R, Guizard S, Flores R, Armero A, Pont C, Steinbach D, Quesneville H, Cooke R, Salse J. Shared subgenome dominance following polyploidization explains grass genome evolutionary plasticity from a seven protochromosome ancestor with 16K protogenes. Genome Biol Evol 2014; 6:12-33. [PMID: 24317974 PMCID: PMC3914691 DOI: 10.1093/gbe/evt200] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Modern plant genomes are diploidized paleopolyploids. We revisited grass genome paleohistory in response to the diploidization process through a detailed investigation of the evolutionary fate of duplicated blocks. Ancestrally duplicated genes can be conserved, deleted, and shuffled, defining dominant (bias toward duplicate retention) and sensitive (bias toward duplicate erosion) chromosomal fragments. We propose a new grass genome paleohistory deriving from an ancestral karyotype structured in seven protochromosomes containing 16,464 protogenes and following evolutionary rules where 1) ancestral shared polyploidizations shaped conserved dominant (D) and sensitive (S) subgenomes, 2) subgenome dominance is revealed by both gene deletion and shuffling from the S blocks, 3) duplicate deletion/movement may have been mediated by single-/double-stranded illegitimate recombination mechanisms, 4) modern genomes arose through centromeric fusion of protochromosomes, leading to functional monocentric neochromosomes, 5) the fusion of two dominant blocks leads to supradominant neochromosomes (D + D = D) with higher ancestral gene retention compared with D + S = D (i.e., fusion of blocks with opposite sensitivity) or even S + S = S (i.e., fusion of two sensitive ancestral blocks). A new user-friendly online tool named "PlantSyntenyViewer," available at http://urgi.versailles.inra.fr/synteny-cereal, presents the refined comparative genomics data.
Collapse
Affiliation(s)
- Florent Murat
- INRA/UBP UMR 1095 GDEC (Génétique, Diversité et Ecophysiologie des Céréales), Clermont Ferrand, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
81
|
Hinch AG, Altemose N, Noor N, Donnelly P, Myers SR. Recombination in the human Pseudoautosomal region PAR1. PLoS Genet 2014; 10:e1004503. [PMID: 25033397 PMCID: PMC4102438 DOI: 10.1371/journal.pgen.1004503] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 05/27/2014] [Indexed: 12/19/2022] Open
Abstract
The pseudoautosomal region (PAR) is a short region of homology between the mammalian X and Y chromosomes, which has undergone rapid evolution. A crossover in the PAR is essential for the proper disjunction of X and Y chromosomes in male meiosis, and PAR deletion results in male sterility. This leads the human PAR with the obligatory crossover, PAR1, to having an exceptionally high male crossover rate, which is 17-fold higher than the genome-wide average. However, the mechanism by which this obligatory crossover occurs remains unknown, as does the fine-scale positioning of crossovers across this region. Recent research in mice has suggested that crossovers in PAR may be mediated independently of the protein PRDM9, which localises virtually all crossovers in the autosomes. To investigate recombination in this region, we construct the most fine-scale genetic map containing directly observed crossovers to date using African-American pedigrees. We leverage recombination rates inferred from the breakdown of linkage disequilibrium in human populations and investigate the signatures of DNA evolution due to recombination. Further, we identify direct PRDM9 binding sites using ChIP-seq in human cells. Using these independent lines of evidence, we show that, in contrast with mouse, PRDM9 does localise peaks of recombination in the human PAR1. We find that recombination is a far more rapid and intense driver of sequence evolution in PAR1 than it is on the autosomes. We also show that PAR1 hotspot activities differ significantly among human populations. Finally, we find evidence that PAR1 hotspot positions have changed between human and chimpanzee, with no evidence of sharing among the hottest hotspots. We anticipate that the genetic maps built and validated in this work will aid research on this vital and fascinating region of the genome.
Collapse
Affiliation(s)
- Anjali G. Hinch
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Nicolas Altemose
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Nudrat Noor
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Peter Donnelly
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Simon R. Myers
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| |
Collapse
|
82
|
Glémin S, Clément Y, David J, Ressayre A. GC content evolution in coding regions of angiosperm genomes: a unifying hypothesis. Trends Genet 2014; 30:263-70. [PMID: 24916172 DOI: 10.1016/j.tig.2014.05.002] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2014] [Revised: 05/09/2014] [Accepted: 05/13/2014] [Indexed: 01/06/2023]
Abstract
In angiosperms (as in other species), GC content varies along and between genes, within a genome, and between genomes of different species, but the reason for this distribution is still an open question. Grass genomes are particularly intriguing because they exhibit a strong bimodal distribution of genic GC content and a sharp 5'-3' decreasing GC content gradient along most genes. Here, we propose a unifying model to explain the main patterns of GC content variation at the gene and genome scale. We argue that GC content patterns could be mainly determined by the interactions between gene structure, recombination patterns, and GC-biased gene conversion. Recent studies on fine-scale recombination maps in angiosperms support this hypothesis and previous results also fit this model. We propose that our model could be used as a null hypothesis to search for additional forces that affect GC content in angiosperms.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, UMR 5554 CNRS, Université Montpellier 2, F-34095 Montpellier, France.
| | - Yves Clément
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, UMR 5554 CNRS, Université Montpellier 2, F-34095 Montpellier, France; Montpellier SupAgro, Unité Mixte de Recherche 1334 Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34398 Montpellier, France
| | - Jacques David
- Montpellier SupAgro, Unité Mixte de Recherche 1334 Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34398 Montpellier, France
| | - Adrienne Ressayre
- INRA, UMR de Génétique Végétale, INRA/CNRS/Univ Paris-Sud/AgroParistech, Ferme du Moulon, F-91190 Gif sur Yvette, France
| |
Collapse
|
83
|
Castric V, Billiard S, Vekemans X. Trait transitions in explicit ecological and genomic contexts: plant mating systems as case studies. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 781:7-36. [PMID: 24277293 DOI: 10.1007/978-94-007-7347-9_2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Plants are astonishingly diverse in how they reproduce sexually, and the study of plant mating systems provides some of the most compelling cases of parallel and independent evolutionary transitions. In this chapter, we review how the massive amount of genomic data being produced is allowing long-standing predictions from ecological and evolutionary theory to be put to test. After a review of theoretical predictions about the importance of considering the genomic architecture of the mating system, we focus on a set of recent discoveries on how the mating system is controlled in a variety of model and non-model species. In parallel, genomic approaches have revealed the complex interaction between the evolution of genes controlling mating systems and genome evolution, both genome-wide and in the mating system control region. In several cases, major transitions in the mating system can be clearly associated with important ecological changes, hence illuminating an important interplay between ecological and genomic approaches. We also list a number of major unsolved questions that remain for the field, and highlight foreseeable conceptual developments that are likely to play a major role in our understanding of how plant mating systems evolve in Nature.
Collapse
Affiliation(s)
- Vincent Castric
- Laboratoire de Génétique et Evolution des Populations Végétales (GEPV), UMR 8198; CNRS, Université Lille 1, Sciences et Technologies, Cité Scientifique, Villeneuve d'Ascq, France,
| | | | | |
Collapse
|
84
|
Gotea V, Elnitski L. Ascertaining regions affected by GC-biased gene conversion through weak-to-strong mutational hotspots. Genomics 2014; 103:349-56. [PMID: 24727706 DOI: 10.1016/j.ygeno.2014.04.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Revised: 03/31/2014] [Accepted: 04/04/2014] [Indexed: 12/24/2022]
Abstract
A major objective for evolutionary biology is to identify regions affected by positive selection. High dN/dS values for proteins and accelerated lineage-specific substitution rates for non-coding regions are considered classic signatures of positive selection. However, these could also be the result of non-adaptive phenomena, such as GC-biased gene conversion (gBGC), which favors the fixation of strong (C/G) over weak (A/T) nucleotides. Recent estimates indicate that gBGC affected up to 20% of regions with signatures of positive selection. Here we evaluate the impact of gBGC through its molecular signature of weak-to-strong mutational hotspots. We implemented specific modifications to the test proposed by Tang and Lewontin (1999) for identifying regions of differential variability and applied it to regions previously investigated for the influence of gBGC. While we found significant agreement with previous reports, our results suggest a smaller influence of gBGC than previously estimated, warranting further development of methods for its detection.
Collapse
Affiliation(s)
- Valer Gotea
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Laura Elnitski
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
85
|
Opposing forces of A/T-biased mutations and G/C-biased gene conversions shape the genome of the nematode Pristionchus pacificus. Genetics 2014; 196:1145-52. [PMID: 24414549 DOI: 10.1534/genetics.113.159863] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Base substitution mutations are a major source of genetic novelty and mutation accumulation line (MAL) studies revealed a nearly universal AT bias in de novo mutation spectra. While a comparison of de novo mutation spectra with the actual nucleotide composition in the genome suggests the existence of general counterbalancing mechanisms, little is known about the evolutionary and historical details of these opposing forces. Here, we correlate MAL-derived mutation spectra with patterns observed from population resequencing. Variation observed in natural populations has already been subject to evolutionary forces. Distinction between rare and common alleles, the latter of which are close to fixation and of presumably older age, can provide insight into mutational processes and their influence on genome evolution. We provide a genome-wide analysis of de novo mutations in 22 MALs of the nematode Pristionchus pacificus and compare the spectra with natural variants observed in resequencing of 104 natural isolates. MALs show an AT bias of 5.3, one of the highest values observed to date. In contrast, the AT bias in natural variants is much lower. Specifically, rare derived alleles show an AT bias of 2.4, whereas common derived alleles close to fixation show no AT bias at all. These results indicate the existence of a strong opposing force and they suggest that the GC content of the P. pacificus genome is in equilibrium. We discuss GC-biased gene conversion as a potential mechanism acting against AT-biased mutations. This study provides insight into genome evolution by combining MAL studies with natural variation.
Collapse
|
86
|
Lee YCG, Langley CH, Begun DJ. Differential strengths of positive selection revealed by hitchhiking effects at small physical scales in Drosophila melanogaster. Mol Biol Evol 2013; 31:804-16. [PMID: 24361994 DOI: 10.1093/molbev/mst270] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
The long time scale of adaptive evolution makes it difficult to directly observe the spread of most beneficial mutations through natural populations. Therefore, inferring attributes of beneficial mutations by studying the genomic signals left by directional selection is an important component of population genetics research. One kind of signal is a trough in nearby neutral genetic variation due to selective fixation of initially rare alleles, a phenomenon known as "genetic hitchhiking." Accumulated evidence suggests that a considerable fraction of substitutions in the Drosophila genome results from positive selection, most of which are expected to have small selection coefficients and influence the population genetics of sites in the immediate vicinity. Using Drosophila melanogaster population genomic data, we found that the heterogeneity in synonymous polymorphism surrounding different categories of coding fixations is readily observable even within 25 bp of focal substitutions, which we interpret as the result of small-scale hitchhiking effects. The strength of natural selection on different sites appears to be quite heterogeneous. Particularly, neighboring fixations that changed amino acid polarities in a way that maintained the overall polarities of a protein were under stronger selection than other categories of fixations. Interestingly, we found that substitutions in slow-evolving genes are associated with stronger hitchhiking effects. This is consistent with the idea that adaptive evolution may involve few substitutions with large effects or many substitutions with small effects. Because our approach only weakly depends on the numbers of recent nonsynonymous substitutions, it can provide a complimentary view to the adaptive evolution inferred by other divergence-based evolutionary genetic methods.
Collapse
Affiliation(s)
- Yuh Chwen G Lee
- Department of Evolution and Ecology and Center for Population Biology, University of California, Davis
| | | | | |
Collapse
|
87
|
Yruela I, Contreras-Moreira B. Genetic recombination is associated with intrinsic disorder in plant proteomes. BMC Genomics 2013; 14:772. [PMID: 24206529 PMCID: PMC3828576 DOI: 10.1186/1471-2164-14-772] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Accepted: 10/31/2013] [Indexed: 11/18/2022] Open
Abstract
Background Intrinsically disordered proteins, found in all living organisms, are essential for basic cellular functions and complement the function of ordered proteins. It has been shown that protein disorder is linked to the G + C content of the genome. Furthermore, recent investigations have suggested that the evolutionary dynamics of the plant nucleus adds disordered segments to open reading frames alike, and these segments are not necessarily conserved among orthologous genes. Results In the present work the distribution of intrinsically disordered proteins along the chromosomes of several representative plants was analyzed. The reported results support a non-random distribution of disordered proteins along the chromosomes of Arabidopsis thaliana and Oryza sativa, two model eudicot and monocot plant species, respectively. In fact, for most chromosomes positive correlations between the frequency of disordered segments of 30+ amino acids and both recombination rates and G + C content were observed. Conclusions These analyses demonstrate that the presence of disordered segments among plant proteins is associated with the rates of genetic recombination of their encoding genes. Altogether, these findings suggest that high recombination rates, as well as chromosomal rearrangements, could induce disordered segments in proteins during evolution.
Collapse
Affiliation(s)
- Inmaculada Yruela
- Estación Experimental de Aula Dei, Consejo Superior de Investigaciones Científicas (EEAD-CSIC), Avda, Montañana, 1005, Zaragoza 50059, Spain.
| | | |
Collapse
|
88
|
Robinson MC, Stone EA, Singh ND. Population genomic analysis reveals no evidence for GC-biased gene conversion in Drosophila melanogaster. Mol Biol Evol 2013; 31:425-33. [PMID: 24214536 DOI: 10.1093/molbev/mst220] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Gene conversion is the nonreciprocal exchange of genetic material between homologous chromosomes. Multiple lines of evidence from a variety of taxa strongly suggest that gene conversion events are biased toward GC-bearing alleles. However, in Drosophila, the data have largely been indirect and unclear, with some studies supporting the predictions of a GC-biased gene conversion model and other data showing contradictory findings. Here, we test whether gene conversion events are GC-biased in Drosophila melanogaster using whole-genome polymorphism and divergence data. Our results provide no support for GC-biased gene conversion and thus suggest that this process is unlikely to significantly contribute to patterns of polymorphism and divergence in this system.
Collapse
Affiliation(s)
- Matthew C Robinson
- Department of Biological Sciences, Program in Genetics, North Carolina State University
| | | | | |
Collapse
|
89
|
Clément Y, Arndt PF. Meiotic recombination strongly influences GC-content evolution in short regions in the mouse genome. Mol Biol Evol 2013; 30:2612-8. [PMID: 24030552 DOI: 10.1093/molbev/mst154] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Meiotic recombination is known to influence GC-content evolution in large regions of mammalian genomes by favoring the fixation of G and C alleles and increasing the rate of A/T to G/C substitutions. This process is known as GC-biased gene conversion (gBGC). Until recently, genome-wide measures of fine-scale recombination activity were unavailable in mice. Additionally, comparative studies focusing on mouse were limited as the closest organism with its genome fully sequenced was rat. Here, we make use of the recent mapping of double strand breaks (DSBs), the first step of meiotic recombination, in the mouse genome and of the sequencing of mouse closely related subspecies to analyze the fine-scale evolutionary signature of meiotic recombination on GC-content evolution in recombination hotspots, short regions that undergo extreme rates of recombination. We measure substitution rates around DSB hotspots and observe that gBGC is affecting a very short region (≈ 1 kbp) in length around these hotspots. Furthermore, we can infer that the locations of hotspots evolved rapidly during mouse evolution.
Collapse
Affiliation(s)
- Yves Clément
- Montpellier SupAgro, Unité Mixte de Recherche 1334, Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, Montpellier, France
| | | |
Collapse
|
90
|
Capra JA, Hubisz MJ, Kostka D, Pollard KS, Siepel A. A model-based analysis of GC-biased gene conversion in the human and chimpanzee genomes. PLoS Genet 2013; 9:e1003684. [PMID: 23966869 PMCID: PMC3744432 DOI: 10.1371/journal.pgen.1003684] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2013] [Accepted: 06/14/2013] [Indexed: 01/03/2023] Open
Abstract
GC-biased gene conversion (gBGC) is a recombination-associated process that favors the fixation of G/C alleles over A/T alleles. In mammals, gBGC is hypothesized to contribute to variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations, but its prevalence and general functional consequences remain poorly understood. gBGC is difficult to incorporate into models of molecular evolution and so far has primarily been studied using summary statistics from genomic comparisons. Here, we introduce a new probabilistic model that captures the joint effects of natural selection and gBGC on nucleotide substitution patterns, while allowing for correlations along the genome in these effects. We implemented our model in a computer program, called phastBias, that can accurately detect gBGC tracts about 1 kilobase or longer in simulated sequence alignments. When applied to real primate genome sequences, phastBias predicts gBGC tracts that cover roughly 0.3% of the human and chimpanzee genomes and account for 1.2% of human-chimpanzee nucleotide differences. These tracts fall in clusters, particularly in subtelomeric regions; they are enriched for recombination hotspots and fast-evolving sequences; and they display an ongoing fixation preference for G and C alleles. They are also significantly enriched for disease-associated polymorphisms, suggesting that they contribute to the fixation of deleterious alleles. The gBGC tracts provide a unique window into historical recombination processes along the human and chimpanzee lineages. They supply additional evidence of long-term conservation of megabase-scale recombination rates accompanied by rapid turnover of hotspots. Together, these findings shed new light on the evolutionary, functional, and disease implications of gBGC. The phastBias program and our predicted tracts are freely available. Interpreting patterns of DNA sequence variation in the genomes of closely related species is critically important for understanding the causes and functional effects of nucleotide substitutions. Classical models describe patterns of substitution in terms of the fundamental forces of mutation, recombination, neutral drift, and natural selection. However, an entirely separate force, called GC-biased gene conversion (gBGC), also appears to have an important influence on substitution patterns in many species. gBGC is a recombination-associated evolutionary process that favors the fixation of strong (G/C) over weak (A/T) alleles. In mammals, gBGC is thought to promote variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations. However, its genome-wide influence remains poorly understood, in part because, it is difficult to incorporate gBGC into statistical models of evolution. In this paper, we describe a new evolutionary model that jointly describes the effects of selection and gBGC and apply it to the human and chimpanzee genomes. Our genome-wide predictions of gBGC tracts indicate that gBGC has been an important force in recent human evolution. Our publicly available computer program, called phastBias, and our genome-wide predictions will enable other researchers to consider gBGC in their analyses.
Collapse
Affiliation(s)
- John A. Capra
- Gladstone Institutes, University of California, San Francisco, California, United States of America
| | - Melissa J. Hubisz
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Dennis Kostka
- Department of Developmental Biology and Computational & Systems Biology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Katherine S. Pollard
- Gladstone Institutes, University of California, San Francisco, California, United States of America
- Institute for Human Genetics and Division of Biostatistics, University of California, San Francisco, California, United States of America
- * E-mail: (KSP); (AS)
| | - Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
- * E-mail: (KSP); (AS)
| |
Collapse
|
91
|
Gioti A, Stajich JE, Johannesson H. Neurospora and the dead-end hypothesis: genomic consequences of selfing in the model genus. Evolution 2013; 67:3600-16. [PMID: 24299411 DOI: 10.1111/evo.12206] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2012] [Accepted: 06/24/2013] [Indexed: 12/24/2022]
Abstract
It is becoming increasingly evident that adoption of different reproductive strategies, such as sexual selfing and asexuality, greatly impacts genome evolution. In this study, we test theoretical predictions on genomic maladaptation of selfing lineages using empirical data from the model fungus Neurospora. We sequenced the genomes of four species representing distinct transitions to selfing within the history of the genus, as well as the transcriptome of one of these, and compared with available data from three outcrossing species. Our results provide evidence for a relaxation of purifying selection in protein-coding genes and for a reduced efficiency of transposable element silencing by Repeat Induced Point mutation. A reduction in adaptive evolution was also identified in the form of reduced codon usage bias in highly expressed genes of selfing Neurospora, but this result may be confounded by mutational bias. Potentially counteracting these negative effects, the nucleotide substitution rate and the spread of transposons is reduced in selfing species. We suggest that differences in substitution rate relate to the absence, in selfing Neurospora, of the asexual pathway producing conidia. Our results support the dead-end theory and show that Neurospora genomes bear signatures of both sexual and asexual reproductive mode.
Collapse
Affiliation(s)
- Anastasia Gioti
- Department of Evolutionary Biology, Uppsala University, Norbyvägen 18D, 752 36 Uppsala, Sweden.
| | | | | |
Collapse
|
92
|
Hallast P, Balaresque P, Bowden GR, Ballereau S, Jobling MA. Recombination dynamics of a human Y-chromosomal palindrome: rapid GC-biased gene conversion, multi-kilobase conversion tracts, and rare inversions. PLoS Genet 2013; 9:e1003666. [PMID: 23935520 PMCID: PMC3723533 DOI: 10.1371/journal.pgen.1003666] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 06/07/2013] [Indexed: 11/19/2022] Open
Abstract
The male-specific region of the human Y chromosome (MSY) includes eight large inverted repeats (palindromes) in which arm-to-arm similarity exceeds 99.9%, due to gene conversion activity. Here, we studied one of these palindromes, P6, in order to illuminate the dynamics of the gene conversion process. We genotyped ten paralogous sequence variants (PSVs) within the arms of P6 in 378 Y chromosomes whose evolutionary relationships within the SNP-defined Y phylogeny are known. This allowed the identification of 146 historical gene conversion events involving individual PSVs, occurring at a rate of 2.9–8.4×10−4 events per generation. A consideration of the nature of nucleotide change and the ancestral state of each PSV showed that the conversion process was significantly biased towards the fixation of G or C nucleotides (GC-biased), and also towards the ancestral state. Determination of haplotypes by long-PCR allowed likely co-conversion of PSVs to be identified, and suggested that conversion tract lengths are large, with a mean of 2068 bp, and a maximum in excess of 9 kb. Despite the frequent formation of recombination intermediates implied by the rapid observed gene conversion activity, resolution via crossover is rare: only three inversions within P6 were detected in the sample. An analysis of chimpanzee and gorilla P6 orthologs showed that the ancestral state bias has existed in all three species, and comparison of human and chimpanzee sequences with the gorilla outgroup confirmed that GC bias of the conversion process has apparently been active in both the human and chimpanzee lineages. The sex-determining role of the human Y chromosome makes it male-specific, and always present in only a single copy. This solo lifestyle has endowed it with some bizarre features, among which are eight large DNA units constituting about a quarter of the chromosome's length, and containing many genes important for sperm production. These units are called palindromes, since, taking into account the polarity of the DNA strands, the sequence is the same read from either end of the unit. We investigated the details of a process (gene conversion) that transfers sequence variants in one half of a palindrome into the other, thereby maintaining >99.9% similarity between the halves. We analysed patterns of sequence variants within one palindrome in a set of Y chromosomes whose evolutionary relationships are known. This allowed us to identify past gene conversion events, and to demonstrate a bias towards events that eliminate new variants, and retain old ones. Gene conversion has therefore acted during human evolution to retard sequence change in these regions. Analysis of the chimpanzee and gorilla versions of the palindrome shows that the dynamic processes we see in human Y chromosomes have a deep evolutionary history.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | | | - Georgina R. Bowden
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Stéphane Ballereau
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Mark A. Jobling
- Department of Genetics, University of Leicester, Leicester, United Kingdom
- * E-mail:
| |
Collapse
|
93
|
Romiguier J, Ranwez V, Delsuc F, Galtier N, Douzery EJP. Less is more in mammalian phylogenomics: AT-rich genes minimize tree conflicts and unravel the root of placental mammals. Mol Biol Evol 2013; 30:2134-44. [PMID: 23813978 DOI: 10.1093/molbev/mst116] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Despite the rapid increase of size in phylogenomic data sets, a number of important nodes on animal phylogeny are still unresolved. Among these, the rooting of the placental mammal tree is still a controversial issue. One difficulty lies in the pervasive phylogenetic conflicts among genes, with each one telling its own story, which may be reliable or not. Here, we identified a simple criterion, that is, the GC content, which substantially helps in determining which gene trees best reflect the species tree. We assessed the ability of 13,111 coding sequence alignments to correctly reconstruct the placental phylogeny. We found that GC-rich genes induced a higher amount of conflict among gene trees and performed worse than AT-rich genes in retrieving well-supported, consensual nodes on the placental tree. We interpret this GC effect mainly as a consequence of genome-wide variations in recombination rate. Indeed, recombination is known to drive GC-content evolution through GC-biased gene conversion and might be problematic for phylogenetic reconstruction, for instance, in an incomplete lineage sorting context. When we focused on the AT-richest fraction of the data set, the resolution level of the placental phylogeny was greatly increased, and a strong support was obtained in favor of an Afrotheria rooting, that is, Afrotheria as the sister group of all other placentals. We show that in mammals most conflicts among gene trees, which have so far hampered the resolution of the placental tree, are concentrated in the GC-rich regions of the genome. We argue that the GC content-because it is a reliable indicator of the long-term recombination rate-is an informative criterion that could help in identifying the most reliable molecular markers for species tree inference.
Collapse
Affiliation(s)
- Jonathan Romiguier
- CNRS, Université Montpellier, Institut des Sciences de l'Evolution, Montpellier, France.
| | | | | | | | | |
Collapse
|
94
|
Cutter AD, Jovelin R, Dey A. Molecular hyperdiversity and evolution in very large populations. Mol Ecol 2013; 22:2074-95. [PMID: 23506466 PMCID: PMC4065115 DOI: 10.1111/mec.12281] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2012] [Revised: 01/24/2013] [Accepted: 01/29/2013] [Indexed: 02/06/2023]
Abstract
The genomic density of sequence polymorphisms critically affects the sensitivity of inferences about ongoing sequence evolution, function and demographic history. Most animal and plant genomes have relatively low densities of polymorphisms, but some species are hyperdiverse with neutral nucleotide heterozygosity exceeding 5%. Eukaryotes with extremely large populations, mimicking bacterial and viral populations, present novel opportunities for studying molecular evolution in sexually reproducing taxa with complex development. In particular, hyperdiverse species can help answer controversial questions about the evolution of genome complexity, the limits of natural selection, modes of adaptation and subtleties of the mutation process. However, such systems have some inherent complications and here we identify topics in need of theoretical developments. Close relatives of the model organisms Caenorhabditis elegans and Drosophila melanogaster provide known examples of hyperdiverse eukaryotes, encouraging functional dissection of resulting molecular evolutionary patterns. We recommend how best to exploit hyperdiverse populations for analysis, for example, in quantifying the impact of noncrossover recombination in genomes and for determining the identity and micro-evolutionary selective pressures on noncoding regulatory elements.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
95
|
Lesecque Y, Mouchiroud D, Duret L. GC-biased gene conversion in yeast is specifically associated with crossovers: molecular mechanisms and evolutionary significance. Mol Biol Evol 2013; 30:1409-19. [PMID: 23505044 PMCID: PMC3649680 DOI: 10.1093/molbev/mst056] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
GC-biased gene conversion (gBGC) is a process associated with recombination that favors the transmission of GC alleles over AT alleles during meiosis. gBGC plays a major role in genome evolution in many eukaryotes. However, the molecular mechanisms of gBGC are still unknown. Different steps of the recombination process could potentially cause gBGC: the formation of double-strand breaks (DSBs), the invasion of the homologous or sister chromatid, and the repair of mismatches in heteroduplexes. To investigate these models, we analyzed a genome-wide data set of crossovers (COs) and noncrossovers (NCOs) in Saccharomyces cerevisiae. We demonstrate that the overtransmission of GC alleles is specific to COs and that it occurs among conversion tracts in which all alleles are converted from the same donor haplotype. Thus, gBGC results from a process that leads to long-patch repair. We show that gBGC is associated with longer tracts and that it is driven by the nature (GC or AT) of the alleles located at the extremities of the tract. These observations invalidate the hypotheses that gBGC is due to the base excision repair machinery or to a bias in DSB formation and suggest that in S. cerevisiae, gBGC is caused by the mismatch repair (MMR) system. We propose that the presence of nicks on both DNA strands during CO resolution could be the cause of the bias in MMR activity. Our observations are consistent with the hypothesis that gBGC is a nonadaptive consequence of a selective pressure to limit the mutation rate in mitotic cells.
Collapse
Affiliation(s)
- Yann Lesecque
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université de Lyon, Université Lyon 1, Villeurbanne, France
| | | | | |
Collapse
|
96
|
Backström N, Zhang Q, Edwards SV. Evidence from a House Finch (Haemorhous mexicanus) Spleen Transcriptome for Adaptive Evolution and Biased Gene Conversion in Passerine Birds. Mol Biol Evol 2013; 30:1046-50. [DOI: 10.1093/molbev/mst033] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
|
97
|
Hazzouri KM, Escobar JS, Ness RW, Killian Newman L, Randle AM, Kalisz S, Wright SI. Comparative population genomics in Collinsia sister species reveals evidence for reduced effective population size, relaxed selection, and evolution of biased gene conversion with an ongoing mating system shift. Evolution 2013; 67:1263-78. [PMID: 23617907 DOI: 10.1111/evo.12027] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2012] [Accepted: 11/15/2012] [Indexed: 10/27/2022]
Abstract
Selfing species experience reduced effective recombination rates and effective population size, which can lead to reductions in polymorphism and the efficacy of natural selection. Here, we use illumina transcriptome sequencing and population resequencing to test for changes in polymorphism, base composition, and selection in the selfing angiosperm Collinsia rattanii (Plantaginaceae) compared with its more outcrossing sister species Collinsia linearis. Coalescent analysis indicates intermediate species divergence (500,000-1 million years) with no ongoing gene flow, but also evidence that the C. rattanii clade remains polymorphic for floral morphology and mating system, suggesting either an ongoing shift to selfing or a potential reversal from selfing to outcrossing. We identify a significant reduction in polymorphism in C. rattanii, particularly within populations. Analysis of polymorphisms suggests an elevated ratio of unique nonsynonymous to synonymous polymorphism in C. rattanii, consistent with relaxed selection in selfing lineages. We additionally find higher linkage disequilibrium and differentiation, lower GC content at variable sites, and reduced expression of genes important in pollen production and pollinator attraction in C. rattanii compared with C. linearis. Together, our results highlight the potential for rapid shifts in the efficacy of selection, gene expression and base composition associated with ongoing evolution of selfing.
Collapse
Affiliation(s)
- Khaled M Hazzouri
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | | | | | | | | | | | | |
Collapse
|
98
|
Campos JL, Zeng K, Parker DJ, Charlesworth B, Haddrill PR. Codon usage bias and effective population sizes on the X chromosome versus the autosomes in Drosophila melanogaster. Mol Biol Evol 2012. [PMID: 23204387 PMCID: PMC3603305 DOI: 10.1093/molbev/mss222] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Codon usage bias (CUB) in Drosophila is higher for X-linked genes than for autosomal genes. One possible explanation is that the higher effective recombination rate for genes on the X chromosome compared with the autosomes reduces their susceptibility to Hill–Robertson effects, and thus enhances the efficacy of selection on codon usage. The genome sequence of D. melanogaster was used to test this hypothesis. Contrary to expectation, it was found that, after correcting for the effective recombination rate, CUB remained higher on the X than on the autosomes. In contrast, an analysis of polymorphism data from a Rwandan population showed that mean nucleotide site diversity at 4-fold degenerate sites for genes on the X is approximately three-quarters of the autosomal value after correcting for the effective recombination rate, compared with approximate equality before correction. In addition, these data show that selection for preferred versus unpreferred synonymous variants is stronger on the X than the autosomes, which accounts for the higher CUB of genes on the X chromosome. This difference in the strength of selection does not appear to reflect the effects of dominance of mutations affecting codon usage, differences in gene expression levels between X and autosomes, or differences in mutational bias. Its cause therefore remains unexplained. The stronger selection on CUB on the X chromosome leads to a lower rate of synonymous site divergence compared with the autosomes; this will cause a stronger upward bias for X than A in estimates of the proportion of nonsynonymous mutations fixed by positive selection, for methods based on the McDonald–Kreitman test.
Collapse
Affiliation(s)
- Jose L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | | | | | | | | |
Collapse
|
99
|
Tortereau F, Servin B, Frantz L, Megens HJ, Milan D, Rohrer G, Wiedmann R, Beever J, Archibald AL, Schook LB, Groenen MAM. A high density recombination map of the pig reveals a correlation between sex-specific recombination and GC content. BMC Genomics 2012; 13:586. [PMID: 23152986 PMCID: PMC3499283 DOI: 10.1186/1471-2164-13-586] [Citation(s) in RCA: 121] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Accepted: 08/09/2012] [Indexed: 12/18/2022] Open
Abstract
Background The availability of a high-density SNP genotyping chip and a reference genome sequence of the pig (Sus scrofa) enabled the construction of a high-density linkage map. A high-density linkage map is an essential tool for further fine-mapping of quantitative trait loci (QTL) for a variety of traits in the pig and for a better understanding of mechanisms underlying genome evolution. Results Four different pig pedigrees were genotyped using the Illumina PorcineSNP60 BeadChip. Recombination maps for the autosomes were computed for each individual pedigree using a common set of markers. The resulting genetic maps comprised 38,599 SNPs, including 928 SNPs not positioned on a chromosome in the current assembly of the pig genome (build 10.2). The total genetic length varied according to the pedigree, from 1797 to 2149 cM. Female maps were longer than male maps, with a notable exception for SSC1 where male maps are characterized by a higher recombination rate than females in the region between 91–250 Mb. The recombination rates varied among chromosomes and along individual chromosomes, regions with high recombination rates tending to cluster close to the chromosome ends, irrespective of the position of the centromere. Correlations between main sequence features and recombination rates were investigated and significant correlations were obtained for all the studied motifs. Regions characterized by high recombination rates were enriched for specific GC-rich sequence motifs as compared to low recombinant regions. These correlations were higher in females than in males, and females were found to be more recombinant than males at regions where the GC content was greater than 0.4. Conclusions The analysis of the recombination rate along the pig genome highlighted that the regions exhibiting higher levels of recombination tend to cluster around the ends of the chromosomes irrespective of the location of the centromere. Major sex-differences in recombination were observed: females had a higher recombination rate within GC-rich regions and exhibited a stronger correlation between recombination rates and specific sequence features.
Collapse
Affiliation(s)
- Flavie Tortereau
- Wageningen University, Animal Breeding and Genomics Centre, 6700AH, Wageningen, The Netherlands
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
100
|
McGaugh SE, Heil CSS, Manzano-Winkler B, Loewe L, Goldstein S, Himmel TL, Noor MAF. Recombination modulates how selection affects linked sites in Drosophila. PLoS Biol 2012; 10:e1001422. [PMID: 23152720 PMCID: PMC3496668 DOI: 10.1371/journal.pbio.1001422] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 10/05/2012] [Indexed: 11/18/2022] Open
Abstract
Recombination rate in Drosophila species shapes the impact of selection in the genome and is positively correlated with nucleotide diversity. One of the most influential observations in molecular evolution has been a strong association between local recombination rate and nucleotide polymorphisms across the genome. This is interpreted as evidence for ubiquitous natural selection. The alternative explanation, that recombination is mutagenic, has been rejected by the absence of a similar association between local recombination rate and nucleotide divergence between species. However, many recent studies show that recombination rates are often very different even in closely related species, questioning whether an association between recombination rate and divergence between species has been tested satisfactorily. To circumvent this problem, we directly surveyed recombination across approximately 43% of the D. pseudoobscura physical genome in two separate recombination maps and 31% of the D. miranda physical genome, and we identified both global and local differences in recombination rate between these two closely related species. Using only regions with conserved recombination rates between and within species and accounting for multiple covariates, our data support the conclusion that recombination is positively related to diversity because recombination modulates Hill–Robertson effects in the genome and not because recombination is predominately mutagenic. Finally, we find evidence for dips in diversity around nonsynonymous substitutions. We infer that at least some of this reduction in diversity resulted from selective sweeps and examine these dips in the context of recombination rate. Individuals within a species differ in the DNA sequences of their genes. This sequence variation affects how well individuals survive or reproduce and is transmitted to their offspring. Genes near each other on individual chromosomes tend to be passed to offspring together—neighboring genes are unlikely to be separated by exchanges of genetic material derived from different parents during meiotic recombination. When genes are inherited together, however, the evolutionary forces acting on one gene can interfere with variation at its neighbors. Thus, variation at multiple genes can be lost if natural selection acts on one gene in close proximity. Recombination can prevent or reduce this loss of variation, but previous tests of this phenomenon failed to account for recombination rate differences between species. In this study, we show that some parts of the genome differ in recombination rate between two species of fruit fly, Drosophila pseudoobscura and D. miranda. Avoiding an assumption made in previous studies, we then examine sequence variation within and between fly species in those parts of the genome that have conserved recombination rates. Based on the results, we conclude that recombination indeed preserves variation within species that would otherwise have been eliminated by natural selection.
Collapse
Affiliation(s)
- Suzanne E McGaugh
- Biology Department, Duke University, Durham, North Carolina, United States of America.
| | | | | | | | | | | | | |
Collapse
|