1
|
Zhao ZY, Wu JW, Xu CG, Nong Y, Huang YF, Lai KD. Molecular identification and studies on genetic diversity and structure-related GC heterogeneity of Spatholobus Suberectus based on ITS2. Sci Rep 2024; 14:23523. [PMID: 39384849 PMCID: PMC11464735 DOI: 10.1038/s41598-024-75763-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2024] [Accepted: 10/08/2024] [Indexed: 10/11/2024] Open
Abstract
To determine the role of internal transcribed spacer 2 (ITS2) in the identification of Spatholobus suberectus and explore the genetic diversity of S. suberectus. A total of 292 ITS2s from S. suberectus and 17 other plant species were analysed. S. suberectus was clustered separately in the phylogenetic tree. The genetic distance between species was greater than that within S. suberectus. Synonymous substitution rate (Ks) analysis revealed that ITS2 diverged the most recently within S. suberectus (Ks = 0.0022). These findings suggested that ITS2 is suitable for the identification of S. suberectus. The ITS2s were divided into 8 haplotypes and 4 evolutionary branches on the basis of secondary structure, indicating that there was variation within S. suberectus. Evolutionary analysis revealed that the GC content of paired regions (pGC) was greater than that of unpaired regions (upGC), and the pGC showed a decreasing trend, whereas the upGC remained unchanged. Single-base mutation was the main cause of base pair substitution. In both the initial state and the equilibrium state, the substitution rate of GC was higher than that of AU. The increase in the GC content was partly attributed to GC-biased gene conversion (gBGC). High GC content reflected the high recombination and mutation rates of ITS2, which is the basis for species identification and genetic diversity. We characterized the sequence and structural characteristics of S. suberectus ITS2 in detail, providing a reference and basis for the identification of S. suberectus and its products, as well as the protection and utilization of wild resources.
Collapse
Affiliation(s)
- Zi-Yi Zhao
- Guangxi Key Laboratory of Traditional Chinese Medicine Quality Standards, Guangxi Institute of Chinese Medicine & Pharmaceutical Science, Nanning, 530022, China
| | - Jia-Wen Wu
- College of Horticulture and Landscape Architecture, Northeast Agricultural University, Harbin, 150000, China
| | - Chuan-Gui Xu
- Guangxi Key Laboratory of Traditional Chinese Medicine Quality Standards, Guangxi Institute of Chinese Medicine & Pharmaceutical Science, Nanning, 530022, China
| | - You Nong
- Guangxi Key Laboratory of Traditional Chinese Medicine Quality Standards, Guangxi Institute of Chinese Medicine & Pharmaceutical Science, Nanning, 530022, China
| | - Yun-Feng Huang
- Guangxi Key Laboratory of Traditional Chinese Medicine Quality Standards, Guangxi Institute of Chinese Medicine & Pharmaceutical Science, Nanning, 530022, China.
| | - Ke-Dao Lai
- Guangxi Key Laboratory of Traditional Chinese Medicine Quality Standards, Guangxi Institute of Chinese Medicine & Pharmaceutical Science, Nanning, 530022, China.
| |
Collapse
|
2
|
Zavala B, Dineen L, Fisher KJ, Opulente DA, Harrison MC, Wolters JF, Shen XX, Zhou X, Groenewald M, Hittinger CT, Rokas A, LaBella AL. Genomic factors shaping codon usage across the Saccharomycotina subphylum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.23.595506. [PMID: 38826271 PMCID: PMC11142207 DOI: 10.1101/2024.05.23.595506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Codon usage bias, or the unequal use of synonymous codons, is observed across genes, genomes, and between species. The biased use of synonymous codons has been implicated in many cellular functions, such as translation dynamics and transcript stability, but can also be shaped by neutral forces. The Saccharomycotina, the fungal subphylum containing the yeasts Saccharomyces cerevisiae and Candida albicans , has been a model system for studying codon usage. We characterized codon usage across 1,154 strains from 1,051 species to gain insight into the biases, molecular mechanisms, evolution, and genomic features contributing to codon usage patterns across the subphylum. We found evidence of a general preference for A/T-ending codons and correlations between codon usage bias, GC content, and tRNA-ome size. Codon usage bias is also distinct between the 12 orders within the subphylum to such a degree that yeasts can be classified into orders with an accuracy greater than 90% using a machine learning algorithm trained on codon usage. We also characterized the degree to which codon usage bias is impacted by translational selection. Interestingly, the degree of translational selection was influenced by a combination of genome features and assembly metrics that included the number of coding sequences, BUSCO count, and genome length. Our analysis also revealed an extreme bias in codon usage in the Saccharomycodales associated with a lack of predicted arginine tRNAs. The order contains 24 species, and 23 are computationally predicted to lack tRNAs that decode CGN codons, leaving only the AGN codons to encode arginine. Analysis of Saccharomycodales gene expression, tRNA sequences, and codon evolution suggests that extreme avoidance of the CGN codons is associated with a decline in arginine tRNA function. Codon usage bias within the Saccharomycotina is generally consistent with previous investigations in fungi, which show a role for both genomic features and GC bias in shaping codon usage. However, we find cases of extreme codon usage preference and avoidance along yeast lineages, suggesting additional forces may be shaping the evolution of specific codons.
Collapse
|
3
|
Komluski J, Habig M, Stukenbrock EH. Repeat-Induced Point Mutation and Gene Conversion Coinciding with Heterochromatin Shape the Genome of a Plant-Pathogenic Fungus. mBio 2023:e0329022. [PMID: 37093087 DOI: 10.1128/mbio.03290-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023] Open
Abstract
Meiosis is associated with genetic changes in the genome-via recombination, gene conversion, and mutations. The occurrence of gene conversion and mutations during meiosis may further be influenced by the chromatin conformation, similar to the effect of the chromatin conformation on the mitotic mutation rate. To date, however, the exact distribution and type of meiosis-associated changes and the role of the chromatin conformation in this context are largely unexplored. Here, we determine recombination, gene conversion, and de novo mutations using whole-genome sequencing of all meiotic products of 23 individual meioses in Zymoseptoria tritici, an important pathogen of wheat. We confirm a high genome-wide recombination rate of 65 centimorgan (cM)/Mb and see higher recombination rates on the accessory compared to core chromosomes. A substantial fraction of 0.16% of all polymorphic markers was affected by gene conversions, showing a weak GC-bias and occurring at higher frequency in regions of constitutive heterochromatin, indicated by the histone modification H3K9me3. The de novo mutation rate associated with meiosis was approximately three orders of magnitude higher than the corresponding mitotic mutation rate. Importantly, repeat-induced point mutation (RIP), a fungal defense mechanism against duplicated sequences, is active in Z. tritici and responsible for the majority of these de novo meiotic mutations. Our results indicate that the genetic changes associated with meiosis are a major source of variability in the genome of an important plant pathogen and shape its evolutionary trajectory. IMPORTANCE The impact of meiosis on the genome composition via gene conversion and mutations is mostly poorly understood, in particular, for non-model species. Here, we sequenced all four meiotic products for 23 individual meioses and determined the genetic changes caused by meiosis for the important fungal wheat pathogen Zymoseptoria tritici. We found a high rate of gene conversions and an effect of the chromatin conformation on gene conversion rates. Higher conversion rates were found in regions enriched with the H3K9me3-a mark for constitutive heterochromatin. Most importantly, meiosis was associated with a much higher frequency of de novo mutations than mitosis; 78% of the meiotic mutations were caused by repeat-induced point mutations-a fungal defense mechanism against duplicated sequences. In conclusion, the genetic changes associated with meiosis are therefore a major factor shaping the genome of this fungal pathogen.
Collapse
Affiliation(s)
- Jovan Komluski
- Environmental Genomics, Christian-Albrechts University of Kiel, Kiel, Germany
- Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Michael Habig
- Environmental Genomics, Christian-Albrechts University of Kiel, Kiel, Germany
- Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Eva H Stukenbrock
- Environmental Genomics, Christian-Albrechts University of Kiel, Kiel, Germany
- Max Planck Institute for Evolutionary Biology, Plön, Germany
| |
Collapse
|
4
|
Lee B, Cyrill SL, Lee W, Melchiotti R, Andiappan AK, Poidinger M, Rötzschke O. Analysis of archaic human haplotypes suggests that 5hmC acts as an epigenetic guide for NCO recombination. BMC Biol 2022; 20:173. [PMID: 35927700 PMCID: PMC9354366 DOI: 10.1186/s12915-022-01353-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 06/17/2022] [Indexed: 11/17/2022] Open
Abstract
Background Non-crossover (NCO) refers to a mechanism of homologous recombination in which short tracks of DNA are copied between homologue chromatids. The allelic changes are typically restricted to one or few SNPs, which potentially allow for the gradual adaptation and maturation of haplotypes. It is assumed to be a stochastic process but the analysis of archaic and modern human haplotypes revealed a striking variability in local NCO recombination rates. Methods NCO recombination rates of 1.9 million archaic SNPs shared with Denisovan hominids were defined by a linkage study and correlated with functional and genomic annotations as well as ChIP-Seq data from modern humans. Results We detected a strong correlation between NCO recombination rates and the function of the respective region: low NCO rates were evident in introns and quiescent intergenic regions but high rates in splice sites, exons, 5′- and 3′-UTRs, as well as CpG islands. Correlations with ChIP-Seq data from ENCODE and other public sources further identified epigenetic modifications that associated directly with these recombination events. A particularly strong association was observed for 5-hydroxymethylcytosine marks (5hmC), which were enriched in virtually all of the functional regions associated with elevated NCO rates, including CpG islands and ‘poised’ bivalent regions. Conclusion Our results suggest that 5hmC marks may guide the NCO machinery specifically towards functionally relevant regions and, as an intermediate of oxidative demethylation, may open a pathway for environmental influence by specifically targeting recently opened gene loci. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01353-9.
Collapse
Affiliation(s)
- Bernett Lee
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Lee Kong Chian School of Medicine, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Samantha Leeanne Cyrill
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Cold Spring Harbor Laboratory, One Bungtown Road, NY, 11724, Cold Spring Harbor, USA
| | - Wendy Lee
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Rossella Melchiotti
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Anand Kumar Andiappan
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Michael Poidinger
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Murdoch Children's Research Institute, Royal Children's Hospital, Flemington Road, Parkville, Victoria, 3052, Australia
| | - Olaf Rötzschke
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.
| |
Collapse
|
5
|
Cope AL, Shah P. Intragenomic variation in non-adaptive nucleotide biases causes underestimation of selection on synonymous codon usage. PLoS Genet 2022; 18:e1010256. [PMID: 35714134 PMCID: PMC9246145 DOI: 10.1371/journal.pgen.1010256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 06/30/2022] [Accepted: 05/13/2022] [Indexed: 11/20/2022] Open
Abstract
Patterns of non-uniform usage of synonymous codons vary across genes in an organism and between species across all domains of life. This codon usage bias (CUB) is due to a combination of non-adaptive (e.g. mutation biases) and adaptive (e.g. natural selection for translation efficiency/accuracy) evolutionary forces. Most models quantify the effects of mutation bias and selection on CUB assuming uniform mutational and other non-adaptive forces across the genome. However, non-adaptive nucleotide biases can vary within a genome due to processes such as biased gene conversion (BGC), potentially obfuscating signals of selection on codon usage. Moreover, genome-wide estimates of non-adaptive nucleotide biases are lacking for non-model organisms. We combine an unsupervised learning method with a population genetics model of synonymous coding sequence evolution to assess the impact of intragenomic variation in non-adaptive nucleotide bias on quantification of natural selection on synonymous codon usage across 49 Saccharomycotina yeasts. We find that in the absence of a priori information, unsupervised learning can be used to identify genes evolving under different non-adaptive nucleotide biases. We find that the impact of intragenomic variation in non-adaptive nucleotide bias varies widely, even among closely-related species. We show that the overall strength and direction of translational selection can be underestimated by failing to account for intragenomic variation in non-adaptive nucleotide biases. Interestingly, genes falling into clusters identified by machine learning are also physically clustered across chromosomes. Our results indicate the need for more nuanced models of sequence evolution that systematically incorporate the effects of variable non-adaptive nucleotide biases on codon frequencies.
Collapse
Affiliation(s)
- Alexander L. Cope
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
- Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey, United States of America
| | - Premal Shah
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
| |
Collapse
|
6
|
Cope AL, Gilchrist MA. Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach. BMC Genomics 2022; 23:408. [PMID: 35637464 PMCID: PMC9153123 DOI: 10.1186/s12864-022-08635-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 05/03/2022] [Indexed: 11/28/2022] Open
Abstract
Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, and/or protein folding. Given the critical link between protein folding and protein function, numerous studies have analyzed the relationship between codon usage and protein structure. The results from these studies have often been contradictory, likely reflecting the differing methods used for measuring codon usage and the failure to appropriately control for confounding factors, such as differences in amino acid usage between protein structures and changes in the frequency of different structures with gene expression. Results Here we take an explicit population genetics approach to quantify codon-specific shifts in natural selection related to protein structure in S. cerevisiae and E. coli. Unlike other metrics of codon usage, our approach explicitly separates the effects of natural selection, scaled by gene expression, and mutation bias while naturally accounting for a region’s amino acid usage. Bayesian model comparisons suggest selection on codon usage varies only slightly between helix, sheet, and coil secondary structures and, similarly, between structured and intrinsically-disordered regions. Similarly, in contrast to prevous findings, we find selection on codon usage only varies slightly at the termini of helices in E. coli. Using simulated data, we show this previous work indicating “non-optimal” codons are enriched at the beginning of helices in S. cerevisiae was due to failure to control for various confounding factors (e.g. amino acid biases, gene expression, etc.), and rather than selection to modulate cotranslational folding. Conclusions Our results reveal a weak relationship between codon usage and protein structure, indicating that differences in selection on codon usage between structures are slight. In addition to the magnitude of differences in selection between protein structures being slight, the observed shifts appear to be idiosyncratic and largely codon-specific rather than systematic reversals in the nature of selection. Overall, our work demonstrates the statistical power and benefits of studying selective shifts on codon usage or other genomic features from an explicitly evolutionary approach. Limitations of this approach and future potential research avenues are discussed. Supplementary Information The online version contains supplementary material available at (10.1186/s12864-022-08635-0).
Collapse
Affiliation(s)
- Alexander L Cope
- Genome Science and Technology, University of Tennessee, Knoxville, United States.,Current Address: Department of Genetics, Rutgers University, Piscataway, United States
| | - Michael A Gilchrist
- Genome Science and Technology, University of Tennessee, Knoxville, United States. .,National Institute for Mathematical and Biological Synthesis, Knoxville, TN, United States. .,Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, United States.
| |
Collapse
|
7
|
Targeted Inter-Homologs Recombination in Arabidopsis Euchromatin and Heterochromatin. Int J Mol Sci 2021; 22:ijms222212096. [PMID: 34829981 PMCID: PMC8622013 DOI: 10.3390/ijms222212096] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/04/2021] [Accepted: 11/05/2021] [Indexed: 12/20/2022] Open
Abstract
Homologous recombination (HR) typically occurs during meiosis between homologs, at a few unplanned locations along the chromosomes. In this study, we tested whether targeted recombination between homologous chromosomes can be achieved via Clustered Regulatory Interspaced Short Palindromic Repeat associated protein Cas9 (CRISPR-Cas9)-induced DNA double-strand break (DSB) repair in Arabidopsis thaliana. Our experimental system includes targets for DSB induction in euchromatic and heterochromatic genomic regions of hybrid F1 plants, in one or both parental chromosomes, using phenotypic and molecular markers to measure Non-Homologous End Joining and HR repair. We present a series of evidence showing that targeted DSBs can be repaired via HR using a homologous chromosome as the template in various chromatin contexts including in pericentric regions. Targeted crossover was rare, but gene conversion events were the most frequent outcome of HR and were found in both “hot and cold” regions. The length of the conversion tracts was variable, ranging from 5 to 7505 bp. In addition, a typical feature of these tracks was that they often were interrupted. Our findings pave the way for the use of targeted gene-conversion for precise breeding.
Collapse
|
8
|
Bergero R, Ellis P, Haerty W, Larcombe L, Macaulay I, Mehta T, Mogensen M, Murray D, Nash W, Neale MJ, O'Connor R, Ottolini C, Peel N, Ramsey L, Skinner B, Suh A, Summers M, Sun Y, Tidy A, Rahbari R, Rathje C, Immler S. Meiosis and beyond - understanding the mechanistic and evolutionary processes shaping the germline genome. Biol Rev Camb Philos Soc 2021; 96:822-841. [PMID: 33615674 PMCID: PMC8246768 DOI: 10.1111/brv.12680] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 12/15/2020] [Accepted: 12/15/2020] [Indexed: 12/11/2022]
Abstract
The separation of germ cell populations from the soma is part of the evolutionary transition to multicellularity. Only genetic information present in the germ cells will be inherited by future generations, and any molecular processes affecting the germline genome are therefore likely to be passed on. Despite its prevalence across taxonomic kingdoms, we are only starting to understand details of the underlying micro-evolutionary processes occurring at the germline genome level. These include segregation, recombination, mutation and selection and can occur at any stage during germline differentiation and mitotic germline proliferation to meiosis and post-meiotic gamete maturation. Selection acting on germ cells at any stage from the diploid germ cell to the haploid gametes may cause significant deviations from Mendelian inheritance and may be more widespread than previously assumed. The mechanisms that affect and potentially alter the genomic sequence and allele frequencies in the germline are pivotal to our understanding of heritability. With the rise of new sequencing technologies, we are now able to address some of these unanswered questions. In this review, we comment on the most recent developments in this field and identify current gaps in our knowledge.
Collapse
Affiliation(s)
- Roberta Bergero
- Institute of Evolutionary BiologyUniversity of EdinburghEdinburghEH9 3JTU.K.
| | - Peter Ellis
- School of BiosciencesUniversity of KentCanterburyCT2 7NJU.K.
| | | | - Lee Larcombe
- Applied Exomics LtdStevenage Bioscience CatalystStevenageSG1 2FXU.K.
| | - Iain Macaulay
- Earlham InstituteNorwich Research ParkNorwichNR4 7UZU.K.
| | - Tarang Mehta
- Earlham InstituteNorwich Research ParkNorwichNR4 7UZU.K.
| | - Mette Mogensen
- School of Biological SciencesUniversity of East AngliaNorwich Research ParkNorwichNR4 7TJU.K.
| | - David Murray
- School of Biological SciencesUniversity of East AngliaNorwich Research ParkNorwichNR4 7TJU.K.
| | - Will Nash
- Earlham InstituteNorwich Research ParkNorwichNR4 7UZU.K.
| | - Matthew J. Neale
- Genome Damage and Stability Centre, School of Life SciencesUniversity of SussexBrightonBN1 9RHU.K.
| | | | | | - Ned Peel
- Earlham InstituteNorwich Research ParkNorwichNR4 7UZU.K.
| | - Luke Ramsey
- The James Hutton InstituteInvergowrieDundeeDD2 5DAU.K.
| | - Ben Skinner
- School of Life SciencesUniversity of EssexColchesterCO4 3SQU.K.
| | - Alexander Suh
- School of Biological SciencesUniversity of East AngliaNorwich Research ParkNorwichNR4 7TJU.K.
- Department of Organismal BiologyUppsala UniversityNorbyvägen 18DUppsala752 36Sweden
| | - Michael Summers
- School of BiosciencesUniversity of KentCanterburyCT2 7NJU.K.
- The Bridge Centre1 St Thomas Street, London BridgeLondonSE1 9RYU.K.
| | - Yu Sun
- Norwich Medical SchoolUniversity of East AngliaNorwich Research Park, Colney LnNorwichNR4 7UGU.K.
| | - Alison Tidy
- School of BiosciencesUniversity of Nottingham, Plant Science, Sutton Bonington CampusSutton BoningtonLE12 5RDU.K.
| | | | - Claudia Rathje
- School of BiosciencesUniversity of KentCanterburyCT2 7NJU.K.
| | - Simone Immler
- School of Biological SciencesUniversity of East AngliaNorwich Research ParkNorwichNR4 7TJU.K.
| |
Collapse
|
9
|
Charlesworth D, Zhang Y, Bergero R, Graham C, Gardner J, Yong L. Using GC Content to Compare Recombination Patterns on the Sex Chromosomes and Autosomes of the Guppy, Poecilia reticulata, and Its Close Outgroup Species. Mol Biol Evol 2021; 37:3550-3562. [PMID: 32697821 DOI: 10.1093/molbev/msaa187] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Genetic and physical mapping of the guppy (Poecilia reticulata) have shown that recombination patterns differ greatly between males and females. Crossover events occur evenly across the chromosomes in females, but in male meiosis they are restricted to the tip furthest from the centromere of each chromosome, creating very high recombination rates per megabase, as in pseudoautosomal regions of mammalian sex chromosomes. We used GC content to indirectly infer recombination patterns on guppy chromosomes, based on evidence that recombination is associated with GC-biased gene conversion, so that genome regions with high recombination rates should be detectable by high GC content. We used intron sequences and third positions of codons to make comparisons between sequences that are matched, as far as possible, and are all probably under weak selection. Almost all guppy chromosomes, including the sex chromosome (LG12), have very high GC values near their assembly ends, suggesting high recombination rates due to strong crossover localization in male meiosis. Our test does not suggest that the guppy XY pair has stronger crossover localization than the autosomes, or than the homologous chromosome in the close relative, the platyfish (Xiphophorus maculatus). We therefore conclude that the guppy XY pair has not recently undergone an evolutionary change to a different recombination pattern, or reduced its crossover rate, but that the guppy evolved Y-linkage due to acquiring a male-determining factor that also conferred the male crossover pattern. We also identify the centromere ends of guppy chromosomes, which were not determined in the genome assembly.
Collapse
Affiliation(s)
- Deborah Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Yexin Zhang
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Roberta Bergero
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Chay Graham
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Jim Gardner
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Lengxob Yong
- Centre for Ecology and Conservation, University of Exeter, Falmouth, Cornwall, United Kingdom
| |
Collapse
|
10
|
Schweizer G, Haider MB, Barroso GV, Rössel N, Münch K, Kahmann R, Dutheil JY. Population Genomics of the Maize Pathogen Ustilago maydis: Demographic History and Role of Virulence Clusters in Adaptation. Genome Biol Evol 2021; 13:evab073. [PMID: 33837781 PMCID: PMC8120014 DOI: 10.1093/gbe/evab073] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/06/2021] [Indexed: 11/14/2022] Open
Abstract
The tight interaction between pathogens and their hosts results in reciprocal selective forces that impact the genetic diversity of the interacting species. The footprints of this selection differ between pathosystems because of distinct life-history traits, demographic histories, or genome architectures. Here, we studied the genome-wide patterns of genetic diversity of 22 isolates of the causative agent of the corn smut disease, Ustilago maydis, originating from five locations in Mexico, the presumed center of origin of this species. In this species, many genes encoding secreted effector proteins reside in so-called virulence clusters in the genome, an arrangement that is so far not found in other filamentous plant pathogens. Using a combination of population genomic statistical analyses, we assessed the geographical, historical, and genome-wide variation of genetic diversity in this fungal pathogen. We report evidence of two partially admixed subpopulations that are only loosely associated with geographic origin. Using the multiple sequentially Markov coalescent model, we inferred the demographic history of the two pathogen subpopulations over the last 0.5 Myr. We show that both populations experienced a recent strong bottleneck starting around 10,000 years ago, coinciding with the assumed time of maize domestication. Although the genome average genetic diversity is low compared with other fungal pathogens, we estimated that the rate of nonsynonymous adaptive substitutions is three times higher in genes located within virulence clusters compared with nonclustered genes, including nonclustered effector genes. These results highlight the role that these singular genomic regions play in the evolution of this pathogen.
Collapse
Affiliation(s)
- Gabriel Schweizer
- Department of Organismic Interactions, Max-Planck-Institute for Terrestrial Microbiology, Marburg, Germany
| | - Muhammad Bilal Haider
- Max-Planck-Institute for Evolutionary Biology, Research Group Molecular Systems Evolution, Plön, Germany
| | - Gustavo V Barroso
- Max-Planck-Institute for Evolutionary Biology, Research Group Molecular Systems Evolution, Plön, Germany
| | - Nicole Rössel
- Department of Organismic Interactions, Max-Planck-Institute for Terrestrial Microbiology, Marburg, Germany
| | - Karin Münch
- Department of Organismic Interactions, Max-Planck-Institute for Terrestrial Microbiology, Marburg, Germany
| | - Regine Kahmann
- Department of Organismic Interactions, Max-Planck-Institute for Terrestrial Microbiology, Marburg, Germany
| | - Julien Y Dutheil
- Department of Organismic Interactions, Max-Planck-Institute for Terrestrial Microbiology, Marburg, Germany
- Max-Planck-Institute for Evolutionary Biology, Research Group Molecular Systems Evolution, Plön, Germany
- Institute of Evolutionary Sciences of Montpellier, University of Montpellier 2, France
| |
Collapse
|
11
|
Boman J, Mugal CF, Backström N. The Effects of GC-Biased Gene Conversion on Patterns of Genetic Diversity among and across Butterfly Genomes. Genome Biol Evol 2021; 13:evab064. [PMID: 33760095 PMCID: PMC8175052 DOI: 10.1093/gbe/evab064] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2021] [Indexed: 12/28/2022] Open
Abstract
Recombination reshuffles the alleles of a population through crossover and gene conversion. These mechanisms have considerable consequences on the evolution and maintenance of genetic diversity. Crossover, for example, can increase genetic diversity by breaking the linkage between selected and nearby neutral variants. Bias in favor of G or C alleles during gene conversion may instead promote the fixation of one allele over the other, thus decreasing diversity. Mutation bias from G or C to A and T opposes GC-biased gene conversion (gBGC). Less recognized is that these two processes may-when balanced-promote genetic diversity. Here, we investigate how gBGC and mutation bias shape genetic diversity patterns in wood white butterflies (Leptidea sp.). This constitutes the first in-depth investigation of gBGC in butterflies. Using 60 resequenced genomes from six populations of three species, we find substantial variation in the strength of gBGC across lineages. When modeling the balance of gBGC and mutation bias and comparing analytical results with empirical data, we reject gBGC as the main determinant of genetic diversity in these butterfly species. As alternatives, we consider linked selection and GC content. We find evidence that high values of both reduce diversity. We also show that the joint effects of gBGC and mutation bias can give rise to a diversity pattern which resembles the signature of linked selection. Consequently, gBGC should be considered when interpreting the effects of linked selection on levels of genetic diversity.
Collapse
Affiliation(s)
- Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Carina F Mugal
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| |
Collapse
|
12
|
Hämälä T, Tiffin P. Biased Gene Conversion Constrains Adaptation in Arabidopsis thaliana. Genetics 2020; 215:831-846. [PMID: 32414868 PMCID: PMC7337087 DOI: 10.1534/genetics.120.303335] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Accepted: 05/14/2020] [Indexed: 02/01/2023] Open
Abstract
Reduction of fitness due to deleterious mutations imposes a limit to adaptive evolution. By characterizing features that influence this genetic load we may better understand constraints on responses to both natural and human-mediated selection. Here, using whole-genome, transcriptome, and methylome data from >600 Arabidopsis thaliana individuals, we set out to identify important features influencing selective constraint. Our analyses reveal that multiple factors underlie the accumulation of maladaptive mutations, including gene expression level, gene network connectivity, and gene-body methylation. We then focus on a feature with major effect, nucleotide composition. The ancestral vs. derived status of segregating alleles suggests that GC-biased gene conversion, a recombination-associated process that increases the frequency of G and C nucleotides regardless of their fitness effects, shapes sequence patterns in A. thaliana Through estimation of mutational effects, we present evidence that biased gene conversion hinders the purging of deleterious mutations and contributes to a genome-wide signal of decreased efficacy of selection. By comparing these results to two outcrossing relatives, Arabidopsis lyrata and Capsella grandiflora, we find that protein evolution in A. thaliana is as strongly affected by biased gene conversion as in the outcrossing species. Last, we perform simulations to show that natural levels of outcrossing in A. thaliana are sufficient to facilitate biased gene conversion despite increased homozygosity due to selfing. Together, our results show that even predominantly selfing taxa are susceptible to biased gene conversion, suggesting that it may constitute an important constraint to adaptation among plant species.
Collapse
Affiliation(s)
- Tuomas Hämälä
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota 55108
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota 55108
| |
Collapse
|
13
|
Rommel Fuentes R, Hesselink T, Nieuwenhuis R, Bakker L, Schijlen E, van Dooijeweert W, Diaz Trivino S, de Haan JR, Sanchez Perez G, Zhang X, Fransz P, de Jong H, van Dijk ADJ, de Ridder D, Peters SA. Meiotic recombination profiling of interspecific hybrid F1 tomato pollen by linked read sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 102:480-492. [PMID: 31820490 DOI: 10.1111/tpj.14640] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 11/25/2019] [Accepted: 12/04/2019] [Indexed: 06/10/2023]
Abstract
Genome wide screening of pooled pollen samples from a single interspecific F1 hybrid obtained from a cross between tomato, Solanum lycopersicum and its wild relative, Solanum pimpinellifolium using linked read sequencing of the haploid nuclei, allowed profiling of the crossover (CO) and gene conversion (GC) landscape. We observed a striking overlap between cold regions of CO in the male gametes and our previously established F6 recombinant inbred lines (RILs) population. COs were overrepresented in non-coding regions in the gene promoter and 5'UTR regions of genes. Poly-A/T and AT rich motifs were found enriched in 1 kb promoter regions flanking the CO sites. Non-crossover associated allelic and ectopic GCs were detected in most chromosomes, confirming that besides CO, GC represents also a source for genetic diversity and genome plasticity in tomato. Furthermore, we identified processed break junctions pointing at the involvement of both homology directed and non-homology directed repair pathways, suggesting a recombination machinery in tomato that is more complex than currently anticipated.
Collapse
Affiliation(s)
- Roven Rommel Fuentes
- Bioinformatics Group, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Thamara Hesselink
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Ronald Nieuwenhuis
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Linda Bakker
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Elio Schijlen
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Willem van Dooijeweert
- Centre for Genetic Resources, Wageningen University and Research, Wageningen, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Sara Diaz Trivino
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Jorn R de Haan
- Genetwister Technologies B.V., Nieuwe Kanaal 7b, 6709 PA, Wageningen, The Netherlands
| | - Gabino Sanchez Perez
- Genetwister Technologies B.V., Nieuwe Kanaal 7b, 6709 PA, Wageningen, The Netherlands
| | - Xinyue Zhang
- Swammerdam Institute for Life Sciences, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Paul Fransz
- Swammerdam Institute for Life Sciences, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Hans de Jong
- Laboratory of Genetics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Aalt D J van Dijk
- Bioinformatics Group, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
- Biometris, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Sander A Peters
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| |
Collapse
|
14
|
Adapting Biased Gene Conversion theory to account for intensive GC-content deterioration in the human genome by novel mutations. PLoS One 2020; 15:e0232167. [PMID: 32353016 PMCID: PMC7192473 DOI: 10.1371/journal.pone.0232167] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 04/09/2020] [Indexed: 12/23/2022] Open
Abstract
We examined seventy million well-characterized human mutations, and their impact on G+C-compositional dynamics, in order to understand the formation and maintenance of major genomic nucleotide sequence patterns. Among novel mutations, those that change a strong (S) base pair G:C/C:G to a weak (W) pair A:T/T:A occur at nearly twice the frequency of the opposite mutations. Such imbalance puts strong downward pressure on overall GC-content. However, along protracted paths to fixation, S→W mutations are much less likely to propagate than W→S mutations. The magnitude of relative propagation disadvantages for S→W mutations is inexplicable by any currently-accepted model. This fact forced us to re-examine the quantitative features of Biased Gene Conversion (BGC) theory. Revised parameters of BGC that, per average individual, convert 7–14 W base pairs into S pairs, would account for the S-content turnover differences between new and old mutations, and make BGC an instrumental force for nucleotide dynamics and evolution. BGC should thus be considered seriously in both theories and biomedical practice. In particular, BGC should be taken into account during allele imputations, where missing SNP alleles are computationally predicted based on the information about several neighboring alleles. Finally, we analyzed the effect of neighboring nucleotide context on the mutation frequencies, dynamics, and GC-composition turnover. For this purpose, we examined genomic regions having extremely biased nucleotide compositions (enriched for S-, W-, purine/pyrimidine strand asymmetry, or AC/GT-strand asymmetry). It was found that point mutations in these regions preferentially degrade the nucleotide inhomogeneities, decreasing the sequence biases. Degradation of sequence bias is highest for novel mutations, and considerably lower for older mutations (those widespread across populations). Besides BGC, there may be additional, still uncharacterized molecular mechanisms that either preserve genomic regions with biased nucleotide compositions from mutational degradation or fail to degrade such inhomogeneities in specific chromosomal regions.
Collapse
|
15
|
Lim MCW, Witt CC, Graham CH, Dávalos LM. Parallel Molecular Evolution in Pathways, Genes, and Sites in High-Elevation Hummingbirds Revealed by Comparative Transcriptomics. Genome Biol Evol 2019; 11:1552-1572. [PMID: 31028697 PMCID: PMC6553502 DOI: 10.1093/gbe/evz101] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2019] [Indexed: 12/13/2022] Open
Abstract
High-elevation organisms experience shared environmental challenges that include low oxygen availability, cold temperatures, and intense ultraviolet radiation. Consequently, repeated evolution of the same genetic mechanisms may occur across high-elevation taxa. To test this prediction, we investigated the extent to which the same biochemical pathways, genes, or sites were subject to parallel molecular evolution for 12 Andean hummingbird species (family: Trochilidae) representing several independent transitions to high elevation across the phylogeny. Across high-elevation species, we discovered parallel evolution for several pathways and genes with evidence of positive selection. In particular, positively selected genes were frequently part of cellular respiration, metabolism, or cell death pathways. To further examine the role of elevation in our analyses, we compared results for low- and high-elevation species and tested different thresholds for defining elevation categories. In analyses with different elevation thresholds, positively selected genes reflected similar functions and pathways, even though there were almost no specific genes in common. For example, EPAS1 (HIF2α), which has been implicated in high-elevation adaptation in other vertebrates, shows a signature of positive selection when high-elevation is defined broadly (>1,500 m), but not when defined narrowly (>2,500 m). Although a few biochemical pathways and genes change predictably as part of hummingbird adaptation to high-elevation conditions, independent lineages have rarely adapted via the same substitutions.
Collapse
Affiliation(s)
- Marisa C W Lim
- Department of Ecology and Evolution, Stony Brook University
| | - Christopher C Witt
- Museum of Southwestern Biology and Department of Biology, University of New Mexico
| | - Catherine H Graham
- Department of Ecology and Evolution, Stony Brook University.,Swiss Federal Research Institute (WSL), Birmensdorf, Switzerland
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University.,Consortium for Inter-Disciplinary Environmental Research, Stony Brook University
| |
Collapse
|
16
|
Kawakami T, Wallberg A, Olsson A, Wintermantel D, de Miranda JR, Allsopp M, Rundlöf M, Webster MT. Substantial Heritable Variation in Recombination Rate on Multiple Scales in Honeybees and Bumblebees. Genetics 2019; 212:1101-1119. [PMID: 31152071 PMCID: PMC6707477 DOI: 10.1534/genetics.119.302008] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 05/30/2019] [Indexed: 12/30/2022] Open
Abstract
Meiotic recombination shuffles genetic variation and promotes correct segregation of chromosomes. Rates of recombination vary on several scales, both within genomes and between individuals, and this variation is affected by both genetic and environmental factors. Social insects have extremely high rates of recombination, although the evolutionary causes of this are not known. Here, we estimate rates of crossovers and gene conversions in 22 colonies of the honeybee, Apis mellifera, and 9 colonies of the bumblebee, Bombus terrestris, using direct sequencing of 299 haploid drone offspring. We confirm that both species have extremely elevated crossover rates, with higher rates measured in the highly eusocial honeybee than the primitively social bumblebee. There are also significant differences in recombination rate between subspecies of honeybee. There is substantial variation in genome-wide recombination rate between individuals of both A. mellifera and B. terrestris and the distribution of these rates overlap between species. A large proportion of interindividual variation in recombination rate is heritable, which indicates the presence of variation in trans-acting factors that influence recombination genome-wide. We infer that levels of crossover interference are significantly lower in honeybees compared to bumblebees, which may be one mechanism that contributes to higher recombination rates in honeybees. We also find a significant increase in recombination rate with distance from the centromere, mirrored by methylation differences. We detect a strong transmission bias due to GC-biased gene conversion associated with noncrossover gene conversions. Our results shed light on the mechanistic causes of extreme rates of recombination in social insects and the genetic architecture of recombination rate variation.
Collapse
Affiliation(s)
- Takeshi Kawakami
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, 752 36, Sweden
- Department of Animal and Plant Sciences, University of Sheffield, S10 2TN, United Kingdom
| | - Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| | - Anna Olsson
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| | - Dimitry Wintermantel
- INRA, UE 1255 APIS, Le Magneraud, 17700 Surgères, France
- Centre d'Etudes Biologiques de Chizé, UMR 7372, CNRS and Université de La Rochelle, 79360 Villiers-en-Bois, France
| | - Joachim R de Miranda
- Department of Ecology, Swedish University of Agricultural Sciences, Uppsala 750 07, Sweden
| | - Mike Allsopp
- Plant Protection Research Institute, Agricultural Research Council, Stellenbosch, 7608, South Africa
| | - Maj Rundlöf
- Department of Biology, Lund University, 223 62, Sweden
| | - Matthew T Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| |
Collapse
|
17
|
Borges R, Szöllősi GJ, Kosiol C. Quantifying GC-Biased Gene Conversion in Great Ape Genomes Using Polymorphism-Aware Models. Genetics 2019; 212:1321-1336. [PMID: 31147380 PMCID: PMC6707462 DOI: 10.1534/genetics.119.302074] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Accepted: 05/20/2019] [Indexed: 11/18/2022] Open
Abstract
As multi-individual population-scale data become available, more complex modeling strategies are needed to quantify genome-wide patterns of nucleotide usage and associated mechanisms of evolution. Recently, the multivariate neutral Moran model was proposed. However, it was shown insufficient to explain the distribution of alleles in great apes. Here, we propose a new model that includes allelic selection. Our theoretical results constitute the basis of a new Bayesian framework to estimate mutation rates and selection coefficients from population data. We apply the new framework to a great ape dataset, where we found patterns of allelic selection that match those of genome-wide GC-biased gene conversion (gBGC). In particular, we show that great apes have patterns of allelic selection that vary in intensity-a feature that we correlated with great apes' distinct demographies. We also demonstrate that the AT/GC toggling effect decreases the probability of a substitution, promoting more polymorphisms in the base composition of great ape genomes. We further assess the impact of GC-bias in molecular analysis, and find that mutation rates and genetic distances are estimated under bias when gBGC is not properly accounted for. Our results contribute to the discussion on the tempo and mode of gBGC evolution, while stressing the need for gBGC-aware models in population genetics and phylogenetics.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
| | - Gergely J Szöllősi
- Department of Biological Physics, MTA-ELTE "Lendulet" Evolutionary Genomics Research Group, Eötvös University, Pázmány P. stny. 1A, Budapest 1117, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
18
|
A century of bias in genetics and evolution. Heredity (Edinb) 2019; 123:33-43. [PMID: 31189901 DOI: 10.1038/s41437-019-0194-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 01/29/2019] [Accepted: 01/29/2019] [Indexed: 02/08/2023] Open
Abstract
Mendel proposed that the heritable material is particulate and that transmission of alleles is unbiased. An assumption of unbiased transmission was necessary to show how variation can be preserved in the absence of selection, so overturning an early objection to Darwinism. In the second half of the twentieth century, it was widely recognised that even strongly deleterious alleles can invade if they have strongly biased transmission (i.e. strong segregation distortion). The spread of alleles with distorted segregation can explain many curiosities. More recently, the selectionist-neutralist duopoly was broken by the realisation that biased gene conversion can explain phenomena such as mammalian isochore structures. An initial focus on unbiased transmission in 1919, has thus given way to an interest in biased transmission in 2019. A focus on very weak bias is now possible owing to technological advances, although technical biases may put a limit on resolving power. To understand the relevance of weak bias we could profit from having the concept of the effectively Mendelian allele, a companion to the effectively neutral allele. Understanding the implications of unbiased and biased transmission may, I suggest, be a good way to teach evolution so as to avoid psychological biases.
Collapse
|
19
|
Lim MCW, Witt CC, Graham CH, Dávalos LM. Divergent Fine-Scale Recombination Landscapes between a Freshwater and Marine Population of Threespine Stickleback Fish. Genome Biol Evol 2019; 11:1573-1585. [PMID: 31028697 PMCID: PMC6553502 DOI: 10.1093/gbe/evz090] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/17/2019] [Indexed: 12/27/2022] Open
Abstract
Meiotic recombination is a highly conserved process that has profound effects on genome evolution. At a fine-scale, recombination rates can vary drastically across genomes, often localized into small recombination "hotspots" with highly elevated rates, surrounded by regions with little recombination. In most species studied, the location of hotspots within genomes is highly conserved across broad evolutionary timescales. The main exception to this pattern is in mammals, where hotspot location can evolve rapidly among closely related species and even among populations within a species. Hotspot position in mammals is controlled by the gene, Prdm9, whereas in species with conserved hotspots, a functional Prdm9 is typically absent. Due to a limited number of species where recombination rates have been estimated at a fine-scale, it remains unclear whether hotspot conservation is always associated with the absence of a functional Prdm9. Threespine stickleback fish (Gasterosteus aculeatus) are an excellent model to examine the evolution of recombination over short evolutionary timescales. Using a linkage disequilibrium-based approach, we found recombination rates indeed varied at a fine-scale across the genome, with many regions organized into narrow hotspots. Hotspots had highly divergent landscapes between stickleback populations, where only ∼15% of these hotspots were shared. Our results indicate that fine-scale recombination rates may be diverging between closely related populations of threespine stickleback fish. Interestingly, we found only a weak association of a PRDM9 binding motif within hotspots, which suggests that threespine stickleback fish may possess a novel mechanism for targeting recombination hotspots at a fine-scale.
Collapse
Affiliation(s)
- Marisa C W Lim
- Department of Ecology and Evolution, Stony Brook University
| | - Christopher C Witt
- Museum of Southwestern Biology and Department of Biology, University of New Mexico
| | - Catherine H Graham
- Department of Ecology and Evolution, Stony Brook University
- Swiss Federal Research Institute (WSL), Birmensdorf, Switzerland
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University
| |
Collapse
|
20
|
Heissl A, Betancourt AJ, Hermann P, Povysil G, Arbeithuber B, Futschik A, Ebner T, Tiemann-Boege I. The impact of poly-A microsatellite heterologies in meiotic recombination. Life Sci Alliance 2019; 2:2/2/e201900364. [PMID: 31023833 PMCID: PMC6485458 DOI: 10.26508/lsa.201900364] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 03/27/2019] [Accepted: 03/29/2019] [Indexed: 12/12/2022] Open
Abstract
Meiosis strongly influences the transmission and evolution of heterozygous poly-A repeats as measured experimentally in a large collection of single recombination products in a human hotspot. Meiotic recombination has strong, but poorly understood effects on short tandem repeat (STR) instability. Here, we screened thousands of single recombinant products with sperm typing to characterize the role of polymorphic poly-A repeats at a human recombination hotspot in terms of hotspot activity and STR evolution. We show that the length asymmetry between heterozygous poly-A’s strongly influences the recombination outcome: a heterology of 10 A’s (9A/19A) reduces the number of crossovers and elevates the frequency of non-crossovers, complex recombination products, and long conversion tracts. Moreover, the length of the heterology also influences the STR transmission during meiotic repair with a strong and significant insertion bias for the short heterology (6A/7A) and a deletion bias for the long heterology (9A/19A). In spite of this opposing insertion-/deletion-biased gene conversion, we find that poly-A’s are enriched at human recombination hotspots that could have important consequences in hotspot activation.
Collapse
Affiliation(s)
- Angelika Heissl
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | | | - Philipp Hermann
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Gundula Povysil
- Institute of Bioinformatics, Johannes Kepler University, Linz, Austria
| | | | - Andreas Futschik
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Thomas Ebner
- Department of Gynecology, Obstetrics and Gynecological Endocrinology, Kepler University Clinic, Linz, Austria
| | | |
Collapse
|
21
|
Galtier N, Roux C, Rousselle M, Romiguier J, Figuet E, Glémin S, Bierne N, Duret L. Codon Usage Bias in Animals: Disentangling the Effects of Natural Selection, Effective Population Size, and GC-Biased Gene Conversion. Mol Biol Evol 2019; 35:1092-1103. [PMID: 29390090 DOI: 10.1093/molbev/msy015] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Selection on codon usage bias is well documented in a number of microorganisms. Whether codon usage is also generally shaped by natural selection in large organisms, despite their relatively small effective population size (Ne), is unclear. In animals, the population genetics of codon usage bias has only been studied in a handful of model organisms so far, and can be affected by confounding, nonadaptive processes such as GC-biased gene conversion and experimental artefacts. Using population transcriptomics data, we analyzed the relationship between codon usage, gene expression, allele frequency distribution, and recombination rate in 30 nonmodel species of animals, each from a different family, covering a wide range of effective population sizes. We disentangled the effects of translational selection and GC-biased gene conversion on codon usage by separately analyzing GC-conservative and GC-changing mutations. We report evidence for effective translational selection on codon usage in large-Ne species of animals, but not in small-Ne ones, in agreement with the nearly neutral theory of molecular evolution. C- and T-ending codons tend to be preferred over synonymous G- and A-ending ones, for reasons that remain to be determined. In contrast, we uncovered a conspicuous effect of GC-biased gene conversion, which is widespread in animals and the main force determining the fate of AT↔GC mutations. Intriguingly, the strength of its effect was uncorrelated with Ne.
Collapse
Affiliation(s)
- Nicolas Galtier
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Camille Roux
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France.,Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,UMR 8198 - Evo-Eco-Paleo, CNRS, Université de Lille-Sciences et Technologies, Villeneuve d'Ascq, France
| | - Marjolaine Rousselle
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Jonathan Romiguier
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France.,Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Emeric Figuet
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Sylvain Glémin
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France.,Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Nicolas Bierne
- UMR5554, Institut des Sciences de l'Evolution, University Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France
| |
Collapse
|
22
|
Rousselle M, Laverré A, Figuet E, Nabholz B, Galtier N. Influence of Recombination and GC-biased Gene Conversion on the Adaptive and Nonadaptive Substitution Rate in Mammals versus Birds. Mol Biol Evol 2019; 36:458-471. [PMID: 30590692 PMCID: PMC6389324 DOI: 10.1093/molbev/msy243] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Recombination is expected to affect functional sequence evolution in several ways. On the one hand, recombination is thought to improve the efficiency of multilocus selection by dissipating linkage disequilibrium. On the other hand, natural selection can be counteracted by recombination-associated transmission distorters such as GC-biased gene conversion (gBGC), which tends to promote G and C alleles irrespective of their fitness effect in high-recombining regions. It has been suggested that gBGC might impact coding sequence evolution in vertebrates, and particularly the ratio of nonsynonymous to synonymous substitution rates (dN/dS). However, distinctive gBGC patterns have been reported in mammals and birds, maybe reflecting the documented contrasts in evolutionary dynamics of recombination rate between these two taxa. Here, we explore how recombination and gBGC affect coding sequence evolution in mammals and birds by analyzing proteome-wide data in six species of Galloanserae (fowls) and six species of catarrhine primates. We estimated the dN/dS ratio and rates of adaptive and nonadaptive evolution in bins of genes of increasing recombination rate, separately analyzing AT → GC, GC → AT, and G ↔ C/A ↔ T mutations. We show that in both taxa, recombination and gBGC entail a decrease in dN/dS. Our analysis indicates that recombination enhances the efficiency of purifying selection by lowering Hill-Robertson effects, whereas gBGC leads to an overestimation of the adaptive rate of AT → GC mutations. Finally, we report a mutagenic effect of recombination, which is independent of gBGC.
Collapse
Affiliation(s)
| | - Alexandre Laverré
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Emeric Figuet
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Benoit Nabholz
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Nicolas Galtier
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| |
Collapse
|
23
|
Korunes KL, Noor MAF. Pervasive gene conversion in chromosomal inversion heterozygotes. Mol Ecol 2018; 28:1302-1315. [PMID: 30387889 DOI: 10.1111/mec.14921] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 09/27/2018] [Accepted: 10/22/2018] [Indexed: 12/30/2022]
Abstract
Chromosomal inversions shape recombination landscapes, and species differing by inversions may exhibit reduced gene flow in these regions of the genome. Though single crossovers within inversions are not usually recovered from inversion heterozygotes, the recombination barrier imposed by inversions is nuanced by noncrossover gene conversion. Here, we provide a genomewide empirical analysis of gene conversion rates both within species and in species hybrids. We estimate that gene conversion occurs at a rate of 1 × 10-5 to 2.5 × 10-5 converted sites per bp per generation in experimental crosses within Drosophila pseudoobscura and between D. pseudoobscura and its naturally hybridizing sister species D. persimilis. This analysis is the first direct empirical assessment of gene conversion rates within inversions of a species hybrid. Our data show that gene conversion rates in interspecies hybrids are at least as high as within-species estimates of gene conversion rates, and gene conversion occurs regularly within and around inverted regions of species hybrids, even near inversion breakpoints. We also found that several gene conversion events appeared to be mitotic rather than meiotic in origin. Finally, we observed that gene conversion rates are higher in regions of lower local sequence divergence, yet our observed gene conversion rates in more divergent inverted regions were at least as high as in less divergent collinear regions. Given our observed high rates of gene conversion despite the sequence differentiation between species, especially in inverted regions, gene conversion has the potential to reduce the efficacy of inversions as barriers to recombination over evolutionary time.
Collapse
|
24
|
Corcoran P, Gossmann TI, Barton HJ, Slate J, Zeng K. Determinants of the Efficacy of Natural Selection on Coding and Noncoding Variability in Two Passerine Species. Genome Biol Evol 2018; 9:2987-3007. [PMID: 29045655 PMCID: PMC5714183 DOI: 10.1093/gbe/evx213] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/16/2017] [Indexed: 02/06/2023] Open
Abstract
Population genetic theory predicts that selection should be more effective when the effective population size (Ne) is larger, and that the efficacy of selection should correlate positively with recombination rate. Here, we analyzed the genomes of ten great tits and ten zebra finches. Nucleotide diversity at 4-fold degenerate sites indicates that zebra finches have a 2.83-fold larger Ne. We obtained clear evidence that purifying selection is more effective in zebra finches. The proportion of substitutions at 0-fold degenerate sites fixed by positive selection (α) is high in both species (great tit 48%; zebra finch 64%) and is significantly higher in zebra finches. When α was estimated on GC-conservative changes (i.e., between A and T and between G and C), the estimates reduced in both species (great tit 22%; zebra finch 53%). A theoretical model presented herein suggests that failing to control for the effects of GC-biased gene conversion (gBGC) is potentially a contributor to the overestimation of α, and that this effect cannot be alleviated by first fitting a demographic model to neutral variants. We present the first estimates in birds for α in the untranslated regions, and found evidence for substantial adaptive changes. Finally, although purifying selection is stronger in high-recombination regions, we obtained mixed evidence for α increasing with recombination rate, especially after accounting for gBGC. These results highlight that it is important to consider the potential confounding effects of gBGC when quantifying selection and that our understanding of what determines the efficacy of selection is incomplete.
Collapse
Affiliation(s)
- Pádraic Corcoran
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Toni I Gossmann
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | | | - Jon Slate
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| |
Collapse
|
25
|
Tiemann-Boege I, Schwarz T, Striedner Y, Heissl A. The consequences of sequence erosion in the evolution of recombination hotspots. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0462. [PMID: 29109225 PMCID: PMC5698624 DOI: 10.1098/rstb.2016.0462] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/05/2017] [Indexed: 12/18/2022] Open
Abstract
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans-acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro. Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Irene Tiemann-Boege
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Theresa Schwarz
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Yasmin Striedner
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Angelika Heissl
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| |
Collapse
|
26
|
Abstract
Recombination often differs markedly between males and females. Here we present the first analysis of sex-specific recombination in Gasterosteus sticklebacks. Using whole-genome sequencing of 15 crosses between G. aculeatus and G. nipponicus, we localized 698 crossovers with a median resolution of 2.3 kb. We also used a bioinformatic approach to infer historical sex-averaged recombination patterns for both species. Recombination is greater in females than males on all chromosomes, and overall map length is 1.64 times longer in females. The locations of crossovers differ strikingly between sexes. Crossovers cluster toward chromosome ends in males, but are distributed more evenly across chromosomes in females. Suppression of recombination near the centromeres in males causes crossovers to cluster at the ends of long arms in acrocentric chromosomes, and greatly reduces crossing over on short arms. The effect of centromeres on recombination is much weaker in females. Genomic differentiation between G. aculeatus and G. nipponicus is strongly correlated with recombination rate, and patterns of differentiation along chromosomes are strongly influenced by male-specific telomere and centromere effects. We found no evidence for fine-scale correlations between recombination and local gene content in either sex. We discuss hypotheses for the origin of sexual dimorphism in recombination and its consequences for sexually antagonistic selection and sex chromosome evolution.
Collapse
|
27
|
Dutta R, Saha-Mandal A, Cheng X, Qiu S, Serpen J, Fedorova L, Fedorov A. 1000 human genomes carry widespread signatures of GC biased gene conversion. BMC Genomics 2018; 19:256. [PMID: 29661137 PMCID: PMC5902838 DOI: 10.1186/s12864-018-4593-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 03/12/2018] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND GC-Biased Gene Conversion (gBGC) is one of the important theories put forward to explain profound long-range non-randomness in nucleotide compositions along mammalian chromosomes. Nucleotide changes due to gBGC are hard to distinguish from regular mutations. Here, we present an algorithm for analysis of millions of known SNPs that detects a subset of so-called "SNP flip-over" events representing recent gBGC nucleotide changes, which occurred in previous generations via non-crossover meiotic recombination. RESULTS This algorithm has been applied in a large-scale analysis of 1092 sequenced human genomes. Altogether, 56,328 regions on all autosomes have been examined, which revealed 223,955 putative gBGC cases leading to SNP flip-overs. We detected a strong bias (11.7% ± 0.2% excess) in AT- > GC over GC- > AT base pair changes within the entire set of putative gBGC cases. CONCLUSIONS On average, a human gamete acquires 7 SNP flip-over events, in which one allele is replaced by its complementary allele during the process of meiotic non-crossover recombination. In each meiosis event, on average, gBGC results in replacement of 7 AT base pairs by GC base pairs, while only 6 GC pairs are replaced by AT pairs. Therefore, every human gamete is enriched by one GC pair. Happening over millions of years of evolution, this bias may be a noticeable force in changing the nucleotide composition landscape along chromosomes.
Collapse
Affiliation(s)
- Rajib Dutta
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Present Address: Center for Cardiovascular and Pulmonary Research, Nationwide Children’s Hospital, 700 Children’s Dr, Columbus, OH USA
| | - Arnab Saha-Mandal
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Present Address: Biochemistry and Molecular Biology Graduate Program, Cumming School of Medicine, University of Calgary, Calgary, AB T2N4N1 Canada
| | - Xi Cheng
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| | - Shuhao Qiu
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| | - Jasmine Serpen
- SURF Program, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- College of Arts and Sciences, Washington University in St. Louis, 1 Brookings Dr, St. Louis, MO 63130 USA
| | | | - Alexei Fedorov
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| |
Collapse
|
28
|
Camiolo S, Porru C, Benítez-Cabello A, Rodríguez-Gómez F, Calero-Delgado B, Porceddu A, Budroni M, Mannazzu I, Jiménez-Díaz R, Arroyo-López FN. Genome overview of eight Candida boidinii strains isolated from human activities and wild environments. Stand Genomic Sci 2017; 12:70. [PMID: 29213357 PMCID: PMC5712119 DOI: 10.1186/s40793-017-0281-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Accepted: 11/21/2017] [Indexed: 11/10/2022] Open
Affiliation(s)
- Salvatore Camiolo
- Dipartimento di Agraria, Università degli Studi di Sassari, Viale Italia 39, Sassari, Italy
| | - Cinzia Porru
- Dipartimento di Agraria, Università degli Studi di Sassari, Viale Italia 39, Sassari, Italy
| | - Antonio Benítez-Cabello
- Food Biotechnology Department, Instituto de la Grasa (C.S.I.C.), University Campus Pablo de Olavide, Building 46, Crta. de Utrera km 1, 41013 Seville, Spain
| | - Francisco Rodríguez-Gómez
- Food Biotechnology Department, Instituto de la Grasa (C.S.I.C.), University Campus Pablo de Olavide, Building 46, Crta. de Utrera km 1, 41013 Seville, Spain
| | - Beatríz Calero-Delgado
- Food Biotechnology Department, Instituto de la Grasa (C.S.I.C.), University Campus Pablo de Olavide, Building 46, Crta. de Utrera km 1, 41013 Seville, Spain
| | - Andrea Porceddu
- Dipartimento di Agraria, Università degli Studi di Sassari, Viale Italia 39, Sassari, Italy
| | - Marilena Budroni
- Dipartimento di Agraria, Università degli Studi di Sassari, Viale Italia 39, Sassari, Italy
| | - Ilaria Mannazzu
- Dipartimento di Agraria, Università degli Studi di Sassari, Viale Italia 39, Sassari, Italy
| | - Rufino Jiménez-Díaz
- Food Biotechnology Department, Instituto de la Grasa (C.S.I.C.), University Campus Pablo de Olavide, Building 46, Crta. de Utrera km 1, 41013 Seville, Spain
| | - Francisco Noé Arroyo-López
- Food Biotechnology Department, Instituto de la Grasa (C.S.I.C.), University Campus Pablo de Olavide, Building 46, Crta. de Utrera km 1, 41013 Seville, Spain
| |
Collapse
|
29
|
Mazumdar P, Binti Othman R, Mebus K, Ramakrishnan N, Ann Harikrishna J. Codon usage and codon pair patterns in non-grass monocot genomes. ANNALS OF BOTANY 2017; 120:893-909. [PMID: 29155926 PMCID: PMC5710610 DOI: 10.1093/aob/mcx112] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 09/19/2017] [Indexed: 05/19/2023]
Abstract
BACKGROUND AND AIMS Studies on codon usage in monocots have focused on grasses, and observed patterns of this taxon were generalized to all monocot species. Here, non-grass monocot species were analysed to investigate the differences between grass and non-grass monocots. METHODS First, studies of codon usage in monocots were reviewed. The current information was then extended regarding codon usage, as well as codon-pair context bias, using four completely sequenced non-grass monocot genomes (Musa acuminata, Musa balbisiana, Phoenix dactylifera and Spirodela polyrhiza) for which comparable transcriptome datasets are available. Measurements were taken regarding relative synonymous codon usage, effective number of codons, derived optimal codon and GC content and then the relationships investigated to infer the underlying evolutionary forces. KEY RESULTS The research identified optimal codons, rare codons and preferred codon-pair context in the non-grass monocot species studied. In contrast to the bimodal distribution of GC3 (GC content in third codon position) in grasses, non-grass monocots showed a unimodal distribution. Disproportionate use of G and C (and of A and T) in two- and four-codon amino acids detected in the analysis rules out the mutational bias hypothesis as an explanation of genomic variation in GC content. There was found to be a positive relationship between CAI (codon adaptation index; predicts the level of expression of a gene) and GC3. In addition, a strong correlation was observed between coding and genomic GC content and negative correlation of GC3 with gene length, indicating a strong impact of GC-biased gene conversion (gBGC) in shaping codon usage and nucleotide composition in non-grass monocots. CONCLUSION Optimal codons in these non-grass monocots show a preference for G/C in the third codon position. These results support the concept that codon usage and nucleotide composition in non-grass monocots are mainly driven by gBGC.
Collapse
Affiliation(s)
- Purabi Mazumdar
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - RofinaYasmin Binti Othman
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Katharina Mebus
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - N Ramakrishnan
- Electrical and Computer System Engineering, School of Engineering, Monash University Malaysia, Bandar Sunway, Malaysia
| | - Jennifer Ann Harikrishna
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
- For correspondence. E-mail:
| |
Collapse
|
30
|
Tetrad analysis in plants and fungi finds large differences in gene conversion rates but no GC bias. Nat Ecol Evol 2017; 2:164-173. [PMID: 29158556 PMCID: PMC5733138 DOI: 10.1038/s41559-017-0372-7] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Accepted: 10/09/2017] [Indexed: 11/29/2022]
Abstract
GC-favoring gene conversion enables fixation of deleterious alleles, disturbs tests of natural selection and potentially explains both the evolution of recombination as well as the commonly reported intra-genomic correlation between G+C content and recombination rate. In addition, gene conversion disturbs linkage disequilibrium, potentially affecting the ability to detect causative variants. However, the importance and generality of these effects is unresolved, not simply because direct analyses are technically challenging but also because prior within- and between-species discrepant results can be hard to appraise owing to methodological differences. Here we report results of methodologically uniform whole-genome sequencing of all tetrad products in Saccharomyces, Neurospora, Chlamydomonas and Arabidopsis. The proportion of polymorphic markers converted varies over three orders of magnitude between species (from 2% of markers converted in yeast to only ~0.005% in the two plants) with at least 87.5% of the variance in per tetrad conversion rates being between-species. This is largely owing to differences in recombination rate and median tract length. Despite three of the species showing a positive GC-recombination correlation, there is no significant net AT->GC conversion bias in any, despite relatively high resolution in the two taxa (Saccharomyces and Neurospora) with relatively common gene conversion. The absence of a GC bias means: 1) that there should be no presumption that gene conversion is GC biased, nor 2) that a GC-recombination correlation necessarily implies biased gene conversion, 3) that Ka/Ks tests should be unaffected in these species and 4) it is unlikely that gene conversion explains the evolution of recombination.
Collapse
|
31
|
Frequent nonallelic gene conversion on the human lineage and its effect on the divergence of gene duplicates. Proc Natl Acad Sci U S A 2017; 114:12779-12784. [PMID: 29138319 DOI: 10.1073/pnas.1708151114] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Gene conversion is the copying of a genetic sequence from a "donor" region to an "acceptor." In nonallelic gene conversion (NAGC), the donor and the acceptor are at distinct genetic loci. Despite the role NAGC plays in various genetic diseases and the concerted evolution of gene families, the parameters that govern NAGC are not well characterized. Here, we survey duplicate gene families and identify converted tracts in 46% of them. These conversions reflect a large GC bias of NAGC. We develop a sequence evolution model that leverages substantially more information in duplicate sequences than used by previous methods and use it to estimate the parameters that govern NAGC in humans: a mean converted tract length of 250 bp and a probability of [Formula: see text] per generation for a nucleotide to be converted (an order of magnitude higher than the point mutation rate). Despite this high baseline rate, we show that NAGC slows down as duplicate sequences diverge-until an eventual "escape" of the sequences from its influence. As a result, NAGC has a small average effect on the sequence divergence of duplicates. This work improves our understanding of the NAGC mechanism and the role that it plays in the evolution of gene duplicates.
Collapse
|
32
|
Niu Z, Xue Q, Wang H, Xie X, Zhu S, Liu W, Ding X. Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus. Int J Mol Sci 2017; 18:E2307. [PMID: 29099062 PMCID: PMC5713276 DOI: 10.3390/ijms18112307] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 09/27/2017] [Accepted: 10/20/2017] [Indexed: 01/03/2023] Open
Abstract
The variation of GC content is a key genome feature because it is associated with fundamental elements of genome organization. However, the reason for this variation is still an open question. Different kinds of hypotheses have been proposed to explain the variation of GC content during genome evolution. However, these hypotheses have not been explicitly investigated in whole plastome sequences. Dendrobium is one of the largest genera in the orchid species. Evolutionary studies of the plastomic organization and base composition are limited in this genus. In this study, we obtained the high-quality plastome sequences of D. loddigesii and D. devonianum. The comparison results showed a nearly identical organization in Dendrobium plastomes, indicating that the plastomic organization is highly conserved in Dendrobium genus. Furthermore, the impact of three evolutionary forces-selection, mutational biases, and GC-biased gene conversion (gBGC)-on the variation of GC content in Dendrobium plastomes was evaluated. Our results revealed: (1) consistent GC content evolution trends and mutational biases in single-copy (SC) and inverted repeats (IRs) regions; and (2) that gBGC has influenced the plastome-wide GC content evolution. These results suggest that both mutational biases and gBGC affect GC content in the plastomes of Dendrobium genus.
Collapse
Affiliation(s)
- Zhitao Niu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Qingyun Xue
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Hui Wang
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xuezhu Xie
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Shuying Zhu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Wei Liu
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| | - Xiaoyu Ding
- College of Life Sciences, Nanjing Normal University, Nanjing 210023, China.
| |
Collapse
|
33
|
Krasovec M, Eyre-Walker A, Sanchez-Ferandin S, Piganeau G. Spontaneous Mutation Rate in the Smallest Photosynthetic Eukaryotes. Mol Biol Evol 2017; 34:1770-1779. [PMID: 28379581 PMCID: PMC5455958 DOI: 10.1093/molbev/msx119] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Mutation is the ultimate source of genetic variation, and knowledge of mutation rates is fundamental for our understanding of all evolutionary processes. High throughput sequencing of mutation accumulation lines has provided genome wide spontaneous mutation rates in a dozen model species, but estimates from nonmodel organisms from much of the diversity of life are very limited. Here, we report mutation rates in four haploid marine bacterial-sized photosynthetic eukaryotic algae; Bathycoccus prasinos, Ostreococcus tauri, Ostreococcus mediterraneus, and Micromonas pusilla. The spontaneous mutation rate between species varies from μ = 4.4 × 10-10 to 9.8 × 10-10 mutations per nucleotide per generation. Within genomes, there is a two-fold increase of the mutation rate in intergenic regions, consistent with an optimization of mismatch and transcription-coupled DNA repair in coding sequences. Additionally, we show that deviation from the equilibrium GC content increases the mutation rate by ∼2% to ∼12% because of a GC bias in coding sequences. More generally, the difference between the observed and equilibrium GC content of genomes explains some of the inter-specific variation in mutation rates.
Collapse
Affiliation(s)
- Marc Krasovec
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Observatoire Océanologique, Banyuls/Mer, France
| | - Adam Eyre-Walker
- Evolution, behaviour and environment, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Sophie Sanchez-Ferandin
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Observatoire Océanologique, Banyuls/Mer, France
| | - Gwenael Piganeau
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, Biologie Intégrative des Organismes Marins (BIOM), Observatoire Océanologique, Banyuls/Mer, France
| |
Collapse
|
34
|
Evolutionary forces affecting synonymous variations in plant genomes. PLoS Genet 2017; 13:e1006799. [PMID: 28531201 PMCID: PMC5460877 DOI: 10.1371/journal.pgen.1006799] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Revised: 06/06/2017] [Accepted: 05/04/2017] [Indexed: 01/04/2023] Open
Abstract
Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones. In protein coding genes, base composition strongly varies within and among plant genomes, especially at positions where changes do not alter the coded protein (synonymous variations). Some species, such as the model plant Arabidopsis thaliana, are relatively GC-poor and homogeneous while others, such as grasses, are highly heterogeneous and GC-rich. The causes of these variations are still debated: are they mainly due to selective or neutral processes? Answering to this question is important to correctly infer whether variations in base composition may have functional roles or not. We extended a population genetics method to jointly estimate the different forces that may affect synonymous variations and applied it to genomic datasets in 11 flowering plant species. We found that GC-biased gene conversion, a neutral process associated with recombination that mimics selection by favouring G and C bases, is a widespread and stronger process than selection and that it could explain the large variation in base composition observed in plant genomes. Our results bear implications for analysing plant genomes and for correctly interpreting what could be functional or not.
Collapse
|
35
|
Bossert S, Murray EA, Blaimer BB, Danforth BN. The impact of GC bias on phylogenetic accuracy using targeted enrichment phylogenomic data. Mol Phylogenet Evol 2017; 111:149-157. [PMID: 28390323 DOI: 10.1016/j.ympev.2017.03.022] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Revised: 03/06/2017] [Accepted: 03/24/2017] [Indexed: 01/08/2023]
Abstract
The field of sequence based phylogenetic analyses is currently being transformed by novel hybrid-based targeted enrichment methods, such as the use of ultraconserved elements (UCEs). Rather than analyzing relationships among organisms using a small number of genes, these methods now allow us to evaluate relationships with many hundreds to thousands of individual gene loci. However, the inclusion of thousands of loci does not necessarily overcome the long-standing challenge of incongruence among phylogenetic trees derived from different genes or gene regions. One factor that impacts the level of incongruence in phylogenomic data sets is the level of GC bias. GC rich gene regions are prone to higher recombination rates than AT rich regions, driven by a process referred to as "GC biased gene conversion". As a result, high GC content can be negatively associated with phylogenetic accuracy, but the extent to which this impacts incongruence among UCEs is currently unstudied. We investigated the impact of GC content on phylogeny reconstruction using in silico captured UCE data for the corbiculate bees (Hymenoptera: Apidae). The phylogeny of this group has been the subject of extensive study, and incongruence among gene trees is thought to be a source of phylogenetic error. We conducted coalescent- and concatenation-based analyses of 810 individual gene loci from all 13 currently available bee genomes, including 8 corbiculate taxa. Both coalescent- and concatenation-based methods converged on a single topology for the corbiculate tribes. In contrast to concatenation, the coalescent-based methods revealed significant topological conflict at nodes involving the orchid bees (Euglossini) and honeybees (Apini). Partitioning the loci by GC content reveals decreasing support for the inferred topology with increasing GC bias. Based on the results of this study, we report the first evidence that GC biased gene conversion may contribute to topological incongruence in studies based on ultraconserved elements.
Collapse
Affiliation(s)
- Silas Bossert
- Department of Entomology, Cornell University, Ithaca, New York, USA.
| | | | - Bonnie B Blaimer
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Bryan N Danforth
- Department of Entomology, Cornell University, Ithaca, New York, USA
| |
Collapse
|
36
|
Badouin H, Gladieux P, Gouzy J, Siguenza S, Aguileta G, Snirc A, Le Prieur S, Jeziorski C, Branca A, Giraud T. Widespread selective sweeps throughout the genome of model plant pathogenic fungi and identification of effector candidates. Mol Ecol 2017; 26:2041-2062. [DOI: 10.1111/mec.13976] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2016] [Revised: 12/15/2016] [Accepted: 12/19/2016] [Indexed: 12/11/2022]
Affiliation(s)
- H. Badouin
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - P. Gladieux
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
- UMR BGPI; Campus International de Baillarguet; INRA; 34398 Montpellier France
| | - J. Gouzy
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR441; INRA; 31326 Castanet-Tolosan France
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR2594; CNRS; 31326 Castanet-Tolosan France
| | - S. Siguenza
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR441; INRA; 31326 Castanet-Tolosan France
- Laboratoire des Interactions Plantes-Microorganismes (LIPM); UMR2594; CNRS; 31326 Castanet-Tolosan France
| | - G. Aguileta
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - A. Snirc
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - S. Le Prieur
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - C. Jeziorski
- Genotoul; GeT-PlaGe; INRA Auzeville 31326 Castanet-Tolosan France
- UAR1209; INRA Auzeville 31326 Castanet-Tolosan France
| | - A. Branca
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| | - T. Giraud
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech; Université Paris-Saclay; 91400 Orsay France
| |
Collapse
|
37
|
Liu H, Jia Y, Sun X, Tian D, Hurst LD, Yang S. Direct Determination of the Mutation Rate in the Bumblebee Reveals Evidence for Weak Recombination-Associated Mutation and an Approximate Rate Constancy in Insects. Mol Biol Evol 2017; 34:119-130. [PMID: 28007973 PMCID: PMC5854123 DOI: 10.1093/molbev/msw226] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Accurate knowledge of the mutation rate provides a base line for inferring expected rates of evolution, for testing evolutionary hypotheses and for estimation of key parameters. Advances in sequencing technology now permit direct estimates of the mutation rate from sequencing of close relatives. Within insects there have been three prior such estimates, two in nonsocial insects (Drosophila: 2.8 × 10-9 per bp per haploid genome per generation; Heliconius: 2.9 × 10-9) and one in a social species, the honeybee (3.4 × 10-9). Might the honeybee's rate be ∼20% higher because it has an exceptionally high recombination rate and recombination may be directly or indirectly mutagenic? To address this possibility, we provide a direct estimate of the mutation rate in the bumblebee (Bombus terrestris), this being a close relative of the honeybee but with a much lower recombination rate. We confirm that the crossover rate of the bumblebee is indeed much lower than honeybees (8.7 cM/Mb vs. 37 cM/Mb). Importantly, we find no significant difference in the mutation rates: we estimate for bumblebees a rate of 3.6 × 10-9 per haploid genome per generation (95% confidence intervals 2.38 × 10-9 and 5.37 × 10-9) which is just 5% higher than the estimate that of honeybees. Both genomes have approximately one new mutation per haploid genome per generation. While we find evidence for a direct coupling between recombination and mutation (also seen in honeybees), the effect is so weak as to leave almost no footprint on any between-species differences. The similarity in mutation rates suggests an approximate constancy of the mutation rate in insects.
Collapse
Affiliation(s)
- Haoxuan Liu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Yanxiao Jia
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Xiaoguang Sun
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| |
Collapse
|
38
|
Korunes KL, Noor MAF. Gene conversion and linkage: effects on genome evolution and speciation. Mol Ecol 2016; 26:351-364. [DOI: 10.1111/mec.13736] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Revised: 06/07/2016] [Accepted: 06/22/2016] [Indexed: 12/12/2022]
|
39
|
Smeds L, Mugal CF, Qvarnström A, Ellegren H. High-Resolution Mapping of Crossover and Non-crossover Recombination Events by Whole-Genome Re-sequencing of an Avian Pedigree. PLoS Genet 2016; 12:e1006044. [PMID: 27219623 PMCID: PMC4878770 DOI: 10.1371/journal.pgen.1006044] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 04/19/2016] [Indexed: 01/04/2023] Open
Abstract
Recombination is an engine of genetic diversity and therefore constitutes a key process in evolutionary biology and genetics. While the outcome of crossover recombination can readily be detected as shuffled alleles by following the inheritance of markers in pedigreed families, the more precise location of both crossover and non-crossover recombination events has been difficult to pinpoint. As a consequence, we lack a detailed portrait of the recombination landscape for most organisms and knowledge on how this landscape impacts on sequence evolution at a local scale. To localize recombination events with high resolution in an avian system, we performed whole-genome re-sequencing at high coverage of a complete three-generation collared flycatcher pedigree. We identified 325 crossovers at a median resolution of 1.4 kb, with 86% of the events localized to <10 kb intervals. Observed crossover rates were in excellent agreement with data from linkage mapping, were 52% higher in male (3.56 cM/Mb) than in female meiosis (2.28 cM/Mb), and increased towards chromosome ends in male but not female meiosis. Crossover events were non-randomly distributed in the genome with several distinct hot-spots and a concentration to genic regions, with the highest density in promoters and CpG islands. We further identified 267 non-crossovers, whose location was significantly associated with crossover locations. We detected a significant transmission bias (0.18) in favour of 'strong' (G, C) over 'weak' (A, T) alleles at non-crossover events, providing direct evidence for the process of GC-biased gene conversion in an avian system. The approach taken in this study should be applicable to any species and would thereby help to provide a more comprehensive portray of the recombination landscape across organism groups.
Collapse
Affiliation(s)
- Linnéa Smeds
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Carina F. Mugal
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Anna Qvarnström
- Department of Animal Ecology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
40
|
Yahara K, Didelot X, Jolley KA, Kobayashi I, Maiden MCJ, Sheppard SK, Falush D. The Landscape of Realized Homologous Recombination in Pathogenic Bacteria. Mol Biol Evol 2016; 33:456-71. [PMID: 26516092 PMCID: PMC4866539 DOI: 10.1093/molbev/msv237] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Recombination enhances the adaptive potential of organisms by allowing genetic variants to be tested on multiple genomic backgrounds. Its distribution in the genome can provide insight into the evolutionary forces that underlie traits, such as the emergence of pathogenicity. Here, we examined landscapes of realized homologous recombination of 500 genomes from ten bacterial species and found all species have "hot" regions with elevated rates relative to the genome average. We examined the size, gene content, and chromosomal features associated with these regions and the correlations between closely related species. The recombination landscape is variable and evolves rapidly. For example in Salmonella, only short regions of around 1 kb in length are hot whereas in the closely related species Escherichia coli, some hot regions exceed 100 kb, spanning many genes. Only Streptococcus pyogenes shows evidence for the positive correlation between GC content and recombination that has been reported for several eukaryotes. Genes with function related to the cell surface/membrane are often found in recombination hot regions but E. coli is the only species where genes annotated as "virulence associated" are consistently hotter. There is also evidence that some genes with "housekeeping" functions tend to be overrepresented in cold regions. For example, ribosomal proteins showed low recombination in all of the species. Among specific genes, transferrin-binding proteins are recombination hot in all three of the species in which they were found, and are subject to interspecies recombination.
Collapse
Affiliation(s)
- Koji Yahara
- Biostatistics Center, Kurume University, Kurume, Fukuoka, Japan College of Medicine, Institute of Life Science, Swansea University, Swansea, United Kingdom
| | - Xavier Didelot
- Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Keith A Jolley
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Ichizo Kobayashi
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | | | - Samuel K Sheppard
- College of Medicine, Institute of Life Science, Swansea University, Swansea, United Kingdom Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Daniel Falush
- College of Medicine, Institute of Life Science, Swansea University, Swansea, United Kingdom Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| |
Collapse
|
41
|
Sundararajan A, Dukowic-Schulze S, Kwicklis M, Engstrom K, Garcia N, Oviedo OJ, Ramaraj T, Gonzales MD, He Y, Wang M, Sun Q, Pillardy J, Kianian SF, Pawlowski WP, Chen C, Mudge J. Gene Evolutionary Trajectories and GC Patterns Driven by Recombination in Zea mays. FRONTIERS IN PLANT SCIENCE 2016; 7:1433. [PMID: 27713757 PMCID: PMC5031598 DOI: 10.3389/fpls.2016.01433] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 09/08/2016] [Indexed: 05/20/2023]
Abstract
Recombination occurring during meiosis is critical for creating genetic variation and plays an essential role in plant evolution. In addition to creating novel gene combinations, recombination can affect genome structure through altering GC patterns. In maize (Zea mays) and other grasses, another intriguing GC pattern exists. Maize genes show a bimodal GC content distribution that has been attributed to nucleotide bias in the third, or wobble, position of the codon. Recombination may be an underlying driving force given that recombination sites are often associated with high GC content. Here we explore the relationship between recombination and genomic GC patterns by comparing GC gene content at each of the three codon positions (GC1, GC2, and GC3, collectively termed GCx) to instances of a variable GC-rich motif that underlies double strand break (DSB) hotspots and to meiocyte-specific gene expression. Surprisingly, GCx bimodality in maize cannot be fully explained by the codon wobble hypothesis. High GCx genes show a strong overlap with the DSB hotspot motif, possibly providing a mechanism for the high evolutionary rates seen in these genes. On the other hand, genes that are turned on in meiosis (early prophase I) are biased against both high GCx genes and genes with the DSB hotspot motif, possibly allowing important meiotic genes to avoid DSBs. Our data suggests a strong link between the GC-rich motif underlying DSB hotspots and high GCx genes.
Collapse
Affiliation(s)
| | | | | | | | - Nathan Garcia
- National Center for Genome Resources, Santa FeNM, USA
| | | | | | | | - Yan He
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Minghui Wang
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Qi Sun
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Jaroslaw Pillardy
- Biotechnology Resource Center Bioinformatics Facility, Cornell University, IthacaNY, USA
| | - Shahryar F. Kianian
- Cereal Disease Laboratory, United States Department of Agriculture – Agricultural Research Service, St. PaulMN, USA
| | - Wojciech P. Pawlowski
- Section of Plant Biology, School of Integrative Plant Science, Cornell University, IthacaNY, USA
| | - Changbin Chen
- Department of Horticultural Science, University of Minnesota, St. PaulMN, USA
| | - Joann Mudge
- National Center for Genome Resources, Santa FeNM, USA
- *Correspondence: Joann Mudge,
| |
Collapse
|
42
|
Keith N, Tucker AE, Jackson CE, Sung W, Lucas Lledó JI, Schrider DR, Schaack S, Dudycha JL, Ackerman M, Younge AJ, Shaw JR, Lynch M. High mutational rates of large-scale duplication and deletion in Daphnia pulex. Genome Res 2016; 26:60-9. [PMID: 26518480 PMCID: PMC4691751 DOI: 10.1101/gr.191338.115] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 10/13/2015] [Indexed: 02/06/2023]
Abstract
Knowledge of the genome-wide rate and spectrum of mutations is necessary to understand the origin of disease and the genetic variation driving all evolutionary processes. Here, we provide a genome-wide analysis of the rate and spectrum of mutations obtained in two Daphnia pulex genotypes via separate mutation-accumulation (MA) experiments. Unlike most MA studies that utilize haploid, homozygous, or self-fertilizing lines, D. pulex can be propagated ameiotically while maintaining a naturally heterozygous, diploid genome, allowing the capture of the full spectrum of genomic changes that arise in a heterozygous state. While base-substitution mutation rates are similar to those in other multicellular eukaryotes (about 4 × 10(-9) per site per generation), we find that the rates of large-scale (>100 kb) de novo copy-number variants (CNVs) are significantly elevated relative to those seen in previous MA studies. The heterozygosity maintained in this experiment allowed for estimates of gene-conversion processes. While most of the conversion tract lengths we report are similar to those generated by meiotic processes, we also find larger tract lengths that are indicative of mitotic processes. Comparison of MA lines to natural isolates reveals that a majority of large-scale CNVs in natural populations are removed by purifying selection. The mutations observed here share similarities with disease-causing, complex, large-scale CNVs, thereby demonstrating that MA studies in D. pulex serve as a system for studying the processes leading to such alterations.
Collapse
Affiliation(s)
- Nathan Keith
- School of Public and Environmental Affairs, Indiana University, Bloomington, Indiana 47405, USA
| | - Abraham E Tucker
- Biology Department, Southern Arkansas University, Magnolia, Arkansas 71753, USA
| | - Craig E Jackson
- School of Public and Environmental Affairs, Indiana University, Bloomington, Indiana 47405, USA
| | - Way Sung
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | | | - Daniel R Schrider
- Department of Genetics, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Sarah Schaack
- Biology Department, Reed College, Portland, Oregon 97202, USA
| | - Jeffry L Dudycha
- Department of Biological Sciences, University of South Carolina, Columbia, South Carolina 29208, USA
| | - Matthew Ackerman
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Andrew J Younge
- School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405, USA
| | - Joseph R Shaw
- School of Public and Environmental Affairs, Indiana University, Bloomington, Indiana 47405, USA; School of Biosciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | - Michael Lynch
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| |
Collapse
|
43
|
Ness RW, Kraemer SA, Colegrave N, Keightley PD. Direct Estimate of the Spontaneous Mutation Rate Uncovers the Effects of Drift and Recombination in theChlamydomonas reinhardtiiPlastid Genome. Mol Biol Evol 2015; 33:800-8. [DOI: 10.1093/molbev/msv272] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
|
44
|
Hillmer M, Wagner D, Summerer A, Daiber M, Mautner VF, Messiaen L, Cooper DN, Kehrer-Sawatzki H. Fine mapping of meiotic NAHR-associated crossovers causing large NF1 deletions. Hum Mol Genet 2015; 25:484-96. [PMID: 26614388 DOI: 10.1093/hmg/ddv487] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 11/19/2015] [Indexed: 02/06/2023] Open
Abstract
Large deletions encompassing the NF1 gene and its flanking regions belong to the group of genomic disorders caused by copy number changes that are mediated by the local genomic architecture. Although nonallelic homologous recombination (NAHR) is known to be a major mutational mechanism underlying such genomic copy number changes, the sequence determinants of NAHR location and frequency are still poorly understood since few high-resolution mapping studies of NAHR hotspots have been performed to date. Here, we have characterized two NAHR hotspots, PRS1 and PRS2, separated by 20 kb and located within the low-copy repeats NF1-REPa and NF1-REPc, which flank the human NF1 gene region. High-resolution mapping of the crossover sites identified in 78 type 1 NF1 deletions mediated by NAHR indicated that PRS2 is a much stronger NAHR hotspot than PRS1 since 80% of these deletions exhibited crossovers within PRS2, whereas 20% had crossovers within PRS1. The identification of the most common strand exchange regions of these 78 deletions served to demarcate the cores of the PRS1 and PRS2 hotspots encompassing 1026 and 1976 bp, respectively. Several sequence features were identified that may influence hotspot intensity and direct the positional preference of NAHR to the hotspot cores. These features include regions of perfect sequence identity encompassing 700 bp at the hotspot core, the presence of PRDM9 binding sites perfectly matching the consensus motif for the most common PRDM9 variant, specific pre-existing patterns of histone modification and open chromatin conformations that are likely to facilitate PRDM9 binding.
Collapse
Affiliation(s)
- Morten Hillmer
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | - David Wagner
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | - Anna Summerer
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | - Michaela Daiber
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | - Victor-Felix Mautner
- Department of Neurology, University Hospital Hamburg Eppendorf, 20246 Hamburg, Germany
| | - Ludwine Messiaen
- Medical Genomics Laboratory, Department of Genetics, University of Alabama at Birmingham, Birmingham, AL 35242, USA and
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK
| | | |
Collapse
|
45
|
Mugal CF, Weber CC, Ellegren H. GC-biased gene conversion links the recombination landscape and demography to genomic base composition. Bioessays 2015; 37:1317-26. [DOI: 10.1002/bies.201500058] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Carina F. Mugal
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| | - Claudia C. Weber
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
- Department of Biology; Center for Computational Genetics and Genomics; Temple University; Philadelphia PA USA
| | - Hans Ellegren
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| |
Collapse
|
46
|
Bolívar P, Mugal CF, Nater A, Ellegren H. Recombination Rate Variation Modulates Gene Sequence Evolution Mainly via GC-Biased Gene Conversion, Not Hill-Robertson Interference, in an Avian System. Mol Biol Evol 2015; 33:216-27. [PMID: 26446902 PMCID: PMC4693978 DOI: 10.1093/molbev/msv214] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill–Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C (“strong,” S) over A:T (“weak,” W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Collapse
Affiliation(s)
- Paulina Bolívar
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Carina F Mugal
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Nater
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
47
|
The relationship of recombination rate, genome structure, and patterns of molecular evolution across angiosperms. BMC Evol Biol 2015; 15:194. [PMID: 26377000 PMCID: PMC4574184 DOI: 10.1186/s12862-015-0473-3] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 09/01/2015] [Indexed: 12/31/2022] Open
Abstract
Background Although homologous recombination affects the efficacy of selection in populations, the pattern of recombination rate evolution and its effects on genome evolution across plants are largely unknown. Recombination can reduce genome size by enabling the removal of LTR retrotransposons, alter codon usage by GC biased gene conversion, contribute to complex histories of gene duplication and loss through tandem duplication, and enhance purifying selection on genes. Therefore, variation in recombination rate across species may explain some of the variation in genomic architecture as well as rates of molecular evolution. We used phylogenetic comparative methods to investigate the evolution of global meiotic recombination rate in angiosperms and its effects on genome architecture and selection at the molecular level using genetic maps and genome sequences from thirty angiosperm species. Results Recombination rate is negatively correlated with genome size, which is likely caused by the removal of LTR retrotransposons. After correcting recombination rates for euchromatin content, we also found an association between global recombination rate and average gene family size. This suggests a role for recombination in the preservation of duplicate genes or expansion of gene families. An analysis of the correlation between the ratio of nonsynonymous to synonymous substitution rates (dN/dS) and recombination rate in 3748 genes indicates that higher recombination rates are associated with an increased efficacy of purifying selection, suggesting that global recombination rates affect variation in rates of molecular evolution across distantly related angiosperm species, not just between populations. We also identified shifts in dN/dS for recombination proteins that are associated with shifts in global recombination rate across our sample of angiosperms. Conclusions Although our analyses only reveal correlations, not mechanisms, and do not include potential covariates of recombination rate, like effective population size, they suggest that global recombination rates may play an important role in shaping the macroevolutionary patterns of gene and genome evolution in plants. Interspecific recombination rate variation is tightly correlated with genome size as well as variation in overall LTR retrotransposon abundances. Recombination may shape gene-to-gene variation in dN/dS between species, which might impact the overall gene duplication and loss rates. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0473-3) contains supplementary material, which is available to authorized users.
Collapse
|
48
|
Glémin S, Arndt PF, Messer PW, Petrov D, Galtier N, Duret L. Quantification of GC-biased gene conversion in the human genome. Genome Res 2015; 25:1215-28. [PMID: 25995268 PMCID: PMC4510005 DOI: 10.1101/gr.185488.114] [Citation(s) in RCA: 108] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 05/18/2015] [Indexed: 11/25/2022]
Abstract
Much evidence indicates that GC-biased gene conversion (gBGC) has a major impact on the evolution of mammalian genomes. However, a detailed quantification of the process is still lacking. The strength of gBGC can be measured from the analysis of derived allele frequency spectra (DAF), but this approach is sensitive to a number of confounding factors. In particular, we show by simulations that the inference is pervasively affected by polymorphism polarization errors and by spatial heterogeneity in gBGC strength. We propose a new general method to quantify gBGC from DAF spectra, incorporating polarization errors, taking spatial heterogeneity into account, and jointly estimating mutation bias. Applying it to human polymorphism data from the 1000 Genomes Project, we show that the strength of gBGC does not differ between hypermutable CpG sites and non-CpG sites, suggesting that in humans gBGC is not caused by the base-excision repair machinery. Genome-wide, the intensity of gBGC is in the nearly neutral area. However, given that recombination occurs primarily within recombination hotspots, 1%–2% of the human genome is subject to strong gBGC. On average, gBGC is stronger in African than in non-African populations, reflecting differences in effective population sizes. However, due to more heterogeneous recombination landscapes, the fraction of the genome affected by strong gBGC is larger in non-African than in African populations. Given that the location of recombination hotspots evolves very rapidly, our analysis predicts that, in the long term, a large fraction of the genome is affected by short episodes of strong gBGC.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France; Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Peter F Arndt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Philipp W Messer
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | - Dmitri Petrov
- Department of Biology, Stanford University, Stanford, California 94305-5020, USA
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Lyon 1, 69622 Villeurbanne, France
| |
Collapse
|
49
|
Williams AL, Genovese G, Dyer T, Altemose N, Truax K, Jun G, Patterson N, Myers SR, Curran JE, Duggirala R, Blangero J, Reich D, Przeworski M. Non-crossover gene conversions show strong GC bias and unexpected clustering in humans. eLife 2015; 4. [PMID: 25806687 PMCID: PMC4404656 DOI: 10.7554/elife.04637] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Accepted: 03/20/2015] [Indexed: 12/15/2022] Open
Abstract
Although the past decade has seen tremendous progress in our understanding of fine-scale recombination, little is known about non-crossover (NCO) gene conversion. We report the first genome-wide study of NCO events in humans. Using SNP array data from 98 meioses, we identified 103 sites affected by NCO, of which 50/52 were confirmed in sequence data. Overlap with double strand break (DSB) hotspots indicates that most of the events are likely of meiotic origin. We estimate that a site is involved in a NCO at a rate of 5.9 × 10(-6)/bp/generation, consistent with sperm-typing studies, and infer that tract lengths span at least an order of magnitude. Observed NCO events show strong allelic bias at heterozygous AT/GC SNPs, with 68% (58-78%) transmitting GC alleles (p = 5 × 10(-4)). Strikingly, in 4 of 15 regions with resequencing data, multiple disjoint NCO tracts cluster in close proximity (∼20-30 kb), a phenomenon not previously seen in mammals.
Collapse
Affiliation(s)
- Amy L Williams
- Department of Biological Sciences, Columbia University, New York, United States
| | - Giulio Genovese
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, United States
| | - Thomas Dyer
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - Nicolas Altemose
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Katherine Truax
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - Goo Jun
- Department of Biostatistics, University of Michigan, Ann Arbor, United States
| | - Nick Patterson
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, United States
| | - Simon R Myers
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Joanne E Curran
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - Ravi Duggirala
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - John Blangero
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - David Reich
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, United States
| | - Molly Przeworski
- Department of Biological Sciences, Columbia University, New York, United States
| | | |
Collapse
|
50
|
Lassalle F, Périan S, Bataillon T, Nesme X, Duret L, Daubin V. GC-Content evolution in bacterial genomes: the biased gene conversion hypothesis expands. PLoS Genet 2015; 11:e1004941. [PMID: 25659072 PMCID: PMC4450053 DOI: 10.1371/journal.pgen.1004941] [Citation(s) in RCA: 135] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 12/08/2014] [Indexed: 11/29/2022] Open
Abstract
The characterization of functional elements in genomes relies on the identification of the footprints of natural selection. In this quest, taking into account neutral evolutionary processes such as mutation and genetic drift is crucial because these forces can generate patterns that may obscure or mimic signatures of selection. In mammals, and probably in many eukaryotes, another such confounding factor called GC-Biased Gene Conversion (gBGC) has been documented. This mechanism generates patterns identical to what is expected under selection for higher GC-content, specifically in highly recombining genomic regions. Recent results have suggested that a mysterious selective force favouring higher GC-content exists in Bacteria but the possibility that it could be gBGC has been excluded. Here, we show that gBGC is probably at work in most if not all bacterial species. First we find a consistent positive relationship between the GC-content of a gene and evidence of intra-genic recombination throughout a broad spectrum of bacterial clades. Second, we show that the evolutionary force responsible for this pattern is acting independently from selection on codon usage, and could potentially interfere with selection in favor of optimal AU-ending codons. A comparison with data from human populations shows that the intensity of gBGC in Bacteria is comparable to what has been reported in mammals. We propose that gBGC is not restricted to sexual Eukaryotes but also widespread among Bacteria and could therefore be an ancestral feature of cellular organisms. We argue that if gBGC occurs in bacteria, it can account for previously unexplained observations, such as the apparent non-equilibrium of base substitution patterns and the heterogeneity of gene composition within bacterial genomes. Because gBGC produces patterns similar to positive selection, it is essential to take this process into account when studying the evolutionary forces at work in bacterial genomes. Classical population genetics models indicate that the efficiency of selection, and hence adaptation, depends on a number of non-selective factors, such as the size of a population or the intensity of recombination. In the last 10 years, evidence has accumulated that another mechanism called GC-Biased Gene Conversion (gBGC) can interfere with selection and even mimic its effects. This phenomenon, which arises from a particularity of the recombination machinery, was first thought to be restricted to sexual eukaryotic organisms. Here, we show that this mechanism probably exists in Bacteria and has a strong impact on their genome evolution. This discovery not only explains many previously unconnected features of bacterial genome evolution, but also highlights the importance of non-adaptive evolutionary processes in Bacteria.
Collapse
Affiliation(s)
- Florent Lassalle
- Université de Lyon, Lyon, France
- Université Lyon 1, Villeurbanne, France
- CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
- CNRS, UMR 5557, Ecologie Microbienne, Villeurbanne, France
- INRA, USC 1364, Ecologie Microbienne, Villeurbanne, France
- Ecole Normale Supérieure de Lyon, Lyon, France
| | - Séverine Périan
- Université de Lyon, Lyon, France
- Université Lyon 1, Villeurbanne, France
- CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| | - Thomas Bataillon
- Aarhus University, Bioinformatics Research Center, Århus Denmark1 Université de Lyon, Lyon, France
| | - Xavier Nesme
- Université de Lyon, Lyon, France
- Université Lyon 1, Villeurbanne, France
- CNRS, UMR 5557, Ecologie Microbienne, Villeurbanne, France
- INRA, USC 1364, Ecologie Microbienne, Villeurbanne, France
| | - Laurent Duret
- Université de Lyon, Lyon, France
- Université Lyon 1, Villeurbanne, France
- CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| | - Vincent Daubin
- Université de Lyon, Lyon, France
- Université Lyon 1, Villeurbanne, France
- CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
- * E-mail:
| |
Collapse
|