1
|
Shevtsov A, Izbanova U, Amirgazin A, Kairzhanova A, Dauletov A, Kiyan V, Vergnaud G. Genetic Homogeneity of Francisella tularensis subsp. mediasiatica Strains in Kazakhstan. Pathogens 2024; 13:581. [PMID: 39057808 PMCID: PMC11279412 DOI: 10.3390/pathogens13070581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 07/05/2024] [Accepted: 07/06/2024] [Indexed: 07/28/2024] Open
Abstract
Tularemia is an acute febrile disease caused by the Gram-negative bacillus Francisella tularensis. Based on genetic and phenotypic characteristics, three subspecies are distinguished: tularensis, holarctica, and mediasiatica. F. tularensis subsp. mediasiatica remains the least studied subspecies. Over the past decade, new foci of distribution of F. tularensis subsp. mediasiatica have been discovered in Russia (Siberia), expanding the possible distribution area by thousands of kilometers. This article provides whole genome single nucleotide polymorphism (wgSNP) and polymorphic tandem repeats (MLVA) analyses of 28 mediasiatica strains isolated between 1965 and 2004 in Kazakhstan. Despite high genetic homogeneity, MLVA with eleven loci (MLVA11) demonstrates a high discriminatory ability (diversity index, 0.9497). The topological structure of the trees based on wgSNP and MLVA is not comparable; however, clustering remains congruent for most outbreaks, with the exception of two strains from one outbreak that are identical in terms of wgSNP but differ at three tandem repeat loci. Based on wgSNP, the strains are assigned to one of the three currently known mediasiatica sublineages, lineage M.I, together with other historical strains maintained in collections in Russia and Sweden. wgSNP shows limited previously unknown genetic diversity, with the M.I lineage size being only 118 SNPs. The wgSNP genotype is not strongly correlated with year and place of isolation.
Collapse
Affiliation(s)
- Alexandr Shevtsov
- National Center for Biotechnology, Astana 010000, Kazakhstan; (A.A.); (A.K.); (A.D.); (V.K.)
| | - Uinkul Izbanova
- Aikimbayev’s National Scientific Center for Especially Dangerous Infections, Almaty 050000, Kazakhstan
| | - Asylulan Amirgazin
- National Center for Biotechnology, Astana 010000, Kazakhstan; (A.A.); (A.K.); (A.D.); (V.K.)
| | - Alma Kairzhanova
- National Center for Biotechnology, Astana 010000, Kazakhstan; (A.A.); (A.K.); (A.D.); (V.K.)
| | - Ayan Dauletov
- National Center for Biotechnology, Astana 010000, Kazakhstan; (A.A.); (A.K.); (A.D.); (V.K.)
| | - Vladimir Kiyan
- National Center for Biotechnology, Astana 010000, Kazakhstan; (A.A.); (A.K.); (A.D.); (V.K.)
| | - Gilles Vergnaud
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay, CEA, CNRS, 91198 Gif-sur-Yvette, France
| |
Collapse
|
2
|
Lynch M, Ali F, Lin T, Wang Y, Ni J, Long H. The divergence of mutation rates and spectra across the Tree of Life. EMBO Rep 2023; 24:e57561. [PMID: 37615267 PMCID: PMC10561183 DOI: 10.15252/embr.202357561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/01/2023] [Accepted: 08/02/2023] [Indexed: 08/25/2023] Open
Abstract
Owing to advances in genome sequencing, genome stability has become one of the most scrutinized cellular traits across the Tree of Life. Despite its centrality to all things biological, the mutation rate (per nucleotide site per generation) ranges over three orders of magnitude among species and several-fold within individual phylogenetic lineages. Within all major organismal groups, mutation rates scale negatively with the effective population size of a species and with the amount of functional DNA in the genome. This relationship is most parsimoniously explained by the drift-barrier hypothesis, which postulates that natural selection typically operates to reduce mutation rates until further improvement is thwarted by the power of random genetic drift. Despite this constraint, the molecular mechanisms underlying DNA replication fidelity and repair are free to wander, provided the performance of the entire system is maintained at the prevailing level. The evolutionary flexibility of the mutation rate bears on the resolution of several prior conundrums in phylogenetic and population-genetic analysis and raises challenges for future applications in these areas.
Collapse
Affiliation(s)
- Michael Lynch
- Biodesign Center for Mechanisms of EvolutionArizona State UniversityTempeAZUSA
| | - Farhan Ali
- Biodesign Center for Mechanisms of EvolutionArizona State UniversityTempeAZUSA
| | - Tongtong Lin
- Institute of Evolution and Marine Biodiversity, KLMMEOcean University of ChinaQingdaoChina
| | - Yaohai Wang
- Institute of Evolution and Marine Biodiversity, KLMMEOcean University of ChinaQingdaoChina
| | - Jiahao Ni
- Institute of Evolution and Marine Biodiversity, KLMMEOcean University of ChinaQingdaoChina
| | - Hongan Long
- Institute of Evolution and Marine Biodiversity, KLMMEOcean University of ChinaQingdaoChina
| |
Collapse
|
3
|
Wu C, Paradis NJ, Lakernick PM, Hryb M. L-shaped distribution of the relative substitution rate (c/μ) observed for SARS-COV-2's genome, inconsistent with the selectionist theory, the neutral theory and the nearly neutral theory but a near-neutral balanced selection theory: Implication on "neutralist-selectionist" debate. Comput Biol Med 2023; 153:106522. [PMID: 36638615 PMCID: PMC9814386 DOI: 10.1016/j.compbiomed.2022.106522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2022] [Revised: 12/17/2022] [Accepted: 12/31/2022] [Indexed: 01/07/2023]
Abstract
The genomic substitution rate (GSR) of SARS-CoV-2 exhibits a molecular clock feature and does not change under fluctuating environmental factors such as the infected human population (10°-107), vaccination etc. The molecular clock feature is believed to be inconsistent with the selectionist theory (ST). The GSR shows lack of dependence on the effective population size, suggesting Ohta's nearly neutral theory (ONNT) is not applicable to this virus. Big variation of the substitution rate within its genome is also inconsistent with Kimura's neutral theory (KNT). Thus, all three existing evolution theories fail to explain the evolutionary nature of this virus. In this paper, we proposed a Segment Substitution Rate Model (SSRM) under non-neutral selections and pointed out that a balanced mechanism between negative and positive selection of some segments that could also lead to the molecular clock feature. We named this hybrid mechanism as near-neutral balanced selection theory (NNBST) and examined if it was followed by SARS-CoV-2 using the three independent sets of SARS-CoV-2 genomes selected by the Nextstrain team. Intriguingly, the relative substitution rate of this virus exhibited an L-shaped probability distribution consisting with NNBST rather than Poisson distribution predicted by KNT or an asymmetric distribution predicted by ONNT in which nearly neutral sites are believed to be slightly deleterious only, or the distribution that is lack of nearly neutral sites predicted by ST. The time-dependence of the substitution rates for some segments and their correlation with the vaccination were observed, supporting NNBST. Our relative substitution rate method provides a tool to resolve the long standing "neutralist-selectionist" controversy. Implications of NNBST in resolving Lewontin's Paradox is also discussed.
Collapse
Affiliation(s)
- Chun Wu
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, NJ, 08028, USA; Department of Biological & Biomedical Sciences, Rowan University, Glassboro, NJ, 08028, USA.
| | - Nicholas J Paradis
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, NJ, 08028, USA
| | - Phillip M Lakernick
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, NJ, 08028, USA
| | - Mariya Hryb
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, NJ, 08028, USA
| |
Collapse
|
4
|
Belinky F, Bykova A, Yurchenko V, Rogozin IB. No evidence for widespread positive selection on double substitutions within codons in primates and yeasts. Front Genet 2022; 13:991249. [PMID: 36159983 PMCID: PMC9500374 DOI: 10.3389/fgene.2022.991249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open
Abstract
Nucleotide substitutions in protein-coding genes can be divided into synonymous (S) and non-synonymous (N) ones that alter amino acids (including nonsense mutations causing stop codons). The S substitutions are expected to have little effect on function. The N substitutions almost always are affected by strong purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases can modulate the deleterious effect of single N substitutions and, thus, could be subjected to the positive selection. This effect has been demonstrated for mutations in the serine codons, stop codons and double N substitutions in prokaryotes. In all abovementioned cases, a novel technique was applied that allows elucidating the effects of selection on double substitutions considering mutational biases. Here, we applied the same technique to study double N substitutions in eukaryotic lineages of primates and yeast. We identified markedly fewer cases of purifying selection relative to prokaryotes and no evidence of codon double substitutions under positive selection. This is consistent with previous studies of serine codons in primates and yeast. In general, the obtained results strongly suggest that there are major differences between studied pro- and eukaryotes; double substitutions in primates and yeasts largely reflect mutational biases and are not hallmarks of selection. This is especially important in the context of detection of positive selection in codons because it has been suggested that multiple mutations in codons cause false inferences of lineage-specific site positive selection. It is likely that this concern is applicable to previously studied prokaryotes but not to primates and yeasts where markedly fewer double substitutions are affected by positive selection.
Collapse
Affiliation(s)
- Frida Belinky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Anastassia Bykova
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Vyacheslav Yurchenko
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
- *Correspondence: Vyacheslav Yurchenko, ; Igor B. Rogozin,
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
- *Correspondence: Vyacheslav Yurchenko, ; Igor B. Rogozin,
| |
Collapse
|
5
|
Reha-Krantz LJ, Goodman MF. John W. (Jan) Drake: A Biochemical View of a Geneticist Par Excellence. Genetics 2020; 216:827-836. [PMID: 33268388 PMCID: PMC7768258 DOI: 10.1534/genetics.120.303813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Accepted: 10/22/2020] [Indexed: 11/18/2022] Open
Abstract
John W. Drake died 02-02-2020, a mathematical palindrome, which he would have enjoyed, given his love of "word play and logic," as stated in his obituary and echoed by his family, friends, students, and colleagues. Many aspects of Jan's career have been reviewed previously, including his early years as a Caltech graduate student, and when he was editor-in-chief, with the devoted assistance of his wife Pam, of this journal for 15 impactful years. During his editorship, he raised the profile of GENETICS as the flagship journal of the Genetics Society of America and inspired and contributed to the creation of the Perspectives column, coedited by Jim Crow and William Dove. At the same time, Jan was building from scratch the Laboratory of Molecular Genetics on the newly established Research Triangle Park campus of the National Institute of Environmental Health Science, which he headed for 30 years. This commentary offers a unique perspective on Jan's legacy; we showcase Jan's 1969 benchmark discovery of antimutagenic T4 DNA polymerases and the research by three generations (and counting) of scientists whose research stems from that groundbreaking discovery. This is followed by a brief discussion of Jan's passion: his overriding interest in analyzing mutation rates across species. Several anecdotal stories are included to bring alive one of Jan's favorite phrases, "to think like a geneticist." We feature Jan's genetical approach to mutation studies, along with the biochemistry of DNA polymerase function, our area of expertise. But in the end, we acknowledge, as Jan did, that genetics, also known as in vivo biochemistry, prevails.
Collapse
Affiliation(s)
- Linda J Reha-Krantz
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| | - Myron F Goodman
- Department of Biological Sciences, University of Southern California, Los Angeles, California 90089
| |
Collapse
|
6
|
Belinky F, Sela I, Rogozin IB, Koonin EV. Crossing fitness valleys via double substitutions within codons. BMC Biol 2019; 17:105. [PMID: 31842858 PMCID: PMC6916188 DOI: 10.1186/s12915-019-0727-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 11/20/2019] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Single nucleotide substitutions in protein-coding genes can be divided into synonymous (S), with little fitness effect, and non-synonymous (N) ones that alter amino acids and thus generally have a greater effect. Most of the N substitutions are affected by purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases potentially could alleviate the deleterious effect of single substitutions, making them subject to positive selection. To elucidate the effects of selection on double substitutions in all codons, it is critical to differentiate selection from mutational biases. RESULTS We addressed the evolutionary regimes of within-codon double substitutions in 37 groups of closely related prokaryotic genomes from diverse phyla by comparing the fractions of double substitutions within codons to those of the equivalent double S substitutions in adjacent codons. Under the assumption that substitutions occur one at a time, all within-codon double substitutions can be represented as "ancestral-intermediate-final" sequences (where "intermediate" refers to the first single substitution and "final" refers to the second substitution) and can be partitioned into four classes: (1) SS, S intermediate-S final; (2) SN, S intermediate-N final; (3) NS, N intermediate-S final; and (4) NN, N intermediate-N final. We found that the selective pressure on the second substitution markedly differs among these classes of double substitutions. Analogous to single S (synonymous) substitutions, SS double substitutions evolve neutrally, whereas analogous to single N (non-synonymous) substitutions, SN double substitutions are subject to purifying selection. In contrast, NS show positive selection on the second step because the original amino acid is recovered. The NN double substitutions are heterogeneous and can be subject to either purifying or positive selection, or evolve neutrally, depending on the amino acid similarity between the final or intermediate and the ancestral states. CONCLUSIONS The results of the present, comprehensive analysis of the evolutionary landscape of within-codon double substitutions reaffirm the largely conservative regime of protein evolution. However, the second step of a double substitution can be subject to positive selection when the first step is deleterious. Such positive selection can result in frequent crossing of valleys on the fitness landscape.
Collapse
Affiliation(s)
- Frida Belinky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Itamar Sela
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
7
|
Amos W. Flanking heterozygosity influences the relative probability of different base substitutions in humans. ROYAL SOCIETY OPEN SCIENCE 2019; 6:191018. [PMID: 31598319 PMCID: PMC6774961 DOI: 10.1098/rsos.191018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 08/30/2019] [Indexed: 06/10/2023]
Abstract
Understanding when, where and which mutations are mostly likely to occur impacts many areas of evolutionary biology, from genetic diseases to phylogenetic reconstruction. Africans and non-African humans differ in the mutability of different triplet base combinations. Africans and non-Africans also differ in mutation rate, possibly because heterozygosity is mutagenic, such that diversity lost when humans expanded out of Africa also lowered the mutation rate. I show that these phenomena are linked: as flanking heterozygosity increases, some triplets become progressively more mutable while others become less so. Africans and non-African show near-identical patterns of dependence on heterozygosity. Thus, the striking differences in triplet mutation frequency between Africans and non-Africans, at least in part, seem to be an emergent property, driven by the way changes in heterozygosity 'out of Africa' have differentially impacted the mutability of different triplets. As heterozygosity decreased, the mutation spectrum outside Africa became enriched for triplet mutations that are favoured by low heterozygosity while those favoured by high heterozygosity became relatively rarer.
Collapse
|
8
|
Luo B, Edge AK, Tolg C, Turley EA, Dean CB, Hill KA, Kulperger RJ. Spatial statistical tools for genome-wide mutation cluster detection under a microarray probe sampling system. PLoS One 2018; 13:e0204156. [PMID: 30252889 PMCID: PMC6155535 DOI: 10.1371/journal.pone.0204156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2018] [Accepted: 09/04/2018] [Indexed: 11/30/2022] Open
Abstract
Mutation cluster analysis is critical for understanding certain mutational mechanisms relevant to genetic disease, diversity, and evolution. Yet, whole genome sequencing for detection of mutation clusters is prohibitive with high cost for most organisms and population surveys. Single nucleotide polymorphism (SNP) genotyping arrays, like the Mouse Diversity Genotyping Array, offer an alternative low-cost, screening for mutations at hundreds of thousands of loci across the genome using experimental designs that permit capture of de novo mutations in any tissue. Formal statistical tools for genome-wide detection of mutation clusters under a microarray probe sampling system are yet to be established. A challenge in the development of statistical methods is that microarray detection of mutation clusters is constrained to select SNP loci captured by probes on the array. This paper develops a Monte Carlo framework for cluster testing and assesses test statistics for capturing potential deviations from spatial randomness which are motivated by, and incorporate, the array design. While null distributions of the test statistics are established under spatial randomness via the homogeneous Poisson process, power performance of the test statistics is evaluated under postulated types of Neyman-Scott clustering processes through Monte Carlo simulation. A new statistic is developed and recommended as a screening tool for mutation cluster detection. The statistic is demonstrated to be excellent in terms of its robustness and power performance, and useful for cluster analysis in settings of missing data. The test statistic can also be generalized to any one dimensional system where every site is observed, such as DNA sequencing data. The paper illustrates how the informal graphical tools for detecting clusters may be misleading. The statistic is used for finding clusters of putative SNP differences in a mixture of different mouse genetic backgrounds and clusters of de novo SNP differences arising between tissues with development and carcinogenesis.
Collapse
Affiliation(s)
- Bin Luo
- Department of Statistical and Actuarial Sciences, Western University, London, Ontario, Canada
- * E-mail: (BL); (CBD); (KAH); (RJK)
| | - Alanna K. Edge
- Department of Biology, Western University, London, Ontario, Canada
| | - Cornelia Tolg
- London Regional Cancer Program, Lawson Health Research Institute, London, Ontario, Canada
| | - Eva A. Turley
- London Regional Cancer Program, Lawson Health Research Institute, London, Ontario, Canada
| | - C. B. Dean
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada
- * E-mail: (BL); (CBD); (KAH); (RJK)
| | - Kathleen A. Hill
- Department of Biology, Western University, London, Ontario, Canada
- * E-mail: (BL); (CBD); (KAH); (RJK)
| | - R. J. Kulperger
- Department of Statistical and Actuarial Sciences, Western University, London, Ontario, Canada
- * E-mail: (BL); (CBD); (KAH); (RJK)
| |
Collapse
|
9
|
Woo AC, Faure L, Dapa T, Matic I. Heterogeneity of spontaneous DNA replication errors in single isogenic Escherichia coli cells. SCIENCE ADVANCES 2018; 4:eaat1608. [PMID: 29938224 PMCID: PMC6010332 DOI: 10.1126/sciadv.aat1608] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 05/14/2018] [Indexed: 06/06/2023]
Abstract
Despite extensive knowledge of the molecular mechanisms that control mutagenesis, it is not known how spontaneous mutations are produced in cells with fully operative mutation-prevention systems. By using a mutation assay that allows visualization of DNA replication errors and stress response transcriptional reporters, we examined populations of isogenic Escherichia coli cells growing under optimal conditions without exogenous stress. We found that spontaneous DNA replication errors in proliferating cells arose more frequently in subpopulations experiencing endogenous stresses, such as problems with proteostasis, genome maintenance, and reactive oxidative species production. The presence of these subpopulations of phenotypic mutators is not expected to affect the average mutation frequency or to reduce the mean population fitness in a stable environment. However, these subpopulations can contribute to overall population adaptability in fluctuating environments by serving as a reservoir of increased genetic variability.
Collapse
Affiliation(s)
- Anthony C. Woo
- INSERM U1001, Université Paris Descartes, Sorbonne Paris Cité, Faculté de Médecine Paris Descartes, 24 rue du Faubourg Saint-Jacques, 75014 Paris, France
| | - Louis Faure
- INSERM U1001, Université Paris Descartes, Sorbonne Paris Cité, Faculté de Médecine Paris Descartes, 24 rue du Faubourg Saint-Jacques, 75014 Paris, France
| | - Tanja Dapa
- INSERM U1001, Université Paris Descartes, Sorbonne Paris Cité, Faculté de Médecine Paris Descartes, 24 rue du Faubourg Saint-Jacques, 75014 Paris, France
| | - Ivan Matic
- INSERM U1001, Université Paris Descartes, Sorbonne Paris Cité, Faculté de Médecine Paris Descartes, 24 rue du Faubourg Saint-Jacques, 75014 Paris, France
- Centre National de la Recherche Scientifique, 75016 Paris, France
| |
Collapse
|
10
|
Vergnaud G, Midoux C, Blouin Y, Bourkaltseva M, Krylov V, Pourcel C. Transposition Behavior Revealed by High-Resolution Description of Pseudomonas Aeruginosa Saltovirus Integration Sites. Viruses 2018; 10:v10050245. [PMID: 29735891 PMCID: PMC5977238 DOI: 10.3390/v10050245] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Revised: 04/27/2018] [Accepted: 05/04/2018] [Indexed: 01/08/2023] Open
Abstract
Transposable phages, also called saltoviruses, of which the Escherichia coli phage Mu is the reference, are temperate phages that multiply their genome through replicative transposition at multiple sites in their host chromosome. The viral genome is packaged together with host DNA at both ends. In the present work, genome sequencing of three Pseudomonas aeruginosa transposable phages, HW12, 2P1, and Ab30, incidentally gave us access to the location of thousands of replicative integration sites and revealed the existence of a variable number of hotspots. Taking advantage of deep sequencing, we then designed an experiment to study 13,000,000 transposon integration sites of bacteriophage Ab30. The investigation revealed the presence of 42 transposition hotspots adjacent to bacterial interspersed mosaic elements (BIME) accounting for 5% of all transposition sites. The rest of the sites appeared widely distributed with the exception of coldspots associated with low G-C content segments, including the putative O-antigen biosynthesis cluster. Surprisingly, 0.4% of the transposition events occurred in a copy of the phage genome itself, indicating that the previously described immunity against such events is slightly leaky. This observation allowed drawing an image of the phage chromosome supercoiling into four loops.
Collapse
Affiliation(s)
- Gilles Vergnaud
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198 Gif-sur-Yvette CEDEX, France.
| | - Cédric Midoux
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198 Gif-sur-Yvette CEDEX, France.
| | - Yann Blouin
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198 Gif-sur-Yvette CEDEX, France.
| | - Maria Bourkaltseva
- I. I. Mechnikov Research Institute for Vaccines & Sera, Moscow 105064, Russia.
| | - Victor Krylov
- I. I. Mechnikov Research Institute for Vaccines & Sera, Moscow 105064, Russia.
| | - Christine Pourcel
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198 Gif-sur-Yvette CEDEX, France.
| |
Collapse
|
11
|
Robert L, Ollion J, Robert J, Song X, Matic I, Elez M. Mutation dynamics and fitness effects followed in single cells. Science 2018; 359:1283-1286. [DOI: 10.1126/science.aan0797] [Citation(s) in RCA: 87] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 10/19/2017] [Accepted: 01/30/2018] [Indexed: 12/12/2022]
|
12
|
Fan H, Guo W. A genome-wide investigation of microsatellite mismatches and the association with body mass among bird species. PeerJ 2018; 6:e4495. [PMID: 29576965 PMCID: PMC5857172 DOI: 10.7717/peerj.4495] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 02/21/2018] [Indexed: 01/26/2023] Open
Abstract
Mutation rate is usually found to covary with many life history traits of animals such as body mass, which has been readily explained by the higher number of mutation opportunities per unit time. Although the precise reason for the pattern is not yet clear, to determine the universality of this pattern, we tested whether life history traits impact another form of genetic mutation, the motif mismatches in microsatellites. Employing published genome sequences from 65 avian species, we explored the motif mismatches patterns of microsatellites in birds on a genomic level and assessed the relationship between motif mismatches and body mass in a phylogenetic context. We found that small-bodied species have a higher average mismatches and we suggested that higher heterozygosity in imperfect microsatellites lead to the increase of motif mismatches. Our results obtained from this study imply that a negative body mass trend in mutation rate may be a general pattern of avian molecular evolution.
Collapse
Affiliation(s)
- Haiying Fan
- Department of Ecology, College of Life Sciences, Wuhan University, Wuhan, China
| | - Weibin Guo
- Department of Ecology, College of Life Sciences, Wuhan University, Wuhan, China
| |
Collapse
|
13
|
Alexander HK, Mayer SI, Bonhoeffer S. Population Heterogeneity in Mutation Rate Increases the Frequency of Higher-Order Mutants and Reduces Long-Term Mutational Load. Mol Biol Evol 2017; 34:419-436. [PMID: 27836985 PMCID: PMC5850754 DOI: 10.1093/molbev/msw244] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Mutation rate is a crucial evolutionary parameter that has typically been treated as a constant in population genetic analyses. However, the propensity to mutate is likely to vary among co-existing individuals within a population, due to genetic polymorphisms, heterogeneous environmental influences, and random physiological fluctuations. We review the evidence for mutation rate heterogeneity and explore its consequences by extending classic population genetic models to allow an arbitrary distribution of mutation rate among individuals, either with or without inheritance. With this general new framework, we rigorously establish the effects of heterogeneity at various evolutionary timescales. In a single generation, variation of mutation rate about the mean increases the probability of producing zero or many simultaneous mutations on a genome. Over multiple generations of mutation and selection, heterogeneity accelerates the appearance of both deleterious and beneficial multi-point mutants. At mutation-selection balance, higher-order mutant frequencies are likewise boosted, while lower-order mutants exhibit subtler effects; nonetheless, population mean fitness is always enhanced. We quantify the dependencies on moments of the mutation rate distribution and selection coefficients, and clarify the role of mutation rate inheritance. While typical methods of estimating mutation rate will recover only the population mean, analyses assuming mutation rate is fixed to this mean could underestimate the potential for multi-locus adaptation, including medically relevant evolution in pathogenic and cancerous populations. We discuss the potential to empirically parameterize mutation rate distributions, which have to date hardly been quantified.
Collapse
Affiliation(s)
- Helen K. Alexander
- Institute of Integrative Biology, Department of Environmental Systems Science, ETH Zürich, Switzerland
| | - Stephanie I. Mayer
- Institute of Integrative Biology, Department of Environmental Systems Science, ETH Zürich, Switzerland
| | - Sebastian Bonhoeffer
- Institute of Integrative Biology, Department of Environmental Systems Science, ETH Zürich, Switzerland
| |
Collapse
|
14
|
Abstract
Bioinformatic analysis can not only accelerate drug target identification and drug candidate screening and refinement, but also facilitate characterization of side effects and predict drug resistance. High-throughput data such as genomic, epigenetic, genome architecture, cistromic, transcriptomic, proteomic, and ribosome profiling data have all made significant contribution to mechanismbased drug discovery and drug repurposing. Accumulation of protein and RNA structures, as well as development of homology modeling and protein structure simulation, coupled with large structure databases of small molecules and metabolites, paved the way for more realistic protein-ligand docking experiments and more informative virtual screening. I present the conceptual framework that drives the collection of these high-throughput data, summarize the utility and potential of mining these data in drug discovery, outline a few inherent limitations in data and software mining these data, point out news ways to refine analysis of these diverse types of data, and highlight commonly used software and databases relevant to drug discovery.
Collapse
Affiliation(s)
- Xuhua Xia
- Department of Biology, Faculty of Science, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
- Ottawa Institute of Systems Biology, Ottawa K1H 8M5, Canada
| |
Collapse
|
15
|
Nshogozabahizi JC, Dench J, Aris-Brosou S. Widespread Historical Contingency in Influenza Viruses. Genetics 2017; 205:409-420. [PMID: 28049709 PMCID: PMC5223518 DOI: 10.1534/genetics.116.193979] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 11/04/2016] [Indexed: 11/18/2022] Open
Abstract
In systems biology and genomics, epistasis characterizes the impact that a substitution at a particular location in a genome can have on a substitution at another location. This phenomenon is often implicated in the evolution of drug resistance or to explain why particular "disease-causing" mutations do not have the same outcome in all individuals. Hence, uncovering these mutations and their locations in a genome is a central question in biology. However, epistasis is notoriously difficult to uncover, especially in fast-evolving organisms. Here, we present a novel statistical approach that replies on a model developed in ecology and that we adapt to analyze genetic data in fast-evolving systems such as the influenza A virus. We validate the approach using a two-pronged strategy: extensive simulations demonstrate a low-to-moderate sensitivity with excellent specificity and precision, while analyses of experimentally validated data recover known interactions, including in a eukaryotic system. We further evaluate the ability of our approach to detect correlated evolution during antigenic shifts or at the emergence of drug resistance. We show that in all cases, correlated evolution is prevalent in influenza A viruses, involving many pairs of sites linked together in chains; a hallmark of historical contingency. Strikingly, interacting sites are separated by large physical distances, which entails either long-range conformational changes or functional tradeoffs, for which we find support with the emergence of drug resistance. Our work paves a new way for the unbiased detection of epistasis in a wide range of organisms by performing whole-genome scans.
Collapse
Affiliation(s)
| | - Jonathan Dench
- Department of Biology, University of Ottawa, Ontario K1N 6N5, Canada
| | - Stéphane Aris-Brosou
- Department of Biology, University of Ottawa, Ontario K1N 6N5, Canada
- Department of Mathematics and Statistics, University of Ottawa, Ontario K1N 6N5, Canada
| |
Collapse
|
16
|
Evolutionary switches between two serine codon sets are driven by selection. Proc Natl Acad Sci U S A 2016; 113:13109-13113. [PMID: 27799560 DOI: 10.1073/pnas.1615832113] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Serine is the only amino acid that is encoded by two disjoint codon sets so that a tandem substitution of two nucleotides is required to switch between the two sets. Previously published evidence suggests that, for the most evolutionarily conserved serines, the codon set switch occurs by simultaneous substitution of two nucleotides. Here we report a genome-wide reconstruction of the evolution of serine codons in triplets of closely related species from diverse prokaryotes and eukaryotes. The results indicate that the great majority of codon set switches proceed by two consecutive nucleotide substitutions, via a threonine or cysteine intermediate, and are driven by selection. These findings imply a strong pressure of purifying selection in protein evolution, which in the case of serine codon set switches occurs via an initial deleterious substitution quickly followed by a second, compensatory substitution. The result is frequent reversal of amino acid replacements and, at short evolutionary distances, pervasive homoplasy.
Collapse
|
17
|
Abstract
It has been long understood that mutation distribution is not completely random across genomic space and in time. Indeed, recent surprising discoveries identified multiple simultaneous mutations occurring in tiny regions within chromosomes while the rest of the genome remains relatively mutation-free. Mechanistic elucidation of these phenomena, called mutation showers, mutation clusters, or kataegis, in parallel with findings of abundant clustered mutagenesis in cancer genomes, is ongoing. So far, the combination of factors most important for clustered mutagenesis is the induction of DNA lesions within unusually long and persistent single-strand DNA intermediates. In addition to being a fascinating phenomenon, clustered mutagenesis also became an indispensable tool for identifying a previously unrecognized major source of mutation in cancer, APOBEC cytidine deaminases. Future research on clustered mutagenesis may shed light onto important mechanistic details of genome maintenance, with potentially profound implications for human health.
Collapse
Affiliation(s)
- Kin Chan
- Mechanisms of Genome Dynamics Group, National Institute of Environmental Health Sciences, Department of Health and Human Services, National Institutes of Health, Durham, North Carolina 27709; ,
| | - Dmitry A Gordenin
- Mechanisms of Genome Dynamics Group, National Institute of Environmental Health Sciences, Department of Health and Human Services, National Institutes of Health, Durham, North Carolina 27709; ,
| |
Collapse
|
18
|
Zhu W, Cooper DN, Zhao Q, Wang Y, Liu R, Li Q, Férec C, Wang Y, Chen JM. Concurrent nucleotide substitution mutations in the human genome are characterized by a significantly decreased transition/transversion ratio. Hum Mutat 2015; 36:333-41. [PMID: 25546635 DOI: 10.1002/humu.22749] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2014] [Accepted: 12/17/2014] [Indexed: 01/16/2023]
Abstract
There is accumulating evidence that the number of multiple-nucleotide substitutions (MNS) occurring in closely spaced sites in eukaryotic genomes is significantly higher than would be predicted from the random accumulation of independently generated single-nucleotide substitutions (SNS). Although this excess can in principle be accounted for by the concept of transient hypermutability, a general mutational signature of concurrent MNS mutations has not so far been evident. Employing a dataset (N = 449) of "concurrent" double MNS mutations causing human inherited disease, we have identified just such a mutational signature: concurrently generated double MNS mutations exhibit a >twofold lower transition/transversion ratio (termed RTs/Tv ) than independently generated de novo SNS mutations (<0.80 vs. 2.10; P = 2.69 × 10(-14) ). We replicated this novel finding through a similar analysis employing two double MNS variant datasets with differing abundances of concurrent events (150,521 variants with both substitutions on the same haplotypic lineage vs. 94,875 variants whose component substitutions were on different haplotypic lineages) plus 5,430,874 SNS variants, all being derived from the whole-genome sequencing of seven Chinese individuals. Evaluation of the newly observed mutational signature in diverse contexts provides solid support for the postulated role of translesion synthesis DNA polymerases in transient hypermutability.
Collapse
Affiliation(s)
- Wenjuan Zhu
- Beijing Genomics Institute (BGI)-Shenzhen, Shenzhen, China
| | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Helfrich S, Pfeifer E, Krämer C, Sachs CC, Wiechert W, Kohlheyer D, Nöh K, Frunzke J. Live cell imaging of SOS and prophage dynamics in isogenic bacterial populations. Mol Microbiol 2015; 98:636-50. [PMID: 26235130 DOI: 10.1111/mmi.13147] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/23/2015] [Indexed: 12/15/2022]
Abstract
Almost all bacterial genomes contain DNA of viral origin, including functional prophages or degenerated phage elements. A frequent but often unnoted phenomenon is the spontaneous induction of prophage elements (SPI) even in the absence of an external stimulus. In this study, we have analyzed SPI of the large, degenerated prophage CGP3 (187 kbp), which is integrated into the genome of the Gram-positive Corynebacterium glutamicum ATCC 13032. Time-lapse fluorescence microscopy of fluorescent reporter strains grown in microfluidic chips revealed the sporadic induction of the SOS response as a prominent trigger of CGP3 SPI but also displayed a considerable fraction (∼30%) of RecA-independent SPI. Whereas approx. 20% of SOS-induced cells recovered from this stress and resumed growth, the spontaneous induction of CGP3 always led to a stop of growth and likely cell death. A carbon source starvation experiment clearly emphasized that SPI only occurs in actively proliferating cells, whereas sporadic SOS induction was still observed in resting cells. These data highlight the impact of sporadic DNA damage on the activity of prophage elements and provide a time-resolved, quantitative description of SPI as general phenomenon of bacterial populations.
Collapse
Affiliation(s)
- Stefan Helfrich
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Eugen Pfeifer
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Christina Krämer
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Christian Carsten Sachs
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Wolfgang Wiechert
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Dietrich Kohlheyer
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Katharina Nöh
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Julia Frunzke
- Institut für Bio- und Geowissenschaften, IBG-1: Biotechnologie, Forschungszentrum Jülich, 52425, Jülich, Germany
| |
Collapse
|
20
|
Disruption of Transcriptional Coactivator Sub1 Leads to Genome-Wide Re-distribution of Clustered Mutations Induced by APOBEC in Active Yeast Genes. PLoS Genet 2015; 11:e1005217. [PMID: 25941824 PMCID: PMC4420506 DOI: 10.1371/journal.pgen.1005217] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2015] [Accepted: 04/13/2015] [Indexed: 12/21/2022] Open
Abstract
Mutations in genomes of species are frequently distributed non-randomly, resulting in mutation clusters, including recently discovered kataegis in tumors. DNA editing deaminases play the prominent role in the etiology of these mutations. To gain insight into the enigmatic mechanisms of localized hypermutagenesis that lead to cluster formation, we analyzed the mutational single nucleotide variations (SNV) data obtained by whole-genome sequencing of drug-resistant mutants induced in yeast diploids by AID/APOBEC deaminase and base analog 6-HAP. Deaminase from sea lamprey, PmCDA1, induced robust clusters, while 6-HAP induced a few weak ones. We found that PmCDA1, AID, and APOBEC1 deaminases preferentially mutate the beginning of the actively transcribed genes. Inactivation of transcription initiation factor Sub1 strongly reduced deaminase-induced can1 mutation frequency, but, surprisingly, did not decrease the total SNV load in genomes. However, the SNVs in the genomes of the sub1 clones were re-distributed, and the effect of mutation clustering in the regions of transcription initiation was even more pronounced. At the same time, the mutation density in the protein-coding regions was reduced, resulting in the decrease of phenotypically detected mutants. We propose that the induction of clustered mutations by deaminases involves: a) the exposure of ssDNA strands during transcription and loss of protection of ssDNA due to the depletion of ssDNA-binding proteins, such as Sub1, and b) attainment of conditions favorable for APOBEC action in subpopulation of cells, leading to enzymatic deamination within the currently expressed genes. This model is applicable to both the initial and the later stages of oncogenic transformation and explains variations in the distribution of mutations and kataegis events in different tumor cells. Genomes of tumors are heavily enriched with mutations. Some of these mutations are distributed non-randomly, forming mutational clusters. Editing cytosine deaminases from APOBEC superfamily are responsible for the formation of many of these clusters. We have expressed APOBEC enzyme in diploid yeast cells and found that most of the mutations occur in the beginning of the active genes, where transcription starts. Clusters of mutations overlapped with promoters/transcription start sites. This is likely due to the weaker protection of ssDNA, an ultimate APOBEC deaminase enzyme target, in the beginning of the genes. This hypothesis was reinforced by the finding that inactivation of Sub1 transcription initiation factor, which is found predominantly in the regions of transcription initiation, leads to further increase in mutagenesis in the beginning of the genes. Interestingly, the total number of mutations in the genomes of Sub1-deficient clones did not change, despite the 100-fold decrease in frequency of mutants in a reporter gene. Thus, the drastic change in genome-wide distribution of mutations can be caused by inactivation of a single gene. We propose that the loss of ssDNA protection factors causes formation of mutation clusters in human cancer.
Collapse
|
21
|
Lind PA, Farr AD, Rainey PB. Experimental evolution reveals hidden diversity in evolutionary pathways. eLife 2015; 4. [PMID: 25806684 PMCID: PMC4395868 DOI: 10.7554/elife.07074] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Accepted: 03/24/2015] [Indexed: 11/13/2022] Open
Abstract
Replicate populations of natural and experimental organisms often show evidence of parallel genetic evolution, but the causes are unclear. The wrinkly spreader morph of Pseudomonas fluorescens arises repeatedly during experimental evolution. The mutational causes reside exclusively within three pathways. By eliminating these, 13 new mutational pathways were discovered with the newly arising WS types having fitnesses similar to those arising from the commonly passaged routes. Our findings show that parallel genetic evolution is strongly biased by constraints and we reveal the genetic bases. From such knowledge, and in instances where new phenotypes arise via gene activation, we suggest a set of principles: evolution proceeds firstly via pathways subject to negative regulation, then via promoter mutations and gene fusions, and finally via activation by intragenic gain-of-function mutations. These principles inform evolutionary forecasting and have relevance to interpreting the diverse array of mutations associated with clinically identical instances of disease in humans. DOI:http://dx.doi.org/10.7554/eLife.07074.001 Different living things often develop similar strategies to adapt to the environments in which they live. Sometimes two species that share a common ancestor independently evolve the same trait by changing the exact same genes. This is called ‘parallel evolution’, and it has led some scientists to ask: are there certain traits that can only evolve in a limited number of ways? Or are there other ways to evolve the same trait that, for some reason, are not explored? Experimentally, investigating these questions is challenging, but parallel evolution occurs in the laboratory as well as in the wild. Many commonly studied organisms—such as fruit flies or bacteria—can be used in relevant studies, because they can be grown in large numbers and then exposed to identical environments. However, if this method fails to find a new way that a trait can evolve, it doesn't mean that alternative mechanisms do not exist. Lind et al. used a different approach that instead relies on removing all of the known pathways that can be mutated to produce a given trait and then seeing if that trait can still evolve via mutations elsewhere. The experiments involved a bacterium called Pseudomonas fluorescens that can evolve to grow flattened and wrinkled colonies (instead of smooth, round ones) when it has to compete for access to oxygen. Previous experiments had shown that the evolution of the so-called ‘wrinkly spreader’ form can be caused by mutations in one of three biological pathways. But P. fluorescens can survive unharmed without these pathways, which enabled Lind et al. to ask if there might be other ways that this trait could evolve. Bacteria without these three pathways were engineered and then grown under oxygen-deprived conditions. This experiment produced 91 new mutants that each had the wrinkly spreader phenotype. Further experiments revealed that together these mutants represented 13 previously unrecognized ways that the ‘wrinkly spreader’ phenotype can evolve. The new rare mutants had similar fitness as the previously known, common ones—so this cannot explain why they hadn't been seen before. Lind et al. instead suggest a set of principles to explain why these newly discovered pathways are rarely mutated and how genetic constraints can bias the outcome of evolution. Further work could investigate whether these principles can help us to predict the course of evolution in other biological contexts, such as in the evolution of antibiotic resistance. DOI:http://dx.doi.org/10.7554/eLife.07074.002
Collapse
Affiliation(s)
- Peter A Lind
- New Zealand Institute for Advanced Study, Massey University, Auckland, New Zealand
| | - Andrew D Farr
- New Zealand Institute for Advanced Study, Massey University, Auckland, New Zealand
| | - Paul B Rainey
- New Zealand Institute for Advanced Study, Massey University, Auckland, New Zealand
| |
Collapse
|
22
|
Long H, Sung W, Miller SF, Ackerman MS, Doak TG, Lynch M. Mutation rate, spectrum, topology, and context-dependency in the DNA mismatch repair-deficient Pseudomonas fluorescens ATCC948. Genome Biol Evol 2014; 7:262-71. [PMID: 25539726 PMCID: PMC4316635 DOI: 10.1093/gbe/evu284] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
High levels of genetic diversity exist among natural isolates of the bacterium Pseudomonas fluorescens, and are especially elevated around the replication terminus of the genome, where strain-specific genes are found. In an effort to understand the role of genetic variation in the evolution of Pseudomonas, we analyzed 31,106 base substitutions from 45 mutation accumulation lines of P. fluorescens ATCC948, naturally deficient for mismatch repair, yielding a base-substitution mutation rate of 2.34 × 10−8 per site per generation (SE: 0.01 × 10−8) and a small-insertion-deletion mutation rate of 1.65 × 10−9 per site per generation (SE: 0.03 × 10−9). We find that the spectrum of mutations in prophage regions, which often contain virulence factors and antibiotic resistance, is highly similar to that in the intergenic regions of the host genome. Our results show that the mutation rate varies around the chromosome, with the lowest mutation rate found near the origin of replication. Consistent with observations from other studies, we find that site-specific mutation rates are heavily influenced by the immediately flanking nucleotides, indicating that mutations are context dependent.
Collapse
Affiliation(s)
- Hongan Long
- Department of Biology, Indiana University, Bloomington
| | - Way Sung
- Department of Biology, Indiana University, Bloomington
| | | | | | - Thomas G Doak
- Department of Biology, Indiana University, Bloomington National Center for Genome Analysis Support, Indiana University, Bloomington
| | - Michael Lynch
- Department of Biology, Indiana University, Bloomington
| |
Collapse
|
23
|
Amos W, Filipe LN. Microsatellite frequencies vary with body mass and body temperature in mammals, suggesting correlated variation in mutation rate. PeerJ 2014; 2:e663. [PMID: 25392761 PMCID: PMC4226647 DOI: 10.7717/peerj.663] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2014] [Accepted: 10/21/2014] [Indexed: 11/20/2022] Open
Abstract
Substitution rate is often found to correlate with life history traits such as body mass, a predictor of population size and longevity, and body temperature. The underlying mechanism is unclear but most models invoke either natural selection or factors such as generation length that change the number of mutation opportunities per unit time. Here we use published genome sequences from 69 mammals to ask whether life history traits impact another form of genetic mutation, the high rates of predominantly neutral slippage in microsatellites. We find that the length-frequency distributions of three common dinucleotide motifs differ greatly between even closely related species. These frequency differences correlate with body mass and body temperature and can be used to predict the phenotype of an unknown species. Importantly, different length microsatellites show complicated patterns of excess and deficit that cannot be explained by a simple model where species with short generation lengths have experienced more mutations. Instead, the patterns probably require changes in mutation rate that impact alleles of different length to different extents. Body temperature plausibly influences mutation rate by modulating the propensity for slippage. Existing hypotheses struggle to account for a link between body mass and mutation rate. However, body mass correlates inversely with population size, which in turn predicts heterozygosity. We suggest that heterozygote instability, HI, the idea that heterozygous sites show increased mutability, could provide a plausible link between body mass and mutation rate.
Collapse
|
24
|
Lenz C, Haerty W, Golding GB. Increased substitution rates surrounding low-complexity regions within primate proteins. Genome Biol Evol 2014; 6:655-65. [PMID: 24572016 PMCID: PMC3971593 DOI: 10.1093/gbe/evu042] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Previous studies have found that DNA-flanking low-complexity regions (LCRs) have an increased substitution rate. Here, the substitution rate was confirmed to increase in the vicinity of LCRs in several primate species, including humans. This effect was also found among human sequences from the 1000 Genomes Project. A strong correlation was found between average substitution rate per site and distance from the LCR, as well as the proportion of genes with gaps in the alignment at each site and distance from the LCR. Along with substitution rates, dN/dS ratios were also determined for each site, and the proportion of sites undergoing negative selection was found to have a negative relationship with distance from the LCR.
Collapse
Affiliation(s)
- Carolyn Lenz
- Department of Biology, McMaster University, Hamilton, Ontario, Canada
| | | | | |
Collapse
|
25
|
Abstract
Models of codon evolution have attracted particular interest because of their unique capabilities to detect selection forces and their high fit when applied to sequence evolution. We described here a novel approach for modeling codon evolution, which is based on Kronecker product of matrices. The 61 × 61 codon substitution rate matrix is created using Kronecker product of three 4 × 4 nucleotide substitution matrices, the equilibrium frequency of codons, and the selection rate parameter. The entities of the nucleotide substitution matrices and selection rate are considered as parameters of the model, which are optimized by maximum likelihood. Our fully mechanistic model allows the instantaneous substitution matrix between codons to be fully estimated with only 19 parameters instead of 3,721, by using the biological interdependence existing between positions within codons. We illustrate the properties of our models using computer simulations and assessed its relevance by comparing the AICc measures of our model and other models of codon evolution on simulations and a large range of empirical data sets. We show that our model fits most biological data better compared with the current codon models. Furthermore, the parameters in our model can be interpreted in a similar way as the exchangeability rates found in empirical codon models.
Collapse
Affiliation(s)
- Maryam Zaheri
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Linda Dib
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Nicolas Salamin
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| |
Collapse
|
26
|
Roberts SA, Gordenin DA. Clustered and genome-wide transient mutagenesis in human cancers: Hypermutation without permanent mutators or loss of fitness. Bioessays 2014; 36:382-393. [PMID: 24615916 DOI: 10.1002/bies.201300140] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The gain of a selective advantage in cancer as well as the establishment of complex traits during evolution require multiple genetic alterations, but how these mutations accumulate over time is currently unclear. There is increasing evidence that a mutator phenotype perpetuates the development of many human cancers. While in some cases the increased mutation rate is the result of a genetic disruption of DNA repair and replication or environmental exposures, other evidence suggests that endogenous DNA damage induced by AID/APOBEC cytidine deaminases can result in transient localized hypermutation generating simultaneous, closely spaced (i.e. "clustered") multiple mutations. Here, we discuss mechanisms that lead to mutation cluster formation, the biological consequences of their formation in cancer and evidence suggesting that APOBEC mutagenesis can also occur genome-wide. This raises the possibility that dysregulation of these enzymes may enable rapid malignant transformation by increasing mutation rates without the loss of fitness associated with permanent mutators.
Collapse
Affiliation(s)
- Steven A Roberts
- Chromosome Stability Group, Laboratory of Molecular Genetics, National Institute of Environmental Health Sciences, Durham, NC, USA
| | | |
Collapse
|
27
|
Chen JM, Cooper DN, Férec C. A new and more accurate estimate of the rate of concurrent tandem-base substitution mutations in the human germline: ∼0.4% of the single-nucleotide substitution mutation rate. Hum Mutat 2014; 35:392-4. [PMID: 24375656 DOI: 10.1002/humu.22501] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Accepted: 12/19/2013] [Indexed: 01/08/2023]
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U1078, Brest, France; Etablissement Français du Sang (EFS) - Bretagne, Brest, France; Faculté de Médecine et des Sciences de la Santé, Université de Bretagne Occidentale (UBO), Brest, France; Laboratoire de Génétique Moléculaire et d'Histocompatibilité, Centre Hospitalier Universitaire (CHU) Brest, Hôpital Morvan, Brest, France
| | | | | |
Collapse
|
28
|
McGrath M, Gey van Pittius NC, van Helden PD, Warren RM, Warner DF. Mutation rate and the emergence of drug resistance in Mycobacterium tuberculosis. J Antimicrob Chemother 2013; 69:292-302. [DOI: 10.1093/jac/dkt364] [Citation(s) in RCA: 99] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
|
29
|
Kroll E, Coyle S, Dunn B, Koniges G, Aragon A, Edwards J, Rosenzweig F. Starvation-associated genome restructuring can lead to reproductive isolation in yeast. PLoS One 2013; 8:e66414. [PMID: 23894280 PMCID: PMC3722211 DOI: 10.1371/journal.pone.0066414] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2013] [Accepted: 05/04/2013] [Indexed: 12/03/2022] Open
Abstract
Knowledge of the mechanisms that lead to reproductive isolation is essential for understanding population structure and speciation. While several models have been advanced to explain post-mating reproductive isolation, experimental data supporting most are indirect. Laboratory investigations of this phenomenon are typically carried out under benign conditions, which result in low rates of genetic change unlikely to initiate reproductive isolation. Previously, we described an experimental system using the yeast Saccharomyces cerevisiae where starvation served as a proxy to any stress that decreases reproduction and/or survivorship. We showed that novel lineages with restructured genomes quickly emerged in starved populations, and that these survivors were more fit than their ancestors when re-starved. Here we show that certain yeast lineages that survive starvation have become reproductively isolated from their ancestor. We further demonstrate that reproductive isolation arises from genomic rearrangements, whose frequency in starving yeast is several orders of magnitude greater than an unstarved control. By contrast, the frequency of point mutations is less than 2-fold greater. In a particular case, we observe that a starved lineage becomes reproductively isolated as a direct result of the stress-related accumulation of a single chromosome. We recapitulate this result by demonstrating that introducing an extra copy of one or several chromosomes into naïve, i.e. unstarved, yeast significantly diminishes their fertility. This type of reproductive barrier, whether arising spontaneously or via genetic manipulation, can be removed by making a lineage euploid for the altered chromosomes. Our model provides direct genetic evidence that reproductive isolation can arise frequently in stressed populations via genome restructuring without the precondition of geographic isolation.
Collapse
Affiliation(s)
- Evgueny Kroll
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America.
| | | | | | | | | | | | | |
Collapse
|
30
|
Abstract
Because spontaneous mutation is the source of all genetic diversity, measuring mutation rates can reveal how natural selection drives patterns of variation within and between species. We sequenced eight genomes produced by a mutation-accumulation experiment in Drosophila melanogaster. Our analysis reveals that point mutation and small indel rates vary significantly between the two different genetic backgrounds examined. We also find evidence that ∼2% of mutational events affect multiple closely spaced nucleotides. Unlike previous similar experiments, we were able to estimate genome-wide rates of large deletions and tandem duplications. These results suggest that, at least in inbred lines like those examined here, mutational pressures may result in net growth rather than contraction of the Drosophila genome. By comparing our mutation rate estimates to polymorphism data, we are able to estimate the fraction of new mutations that are eliminated by purifying selection. These results suggest that ∼99% of duplications and deletions are deleterious--making them 10 times more likely to be removed by selection than nonsynonymous mutations. Our results illuminate not only the rates of new small- and large-scale mutations, but also the selective forces that they encounter once they arise.
Collapse
|
31
|
Amos W. Variation in heterozygosity predicts variation in human substitution rates between populations, individuals and genomic regions. PLoS One 2013; 8:e63048. [PMID: 23646173 PMCID: PMC3639965 DOI: 10.1371/journal.pone.0063048] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2012] [Accepted: 03/28/2013] [Indexed: 01/11/2023] Open
Abstract
The "heterozygote instability" (HI) hypothesis suggests that gene conversion events focused on heterozygous sites during meiosis locally increase the mutation rate, but this hypothesis remains largely untested. As humans left Africa they lost variability, which, if HI operates, should have reduced the mutation rate in non-Africans. Relative substitution rates were quantified in diverse humans using aligned whole genome sequences from the 1,000 genomes project. Substitution rate is consistently greater in Africans than in non-Africans, but only in diploid regions of the genome, consistent with a role for heterozygosity. Analysing the same data partitioned into a series of non-overlapping 2 Mb windows reveals a strong, non-linear correlation between the amount of heterozygosity lost "out of Africa" and the difference in substitution rate between Africans and non-Africans. Putative recent mutations, derived variants that occur only once among the 80 human chromosomes sampled, occur preferentially at the centre of 2 Kb windows that have elevated heterozygosity compared both with the same region in a closely related population and with an immediately adjacent region in the same population. More than half of all substitutions appear attributable to variation in heterozygosity. This observation provides strong support for HI with implications for many branches of evolutionary biology.
Collapse
Affiliation(s)
- William Amos
- Department of Zoology, Cambridge University, Cambridge, Cambridgeshire, United Kingdom.
| |
Collapse
|
32
|
Shee C, Gibson JL, Rosenberg SM. Two mechanisms produce mutation hotspots at DNA breaks in Escherichia coli. Cell Rep 2012; 2:714-21. [PMID: 23041320 PMCID: PMC3607216 DOI: 10.1016/j.celrep.2012.08.033] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Revised: 08/06/2012] [Accepted: 08/30/2012] [Indexed: 11/30/2022] Open
Abstract
Mutation hotspots and showers occur across phylogeny and profoundly influence genome evolution, yet the mechanisms that produce hotspots remain obscure. We report that DNA double-strand breaks (DSBs) provoke mutation hotspots via stress-induced mutation in Escherichia coli. With tet reporters placed 2 kb to 2 Mb (half the genome) away from an I-SceI site, RpoS/DinB-dependent mutations occur maximally within the first 2 kb and decrease logarithmically to ∼60 kb. A weak mutation tail extends to 1 Mb. Hotspotting occurs independently of I-site/tet-reporter-pair position in the genome, upstream and downstream in the replication path. RecD, which allows RecBCD DSB-exonuclease activity, is required for strong local but not long-distance hotspotting, indicating that double-strand resection and gap-filling synthesis underlie local hotspotting, and newly illuminating DSB resection in vivo. Hotspotting near DSBs opens the possibility that specific genomic regions could be targeted for mutagenesis, and could also promote concerted evolution (coincident mutations) within genes/gene clusters, an important issue in the evolution of protein functions.
Collapse
Affiliation(s)
- Chandan Shee
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | | |
Collapse
|
33
|
Chen JM, Férec C, Cooper DN. Transient hypermutability, chromothripsis and replication-based mechanisms in the generation of concurrent clustered mutations. Mutat Res 2012; 750:52-9. [PMID: 22100908 DOI: 10.1016/j.mrrev.2011.10.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Revised: 10/25/2011] [Accepted: 10/27/2011] [Indexed: 12/21/2022]
Abstract
Clustered mutations may be broadly defined as the presence of two or more mutations within a spatially localized genomic region on a single chromosome. Known instances vary in terms of both the number and type of the component mutations, ranging from two closely spaced point mutations to tens or even hundreds of genomic rearrangements. Although clustered mutations can represent the observable net result of independent lesions sequentially acquired over multiple cell cycles, they can also be generated in a simultaneous or quasi-simultaneous manner within a single cell cycle. This review focuses on those mechanisms known to underlie the latter type. Both gene conversion and transient hypermutability are capable of generating closely spaced multiple mutations. However, a recently described phenomenon in human cancer cells, known as 'chromothripsis', has provided convincing evidence that tens to hundreds of genomic rearrangements can sometimes be generated simultaneously via a single catastrophic event. The distinctive genomic features observed in the derivative chromosomes, together with the highly characteristic junction sequences, point to non-homologous end joining (NHEJ) as being the likely underlying mutational mechanism. By contrast, replication-based mechanisms such as microhomology-mediated break-induced replication (MMBIR) which involves serial replication slippage or serial template switching probably account for those complex genomic rearrangements that comprise multiple duplications and/or triplications.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | | | | |
Collapse
|
34
|
Roberts SA, Sterling J, Thompson C, Harris S, Mav D, Shah R, Klimczak LJ, Kryukov GV, Malc E, Mieczkowski PA, Resnick MA, Gordenin DA. Clustered mutations in yeast and in human cancers can arise from damaged long single-strand DNA regions. Mol Cell 2012; 46:424-35. [PMID: 22607975 DOI: 10.1016/j.molcel.2012.03.030] [Citation(s) in RCA: 323] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2012] [Revised: 03/14/2012] [Accepted: 03/31/2012] [Indexed: 10/28/2022]
Abstract
Mutations are typically perceived as random, independent events. We describe here nonrandom clustered mutations in yeast and in human cancers. Genome sequencing of yeast grown under chronic alkylation damage identified mutation clusters that extend up to 200 kb. A predominance of "strand-coordinated" changes of either cytosines or guanines in the same strand, mutation patterns, and genetic controls indicated that simultaneous mutations were generated by base alkylation in abnormally long single-strand DNA (ssDNA) formed at double-strand breaks (DSBs) and replication forks. Significantly, we found mutation clusters with analogous features in sequenced human cancers. Strand-coordinated clusters of mutated cytosines or guanines often resided near chromosome rearrangement breakpoints and were highly enriched with a motif targeted by APOBEC family cytosine-deaminases, which strongly prefer ssDNA. These data indicate that hypermutation via multiple simultaneous changes in randomly formed ssDNA is a general phenomenon that may be an important mechanism producing rapid genetic variation.
Collapse
Affiliation(s)
- Steven A Roberts
- Laboratory of Molecular Genetics, National Institute of Environmental Health Sciences, NIH, DHHS, Durham, NC 27709, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Gottlieb B, Beitel LK, Nadarajah A, Paliouras M, Trifiro M. The androgen receptor gene mutations database: 2012 update. Hum Mutat 2012; 33:887-94. [DOI: 10.1002/humu.22046] [Citation(s) in RCA: 329] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Accepted: 01/18/2012] [Indexed: 12/18/2022]
|
36
|
Schrider DR, Hourmozdi JN, Hahn MW. Pervasive multinucleotide mutational events in eukaryotes. Curr Biol 2011; 21:1051-4. [PMID: 21636278 DOI: 10.1016/j.cub.2011.05.013] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2011] [Revised: 04/12/2011] [Accepted: 05/05/2011] [Indexed: 10/18/2022]
Abstract
Many aspects of mutational processes are nonrandom, from the preponderance of transitions relative to transversions to the higher rate of mutation at CpG dinucleotides [1]. However, it is still often assumed that single-nucleotide mutations are independent of one another, each being caused by separate mutational events. The occurrence of multiple, closely spaced substitutions appears to violate assumptions of independence and is often interpreted as evidence for the action of adaptive natural selection [2, 3], balancing selection [4], or compensatory evolution [5, 6]. Here we provide evidence of a frequent, widespread multinucleotide mutational process active throughout eukaryotes. Genomic data from mutation-accumulation experiments, parent-offspring trios, and human polymorphisms all show that simultaneous nucleotide substitutions occur within short stretches of DNA. Regardless of species, such multinucleotide mutations (MNMs) consistently comprise ~3% of the total number of nucleotide substitutions. These results imply that previous adaptive interpretations of multiple, closely spaced substitutions may have been unwarranted and that MNMs must be considered when interpreting sequence data.
Collapse
Affiliation(s)
- Daniel R Schrider
- Department of Biology, Indiana University Bloomington, Bloomington, IN 47405, USA
| | | | | |
Collapse
|
37
|
Forche A, Abbey D, Pisithkul T, Weinzierl MA, Ringstrom T, Bruck D, Petersen K, Berman J. Stress alters rates and types of loss of heterozygosity in Candida albicans. mBio 2011; 2:e00129-11. [PMID: 21791579 PMCID: PMC3143845 DOI: 10.1128/mbio.00129-11] [Citation(s) in RCA: 167] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Accepted: 06/30/2011] [Indexed: 01/27/2023] Open
Abstract
UNLABELLED Genetic diversity is often generated during adaptation to stress, and in eukaryotes some of this diversity is thought to arise via recombination and reassortment of alleles during meiosis. Candida albicans, the most prevalent pathogen of humans, has no known meiotic cycle, and yet it is a heterozygous diploid that undergoes mitotic recombination during somatic growth. It has been shown that clinical isolates as well as strains passaged once through a mammalian host undergo increased levels of recombination. Here, we tested the hypothesis that stress conditions increase rates of mitotic recombination in C. albicans, which is measured as loss of heterozygosity (LOH) at specific loci. We show that LOH rates are elevated during in vitro exposure to oxidative stress, heat stress, and antifungal drugs. In addition, an increase in stress severity correlated well with increased LOH rates. LOH events can arise through local recombination, through homozygosis of longer tracts of chromosome arms, or by whole-chromosome homozygosis. Chromosome arm homozygosis was most prevalent in cultures grown under conventional lab conditions. Importantly, exposure to different stress conditions affected the levels of different types of LOH events, with oxidative stress causing increased recombination, while fluconazole and high temperature caused increases in events involving whole chromosomes. Thus, C. albicans generates increased amounts and different types of genetic diversity in response to a range of stress conditions, a process that we term "stress-induced LOH" that arises either by elevating rates of recombination and/or by increasing rates of chromosome missegregation. IMPORTANCE Stress-induced mutagenesis fuels the evolution of bacterial pathogens and is mainly driven by genetic changes via mitotic recombination. Little is known about this process in other organisms. Candida albicans, an opportunistic fungal pathogen, causes infections that require adaptation to different host environmental niches. We measured the rates of LOH and the types of LOH events that appeared in the absence and in the presence of physiologically relevant stresses and found that stress causes a significant increase in the rates of LOH and that this increase is proportional to the degree of stress. Furthermore, the types of LOH events that arose differed in a stress-dependent manner, indicating that eukaryotic cells generate increased genetic diversity in response to a range of stress conditions. We propose that this "stress-induced LOH" facilitates the rapid adaptation of C. albicans, which does not undergo meiosis, to changing environments within the host.
Collapse
Affiliation(s)
- A. Forche
- Department of Biology, Bowdoin College, Brunswick, Maine, USA, and
| | - D. Abbey
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota, USA
| | - T. Pisithkul
- Department of Biology, Bowdoin College, Brunswick, Maine, USA, and
| | - M. A. Weinzierl
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota, USA
| | - T. Ringstrom
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota, USA
| | - D. Bruck
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota, USA
| | - K. Petersen
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota, USA
| | - J. Berman
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
38
|
Rani AQ, Malueka RG, Sasongko TH, Awano H, Lee T, Yagi M, Zilfalil BA, Salmi ABR, Takeshima Y, Zabidi-Hussin ZAMH, Matsuo M. Two closely spaced nonsense mutations in the DMD gene in a Malaysian family. Mol Genet Metab 2011; 103:303-4. [PMID: 21514860 DOI: 10.1016/j.ymgme.2011.04.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2011] [Revised: 04/02/2011] [Accepted: 04/02/2011] [Indexed: 12/31/2022]
Abstract
In Duchenne muscular dystrophy (DMD), identification of one nonsense mutation in the DMD gene has been considered an endpoint of genetic diagnosis. Here, we identified two closely spaced nonsense mutations in the DMD gene. In a Malaysian DMD patient two nonsense mutations (p.234S>X and p.249Q>X, respectively) were identified within exon 8. The proband's mother carried both mutations on one allele. Multiple mutations may explain the occasional discrepancies between genotype and phenotype in dystrophinopathy.
Collapse
Affiliation(s)
- Abdul Qawee Rani
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Kelantan, Malaysia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Sun S, Evans BJ, Golding GB. "Patchy-tachy" leads to false positives for recombination. Mol Biol Evol 2011; 28:2549-59. [PMID: 21498600 DOI: 10.1093/molbev/msr076] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Indirect tests have detected recombination in mitochondrial DNA (mtDNA) from many animal lineages, including mammals. However, it is possible that features of the molecular evolutionary process without recombination could be incorrectly inferred by indirect tests as being due to recombination. We have identified one such example, which we call "patchy-tachy" (PT), where different partitions of sequences evolve at different rates, that leads to an excess of false positives for recombination inferred by indirect tests. To explore this phenomena, we characterized the false positive rates of six widely used indirect tests for recombination using simulations of general models for mtDNA evolution with PT but without recombination. All tests produced 30-99% false positives for recombination, although the conditions that produced the maximal level of false positives differed between the tests. To evaluate the degree to which conditions that exacerbate false positives are found in published sequence data, we turned to 20 animal mtDNA data sets in which recombination is suggested by indirect tests. Using a model where different regions of the sequences were free to evolve at different rates in different lineages, we demonstrated that PT is prevalent in many data sets in which recombination was previously inferred using indirect tests. Taken together, our results argue that PT without recombination is a viable alternative explanation for detection of widespread recombination in animal mtDNA using indirect tests.
Collapse
Affiliation(s)
- Stephanie Sun
- Department of Biology, McMaster University, Hamilton, Ontario, Canada
| | | | | |
Collapse
|
40
|
Chen JM, Férec C, Cooper DN. Gene conversion in human genetic disease. Genes (Basel) 2010; 1:550-63. [PMID: 24710102 PMCID: PMC3966225 DOI: 10.3390/genes1030550] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2010] [Revised: 11/12/2010] [Accepted: 11/17/2010] [Indexed: 01/01/2023] Open
Abstract
Gene conversion is a specific type of homologous recombination that involves the unidirectional transfer of genetic material from a ‘donor’ sequence to a highly homologous ‘acceptor’. We have recently reviewed the molecular mechanisms underlying gene conversion, explored the key part that this process has played in fashioning extant human genes, and performed a meta-analysis of gene-conversion events known to have caused human genetic disease. Here we shall briefly summarize some of the latest developments in the study of pathogenic gene conversion events, including (i) the emerging idea of minimal efficient sequence homology (MESH) for homologous recombination, (ii) the local DNA sequence features that appear to predispose to gene conversion, (iii) a mechanistic comparison of gene conversion and transient hypermutability, and (iv) recently reported examples of pathogenic gene conversion events.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | - Claude Férec
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, UK.
| |
Collapse
|
41
|
Amos W, Bryant C. Using human demographic history to infer natural selection reveals contrasting patterns on different families of immune genes. Proc Biol Sci 2010; 278:1587-94. [PMID: 21068042 DOI: 10.1098/rspb.2010.2056] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Detecting regions of the human genome that are, or have been, influenced by natural selection remains an important goal for geneticists. Many methods are used to infer selection, but there is a general reliance on an accurate understanding of how mutation and recombination events are distributed, and the well-known link between these processes and their evolutionary transience introduces uncertainty into inferences. Here, we present and apply two new, independent approaches; one based on single nucleotide polymorphisms (SNPs) that exploits geographical patterns in how humans lost variability as we colonized the world, the other based on the relationship between microsatellite repeat number and heterozygosity. We show that the two methods give concordant results. Of these, the SNP-based method is both widely applicable and detects selection over a well-defined time interval, the last 50 000 years. Analysis of all human genes by their Gene Ontology codes reveals how accelerated and decelerated loss of variability are both preferentially associated with immune genes. Applied to 168 immune genes used as the focus of a previous study, we show that members of the same gene family tend to yield similar indices of selection, even when located on different chromosomes. We hope our approach will provide a useful tool with which to infer where selection has acted to shape the human genome.
Collapse
Affiliation(s)
- William Amos
- Department of Zoology, University of Cambridge, Downing Street, Cambridge CB2 3EJ, UK.
| | | |
Collapse
|
42
|
Magee AM, Aspinall S, Rice DW, Cusack BP, Sémon M, Perry AS, Stefanović S, Milbourne D, Barth S, Palmer JD, Gray JC, Kavanagh TA, Wolfe KH. Localized hypermutation and associated gene losses in legume chloroplast genomes. Genome Res 2010; 20:1700-10. [PMID: 20978141 DOI: 10.1101/gr.111955.110] [Citation(s) in RCA: 178] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Point mutations result from errors made during DNA replication or repair, so they are usually expected to be homogeneous across all regions of a genome. However, we have found a region of chloroplast DNA in plants related to sweetpea (Lathyrus) whose local point mutation rate is at least 20 times higher than elsewhere in the same molecule. There are very few precedents for such heterogeneity in any genome, and we suspect that the hypermutable region may be subject to an unusual process such as repeated DNA breakage and repair. The region is 1.5 kb long and coincides with a gene, ycf4, whose rate of evolution has increased dramatically. The product of ycf4, a photosystem I assembly protein, is more divergent within the single genus Lathyrus than between cyanobacteria and other angiosperms. Moreover, ycf4 has been lost from the chloroplast genome in Lathyrus odoratus and separately in three other groups of legumes. Each of the four consecutive genes ycf4-psaI-accD-rps16 has been lost in at least one member of the legume "inverted repeat loss" clade, despite the rarity of chloroplast gene losses in angiosperms. We established that accD has relocated to the nucleus in Trifolium species, but were unable to find nuclear copies of ycf4 or psaI in Lathyrus. Our results suggest that, as well as accelerating sequence evolution, localized hypermutation has contributed to the phenomenon of gene loss or relocation to the nucleus.
Collapse
Affiliation(s)
- Alan M Magee
- Smurfit Institute of Genetics, Trinity College, Dublin, Ireland
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Jiang X, Mu B, Huang Z, Zhang M, Wang X, Tao S. Impacts of mutation effects and population size on mutation rate in asexual populations: a simulation study. BMC Evol Biol 2010; 10:298. [PMID: 20920286 PMCID: PMC2958918 DOI: 10.1186/1471-2148-10-298] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2010] [Accepted: 09/30/2010] [Indexed: 12/03/2022] Open
Abstract
Background In any natural population, mutation is the primary source of genetic variation required for evolutionary novelty and adaptation. Nevertheless, most mutations, especially those with phenotypic effects, are harmful and are consequently removed by natural selection. For this reason, under natural selection, an organism will evolve to a lower mutation rate. Overall, the action of natural selection on mutation rate is related to population size and mutation effects. Although theoretical work has intensively investigated the relationship between natural selection and mutation rate, most of these studies have focused on individual competition within a population, rather than on competition among populations. The aim of the present study was to use computer simulations to investigate how natural selection adjusts mutation rate among asexually reproducing subpopulations with different mutation rates. Results The competition results for the different subpopulations showed that a population could evolve to an "optimum" mutation rate during long-term evolution, and that this rate was modulated by both population size and mutation effects. A larger population could evolve to a higher optimum mutation rate than could a smaller population. The optimum mutation rate depended on both the fraction and the effects of beneficial mutations, rather than on the effects of deleterious ones. The optimum mutation rate increased with either the fraction or the effects of beneficial mutations. When strongly favored mutations appeared, the optimum mutation rate was elevated to a much higher level. The competition time among the subpopulations also substantially shortened. Conclusions Competition at the population level revealed that the evolution of the mutation rate in asexual populations was determined by both population size and mutation effects. The most striking finding was that beneficial mutations, rather than deleterious mutations, were the leading force that modulated the optimum mutation rate. The initial configuration of the population appeared to have no effect on these conclusions, confirming the robustness of the simulation method developed in the present study. These findings might further explain the lower mutation rates observed in most asexual organisms, as well as the higher mutation rates in some viruses.
Collapse
Affiliation(s)
- Xiaoqian Jiang
- Bioinformatics center, Northwest A&F University, Yangling, Shaanxi 712100, China
| | | | | | | | | | | |
Collapse
|
44
|
Affiliation(s)
- Jacques Ninio
- Laboratoire de Physique Statistique de l'Ecole Normale Supérieure, UMR 8550 of the CNRS, UPMC Université Paris 06 and Université Paris Diderot, Paris, France.
| |
Collapse
|
45
|
Mutation biases and mutation rate variation around very short human microsatellites revealed by human-chimpanzee-orangutan genomic sequence alignments. J Mol Evol 2010; 71:192-201. [PMID: 20700734 DOI: 10.1007/s00239-010-9377-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2010] [Accepted: 07/26/2010] [Indexed: 01/21/2023]
Abstract
I have studied mutation patterns around very short microsatellites, focusing mainly on sequences carrying only two repeat units. By using human-chimpanzee-orangutan alignments, inferences can be made about both the relative rates of mutations and which bases have mutated. I find remarkable non-randomness, with mutation rate depending on a base's position relative to the microsatellite, the identity of the base itself and the motif in the microsatellite. Comparing the patterns around AC2 with those around other four-base combinations reveals that AC2 does not stand out as being special in the sense that non-repetitive tetramers also generate strong mutation biases. However, comparing AC2 and AC3 with AC4 reveals a step change in both the rate and nature of mutations occurring, suggesting a transition state, AC4 exhibiting an alternating high-low mutation rate pattern consistent with the sequence patterning seen around longer microsatellites. Surprisingly, most changes in repeat number occur through base substitutions rather than slippage, and the relative probability of gaining versus losing a repeat in this way varies greatly with repeat number. Slippage mutations reveal rather similar patterns of mutability compared with point mutations, being rare at two repeats where most cause the loss of a repeat, with both mutation rate and the proportion of expansion mutations increasing up to 6-8 repeats. Inferences about longer repeat tracts are hampered by uncertainties about the proportion of multi-species alignments that fail due to multi-repeat mutations and other rearrangements.
Collapse
|
46
|
Elez M, Murray AW, Bi LJ, Zhang XE, Matic I, Radman M. Seeing mutations in living cells. Curr Biol 2010; 20:1432-7. [PMID: 20674359 DOI: 10.1016/j.cub.2010.06.071] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Revised: 05/26/2010] [Accepted: 06/22/2010] [Indexed: 10/19/2022]
Abstract
BACKGROUND Evolution depends on mutations: rare errors in the transmission of genetic information. Experimentally, mutations have been found by detecting altered phenotypes or sequencing complete genomes, but most mutations do not have overt phenotypes, and sequencing is expensive and has limited time resolution. The major source of mutations is DNA replication errors. Nearly all mistakes in DNA replication are detected and repaired by the mismatch repair machinery. RESULTS We use a functional, fluorescently labeled derivative of one of the key mismatch repair proteins (MutL) to see and count the small fraction of errors in Escherichia coli that does not get repaired and is converted into stable mutations by the next round of DNA replication. Over a 300-fold range, there is a linear relationship between the frequency of fluorescent foci and the genetically measured mutation frequency, and the mean frequency of fluorescent foci agrees well with estimates of the global mutation rate. CONCLUSION We describe a method for detecting the majority of genomic mutations emerging in living cells, independently of their potential phenotype. The distribution of emerging mutations per cell is roughly Poisson distributed, suggesting that all the cells in the population have roughly the same mutation rate.
Collapse
Affiliation(s)
- Marina Elez
- University Paris-Descartes Medical School, Inserm Unit 1001, 75730 Paris Cedex 15, France
| | | | | | | | | | | |
Collapse
|
47
|
Yang Y, Gordenin DA, Resnick MA. A single-strand specific lesion drives MMS-induced hyper-mutability at a double-strand break in yeast. DNA Repair (Amst) 2010; 9:914-21. [PMID: 20663718 DOI: 10.1016/j.dnarep.2010.06.005] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2010] [Revised: 06/13/2010] [Accepted: 06/14/2010] [Indexed: 11/17/2022]
Abstract
Localized hyper-mutability (LHM) can be important in evolution, immunity, and genetic diseases. We previously reported that single-strand DNA (ssDNA) can be an important source of damage-induced LHM in yeast. Here, we establish that the generation of LHM by methyl methanesulfonate (MMS) during repair of a chromosomal double-strand break (DSB) can result in over 0.2 mutations/kb, which is approximately 20,000-fold higher than the MMS-induced mutation density without a DSB. The MMS-induced mutations associated with DSB repair were primarily due to substitutions via translesion DNA synthesis at damaged cytosines, even though there are nearly 10 times more MMS-induced lesions at other bases. Based on this mutation bias, the promutagenic lesion dominating LHM is likely 3-methylcytosine, which is single-strand specific. Thus, the dramatic increase in mutagenesis at a DSB is concluded to result primarily from the generation of non-repairable lesions in ssDNA associated with DSB repair along with efficient induction of highly mutagenic ssDNA-specific lesions. These findings with MMS-induced LHM have broad biological implications for unrepaired damage generated in ssDNA and possibly ssRNA.
Collapse
Affiliation(s)
- Yong Yang
- Laboratory of Molecular Genetics, National Institute of Environmental Health Sciences, NIH, DHHS, Research Triangle Park, NC 27709, United States
| | | | | |
Collapse
|
48
|
García-Villada L, Drake JW. Mutational clusters generated by non-processive polymerases: A case study using DNA polymerase betain vitro. DNA Repair (Amst) 2010; 9:871-8. [PMID: 20627824 DOI: 10.1016/j.dnarep.2010.05.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2010] [Accepted: 05/03/2010] [Indexed: 11/28/2022]
Abstract
Available DNA mutational spectra reveal that the number of mutants with multiple mutations ("multiples") is usually greater than expected from a random distribution of mutations among mutants. These overloads imply the occurrence of non-random clusters of mutations, probably generated during episodes of low-fidelity DNA synthesis. Excess multiples have been reported not only for viruses, bacteria, and eukaryotic cells but also for the DNA polymerases of phages T4 and RB69 in vitro. In the simplest case of a purified polymerase, non-random clusters may be generated by a subfraction of phenotypic variants able to introduce more errors per cycle of DNA synthesis than the normal enzyme. According to this hypothesis, excess multiples are not expected with non-processive polymerases even if they harbor rare mutator variants. DNA polymerase beta (Pol beta) is a mammalian DNA-repair polymerase with very low processivity. Although several Pol beta mutational spectra have been described, there is conflicting evidence on whether or not excess multiples occur, with spectra based on the HSV-tk system tending to show excess multiples. Excess multiples generated by Pol beta or any of its mutants might imply that the excesses of multiples observed in numerous other systems, especially those with processive polymerases, could be artifactual. Here, the distributions of mutations generated by native and recombinant rat Pol beta and by the Pol beta(Y265C) mutator were analyzed in the M13mp2 lacZalpha system. Our results present no evidence for a significant excess of multiples over the expected numbers with any of the Pol beta enzymes tested in this system. The reported excess of Pol beta-generated multiples in the HSV-tk system may reflect a reduced efficiency of detection of base substitutions that cause weak phenotypes, which in turn may artifactually increase the frequency of multiples.
Collapse
Affiliation(s)
- Libertad García-Villada
- Laboratory of Molecular Genetics, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA.
| | | |
Collapse
|
49
|
Thébaud G, Chadœuf J, Morelli MJ, McCauley JW, Haydon DT. The relationship between mutation frequency and replication strategy in positive-sense single-stranded RNA viruses. Proc Biol Sci 2010; 277:809-17. [PMID: 19906671 PMCID: PMC2842737 DOI: 10.1098/rspb.2009.1247] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2009] [Accepted: 10/22/2009] [Indexed: 01/21/2023] Open
Abstract
For positive-sense single-stranded RNA virus genomes, there is a trade-off between the mutually exclusive tasks of transcription, translation and encapsidation. The replication strategy that maximizes the intracellular growth rate of the virus requires iterative genome transcription from positive to negative, and back to positive sense. However, RNA viruses experience high mutation rates, and the proportion of genomes with lethal mutations increases with the number of replication cycles. Thus, intracellular mutant frequency will depend on the replication strategy. Introducing apparently realistic mutation rates into a model of viral replication demonstrates that strategies that maximize viral growth rate could result in an average of 26 mutations per genome by the time plausible numbers of positive strands have been generated, and that virus viability could be as low as 0.1 per cent. At high mutation rates or when a high proportion of mutations are deleterious, the optimal strategy shifts towards synthesizing more negative strands per positive strand, and in extremis towards a 'stamping-machine' replication mode where all the encapsidated genomes come from only two transcriptional steps. We conclude that if viral mutation rates are as high as current estimates suggest, either mutation frequency must be considerably higher than generally anticipated and the proportion of viable viruses produced extremely small, or replication strategies cannot be optimized to maximize viral growth rate. Mechanistic models linking mutation frequency to replication mechanisms coupled with data generated through new deep-sequencing technologies could play an important role in improving the estimates of viral mutation rate.
Collapse
Affiliation(s)
- Gaël Thébaud
- Institut National de la Recherche Agronomique (INRA), UMR BGPI, Cirad TA A-54/K, Campus de Baillarguet, 34398 Montpellier cedex 5, France
| | - Joël Chadœuf
- INRA, UR546 Biostatistique et Processus Spatiaux, Domaine Saint-Paul, 84914 Avignon, France
| | - Marco J. Morelli
- Boyd Orr Centre for Population and Ecosystem Health, Faculty of Biomedical and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK
| | - John W. McCauley
- The National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Daniel T. Haydon
- Boyd Orr Centre for Population and Ecosystem Health, Faculty of Biomedical and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK
| |
Collapse
|
50
|
Abstract
A central problem in evolutionary theory concerns the mechanisms by which adaptations requiring multiple mutations emerge in natural populations. We develop a series of expressions that clarify the scaling of the time to establishment of complex adaptations with population size, mutation rate, magnitude of the selective disadvantage of intermediate-state alleles, and the complexity of the adaptation. In general, even in the face of deleterious intermediate steps, the time to establishment is minimized in populations with very large size. Under a broad range of conditions, the time to establishment also scales by no more than the square of the mutation rate, regardless of the number of sites contributing to the adaptive change, demonstrating that the emergence of complex adaptations is only weakly constrained by the independent acquisition of mutations at the underlying sites. Mutator alleles with deleterious side effects have only moderate effects on the rate of adaptation in large populations but can cause a quantum decrease in the time to establishment of some adaptive alleles in small populations, although probably not at a high enough rate to offset the increased deleterious mutation load. Transient hypermutability, whereby a subset of gamete-producing cells mutate at an elevated rate in a nonheritable manner, may also elevate the rate of adaptation, although the effect is modest and appears to result from a simple increase in the rate of transitions between intermediate states rather than from the saltational production of doublet mutations. Taken together, these results illustrate the plausibility of the relatively rapid emergence of specific complex adaptations by conventional population genetic mechanisms and provide insight into the relative incidences of various paths of allelic adaptation in organisms with different population genetic features.
Collapse
|