Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Schrider DR, Hourmozdi JN, Hahn MW. Pervasive multinucleotide mutational events in eukaryotes. Curr Biol 2011;21:1051-4. [PMID: 21636278 DOI: 10.1016/j.cub.2011.05.013] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2011] [Revised: 04/12/2011] [Accepted: 05/05/2011] [Indexed: 10/18/2022]

For:	Schrider DR, Hourmozdi JN, Hahn MW. Pervasive multinucleotide mutational events in eukaryotes. Curr Biol 2011;21:1051-4. [PMID: 21636278 DOI: 10.1016/j.cub.2011.05.013] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2011] [Revised: 04/12/2011] [Accepted: 05/05/2011] [Indexed: 10/18/2022]

Number

Cited by Other Article(s)

Catania EM, Dubs NM, Soumen S, Barkman TJ. The Mutational Road not Taken: Using Ancestral Sequence Resurrection to Evaluate the Evolution of Plant Enzyme Substrate Preferences. Genome Biol Evol 2024;16:evae016. [PMID: 38290535 PMCID: PMC10853004 DOI: 10.1093/gbe/evae016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 01/19/2024] [Indexed: 02/01/2024] Open

Feldmeyer B, Bornberg-Bauer E, Dohmen E, Fouks B, Heckenhauer J, Huylmans AK, Jones ARC, Stolle E, Harrison MC. Comparative Evolutionary Genomics in Insects. Methods Mol Biol 2024;2802:473-514. [PMID: 38819569 DOI: 10.1007/978-1-0716-3838-5_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]

Lynch M, Ali F, Lin T, Wang Y, Ni J, Long H. The divergence of mutation rates and spectra across the Tree of Life. EMBO Rep 2023;24:e57561. [PMID: 37615267 PMCID: PMC10561183 DOI: 10.15252/embr.202357561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/01/2023] [Accepted: 08/02/2023] [Indexed: 08/25/2023] Open

Gitschlag BL, Cano AV, Payne JL, McCandlish DM, Stoltzfus A. Mutation and Selection Induce Correlations between Selection Coefficients and Mutation Rates. Am Nat 2023;202:534-557. [PMID: 37792926 DOI: 10.1086/726014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]

Suárez-Menéndez M, Bérubé M, Furni F, Rivera-León VE, Heide-Jørgensen MP, Larsen F, Sears R, Ramp C, Eriksson BK, Etienne RS, Robbins J, Palsbøll PJ. Wild pedigrees inform mutation rates and historic abundance in baleen whales. Science 2023;381:990-995. [PMID: 37651509 DOI: 10.1126/science.adf2160] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 07/25/2023] [Indexed: 09/02/2023]

Lucaci AG, Zehr JD, Enard D, Thornton JW, Kosakovsky Pond SL. Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses. Mol Biol Evol 2023;40:msad150. [PMID: 37395787 PMCID: PMC10336034 DOI: 10.1093/molbev/msad150] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/15/2023] [Accepted: 06/26/2023] [Indexed: 07/04/2023] Open

Abstract

Inference and interpretation of evolutionary processes, in particular of the types and targets of natural selection affecting coding sequences, are critically influenced by the assumptions built into statistical models and tests. If certain aspects of the substitution process (even when they are not of direct interest) are presumed absent or are modeled with too crude of a simplification, estimates of key model parameters can become biased, often systematically, and lead to poor statistical performance. Previous work established that failing to accommodate multinucleotide (or multihit, MH) substitutions strongly biases dN/dS-based inference towards false-positive inferences of diversifying episodic selection, as does failing to model variation in the rate of synonymous substitution (SRV) among sites. Here, we develop an integrated analytical framework and software tools to simultaneously incorporate these sources of evolutionary complexity into selection analyses. We found that both MH and SRV are ubiquitous in empirical alignments, and incorporating them has a strong effect on whether or not positive selection is detected (1.4-fold reduction) and on the distributions of inferred evolutionary rates. With simulation studies, we show that this effect is not attributable to reduced statistical power caused by using a more complex model. After a detailed examination of 21 benchmark alignments and a new high-resolution analysis showing which parts of the alignment provide support for positive selection, we show that MH substitutions occurring along shorter branches in the tree explain a significant fraction of discrepant results in selection detection. Our results add to the growing body of literature which examines decades-old modeling assumptions (including MH) and finds them to be problematic for comparative genomic data analysis. Because multinucleotide substitutions have a significant impact on natural selection detection even at the level of an entire gene, we recommend that selection analyses of this type consider their inclusion as a matter of routine. To facilitate this procedure, we developed, implemented, and benchmarked a simple and well-performing model testing selection detection framework able to screen an alignment for positive selection with two biologically important confounding processes: site-to-site synonymous rate variation, and multinucleotide instantaneous substitutions.

Collapse

Cano AV, Gitschlag BL, Rozhoňová H, Stoltzfus A, McCandlish DM, Payne JL. Mutation bias and the predictability of evolution. Philos Trans R Soc Lond B Biol Sci 2023;378:20220055. [PMID: 37004719 PMCID: PMC10067271 DOI: 10.1098/rstb.2022.0055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023] Open

Fan W, Liu F, Jia Q, Du H, Chen W, Ruan J, Lei J, Li DZ, Mower JP, Zhu A. Fragaria mitogenomes evolve rapidly in structure but slowly in sequence and incur frequent multinucleotide mutations mediated by microinversions. THE NEW PHYTOLOGIST 2022;236:745-759. [PMID: 35731093 DOI: 10.1111/nph.18334] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 06/16/2022] [Indexed: 06/15/2023]

Hasan AR, Lachapelle J, El-Shawa SA, Potjewyd R, Ford SA, Ness RW. Salt stress alters the spectrum of de novo mutation available to selection during experimental adaptation of Chlamydomonas reinhardtii. Evolution 2022;76:2450-2463. [PMID: 36036481 DOI: 10.1111/evo.14604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 08/12/2022] [Indexed: 01/22/2023]

Mohiuddin M, Kooy RF, Pearson CE. De novo mutations, genetic mosaicism and human disease. Front Genet 2022;13:983668. [PMID: 36226191 PMCID: PMC9550265 DOI: 10.3389/fgene.2022.983668] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 09/08/2022] [Indexed: 11/23/2022] Open

Belinky F, Bykova A, Yurchenko V, Rogozin IB. No evidence for widespread positive selection on double substitutions within codons in primates and yeasts. Front Genet 2022;13:991249. [PMID: 36159983 PMCID: PMC9500374 DOI: 10.3389/fgene.2022.991249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open

Abstract

Nucleotide substitutions in protein-coding genes can be divided into synonymous (S) and non-synonymous (N) ones that alter amino acids (including nonsense mutations causing stop codons). The S substitutions are expected to have little effect on function. The N substitutions almost always are affected by strong purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases can modulate the deleterious effect of single N substitutions and, thus, could be subjected to the positive selection. This effect has been demonstrated for mutations in the serine codons, stop codons and double N substitutions in prokaryotes. In all abovementioned cases, a novel technique was applied that allows elucidating the effects of selection on double substitutions considering mutational biases. Here, we applied the same technique to study double N substitutions in eukaryotic lineages of primates and yeast. We identified markedly fewer cases of purifying selection relative to prokaryotes and no evidence of codon double substitutions under positive selection. This is consistent with previous studies of serine codons in primates and yeast. In general, the obtained results strongly suggest that there are major differences between studied pro- and eukaryotes; double substitutions in primates and yeasts largely reflect mutational biases and are not hallmarks of selection. This is especially important in the context of detection of positive selection in codons because it has been suggested that multiple mutations in codons cause false inferences of lineage-specific site positive selection. It is likely that this concern is applicable to previously studied prokaryotes but not to primates and yeasts where markedly fewer double substitutions are affected by positive selection.

Collapse

Matsen FA, Ralph PL. Enabling Inference for Context-Dependent Models of Mutation by Bounding the Propagation of Dependency. J Comput Biol 2022;29:802-824. [PMID: 35776513 PMCID: PMC9419934 DOI: 10.1089/cmb.2021.0644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Póti Á, Szikriszt B, Gervai JZ, Chen D, Szüts D. Characterisation of the spectrum and genetic dependence of collateral mutations induced by translesion DNA synthesis. PLoS Genet 2022;18:e1010051. [PMID: 35130276 PMCID: PMC8870599 DOI: 10.1371/journal.pgen.1010051] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 02/24/2022] [Accepted: 01/21/2022] [Indexed: 11/18/2022] Open

Abstract

Translesion DNA synthesis (TLS) is a fundamental damage bypass pathway that utilises specialised polymerases with relaxed template specificity to achieve replication through damaged DNA. Misinsertions by low fidelity TLS polymerases may introduce additional mutations on undamaged DNA near the original lesion site, which we termed collateral mutations. In this study, we used whole genome sequencing datasets of chicken DT40 and several human cell lines to obtain evidence for collateral mutagenesis in higher eukaryotes. We found that cisplatin and UVC radiation frequently induce close mutation pairs within 25 base pairs that consist of an adduct-associated primary and a downstream collateral mutation, and genetically linked their formation to TLS activity involving PCNA ubiquitylation and polymerase κ. PCNA ubiquitylation was also indispensable for close mutation pairs observed amongst spontaneously arising base substitutions in cell lines with disrupted homologous recombination. Collateral mutation pairs were also found in melanoma genomes with evidence of UV exposure. We showed that collateral mutations frequently copy the upstream base, and extracted a base substitution signature that describes collateral mutagenesis in the presented dataset regardless of the primary mutagenic process. Using this mutation signature, we showed that collateral mutagenesis creates approximately 10–20% of non-paired substitutions as well, underscoring the importance of the process.

DNA base substitutions are the most common form of genomic mutations, formed both spontaneously and in response to environmental mutagens. One of the main mechanisms of base substitution mutagenesis is translesion synthesis, a process that relies on specialised DNA polymerases to replicate damaged DNA templates. In addition to incorrect base insertions at the site of lesions in the template, translesion polymerases may also generate ‘collateral’ mutations away from the lesion due to their lower accuracy in selecting the correct incoming nucleotide. In this study, we surveyed the whole genome sequence of experimental cell clones to examine the extent and genetic dependence of collateral mutagenesis in higher eukaryotes. Looking for close mutation pairs, we found that collateral mutations frequently occur near primary lesions generated by cisplatin or ultraviolet radiation in chicken and human cells, but are restricted to a short distance of approximately 25 base pairs. By analysing their sequence context, we showed that collateral mutations can also occur near correctly bypassed primary lesions and may be responsible for a considerable proportion of all base substitution mutations.

Collapse

Bergeron LA, Besenbacher S, Turner T, Versoza CJ, Wang RJ, Price AL, Armstrong E, Riera M, Carlson J, Chen HY, Hahn MW, Harris K, Kleppe AS, López-Nandam EH, Moorjani P, Pfeifer SP, Tiley GP, Yoder AD, Zhang G, Schierup MH. The mutationathon highlights the importance of reaching standardization in estimates of pedigree-based germline mutation rates. eLife 2022;11:73577. [PMID: 35018888 PMCID: PMC8830884 DOI: 10.7554/elife.73577] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 01/11/2022] [Indexed: 11/13/2022] Open

Sepúlveda-Yáñez JH, Alvarez Saravia D, Pilzecker B, van Schouwenburg PA, van den Burg M, Veelken H, Navarrete MA, Jacobs H, Koning MT. Tandem Substitutions in Somatic Hypermutation. Front Immunol 2022;12:807015. [PMID: 35069591 PMCID: PMC8781386 DOI: 10.3389/fimmu.2021.807015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/16/2021] [Indexed: 11/13/2022] Open

Protein innovation through template switching in the Saccharomyces cerevisiae lineage. Sci Rep 2021;11:22558. [PMID: 34799587 PMCID: PMC8604942 DOI: 10.1038/s41598-021-01736-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Accepted: 10/27/2021] [Indexed: 11/08/2022] Open

Jiang P, Ollodart AR, Sudhesh V, Herr AJ, Dunham MJ, Harris K. A modified fluctuation assay reveals a natural mutator phenotype that drives mutation spectrum variation within Saccharomyces cerevisiae. eLife 2021;10:68285. [PMID: 34523420 PMCID: PMC8497059 DOI: 10.7554/elife.68285] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 09/14/2021] [Indexed: 12/23/2022] Open

Kritioti E, Theodosiou A, Parpaite T, Alexandrou A, Nicolaou N, Papaevripidou I, Séjourné N, Coste B, Christophidou-Anastasiadou V, Tanteles GA, Sismani C. Unravelling the genetic causes of multiple malformation syndromes: A whole exome sequencing study of the Cypriot population. PLoS One 2021;16:e0253562. [PMID: 34324503 PMCID: PMC8320927 DOI: 10.1371/journal.pone.0253562] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 06/08/2021] [Indexed: 11/19/2022] Open

Bohutínská M, Handrick V, Yant L, Schmickl R, Kolář F, Bomblies K, Paajanen P. De Novo Mutation and Rapid Protein (Co-)evolution during Meiotic Adaptation in Arabidopsis arenosa. Mol Biol Evol 2021;38:1980-1994. [PMID: 33502506 PMCID: PMC8097281 DOI: 10.1093/molbev/msab001] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Norn C, André I, Theobald DL. A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices. Protein Sci 2021;30:2057-2068. [PMID: 34218472 PMCID: PMC8442976 DOI: 10.1002/pro.4155] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 06/25/2021] [Accepted: 06/29/2021] [Indexed: 12/30/2022]

Moreira A, Croze M, Delehelle F, Cussat-Blanc S, Luga H, Mollereau C, Balaresque P. Hearing Sensitivity of Primates: Recurrent and Episodic Positive Selection in Hair Cells and Stereocilia Protein-Coding Genes. Genome Biol Evol 2021;13:6302699. [PMID: 34137817 PMCID: PMC8358225 DOI: 10.1093/gbe/evab133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/06/2021] [Indexed: 12/29/2022] Open

Sandler G, Wright SI, Agrawal AF. Patterns and Causes of Signed Linkage Disequilibria in Flies and Plants. Mol Biol Evol 2021;38:4310-4321. [PMID: 34097067 PMCID: PMC8476167 DOI: 10.1093/molbev/msab169] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Extra base hits: Widespread empirical support for instantaneous multiple-nucleotide changes. PLoS One 2021;16:e0248337. [PMID: 33711070 PMCID: PMC7954308 DOI: 10.1371/journal.pone.0248337] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 02/24/2021] [Indexed: 01/03/2023] Open

Walker CR, Scally A, De Maio N, Goldman N. Short-range template switching in great ape genomes explored using pair hidden Markov models. PLoS Genet 2021;17:e1009221. [PMID: 33651813 PMCID: PMC7954356 DOI: 10.1371/journal.pgen.1009221] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 03/12/2021] [Accepted: 02/10/2021] [Indexed: 12/14/2022] Open

Abstract

Many complex genomic rearrangements arise through template switch errors, which occur in DNA replication when there is a transient polymerase switch to an alternate template nearby in three-dimensional space. While typically investigated at kilobase-to-megabase scales, the genomic and evolutionary consequences of this mutational process are not well characterised at smaller scales, where they are often interpreted as clusters of independent substitutions, insertions and deletions. Here we present an improved statistical approach using pair hidden Markov models, and use it to detect and describe short-range template switches underlying clusters of mutations in the multi-way alignment of hominid genomes. Using robust statistics derived from evolutionary genomic simulations, we show that template switch events have been widespread in the evolution of the great apes’ genomes and provide a parsimonious explanation for the presence of many complex mutation clusters in their phylogenetic context. Larger-scale mechanisms of genome rearrangement are typically associated with structural features around breakpoints, and accordingly we show that atypical patterns of secondary structure formation and DNA bending are present at the initial template switch loci. Our methods improve on previous non-probabilistic approaches for computational detection of template switch mutations, allowing the statistical significance of events to be assessed. By specifying realistic evolutionary parameters based on the genomes and taxa involved, our methods can be readily adapted to other intra- or inter-species comparisons.

DNA replication is an imperfect process which causes the mutations that give rise to genetic diversity during the evolution of genomes. While many mutations are independent, single-nucleotide substitutions or small insertions and deletions, some mutations arise as nonindependent clusters of substitutions and larger scale chromosomal rearrangements. Large-scale rearrangements (also called structural variants) in particular can have a profound impact on genome evolution and contribute to both germline and somatic disease in humans. The replication-based mechanisms underlying structural variation typically involve a polymerase switch event in which a large segment of DNA is copied using a template from an alternate location in the genome. Methods for identifying these template switch mutations lack the power to detect smaller scale rearrangements which can arise through the same replication-based pathways. Here we outline a model which can detect and assess the statistical significance of such small-scale template switches within their evolutionary context. We show that these events are widespread in the evolution of great apes and that the genomic features associated with these small-scale rearrangements are similar to those of large-scale structural variants.

Collapse

Jones CT, Youssef N, Susko E, Bielawski JP. A Phenotype-Genotype Codon Model for Detecting Adaptive Evolution. Syst Biol 2021;69:722-738. [PMID: 31730199 DOI: 10.1093/sysbio/syz075] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 11/09/2019] [Accepted: 11/11/2019] [Indexed: 01/03/2023] Open

Abstract

A central objective in biology is to link adaptive evolution in a gene to structural and/or functional phenotypic novelties. Yet most analytic methods make inferences mainly from either phenotypic data or genetic data alone. A small number of models have been developed to infer correlations between the rate of molecular evolution and changes in a discrete or continuous life history trait. But such correlations are not necessarily evidence of adaptation. Here, we present a novel approach called the phenotype-genotype branch-site model (PG-BSM) designed to detect evidence of adaptive codon evolution associated with discrete-state phenotype evolution. An episode of adaptation is inferred under standard codon substitution models when there is evidence of positive selection in the form of an elevation in the nonsynonymous-to-synonymous rate ratio $\omega$ to a value $\omega > 1$. As it is becoming increasingly clear that $\omega > 1$ can occur without adaptation, the PG-BSM was formulated to infer an instance of adaptive evolution without appealing to evidence of positive selection. The null model makes use of a covarion-like component to account for general heterotachy (i.e., random changes in the evolutionary rate at a site over time). The alternative model employs samples of the phenotypic evolutionary history to test for phenomenological patterns of heterotachy consistent with specific mechanisms of molecular adaptation. These include 1) a persistent increase/decrease in $\omega$ at a site following a change in phenotype (the pattern) consistent with an increase/decrease in the functional importance of the site (the mechanism); and 2) a transient increase in $\omega$ at a site along a branch over which the phenotype changed (the pattern) consistent with a change in the site's optimal amino acid (the mechanism). Rejection of the null is followed by post hoc analyses to identify sites with strongest evidence for adaptation in association with changes in the phenotype as well as the most likely evolutionary history of the phenotype. Simulation studies based on a novel method for generating mechanistically realistic signatures of molecular adaptation show that the PG-BSM has good statistical properties. Analyses of real alignments show that site patterns identified post hoc are consistent with the specific mechanisms of adaptation included in the alternate model. Further simulation studies show that the covarion-like component of the PG-BSM plays a crucial role in mitigating recently discovered statistical pathologies associated with confounding by accounting for heterotachy-by-any-cause. [Adaptive evolution; branch-site model; confounding; mutation-selection; phenotype-genotype.].

Collapse

Cohen ZP, Brevik K, Chen YH, Hawthorne DJ, Weibel BD, Schoville SD. Elevated rates of positive selection drive the evolution of pestiferousness in the Colorado potato beetle (Leptinotarsa decemlineata, Say). Mol Ecol 2020;30:237-254. [PMID: 33095936 DOI: 10.1111/mec.15703] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 09/28/2020] [Accepted: 10/15/2020] [Indexed: 12/16/2022]

Abstract

Contextualizing evolutionary history and identifying genomic features of an insect that might contribute to its pest status is important in developing early detection and control tactics. In order to understand the evolution of pestiferousness, which we define as the accumulation of traits that contribute to an insect population's success in an agroecosystem, we tested the importance of known genomic properties associated with rapid adaptation in the Colorado potato beetle (CPB), Leptinotarsa decemlineata Say. Within the leaf beetle genus Leptinotarsa, only CPB, and a few populations therein, has risen to pest status on cultivated nightshades, Solanum. Using whole genomes from ten closely related Leptinotarsa species native to the United States, we reconstructed a high-quality species tree and used this phylogenetic framework to assess evolutionary patterns in four genomic features of rapid adaptation: standing genetic variation, gene family expansion and contraction, transposable element abundance and location, and positive selection at protein-coding genes. Throughout approximately 20 million years of history, Leptinotarsa species show little evidence of gene family turnover and transposable element variation. However, there is a clear pattern of CPB experiencing higher rates of positive selection on protein-coding genes. We determine that these rates are associated with greater standing genetic variation due to larger effective population size, which supports the theory that the demographic history contributes to rates of protein evolution. Furthermore, we identify a suite of coding genes under positive selection that are putatively associated with pestiferousness in the Colorado potato beetle lineage. They are involved in the biological processes of xenobiotic detoxification, chemosensation and hormone function.

Collapse

Sackton TB. Studying Natural Selection in the Era of Ubiquitous Genomes. Trends Genet 2020;36:792-803. [DOI: 10.1016/j.tig.2020.07.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 07/10/2020] [Accepted: 07/13/2020] [Indexed: 01/15/2023]

Comparative Analysis of Sequence Polymorphism in Complete Organelle Genomes of the ‘Golden Tide’ Seaweed Sargassum horneri between Korean and Chinese Forms. SUSTAINABILITY 2020. [DOI: 10.3390/su12187280] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Low Base-Substitution Mutation Rate but High Rate of Slippage Mutations in the Sequence Repeat-Rich Genome of Dictyostelium discoideum. G3-GENES GENOMES GENETICS 2020;10:3445-3452. [PMID: 32732307 PMCID: PMC7466956 DOI: 10.1534/g3.120.401578] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Estimation of the Genome-Wide Mutation Rate and Spectrum in the Archaeal Species Haloferax volcanii. Genetics 2020;215:1107-1116. [PMID: 32513815 DOI: 10.1534/genetics.120.303299] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 05/26/2020] [Indexed: 12/26/2022] Open

Wang Q, Pierce-Hoffman E, Cummings BB, Alföldi J, Francioli LC, Gauthier LD, Hill AJ, O'Donnell-Luria AH, Karczewski KJ, MacArthur DG. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes. Nat Commun 2020;11:2539. [PMID: 32461613 PMCID: PMC7253413 DOI: 10.1038/s41467-019-12438-5] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Accepted: 09/09/2019] [Indexed: 12/31/2022] Open

Affiliation(s)

Qingbo Wang Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
Emma Pierce-Hoffman Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Beryl B Cummings Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA Program in Biomedical and Biological Sciences, Harvard Medical School, Boston, MA, 02115, USA
Jessica Alföldi Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Laurent C Francioli Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Laura D Gauthier Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Andrew J Hill Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
Anne H O'Donnell-Luria Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Konrad J Karczewski Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Daniel G MacArthur Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA. Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, Australia. Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia.

Collapse

The Tempo and Mode of Angiosperm Mitochondrial Genome Divergence Inferred from Intraspecific Variation in Arabidopsis thaliana. G3-GENES GENOMES GENETICS 2020;10:1077-1086. [PMID: 31964685 PMCID: PMC7056966 DOI: 10.1534/g3.119.401023] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Mugal CF, Kutschera VE, Botero-Castro F, Wolf JBW, Kaj I. Polymorphism Data Assist Estimation of the Nonsynonymous over Synonymous Fixation Rate Ratio ω for Closely Related Species. Mol Biol Evol 2020;37:260-279. [PMID: 31504782 PMCID: PMC6984366 DOI: 10.1093/molbev/msz203] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Belinky F, Sela I, Rogozin IB, Koonin EV. Crossing fitness valleys via double substitutions within codons. BMC Biol 2019;17:105. [PMID: 31842858 PMCID: PMC6916188 DOI: 10.1186/s12915-019-0727-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 11/20/2019] [Indexed: 02/07/2023] Open

Abstract

BACKGROUND

Single nucleotide substitutions in protein-coding genes can be divided into synonymous (S), with little fitness effect, and non-synonymous (N) ones that alter amino acids and thus generally have a greater effect. Most of the N substitutions are affected by purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases potentially could alleviate the deleterious effect of single substitutions, making them subject to positive selection. To elucidate the effects of selection on double substitutions in all codons, it is critical to differentiate selection from mutational biases.

RESULTS

We addressed the evolutionary regimes of within-codon double substitutions in 37 groups of closely related prokaryotic genomes from diverse phyla by comparing the fractions of double substitutions within codons to those of the equivalent double S substitutions in adjacent codons. Under the assumption that substitutions occur one at a time, all within-codon double substitutions can be represented as "ancestral-intermediate-final" sequences (where "intermediate" refers to the first single substitution and "final" refers to the second substitution) and can be partitioned into four classes: (1) SS, S intermediate-S final; (2) SN, S intermediate-N final; (3) NS, N intermediate-S final; and (4) NN, N intermediate-N final. We found that the selective pressure on the second substitution markedly differs among these classes of double substitutions. Analogous to single S (synonymous) substitutions, SS double substitutions evolve neutrally, whereas analogous to single N (non-synonymous) substitutions, SN double substitutions are subject to purifying selection. In contrast, NS show positive selection on the second step because the original amino acid is recovered. The NN double substitutions are heterogeneous and can be subject to either purifying or positive selection, or evolve neutrally, depending on the amino acid similarity between the final or intermediate and the ancestral states.

CONCLUSIONS

The results of the present, comprehensive analysis of the evolutionary landscape of within-codon double substitutions reaffirm the largely conservative regime of protein evolution. However, the second step of a double substitution can be subject to positive selection when the first step is deleterious. Such positive selection can result in frequent crossing of valleys on the fitness landscape.

Collapse

Beichman AC, Koepfli KP, Li G, Murphy W, Dobrynin P, Kliver S, Tinker MT, Murray MJ, Johnson J, Lindblad-Toh K, Karlsson EK, Lohmueller KE, Wayne RK. Aquatic Adaptation and Depleted Diversity: A Deep Dive into the Genomes of the Sea Otter and Giant Otter. Mol Biol Evol 2019;36:2631-2655. [PMID: 31212313 PMCID: PMC7967881 DOI: 10.1093/molbev/msz101] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Affiliation(s)

Annabel C Beichman Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA
Klaus-Peter Koepfli Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russian Federation
Gang Li College of Life Science, Shaanxi Normal University, Xi’an, Shaanxi, China
William Murphy Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
Pasha Dobrynin Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russian Federation
Sergei Kliver Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russian Federation
Martin T Tinker Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA
Michael J Murray Monterey Bay Aquarium, Monterey, CA
Jeremy Johnson Vertebrate Genome Biology, Broad Institute of MIT and Harvard, Cambridge, MA
Kerstin Lindblad-Toh Vertebrate Genome Biology, Broad Institute of MIT and Harvard, Cambridge, MA Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
Elinor K Karlsson Vertebrate Genome Biology, Broad Institute of MIT and Harvard, Cambridge, MA Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA
Kirk E Lohmueller Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
Robert K Wayne Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA

Collapse

Dapper AL, Payseur BA. Molecular evolution of the meiotic recombination pathway in mammals. Evolution 2019;73:2368-2389. [PMID: 31579931 DOI: 10.1111/evo.13850] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 09/07/2019] [Indexed: 02/06/2023]

Goldmann JM, Veltman JA, Gilissen C. De Novo Mutations Reflect Development and Aging of the Human Germline. Trends Genet 2019;35:828-839. [PMID: 31610893 DOI: 10.1016/j.tig.2019.08.005] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 08/15/2019] [Accepted: 08/28/2019] [Indexed: 01/19/2023]

Richards JK, Stukenbrock EH, Carpenter J, Liu Z, Cowger C, Faris JD, Friesen TL. Local adaptation drives the diversification of effectors in the fungal wheat pathogen Parastagonospora nodorum in the United States. PLoS Genet 2019;15:e1008223. [PMID: 31626626 PMCID: PMC6821140 DOI: 10.1371/journal.pgen.1008223] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 10/30/2019] [Accepted: 08/25/2019] [Indexed: 12/22/2022] Open

Abstract

Filamentous fungi rapidly evolve in response to environmental selection pressures in part due to their genomic plasticity. Parastagonospora nodorum, a fungal pathogen of wheat and causal agent of septoria nodorum blotch, responds to selection pressure exerted by its host, influencing the gain, loss, or functional diversification of virulence determinants, known as effector genes. Whole genome resequencing of 197 P. nodorum isolates collected from spring, durum, and winter wheat production regions of the United States enabled the examination of effector diversity and genomic regions under selection specific to geographically discrete populations. 1,026,859 SNPs/InDels were used to identify novel loci, as well as SnToxA and SnTox3 as factors in disease. Genes displaying presence/absence variation, predicted effector genes, and genes localized on an accessory chromosome had significantly higher pN/pS ratios, indicating a higher rate of sequence evolution. Population structure analyses indicated two P. nodorum populations corresponding to the Upper Midwest (Population 1) and Southern/Eastern United States (Population 2). Prevalence of SnToxA varied greatly between the two populations which correlated with presence of the host sensitivity gene Tsn1 in the most prevalent cultivars in the corresponding regions. Additionally, 12 and 5 candidate effector genes were observed to be under diversifying selection among isolates from Population 1 and 2, respectively, but under purifying selection or neutrally evolving in the opposite population. Selective sweep analysis revealed 10 and 19 regions that had recently undergone positive selection in Population 1 and 2, respectively, involving 92 genes in total. When comparing genes with and without presence/absence variation, those genes exhibiting this variation were significantly closer to transposable elements. Taken together, these results indicate that P. nodorum is rapidly adapting to distinct selection pressures unique to spring and winter wheat production regions by rapid adaptive evolution and various routes of genomic diversification, potentially facilitated through transposable element activity.

Collapse

Delmont TO, Kiefl E, Kilinc O, Esen OC, Uysal I, Rappé MS, Giovannoni S, Eren AM. Single-amino acid variants reveal evolutionary processes that shape the biogeography of a global SAR11 subclade. eLife 2019;8:46497. [PMID: 31478833 PMCID: PMC6721796 DOI: 10.7554/elife.46497] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Accepted: 08/13/2019] [Indexed: 12/14/2022] Open

Amos W. Flanking heterozygosity influences the relative probability of different base substitutions in humans. ROYAL SOCIETY OPEN SCIENCE 2019;6:191018. [PMID: 31598319 PMCID: PMC6774961 DOI: 10.1098/rsos.191018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 08/30/2019] [Indexed: 06/10/2023]

Rana S, Valentin K, Bartsch I, Glöckner G. Loss of a chloroplast encoded function could influence species range in kelp. Ecol Evol 2019;9:8759-8770. [PMID: 31410278 PMCID: PMC6686309 DOI: 10.1002/ece3.5428] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 05/16/2019] [Accepted: 06/15/2019] [Indexed: 12/25/2022] Open

Kaplanis J, Akawi N, Gallone G, McRae JF, Prigmore E, Wright CF, Fitzpatrick DR, Firth HV, Barrett JC, Hurles ME. Exome-wide assessment of the functional impact and pathogenicity of multinucleotide mutations. Genome Res 2019;29:1047-1056. [PMID: 31227601 PMCID: PMC6633265 DOI: 10.1101/gr.239756.118] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 05/24/2019] [Indexed: 01/25/2023]

Prendergast JGD, Pugh C, Harris SE, Hume DA, Deary IJ, Beveridge A. Linked Mutations at Adjacent Nucleotides Have Shaped Human Population Differentiation and Protein Evolution. Genome Biol Evol 2019;11:759-775. [PMID: 30689878 PMCID: PMC6424222 DOI: 10.1093/gbe/evz014] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/18/2019] [Indexed: 02/06/2023] Open

Ghafari M, Weissman DB. The expected time to cross extended fitness plateaus. Theor Popul Biol 2019;129:54-67. [PMID: 31054850 DOI: 10.1016/j.tpb.2019.03.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2018] [Revised: 12/28/2018] [Accepted: 03/05/2019] [Indexed: 10/25/2022]

Bowden R, Davies RW, Heger A, Pagnamenta AT, de Cesare M, Oikkonen LE, Parkes D, Freeman C, Dhalla F, Patel SY, Popitsch N, Ip CLC, Roberts HE, Salatino S, Lockstone H, Lunter G, Taylor JC, Buck D, Simpson MA, Donnelly P. Sequencing of human genomes with nanopore technology. Nat Commun 2019;10:1869. [PMID: 31015479 PMCID: PMC6478738 DOI: 10.1038/s41467-019-09637-5] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 03/19/2019] [Indexed: 12/17/2022] Open

Affiliation(s)

Rory Bowden Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Robert W Davies Genomics plc, Oxford, OX1 1JD, UK Program in Genetics and Genomic Biology and The Centre for Applied Genomics, Hospital for Sick Children, Toronto, M5G 0A4, Canada
Andreas Heger Genomics plc, Oxford, OX1 1JD, UK
Alistair T Pagnamenta Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
Mariateresa de Cesare Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Laura E Oikkonen Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Duncan Parkes Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Colin Freeman Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Fatima Dhalla Department of Clinical Immunology, Oxford University Hospitals, Oxford, OX3 9DU, UK Developmental Immunology Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, OX3 9DS, UK
Smita Y Patel Department of Clinical Immunology, Oxford University Hospitals, Oxford, OX3 9DU, UK Clinical Immunology Group, National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
Niko Popitsch Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK Children's Cancer Research Institute, St. Anna Kinderkrebsforschung, 1090, Vienna, Austria
Camilla L C Ip Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Hannah E Roberts Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Silvia Salatino Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Helen Lockstone Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Gerton Lunter Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK Genomics plc, Oxford, OX1 1JD, UK
Jenny C Taylor Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
David Buck Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Michael A Simpson Genomics plc, Oxford, OX1 1JD, UK
Peter Donnelly Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK. Genomics plc, Oxford, OX1 1JD, UK. Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.

Collapse

Dunn KA, Kenney T, Gu H, Bielawski JP. Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates. BMC Evol Biol 2019;19:22. [PMID: 30642241 PMCID: PMC6332903 DOI: 10.1186/s12862-018-1326-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/11/2018] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

An excess of nonsynonymous substitutions, over neutrality, is considered evidence of positive Darwinian selection. Inference for proteins often relies on estimation of the nonsynonymous to synonymous ratio (ω = dN/dS) within a codon model. However, to ease computational difficulties, ω is typically estimated assuming an idealized substitution process where (i) all nonsynonymous substitutions have the same rate (regardless of impact on organism fitness) and (ii) instantaneous double and triple (DT) nucleotide mutations have zero probability (despite evidence that they can occur). It follows that estimates of ω represent an imperfect summary of the intensity of selection, and that tests based on the ω > 1 threshold could be negatively impacted.

RESULTS

We developed a general-purpose parametric (GPP) modelling framework for codons. This novel approach allows specification of all possible instantaneous codon substitutions, including multiple nonsynonymous rates (MNRs) and instantaneous DT nucleotide changes. Existing codon models are specified as special cases of the GPP model. We use GPP models to implement likelihood ratio tests for ω > 1 that accommodate MNRs and DT mutations. Through both simulation and real data analysis, we find that failure to model MNRs and DT mutations reduces power in some cases and inflates false positives in others. False positives under traditional M2a and M8 models were very sensitive to DT changes. This was exacerbated by the choice of frequency parameterization (GY vs. MG), with rates sometimes > 90% under MG. By including MNRs and DT mutations, accuracy and power was greatly improved under the GPP framework. However, we also find that over-parameterized models can perform less well, and this can contribute to degraded performance of LRTs.

CONCLUSIONS

We suggest GPP models should be used alongside traditional codon models. Further, all codon models should be deployed within an experimental design that includes (i) assessing robustness to model assumptions, and (ii) investigation of non-standard behaviour of MLEs. As the goal of every analysis is to avoid false conclusions, more work is needed on model selection methods that consider both the increase in fit engendered by a model parameter and the degree to which that parameter is affected by un-modelled evolutionary processes.

Collapse

Looking for Darwin in Genomic Sequences: Validity and Success Depends on the Relationship Between Model and Data. Methods Mol Biol 2019;1910:399-426. [PMID: 31278672 DOI: 10.1007/978-1-4939-9074-0_13] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Abstract

Codon substitution models (CSMs) are commonly used to infer the history of natural section for a set of protein-coding sequences, often with the explicit goal of detecting the signature of positive Darwinian selection. However, the validity and success of CSMs used in conjunction with the maximum likelihood (ML) framework is sometimes challenged with claims that the approach might too often support false conclusions. In this chapter, we use a case study approach to identify four legitimate statistical difficulties associated with inference of evolutionary events using CSMs. These include: (1) model misspecification, (2) low information content, (3) the confounding of processes, and (4) phenomenological load, or PL. While past criticisms of CSMs can be connected to these issues, the historical critiques were often misdirected, or overstated, because they failed to recognize that the success of any model-based approach depends on the relationship between model and data. Here, we explore this relationship and provide a candid assessment of the limitations of CSMs to extract historical information from extant sequences. To aid in this assessment, we provide a brief overview of: (1) a more realistic way of thinking about the process of codon evolution framed in terms of population genetic parameters, and (2) a novel presentation of the ML statistical framework. We then divide the development of CSMs into two broad phases of scientific activity and show that the latter phase is characterized by increases in model complexity that can sometimes negatively impact inference of evolutionary mechanisms. Such problems are not yet widely appreciated by the users of CSMs. These problems can be avoided by using a model that is appropriate for the data; but, understanding the relationship between the data and a fitted model is a difficult task. We argue that the only way to properly understand that relationship is to perform in silico experiments using a generating process that can mimic the data as closely as possible. The mutation-selection modeling framework (MutSel) is presented as the basis of such a generating process. We contend that if complex CSMs continue to be developed for testing explicit mechanistic hypotheses, then additional analyses such as those described in here (e.g., penalized LRTs and estimation of PL) will need to be applied alongside the more traditional inferential methods.

Collapse

Fine-Grained Analysis of Spontaneous Mutation Spectrum and Frequency in Arabidopsis thaliana. Genetics 2018;211:703-714. [PMID: 30514707 PMCID: PMC6366913 DOI: 10.1534/genetics.118.301721] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 11/29/2018] [Indexed: 01/17/2023] Open

Senra MVX, Sung W, Ackerman M, Miller SF, Lynch M, Soares CAG. An Unbiased Genome-Wide View of the Mutation Rate and Spectrum of the Endosymbiotic Bacterium Teredinibacter turnerae. Genome Biol Evol 2018;10:723-730. [PMID: 29415256 PMCID: PMC5833318 DOI: 10.1093/gbe/evy027] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/02/2018] [Indexed: 12/14/2022] Open

Thomas GWC, Wang RJ, Puri A, Harris RA, Raveendran M, Hughes DST, Murali SC, Williams LE, Doddapaneni H, Muzny DM, Gibbs RA, Abee CR, Galinski MR, Worley KC, Rogers J, Radivojac P, Hahn MW. Reproductive Longevity Predicts Mutation Rates in Primates. Curr Biol 2018;28:3193-3197.e5. [PMID: 30270182 DOI: 10.1016/j.cub.2018.08.050] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 07/26/2018] [Accepted: 08/22/2018] [Indexed: 12/30/2022]

Affiliation(s)

Gregg W C Thomas Department of Biology, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA; Department of Computer Science, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA.
Richard J Wang Department of Biology, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA
Arthi Puri Department of Computer Science, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA
R Alan Harris Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Muthuswamy Raveendran Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Daniel S T Hughes Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Shwetha C Murali Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Lawrence E Williams Keeling Center for Comparative Medicine and Research, University of Texas, MD Anderson Cancer Center, 650 Cool Water Drive, Bastrop, TX 78602, USA
Harsha Doddapaneni Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Donna M Muzny Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Richard A Gibbs Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Christian R Abee Keeling Center for Comparative Medicine and Research, University of Texas, MD Anderson Cancer Center, 650 Cool Water Drive, Bastrop, TX 78602, USA
Mary R Galinski Emory Vaccine Center, Yerkes National Primate Research Center, Emory University, 201 Dowman Drive, Atlanta, GA, USA; Division of Infectious Diseases, Department of Medicine, Emory University, 201 Dowman Drive, Atlanta, GA, USA
Kim C Worley Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Jeffrey Rogers Human Genome Sequencing Center, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA
Predrag Radivojac Department of Computer Science, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA
Matthew W Hahn Department of Biology, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA; Department of Computer Science, Indiana University, 107 S. Indiana Avenue, Bloomington, IN 47405, USA.

Collapse