Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Harris K, Nielsen R. Error-prone polymerase activity causes multinucleotide mutations in humans. Genome Res 2014;24:1445-54. [PMID: 25079859 PMCID: PMC4158752 DOI: 10.1101/gr.170696.113] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

For:	Harris K, Nielsen R. Error-prone polymerase activity causes multinucleotide mutations in humans. Genome Res 2014;24:1445-54. [PMID: 25079859 PMCID: PMC4158752 DOI: 10.1101/gr.170696.113] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Number

Cited by Other Article(s)

Prentout D, Bykova D, Hoge C, Hooper DM, McDiarmid CS, Wu F, Griffith SC, de Manuel M, Przeworski M. Conservation of mutation and recombination parameters between mammals and zebra finch. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.05.611523. [PMID: 39282267 PMCID: PMC11398497 DOI: 10.1101/2024.09.05.611523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/21/2024]

Abstract

Most of our understanding of the fundamental processes of mutation and recombination stems from a handful of disparate model organisms and pedigree studies of mammals, with little known about other vertebrates. To gain a broader comparative perspective, we focused on the zebra finch (Taeniopygia castanotis), which, like other birds, differs from mammals in its karyotype (which includes many micro-chromosomes), in the mechanism by which recombination is directed to the genome, and in aspects of ontogenesis. We collected genome sequences from three generation pedigrees that provide information about 80 meioses, inferring 202 single-point de novo mutations, 1,174 crossovers, and 275 non-crossovers. On that basis, we estimated a sex-averaged mutation rate of 5.0 × 10-9 per base pair per generation, on par with mammals that have a similar generation time. Also as in mammals, we found a paternal germline mutation bias at later stages of gametogenesis (of 1.7 to 1) but no discernible difference between sexes in early development. We also examined recombination patterns, and found that the sex-averaged crossover rate on macro-chromosomes (1.05 cM/Mb) is again similar to values observed in mammals, as is the spatial distribution of crossovers, with a pronounced enrichment near telomeres. In contrast, non-crossover rates are more uniformly distributed. On micro-chromosomes, sex-averaged crossover rates are substantially higher (4.21 cM/Mb), as expected from crossover homeostasis, and both crossover and non-crossover events are more uniformly distributed. At a finer scale, recombination events overlap CpG islands more often than expected by chance, as expected in the absence of PRDM9. Despite differences in the mechanism by which recombination events are specified and the presence of many micro-chromosomes, estimates of the degree of GC-biased gene conversion (59%), the mean non-crossover conversion tract length (~23 bp), and the non-crossover to crossover ratio (6.7:1) are all comparable to those reported in primates and mice. The conservation of mutation and recombination properties from zebra finch to mammals suggest that these processes have evolved under stabilizing selection.

Collapse

Iyengar BR, Grandchamp A, Bornberg-Bauer E. How antisense transcripts can evolve to encode novel proteins. Nat Commun 2024;15:6187. [PMID: 39043684 PMCID: PMC11266595 DOI: 10.1038/s41467-024-50550-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 07/12/2024] [Indexed: 07/25/2024] Open

Lynch M, Ali F, Lin T, Wang Y, Ni J, Long H. The divergence of mutation rates and spectra across the Tree of Life. EMBO Rep 2023;24:e57561. [PMID: 37615267 PMCID: PMC10561183 DOI: 10.15252/embr.202357561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/01/2023] [Accepted: 08/02/2023] [Indexed: 08/25/2023] Open

Lucaci AG, Zehr JD, Enard D, Thornton JW, Kosakovsky Pond SL. Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses. Mol Biol Evol 2023;40:msad150. [PMID: 37395787 PMCID: PMC10336034 DOI: 10.1093/molbev/msad150] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/15/2023] [Accepted: 06/26/2023] [Indexed: 07/04/2023] Open

Abstract

Inference and interpretation of evolutionary processes, in particular of the types and targets of natural selection affecting coding sequences, are critically influenced by the assumptions built into statistical models and tests. If certain aspects of the substitution process (even when they are not of direct interest) are presumed absent or are modeled with too crude of a simplification, estimates of key model parameters can become biased, often systematically, and lead to poor statistical performance. Previous work established that failing to accommodate multinucleotide (or multihit, MH) substitutions strongly biases dN/dS-based inference towards false-positive inferences of diversifying episodic selection, as does failing to model variation in the rate of synonymous substitution (SRV) among sites. Here, we develop an integrated analytical framework and software tools to simultaneously incorporate these sources of evolutionary complexity into selection analyses. We found that both MH and SRV are ubiquitous in empirical alignments, and incorporating them has a strong effect on whether or not positive selection is detected (1.4-fold reduction) and on the distributions of inferred evolutionary rates. With simulation studies, we show that this effect is not attributable to reduced statistical power caused by using a more complex model. After a detailed examination of 21 benchmark alignments and a new high-resolution analysis showing which parts of the alignment provide support for positive selection, we show that MH substitutions occurring along shorter branches in the tree explain a significant fraction of discrepant results in selection detection. Our results add to the growing body of literature which examines decades-old modeling assumptions (including MH) and finds them to be problematic for comparative genomic data analysis. Because multinucleotide substitutions have a significant impact on natural selection detection even at the level of an entire gene, we recommend that selection analyses of this type consider their inclusion as a matter of routine. To facilitate this procedure, we developed, implemented, and benchmarked a simple and well-performing model testing selection detection framework able to screen an alignment for positive selection with two biologically important confounding processes: site-to-site synonymous rate variation, and multinucleotide instantaneous substitutions.

Collapse

Iyengar BR, Bornberg-Bauer E. Neutral Models of De Novo Gene Emergence Suggest that Gene Evolution has a Preferred Trajectory. Mol Biol Evol 2023;40:msad079. [PMID: 37011142 PMCID: PMC10118301 DOI: 10.1093/molbev/msad079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 03/01/2023] [Accepted: 03/28/2023] [Indexed: 04/05/2023] Open

Silva SR, Miranda VFO, Michael TP, Płachno BJ, Matos RG, Adamec L, Pond SLK, Lucaci AG, Pinheiro DG, Varani AM. The phylogenomics and evolutionary dynamics of the organellar genomes in carnivorous Utricularia and Genlisea species (Lentibulariaceae). Mol Phylogenet Evol 2023;181:107711. [PMID: 36693533 DOI: 10.1016/j.ympev.2023.107711] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 01/13/2023] [Accepted: 01/18/2023] [Indexed: 01/22/2023]

Gupta MK, Vadde R. Next-generation development and application of codon model in evolution. Front Genet 2023;14:1091575. [PMID: 36777719 PMCID: PMC9911445 DOI: 10.3389/fgene.2023.1091575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/17/2023] [Indexed: 01/28/2023] Open

Wang Y, Zhang L, Zhou Y, Ma W, Li M, Guo P, Feng L, Fu C. Using landscape genomics to assess local adaptation and genomic vulnerability of a perennial herb Tetrastigma hemsleyanum (Vitaceae) in subtropical China. Front Genet 2023;14:1150704. [PMID: 37144128 PMCID: PMC10151583 DOI: 10.3389/fgene.2023.1150704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 04/04/2023] [Indexed: 05/06/2023] Open

Abstract

Understanding adaptive genetic variation of plant populations and their vulnerabilities to climate change are critical to preserve biodiversity and subsequent management interventions. To this end, landscape genomics may represent a cost-efficient approach for investigating molecular signatures underlying local adaptation. Tetrastigma hemsleyanum is, in its native habitat, a widespread perennial herb of warm-temperate evergreen forest in subtropical China. Its ecological and medicinal values constitute a significant revenue for local human populations and ecosystem. Using 30,252 single nucleotide polymorphisms (SNPs) derived from reduced-representation genome sequencing in 156 samples from 24 sites, we conducted a landscape genomics study of the T. hemsleyanum to elucidate its genomic variation across multiple climate gradients and genomic vulnerability to future climate change. Multivariate methods identified that climatic variation explained more genomic variation than that of geographical distance, which implied that local adaptation to heterogeneous environment might represent an important source of genomic variation. Among these climate variables, winter precipitation was the strongest predictor of the contemporary genetic structure. F _ST outlier tests and environment association analysis totally identified 275 candidate adaptive SNPs along the genetic and environmental gradients. SNP annotations of these putatively adaptive loci uncovered gene functions associated with modulating flowering time and regulating plant response to abiotic stresses, which have implications for breeding and other special agricultural aims on the basis of these selection signatures. Critically, modelling revealed that the high genomic vulnerability of our focal species via a mismatch between current and future genotype-environment relationships located in central-northern region of the T. hemsleyanum's range, where populations require proactive management efforts such as assistant adaptation to cope with ongoing climate change. Taken together, our results provide robust evidence of local climate adaption for T. hemsleyanum and further deepen our understanding of adaptation basis of herbs in subtropical China.

Collapse

Parada-Márquez JF, Maldonado-Rodriguez ND, Triana-Fonseca P, Contreras-Bravo NC, Calderón-Ospina CA, Restrepo CM, Morel A, Ortega-Recalde OJ, Silgado-Guzmán DF, Angulo-Aguado M, Fonseca-Mendoza DJ. Pharmacogenomic profile of actionable molecular variants related to drugs commonly used in anesthesia: WES analysis reveals new mutations. Front Pharmacol 2023;14:1047854. [PMID: 37021041 PMCID: PMC10069477 DOI: 10.3389/fphar.2023.1047854] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 03/06/2023] [Indexed: 04/07/2023] Open

Abstract

Background: Genetic interindividual variability is associated with adverse drug reactions (ADRs) and affects the response to common drugs used in anesthesia. Despite their importance, these variants remain largely underexplored in Latin-American countries. This study describes rare and common variants found in genes related to metabolism of analgesic and anaesthetic drug in the Colombian population. Methods: We conducted a study that included 625 Colombian healthy individuals. We generated a subset of 14 genes implicated in metabolic pathways of common medications used in anesthesia and assessed them by whole-exome sequencing (WES). Variants were filtered using two pipelines: A) novel or rare (minor allele frequency-MAF <1%) variants including missense, loss-of-function (LoF, e.g., frameshift, nonsense), and splice site variants with potential deleterious effect and B) clinically validated variants described in the PharmGKB (categories 1, 2 and 3) and/or ClinVar databases. For rare and novel missense variants, we applied an optimized prediction framework (OPF) to assess the functional impact of pharmacogenetic variants. Allelic, genotypic frequencies and Hardy-Weinberg equilibrium were calculated. We compare our allelic frequencies with these from populations described in the gnomAD database. Results: Our study identified 148 molecular variants potentially related to variability in the therapeutic response to 14 drugs commonly used in anesthesiology. 83.1% of them correspond to rare and novel missense variants classified as pathogenic according to the pharmacogenetic optimized prediction framework, 5.4% were loss-of-function (LoF), 2.7% led to potential splicing alterations and 8.8% were assigned as actionable or informative pharmacogenetic variants. Novel variants were confirmed by Sanger sequencing. Allelic frequency comparison showed that the Colombian population has a unique pharmacogenomic profile for anesthesia drugs with some allele frequencies different from other populations. Conclusion: Our results demonstrated high allelic heterogeneity among the analyzed sampled, enriched by rare (91.2%) variants in pharmacogenes related to common drugs used in anesthesia. The clinical implications of these results highlight the importance of implementation of next-generation sequencing data into pharmacogenomic approaches and personalized medicine.

Collapse

Affiliation(s)

Juan Fernando Parada-Márquez Department of Molecular Diagnosis, Genética Molecular de Colombia SAS, Bogotá, Colombia
Nicolás David Maldonado-Rodriguez Department of Molecular Diagnosis, Genética Molecular de Colombia SAS, Bogotá, Colombia
Paula Triana-Fonseca Department of Molecular Diagnosis, Genética Molecular de Colombia SAS, Bogotá, Colombia
Nora Constanza Contreras-Bravo School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia
Carlos Alberto Calderón-Ospina School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia
Carlos M. Restrepo School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia
Adrien Morel School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia
Oscar Javier Ortega-Recalde School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia
Daniel Felipe Silgado-Guzmán Department of Molecular Diagnosis, Genética Molecular de Colombia SAS, Bogotá, Colombia
Mariana Angulo-Aguado School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia *Correspondence: Mariana Angulo-Aguado, ; Dora Janeth Fonseca-Mendoza,
Dora Janeth Fonseca-Mendoza School of Medicine and Health Sciences, Center for Research in Genetics and Genomics (CIGGUR), Institute of Translational Medicine (IMT), Universidad Del Rosario, Bogotá, Colombia *Correspondence: Mariana Angulo-Aguado, ; Dora Janeth Fonseca-Mendoza,

Collapse

Patton DL, Cardenas T, Mele P, Navarro J, Sung W. CDMAP/CDVIS: context-dependent mutation analysis package and visualization software. G3 (BETHESDA, MD.) 2022;13:6887836. [PMID: 36917690 PMCID: PMC10085751 DOI: 10.1093/g3journal/jkac299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 10/17/2022] [Indexed: 12/15/2022]

Hasan AR, Lachapelle J, El-Shawa SA, Potjewyd R, Ford SA, Ness RW. Salt stress alters the spectrum of de novo mutation available to selection during experimental adaptation of Chlamydomonas reinhardtii. Evolution 2022;76:2450-2463. [PMID: 36036481 DOI: 10.1111/evo.14604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 08/12/2022] [Indexed: 01/22/2023]

Belinky F, Bykova A, Yurchenko V, Rogozin IB. No evidence for widespread positive selection on double substitutions within codons in primates and yeasts. Front Genet 2022;13:991249. [PMID: 36159983 PMCID: PMC9500374 DOI: 10.3389/fgene.2022.991249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open

Abstract

Nucleotide substitutions in protein-coding genes can be divided into synonymous (S) and non-synonymous (N) ones that alter amino acids (including nonsense mutations causing stop codons). The S substitutions are expected to have little effect on function. The N substitutions almost always are affected by strong purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases can modulate the deleterious effect of single N substitutions and, thus, could be subjected to the positive selection. This effect has been demonstrated for mutations in the serine codons, stop codons and double N substitutions in prokaryotes. In all abovementioned cases, a novel technique was applied that allows elucidating the effects of selection on double substitutions considering mutational biases. Here, we applied the same technique to study double N substitutions in eukaryotic lineages of primates and yeast. We identified markedly fewer cases of purifying selection relative to prokaryotes and no evidence of codon double substitutions under positive selection. This is consistent with previous studies of serine codons in primates and yeast. In general, the obtained results strongly suggest that there are major differences between studied pro- and eukaryotes; double substitutions in primates and yeasts largely reflect mutational biases and are not hallmarks of selection. This is especially important in the context of detection of positive selection in codons because it has been suggested that multiple mutations in codons cause false inferences of lineage-specific site positive selection. It is likely that this concern is applicable to previously studied prokaryotes but not to primates and yeasts where markedly fewer double substitutions are affected by positive selection.

Collapse

Löytynoja A. Thousands of human mutation clusters are explained by short-range template switching. Genome Res 2022;32:1437-1447. [PMID: 35760560 PMCID: PMC9435742 DOI: 10.1101/gr.276478.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 06/21/2022] [Indexed: 02/03/2023]

Matsen FA, Ralph PL. Enabling Inference for Context-Dependent Models of Mutation by Bounding the Propagation of Dependency. J Comput Biol 2022;29:802-824. [PMID: 35776513 PMCID: PMC9419934 DOI: 10.1089/cmb.2021.0644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Chen DS, Clark AG, Wolfner MF. Octopaminergic/tyraminergic Tdc2 neurons regulate biased sperm usage in female Drosophila melanogaster. Genetics 2022;221:6613932. [PMID: 35736370 DOI: 10.1093/genetics/iyac097] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 06/04/2022] [Indexed: 11/14/2022] Open

Póti Á, Szikriszt B, Gervai JZ, Chen D, Szüts D. Characterisation of the spectrum and genetic dependence of collateral mutations induced by translesion DNA synthesis. PLoS Genet 2022;18:e1010051. [PMID: 35130276 PMCID: PMC8870599 DOI: 10.1371/journal.pgen.1010051] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 02/24/2022] [Accepted: 01/21/2022] [Indexed: 11/18/2022] Open

Abstract

Translesion DNA synthesis (TLS) is a fundamental damage bypass pathway that utilises specialised polymerases with relaxed template specificity to achieve replication through damaged DNA. Misinsertions by low fidelity TLS polymerases may introduce additional mutations on undamaged DNA near the original lesion site, which we termed collateral mutations. In this study, we used whole genome sequencing datasets of chicken DT40 and several human cell lines to obtain evidence for collateral mutagenesis in higher eukaryotes. We found that cisplatin and UVC radiation frequently induce close mutation pairs within 25 base pairs that consist of an adduct-associated primary and a downstream collateral mutation, and genetically linked their formation to TLS activity involving PCNA ubiquitylation and polymerase κ. PCNA ubiquitylation was also indispensable for close mutation pairs observed amongst spontaneously arising base substitutions in cell lines with disrupted homologous recombination. Collateral mutation pairs were also found in melanoma genomes with evidence of UV exposure. We showed that collateral mutations frequently copy the upstream base, and extracted a base substitution signature that describes collateral mutagenesis in the presented dataset regardless of the primary mutagenic process. Using this mutation signature, we showed that collateral mutagenesis creates approximately 10–20% of non-paired substitutions as well, underscoring the importance of the process.

DNA base substitutions are the most common form of genomic mutations, formed both spontaneously and in response to environmental mutagens. One of the main mechanisms of base substitution mutagenesis is translesion synthesis, a process that relies on specialised DNA polymerases to replicate damaged DNA templates. In addition to incorrect base insertions at the site of lesions in the template, translesion polymerases may also generate ‘collateral’ mutations away from the lesion due to their lower accuracy in selecting the correct incoming nucleotide. In this study, we surveyed the whole genome sequence of experimental cell clones to examine the extent and genetic dependence of collateral mutagenesis in higher eukaryotes. Looking for close mutation pairs, we found that collateral mutations frequently occur near primary lesions generated by cisplatin or ultraviolet radiation in chicken and human cells, but are restricted to a short distance of approximately 25 base pairs. By analysing their sequence context, we showed that collateral mutations can also occur near correctly bypassed primary lesions and may be responsible for a considerable proportion of all base substitution mutations.

Collapse

Yao Y, Sun K, Yang Q, Zhou Z, Shao C, Qian X, Tang Q, Xie J. Assessing Autosomal InDel Loci With Multiple Insertions or Deletions of Random DNA Sequences in Human Genome. Front Genet 2022;12:809815. [PMID: 35178073 PMCID: PMC8844376 DOI: 10.3389/fgene.2021.809815] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 12/27/2021] [Indexed: 11/13/2022] Open

Sepúlveda-Yáñez JH, Alvarez Saravia D, Pilzecker B, van Schouwenburg PA, van den Burg M, Veelken H, Navarrete MA, Jacobs H, Koning MT. Tandem Substitutions in Somatic Hypermutation. Front Immunol 2022;12:807015. [PMID: 35069591 PMCID: PMC8781386 DOI: 10.3389/fimmu.2021.807015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/16/2021] [Indexed: 11/13/2022] Open

Lu K, Hsiao YC, Liu CW, Schoeny R, Gentry R, Starr TB. A Review of Stable Isotope Labeling and Mass Spectrometry Methods to Distinguish Exogenous from Endogenous DNA Adducts and Improve Dose-Response Assessments. Chem Res Toxicol 2021;35:7-29. [PMID: 34910474 DOI: 10.1021/acs.chemrestox.1c00212] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zverinova S, Guryev V. Variant calling: Considerations, practices, and developments. Hum Mutat 2021;43:976-985. [PMID: 34882898 PMCID: PMC9545713 DOI: 10.1002/humu.24311] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 11/02/2021] [Accepted: 12/03/2021] [Indexed: 11/10/2022]

Protein innovation through template switching in the Saccharomyces cerevisiae lineage. Sci Rep 2021;11:22558. [PMID: 34799587 PMCID: PMC8604942 DOI: 10.1038/s41598-021-01736-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Accepted: 10/27/2021] [Indexed: 11/08/2022] Open

Jiang P, Ollodart AR, Sudhesh V, Herr AJ, Dunham MJ, Harris K. A modified fluctuation assay reveals a natural mutator phenotype that drives mutation spectrum variation within Saccharomyces cerevisiae. eLife 2021;10:68285. [PMID: 34523420 PMCID: PMC8497059 DOI: 10.7554/elife.68285] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 09/14/2021] [Indexed: 12/23/2022] Open

Bohutínská M, Handrick V, Yant L, Schmickl R, Kolář F, Bomblies K, Paajanen P. De Novo Mutation and Rapid Protein (Co-)evolution during Meiotic Adaptation in Arabidopsis arenosa. Mol Biol Evol 2021;38:1980-1994. [PMID: 33502506 PMCID: PMC8097281 DOI: 10.1093/molbev/msab001] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Norn C, André I, Theobald DL. A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices. Protein Sci 2021;30:2057-2068. [PMID: 34218472 PMCID: PMC8442976 DOI: 10.1002/pro.4155] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 06/25/2021] [Accepted: 06/29/2021] [Indexed: 12/30/2022]

Extra base hits: Widespread empirical support for instantaneous multiple-nucleotide changes. PLoS One 2021;16:e0248337. [PMID: 33711070 PMCID: PMC7954308 DOI: 10.1371/journal.pone.0248337] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 02/24/2021] [Indexed: 01/03/2023] Open

Walker CR, Scally A, De Maio N, Goldman N. Short-range template switching in great ape genomes explored using pair hidden Markov models. PLoS Genet 2021;17:e1009221. [PMID: 33651813 PMCID: PMC7954356 DOI: 10.1371/journal.pgen.1009221] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 03/12/2021] [Accepted: 02/10/2021] [Indexed: 12/14/2022] Open

Abstract

Many complex genomic rearrangements arise through template switch errors, which occur in DNA replication when there is a transient polymerase switch to an alternate template nearby in three-dimensional space. While typically investigated at kilobase-to-megabase scales, the genomic and evolutionary consequences of this mutational process are not well characterised at smaller scales, where they are often interpreted as clusters of independent substitutions, insertions and deletions. Here we present an improved statistical approach using pair hidden Markov models, and use it to detect and describe short-range template switches underlying clusters of mutations in the multi-way alignment of hominid genomes. Using robust statistics derived from evolutionary genomic simulations, we show that template switch events have been widespread in the evolution of the great apes’ genomes and provide a parsimonious explanation for the presence of many complex mutation clusters in their phylogenetic context. Larger-scale mechanisms of genome rearrangement are typically associated with structural features around breakpoints, and accordingly we show that atypical patterns of secondary structure formation and DNA bending are present at the initial template switch loci. Our methods improve on previous non-probabilistic approaches for computational detection of template switch mutations, allowing the statistical significance of events to be assessed. By specifying realistic evolutionary parameters based on the genomes and taxa involved, our methods can be readily adapted to other intra- or inter-species comparisons.

DNA replication is an imperfect process which causes the mutations that give rise to genetic diversity during the evolution of genomes. While many mutations are independent, single-nucleotide substitutions or small insertions and deletions, some mutations arise as nonindependent clusters of substitutions and larger scale chromosomal rearrangements. Large-scale rearrangements (also called structural variants) in particular can have a profound impact on genome evolution and contribute to both germline and somatic disease in humans. The replication-based mechanisms underlying structural variation typically involve a polymerase switch event in which a large segment of DNA is copied using a template from an alternate location in the genome. Methods for identifying these template switch mutations lack the power to detect smaller scale rearrangements which can arise through the same replication-based pathways. Here we outline a model which can detect and assess the statistical significance of such small-scale template switches within their evolutionary context. We show that these events are widespread in the evolution of great apes and that the genomic features associated with these small-scale rearrangements are similar to those of large-scale structural variants.

Collapse

Taliun D, Harris DN, Kessler MD, Carlson J, Szpiech ZA, Torres R, Taliun SAG, Corvelo A, Gogarten SM, Kang HM, Pitsillides AN, LeFaive J, Lee SB, Tian X, Browning BL, Das S, Emde AK, Clarke WE, Loesch DP, Shetty AC, Blackwell TW, Smith AV, Wong Q, Liu X, Conomos MP, Bobo DM, Aguet F, Albert C, Alonso A, Ardlie KG, Arking DE, Aslibekyan S, Auer PL, Barnard J, Barr RG, Barwick L, Becker LC, Beer RL, Benjamin EJ, Bielak LF, Blangero J, Boehnke M, Bowden DW, Brody JA, Burchard EG, Cade BE, Casella JF, Chalazan B, Chasman DI, Chen YDI, Cho MH, Choi SH, Chung MK, Clish CB, Correa A, Curran JE, Custer B, Darbar D, Daya M, de Andrade M, DeMeo DL, Dutcher SK, Ellinor PT, Emery LS, Eng C, Fatkin D, Fingerlin T, Forer L, Fornage M, Franceschini N, Fuchsberger C, Fullerton SM, Germer S, Gladwin MT, Gottlieb DJ, Guo X, Hall ME, He J, Heard-Costa NL, Heckbert SR, Irvin MR, Johnsen JM, Johnson AD, Kaplan R, Kardia SLR, Kelly T, Kelly S, Kenny EE, Kiel DP, Klemmer R, Konkle BA, Kooperberg C, Köttgen A, Lange LA, Lasky-Su J, Levy D, Lin X, Lin KH, Liu C, Loos RJF, Garman L, Gerszten R, Lubitz SA, Lunetta KL, Mak ACY, Manichaikul A, Manning AK, Mathias RA, McManus DD, McGarvey ST, Meigs JB, Meyers DA, Mikulla JL, Minear MA, Mitchell BD, Mohanty S, Montasser ME, Montgomery C, Morrison AC, Murabito JM, Natale A, Natarajan P, Nelson SC, North KE, O'Connell JR, Palmer ND, Pankratz N, Peloso GM, Peyser PA, Pleiness J, Post WS, Psaty BM, Rao DC, Redline S, Reiner AP, Roden D, Rotter JI, Ruczinski I, Sarnowski C, Schoenherr S, Schwartz DA, Seo JS, Seshadri S, Sheehan VA, Sheu WH, Shoemaker MB, Smith NL, Smith JA, Sotoodehnia N, Stilp AM, Tang W, Taylor KD, Telen M, Thornton TA, Tracy RP, Van Den Berg DJ, Vasan RS, Viaud-Martinez KA, Vrieze S, Weeks DE, Weir BS, Weiss ST, Weng LC, Willer CJ, Zhang Y, Zhao X, Arnett DK, Ashley-Koch AE, Barnes KC, Boerwinkle E, Gabriel S, Gibbs R, Rice KM, Rich SS, Silverman EK, Qasba P, Gan W, Papanicolaou GJ, Nickerson DA, Browning SR, Zody MC, Zöllner S, Wilson JG, Cupples LA, Laurie CC, Jaquish CE, Hernandez RD, O'Connor TD, Abecasis GR. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 2021;590:290-299. [PMID: 33568819 PMCID: PMC7875770 DOI: 10.1038/s41586-021-03205-y] [Citation(s) in RCA: 965] [Impact Index Per Article: 321.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 01/07/2021] [Indexed: 02/08/2023]

Affiliation(s)

Daniel Taliun Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Daniel N Harris Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Michael D Kessler Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Jedidiah Carlson Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA Department of Genome Sciences, University of Washington, Seattle, WA, USA
Zachary A Szpiech Department of Biology, Pennsylvania State University, University Park, PA, USA Institute for Computational and Data Sciences, Pennsylvania State University, University Park, PA, USA
Raul Torres Biomedical Sciences Graduate Program, University of California, San Francisco, San Francisco, CA, USA
Sarah A Gagliano Taliun Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
André Corvelo New York Genome Center, New York, NY, USA
Stephanie M Gogarten Department of Biostatistics, University of Washington, Seattle, WA, USA
Hyun Min Kang Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Achilleas N Pitsillides Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Jonathon LeFaive Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Seung-Been Lee Department of Genome Sciences, University of Washington, Seattle, WA, USA
Xiaowen Tian Department of Biostatistics, University of Washington, Seattle, WA, USA
Brian L Browning Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA, USA
Sayantan Das Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Anne-Katrin Emde New York Genome Center, New York, NY, USA
Wayne E Clarke New York Genome Center, New York, NY, USA
Douglas P Loesch Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Amol C Shetty Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Thomas W Blackwell Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Albert V Smith Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Quenna Wong Department of Biostatistics, University of Washington, Seattle, WA, USA
Xiaoming Liu USF Genomics, College of Public Health, University of South Florida, Tampa, FL, USA
Matthew P Conomos Department of Biostatistics, University of Washington, Seattle, WA, USA
Dean M Bobo Icahn School of Medicine at Mount Sinai, New York, NY, USA
François Aguet The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Christine Albert Massachusetts General Hospital, Boston, MA, USA
Alvaro Alonso Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Kristin G Ardlie The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Dan E Arking McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Stella Aslibekyan University of Alabama, Birmingham, AL, USA
Paul L Auer Zilber School of Public Health, University of Wisconsin Milwaukee, Milwaukee, WI, USA
John Barnard Cleveland Clinic, Cleveland, OH, USA
R Graham Barr Department of Medicine, Columbia University Medical Center, New York, NY, USA Department of Epidemiology, Columbia University Medical Center, New York, NY, USA
Lucas Barwick The Emmes Corporation, Rockville, MD, USA
Lewis C Becker Johns Hopkins University, Baltimore, MD, USA
Rebecca L Beer National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Emelia J Benjamin Department of Medicine, Boston University School of Medicine, Boston, MA, USA Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA Framingham Heart Study, Framingham, MA, USA
Lawrence F Bielak Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI, USA
John Blangero Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
Michael Boehnke Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Donald W Bowden Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
Jennifer A Brody Department of Medicine, University of Washington, Seattle, WA, USA Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
Esteban G Burchard Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA Department of Medicine, University of California, San Francisco, San Francisco, CA, USA
Brian E Cade Department of Medicine, Harvard Medical School, Boston, MA, USA Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
James F Casella Department of Pediatrics, Johns Hopkins University, Baltimore, MD, USA Division of Pediatric Hematology, Johns Hopkins University, Baltimore, MD, USA
Brandon Chalazan Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
Daniel I Chasman Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA, USA Harvard Medical School, Boston, MA, USA
Yii-Der Ida Chen The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA, USA
Michael H Cho Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
Seung Hoan Choi The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Mina K Chung Department of Cardiovascular Medicine, Heart & Vascular Institute, Cleveland Clinic, Cleveland, OH, USA Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
Clary B Clish Metabolomics Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Adolfo Correa Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA Department of Pediatrics, University of Mississippi Medical Center, Jackson, MS, USA Department of Population Health Science, University of Mississippi Medical Center, Jackson, MS, USA
Joanne E Curran Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
Brian Custer Vitalant Research Institute, San Francisco, CA, USA Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, USA
Dawood Darbar Department of Medicine, University of Illinois at Chicago, Chicago, IL, USA
Michelle Daya Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Mariza de Andrade Mayo Clinic, Rochester, MN, USA
Dawn L DeMeo Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
Susan K Dutcher McDonnell Genome Institute, Washington University, St Louis, MO, USA Department of Genetics, Washington University, St Louis, MO, USA
Patrick T Ellinor Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Leslie S Emery Department of Biostatistics, University of Washington, Seattle, WA, USA
Celeste Eng Department of Medicine, University of California, San Francisco, San Francisco, CA, USA
Diane Fatkin Molecular Cardiology Division, Victor Chang Cardiac Research Institute, Darlinghurst, New South Wales, Australia Faculty of Medicine, University of New South Wales, Kensington, New South Wales, Australia Cardiology Department, St Vincent's Hospital, Darlinghurst, New South Wales, Australia
Tasha Fingerlin National Jewish Health, Center for Genes, Environment and Health, Denver, CO, USA
Lukas Forer Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
Myriam Fornage Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, TX, USA
Nora Franceschini Department of Epidemiology, University of North Carolina, Chapel Hill, NC, USA
Christian Fuchsberger Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria Institute for Biomedicine, Eurac Research, Bolzano, Italy
Stephanie M Fullerton Department of Bioethics & Humanities, University of Washington School of Medicine, Seattle, WA, USA
Soren Germer New York Genome Center, New York, NY, USA
Mark T Gladwin Pittsburgh Heart, Lung, Blood and Vascular Medicine Institute, University of Pittsburgh, Pittsburgh, PA, USA Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA Department of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
Daniel J Gottlieb VA Boston Healthcare System, Boston, MA, USA Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
Xiuqing Guo The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA, USA
Michael E Hall Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
Jiang He Department of Epidemiology, Tulane University, New Orleans, LA, USA Tulane University Translational Science Institute, Tulane University, New Orleans, LA, USA
Nancy L Heard-Costa Framingham Heart Study, Framingham, MA, USA Department of Neurology, Boston University School of Medicine, Boston, MA, USA
Susan R Heckbert Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA Department of Epidemiology, University of Washington, Seattle, WA, USA
Marguerite R Irvin Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AL, USA
Jill M Johnsen Department of Medicine, University of Washington, Seattle, WA, USA Bloodworks Northwest Research Institute, Seattle, WA, USA
Andrew D Johnson Framingham Heart Study, Framingham, MA, USA Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Framingham, MA, USA
Robert Kaplan Albert Einstein College of Medicine, New York, NY, USA
Sharon L R Kardia Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI, USA
Tanika Kelly Department of Epidemiology, Tulane University, New Orleans, LA, USA
Shannon Kelly Department of Epidemiology, Vitalant Research Institute, San Francisco, CA, USA Department of Pediatrics, UCSF Benioff Children's Hospital, Oakland, CA, USA Division of Pediatric Hematology, UCSF Benioff Children's Hospital, Oakland, CA, USA
Eimear E Kenny Icahn School of Medicine at Mount Sinai, New York, NY, USA
Douglas P Kiel The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Hinda and Arthur Marcus Institute for Aging Research, Hebrew SeniorLife, Boston, MA, USA Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
Robert Klemmer Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Barbara A Konkle Department of Medicine, University of Washington, Seattle, WA, USA Bloodworks Northwest Research Institute, Seattle, WA, USA
Charles Kooperberg Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Anna Köttgen Department of Epidemiology, Johns Hopkins University, Baltimore, MD, USA Institute of Genetic Epidemiology, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
Leslie A Lange Department of Medicine, University of Colorado at Denver, Aurora, CO, USA
Jessica Lasky-Su Department of Medicine, Harvard Medical School, Boston, MA, USA Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA Brigham and Women's Hospital, Boston, MA, USA
Daniel Levy Department of Medicine, Boston University School of Medicine, Boston, MA, USA Framingham Heart Study, Framingham, MA, USA Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Framingham, MA, USA
Xihong Lin Biostatistics and Statistics, Harvard University, Boston, MA, USA
Keng-Han Lin Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Chunyu Liu Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Ruth J F Loos The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Lori Garman Department of Genes and Human Disease, Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
Robert Gerszten Beth Israel Deaconess Medical Center, Boston, MA, USA
Steven A Lubitz Massachusetts General Hospital, Boston, MA, USA
Kathryn L Lunetta Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Angel C Y Mak Department of Medicine, University of California, San Francisco, San Francisco, CA, USA
Ani Manichaikul Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
Alisa K Manning Department of Medicine, Harvard Medical School, Boston, MA, USA Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA, USA Metabolism Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Rasika A Mathias Department of Medicine, Johns Hopkins University, Baltimore, MD, USA
David D McManus Cardiovascular Medicine, University of Massachusetts Medical School, Worcester, MA, USA
Stephen T McGarvey International Health Institute, Brown University, Providence, RI, USA Department of Epidemiology, Brown University, Providence, RI, USA Department of Anthropology, Brown University, Providence, RI, USA
James B Meigs Division of General Internal Medicine, Massachusetts General Hospital, Harvard Medical School, The Broad Institute of MIT and Harvard, Boston, MA, USA
Deborah A Meyers University of Arizona, Tucson, AZ, USA
Julie L Mikulla National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Mollie A Minear National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Braxton D Mitchell Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Geriatrics Research and Education Clinical Center, Baltimore Veterans Administration Medical Center, Baltimore, MD, USA
Sanghamitra Mohanty Texas Cardiac Arrhythmia Institute, St David's Medical Center, Austin, TX, USA Department of Internal Medicine, Dell Medical School, Austin, TX, USA
May E Montasser Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Courtney Montgomery Department of Genes and Human Disease, Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
Alanna C Morrison Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, University of Texas Health Science Center at Houston, Houston, TX, USA
Joanne M Murabito Department of Medicine, Boston University School of Medicine, Boston, MA, USA
Andrea Natale Texas Cardiac Arrhythmia Institute, St David's Medical Center, Austin, TX, USA
Pradeep Natarajan Department of Medicine, Harvard Medical School, Boston, MA, USA Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Sarah C Nelson Department of Biostatistics, University of Washington, Seattle, WA, USA
Kari E North Department of Epidemiology, University of North Carolina, Chapel Hill, NC, USA
Jeffrey R O'Connell Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Nicholette D Palmer Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
Nathan Pankratz Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN, USA
Gina M Peloso Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Patricia A Peyser Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI, USA
Jacob Pleiness Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Wendy S Post Division of Cardiology, Department of Medicine, Johns Hopkins University, Baltimore, MD, USA
Bruce M Psaty Department of Medicine, University of Washington, Seattle, WA, USA Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA Department of Epidemiology, University of Washington, Seattle, WA, USA Department of Health Services, University of Washington, Seattle, WA, USA Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
D C Rao Division of Biostatistics, Washington University in St Louis, St Louis, MO, USA
Susan Redline Department of Medicine, Harvard Medical School, Boston, MA, USA Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
Alexander P Reiner Department of Epidemiology, University of Washington, Seattle, WA, USA Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Dan Roden Vanderbilt University Medical Center, Nashville, TN, USA
Jerome I Rotter The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA, USA
Ingo Ruczinski Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Chloé Sarnowski Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Sebastian Schoenherr Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
David A Schwartz University of Colorado at Denver, Denver, CO, USA
Jeong-Sun Seo Precision Medicine Center, Seoul National University Bundang Hospital, Seongnam, Republic of Korea Macrogen Inc, Seoul, Republic of Korea Gong Wu Genomic Medicine Institute, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
Sudha Seshadri Framingham Heart Study, Framingham, MA, USA Glenn Biggs Institute for Alzheimer's and Neurodegenerative Diseases, University of Texas Health Sciences Center at San Antonio, San Antonio, TX, USA
Vivien A Sheehan Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, USA Aflac Cancer and Blood Disorders Center, Children's Healthcare of Atlanta, Atlanta, GA, USA
Wayne H Sheu Taichung Veterans General Hospital Taiwan, Taichung City, Taiwan
M Benjamin Shoemaker Vanderbilt University Medical Center, Nashville, TN, USA
Nicholas L Smith Department of Epidemiology, University of Washington, Seattle, WA, USA Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA, USA
Jennifer A Smith Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI, USA Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
Nona Sotoodehnia Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
Adrienne M Stilp Department of Biostatistics, University of Washington, Seattle, WA, USA
Weihong Tang Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, MN, USA
Kent D Taylor The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA, USA
Marilyn Telen Duke University, Durham, NC, USA
Timothy A Thornton Department of Biostatistics, University of Washington, Seattle, WA, USA
Russell P Tracy Department of Pathology & Laboratory Medicine, University of Vermont Larner College of Medicine, Burlington, VT, USA
David J Van Den Berg Center for Genetic Epidemiology, Department of Preventive Medicine, University of Southern California, Los Angeles, CA, USA
Ramachandran S Vasan Department of Medicine, Boston University School of Medicine, Boston, MA, USA Framingham Heart Study, Framingham, MA, USA
Karine A Viaud-Martinez Illumina Laboratory Services, Illumina Inc, San Diego, CA, USA
Scott Vrieze Department of Psychology, University of Minnesota, Minneapolis, MN, USA
Daniel E Weeks Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, USA Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, USA
Bruce S Weir Department of Biostatistics, University of Washington, Seattle, WA, USA
Scott T Weiss Department of Medicine, Harvard Medical School, Boston, MA, USA Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA Brigham and Women's Hospital, Boston, MA, USA
Lu-Chen Weng Massachusetts General Hospital, Boston, MA, USA
Cristen J Willer Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA Department of Internal Medicine-Cardiology, University of Michigan, Ann Arbor, MI, USA Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
Yingze Zhang Pittsburgh Heart, Lung, Blood and Vascular Medicine Institute, University of Pittsburgh, Pittsburgh, PA, USA Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA Department of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
Xutong Zhao Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
Donna K Arnett Department of Epidemiology, University of Kentucky, Lexington, KY, USA
Allison E Ashley-Koch Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC, USA
Kathleen C Barnes Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Eric Boerwinkle University of Texas Health Science Center at Houston, Houston, TX, USA Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
Stacey Gabriel The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Richard Gibbs Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
Kenneth M Rice Department of Biostatistics, University of Washington, Seattle, WA, USA
Stephen S Rich Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
Edwin K Silverman Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
Pankaj Qasba National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Weiniu Gan National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
George J Papanicolaou National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Deborah A Nickerson Department of Genome Sciences, University of Washington, Seattle, WA, USA Northwest Genomics Center, Seattle, WA, USA Brotman Baty Institute, Seattle, WA, USA
Sharon R Browning Department of Biostatistics, University of Washington, Seattle, WA, USA
Michael C Zody New York Genome Center, New York, NY, USA
Sebastian Zöllner Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
James G Wilson Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, USA
L Adrienne Cupples Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA. Framingham Heart Study, Framingham, MA, USA.
Cathy C Laurie Department of Biostatistics, University of Washington, Seattle, WA, USA.
Cashell E Jaquish National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA.
Ryan D Hernandez Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA. Department of Human Genetics, McGill University, Montreal, Quebec, Canada. Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA, USA. Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA. Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.
Timothy D O'Connor Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA. Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA. Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA.
Gonçalo R Abecasis Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA.

Collapse

Jones CT, Youssef N, Susko E, Bielawski JP. A Phenotype-Genotype Codon Model for Detecting Adaptive Evolution. Syst Biol 2021;69:722-738. [PMID: 31730199 DOI: 10.1093/sysbio/syz075] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 11/09/2019] [Accepted: 11/11/2019] [Indexed: 01/03/2023] Open

Abstract

A central objective in biology is to link adaptive evolution in a gene to structural and/or functional phenotypic novelties. Yet most analytic methods make inferences mainly from either phenotypic data or genetic data alone. A small number of models have been developed to infer correlations between the rate of molecular evolution and changes in a discrete or continuous life history trait. But such correlations are not necessarily evidence of adaptation. Here, we present a novel approach called the phenotype-genotype branch-site model (PG-BSM) designed to detect evidence of adaptive codon evolution associated with discrete-state phenotype evolution. An episode of adaptation is inferred under standard codon substitution models when there is evidence of positive selection in the form of an elevation in the nonsynonymous-to-synonymous rate ratio $\omega$ to a value $\omega > 1$. As it is becoming increasingly clear that $\omega > 1$ can occur without adaptation, the PG-BSM was formulated to infer an instance of adaptive evolution without appealing to evidence of positive selection. The null model makes use of a covarion-like component to account for general heterotachy (i.e., random changes in the evolutionary rate at a site over time). The alternative model employs samples of the phenotypic evolutionary history to test for phenomenological patterns of heterotachy consistent with specific mechanisms of molecular adaptation. These include 1) a persistent increase/decrease in $\omega$ at a site following a change in phenotype (the pattern) consistent with an increase/decrease in the functional importance of the site (the mechanism); and 2) a transient increase in $\omega$ at a site along a branch over which the phenotype changed (the pattern) consistent with a change in the site's optimal amino acid (the mechanism). Rejection of the null is followed by post hoc analyses to identify sites with strongest evidence for adaptation in association with changes in the phenotype as well as the most likely evolutionary history of the phenotype. Simulation studies based on a novel method for generating mechanistically realistic signatures of molecular adaptation show that the PG-BSM has good statistical properties. Analyses of real alignments show that site patterns identified post hoc are consistent with the specific mechanisms of adaptation included in the alternate model. Further simulation studies show that the covarion-like component of the PG-BSM plays a crucial role in mitigating recently discovered statistical pathologies associated with confounding by accounting for heterotachy-by-any-cause. [Adaptive evolution; branch-site model; confounding; mutation-selection; phenotype-genotype.].

Collapse

Srivastava K, Doescher A, Wagner FF, Flegel WA. NG_007494.1(RHD):c.[4A>T;5G>C;6_7insG] with an RhD-negative phenotype. Transfusion 2020;60:E45-E47. [PMID: 33043462 DOI: 10.1111/trf.16115] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Revised: 07/17/2020] [Accepted: 07/19/2020] [Indexed: 12/01/2022]

Mas-Ponte D, Supek F. DNA mismatch repair promotes APOBEC3-mediated diffuse hypermutation in human cancers. Nat Genet 2020;52:958-968. [PMID: 32747826 PMCID: PMC7610516 DOI: 10.1038/s41588-020-0674-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 06/30/2020] [Indexed: 01/12/2023]

Wang Q, Pierce-Hoffman E, Cummings BB, Alföldi J, Francioli LC, Gauthier LD, Hill AJ, O'Donnell-Luria AH, Karczewski KJ, MacArthur DG. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes. Nat Commun 2020;11:2539. [PMID: 32461613 PMCID: PMC7253413 DOI: 10.1038/s41467-019-12438-5] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Accepted: 09/09/2019] [Indexed: 12/31/2022] Open

Affiliation(s)

Qingbo Wang Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
Emma Pierce-Hoffman Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Beryl B Cummings Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA Program in Biomedical and Biological Sciences, Harvard Medical School, Boston, MA, 02115, USA
Jessica Alföldi Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Laurent C Francioli Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Laura D Gauthier Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Andrew J Hill Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
Anne H O'Donnell-Luria Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Konrad J Karczewski Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Daniel G MacArthur Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA. Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, Australia. Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia.

Collapse

Li C, Luscombe NM. Nucleosome positioning stability is a modulator of germline mutation rate variation across the human genome. Nat Commun 2020;11:1363. [PMID: 32170069 PMCID: PMC7070026 DOI: 10.1038/s41467-020-15185-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 02/23/2020] [Indexed: 02/08/2023] Open

The Tempo and Mode of Angiosperm Mitochondrial Genome Divergence Inferred from Intraspecific Variation in Arabidopsis thaliana. G3-GENES GENOMES GENETICS 2020;10:1077-1086. [PMID: 31964685 PMCID: PMC7056966 DOI: 10.1534/g3.119.401023] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Satoh Y, Asakawa JI, Nishimura M, Kuo T, Shinkai N, Cullings HM, Minakuchi Y, Sese J, Toyoda A, Shimada Y, Nakamura N, Uchimura A. Characteristics of induced mutations in offspring derived from irradiated mouse spermatogonia and mature oocytes. Sci Rep 2020;10:37. [PMID: 31913321 PMCID: PMC6949229 DOI: 10.1038/s41598-019-56881-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Accepted: 12/18/2019] [Indexed: 01/07/2023] Open

Affiliation(s)

Yasunari Satoh Department of Molecular Biosciences, Radiation Effects Research Foundation, 5-2 Hijiyama Park, Minami-ku, Hiroshima, 732-0815, Japan.
Jun-Ichi Asakawa Department of Molecular Biosciences, Radiation Effects Research Foundation, 5-2 Hijiyama Park, Minami-ku, Hiroshima, 732-0815, Japan
Mayumi Nishimura Department of Radiation Effects Research, National Institute of Radiological Sciences (NIRS), National Institutes for Quantum and Radiological Science and Technology (QST), Chiba, 263-8555, Japan
Tony Kuo Artificial Intelligence Research Center, AIST, 2-3-26 Aomi, Koto-ku, Tokyo, 135-0064, Japan.,Real World Big-Data Computation Open Innovation Laboratory, AIST-Tokyo Tech, 2-12-1 Okayama, Meguro-ku, Tokyo, 152-8550, Japan
Norio Shinkai Artificial Intelligence Research Center, AIST, 2-3-26 Aomi, Koto-ku, Tokyo, 135-0064, Japan
Harry M Cullings Department of Statistics, Radiation Effects Research Foundation, 5-2 Hijiyama Park, Minami-ku, Hiroshima, 732-0815, Japan
Yohei Minakuchi Comparative Genomics Laboratory, National Institute of Genetics, Mishima, 411-8540, Japan
Jun Sese Artificial Intelligence Research Center, AIST, 2-3-26 Aomi, Koto-ku, Tokyo, 135-0064, Japan.,Real World Big-Data Computation Open Innovation Laboratory, AIST-Tokyo Tech, 2-12-1 Okayama, Meguro-ku, Tokyo, 152-8550, Japan.,Humanome Lab, Inc., L-HUB 3F, 1-4, Shumomiyabi-cho, Sinjuku-ku, Tokyo, 162-0822, Japan
Atsushi Toyoda Comparative Genomics Laboratory, National Institute of Genetics, Mishima, 411-8540, Japan
Yoshiya Shimada Department of Radiological Sciences, Graduate School of Human Health Sciences, Tokyo Metropolitan University, Tokyo, 116-8551, Japan.,Executive Director, QST, Chiba, 263-8555, Japan
Nori Nakamura Department of Molecular Biosciences, Radiation Effects Research Foundation, 5-2 Hijiyama Park, Minami-ku, Hiroshima, 732-0815, Japan
Arikuni Uchimura Department of Molecular Biosciences, Radiation Effects Research Foundation, 5-2 Hijiyama Park, Minami-ku, Hiroshima, 732-0815, Japan.

Collapse

Belinky F, Sela I, Rogozin IB, Koonin EV. Crossing fitness valleys via double substitutions within codons. BMC Biol 2019;17:105. [PMID: 31842858 PMCID: PMC6916188 DOI: 10.1186/s12915-019-0727-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 11/20/2019] [Indexed: 02/07/2023] Open

Abstract

BACKGROUND

Single nucleotide substitutions in protein-coding genes can be divided into synonymous (S), with little fitness effect, and non-synonymous (N) ones that alter amino acids and thus generally have a greater effect. Most of the N substitutions are affected by purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases potentially could alleviate the deleterious effect of single substitutions, making them subject to positive selection. To elucidate the effects of selection on double substitutions in all codons, it is critical to differentiate selection from mutational biases.

RESULTS

We addressed the evolutionary regimes of within-codon double substitutions in 37 groups of closely related prokaryotic genomes from diverse phyla by comparing the fractions of double substitutions within codons to those of the equivalent double S substitutions in adjacent codons. Under the assumption that substitutions occur one at a time, all within-codon double substitutions can be represented as "ancestral-intermediate-final" sequences (where "intermediate" refers to the first single substitution and "final" refers to the second substitution) and can be partitioned into four classes: (1) SS, S intermediate-S final; (2) SN, S intermediate-N final; (3) NS, N intermediate-S final; and (4) NN, N intermediate-N final. We found that the selective pressure on the second substitution markedly differs among these classes of double substitutions. Analogous to single S (synonymous) substitutions, SS double substitutions evolve neutrally, whereas analogous to single N (non-synonymous) substitutions, SN double substitutions are subject to purifying selection. In contrast, NS show positive selection on the second step because the original amino acid is recovered. The NN double substitutions are heterogeneous and can be subject to either purifying or positive selection, or evolve neutrally, depending on the amino acid similarity between the final or intermediate and the ancestral states.

CONCLUSIONS

The results of the present, comprehensive analysis of the evolutionary landscape of within-codon double substitutions reaffirm the largely conservative regime of protein evolution. However, the second step of a double substitution can be subject to positive selection when the first step is deleterious. Such positive selection can result in frequent crossing of valleys on the fitness landscape.

Collapse

Gagunashvili AN, Ocaka L, Kelberman D, Munot P, Bacchelli C, Beales PL, Ganesan V. Novel missense variants in the RNF213 gene from a European family with Moyamoya disease. Hum Genome Var 2019;6:35. [PMID: 31645973 PMCID: PMC6804521 DOI: 10.1038/s41439-019-0066-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/06/2019] [Accepted: 06/21/2019] [Indexed: 01/30/2023] Open

Kaplanis J, Akawi N, Gallone G, McRae JF, Prigmore E, Wright CF, Fitzpatrick DR, Firth HV, Barrett JC, Hurles ME. Exome-wide assessment of the functional impact and pathogenicity of multinucleotide mutations. Genome Res 2019;29:1047-1056. [PMID: 31227601 PMCID: PMC6633265 DOI: 10.1101/gr.239756.118] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 05/24/2019] [Indexed: 01/25/2023]

Prendergast JGD, Pugh C, Harris SE, Hume DA, Deary IJ, Beveridge A. Linked Mutations at Adjacent Nucleotides Have Shaped Human Population Differentiation and Protein Evolution. Genome Biol Evol 2019;11:759-775. [PMID: 30689878 PMCID: PMC6424222 DOI: 10.1093/gbe/evz014] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/18/2019] [Indexed: 02/06/2023] Open

Dunn KA, Kenney T, Gu H, Bielawski JP. Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates. BMC Evol Biol 2019;19:22. [PMID: 30642241 PMCID: PMC6332903 DOI: 10.1186/s12862-018-1326-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/11/2018] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

An excess of nonsynonymous substitutions, over neutrality, is considered evidence of positive Darwinian selection. Inference for proteins often relies on estimation of the nonsynonymous to synonymous ratio (ω = dN/dS) within a codon model. However, to ease computational difficulties, ω is typically estimated assuming an idealized substitution process where (i) all nonsynonymous substitutions have the same rate (regardless of impact on organism fitness) and (ii) instantaneous double and triple (DT) nucleotide mutations have zero probability (despite evidence that they can occur). It follows that estimates of ω represent an imperfect summary of the intensity of selection, and that tests based on the ω > 1 threshold could be negatively impacted.

RESULTS

We developed a general-purpose parametric (GPP) modelling framework for codons. This novel approach allows specification of all possible instantaneous codon substitutions, including multiple nonsynonymous rates (MNRs) and instantaneous DT nucleotide changes. Existing codon models are specified as special cases of the GPP model. We use GPP models to implement likelihood ratio tests for ω > 1 that accommodate MNRs and DT mutations. Through both simulation and real data analysis, we find that failure to model MNRs and DT mutations reduces power in some cases and inflates false positives in others. False positives under traditional M2a and M8 models were very sensitive to DT changes. This was exacerbated by the choice of frequency parameterization (GY vs. MG), with rates sometimes > 90% under MG. By including MNRs and DT mutations, accuracy and power was greatly improved under the GPP framework. However, we also find that over-parameterized models can perform less well, and this can contribute to degraded performance of LRTs.

CONCLUSIONS

We suggest GPP models should be used alongside traditional codon models. Further, all codon models should be deployed within an experimental design that includes (i) assessing robustness to model assumptions, and (ii) investigation of non-standard behaviour of MLEs. As the goal of every analysis is to avoid false conclusions, more work is needed on model selection methods that consider both the increase in fit engendered by a model parameter and the degree to which that parameter is affected by un-modelled evolutionary processes.

Collapse

Looking for Darwin in Genomic Sequences: Validity and Success Depends on the Relationship Between Model and Data. Methods Mol Biol 2019;1910:399-426. [PMID: 31278672 DOI: 10.1007/978-1-4939-9074-0_13] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Abstract

Codon substitution models (CSMs) are commonly used to infer the history of natural section for a set of protein-coding sequences, often with the explicit goal of detecting the signature of positive Darwinian selection. However, the validity and success of CSMs used in conjunction with the maximum likelihood (ML) framework is sometimes challenged with claims that the approach might too often support false conclusions. In this chapter, we use a case study approach to identify four legitimate statistical difficulties associated with inference of evolutionary events using CSMs. These include: (1) model misspecification, (2) low information content, (3) the confounding of processes, and (4) phenomenological load, or PL. While past criticisms of CSMs can be connected to these issues, the historical critiques were often misdirected, or overstated, because they failed to recognize that the success of any model-based approach depends on the relationship between model and data. Here, we explore this relationship and provide a candid assessment of the limitations of CSMs to extract historical information from extant sequences. To aid in this assessment, we provide a brief overview of: (1) a more realistic way of thinking about the process of codon evolution framed in terms of population genetic parameters, and (2) a novel presentation of the ML statistical framework. We then divide the development of CSMs into two broad phases of scientific activity and show that the latter phase is characterized by increases in model complexity that can sometimes negatively impact inference of evolutionary mechanisms. Such problems are not yet widely appreciated by the users of CSMs. These problems can be avoided by using a model that is appropriate for the data; but, understanding the relationship between the data and a fitted model is a difficult task. We argue that the only way to properly understand that relationship is to perform in silico experiments using a generating process that can mimic the data as closely as possible. The mutation-selection modeling framework (MutSel) is presented as the basis of such a generating process. We contend that if complex CSMs continue to be developed for testing explicit mechanistic hypotheses, then additional analyses such as those described in here (e.g., penalized LRTs and estimation of PL) will need to be applied alongside the more traditional inferential methods.

Collapse

Multinucleotide mutations cause false inferences of lineage-specific positive selection. Nat Ecol Evol 2018;2:1280-1288. [PMID: 29967485 PMCID: PMC6093625 DOI: 10.1038/s41559-018-0584-5] [Citation(s) in RCA: 88] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 05/18/2018] [Indexed: 11/08/2022]

Rahman A, Hallgrímsdóttir I, Eisen M, Pachter L. Association mapping from sequencing reads using k-mers. eLife 2018;7:e32920. [PMID: 29897334 PMCID: PMC6044908 DOI: 10.7554/elife.32920] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 06/08/2018] [Indexed: 01/05/2023] Open

Germline de novo mutation clusters arise during oocyte aging in genomic regions with high double-strand-break incidence. Nat Genet 2018;50:487-492. [PMID: 29507425 DOI: 10.1038/s41588-018-0071-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2017] [Accepted: 01/29/2018] [Indexed: 11/08/2022]

Harris K. Reading the genome like a history book. Science 2017;358:1265. [DOI: 10.1126/science.aar2003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Assaf ZJ, Tilk S, Park J, Siegal ML, Petrov DA. Deep sequencing of natural and experimental populations of Drosophila melanogaster reveals biases in the spectrum of new mutations. Genome Res 2017;27:1988-2000. [PMID: 29079675 PMCID: PMC5741049 DOI: 10.1101/gr.219956.116] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Accepted: 10/20/2017] [Indexed: 11/25/2022]

Seitz H. Issues in current microRNA target identification methods. RNA Biol 2017;14:831-834. [PMID: 28430005 DOI: 10.1080/15476286.2017.1320469] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open

Löytynoja A, Goldman N. Short template switch events explain mutation clusters in the human genome. Genome Res 2017;27:1039-1049. [PMID: 28385709 PMCID: PMC5453318 DOI: 10.1101/gr.214973.116] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 03/28/2017] [Indexed: 01/19/2023]

Seplyarskiy VB, Andrianova MA, Bazykin GA. APOBEC3A/B-induced mutagenesis is responsible for 20% of heritable mutations in the TpCpW context. Genome Res 2016;27:175-184. [PMID: 27940951 PMCID: PMC5287224 DOI: 10.1101/gr.210336.116] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2016] [Accepted: 12/01/2016] [Indexed: 12/18/2022]

Besenbacher S, Sulem P, Helgason A, Helgason H, Kristjansson H, Jonasdottir A, Jonasdottir A, Magnusson OT, Thorsteinsdottir U, Masson G, Kong A, Gudbjartsson DF, Stefansson K. Multi-nucleotide de novo Mutations in Humans. PLoS Genet 2016;12:e1006315. [PMID: 27846220 PMCID: PMC5147774 DOI: 10.1371/journal.pgen.1006315] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Accepted: 08/22/2016] [Indexed: 01/23/2023] Open

Novembre J, Peter BM. Recent advances in the study of fine-scale population structure in humans. Curr Opin Genet Dev 2016;41:98-105. [PMID: 27662060 DOI: 10.1016/j.gde.2016.08.007] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2016] [Revised: 08/18/2016] [Accepted: 08/24/2016] [Indexed: 01/17/2023]