51
|
Zhang M, Yang Q, Ai H, Huang L. Revisiting the Evolutionary History of Pigs via De Novo Mutation Rate Estimation in A Three-generation Pedigree. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:1040-1052. [PMID: 35181533 DOI: 10.1016/j.gpb.2022.02.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 12/20/2021] [Accepted: 02/09/2022] [Indexed: 12/30/2022]
Abstract
The mutation rate used in the previous analyses of pig evolution and demographics was cursory and hence invited potential bias in inferring evolutionary history. Herein, we estimated the de novo mutation rate of pigs as 3.6 × 10-9 per base per generation using high-quality whole-genome sequencing data from nine individuals in a three-generation pedigree through stringent filtering and validation. Using this mutation rate, we re-investigated the evolutionary history of pigs. The estimated divergence time of ∼ 10 kiloyears ago (KYA) between European wild and domesticated pigs was consistent with the domestication time of European pigs based on archaeological evidence. However, other divergence events inferred here were not as ancient as previously described. Our estimates suggested that Sus speciation occurred ∼ 1.36 million years ago (MYA); European wild pigs split from Asian wild pigs only ∼ 219 KYA; and south and north Chinese wild pigs split ∼ 25 KYA. Meanwhile, our results showed that the most recent divergence event between Chinese wild and domesticated pigs occurred in the Hetao plain, North China, approximately 20 KYA, supporting the possibly independent domestication in North China along the middle Yellow River. We also found that the maximum effective population size of pigs was ∼ 6 times larger than the previous estimate. An archaic migration from other Sus species originating ∼ 2 MYA to European pigs was detected during western colonization of pigs; this interfered with the previous demographic inference. Our de novo mutation rate estimation and its consequences for demographic history inference reasonably provide a new vision regarding the evolutionary history of pigs.
Collapse
Affiliation(s)
- Mingpeng Zhang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China
| | - Qiang Yang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China
| | - Huashui Ai
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China.
| | - Lusheng Huang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China.
| |
Collapse
|
52
|
Xu J, Li L, Ren J, Zhong X, Xie C, Zheng A, Abudukadier A, Tuerxun M, Zhang S, Tang L, Hairoula D, Zou X. Whole-Exome Sequencing Implicates the USP34 rs777591A > G Intron Variant in Chronic Obstructive Pulmonary Disease in a Kashi Cohort. Front Cell Dev Biol 2022; 9:792027. [PMID: 35198563 PMCID: PMC8859106 DOI: 10.3389/fcell.2021.792027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Accepted: 12/08/2021] [Indexed: 12/17/2022] Open
Abstract
Genetic factors are important factors in chronic obstructive pulmonary disease (COPD) onset. Plenty of risk and new causative genes for COPD have been identified in patients of the Chinese Han population. In contrast, we know considerably little concerning the genetics in the Kashi COPD population (Uyghur). This study aims at clarifying the genetic maps regarding COPD susceptibility in Kashi (China). Whole-exome sequencing (WES) was used to analyze three Uyghur families with COPD in Kashi (eight patients and one healthy control). Sanger sequencing was also used to verify the WES results in 541 unrelated Uyghur COPD patients and 534 Uyghur healthy controls. WES showed 72 single nucleotide variants (SNVs), two deletions, and small insertions (InDels), 26 copy number variants (CNVs), and 34 structural variants (SVs), including g.71230620T > A (rs12449210T > A, NC_000,016.10) in the HYDIN axonemal central pair apparatus protein (HYDIN) gene and g.61190482A > G (rs777591A > G, NC_000002.12) in the ubiquitin-specific protease 34 (USP34) gene. After Sanger sequencing, we found that rs777591“AA” under different genetic models except for the dominant model (adjusted OR = 0.8559, 95%CI 0.6568–1.115, p > .05), could significantly reduce COPD risk, but rs12449210T > A was not related to COPD. In stratified analysis of smoking status, rs777591“AA” reduced COPD risk significantly among the nonsmoker group. Protein and mRNA expression of USP34 in cigarette smoke extract-treated BEAS-2b cells increased significantly compared with those in the control group. Our findings associate the USP34 rs777591“AA” genotype as a protector factor in COPD.
Collapse
Affiliation(s)
- Jingran Xu
- Department of Medical College, Shihezi University, Shihezi, China
| | - Li Li
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Jie Ren
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Xuemei Zhong
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Chengxin Xie
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Aifang Zheng
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Ayiguzali Abudukadier
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Maimaitiaili Tuerxun
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Sujie Zhang
- Department of Medical College, Shihezi University, Shihezi, China
| | - Lifeng Tang
- Department of Medical College, Shihezi University, Shihezi, China
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Dilare Hairoula
- Department of Medical College, Shihezi University, Shihezi, China
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Xiaoguang Zou
- Department of Medical College, Shihezi University, Shihezi, China
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
- *Correspondence: Xiaoguang Zou,
| |
Collapse
|
53
|
Li W, Almirantis Y, Provata A. Revisiting the neutral dynamics derived limiting guanine-cytosine content using human de novo point mutation data. Meta Gene 2022. [DOI: 10.1016/j.mgene.2021.100994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
54
|
Fine human genetic map based on UK10K data set. Hum Genet 2022; 141:273-281. [DOI: 10.1007/s00439-021-02415-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 12/03/2021] [Indexed: 11/04/2022]
|
55
|
Melamed D, Nov Y, Malik A, Yakass MB, Bolotin E, Shemer R, Hiadzi EK, Skorecki KL, Livnat A. De novo mutation rates at the single-mutation resolution in a human HBB gene-region associated with adaptation and genetic disease. Genome Res 2022; 32:488-498. [PMID: 35031571 PMCID: PMC8896469 DOI: 10.1101/gr.276103.121] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 01/10/2022] [Indexed: 11/25/2022]
Abstract
Although it is known that the mutation rate varies across the genome, previous estimates were based on averaging across various numbers of positions. Here, we describe a method to measure the origination rates of target mutations at target base positions and apply it to a 6-bp region in the human hemoglobin subunit beta (HBB) gene and to the identical, paralogous hemoglobin subunit delta (HBD) region in sperm cells from both African and European donors. The HBB region of interest (ROI) includes the site of the hemoglobin S (HbS) mutation, which protects against malaria, is common in Africa, and has served as a classic example of adaptation by random mutation and natural selection. We found a significant correspondence between de novo mutation rates and past observations of alleles in carriers, showing that mutation rates vary substantially in a mutation-specific manner that contributes to the site frequency spectrum. We also found that the overall point mutation rate is significantly higher in Africans than in Europeans in the HBB region studied. Finally, the rate of the 20A→T mutation, called the “HbS mutation” when it appears in HBB, is significantly higher than expected from the genome-wide average for this mutation type. Nine instances were observed in the African HBB ROI, where it is of adaptive significance, representing at least three independent originations; no instances were observed elsewhere. Further studies will be needed to examine mutation rates at the single-mutation resolution across these and other loci and organisms and to uncover the molecular mechanisms responsible.
Collapse
|
56
|
Bergeron LA, Besenbacher S, Turner T, Versoza CJ, Wang RJ, Price AL, Armstrong E, Riera M, Carlson J, Chen HY, Hahn MW, Harris K, Kleppe AS, López-Nandam EH, Moorjani P, Pfeifer SP, Tiley GP, Yoder AD, Zhang G, Schierup MH. The mutationathon highlights the importance of reaching standardization in estimates of pedigree-based germline mutation rates. eLife 2022; 11:73577. [PMID: 35018888 PMCID: PMC8830884 DOI: 10.7554/elife.73577] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 01/11/2022] [Indexed: 11/13/2022] Open
Abstract
In the past decade, several studies have estimated the human per-generation germline mutation rate using large pedigrees. More recently, estimates for various nonhuman species have been published. However, methodological differences among studies in detecting germline mutations and estimating mutation rates make direct comparisons difficult. Here, we describe the many different steps involved in estimating pedigree-based mutation rates, including sampling, sequencing, mapping, variant calling, filtering, and appropriately accounting for false-positive and false-negative rates. For each step, we review the different methods and parameter choices that have been used in the recent literature. Additionally, we present the results from a ‘Mutationathon,’ a competition organized among five research labs to compare germline mutation rate estimates for a single pedigree of rhesus macaques. We report almost a twofold variation in the final estimated rate among groups using different post-alignment processing, calling, and filtering criteria, and provide details into the sources of variation across studies. Though the difference among estimates is not statistically significant, this discrepancy emphasizes the need for standardized methods in mutation rate estimations and the difficulty in comparing rates from different studies. Finally, this work aims to provide guidelines for computational and statistical benchmarks for future studies interested in identifying germline mutations from pedigrees.
Collapse
Affiliation(s)
- Lucie A Bergeron
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Søren Besenbacher
- Department of Molecular Medicine (MOMA), Aarhus University, Aarhus N, Denmark
| | - Tychele Turner
- Department of Genetics, Washington University in St. Louis, Saint Louis, United States
| | - Cyril J Versoza
- Center for Evolution and Medicine, Arizona State University, Tempe, United States
| | - Richard J Wang
- Department of Biology, Indiana University, Bloomington, United States
| | - Alivia Lee Price
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Ellie Armstrong
- Department of Biology, Stanford University, Stanford, United States
| | - Meritxell Riera
- Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| | - Jedidiah Carlson
- Department of Genome Sciences, University of Washington, Seattle, United States
| | - Hwei-Yen Chen
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, United States
| | - Kelley Harris
- Department of Genome Sciences, University of Washington, Seattle, United States
| | | | | | - Priya Moorjani
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Susanne P Pfeifer
- School of Life Sciences, Arizona State University, Tempe, United States
| | - George P Tiley
- Department of Biology, Duke University, Durham, United States
| | - Anne D Yoder
- Department of Biology, Duke University, Durham, United States
| | - Guojie Zhang
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | |
Collapse
|
57
|
Oud MS, Smits RM, Smith HE, Mastrorosa FK, Holt GS, Houston BJ, de Vries PF, Alobaidi BKS, Batty LE, Ismail H, Greenwood J, Sheth H, Mikulasova A, Astuti GDN, Gilissen C, McEleny K, Turner H, Coxhead J, Cockell S, Braat DDM, Fleischer K, D’Hauwers KWM, Schaafsma E, Nagirnaja L, Conrad DF, Friedrich C, Kliesch S, Aston KI, Riera-Escamilla A, Krausz C, Gonzaga-Jauregui C, Santibanez-Koref M, Elliott DJ, Vissers LELM, Tüttelmann F, O’Bryan MK, Ramos L, Xavier MJ, van der Heijden GW, Veltman JA. A de novo paradigm for male infertility. Nat Commun 2022; 13:154. [PMID: 35013161 PMCID: PMC8748898 DOI: 10.1038/s41467-021-27132-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 11/02/2021] [Indexed: 12/29/2022] Open
Abstract
De novo mutations are known to play a prominent role in sporadic disorders with reduced fitness. We hypothesize that de novo mutations play an important role in severe male infertility and explain a portion of the genetic causes of this understudied disorder. To test this hypothesis, we utilize trio-based exome sequencing in a cohort of 185 infertile males and their unaffected parents. Following a systematic analysis, 29 of 145 rare (MAF < 0.1%) protein-altering de novo mutations are classified as possibly causative of the male infertility phenotype. We observed a significant enrichment of loss-of-function de novo mutations in loss-of-function-intolerant genes (p-value = 1.00 × 10-5) in infertile men compared to controls. Additionally, we detected a significant increase in predicted pathogenic de novo missense mutations affecting missense-intolerant genes (p-value = 5.01 × 10-4) in contrast to predicted benign de novo mutations. One gene we identify, RBM5, is an essential regulator of male germ cell pre-mRNA splicing and has been previously implicated in male infertility in mice. In a follow-up study, 6 rare pathogenic missense mutations affecting this gene are observed in a cohort of 2,506 infertile patients, whilst we find no such mutations in a cohort of 5,784 fertile men (p-value = 0.03). Our results provide evidence for the role of de novo mutations in severe male infertility and point to new candidate genes affecting fertility.
Collapse
Affiliation(s)
- M. S. Oud
- grid.10417.330000 0004 0444 9382Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, The Netherlands
| | - R. M. Smits
- grid.10417.330000 0004 0444 9382Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, The Netherlands
| | - H. E. Smith
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - F. K. Mastrorosa
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - G. S. Holt
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - B. J. Houston
- grid.1008.90000 0001 2179 088XSchool of BioSciences, Faculty of Science, The University of Melbourne, Parkville, VIC Australia
| | - P. F. de Vries
- grid.10417.330000 0004 0444 9382Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, The Netherlands
| | - B. K. S. Alobaidi
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - L. E. Batty
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - H. Ismail
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - J. Greenwood
- grid.420004.20000 0004 0444 2244Department of Genetic Medicine, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - H. Sheth
- Foundation for Research in Genetics and Endocrinology, Institute of Human Genetics, Ahmedabad, India
| | - A. Mikulasova
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - G. D. N. Astuti
- grid.10417.330000 0004 0444 9382Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, The Netherlands ,grid.412032.60000 0001 0744 0787Division of Human Genetics, Center for Biomedical Research, Faculty of Medicine, Diponegoro University, Semarang, Indonesia
| | - C. Gilissen
- grid.10417.330000 0004 0444 9382Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, The Netherlands
| | - K. McEleny
- grid.420004.20000 0004 0444 2244Newcastle Fertility Centre, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - H. Turner
- grid.420004.20000 0004 0444 2244Department of Cellular Pathology, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - J. Coxhead
- grid.1006.70000 0001 0462 7212Genomics Core Facility, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - S. Cockell
- Bioinformatics Support Unit, Faculty of Medical Sciences New, castle University, Newcastle upon Tyne, UK
| | - D. D. M. Braat
- grid.10417.330000 0004 0444 9382Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, The Netherlands
| | - K. Fleischer
- grid.10417.330000 0004 0444 9382Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, The Netherlands
| | - K. W. M. D’Hauwers
- grid.10417.330000 0004 0444 9382Department of Urology, Radboudumc, Nijmegen, The Netherlands
| | - E. Schaafsma
- grid.10417.330000 0004 0444 9382Department of Pathology, Radboudumc, Nijmegen, The Netherlands
| | | | - L. Nagirnaja
- grid.5288.70000 0000 9758 5690Division of Genetics, Oregon National Primate Research Center, Oregon Health & Science University, Beaverton, OR USA
| | - D. F. Conrad
- grid.5288.70000 0000 9758 5690Division of Genetics, Oregon National Primate Research Center, Oregon Health & Science University, Beaverton, OR USA
| | - C. Friedrich
- grid.5949.10000 0001 2172 9288Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | - S. Kliesch
- grid.16149.3b0000 0004 0551 4246Centre of Reproductive Medicine and Andrology, Department of Clinical and Surgical Andrology, University Hospital Münster, Münster, Germany
| | - K. I. Aston
- grid.223827.e0000 0001 2193 0096Department of Surgery, Division of Urology, University of Utah School of Medicine, Salt Lake City, UT USA
| | - A. Riera-Escamilla
- grid.418813.70000 0004 1767 1951Andrology Department, Fundació Puigvert, Universitat Autònoma de Barcelona, Instituto de Investigaciones Biomédicas Sant Pau (IIB-Sant Pau), Barcelona, Catalonia Spain
| | - C. Krausz
- grid.8404.80000 0004 1757 2304Department of Biomedical, Experimental and Clinical Sciences “Mario Serio”, University of Florence, Florence, Italy
| | - C. Gonzaga-Jauregui
- grid.418961.30000 0004 0472 2713Regeneron Genetics Center, Tarrytown, NY USA
| | - M. Santibanez-Koref
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - D. J. Elliott
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - L. E. L. M. Vissers
- grid.10417.330000 0004 0444 9382Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, The Netherlands
| | - F. Tüttelmann
- grid.5949.10000 0001 2172 9288Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | - M. K. O’Bryan
- grid.1008.90000 0001 2179 088XSchool of BioSciences, Faculty of Science, The University of Melbourne, Parkville, VIC Australia
| | - L. Ramos
- grid.10417.330000 0004 0444 9382Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, The Netherlands
| | - M. J. Xavier
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - G. W. van der Heijden
- grid.10417.330000 0004 0444 9382Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, The Netherlands ,grid.10417.330000 0004 0444 9382Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, The Netherlands
| | - J. A. Veltman
- grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
58
|
Sepúlveda-Yáñez JH, Alvarez Saravia D, Pilzecker B, van Schouwenburg PA, van den Burg M, Veelken H, Navarrete MA, Jacobs H, Koning MT. Tandem Substitutions in Somatic Hypermutation. Front Immunol 2022; 12:807015. [PMID: 35069591 PMCID: PMC8781386 DOI: 10.3389/fimmu.2021.807015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/16/2021] [Indexed: 11/13/2022] Open
Abstract
Upon antigen recognition, activation-induced cytosine deaminase initiates affinity maturation of the B-cell receptor by somatic hypermutation (SHM) through error-prone DNA repair pathways. SHM typically creates single nucleotide substitutions, but tandem substitutions may also occur. We investigated incidence and sequence context of tandem substitutions by massive parallel sequencing of V(D)J repertoires in healthy human donors. Mutation patterns were congruent with SHM-derived single nucleotide mutations, delineating initiation of the tandem substitution by AID. Tandem substitutions comprised 5,7% of AID-induced mutations. The majority of tandem substitutions represents single nucleotide juxtalocations of directly adjacent sequences. These observations were confirmed in an independent cohort of healthy donors. We propose a model where tandem substitutions are predominantly generated by translesion synthesis across an apyramidinic site that is typically created by UNG. During replication, apyrimidinic sites transiently adapt an extruded configuration, causing skipping of the extruded base. Consequent strand decontraction leads to the juxtalocation, after which exonucleases repair the apyramidinic site and any directly adjacent mismatched base pairs. The mismatch repair pathway appears to account for the remainder of tandem substitutions. Tandem substitutions may enhance affinity maturation and expedite the adaptive immune response by overcoming amino acid codon degeneracies or mutating two adjacent amino acid residues simultaneously.
Collapse
Affiliation(s)
- Julieta H Sepúlveda-Yáñez
- Department of Hematology, Leiden University Medical Center, Leiden, Netherlands
- School of Medicine, University of Magallanes, Punta Arenas, Chile
| | | | - Bas Pilzecker
- Department of Tumor Immunology, Radboud Institute for Molecular Life Sciences, Nijmegen, Netherlands
- Division of Tumor Biology and Immunology, Netherlands Cancer Institute, Amsterdam, Netherlands
| | | | - Mirjam van den Burg
- Department of Pediatrics, Leiden University Medical Center, Leiden, Netherlands
| | - Hendrik Veelken
- Department of Hematology, Leiden University Medical Center, Leiden, Netherlands
| | | | - Heinz Jacobs
- Division of Tumor Biology and Immunology, Netherlands Cancer Institute, Amsterdam, Netherlands
| | - Marvyn T Koning
- Department of Hematology, Leiden University Medical Center, Leiden, Netherlands
| |
Collapse
|
59
|
Harder A. Do non-pathogenic variants of DNA mismatch repair genes modify neurofibroma load in neurofibromatosis type 1? Childs Nerv Syst 2022; 38:705-713. [PMID: 34997843 PMCID: PMC8940751 DOI: 10.1007/s00381-021-05436-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 12/13/2021] [Indexed: 01/07/2023]
Abstract
Non-pathogenic mismatch repair (MMR) gene variants can be associated with decreased MMR capacity in several settings. Due to an increased mutation rate, reduced MMR capacity leads to accumulation of somatic sequence changes in tumour suppressor genes such as in the neurofibromatosis type 1 (NF1) gene. Patients with autosomal dominant NF1 typically develop neurofibromas ranging from single to thousands. Concerning the number of neurofibromas NF1 patients face a situation that is still not predictable. A few studies suggested that germline non-pathogenic MMR gene variants modify the number of neurofibromas in NF1 and by this mechanism may promote the extent of neurofibroma manifestation. This review represents first evidence that specific non-pathogenic single nucleotide variants of MMR genes act as a modifier of neurofibroma manifestation in NF1, highlighting MSH2 re4987188 as the best analysed non-pathogenic variant so far. In summary, besides MSH2 promotor methylation, specific non-pathogenic germline MSH2 variants are associated with the extent of neurofibroma manifestation. Those variants can serve as a biomarker to facilitate better mentoring of NF1 patients at risk.
Collapse
Affiliation(s)
- Anja Harder
- Institute of Pathology, Medical Faculty, Martin Luther University Halle-Wittenberg, Halle (Saale), 06120, Germany.
- Institute of Neuropathology, University Hospital Münster, Münster, Germany.
- Faculty of Health Sciences, Joint Faculty, Potsdam, Germany.
| |
Collapse
|
60
|
DNA Methyltransferases and DNA Damage. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1389:349-361. [DOI: 10.1007/978-3-031-11454-0_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
61
|
Jiang L, Jiang H, Dai S, Chen Y, Song Y, Tang CSM, Pang SYY, Ho SL, Wang B, Garcia-Barcelo MM, Tam PKH, Cherny SS, Li MJ, Sham PC, Li M. Deviation from baseline mutation burden provides powerful and robust rare-variants association test for complex diseases. Nucleic Acids Res 2021; 50:e34. [PMID: 34931221 PMCID: PMC8989543 DOI: 10.1093/nar/gkab1234] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 11/19/2021] [Accepted: 12/04/2021] [Indexed: 02/07/2023] Open
Abstract
Identifying rare variants that contribute to complex diseases is challenging because of the low statistical power in current tests comparing cases with controls. Here, we propose a novel and powerful rare variants association test based on the deviation of the observed mutation burden of a gene in cases from a baseline predicted by a weighted recursive truncated negative-binomial regression (RUNNER) on genomic features available from public data. Simulation studies show that RUNNER is substantially more powerful than state-of-the-art rare variant association tests and has reasonable type 1 error rates even for stratified populations or in small samples. Applied to real case-control data, RUNNER recapitulates known genes of Hirschsprung disease and Alzheimer's disease missed by current methods and detects promising new candidate genes for both disorders. In a case-only study, RUNNER successfully detected a known causal gene of amyotrophic lateral sclerosis. The present study provides a powerful and robust method to identify susceptibility genes with rare risk variants for complex diseases.
Collapse
Affiliation(s)
- Lin Jiang
- Program in Bioinformatics, Zhongshan School of Medicine and The Fifth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Research Center of Medical Sciences, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China.,Key Laboratory of Tropical Disease Control (Sun Yat-sen University), Ministry of Education, Sun Yat-sen University, Guangzhou, China.,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, China
| | - Hui Jiang
- Program in Bioinformatics, Zhongshan School of Medicine and The Fifth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Key Laboratory of Tropical Disease Control (Sun Yat-sen University), Ministry of Education, Sun Yat-sen University, Guangzhou, China.,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, China
| | - Sheng Dai
- Program in Bioinformatics, Zhongshan School of Medicine and The Fifth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Key Laboratory of Tropical Disease Control (Sun Yat-sen University), Ministry of Education, Sun Yat-sen University, Guangzhou, China.,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, China
| | - Ying Chen
- Program in Bioinformatics, Zhongshan School of Medicine and The Fifth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Key Laboratory of Tropical Disease Control (Sun Yat-sen University), Ministry of Education, Sun Yat-sen University, Guangzhou, China.,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, China
| | - Youqiang Song
- School of Biomedical Sciences, the University of Hong Kong, Hong Kong, SAR China.,State Key Laboratory of Brain and Cognitive Sciences, the University of Hong Kong, Hong Kong, SAR China
| | - Clara Sze-Man Tang
- Department of Surgery, the University of Hong Kong, Hong Kong, SAR China.,Dr. Li Dak-Sum Research Centre, The University of Hong Kong - Karolinska Institutet Collaboration in Regenerative Medicine, Hong Kong, SAR China
| | - Shirley Yin-Yu Pang
- Division of Neurology, Department of Medicine, the University of Hong Kong, Hong Kong, SAR China
| | - Shu-Leong Ho
- Division of Neurology, Department of Medicine, the University of Hong Kong, Hong Kong, SAR China
| | - Binbin Wang
- Department of Genetics, National Research Institute for Family Planning, Beijing, China
| | | | - Paul Kwong-Hang Tam
- Department of Surgery, the University of Hong Kong, Hong Kong, SAR China.,Dr. Li Dak-Sum Research Centre, The University of Hong Kong - Karolinska Institutet Collaboration in Regenerative Medicine, Hong Kong, SAR China.,Faculty of Medicine, Macau University of Science and Technology, Macau, SAR China
| | | | - Mulin Jun Li
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Medical University, Tianjin 300070, China
| | - Pak Chung Sham
- The Centre for PanorOmic Sciences, the University of Hong Kong, Hong Kong, SAR China.,State Key Laboratory of Brain and Cognitive Sciences, the University of Hong Kong, Hong Kong, SAR China.,Department of Psychiatry, the University of Hong Kong, Hong Kong, SAR China
| | - Miaoxin Li
- Program in Bioinformatics, Zhongshan School of Medicine and The Fifth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Key Laboratory of Tropical Disease Control (Sun Yat-sen University), Ministry of Education, Sun Yat-sen University, Guangzhou, China.,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, China.,The Centre for PanorOmic Sciences, the University of Hong Kong, Hong Kong, SAR China.,Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, China
| |
Collapse
|
62
|
Karipcin S, Wei S, Williams Z. Getting to the point (mutation). Fertil Steril 2021; 116:1359. [PMID: 34602259 DOI: 10.1016/j.fertnstert.2021.09.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 09/07/2021] [Indexed: 11/15/2022]
Affiliation(s)
- Sinem Karipcin
- Columbia University Fertility Center, New York, New York
| | - Shan Wei
- Columbia University Fertility Center, New York, New York
| | - Zev Williams
- Columbia University Fertility Center, New York, New York
| |
Collapse
|
63
|
Breuss MW, Yang X, Gleeson JG. Sperm mosaicism: implications for genomic diversity and disease. Trends Genet 2021; 37:890-902. [PMID: 34158173 PMCID: PMC9484299 DOI: 10.1016/j.tig.2021.05.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 05/26/2021] [Accepted: 05/27/2021] [Indexed: 12/18/2022]
Abstract
While sperm mosaicism has few consequences for men, the offspring and future generations are unwitting recipients of gonadal cell mutations, often yielding severe disease. Recent studies, fueled by emergent technologies, show that sperm mosaicism is a common source of de novo mutations (DNMs) that underlie severe pediatric disease as well as human genetic diversity. Sperm mosaicism can be divided into three types: Type I arises during sperm meiosis and is non-age dependent; Type II arises in spermatogonia and increases as men age; and Type III arises during paternal embryogenesis, spreads throughout the body, and contributes stably to sperm throughout life. Where Types I and II confer little risk of recurrence, Type III may confer identifiable risk to future offspring. These mutations are likely to be the single largest contributor to human genetic diversity. New sequencing approaches may leverage this framework to evaluate and reduce disease risk for future generations.
Collapse
Affiliation(s)
- Martin W Breuss
- Department of Pediatrics, Section of Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO, USA
| | - Xiaoxu Yang
- Rady Children's Institute for Genomic Medicine, Department of Neurosciences, University of California, San Diego, CA, USA
| | - Joseph G Gleeson
- Rady Children's Institute for Genomic Medicine, Department of Neurosciences, University of California, San Diego, CA, USA.
| |
Collapse
|
64
|
Seplyarskiy VB, Sunyaev S. The origin of human mutation in light of genomic data. Nat Rev Genet 2021; 22:672-686. [PMID: 34163020 DOI: 10.1038/s41576-021-00376-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/06/2021] [Indexed: 02/05/2023]
Abstract
Despite years of active research into the role of DNA repair and replication in mutagenesis, surprisingly little is known about the origin of spontaneous human mutation in the germ line. With the advent of high-throughput sequencing, genome-scale data have revealed statistical properties of mutagenesis in humans. These properties include variation of the mutation rate and spectrum along the genome at different scales in relation to epigenomic features and dependency on parental age. Moreover, mutations originated in mothers are less frequent than mutations originated in fathers and have a distinct genomic distribution. Statistical analyses that interpret these patterns in the context of known biochemistry can provide mechanistic models of mutagenesis in humans.
Collapse
Affiliation(s)
- Vladimir B Seplyarskiy
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Shamil Sunyaev
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA. .,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
65
|
Moore L, Cagan A, Coorens THH, Neville MDC, Sanghvi R, Sanders MA, Oliver TRW, Leongamornlert D, Ellis P, Noorani A, Mitchell TJ, Butler TM, Hooks Y, Warren AY, Jorgensen M, Dawson KJ, Menzies A, O'Neill L, Latimer C, Teng M, van Boxtel R, Iacobuzio-Donahue CA, Martincorena I, Heer R, Campbell PJ, Fitzgerald RC, Stratton MR, Rahbari R. The mutational landscape of human somatic and germline cells. Nature 2021; 597:381-386. [PMID: 34433962 DOI: 10.1038/s41586-021-03822-7] [Citation(s) in RCA: 139] [Impact Index Per Article: 46.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 07/13/2021] [Indexed: 12/31/2022]
Abstract
Over the course of an individual's lifetime, normal human cells accumulate mutations1. Here we compare the mutational landscape in 29 cell types from the soma and germline using multiple samples from the same individuals. Two ubiquitous mutational signatures, SBS1 and SBS5/40, accounted for the majority of acquired mutations in most cell types, but their absolute and relative contributions varied substantially. SBS18, which potentially reflects oxidative damage2, and several additional signatures attributed to exogenous and endogenous exposures contributed mutations to subsets of cell types. The rate of mutation was lowest in spermatogonia, the stem cells from which sperm are generated and from which most genetic variation in the human population is thought to originate. This was due to low rates of ubiquitous mutational processes and may be partially attributable to a low rate of cell division in basal spermatogonia. These results highlight similarities and differences in the maintenance of the germline and soma.
Collapse
Affiliation(s)
- Luiza Moore
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
- Department of Pathology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Alex Cagan
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Tim H H Coorens
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Matthew D C Neville
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Rashesh Sanghvi
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Mathijs A Sanders
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
- Department of Hematology, Erasmus University Medical Center, Rotterdam, Netherlands
| | - Thomas R W Oliver
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
- Department of Pathology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | | | - Peter Ellis
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
- Inivata, Cambridge, UK
| | - Ayesha Noorani
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Thomas J Mitchell
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
- Department of Surgery, University of Cambridge, Cambridge, UK
| | - Timothy M Butler
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Yvette Hooks
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Anne Y Warren
- Department of Pathology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Mette Jorgensen
- Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Kevin J Dawson
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Andrew Menzies
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Laura O'Neill
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Calli Latimer
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Mabel Teng
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Ruben van Boxtel
- Princess Máxima Center for Pediatric Oncology and Oncode Institute, Utrecht, Netherlands
| | - Christine A Iacobuzio-Donahue
- Department of Pathology, Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Inigo Martincorena
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | - Rakesh Heer
- Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
- Newcastle Urology, Freeman Hospital, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Peter J Campbell
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK
| | | | - Michael R Stratton
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK.
| | - Raheleh Rahbari
- Cancer, Ageing and Somatic Mutation (CASM), Wellcome Sanger Institute, Hinxton, UK.
| |
Collapse
|
66
|
Trost B, Loureiro LO, Scherer SW. Discovery of genomic variation across a generation. Hum Mol Genet 2021; 30:R174-R186. [PMID: 34296264 PMCID: PMC8490016 DOI: 10.1093/hmg/ddab209] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 07/09/2021] [Accepted: 07/19/2021] [Indexed: 11/12/2022] Open
Abstract
Over the past 30 years (the timespan of a generation), advances in genomics technologies have revealed tremendous and unexpected variation in the human genome and have provided increasingly accurate answers to long-standing questions of how much genetic variation exists in human populations and to what degree the DNA complement changes between parents and offspring. Tracking the characteristics of these inherited and spontaneous (or de novo) variations has been the basis of the study of human genetic disease. From genome-wide microarray and next-generation sequencing scans, we now know that each human genome contains over 3 million single nucleotide variants when compared with the ~ 3 billion base pairs in the human reference genome, along with roughly an order of magnitude more DNA—approximately 30 megabase pairs (Mb)—being ‘structurally variable’, mostly in the form of indels and copy number changes. Additional large-scale variations include balanced inversions (average of 18 Mb) and complex, difficult-to-resolve alterations. Collectively, ~1% of an individual’s genome will differ from the human reference sequence. When comparing across a generation, fewer than 100 new genetic variants are typically detected in the euchromatic portion of a child’s genome. Driven by increasingly higher-resolution and higher-throughput sequencing technologies, newer and more accurate databases of genetic variation (for instance, more comprehensive structural variation data and phasing of combinations of variants along chromosomes) of worldwide populations will emerge to underpin the next era of discovery in human molecular genetics.
Collapse
Affiliation(s)
- Brett Trost
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Livia O Loureiro
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Stephen W Scherer
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada.,McLaughlin Centre and Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| |
Collapse
|
67
|
Koire A, Katsonis P, Kim YW, Buchovecky C, Wilson SJ, Lichtarge O. A method to delineate de novo missense variants across pathways prioritizes genes linked to autism. Sci Transl Med 2021; 13:13/594/eabc1739. [PMID: 34011629 DOI: 10.1126/scitranslmed.abc1739] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Accepted: 03/01/2021] [Indexed: 12/31/2022]
Abstract
Genotype-phenotype relationships shape health and population fitness but remain difficult to predict and interpret. Here, we apply an evolutionary action method to de novo missense variants in whole-exome sequences of individuals with autism spectrum disorder (ASD) to unravel genes and pathways connected to ASD. Evolutionary action predicts the impact of missense variants on protein function by measuring the fitness effect based on phylogenetic distances and substitution odds in homologous gene sequences. By examining de novo missense variants in 2384 individuals with ASD (probands) compared to matched siblings without ASD, we found missense variants in 398 genes representing 23 pathways that were biased toward higher evolutionary action scores than expected by random chance; these pathways were involved in axonogenesis, synaptic transmission, and neurodevelopment. The predicted fitness impact of de novo and inherited missense variants in candidate genes correlated with the IQ of individuals with ASD, even for new gene candidates. Taking an evolutionary action method, we detected those missense variants most likely to contribute to ASD pathogenesis and elucidated their phenotypic impact. This approach could be applied to integrate missense variants across a patient cohort to identify genes contributing to a shared phenotype in other complex diseases.
Collapse
Affiliation(s)
- Amanda Koire
- Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, TX, USA.,Medical Scientist Training Program, Baylor College of Medicine, Houston, TX, USA.,Department of Psychiatry, Brigham and Women's Hospital, Boston, MA, USA.,Harvard Medical School, Boston, MA, USA
| | - Panagiotis Katsonis
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Young Won Kim
- Program in Integrative Molecular and Biomedical Sciences, Baylor College of Medicine, Houston, TX, USA
| | - Christie Buchovecky
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Division of Carrier Screening and Prenatal Testing, SEMA4, Stamford, CT, USA
| | - Stephen J Wilson
- Department of Biochemistry, Baylor College of Medicine, Houston, TX, USA
| | - Olivier Lichtarge
- Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, TX, USA. .,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Department of Biochemistry, Baylor College of Medicine, Houston, TX, USA
| |
Collapse
|
68
|
Auboeuf D. The Physics-Biology continuum challenges darwinism: Evolution is directed by the homeostasis-dependent bidirectional relation between genome and phenotype. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2021; 167:121-139. [PMID: 34097984 DOI: 10.1016/j.pbiomolbio.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 05/19/2021] [Accepted: 05/31/2021] [Indexed: 10/21/2022]
Abstract
The physics-biology continuum relies on the fact that life emerged from prebiotic molecules. Here, I argue that life emerged from the coupling between nucleic acid and protein synthesis during which proteins (or proto-phenotypes) maintained the physicochemical parameter equilibria (or proto-homeostasis) in the proximity of their encoding nucleic acids (or proto-genomes). This protected the proto-genome physicochemical integrity (i.e., atomic composition) from environmental physicochemical constraints, and therefore increased the probability of reproducing the proto-genome without variation. From there, genomes evolved depending on the biological activities they generated in response to environmental fluctuations. Thus, a genome maintaining homeostasis (i.e., internal physicochemical parameter equilibria), despite and in response to environmental fluctuations, maintains its physicochemical integrity and has therefore a higher probability to be reproduced without variation. Consequently, descendants have a higher probability to share the same phenotype than their parents. Otherwise, the genome is modified during replication as a consequence of the imbalance of the internal physicochemical parameters it generates, until new mutation-deriving biological activities maintain homeostasis in offspring. In summary, evolution depends on feedforward and feedback loops between genome and phenotype, as the internal physicochemical conditions that a genome generates ─ through its derived phenotype in response to environmental fluctuations ─ in turn either guarantee its stability or direct its variation. Evolution may not be explained by the Darwinism-derived, unidirectional principle (random mutations-phenotypes-natural selection) but rather by the bidirectional relationship between genome and phenotype, in which the phenotype in interaction with the environment directs the evolution of the genome it derives from.
Collapse
Affiliation(s)
- Didier Auboeuf
- ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée D'Italie, Site Jacques Monod, F-69007, Lyon, France.
| |
Collapse
|
69
|
Stability across the Whole Nuclear Genome in the Presence and Absence of DNA Mismatch Repair. Cells 2021; 10:cells10051224. [PMID: 34067668 PMCID: PMC8156620 DOI: 10.3390/cells10051224] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 05/13/2021] [Accepted: 05/14/2021] [Indexed: 01/06/2023] Open
Abstract
We describe the contribution of DNA mismatch repair (MMR) to the stability of the eukaryotic nuclear genome as determined by whole-genome sequencing. To date, wild-type nuclear genome mutation rates are known for over 40 eukaryotic species, while measurements in mismatch repair-defective organisms are fewer in number and are concentrated on Saccharomyces cerevisiae and human tumors. Well-studied organisms include Drosophila melanogaster and Mus musculus, while less genetically tractable species include great apes and long-lived trees. A variety of techniques have been developed to gather mutation rates, either per generation or per cell division. Generational rates are described through whole-organism mutation accumulation experiments and through offspring–parent sequencing, or they have been identified by descent. Rates per somatic cell division have been estimated from cell line mutation accumulation experiments, from systemic variant allele frequencies, and from widely spaced samples with known cell divisions per unit of tissue growth. The latter methods are also used to estimate generational mutation rates for large organisms that lack dedicated germlines, such as trees and hyphal fungi. Mechanistic studies involving genetic manipulation of MMR genes prior to mutation rate determination are thus far confined to yeast, Arabidopsis thaliana, Caenorhabditis elegans, and one chicken cell line. A great deal of work in wild-type organisms has begun to establish a sound baseline, but far more work is needed to uncover the variety of MMR across eukaryotes. Nonetheless, the few MMR studies reported to date indicate that MMR contributes 100-fold or more to genome stability, and they have uncovered insights that would have been impossible to obtain using reporter gene assays.
Collapse
|
70
|
Bergeron LA, Besenbacher S, Bakker J, Zheng J, Li P, Pacheco G, Sinding MHS, Kamilari M, Gilbert MTP, Schierup MH, Zhang G. The germline mutational process in rhesus macaque and its implications for phylogenetic dating. Gigascience 2021; 10:giab029. [PMID: 33954793 PMCID: PMC8099771 DOI: 10.1093/gigascience/giab029] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 01/05/2021] [Accepted: 03/29/2021] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Understanding the rate and pattern of germline mutations is of fundamental importance for understanding evolutionary processes. RESULTS Here we analyzed 19 parent-offspring trios of rhesus macaques (Macaca mulatta) at high sequencing coverage of ∼76× per individual and estimated a mean rate of 0.77 × 10-8de novo mutations per site per generation (95% CI: 0.69 × 10-8 to 0.85 × 10-8). By phasing 50% of the mutations to parental origins, we found that the mutation rate is positively correlated with the paternal age. The paternal lineage contributed a mean of 81% of the de novo mutations, with a trend of an increasing male contribution for older fathers. Approximately 3.5% of de novo mutations were shared between siblings, with no parental bias, suggesting that they arose from early development (postzygotic) stages. Finally, the divergence times between closely related primates calculated on the basis of the yearly mutation rate of rhesus macaque generally reconcile with divergence estimated with molecular clock methods, except for the Cercopithecoidea/Hominoidea molecular divergence dated at 58 Mya using our new estimate of the yearly mutation rate. CONCLUSIONS When compared to the traditional molecular clock methods, new estimated rates from pedigree samples can provide insights into the evolution of well-studied groups such as primates.
Collapse
Affiliation(s)
- Lucie A Bergeron
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| | - Søren Besenbacher
- Department of Molecular Medicine, Aarhus University, Brendstrupgårdsvej 21A, 8200 Aarhus N, Denmark
| | - Jaco Bakker
- Animal Science Department, Biomedical Primate Research Centre, Lange Kleiweg 161, 2288 GJ Rijswijk, Netherlands
| | - Jiao Zheng
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, Guangdong, China
| | - Panyi Li
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China
| | - George Pacheco
- Section for Evolutionary Genomics, The GLOBE Institute, University of Copenhagen, Oester Voldgade 5-7, 1350 Copenhagen K, Denmark
| | - Mikkel-Holger S Sinding
- Department of genetics, Trinity College Dublin, 2 college green, D02 VF25, Dublin, Ireland
- Greenland Institute of Natural Resources, Kivioq 2, 3900 Nuuk, Greenland
| | - Maria Kamilari
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| | - M Thomas P Gilbert
- Section for Evolutionary Genomics, The GLOBE Institute, University of Copenhagen, Oester Voldgade 5-7, 1350 Copenhagen K, Denmark
- Department of Natural History, NTNU University Museum, Norwegian University of Science and Technology (NTNU), NO-7491 Trondheim, Norway
| | - Mikkel H Schierup
- Bioinformatics Research Centre, Aarhus University, C.F.Møllers Allé 8, 8000, Aarhus C, Denmark
| | - Guojie Zhang
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
71
|
Yeager M, Machiela MJ, Kothiyal P, Dean M, Bodelon C, Suman S, Wang M, Mirabello L, Nelson CW, Zhou W, Palmer C, Ballew B, Colli LM, Freedman ND, Dagnall C, Hutchinson A, Vij V, Maruvka Y, Hatch M, Illienko I, Belayev Y, Nakamura N, Chumak V, Bakhanova E, Belyi D, Kryuchkov V, Golovanov I, Gudzenko N, Cahoon EK, Albert P, Drozdovitch V, Little MP, Mabuchi K, Stewart C, Getz G, Bazyka D, Berrington de Gonzalez A, Chanock SJ. Lack of transgenerational effects of ionizing radiation exposure from the Chernobyl accident. Science 2021; 372:725-729. [PMID: 33888597 DOI: 10.1126/science.abg2365] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 04/12/2021] [Indexed: 12/15/2022]
Abstract
Effects of radiation exposure from the Chernobyl nuclear accident remain a topic of interest. We investigated germline de novo mutations (DNMs) in children born to parents employed as cleanup workers or exposed to occupational and environmental ionizing radiation after the accident. Whole-genome sequencing of 130 children (born 1987-2002) and their parents did not reveal an increase in the rates, distributions, or types of DNMs relative to the results of previous studies. We find no elevation in total DNMs, regardless of cumulative preconception gonadal paternal [mean = 365 milligrays (mGy), range = 0 to 4080 mGy] or maternal (mean = 19 mGy, range = 0 to 550 mGy) exposure to ionizing radiation. Thus, we conclude that, over this exposure range, evidence is lacking for a substantial effect on germline DNMs in humans, suggesting minimal impact from transgenerational genetic effects.
Collapse
Affiliation(s)
- Meredith Yeager
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA. .,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Mitchell J Machiela
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Prachi Kothiyal
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,SymbioSeq LLC, Arlington, VA 20148, USA
| | - Michael Dean
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Clara Bodelon
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Shalabh Suman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Mingyi Wang
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Lisa Mirabello
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Chase W Nelson
- Biodiversity Research Center, Academia Sinica, Taipei, 11529, Taiwan.,Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| | - Weiyin Zhou
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Cameron Palmer
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Bari Ballew
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Leandro M Colli
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Department of Medical Imaging, Hematology, and Oncology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, 14049-900, Brazil
| | - Neal D Freedman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Casey Dagnall
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Amy Hutchinson
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.,Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD 21701, USA
| | - Vibha Vij
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Yosi Maruvka
- Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA.,Center for Cancer Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Maureen Hatch
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Iryna Illienko
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | - Yuri Belayev
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | - Nori Nakamura
- Department of Molecular Biosciences, Radiation Effects Research Foundation, 5-2 Hijiyama Park, Minami-ku, Hiroshima, 732-0815, Japan
| | - Vadim Chumak
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | - Elena Bakhanova
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | - David Belyi
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | - Victor Kryuchkov
- Burnasyan Federal Medical and Biophysical Centre, 46 Zhivopisnaya Street, Moscow, 123182, Russia
| | - Ivan Golovanov
- Burnasyan Federal Medical and Biophysical Centre, 46 Zhivopisnaya Street, Moscow, 123182, Russia
| | - Natalia Gudzenko
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | - Elizabeth K Cahoon
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Paul Albert
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Vladimir Drozdovitch
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Mark P Little
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Kiyohiko Mabuchi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA
| | - Chip Stewart
- Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Gad Getz
- Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA.,Center for Cancer Research, Massachusetts General Hospital, Boston, MA 02114, USA.,Department of Pathology, Massachusetts General Hospital, Boston, MA 02114, USA.,Harvard Medical School, Boston, MA 02115, USA
| | - Dimitry Bazyka
- National Research Centre for Radiation Medicine, 53 Yu. Illienka Street, Kyiv, 04050, Ukraine
| | | | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20892, USA.
| |
Collapse
|
72
|
Shima H, Tokuhiro E, Okamoto S, Nagamori M, Ogata T, Narumi S, Nakamura A, Izumi Y, Jinno T, Suzuki E, Fukami M. SOX10 Mutation Screening for 117 Patients with Kallmann Syndrome. J Endocr Soc 2021; 5:bvab056. [PMID: 34095692 PMCID: PMC8170842 DOI: 10.1210/jendso/bvab056] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Indexed: 11/19/2022] Open
Abstract
Introduction Kallmann syndrome (KS) is a genetically heterogeneous condition characterized by hypogonadotropic hypogonadism (HH) and olfactory dysfunction. Although SOX10, a causative gene for Waardenburg syndrome (WS) and peripheral demyelinating neuropathy, central demyelination, WS, and Hirschsprung disease (PCWH) has previously been implicated in KS, the clinical significance of SOX10 variants as the cause of KS remains uncertain. Patients and Methods A total of 117 patients with KS underwent mutation screening of SOX10 and 14 other causative genes for KS/HH. Rare SOX10 variants were subjected to in silico and in vitro analyses. We also examined clinical data of the patients and their parents with SOX10 variants. Results Sequence analysis identified 2 heterozygous variants of SOX10 (c.1225G > T, p.Gly409* and c.475C > T, p.Arg159Trp) in patients 1–3, as well as in the parents of patients 1 and 3. The variants were assessed as pathogenic/likely pathogenic, according to the American College of Medical Genomics guidelines. Both variants lacked in vitro transactivating activity for the MITF promoter and exerted no dominant-negative effects. Patients 1–3 carried no pathogenic variants in other genes examined. The patients presented with typical KS, while such features were absent in the parents of patients 1 and 3. None of the 5 variant-positive individuals exhibited hypopigmentation, while 1 and 2 individuals exhibited complete and partial hearing loss, respectively. Conclusion These results provide evidence that SOX10 haploinsufficiency accounts for a small percentage of KS cases. SOX10 haploinsufficiency is likely to be associated with a broad phenotypic spectrum, which includes KS without other clinical features of WS/PCWH.
Collapse
Affiliation(s)
- Hirohito Shima
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| | - Etsuro Tokuhiro
- Department of Pediatrics, Fujisawa City Hospital, 251-8550, Fujisawa, Japan
| | - Shingo Okamoto
- Okamoto Internal Medicine and Pediatrics Clinic, 633-0064, Sakurai, Japan
| | - Mariko Nagamori
- Department of Pediatrics, University of Toyama, 930-0194, Toyama, Japan
| | - Tsutomu Ogata
- Department of Pediatrics, Hamamatsu University School of Medicine, 431-3192, Hamamatsu, Japan
| | - Satoshi Narumi
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| | - Akie Nakamura
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| | - Yoko Izumi
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| | - Tomoko Jinno
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| | - Erina Suzuki
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| | - Maki Fukami
- Department of Molecular Endocrinology, National Research Institute for Child Health and Development, 157-8535 Tokyo, Japan
| |
Collapse
|
73
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects. Mol Biol Evol 2021; 38:2986-3003. [PMID: 33591322 PMCID: PMC8233493 DOI: 10.1093/molbev/msab050] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
74
|
Niehus S, Jónsson H, Schönberger J, Björnsson E, Beyter D, Eggertsson HP, Sulem P, Stefánsson K, Halldórsson BV, Kehr B. PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes. Nat Commun 2021; 12:730. [PMID: 33526789 PMCID: PMC7851401 DOI: 10.1038/s41467-020-20850-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 12/14/2020] [Indexed: 12/14/2022] Open
Abstract
Thousands of genomic structural variants (SVs) segregate in the human population and can impact phenotypic traits and diseases. Their identification in whole-genome sequence data of large cohorts is a major computational challenge. Most current approaches identify SVs in single genomes and afterwards merge the identified variants into a joint call set across many genomes. We describe the approach PopDel, which directly identifies deletions of about 500 to at least 10,000 bp in length in data of many genomes jointly, eliminating the need for subsequent variant merging. PopDel scales to tens of thousands of genomes as we demonstrate in evaluations on up to 49,962 genomes. We show that PopDel reliably reports common, rare and de novo deletions. On genomes with available high-confidence reference call sets PopDel shows excellent recall and precision. Genotype inheritance patterns in up to 6794 trios indicate that genotypes predicted by PopDel are more reliable than those of previous SV callers. Furthermore, PopDel's running time is competitive with the fastest tested previous tools. The demonstrated scalability and accuracy of PopDel enables routine scans for deletions in large-scale sequencing studies.
Collapse
Affiliation(s)
- Sebastian Niehus
- Regensburg Center for Interventional Immunology (RCI), Regensburg, Germany
- Berlin Institute of Health (BIH), Berlin, Germany
- Charité-Universitätsmedizin Berlin, Berlin, Germany
| | | | - Janina Schönberger
- Regensburg Center for Interventional Immunology (RCI), Regensburg, Germany
- Berlin Institute of Health (BIH), Berlin, Germany
| | - Eythór Björnsson
- deCODE genetics/Amgen Inc., Reykjavík, Iceland
- Faculty of Medicine, School of Heath Sciences, University of Iceland, Reykjavík, Iceland
- Department of Internal Medicine, Landspítali-The National University Hospital of Iceland, Reykjavík, Iceland
| | | | | | | | - Kári Stefánsson
- deCODE genetics/Amgen Inc., Reykjavík, Iceland
- Faculty of Medicine, School of Heath Sciences, University of Iceland, Reykjavík, Iceland
| | - Bjarni V Halldórsson
- deCODE genetics/Amgen Inc., Reykjavík, Iceland
- School of Science and Engineering, Reykjavik University, Reykjavík, Iceland
| | - Birte Kehr
- Regensburg Center for Interventional Immunology (RCI), Regensburg, Germany.
- Berlin Institute of Health (BIH), Berlin, Germany.
- Charité-Universitätsmedizin Berlin, Berlin, Germany.
- Univeristät Regensburg, Regensburg, Germany.
| |
Collapse
|
75
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The impact of purifying and background selection on the inference of population history: problems and prospects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 33501439 PMCID: PMC7836109 DOI: 10.1101/2020.04.28.066365] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Current procedures for inferring population history generally assume complete neutrality - that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects (DFE) and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the DFE as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne 3012, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
76
|
Zhang Y, Wang R, Liu Z, Jiang S, Du L, Qiu K, Li F, Wang Q, Jin J, Chen X, Li Z, Wu J, Zhang N. Distinct genetic patterns of shared and unique genes across four neurodevelopmental disorders. Am J Med Genet B Neuropsychiatr Genet 2021; 186:3-15. [PMID: 32929885 DOI: 10.1002/ajmg.b.32821] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 06/04/2020] [Accepted: 08/15/2020] [Indexed: 01/09/2023]
Abstract
Neurodevelopmental disorders, including autism spectrum disorder (ASD), intellectual disability (ID), developmental disorders (DD) and epileptic encephalopathy (EE), have a strong clinical comorbidity, which indicates a common genetic etiology across various disorders. However, the underlying genetic mechanisms of comorbidity and specificity remain unknown across neurodevelopmental disorders. Based on de novo mutations, we compared systematically the functional characteristics between shared and unique genes under these disorders, as well as the spatiotemporal trajectory of development in brain and common molecular pathways of all shared genes. We observed that shared genes present more constrained against functional rare genetic variation, and harbor more pathogenic rare variants than do unique genes in each disorder. Furthermore, 71 shared genes formed two clusters related to synaptic transmission, transcription regulation and chromatin regulator. Particularly, we also found that two core genes STXBP1 and SCN2A, that were shared by the four neurodevelopmental disorders showed prominent pleiotropy. Our findings shed light on the shared and specific patterns across neurodevelopmental disorders and will enable us to further comprehend the etiology and provide valuable information for the diagnosis of neurodevelopmental disorders.
Collapse
Affiliation(s)
- Yijia Zhang
- Reproductive Center, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Ruochen Wang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Zhenwei Liu
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Shan Jiang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Lifeng Du
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Kairui Qiu
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Fengxia Li
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Qiongdan Wang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Jing Jin
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Xiaomin Chen
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Zhongshan Li
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Jinyu Wu
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Na Zhang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China.,Medicine & Technology School of Zunyi Medical University, Zunyi, China
| |
Collapse
|
77
|
Housh K, Jha JS, Haldar T, Amin SBM, Islam T, Wallace A, Gomina A, Guo X, Nel C, Wyatt JW, Gates KS. Formation and repair of unavoidable, endogenous interstrand cross-links in cellular DNA. DNA Repair (Amst) 2020; 98:103029. [PMID: 33385969 DOI: 10.1016/j.dnarep.2020.103029] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 11/24/2020] [Indexed: 02/08/2023]
Abstract
Genome integrity is essential for life and, as a result, DNA repair systems evolved to remove unavoidable DNA lesions from cellular DNA. Many forms of life possess the capacity to remove interstrand DNA cross-links (ICLs) from their genome but the identity of the naturally-occurring, endogenous substrates that drove the evolution and retention of these DNA repair systems across a wide range of life forms remains uncertain. In this review, we describe more than a dozen chemical processes by which endogenous ICLs plausibly can be introduced into cellular DNA. The majority involve DNA degradation processes that introduce aldehyde residues into the double helix or reactions of DNA with endogenous low molecular weight aldehyde metabolites. A smaller number of the cross-linking processes involve reactions of DNA radicals generated by oxidation.
Collapse
Affiliation(s)
- Kurt Housh
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Jay S Jha
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Tuhin Haldar
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Saosan Binth Md Amin
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Tanhaul Islam
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Amanda Wallace
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Anuoluwapo Gomina
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Xu Guo
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Christopher Nel
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Jesse W Wyatt
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States
| | - Kent S Gates
- University of Missouri, Department of Chemistry, 125 Chemistry Building, Columbia, MO 65211, United States; University of Missouri, Department of Biochemistry, Columbia, MO 65211, United States.
| |
Collapse
|
78
|
Marian AJ. Clinical Interpretation and Management of Genetic Variants. ACTA ACUST UNITED AC 2020; 5:1029-1042. [PMID: 33145465 PMCID: PMC7591931 DOI: 10.1016/j.jacbts.2020.05.013] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 05/22/2020] [Accepted: 05/27/2020] [Indexed: 01/31/2023]
Abstract
The human genome contains approximately 4 million variants, whose population frequencies vary according to the ethnic backgrounds. Genetic diversity of humans in part determines interindividual variability in susceptibility to diseases, response to therapy, and the clinical outcomes. Genetic variants exert a gradient of biological and clinical effect sizes. In general, variants with the largest effect sizes are responsible for the single-gene disorders, whereas those with moderate and modest effect sizes are responsible for oligogenic and polygenic diseases, respectively. A phenotype is the consequence of nonlinear stochastic interactions among multiple genetic and nongenetic determinants. Discerning pathogenicity of the genetic variants, identified through genetic testing, in the clinical phenotype is challenging and requires complementary expertise in human molecular genetics and clinical medicine.
Genetic variants are major determinants of susceptibility to disease, response to therapy, and clinical outcomes. Advances in the short-read sequencing technologies, despite some shortcomings, have enabled identification of the vast majority of the genetic variants in each genome. The major challenge is in identifying the pathogenic variants in cardiovascular diseases. The yield of the genetic testing has been limited because of technological shortcomings and our incomplete understanding of the genetic basis of cardiovascular disorders. To advance the field, a shift to long-read sequencing platforms is necessary. In addition, to discern the pathogenic variants, genetic diseases should be considered as a continuum and the genetic variants as probabilistic factors with a gradient of effect sizes. Moreover, disease-specific physician-scientists with expertise in the clinical medicine and molecular genetics are best equipped to discern functional and clinical significance of the genetic variants. The changes would be expected to enhance clinical utilities of the genetic discoveries.
Collapse
Affiliation(s)
- Ali J Marian
- Center for Cardiovascular Genetics, Institute of Molecular Medicine and Department of Medicine, University of Texas Health Sciences Center at Houston, Houston, Texas
| |
Collapse
|
79
|
The Impact of DNA Methylation Dynamics on the Mutation Rate During Human Germline Development. G3-GENES GENOMES GENETICS 2020; 10:3337-3346. [PMID: 32727923 PMCID: PMC7466984 DOI: 10.1534/g3.120.401511] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
DNA methylation is a dynamic epigenetic modification found in most eukaryotic genomes. It is known to lead to a high CpG to TpG mutation rate. However, the relationship between the methylation dynamics in germline development and the germline mutation rate remains unexplored. In this study, we used whole genome bisulfite sequencing (WGBS) data of cells at 13 stages of human germline development and rare variants from the 1000 Genome Project as proxies for germline mutations to investigate the correlation between dynamic methylation levels and germline mutation rates at different scales. At the single-site level, we found a significant correlation between methylation and the germline point mutation rate at CpG sites during germline developmental stages. Then we explored the mutability of methylation dynamics in all stages. Our results also showed a broad correlation between the regional methylation level and the rate of C > T mutation at CpG sites in all genomic regions, especially in intronic regions; a similar link was also seen at all chromosomal levels. Our findings indicate that the dynamic DNA methylome during human germline development has a broader mutational impact than is commonly assumed.
Collapse
|
80
|
Jiang S, Zhou D, Wang YY, Jia P, Wan C, Li X, He G, Cao D, Jiang X, Kendler KS, Tsuang M, Mize T, Wu JS, Lu Y, He L, Chen J, Zhao Z, Chen X. Identification of de novo mutations in prenatal neurodevelopment-associated genes in schizophrenia in two Han Chinese patient-sibling family-based cohorts. Transl Psychiatry 2020; 10:307. [PMID: 32873781 PMCID: PMC7463022 DOI: 10.1038/s41398-020-00987-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 08/01/2020] [Accepted: 08/10/2020] [Indexed: 12/31/2022] Open
Abstract
Schizophrenia (SCZ) is a severe psychiatric disorder with a strong genetic component. High heritability of SCZ suggests a major role for transmitted genetic variants. Furthermore, SCZ is also associated with a marked reduction in fecundity, leading to the hypothesis that alleles with large effects on risk might often occur de novo. In this study, we conducted whole-genome sequencing for 23 families from two cohorts with unaffected siblings and parents. Two nonsense de novo mutations (DNMs) in GJC1 and HIST1H2AD were identified in SCZ patients. Ten genes (DPYSL2, NBPF1, SDK1, ZNF595, ZNF718, GCNT2, SNX9, AACS, KCNQ1, and MSI2) were found to carry more DNMs in SCZ patients than their unaffected siblings by burden test. Expression analyses indicated that these DNM implicated genes showed significantly higher expression in prefrontal cortex in prenatal stage. The DNM in the GJC1 gene is highly likely a loss function mutation (pLI = 0.94), leading to the dysregulation of ion channel in the glutamatergic excitatory neurons. Analysis of rare variants in independent exome sequencing dataset indicates that GJC1 has significantly more rare variants in SCZ patients than in unaffected controls. Data from genome-wide association studies suggested that common variants in the GJC1 gene may be associated with SCZ and SCZ-related traits. Genes co-expressed with GJC1 are involved in SCZ, SCZ-associated pathways, and drug targets. These evidences suggest that GJC1 may be a risk gene for SCZ and its function may be involved in prenatal and early neurodevelopment, a vulnerable period for developmental disorders such as SCZ.
Collapse
Affiliation(s)
- Shan Jiang
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Daizhan Zhou
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yin-Ying Wang
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Peilin Jia
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Chunling Wan
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Xingwang Li
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Guang He
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Dongmei Cao
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Xiaoqian Jiang
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Kenneth S Kendler
- Virginia Institute of Psychiatric and Behavioral Genetics, Medical College of Virginia and Virginia Commonwealth University, Richmond, VA, 23298, USA
| | - Ming Tsuang
- Department of Psychiatry, University of California at San Diego, San Diego, CA, 92093, USA
| | - Travis Mize
- Department of Ecology and Evolutionary Biology, University of Colorado Boulder, Boulder, CO, 80309, USA
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, 80309, USA
| | - Jain-Shing Wu
- Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Yimei Lu
- Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Lin He
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong University, Shanghai, China.
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
- Institute of Neuropsychiatric Science and Systems Biological Medicine, Shanghai Jiao Tong University, Shanghai, China.
| | - Jingchun Chen
- Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
- MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, 77030, USA.
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
| | | |
Collapse
|
81
|
Simon H, Huttley G. Quantifying Influences on Intragenomic Mutation Rate. G3 (BETHESDA, MD.) 2020; 10:2641-2652. [PMID: 32527747 PMCID: PMC7407452 DOI: 10.1534/g3.120.401335] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 05/28/2020] [Indexed: 12/14/2022]
Abstract
We report work to quantify the impact on the probability of human genome polymorphism both of recombination and of sequence context at different scales. We use population-based analyses of data on human genetic variants obtained from the public Ensembl database. For recombination, we calculate the variance due to recombination and the probability that a recombination event causes a mutation. We employ novel statistical procedures to take account of the spatial auto-correlation of recombination and mutation rates along the genome. Our results support the view that genomic diversity in recombination hotspots arises largely from a direct effect of recombination on mutation rather than predominantly from the effect of selective sweeps. We also use the statistic of variance due to context to compare the effect on the probability of polymorphism of contexts of various sizes. We find that when the 12 point mutations are considered separately, variance due to context increases significantly as we move from 3-mer to 5-mer and from 5-mer to 7-mer contexts. However, when all mutations are considered in aggregate, these differences are outweighed by the effect of interaction between the central base and its immediate neighbors. This interaction is itself dominated by the transition mutations, including, but not limited to, the CpG effect. We also demonstrate strand-asymmetry of contextual influence in intronic regions, which is hypothesized to be a result of transcription coupled DNA repair. We consider the extent to which the measures we have used can be used to meaningfully compare the relative magnitudes of the impact of recombination and context on mutation.
Collapse
Affiliation(s)
- Helmut Simon
- Research School of Biology, the Australian National University
| | - Gavin Huttley
- Research School of Biology, the Australian National University
| |
Collapse
|
82
|
Singh VK, Rastogi A, Hu X, Wang Y, De S. Mutational signature SBS8 predominantly arises due to late replication errors in cancer. Commun Biol 2020; 3:421. [PMID: 32747711 PMCID: PMC7400754 DOI: 10.1038/s42003-020-01119-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2020] [Accepted: 07/02/2020] [Indexed: 02/08/2023] Open
Abstract
Although a majority of somatic mutations in cancer are passengers, their mutational signatures provide mechanistic insights into mutagenesis and DNA repair processes. Mutational signature SBS8 is common in most cancers, but its etiology is debated. Incorporating genomic, epigenomic, and cellular process features for multiple cell-types we develop genome-wide composite epigenomic context-maps relevant for mutagenesis and DNA repair. Analyzing somatic mutation data from multiple cancer types in their epigenomic contexts, we show that SBS8 preferentially occurs in gene-poor, lamina-proximal, late replicating heterochromatin domains. While SBS8 is uncommon among mutations in non-malignant tissues, in tumor genomes its proportions increase with replication timing and speed, and checkpoint defects further promote this signature - suggesting that SBS8 probably arises due to uncorrected late replication errors during cancer progression. Our observations offer a potential reconciliation among different perspectives in the debate about the etiology of SBS8 and its relationship with other mutational signatures.
Collapse
Affiliation(s)
- Vinod Kumar Singh
- Rutgers Cancer Institute, Rutgers the State University of New Jersey, New Brunswick, NJ, 08901, USA
| | - Arnav Rastogi
- Rutgers Cancer Institute, Rutgers the State University of New Jersey, New Brunswick, NJ, 08901, USA
| | - Xiaoju Hu
- Rutgers Cancer Institute, Rutgers the State University of New Jersey, New Brunswick, NJ, 08901, USA
| | - Yaqun Wang
- Rutgers Cancer Institute, Rutgers the State University of New Jersey, New Brunswick, NJ, 08901, USA
| | - Subhajyoti De
- Rutgers Cancer Institute, Rutgers the State University of New Jersey, New Brunswick, NJ, 08901, USA.
| |
Collapse
|
83
|
Gao F, Pan X, Dodd-Eaton EB, Recio CV, Montierth MD, Bojadzieva J, Mai PL, Zelley K, Johnson VE, Braun D, Nichols KE, Garber JE, Savage SA, Strong LC, Wang W. A pedigree-based prediction model identifies carriers of deleterious de novo mutations in families with Li-Fraumeni syndrome. Genome Res 2020; 30:1170-1180. [PMID: 32817165 PMCID: PMC7462073 DOI: 10.1101/gr.249599.119] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Accepted: 06/25/2020] [Indexed: 01/14/2023]
Abstract
De novo mutations (DNMs) are increasingly recognized as rare disease causal factors. Identifying DNM carriers will allow researchers to study the likely distinct molecular mechanisms of DNMs. We developed Famdenovo to predict DNM status (DNM or familial mutation [FM]) of deleterious autosomal dominant germline mutations for any syndrome. We introduce Famdenovo.TP53 for Li-Fraumeni syndrome (LFS) and analyze 324 LFS family pedigrees from four US cohorts: a validation set of 186 pedigrees and a discovery set of 138 pedigrees. The concordance index for Famdenovo.TP53 prediction was 0.95 (95% CI: [0.92, 0.98]). Forty individuals (95% CI: [30, 50]) were predicted as DNM carriers, increasing the total number from 42 to 82. We compared clinical and biological features of FM versus DNM carriers: (1) cancer and mutation spectra along with parental ages were similarly distributed; (2) ascertainment criteria like early-onset breast cancer (age 20-35 yr) provides a condition for an unbiased estimate of the DNM rate: 48% (23 DNMs vs. 25 FMs); and (3) hotspot mutation R248W was not observed in DNMs, although it was as prevalent as hotspot mutation R248Q in FMs. Furthermore, we introduce Famdenovo.BRCA for hereditary breast and ovarian cancer syndrome and apply it to a small set of family data from the Cancer Genetics Network. In summary, we introduce a novel statistical approach to systematically evaluate deleterious DNMs in inherited cancer syndromes. Our approach may serve as a foundation for future studies evaluating how new deleterious mutations can be established in the germline, such as those in TP53.
Collapse
Affiliation(s)
- Fan Gao
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
- Department of Statistics, Rice University, Houston, Texas 77005, USA
| | - Xuedong Pan
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
- Department of Statistics, Texas A&M University, College Station, Texas 77843, USA
| | - Elissa B Dodd-Eaton
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Carlos Vera Recio
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Matthew D Montierth
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
- Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Jasmina Bojadzieva
- Department of Genetics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Phuong L Mai
- Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Kristin Zelley
- Division of Oncology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Valen E Johnson
- Department of Statistics, Texas A&M University, College Station, Texas 77843, USA
| | - Danielle Braun
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| | - Kim E Nichols
- Division of Oncology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Judy E Garber
- Dana-Farber Cancer Institute and Harvard Medical School, Boston, Massachusetts 02215, USA
| | - Sharon A Savage
- Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Louise C Strong
- Department of Genetics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Wenyi Wang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| |
Collapse
|
84
|
Chiu FPC, Doolan BJ, McGrath JA, Onoufriadis A. A decade of next-generation sequencing in genodermatoses: the impact on gene discovery and clinical diagnostics. Br J Dermatol 2020; 184:606-616. [PMID: 32628274 DOI: 10.1111/bjd.19384] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2020] [Indexed: 12/11/2022]
Abstract
BACKGROUND Discovering the genetic basis of inherited skin diseases is fundamental to improving diagnostic accuracy and genetic counselling. In the 1990s and 2000s, genetic linkage and candidate gene approaches led to the molecular characterization of several dozen genodermatoses, but over the past decade the advent of next-generation sequencing (NGS) technologies has accelerated diagnostic discovery and precision. OBJECTIVES This review examines the application of NGS technologies from 2009 to 2019 that have (i) led to the initial discovery of gene mutations in known or new genodermatoses and (ii) identified involvement of more than one contributing pathogenic gene in individuals with complex Mendelian skin disorder phenotypes. METHODS A comprehensive review of the PubMed database and dermatology conference abstracts was undertaken between January 2009 and December 2019. The results were collated and cross-referenced with OMIM. RESULTS We identified 166 new disease-gene associations in inherited skin diseases discovered by NGS. Of these, 131 were previously recognized, while 35 were brand new disorders. Eighty-five were autosomal dominant (with 43 of 85 mutations occurring de novo), 78 were autosomal recessive and three were X-linked. We also identified 63 cases harbouring multiple pathogenic mutations, either involving two coexisting genodermatoses (n = 13) or an inherited skin disorder in conjunction with other organ system phenotypes (n = 50). CONCLUSIONS NGS technologies have accelerated disease-gene discoveries in dermatology over the last decade. Moreover, the era of NGS has enabled clinicians to split complex Mendelian phenotypes into separate diseases. These genetic data improve diagnostic precision and make feasible accurate prenatal testing and better-targeted translational research.
Collapse
Affiliation(s)
- F P-C Chiu
- St John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - B J Doolan
- St John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - J A McGrath
- St John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - A Onoufriadis
- St John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| |
Collapse
|
85
|
Germline de novo mutation rates on exons versus introns in humans. Nat Commun 2020; 11:3304. [PMID: 32620809 PMCID: PMC7334200 DOI: 10.1038/s41467-020-17162-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 06/02/2020] [Indexed: 02/06/2023] Open
Abstract
A main assumption of molecular population genetics is that genomic mutation rate does not depend on sequence function. Challenging this assumption, a recent study has found a reduction in the mutation rate in exons compared to introns in somatic cells, ascribed to an enhanced exonic mismatch repair system activity. If this reduction happens also in the germline, it can compromise studies of population genomics, including the detection of selection when using introns as proxies for neutrality. Here we compile and analyze published germline de novo mutation data to test if the exonic mutation rate is also reduced in germ cells. After controlling for sampling bias in datasets with diseased probands and extended nucleotide context dependency, we find no reduction in the mutation rate in exons compared to introns in the germline. Therefore, there is no evidence that enhanced exonic mismatch repair activity determines the mutation rate in germline cells. Evidence that somatic mutation rates in introns exceed those in exons challenges the molecular evolution tenet that mutation rate and sequence function are independent. Here, authors analyze germline de novo mutations and reveal no evidence for mutation rate differences between exons and introns.
Collapse
|
86
|
Vierstra J, Lazar J, Sandstrom R, Halow J, Lee K, Bates D, Diegel M, Dunn D, Neri F, Haugen E, Rynes E, Reynolds A, Nelson J, Johnson A, Frerker M, Buckley M, Kaul R, Meuleman W, Stamatoyannopoulos JA. Global reference mapping of human transcription factor footprints. Nature 2020; 583:729-736. [PMID: 32728250 PMCID: PMC7410829 DOI: 10.1038/s41586-020-2528-x] [Citation(s) in RCA: 162] [Impact Index Per Article: 40.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 06/25/2020] [Indexed: 11/09/2022]
Abstract
Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3-6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor-DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation.
Collapse
Affiliation(s)
- Jeff Vierstra
- Altius Institute for Biomedical Sciences, Seattle, WA, USA.
| | - John Lazar
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | - Jessica Halow
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Kristen Lee
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Daniel Bates
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Morgan Diegel
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Douglas Dunn
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Fidencio Neri
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Eric Haugen
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Eric Rynes
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Alex Reynolds
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Jemma Nelson
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Audra Johnson
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | - Mark Frerker
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | | | - Rajinder Kaul
- Altius Institute for Biomedical Sciences, Seattle, WA, USA
| | | | - John A Stamatoyannopoulos
- Altius Institute for Biomedical Sciences, Seattle, WA, USA.
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Division of Oncology, Department of Medicine, University of Washington, Seattle, WA, USA.
| |
Collapse
|
87
|
Saini N, Gordenin DA. Hypermutation in single-stranded DNA. DNA Repair (Amst) 2020; 91-92:102868. [PMID: 32438271 PMCID: PMC7234795 DOI: 10.1016/j.dnarep.2020.102868] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Revised: 05/02/2020] [Accepted: 05/04/2020] [Indexed: 12/15/2022]
Abstract
Regions of genomic DNA can become single-stranded in the course of normal replication and transcription as well as during DNA repair. Abnormal repair and replication intermediates can contain large stretches of persistent single-stranded DNA, which is extremely vulnerable to DNA damaging agents and hypermutation. Since such single-stranded DNA spans only a fraction of the genome at a given instance, hypermutation in these regions leads to tightly-spaced mutation clusters. This phenomenon of hypermutation in single-stranded DNA has been documented in several experimental models as well as in cancer genomes. Recently, hypermutated single-stranded RNA viral genomes also have been documented. Moreover, indications of hypermutation in single-stranded DNA may also be found in the human germline. This review will summarize key current knowledge and the recent developments in understanding the diverse mechanisms and sources of ssDNA hypermutation.
Collapse
Affiliation(s)
- Natalie Saini
- Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, US National Institutes of Health, Research Triangle Park, NC, USA
| | - Dmitry A Gordenin
- Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, US National Institutes of Health, Research Triangle Park, NC, USA.
| |
Collapse
|
88
|
Lopera Maya EA, van der Graaf A, Lanting P, van der Geest M, Fu J, Swertz M, Franke L, Wijmenga C, Deelen P, Zhernakova A, Sanna S. Lack of Association Between Genetic Variants at ACE2 and TMPRSS2 Genes Involved in SARS-CoV-2 Infection and Human Quantitative Phenotypes. Front Genet 2020; 11:613. [PMID: 32582302 PMCID: PMC7295011 DOI: 10.3389/fgene.2020.00613] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 05/19/2020] [Indexed: 12/12/2022] Open
Abstract
Coronavirus disease 2019 (COVID-19) shows a wide variation in expression and severity of symptoms, from very mild or no symptoms, to flu-like symptoms, and in more severe cases, to pneumonia, acute respiratory distress syndrome, and even death. Large differences in outcome have also been observed between males and females. The causes for this variability are likely to be multifactorial, and to include genetics. The SARS-CoV-2 virus responsible for the infection depends on two human genes: the human receptor angiotensin converting enzyme 2 (ACE2) for cell invasion, and the serine protease TMPRSS2 for S protein priming. Genetic variation in these two genes may thus modulate an individual's genetic predisposition to infection and virus clearance. While genetic data on COVID-19 patients is being gathered, we carried out a phenome-wide association scan (PheWAS) to investigate the role of these genes in other human phenotypes in the general population. We examined 178 quantitative phenotypes including cytokines and cardio-metabolic biomarkers, as well as usage of 58 medications in 36,339 volunteers from the Lifelines population cohort, in relation to 1,273 genetic variants located in or near ACE2 and TMPRSS2. While none reached our threshold for significance, we observed several interesting suggestive associations. For example, single nucleotide polymorphisms (SNPs) near the TMPRSS2 genes were associated with thrombocytes count (p = 1.8 × 10-5). SNPs within the ACE2 gene were associated with (1) the use of angiotensin II receptor blockers (ARBs) combination therapies (p = 5.7 × 10-4), an association that is significantly stronger in females (p dif f = 0.01), and (2) with the use of non-steroid anti-inflammatory and antirheumatic products (p = 5.5 × 10-4). While these associations need to be confirmed in larger sample sizes, they suggest that these variants could play a role in diseases such as thrombocytopenia, hypertension, and chronic inflammation that are often observed in the more severe COVID-19 cases. Further investigation of these genetic variants in the context of COVID-19 is thus promising for better understanding of disease variability. Full results are available at https://covid19research.nl.
Collapse
Affiliation(s)
- Esteban A. Lopera Maya
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Adriaan van der Graaf
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Pauline Lanting
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Marije van der Geest
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Jingyuan Fu
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
- Department of Pediatrics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Morris Swertz
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Lude Franke
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Cisca Wijmenga
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Patrick Deelen
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
- Department of Genetics, University Medical Centre Utrecht, Utrecht, Netherlands
| | - Alexandra Zhernakova
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Serena Sanna
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
- Institute for Genetics and Biomedical Research (IRGB), Consiglio Nazionale delle Ricerche (CNR), Monserrato, Italy
| | | |
Collapse
|
89
|
Rong S, Buerer L, Rhine CL, Wang J, Cygan KJ, Fairbrother WG. Mutational bias and the protein code shape the evolution of splicing enhancers. Nat Commun 2020; 11:2845. [PMID: 32504065 PMCID: PMC7275064 DOI: 10.1038/s41467-020-16673-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Accepted: 04/28/2020] [Indexed: 02/06/2023] Open
Abstract
Exonic splicing enhancers (ESEs) are enriched in exons relative to introns and bind splicing activators. This study considers a fundamental question of co-evolution: How did ESE motifs become enriched in exons prior to the evolution of ESE recognition? We hypothesize that the high exon to intron motif ratios necessary for ESE function were created by mutational bias coupled with purifying selection on the protein code. These two forces retain certain coding motifs in exons while passively depleting them from introns. Through the use of simulations, genomic analyses, and high throughput splicing assays, we confirm the key predictions of this hypothesis, including an overlap between protein and splicing information in ESEs. We discuss the implications of mutational bias as an evolutionary driver in other cis-regulatory systems. Splicing is regulated by cis-acting elements in pre-mRNAs such as exonic or intronic splicing enhancers and silencers. Here the authors show that exonic splicing enhancers are enriched in exons compared to introns due to mutational bias coupled with purifying selection on the protein code.
Collapse
Affiliation(s)
- Stephen Rong
- Center for Computational Molecular Biology, Brown University, Providence, RI, 02912, USA.,Ecology and Evolutionary Biology, Brown University, Providence, RI, 02912, USA
| | - Luke Buerer
- Center for Computational Molecular Biology, Brown University, Providence, RI, 02912, USA
| | - Christy L Rhine
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI, 02912, USA
| | - Jing Wang
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI, 02912, USA
| | - Kamil J Cygan
- Center for Computational Molecular Biology, Brown University, Providence, RI, 02912, USA.,Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI, 02912, USA
| | - William G Fairbrother
- Center for Computational Molecular Biology, Brown University, Providence, RI, 02912, USA. .,Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI, 02912, USA. .,Hassenfeld Child Health Innovation Institute of Brown University, Providence, RI, 02912, USA.
| |
Collapse
|
90
|
Wang RJ, Thomas GWC, Raveendran M, Harris RA, Doddapaneni H, Muzny DM, Capitanio JP, Radivojac P, Rogers J, Hahn MW. Paternal age in rhesus macaques is positively associated with germline mutation accumulation but not with measures of offspring sociability. Genome Res 2020; 30:826-834. [PMID: 32461224 PMCID: PMC7370888 DOI: 10.1101/gr.255174.119] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 05/21/2020] [Indexed: 01/26/2023]
Abstract
Mutation is the ultimate source of all genetic novelty and the cause of heritable genetic disorders. Mutational burden has been linked to complex disease, including neurodevelopmental disorders such as schizophrenia and autism. The rate of mutation is a fundamental genomic parameter and direct estimates of this parameter have been enabled by accurate comparisons of whole-genome sequences between parents and offspring. Studies in humans have revealed that the paternal age at conception explains most of the variation in mutation rate: Each additional year of paternal age in humans leads to approximately 1.5 additional inherited mutations. Here, we present an estimate of the de novo mutation rate in the rhesus macaque (Macaca mulatta) using whole-genome sequence data from 32 individuals in four large pedigrees. We estimated an average mutation rate of 0.58 × 10−8 per base pair per generation (at an average parental age of 7.5 yr), much lower than found in direct estimates from great apes. As in humans, older macaque fathers transmit more mutations to their offspring, increasing the per generation mutation rate by 4.27 × 10−10 per base pair per year. We found that the rate of mutation accumulation after puberty is similar between macaques and humans, but that a smaller number of mutations accumulate before puberty in macaques. We additionally investigated the role of paternal age on offspring sociability, a proxy for normal neurodevelopment, by studying 203 male macaques in large social groups.
Collapse
Affiliation(s)
- Richard J Wang
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Gregg W C Thomas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.,Department of Computer Science, Indiana University, Bloomington, Indiana 47405, USA
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - R Alan Harris
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Harshavardhan Doddapaneni
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - John P Capitanio
- California National Primate Research Center, University of California-Davis, Davis, California 95616, USA
| | - Predrag Radivojac
- Department of Computer Science, Indiana University, Bloomington, Indiana 47405, USA.,Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts 02115, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.,Department of Computer Science, Indiana University, Bloomington, Indiana 47405, USA
| |
Collapse
|
91
|
Wang Q, Pierce-Hoffman E, Cummings BB, Alföldi J, Francioli LC, Gauthier LD, Hill AJ, O'Donnell-Luria AH, Karczewski KJ, MacArthur DG. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes. Nat Commun 2020; 11:2539. [PMID: 32461613 PMCID: PMC7253413 DOI: 10.1038/s41467-019-12438-5] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Accepted: 09/09/2019] [Indexed: 12/31/2022] Open
Abstract
Multi-nucleotide variants (MNVs), defined as two or more nearby variants existing on the same haplotype in an individual, are a clinically and biologically important class of genetic variation. However, existing tools typically do not accurately classify MNVs, and understanding of their mutational origins remains limited. Here, we systematically survey MNVs in 125,748 whole exomes and 15,708 whole genomes from the Genome Aggregation Database (gnomAD). We identify 1,792,248 MNVs across the genome with constituent variants falling within 2 bp distance of one another, including 18,756 variants with a novel combined effect on protein sequence. Finally, we estimate the relative impact of known mutational mechanisms - CpG deamination, replication error by polymerase zeta, and polymerase slippage at repeat junctions - on the generation of MNVs. Our results demonstrate the value of haplotype-aware variant annotation, and refine our understanding of genome-wide mutational mechanisms of MNVs.
Collapse
Affiliation(s)
- Qingbo Wang
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
- Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
| | - Emma Pierce-Hoffman
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Beryl B Cummings
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
- Program in Biomedical and Biological Sciences, Harvard Medical School, Boston, MA, 02115, USA
| | - Jessica Alföldi
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Laurent C Francioli
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Laura D Gauthier
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Andrew J Hill
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Anne H O'Donnell-Luria
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Konrad J Karczewski
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA.
- Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, Australia.
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia.
| |
Collapse
|
92
|
Lodato MA, Walsh CA. Genome aging: somatic mutation in the brain links age-related decline with disease and nominates pathogenic mechanisms. Hum Mol Genet 2020; 28:R197-R206. [PMID: 31578549 DOI: 10.1093/hmg/ddz191] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 07/25/2019] [Accepted: 07/26/2019] [Indexed: 12/19/2022] Open
Abstract
Aging is a mysterious process, not only controlled genetically but also subject to random damage that can accumulate over time. While DNA damage and subsequent mutation in somatic cells were first proposed as drivers of aging more than 60 years ago, whether and to what degree these processes shape the neuronal genome in the human brain could not be tested until recent technological breakthroughs related to single-cell whole-genome sequencing. Indeed, somatic single-nucleotide variants (SNVs) increase with age in the human brain, in a somewhat stochastic process that may nonetheless be controlled by underlying genetic programs. Evidence from the literature suggests that in addition to demonstrated increases in somatic SNVs during aging in normal brains, somatic mutation may also play a role in late-onset, sporadic neurodegenerative diseases, such as Alzheimer's disease and Parkinson's disease. In this review, we will discuss somatic mutation in the human brain, mechanisms by which somatic mutations occur and can be controlled, and how this process can impact human health.
Collapse
Affiliation(s)
- Michael A Lodato
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Howard Hughes Medical Institute, Boston, MA, USA.,Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Molecular, Cell, and Cancer Biology, University of Massachusetts Medical School, Worcester, MA, USA
| | - Christopher A Walsh
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Howard Hughes Medical Institute, Boston, MA, USA.,Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
93
|
Hsieh A, Morton SU, Willcox JAL, Gorham JM, Tai AC, Qi H, DePalma S, McKean D, Griffin E, Manheimer KB, Bernstein D, Kim RW, Newburger JW, Porter GA, Srivastava D, Tristani-Firouzi M, Brueckner M, Lifton RP, Goldmuntz E, Gelb BD, Chung WK, Seidman CE, Seidman JG, Shen Y. EM-mosaic detects mosaic point mutations that contribute to congenital heart disease. Genome Med 2020; 12:42. [PMID: 32349777 PMCID: PMC7189690 DOI: 10.1186/s13073-020-00738-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 04/09/2020] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The contribution of somatic mosaicism, or genetic mutations arising after oocyte fertilization, to congenital heart disease (CHD) is not well understood. Further, the relationship between mosaicism in blood and cardiovascular tissue has not been determined. METHODS We developed a new computational method, EM-mosaic (Expectation-Maximization-based detection of mosaicism), to analyze mosaicism in exome sequences derived primarily from blood DNA of 2530 CHD proband-parent trios. To optimize this method, we measured mosaic detection power as a function of sequencing depth. In parallel, we analyzed our cohort using MosaicHunter, a Bayesian genotyping algorithm-based mosaic detection tool, and compared the two methods. The accuracy of these mosaic variant detection algorithms was assessed using an independent resequencing method. We then applied both methods to detect mosaicism in cardiac tissue-derived exome sequences of 66 participants for which matched blood and heart tissue was available. RESULTS EM-mosaic detected 326 mosaic mutations in blood and/or cardiac tissue DNA. Of the 309 detected in blood DNA, 85/97 (88%) tested were independently confirmed, while 7/17 (41%) candidates of 17 detected in cardiac tissue were confirmed. MosaicHunter detected an additional 64 mosaics, of which 23/46 (50%) among 58 candidates from blood and 4/6 (67%) of 6 candidates from cardiac tissue confirmed. Twenty-five mosaic variants altered CHD-risk genes, affecting 1% of our cohort. Of these 25, 22/22 candidates tested were confirmed. Variants predicted as damaging had higher variant allele fraction than benign variants, suggesting a role in CHD. The estimated true frequency of mosaic variants above 10% mosaicism was 0.14/person in blood and 0.21/person in cardiac tissue. Analysis of 66 individuals with matched cardiac tissue available revealed both tissue-specific and shared mosaicism, with shared mosaics generally having higher allele fraction. CONCLUSIONS We estimate that ~ 1% of CHD probands have a mosaic variant detectable in blood that could contribute to cardiac malformations, particularly those damaging variants with relatively higher allele fraction. Although blood is a readily available DNA source, cardiac tissues analyzed contributed ~ 5% of somatic mosaic variants identified, indicating the value of tissue mosaicism analyses.
Collapse
Affiliation(s)
- Alexander Hsieh
- Columbia University Medical Center, 1130 St Nicholas Ave, New York, NY, 10032, USA
| | - Sarah U Morton
- Boston Children's Hospital, Boston, MA, USA.,Harvard Medical School, Boston, MA, USA
| | | | | | | | - Hongjian Qi
- Columbia University Medical Center, 1130 St Nicholas Ave, New York, NY, 10032, USA
| | | | | | - Emily Griffin
- Columbia University Medical Center, 1130 St Nicholas Ave, New York, NY, 10032, USA
| | | | | | - Richard W Kim
- Children's Hospital Los Angeles, Los Angeles, CA, USA
| | | | | | - Deepak Srivastava
- Gladstone Institutes and University of California San Francisco, San Francisco, CA, USA
| | | | | | | | | | - Bruce D Gelb
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Wendy K Chung
- Columbia University Medical Center, 1130 St Nicholas Ave, New York, NY, 10032, USA
| | - Christine E Seidman
- Harvard Medical School, Boston, MA, USA.,Brigham and Women's Hospital, Boston, MA, USA.,Howard Hughes Medical Institute, Harvard University, Boston, MA, USA
| | | | - Yufeng Shen
- Columbia University Medical Center, 1130 St Nicholas Ave, New York, NY, 10032, USA.
| |
Collapse
|
94
|
Abstract
Developmental and epileptic encephalopathies (DEEs) can be primarily attributed to genetic causes. The genetic landscape of DEEs has been largely shaped by the rise of high-throughput sequencing, which led to the discovery of new DEE-associated genes and helped identify de novo pathogenic variants. We discuss briefly the contribution of de novo variants to DEE and also focus on alternative inheritance models that contribute to DEE. First, autosomal recessive inheritance in outbred populations may have a larger contribution than previously appreciated, accounting for up to 13% of DEEs. A small subset of genes that typically harbor de novo variants have been associated with recessive inheritance, and often these individuals have more severe clinical presentations. Additionally, pathogenic variants in X-linked genes have been identified in both affected males and females, possibly due to a lack of X-chromosome inactivation skewing. Collectively, exome sequencing has resulted in a molecular diagnosis for many individuals with DEE, but this still leaves many cases unsolved. Multiple factors contribute to the missing etiology, including nonexonic variants, mosaicism, epigenetics, and oligogenic inheritance. Here, we focus on the first 2 factors. We discuss the promises and challenges of genome sequencing, which allows for a more comprehensive analysis of the genome, including interpretation of structural and noncoding variants and also yields a high number of de novo variants for interpretation. We also consider the contribution of genetic mosaicism, both what it means for a molecular diagnosis in mosaic individuals and the important implications for genetic counseling.
Collapse
Affiliation(s)
- Hannah C Happ
- Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Gemma L Carvill
- Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| |
Collapse
|
95
|
Li C, Luscombe NM. Nucleosome positioning stability is a modulator of germline mutation rate variation across the human genome. Nat Commun 2020; 11:1363. [PMID: 32170069 PMCID: PMC7070026 DOI: 10.1038/s41467-020-15185-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 02/23/2020] [Indexed: 02/08/2023] Open
Abstract
Nucleosome organization has been suggested to affect local mutation rates in the genome. However, the lack of de novo mutation and high-resolution nucleosome data has limited the investigation of this hypothesis. Additionally, analyses using indirect mutation rate measurements have yielded contradictory and potentially confounding results. Here, we combine data on >300,000 human de novo mutations with high-resolution nucleosome maps and find substantially elevated mutation rates around translationally stable (‘strong’) nucleosomes. We show that the mutational mechanisms affected by strong nucleosomes are low-fidelity replication, insufficient mismatch repair and increased double-strand breaks. Strong nucleosomes preferentially locate within young SINE/LINE transposons, suggesting that when subject to increased mutation rates, transposons are then more rapidly inactivated. Depletion of strong nucleosomes in older transposons suggests frequent positioning changes during evolution. The findings have important implications for human genetics and genome evolution. Nucleosome organization has been suggested to affect local mutation rates in the genome. Here, the authors analyse data on >300,000 human de novo mutations and high-resolution nucleosome maps and provide evidence that nucleosome positioning stability modulates germline mutation rate variation across the human genome.
Collapse
Affiliation(s)
- Cai Li
- The Francis Crick Institute, London, NW1 1AT, UK. .,School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
| | - Nicholas M Luscombe
- The Francis Crick Institute, London, NW1 1AT, UK.,Okinawa Institute of Science & Technology Graduate University, Okinawa, 904-0495, Japan.,UCL Genetics Institute, University College London, London, WC1E 6BT, UK
| |
Collapse
|
96
|
Tomoiaga D, Aguiar-Pulido V, Shrestha S, Feinstein P, Levy SE, Mason CE, Rosenfeld JA. Single-cell sperm transcriptomes and variants from fathers of children with and without autism spectrum disorder. NPJ Genom Med 2020; 5:14. [PMID: 32133155 PMCID: PMC7035312 DOI: 10.1038/s41525-020-0117-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Accepted: 01/02/2020] [Indexed: 11/17/2022] Open
Abstract
The human sperm is one of the smallest cells in the body, but also one of the most important, as it serves as the entire paternal genetic contribution to a child. Investigating RNA and mutations in sperm is especially relevant for diseases such as autism spectrum disorders (ASD), which have been correlated with advanced paternal age. Historically, studies have focused on the assessment of bulk sperm, wherein millions of individual sperm are present and only high-frequency variants can be detected. Using 10× Chromium single-cell sequencing technology, we assessed the transcriptome from >65,000 single spermatozoa across six sperm donors (scSperm-RNA-seq), including two who fathered multiple children with ASD and four fathers of neurotypical children. Using RNA-seq methods for differential expression and variant analysis, we found clusters of sperm mutations in each donor that are indicative of the sperm being produced by different stem cell pools. Finally, we have shown that genetic variations can be found in single sperm.
Collapse
Affiliation(s)
- Delia Tomoiaga
- 1Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY USA
| | - Vanessa Aguiar-Pulido
- 2The Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY USA
| | | | - Paul Feinstein
- 4Hunter College, City University of New York, New York, NY USA
| | - Shawn E Levy
- 3Hudson Alpha Institute for Biotechnology, Huntsville, AL USA
| | - Christopher E Mason
- 1Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY USA.,2The Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY USA.,5The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY USA.,6The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY USA
| | - Jeffrey A Rosenfeld
- 7Rutgers Cancer Institute of New Jersey, New Brunswick, NJ USA.,8Department of Pathology, Robert Wood Johnson Medical School, New Brunswick, NJ USA
| |
Collapse
|
97
|
Cytosine Methylation Affects the Mutability of Neighboring Nucleotides in Germline and Soma. Genetics 2020; 214:809-823. [PMID: 32079595 DOI: 10.1534/genetics.120.303028] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 02/12/2020] [Indexed: 02/07/2023] Open
Abstract
Methylated cytosines deaminate at higher rates than unmethylated cytosines, and the lesions they produce are repaired less efficiently. As a result, methylated cytosines are mutational hotspots. Here, combining rare polymorphism and base-resolution methylation data in humans, Arabidopsis thaliana, and rice (Oryza sativa), we present evidence that methylation state affects mutation dynamics not only at the focal cytosine but also at neighboring nucleotides. In humans, contrary to prior suggestions, we find that nucleotides in the close vicinity (±3 bp) of methylated cytosines mutate less frequently. Reduced mutability around methylated CpGs is also observed in cancer genomes, considering single nucleotide variants alongside tissue-of-origin-matched methylation data. In contrast, methylation is associated with increased neighborhood mutation risk in A. thaliana and rice. The difference in neighborhood mutation risk is less pronounced further away from the focal CpG and modulated by regional GC content. Our results are consistent with a model where altered risk at neighboring bases is linked to lesion formation at the focal CpG and subsequent long-patch repair. Our findings indicate that cytosine methylation has a broader mutational footprint than is commonly assumed.
Collapse
|
98
|
Woerner AE, Veeramah KR, Watkins JC, Hammer MF. The Role of Phylogenetically Conserved Elements in Shaping Patterns of Human Genomic Diversity. Mol Biol Evol 2020; 35:2284-2295. [PMID: 30113695 DOI: 10.1093/molbev/msy145] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Evolutionary genetic studies have shown a positive correlation between levels of nucleotide diversity and either rates of recombination or genetic distance to genes. Both positive-directional and purifying selection have been offered as the source of these correlations via genetic hitchhiking and background selection, respectively. Phylogenetically conserved elements (CEs) are short (∼100 bp), widely distributed (comprising ∼5% of genome), sequences that are often found far from genes. While the function of many CEs is unknown, CEs also are associated with reduced diversity at linked sites. Using high coverage (>80×) whole genome data from two human populations, the Yoruba and the CEU, we perform fine scale evaluations of diversity, rates of recombination, and linkage to genes. We find that the local rate of recombination has a stronger effect on levels of diversity than linkage to genes, and that these effects of recombination persist even in regions far from genes. Our whole genome modeling demonstrates that, rather than recombination or GC-biased gene conversion, selection on sites within or linked to CEs better explains the observed genomic diversity patterns. A major implication is that very few sites in the human genome are predicted to be free of the effects of selection. These sites, which we refer to as the human "neutralome," comprise only 1.2% of the autosomes and 5.1% of the X chromosome. Demographic analysis of the neutralome reveals larger population sizes and lower rates of growth for ancestral human populations than inferred by previous analyses.
Collapse
Affiliation(s)
- August E Woerner
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ.,Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX
| | - Krishna R Veeramah
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY
| | | | - Michael F Hammer
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ
| |
Collapse
|
99
|
Mohanty AK, Vuzman D, Francioli L, Cassa C, Toth-Petroczy A, Sunyaev S. novoCaller: a Bayesian network approach for de novo variant calling from pedigree and population sequence data. Bioinformatics 2020; 35:1174-1180. [PMID: 30169785 PMCID: PMC6449753 DOI: 10.1093/bioinformatics/bty749] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 06/19/2018] [Accepted: 08/29/2018] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION De novo mutations (i.e. newly occurring mutations) are a pre-dominant cause of sporadic dominant monogenic diseases and play a significant role in the genetics of complex disorders. De novo mutation studies also inform population genetics models and shed light on the biology of DNA replication and repair. Despite the broad interest, there is room for improvement with regard to the accuracy of de novo mutation calling. RESULTS We designed novoCaller, a Bayesian variant calling algorithm that uses information from read-level data both in the pedigree and in unrelated samples. The method was extensively tested using large trio-sequencing studies, and it consistently achieved over 97% sensitivity. We applied the algorithm to 48 trio cases of suspected rare Mendelian disorders as part of the Brigham Genomic Medicine gene discovery initiative. Its application resulted in a significant reduction in the resources required for manual inspection and experimental validation of the calls. Three de novo variants were found in known genes associated with rare disorders, leading to rapid genetic diagnosis of the probands. Another 14 variants were found in genes that are likely to explain the phenotype, and could lead to novel disease-gene discovery. AVAILABILITY AND IMPLEMENTATION Source code implemented in C++ and Python can be downloaded from https://github.com/bgm-cwg/novoCaller. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anwoy Kumar Mohanty
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Dana Vuzman
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Laurent Francioli
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Christopher Cassa
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | | | | | | | - Agnes Toth-Petroczy
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Shamil Sunyaev
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
100
|
Weghorn D, Balick DJ, Cassa C, Kosmicki JA, Daly MJ, Beier DR, Sunyaev SR. Applicability of the Mutation-Selection Balance Model to Population Genetics of Heterozygous Protein-Truncating Variants in Humans. Mol Biol Evol 2020; 36:1701-1710. [PMID: 31004148 DOI: 10.1093/molbev/msz092] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The fate of alleles in the human population is believed to be highly affected by the stochastic force of genetic drift. Estimation of the strength of natural selection in humans generally necessitates a careful modeling of drift including complex effects of the population history and structure. Protein-truncating variants (PTVs) are expected to evolve under strong purifying selection and to have a relatively high per-gene mutation rate. Thus, it is appealing to model the population genetics of PTVs under a simple deterministic mutation-selection balance, as has been proposed earlier (Cassa et al. 2017). Here, we investigated the limits of this approximation using both computer simulations and data-driven approaches. Our simulations rely on a model of demographic history estimated from 33,370 individual exomes of the Non-Finnish European subset of the ExAC data set (Lek et al. 2016). Additionally, we compared the African and European subset of the ExAC study and analyzed de novo PTVs. We show that the mutation-selection balance model is applicable to the majority of human genes, but not to genes under the weakest selection.
Collapse
Affiliation(s)
- Donate Weghorn
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA.,Centre for Genomic Regulation and Universitat Pompeu Fabra, Barcelona, Spain
| | - Daniel J Balick
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Christopher Cassa
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Jack A Kosmicki
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA.,Program in Medical and Population Genetics, Broad Institute, Cambridge, MA
| | - Mark J Daly
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA.,Program in Medical and Population Genetics, Broad Institute, Cambridge, MA
| | - David R Beier
- Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute, Seattle, WA.,Department of Pediatrics, University of Washington School of Medicine, Seattle, WA
| | - Shamil R Sunyaev
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| |
Collapse
|