1
|
Pérez EM, Torbay AG, López MS, de la Cámara RCM, Jiménez CR, Álvarez MÁM, Blanco JN, Gianelli C, Hijón CC, Granados EL, Pena RR, Del Pozo Mate Á, García-Morato MB. Genetics of inborn errors of immunity: Diagnostic strategies and new approaches to CNV detection. Eur J Clin Invest 2024; 54:e14191. [PMID: 38440843 DOI: 10.1111/eci.14191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 02/20/2024] [Accepted: 02/25/2024] [Indexed: 03/06/2024]
Abstract
BACKGROUND Genetic diagnosis of inborn errors of immunity (IEI) is complex due to the large number of genes involved and their molecular features. Missense variants have been reported as the most common cause of IEI. However, the frequency of copy number variants (CNVs) may be underestimated since their detection requires specific quantitative techniques. At this point, the use of Next Generation Sequencing (NGS) is acquiring relevance. METHODS In this article, we present our experience in the genetic diagnosis of IEI based on three diagnostic algorithms that allowed the detection of single nucleotide variants (SNVs) and CNVs. Following this approximation, 703 index cases were evaluated between 2014 and 2021. Sanger sequencing, MLPA, CGH array, breakpoint spanning PCR or a customized NGS-based multigene-targeted panel were performed. RESULTS A genetic diagnosis was reached in 142 of the 703 index cases (20%), 19 of them presented deletions as causal variants. Deletions were also detected in 5 affected relatives and 16 healthy carriers during the family studies. Additionally, we compile, characterize and present all the CNVs detected by our diagnostic algorithms, representing the largest cohort of deletions related to IEI to date. Furthermore, three bioinformatic tools (LACONv, XHMM, VarSeq™) based on NGS data were evaluated. VarSeq™ was the most sensitive and specific bioinformatic tool; detecting 21/23 (91%) deletions located in captured regions. CONCLUSION Based on our results, we propose a strategy to guide the molecular diagnosis that can be followed by expert and non-expert centres in the field of IEI.
Collapse
Affiliation(s)
| | | | - Mario Solis López
- Bioinformatics Section, Institute of Medical and Molecular Genetics (INGEMM)-IdiPAZ, La Paz University Hospital, UAM, Madrid, Spain
| | | | - Carmen Rodríguez Jiménez
- Metabolic Disease Section, Institute of Medical and Molecular Genetics (INGEMM)-IdiPAZ, La Paz University Hospital, Madrid, Spain
- Group of Dislipemias of genetic origin and metabolic diseases, IdiPAZ, La Paz University Hospital, Madrid, Spain
| | - Mari Ángeles Mori Álvarez
- Functional and Structural Genomics Section, Institute of Medical and Molecular Genetics (INGEMM)-IdiPAZ, La Paz University Hospital, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases, Madrid, Spain
| | - Julián Nevado Blanco
- Functional and Structural Genomics Section, Institute of Medical and Molecular Genetics (INGEMM)-IdiPAZ, La Paz University Hospital, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases, Madrid, Spain
| | - Carla Gianelli
- Immunology Department, La Paz University Hospital, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER U767), Madrid, Spain
- La Paz Institute of Biomedical Research, Madrid, Spain
| | | | - Eduardo López Granados
- Immunology Department, La Paz University Hospital, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER U767), Madrid, Spain
- La Paz Institute of Biomedical Research, Madrid, Spain
| | - Rebeca Rodríguez Pena
- Immunology Department, La Paz University Hospital, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER U767), Madrid, Spain
- La Paz Institute of Biomedical Research, Madrid, Spain
| | - Ángela Del Pozo Mate
- Bioinformatics Section, Institute of Medical and Molecular Genetics (INGEMM)-IdiPAZ, La Paz University Hospital, UAM, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases, Madrid, Spain
- ERN-ITHACA, La Paz University Hospital, Madrid, Spain
| | - María Bravo García-Morato
- Immunology Department, La Paz University Hospital, Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER U767), Madrid, Spain
- La Paz Institute of Biomedical Research, Madrid, Spain
| |
Collapse
|
2
|
Mishra S, Srivastava AK, Khan AW, Tran LSP, Nguyen HT. The era of panomics-driven gene discovery in plants. TRENDS IN PLANT SCIENCE 2024:S1360-1385(24)00063-3. [PMID: 38658292 DOI: 10.1016/j.tplants.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 03/01/2024] [Accepted: 03/08/2024] [Indexed: 04/26/2024]
Abstract
Panomics is an approach to integrate multiple 'omics' datasets, generated using different individuals or natural variations. Considering their diverse phenotypic spectrum, the phenome is inherently associated with panomics-based science, which is further combined with genomics, transcriptomics, metabolomics, and other omics techniques, either independently or collectively. Panomics has been accelerated through recent technological advancements in the field of genomics that enable the detection of population-wide structural variations (SVs) and hence offer unprecedented insights into the genetic variations contributing to important agronomic traits. The present review provides the recent trends of panomics-driven gene discovery toward various traits related to plant development, stress tolerance, accumulation of specialized metabolites, and domestication/dedomestication. In addition, the success stories are highlighted in the broader context of enhancing crop productivity.
Collapse
Affiliation(s)
- Shefali Mishra
- Nuclear Agriculture and Biotechnology Division, Bhabha Atomic Research Centre, Mumbai, Maharashtra 400085, India
| | - Ashish Kumar Srivastava
- Nuclear Agriculture and Biotechnology Division, Bhabha Atomic Research Centre, Mumbai, Maharashtra 400085, India; Homi Bhabha National Institute, Mumbai 400094, India.
| | - Aamir W Khan
- Division of Plant Science and Technology and National Center for Soybean Biotechnology, University of Missouri-Columbia, Columbia, MO 65211, USA
| | - Lam-Son Phan Tran
- Institute of Genomics for Crop Abiotic Stress Tolerance, Department of Plant and Soil Science, Texas Tech University, Lubbock, TX, USA
| | - Henry T Nguyen
- Division of Plant Science and Technology and National Center for Soybean Biotechnology, University of Missouri-Columbia, Columbia, MO 65211, USA.
| |
Collapse
|
3
|
Kumar S, Talluri S, Zhao J, Liao C, Potluri LB, Buon L, Mu S, Shi J, Chakraborty C, Tai YT, Samur MK, Munshi NC, Shammas MA. ABL1 kinase plays an important role in spontaneous and chemotherapy-induced genomic instability in multiple myeloma. Blood 2024; 143:996-1005. [PMID: 37992230 DOI: 10.1182/blood.2023021225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 11/02/2023] [Accepted: 11/03/2023] [Indexed: 11/24/2023] Open
Abstract
ABSTRACT Genomic instability contributes to cancer progression and is at least partly due to dysregulated homologous recombination (HR). Here, we show that an elevated level of ABL1 kinase overactivates the HR pathway and causes genomic instability in multiple myeloma (MM) cells. Inhibiting ABL1 with either short hairpin RNA or a pharmacological inhibitor (nilotinib) inhibits HR activity, reduces genomic instability, and slows MM cell growth. Moreover, inhibiting ABL1 reduces the HR activity and genomic instability caused by melphalan, a chemotherapeutic agent used in MM treatment, and increases melphalan's efficacy and cytotoxicity in vivo in a subcutaneous tumor model. In these tumors, nilotinib inhibits endogenous as well as melphalan-induced HR activity. These data demonstrate that inhibiting ABL1 using the clinically approved drug nilotinib reduces MM cell growth, reduces genomic instability in live cell fraction, increases the cytotoxicity of melphalan (and similar chemotherapeutic agents), and can potentially prevent or delay progression in patients with MM.
Collapse
Affiliation(s)
- Subodh Kumar
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| | - Srikanth Talluri
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| | - Jiangning Zhao
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| | - Chengcheng Liao
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| | - Lakshmi B Potluri
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| | - Leutz Buon
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
| | - Shidai Mu
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| | - Jialan Shi
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Chandraditya Chakraborty
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
| | - Yu-Tzu Tai
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
| | - Mehmet K Samur
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
| | - Nikhil C Munshi
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Masood A Shammas
- The Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, Boston, MA
| |
Collapse
|
4
|
Song B, Buckler ES, Stitzer MC. New whole-genome alignment tools are needed for tapping into plant diversity. TRENDS IN PLANT SCIENCE 2024; 29:355-369. [PMID: 37749022 DOI: 10.1016/j.tplants.2023.08.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 07/19/2023] [Accepted: 08/23/2023] [Indexed: 09/27/2023]
Abstract
Genome alignment is one of the most foundational methods for genome sequence studies. With rapid advances in sequencing and assembly technologies, these newly assembled genomes present challenges for alignment tools to meet the increased complexity and scale. Plant genome alignment is technologically challenging because of frequent whole-genome duplications (WGDs) as well as chromosome rearrangements and fractionation, high nucleotide diversity, widespread structural variation, and high transposable element (TE) activity causing large proportions of repeat elements. We summarize classical pairwise and multiple genome alignment (MGA) methods, and highlight techniques that are widely used or are being developed by the plant research community. We also outline the remaining challenges for precise genome alignment and the interpretation of alignment results in plants.
Collapse
Affiliation(s)
- Baoxing Song
- National Key Laboratory of Wheat Improvement, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agriculture Sciences in Weifang, Weifang, Shandong 261325, China; Key Laboratory of Maize Biology and Genetic Breeding in Arid Area of Northwest Region of the Ministry of Agriculture, College of Agronomy, Northwest A&F University, Yangling, Shaanxi 712100, China.
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Section of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853, USA; Agricultural Research Service, United States Department of Agriculture, Ithaca, NY 14853, USA
| | - Michelle C Stitzer
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
5
|
Gaitán N, Duitama J. A graph clustering algorithm for detection and genotyping of structural variants from long reads. Gigascience 2024; 13:giad112. [PMID: 38206589 PMCID: PMC10783151 DOI: 10.1093/gigascience/giad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 08/02/2023] [Accepted: 12/08/2023] [Indexed: 01/12/2024] Open
Abstract
BACKGROUND Structural variants (SVs) are genomic polymorphisms defined by their length (>50 bp). The usual types of SVs are deletions, insertions, translocations, inversions, and copy number variants. SV detection and genotyping is fundamental given the role of SVs in phenomena such as phenotypic variation and evolutionary events. Thus, methods to identify SVs using long-read sequencing data have been recently developed. FINDINGS We present an accurate and efficient algorithm to predict germline SVs from long-read sequencing data. The algorithm starts collecting evidence (signatures) of SVs from read alignments. Then, signatures are clustered based on a Euclidean graph with coordinates calculated from lengths and genomic positions. Clustering is performed by the DBSCAN algorithm, which provides the advantage of delimiting clusters with high resolution. Clusters are transformed into SVs and a Bayesian model allows to precisely genotype SVs based on their supporting evidence. This algorithm is integrated into the single sample variants detector of the Next Generation Sequencing Experience Platform, which facilitates the integration with other functionalities for genomics analysis. We performed multiple benchmark experiments, including simulation and real data, representing different genome profiles, sequencing technologies (PacBio HiFi, ONT), and read depths. CONCLUSION The results show that our approach outperformed state-of-the-art tools on germline SV calling and genotyping, especially at low depths, and in error-prone repetitive regions. We believe this work significantly contributes to the development of bioinformatic strategies to maximize the use of long-read sequencing technologies.
Collapse
Affiliation(s)
- Nicolás Gaitán
- Systems and Computing Engineering Department, Universidad de Los Andes, Bogotá 111711, Colombia
| | - Jorge Duitama
- Systems and Computing Engineering Department, Universidad de Los Andes, Bogotá 111711, Colombia
| |
Collapse
|
6
|
Choo ZN, Behr JM, Deshpande A, Hadi K, Yao X, Tian H, Takai K, Zakusilo G, Rosiene J, Da Cruz Paula A, Weigelt B, Setton J, Riaz N, Powell SN, Busam K, Shoushtari AN, Ariyan C, Reis-Filho J, de Lange T, Imieliński M. Most large structural variants in cancer genomes can be detected without long reads. Nat Genet 2023; 55:2139-2148. [PMID: 37945902 PMCID: PMC10703688 DOI: 10.1038/s41588-023-01540-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 09/19/2023] [Indexed: 11/12/2023]
Abstract
Short-read sequencing is the workhorse of cancer genomics yet is thought to miss many structural variants (SVs), particularly large chromosomal alterations. To characterize missing SVs in short-read whole genomes, we analyzed 'loose ends'-local violations of mass balance between adjacent DNA segments. In the landscape of loose ends across 1,330 high-purity cancer whole genomes, most large (>10-kb) clonal SVs were fully resolved by short reads in the 87% of the human genome where copy number could be reliably measured. Some loose ends represent neotelomeres, which we propose as a hallmark of the alternative lengthening of telomeres phenotype. These pan-cancer findings were confirmed by long-molecule profiles of 38 breast cancer and melanoma cases. Our results indicate that aberrant homologous recombination is unlikely to drive the majority of large cancer SVs. Furthermore, analysis of mass balance in short-read whole genome data provides a surprisingly complete picture of cancer chromosomal structure.
Collapse
Affiliation(s)
- Zi-Ning Choo
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Tri-institutional MD PhD Program, Weill Cornell Medicine, New York, NY, USA
- Physiology and Biophysics PhD Program, Weill Cornell Medicine, New York, NY, USA
| | - Julie M Behr
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Tri-institutional PhD Program in Computational Biology and Medicine, New York, NY, USA
| | - Aditya Deshpande
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Tri-institutional PhD Program in Computational Biology and Medicine, New York, NY, USA
| | - Kevin Hadi
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Physiology and Biophysics PhD Program, Weill Cornell Medicine, New York, NY, USA
| | - Xiaotong Yao
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Tri-institutional PhD Program in Computational Biology and Medicine, New York, NY, USA
| | - Huasong Tian
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Grossman School of Medicine, New York, NY, USA
| | - Kaori Takai
- Laboratory of Cell Biology and Genetics, Rockefeller University, New York, NY, USA
| | - George Zakusilo
- Laboratory of Cell Biology and Genetics, Rockefeller University, New York, NY, USA
| | - Joel Rosiene
- New York Genome Center, New York, NY, USA
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
| | | | - Britta Weigelt
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Jeremy Setton
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Nadeem Riaz
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Simon N Powell
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Klaus Busam
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | | | | | - Titia de Lange
- Laboratory of Cell Biology and Genetics, Rockefeller University, New York, NY, USA
| | - Marcin Imieliński
- New York Genome Center, New York, NY, USA.
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA.
- Perlmutter Cancer Center, NYU Grossman School of Medicine, New York, NY, USA.
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA.
| |
Collapse
|
7
|
Tayebi N, Lichtenberg J, Hertz E, Sidransky E. Is Gauchian genotyping of GBA1 variants reliable? MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.10.26.23297627. [PMID: 37986861 PMCID: PMC10659459 DOI: 10.1101/2023.10.26.23297627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
Biallelic mutations in GBA1 result in Gaucher disease (GD), the inherited deficiency of glucocerebrosidase. Variants in GBA1 are also a common genetic risk factor for Parkinson disease (PD). Currently, some PD centers screen for mutant GBA1 alleles to stratify patients who may ultimately benefit from GBA1-targeted therapeutics. However, accurately detecting variants, especially recombinant alleles resulting from a crossover between GBA1 and its pseudogene, is challenging, impacting studies of both GD and GBA1-associated parkinsonism. Recently, the software tool Gauchian was introduced to identify GBA1 variants from whole genome sequencing. We evaluated Gauchian in 90 Sanger-sequenced patients with GD and five GBA1 heterozygotes. While Gauchian genotyped most patients correctly, it missed some rare or de novo mutations due to its limited internal database and over-reliance on intergenic structural variants. This resulted in misreported homozygosity, incomplete genotypes, and undetected recombination events, limiting Gauchian's utility in variant screening and precluding its use in diagnostics.
Collapse
Affiliation(s)
- Nahid Tayebi
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, United States
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815
| | - Jens Lichtenberg
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, United States
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815
| | - Ellen Hertz
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, United States
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815
| | - Ellen Sidransky
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, United States
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815
| |
Collapse
|
8
|
Laufer VA, Glover TW, Wilson TE. Applications of advanced technologies for detecting genomic structural variation. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 792:108475. [PMID: 37931775 PMCID: PMC10792551 DOI: 10.1016/j.mrrev.2023.108475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/08/2023]
Abstract
Chromosomal structural variation (SV) encompasses a heterogenous class of genetic variants that exerts strong influences on human health and disease. Despite their importance, many structural variants (SVs) have remained poorly characterized at even a basic level, a discrepancy predicated upon the technical limitations of prior genomic assays. However, recent advances in genomic technology can identify and localize SVs accurately, opening new questions regarding SV risk factors and their impacts in humans. Here, we first define and classify human SVs and their generative mechanisms, highlighting characteristics leveraged by various SV assays. We next examine the first-ever gapless assembly of the human genome and the technical process of assembling it, which required third-generation sequencing technologies to resolve structurally complex loci. The new portions of that "telomere-to-telomere" and subsequent pangenome assemblies highlight aspects of SV biology likely to develop in the near-term. We consider the strengths and limitations of the most promising new SV technologies and when they or longstanding approaches are best suited to meeting salient goals in the study of human SV in population-scale genomics research, clinical, and public health contexts. It is a watershed time in our understanding of human SV when new approaches are expected to fundamentally change genomic applications.
Collapse
Affiliation(s)
- Vincent A Laufer
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| | - Thomas W Glover
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| | - Thomas E Wilson
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| |
Collapse
|
9
|
Saito-Adachi M, Hama N, Totoki Y, Nakamura H, Arai Y, Hosoda F, Rokutan H, Yachida S, Kato M, Fukagawa A, Shibata T. Oncogenic structural aberration landscape in gastric cancer genomes. Nat Commun 2023; 14:3688. [PMID: 37349325 PMCID: PMC10287692 DOI: 10.1038/s41467-023-39263-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 06/05/2023] [Indexed: 06/24/2023] Open
Abstract
Structural variants (SVs) are responsible for driver events in gastric cancer (GC); however, their patterns and processes remain poorly understood. Here, we examine 170 GC whole genomes to unravel the oncogenic structural aberration landscape in GC genomes and identify six rearrangement signatures (RSs). Non-random combinations of RSs elucidate distinctive GC subtypes comprising one or a few dominant RS that are associated with specific driver events (BRCA1/2 defects, mismatch repair deficiency, and TP53 mutation) and epidemiological backgrounds. Twenty-seven SV hotspots are identified as GC driver candidates. SV hotspots frequently constitute complexly clustered SVs involved in driver gene amplification, such as ERBB2, CCNE1, and FGFR2. Further deconstruction of the locally clustered SVs uncovers amplicon-generating profiles characterized by super-large SVs and intensive segmental amplifications, contributing to the extensive amplification of GC oncogenes. Comprehensive analyses using adjusted SV allele frequencies indicate the significant involvement of extra-chromosomal DNA in processes linked to specific RSs.
Collapse
Affiliation(s)
- Mihoko Saito-Adachi
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Natsuko Hama
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Yasushi Totoki
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
- Department of Cancer Genome Informatics, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Hiromi Nakamura
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Yasuhito Arai
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Fumie Hosoda
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Hirofumi Rokutan
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
- Department of Pathology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Shinichi Yachida
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
- Department of Cancer Genome Informatics, Graduate School of Medicine, Osaka University, Osaka, Japan
| | - Mamoru Kato
- Division of Bioinformatics, National Cancer Center Research Institute, Tokyo, Japan
| | - Akihiko Fukagawa
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Tatsuhiro Shibata
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan.
- Laboratory of Molecular Medicine, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
10
|
Shi J, Tian Z, Lai J, Huang X. Plant pan-genomics and its applications. MOLECULAR PLANT 2023; 16:168-186. [PMID: 36523157 DOI: 10.1016/j.molp.2022.12.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 12/07/2022] [Accepted: 12/12/2022] [Indexed: 06/17/2023]
Abstract
Plant genomes are so highly diverse that a substantial proportion of genomic sequences are not shared among individuals. The variable DNA sequences, along with the conserved core sequences, compose the more sophisticated pan-genome that represents the collection of all non-redundant DNA in a species. With rapid progress in genome sequencing technologies, pan-genome research in plants is now accelerating. Here we review recent advances in plant pan-genomics, including major driving forces of structural variations that constitute the variable sequences, methodological innovations for representing the pan-genome, and major successes in constructing plant pan-genomes. We also summarize recent efforts toward decoding the remaining dark matter in telomere-to-telomere or gapless plant genomes. These new genome resources, which have remarkable advantages over numerous previously assembled less-than-perfect genomes, are expected to become new references for genetic studies and plant breeding.
Collapse
Affiliation(s)
- Junpeng Shi
- State Key Laboratory of Biocontrol, School of Agriculture, Sun Yat-sen University, Shenzhen 518107, China.
| | - Zhixi Tian
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Jinsheng Lai
- State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, China
| | - Xuehui Huang
- Shanghai Key Laboratory of Plant Molecular Sciences, College of Life Sciences, Shanghai Normal University, Shanghai 200234, China.
| |
Collapse
|
11
|
López-Cortegano E, Craig RJ, Chebib J, Balogun EJ, Keightley PD. Rates and spectra of de novo structural mutations in Chlamydomonas reinhardtii. Genome Res 2023; 33:45-60. [PMID: 36617667 PMCID: PMC9977147 DOI: 10.1101/gr.276957.122] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022]
Abstract
Genetic variation originates from several types of spontaneous mutation, including single-nucleotide substitutions, short insertions and deletions (indels), and larger structural changes. Structural mutations (SMs) drive genome evolution and are thought to play major roles in evolutionary adaptation, speciation, and genetic disease, including cancers. Sequencing of mutation accumulation (MA) lines has provided estimates of rates and spectra of single-nucleotide and indel mutations in many species, yet the rate of new SMs is largely unknown. Here, we use long-read sequencing to determine the full mutation spectrum in MA lines derived from two strains (CC-1952 and CC-2931) of the green alga Chlamydomonas reinhardtii The SM rate is highly variable between strains and between MA lines, and SMs represent a substantial proportion of all mutations in both strains (CC-1952 6%; CC-2931 12%). The SM spectra differ considerably between the two strains, with almost all inversions and translocations occurring in CC-2931 MA lines. This variation is associated with heterogeneity in the number and type of active transposable elements (TEs), which comprise major proportions of SMs in both strains (CC-1952 22%; CC-2931 38%). In CC-2931, a Crypton and a previously undescribed type of DNA element have caused 71% of chromosomal rearrangements, whereas in CC-1952, a Dualen LINE is associated with 87% of duplications. Other SMs, notably large duplications in CC-2931, are likely products of various double-strand break repair pathways. Our results show that diverse types of SMs occur at substantial rates, and support prominent roles for SMs and TEs in evolution.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Rory J. Craig
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom;,California Institute for Quantitative Biosciences, UC Berkeley, Berkeley, California 94720, USA
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Eniolaye J. Balogun
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario ON M5S 3B2, Canada;,Department of Biology, University of Toronto Mississauga, Mississauga ON L5L 1C6, Canada
| | - Peter D. Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
12
|
A Novel Frameshift CHD4 Variant Leading to Sifrim-Hitz-Weiss Syndrome in a Proband with a Subclinical Familial t(17;19) and a Large dup(2)(q14.3q21.1). Biomedicines 2022; 11:biomedicines11010012. [PMID: 36672520 PMCID: PMC9855399 DOI: 10.3390/biomedicines11010012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/14/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
The genetic complexity of neurodevelopmental disorders (NDD), combined with a heterogeneous clinical presentation, makes accurate assessment of their molecular bases and pathogenic mechanisms challenging. Our purpose is to reveal the pathogenic variant underlying a complex NDD through identification of the "full" spectrum of structural genomic and genetic variants. Therefore, clinical phenotyping and identification of variants by genome and exome sequencing, together with comprehensive assessment of these and affected candidate genes, were carried out. A maternally-inherited familial translocation [t(17;19)(p13.1;p13.3)mat] disrupting the GSG1 like 2 gene (GSG1L2), a 3.2 Mb dup(2)(q14.3q21.1) encompassing the autosomal dominant OMIM phenotype-associated PROC and HS6ST1 gene, and a novel frameshift c.4442del, p.(Gly1481Valfs*21) variant within exon 30 of the Chromodomain helicase DNA binding protein 4 (CHD4) have been identified. Considering the pathogenic potential of each variant and the proband's phenotype, we conclude that this case basically fits the Sifrim-Hitz-Weiss syndrome or CHD4-associated neurodevelopmental phenotype. Finally, our data highlight the need for identification of the "full" spectrum of structural genomic and genetic variants and of reverse comparative phenotyping, including unrelated patients with variants in same genes, for improved genomic healthcare of patients with NDD.
Collapse
|
13
|
Xie HB, Yan C, Adeola AC, Wang K, Huang CP, Xu MM, Qiu Q, Yin X, Fan CY, Ma YF, Yin TT, Gao Y, Deng JK, Okeyoyin AO, Oluwole OO, Omotosho O, Okoro VMO, Omitogun OG, Dawuda PM, Olaogun SC, Nneji LM, Ayoola AO, Sanke OJ, Luka PD, Okoth E, Lekolool I, Mijele D, Bishop RP, Han J, Wang W, Peng MS, Zhang YP. African Suid Genomes Provide Insights into the Local Adaptation to Diverse African Environments. Mol Biol Evol 2022; 39:6840307. [PMID: 36413509 PMCID: PMC9733430 DOI: 10.1093/molbev/msac256] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 10/21/2022] [Accepted: 11/17/2022] [Indexed: 11/23/2022] Open
Abstract
African wild suids consist of several endemic species that represent ancient members of the family Suidae and have colonized diverse habitats on the African continent. However, limited genomic resources for African wild suids hinder our understanding of their evolution and genetic diversity. In this study, we assembled high-quality genomes of a common warthog (Phacochoerus africanus), a red river hog (Potamochoerus porcus), as well as an East Asian Diannan small-ear pig (Sus scrofa). Phylogenetic analysis showed that common warthog and red river hog diverged from their common ancestor around the Miocene/Pliocene boundary, putatively predating their entry into Africa. We detected species-specific selective signals associated with sensory perception and interferon signaling pathways in common warthog and red river hog, respectively, which contributed to their local adaptation to savannah and tropical rainforest environments, respectively. The structural variation and evolving signals in genes involved in T-cell immunity, viral infection, and lymphoid development were identified in their ancestral lineage. Our results provide new insights into the evolutionary histories and divergent genetic adaptations of African suids.
Collapse
Affiliation(s)
| | | | | | | | | | - Ming-Min Xu
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Qiang Qiu
- School of Ecology and Environment, Northwestern Polytechnical University, Xi’an 710129, China
| | - Xue Yin
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Chen-Yu Fan
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Yun-Fei Ma
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Ting-Ting Yin
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Yun Gao
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Jia-Kun Deng
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Agboola O Okeyoyin
- National Park Service Headquarter, Federal Capital Territory, Abuja 900108, Nigeria
| | - Olufunke O Oluwole
- Institute of Agricultural Research and Training, Obafemi Awolowo University, Ibadan, Nigeria
| | - Oladipo Omotosho
- Department of Veterinary Medicine, University of Ibadan, Ibadan 200005, Nigeria
| | - Victor M O Okoro
- Department of Animal Science and Technology, School of Agriculture and Agricultural Technology, Federal University of Technology, Owerri 460114, Nigeria
| | - Ofelia G Omitogun
- Department of Animal Sciences, Obafemi Awolowo University, Ile-Ife 220282, Nigeria
| | - Philip M Dawuda
- Department of Veterinary Surgery and Theriogenology, College of Veterinary Medicine, University of Agriculture Makurdi, Makurdi 970001, Nigeria
| | - Sunday C Olaogun
- Department of Veterinary Medicine, University of Ibadan, Ibadan 200005, Nigeria
| | - Lotanna M Nneji
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Kunming 650204, China
| | - Adeola O Ayoola
- State Key Laboratory of Genetic Resources and Evolution & Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Kunming 650204, China
| | - Oscar J Sanke
- Taraba State Ministry of Agriculture and Natural Resources, Jalingo 660213, Nigeria
| | - Pam D Luka
- National Veterinary Research Institute, Vom 930103, Nigeria
| | - Edward Okoth
- International Livestock Research Institute (ILRI), Nairobi 00100, Kenya
| | | | | | - Richard P Bishop
- International Livestock Research Institute (ILRI), Nairobi 00100, Kenya
| | | | - Wen Wang
- Corresponding authors: E-mails: ; ; ;
| | | | | |
Collapse
|
14
|
Zattera ML, Bruschi DP. Transposable Elements as a Source of Novel Repetitive DNA in the Eukaryote Genome. Cells 2022; 11:cells11213373. [PMID: 36359770 PMCID: PMC9659126 DOI: 10.3390/cells11213373] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/23/2022] [Accepted: 09/26/2022] [Indexed: 12/02/2022] Open
Abstract
The impact of transposable elements (TEs) on the evolution of the eukaryote genome has been observed in a number of biological processes, such as the recruitment of the host’s gene expression network or the rearrangement of genome structure. However, TEs may also provide a substrate for the emergence of novel repetitive elements, which contribute to the generation of new genomic components during the course of the evolutionary process. In this review, we examine published descriptions of TEs that give rise to tandem sequences in an attempt to comprehend the relationship between TEs and the emergence of de novo satellite DNA families in eukaryotic organisms. We evaluated the intragenomic behavior of the TEs, the role of their molecular structure, and the chromosomal distribution of the paralogous copies that generate arrays of repeats as a substrate for the emergence of new repetitive elements in the genome. We highlight the involvement and importance of TEs in the eukaryote genome and its remodeling processes.
Collapse
Affiliation(s)
- Michelle Louise Zattera
- Departamento de Genética, Programa de Pós-Graduação em Genética, Setor de Ciências Biológicas, Universidade Federal do Paraná, Curitiba 81530-000, PR, Brazil
| | - Daniel Pacheco Bruschi
- Departamento de Genética, Laboratorio de Citogenética Evolutiva e Conservação Animal, Setor de Ciências Biológicas, Universidade Federal do Paraná, Curitiba 81530-000, PR, Brazil
- Correspondence:
| |
Collapse
|
15
|
Pascarella G, Hon CC, Hashimoto K, Busch A, Luginbühl J, Parr C, Hin Yip W, Abe K, Kratz A, Bonetti A, Agostini F, Severin J, Murayama S, Suzuki Y, Gustincich S, Frith M, Carninci P. Recombination of repeat elements generates somatic complexity in human genomes. Cell 2022; 185:3025-3040.e6. [DOI: 10.1016/j.cell.2022.06.032] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 03/30/2022] [Accepted: 06/16/2022] [Indexed: 12/14/2022]
|
16
|
Batista RL, Mendonca BB. The Molecular Basis of 5α-Reductase Type 2 Deficiency. Sex Dev 2022; 16:171-183. [PMID: 35793650 DOI: 10.1159/000525119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 05/13/2022] [Indexed: 11/19/2022] Open
Abstract
The 5α-reductase type 2 enzyme catalyzes the conversion of testosterone into dihydrotestosterone, playing a crucial role in male development. This enzyme is encoded by the SRD5A2 gene, which maps to chromosome 2 (2p23), consists of 5 exons and 4 introns, and encodes a 254 amino acid protein. Disruptions in this gene are the molecular etiology of a subgroup of differences of sex development (DSD) in 46,XY patients. Affected individuals present a large range of external genitalia undervirilization, ranging from almost typically female external genitalia to predominantly typically male external genitalia with minimal undervirilization, including isolated micropenis. This is an updated review of the implication of the SRD5A2 gene in 5α-reductase type 2 enzyme deficiency. For that, we identified 451 cases from 48 countries of this particular 46,XY DSD from the literature with reported variants in the SRD5A2 gene. Herein, we present the SRD5A2 mutational profile, the SRD5A2 polymorphisms, and the functional studies related to SRD5A2 variants to detail the molecular etiology of this condition.
Collapse
Affiliation(s)
- Rafael L Batista
- Unidade de Endocrinologia do Desenvolvimento, Laboratório de Hormônios e Genética Molecular/LIM42, Hospital das Clínicas, Disciplina de Endocrinologia, do Departamento de Clínica Médica, Faculdade de Medicina da Universidade de São Paulo, São Paulo, Brazil.,Endocrine Oncology Unit, Instituto do Câncer do Estado de São Paulo, ICESP, Faculdade de Medicina da Universidade de São Paulo, São Paulo, Brazil
| | - Berenice B Mendonca
- Unidade de Endocrinologia do Desenvolvimento, Laboratório de Hormônios e Genética Molecular/LIM42, Hospital das Clínicas, Disciplina de Endocrinologia, do Departamento de Clínica Médica, Faculdade de Medicina da Universidade de São Paulo, São Paulo, Brazil
| |
Collapse
|
17
|
Repeated translocation of a supergene underlying rapid sex chromosome turnover in Takifugu pufferfish. Proc Natl Acad Sci U S A 2022; 119:e2121469119. [PMID: 35658077 DOI: 10.1073/pnas.2121469119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
SignificanceAlthough turnover of sex chromosomes is very common in many vertebrate lineages, the transition process is still elusive. We studied the sex-determining region (SDR) of 12 congeneric fish species of Takifugu fish that compose an important model for the study of genomics and sex determination. We found that while nine species retained their ancestral SDR, three species had acquired derived SDRs. Although the derived SDRs resided in three different chromosomes, they harbored a shared supergene flanked by two putative transposable elements. The results highlight the underestimated role of a mobile supergene in turnover of sex chromosomes in vertebrates.
Collapse
|
18
|
Vicari MR, Bruschi DP, Cabral-de-Mello DC, Nogaroto V. Telomere organization and the interstitial telomeric sites involvement in insects and vertebrates chromosome evolution. Genet Mol Biol 2022; 45:e20220071. [DOI: 10.1590/1678-4685-gmb-2022-0071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 09/24/2022] [Indexed: 11/16/2022] Open
|
19
|
Yanovsky-Dagan S, Frumkin A, Lupski JR, Harel T. CRISPR/Cas9-induced gene conversion between ATAD3 paralogs. HGG ADVANCES 2022; 3:100092. [PMID: 35199044 PMCID: PMC8844715 DOI: 10.1016/j.xhgg.2022.100092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 01/19/2022] [Indexed: 11/23/2022] Open
Abstract
Paralogs and pseudogenes are abundant within the human genome, and can mediate non-allelic homologous recombination (NAHR) or gene conversion events. The ATAD3 locus contains three paralogs situated in tandem, and is therefore prone to NAHR-mediated deletions and duplications associated with severe neurological phenotypes. To study this locus further, we aimed to generate biallelic loss-of-function variants in ATAD3A by CRISPR/Cas9 genome editing. Unexpectedly, two of the generated clones underwent gene conversion, as evidenced by replacement of the targeted sequence of ATAD3A by a donor sequence from its paralog ATAD3B. We highlight the complexity of CRISPR/Cas9 design, end-product formation, and recombination repair mechanisms for CRISPR/Cas9 delivery as a nucleic acid molecular therapy when targeting genes that have paralogs or pseudogenes, and advocate meticulous evaluation of resultant clones in model organisms. In addition, we suggest that endogenous gene conversion may be used to repair missense variants in genes with paralogs or pseudogenes.
Collapse
Affiliation(s)
| | - Ayala Frumkin
- Department of Genetics, Hadassah Medical Organization, Jerusalem, Israel
| | - James R. Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Hospital, Houston, TX, USA
| | - Tamar Harel
- Department of Genetics, Hadassah Medical Organization, Jerusalem, Israel
- Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
- Corresponding author
| |
Collapse
|
20
|
Long-read technologies identify a hidden inverted duplication in a family with choroideremia. HGG ADVANCES 2021; 2:100046. [PMID: 35047838 PMCID: PMC8756506 DOI: 10.1016/j.xhgg.2021.100046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 07/01/2021] [Indexed: 12/03/2022] Open
Abstract
The lack of molecular diagnoses in rare genetic diseases can be explained by limitations of current standard genomic technologies. Upcoming long-read techniques have complementary strengths to overcome these limitations, with a particular strength in identifying structural variants. By using optical genome mapping and long-read sequencing, we aimed to identify the pathogenic variant in a large family with X-linked choroideremia. In this family, aberrant splicing of exon 12 of the choroideremia gene CHM was detected in 2003, but the underlying genomic defect remained elusive. Optical genome mapping and long-read sequencing approaches now revealed an intragenic 1,752 bp inverted duplication including exon 12 and surrounding regions, located downstream of the wild-type copy of exon 12. Both breakpoint junctions were confirmed with Sanger sequencing and segregate with the X-linked inheritance in the family. The breakpoint junctions displayed sequence microhomology suggestive for an erroneous replication mechanism as the origin of the structural variant. The inverted duplication is predicted to result in a hairpin formation of the pre-mRNA with the wild-type exon 12, leading to exon skipping in the mature mRNA. The identified inverted duplication is deemed the hidden pathogenic cause of disease in this family. Our study shows that optical genome mapping and long-read sequencing have significant potential for the identification of (hidden) structural variants in rare genetic diseases.
Collapse
|
21
|
Talavera-López C, Messenger LA, Lewis MD, Yeo M, Reis-Cunha JL, Matos GM, Bartholomeu DC, Calzada JE, Saldaña A, Ramírez JD, Guhl F, Ocaña-Mayorga S, Costales JA, Gorchakov R, Jones K, Nolan MS, Teixeira SMR, Carrasco HJ, Bottazzi ME, Hotez PJ, Murray KO, Grijalva MJ, Burleigh B, Grisard EC, Miles MA, Andersson B. Repeat-Driven Generation of Antigenic Diversity in a Major Human Pathogen, Trypanosoma cruzi. Front Cell Infect Microbiol 2021; 11:614665. [PMID: 33747978 PMCID: PMC7966520 DOI: 10.3389/fcimb.2021.614665] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 01/22/2021] [Indexed: 12/18/2022] Open
Abstract
Trypanosoma cruzi, a zoonotic kinetoplastid protozoan parasite, is the causative agent of American trypanosomiasis (Chagas disease). Having a very plastic, repetitive and complex genome, the parasite displays a highly diverse repertoire of surface molecules, with pivotal roles in cell invasion, immune evasion and pathogenesis. Before 2016, the complexity of the genomic regions containing these genes impaired the assembly of a genome at chromosomal level, making it impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here describe the genome assembly of the Sylvio X10/1 genome sequence, which since 2016 has been used as a reference genome sequence for T. cruzi clade I (TcI), produced using high coverage PacBio single-molecule sequencing. It was used to analyze deep Illumina sequence data from 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene distribution showed the unusual duality in the organization of the parasite genome, a synteny of the core genomic region with related protozoa flanked by unique and highly plastic multigene family clusters encoding surface antigens. The presence of abundant interspersed retrotransposons in these multigene family clusters suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response on these TcI strains. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.
Collapse
Affiliation(s)
- Carlos Talavera-López
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
- European Bioinformatics Institute, Wellcome Sanger Institute, Hinxton, United Kingdom
| | - Louisa A. Messenger
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Michael D. Lewis
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Matthew Yeo
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - João Luís Reis-Cunha
- Departamento de Parasitologia, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Gabriel Machado Matos
- Departamento de Biologia Celular, Embriologia e Genética, Universidade Federal Santa Catarina, Florianópolis, Brazil
| | | | - José E. Calzada
- Departamento de Parasitología, Instituto Conmemorativo Gorgas de Estudios de la Salud, Ciudad de Panamá, Panama
| | - Azael Saldaña
- Departamento de Parasitología, Instituto Conmemorativo Gorgas de Estudios de la Salud, Ciudad de Panamá, Panama
| | - Juan David Ramírez
- Grupo de Investigaciones Microbiológicas-UR (GIMUR), Departamento de Biología, Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Felipe Guhl
- Grupo de Investigaciones en Microbiología y Parasitología Tropical (CIMPAT), Tropical Parasitology Research Center, Universidad de Los Andes, Bogotá, Colombia
| | - Sofía Ocaña-Mayorga
- Centro de Investigación para la Salud en América Latina (CISeAL), Escuela de Ciencias Biológicas, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
| | - Jaime A. Costales
- Centro de Investigación para la Salud en América Latina (CISeAL), Escuela de Ciencias Biológicas, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
| | - Rodion Gorchakov
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Kathryn Jones
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Melissa S. Nolan
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Santuza M. R. Teixeira
- Departamento de Bioquímica e Imunologia, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Hernán José Carrasco
- Laboratorio de Biología Molecular de Protozoarios, Instituto de Medicina Tropical, Facultad de Medicina, Universidad Central de Venezuela, Caracas, Venezuela
| | - Maria Elena Bottazzi
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Peter J. Hotez
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Kristy O. Murray
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Mario J. Grijalva
- Centro de Investigación para la Salud en América Latina (CISeAL), Escuela de Ciencias Biológicas, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
- Department of Biomedical Sciences, Heritage College of Osteopathic Medicine, Infectious and Tropical Disease Institute, Ohio University, Athens, OH, United States
| | - Barbara Burleigh
- Department of Immunology and Infectious Diseases, T.H. Chan School of Public Health, Harvard University, Boston, MA, United States
| | - Edmundo C. Grisard
- Departamento de Microbiologia, Imunologia e Parasitologia, Universidade Federal Santa Catarina, Florianópolis, Brazil
| | - Michael A. Miles
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Björn Andersson
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
22
|
van den Akker J, Hon L, Ondov A, Mahkovec Z, O'Connor R, Chan RC, Lock J, Zimmer AD, Rostamianfar A, Ginsberg J, Leon A, Topper S. Intronic Breakpoint Signatures Enhance Detection and Characterization of Clinically Relevant Germline Structural Variants. J Mol Diagn 2021; 23:612-629. [PMID: 33621668 DOI: 10.1016/j.jmoldx.2021.01.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 12/14/2020] [Accepted: 01/27/2021] [Indexed: 12/16/2022] Open
Abstract
The relevance of large copy number variants (CNVs) to hereditary disorders has been long recognized, and population sequencing efforts have chronicled many common structural variants (SVs). However, limited data are available on the clinical contribution of rare germline SVs. Here, a detailed characterization of SVs identified using targeted next-generation sequencing was performed. Across 50 genes associated with hereditary cancer and cardiovascular disorders, a minimum of 828 unique SVs were reported, including 584 fully characterized SVs. Almost 40% of CNVs were <5 kb, with one in three deletions impacting a single exon. Additionally, 36 mid-range deletions/duplications (50 to 250 bp), 21 mobile element insertions, 6 inversions, and 27 complex rearrangements were detected. This data set was used to model SV detection in a bioinformatics pipeline solely relying on read depth, which revealed that genome sequencing (30×) allows detection of 71%, a 500× panel only targeting coding regions 53%, and exome sequencing (100×) <20% of characterized SVs. SVs accounted for 14.1% of all unique pathogenic variants, supporting the importance of SVs in hereditary disorders. Robust SV detection requires an ensemble of variant-calling algorithms that utilize sequencing of intronic regions. These algorithms should use distinct data features representative of each class of mutational mechanism, including recombination between two sequences sharing high similarity, covariants inserted between CNV breakpoints, and complex rearrangements containing inverted sequences.
Collapse
|
23
|
Han T, Jing X, Bao J, Zhao L, Zhang A, Miao R, Guo H, Zhou B, Zhang S, Sun J, Shi J. H. pylori infection alters repair of DNA double-strand breaks via SNHG17. J Clin Invest 2021; 130:3901-3918. [PMID: 32538894 DOI: 10.1172/jci125581] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 04/14/2020] [Indexed: 12/15/2022] Open
Abstract
Chronic infections can lead to carcinogenesis through inflammation-related mechanisms. Chronic infection of the human gastric mucosa with Helicobacter pylori is a well-known risk factor for gastric cancer. However, the mechanisms underlying H. pylori-induced gastric carcinogenesis are incompletely defined. We aimed to screen and clarify the functions of long noncoding RNAs (lncRNAs) that are differentially expressed in H. pylori-related gastric cancer. We found that lncRNA SNHG17 was upregulated by H. pylori infection and markedly increased the levels of double-strand breaks (DSBs). SNHG17 overexpression correlated with poor overall survival in patients with gastric cancer. The recruitment of NONO by overabundant nuclear SNHG17, along with the role of cytoplasmic SNHG17 as a decoy for miR-3909, which regulates Rad51 expression, shifted the DSB repair balance from homologous recombination toward nonhomologous end joining. Notably, during chronic H. pylori infection, SNHG17 knockdown inhibited chromosomal aberrations. Our findings suggest that spatially independent deregulation of the SNHG17/NONO and SNHG17/miR-3909/RING1/Rad51 pathways upon H. pylori infection promotes tumorigenesis in gastric cancer by altering the DNA repair system, which is critical for the maintenance of genomic stability. Upregulation of SNHG17 by H. pylori infection might be an undefined link between cancer and inflammation.
Collapse
Affiliation(s)
- Taotao Han
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Xiaohui Jing
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jiayu Bao
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Lianmei Zhao
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Research Center, Fourth Affiliated Hospital of Hebei Medical University, Shijiazhuang, China
| | - Aidong Zhang
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Renling Miao
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hui Guo
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Baoguo Zhou
- Department of General Surgery, First Affiliated Hospital of Harbin Medical University, Heilongjiang, China
| | - Shang Zhang
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jiazeng Sun
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Juan Shi
- Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
24
|
Wang WJ, Li LY, Cui JW. Chromosome structural variation in tumorigenesis: mechanisms of formation and carcinogenesis. Epigenetics Chromatin 2020; 13:49. [PMID: 33168103 PMCID: PMC7654176 DOI: 10.1186/s13072-020-00371-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 10/29/2020] [Indexed: 12/23/2022] Open
Abstract
With the rapid development of next-generation sequencing technology, chromosome structural variation has gradually gained increased clinical significance in tumorigenesis. However, the molecular mechanism(s) underlying this structural variation remain poorly understood. A search of the literature shows that a three-dimensional chromatin state plays a vital role in inducing structural variation and in the gene expression profiles in tumorigenesis. Structural variants may result in changes in copy number or deletions of coding sequences, as well as the perturbation of structural chromatin features, especially topological domains, and disruption of interactions between genes and their regulatory elements. This review focuses recent work aiming at elucidating how structural variations develop and misregulate oncogenes and tumor suppressors, to provide general insights into tumor formation mechanisms and to provide potential targets for future anticancer therapies.
Collapse
Affiliation(s)
- Wen-Jun Wang
- Cancer Center, The First Hospital of Jilin University, Jilin University, Changchun, 130021, Jilin, China
| | - Ling-Yu Li
- Cancer Center, The First Hospital of Jilin University, Jilin University, Changchun, 130021, Jilin, China
| | - Jiu-Wei Cui
- Cancer Center, The First Hospital of Jilin University, Jilin University, Changchun, 130021, Jilin, China.
| |
Collapse
|
25
|
Zhuang X, Ye R, So MT, Lam WY, Karim A, Yu M, Ngo ND, Cherny SS, Tam PKH, Garcia-Barcelo MM, Tang CSM, Sham PC. A random forest-based framework for genotyping and accuracy assessment of copy number variations. NAR Genom Bioinform 2020; 2:lqaa071. [PMID: 33575619 PMCID: PMC7671382 DOI: 10.1093/nargab/lqaa071] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 08/18/2020] [Accepted: 08/26/2020] [Indexed: 12/24/2022] Open
Abstract
Detection of copy number variations (CNVs) is essential for uncovering genetic factors underlying human diseases. However, CNV detection by current methods is prone to error, and precisely identifying CNVs from paired-end whole genome sequencing (WGS) data is still challenging. Here, we present a framework, CNV-JACG, for Judging the Accuracy of CNVs and Genotyping using paired-end WGS data. CNV-JACG is based on a random forest model trained on 21 distinctive features characterizing the CNV region and its breakpoints. Using the data from the 1000 Genomes Project, Genome in a Bottle Consortium, the Human Genome Structural Variation Consortium and in-house technical replicates, we show that CNV-JACG has superior sensitivity over the latest genotyping method, SV2, particularly for the small CNVs (≤1 kb). We also demonstrate that CNV-JACG outperforms SV2 in terms of Mendelian inconsistency in trios and concordance between technical replicates. Our study suggests that CNV-JACG would be a useful tool in assessing the accuracy of CNVs to meet the ever-growing needs for uncovering the missing heritability linked to CNVs.
Collapse
Affiliation(s)
- Xuehan Zhuang
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Rui Ye
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Man-Ting So
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Wai-Yee Lam
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Anwarul Karim
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Michelle Yu
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Ngoc Diem Ngo
- National Hospital of Pediatrics, Ha Noi 100000, Vietnam
| | - Stacey S Cherny
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Paul Kwong-Hang Tam
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | | | - Clara Sze-Man Tang
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Pak Chung Sham
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| |
Collapse
|
26
|
Garimella KV, Iqbal Z, Krause MA, Campino S, Kekre M, Drury E, Kwiatkowski D, Sá JM, Wellems TE, McVean G. Detection of simple and complex de novo mutations with multiple reference sequences. Genome Res 2020; 30:1154-1169. [PMID: 32817236 PMCID: PMC7462078 DOI: 10.1101/gr.255505.119] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 07/17/2020] [Indexed: 12/25/2022]
Abstract
The characterization of de novo mutations in regions of high sequence and structural diversity from whole-genome sequencing data remains highly challenging. Complex structural variants tend to arise in regions of high repetitiveness and low complexity, challenging both de novo assembly, in which short reads do not capture the long-range context required for resolution, and mapping approaches, in which improper alignment of reads to a reference genome that is highly diverged from that of the sample can lead to false or partial calls. Long-read technologies can potentially solve such problems but are currently unfeasible to use at scale. Here we present Corticall, a graph-based method that combines the advantages of multiple technologies and prior data sources to detect arbitrary classes of genetic variant. We construct multisample, colored de Bruijn graphs from short-read data for all samples, align long-read–derived haplotypes and multiple reference data sources to restore graph connectivity information, and call variants using graph path-finding algorithms and a model for simultaneous alignment and recombination. We validate and evaluate the approach using extensive simulations and use it to characterize the rate and spectrum of de novo mutation events in 119 progeny from four Plasmodium falciparum experimental crosses, using long-read data on the parents to inform reconstructions of the progeny and to detect several known and novel nonallelic homologous recombination events.
Collapse
Affiliation(s)
- Kiran V Garimella
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA.,Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, Oxfordshire, OX3 7BN, United Kingdom.,Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, Oxfordshire, OX3 7LF, United Kingdom
| | - Zamin Iqbal
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, Oxfordshire, OX3 7BN, United Kingdom.,European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, United Kingdom
| | - Michael A Krause
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, Oxfordshire, OX3 7BN, United Kingdom.,The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom.,Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Susana Campino
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom
| | - Mihir Kekre
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom
| | - Eleanor Drury
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom
| | - Dominic Kwiatkowski
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, Oxfordshire, OX3 7LF, United Kingdom.,The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom
| | - Juliana M Sá
- Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Thomas E Wellems
- Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Gil McVean
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, Oxfordshire, OX3 7BN, United Kingdom.,Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, Oxfordshire, OX3 7LF, United Kingdom
| |
Collapse
|
27
|
The Role of Alpha-Synuclein and Other Parkinson's Genes in Neurodevelopmental and Neurodegenerative Disorders. Int J Mol Sci 2020; 21:ijms21165724. [PMID: 32785033 PMCID: PMC7460874 DOI: 10.3390/ijms21165724] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 07/29/2020] [Accepted: 08/08/2020] [Indexed: 12/13/2022] Open
Abstract
Neurodevelopmental and late-onset neurodegenerative disorders present as separate entities that are clinically and neuropathologically quite distinct. However, recent evidence has highlighted surprising commonalities and converging features at the clinical, genomic, and molecular level between these two disease spectra. This is particularly striking in the context of autism spectrum disorder (ASD) and Parkinson's disease (PD). Genetic causes and risk factors play a central role in disease pathophysiology and enable the identification of overlapping mechanisms and pathways. Here, we focus on clinico-genetic studies of causal variants and overlapping clinical and cellular features of ASD and PD. Several genes and genomic regions were selected for our review, including SNCA (alpha-synuclein), PARK2 (parkin RBR E3 ubiquitin protein ligase), chromosome 22q11 deletion/DiGeorge region, and FMR1 (fragile X mental retardation 1) repeat expansion, which influence the development of both ASD and PD, with converging features related to synaptic function and neurogenesis. Both PD and ASD display alterations and impairments at the synaptic level, representing early and key disease phenotypes, which support the hypothesis of converging mechanisms between the two types of diseases. Therefore, understanding the underlying molecular mechanisms might inform on common targets and therapeutic approaches. We propose to re-conceptualize how we understand these disorders and provide a new angle into disease targets and mechanisms linking neurodevelopmental disorders and neurodegeneration.
Collapse
|
28
|
Roychowdhury T, Abyzov A. Chromatin organization modulates the origin of heritable structural variations in human genome. Nucleic Acids Res 2019; 47:2766-2777. [PMID: 30773596 PMCID: PMC6451188 DOI: 10.1093/nar/gkz103] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 02/06/2019] [Accepted: 02/14/2019] [Indexed: 12/11/2022] Open
Abstract
Structural variations (SVs) in the human genome originate from different mechanisms related to DNA repair, replication errors, and retrotransposition. Our analyses of 26 927 SVs from the 1000 Genomes Project revealed differential distributions and consequences of SVs of different origin, e.g. deletions from non-allelic homologous recombination (NAHR) are more prone to disrupt chromatin organization while processed pseudogenes can create accessible chromatin. Spontaneous double stranded breaks (DSBs) are the best predictor of enrichment of NAHR deletions in open chromatin. This evidence, along with strong physical interaction of NAHR breakpoints belonging to the same deletion suggests that majority of NAHR deletions are non-meiotic i.e. originate from errors during homology directed repair (HDR) of spontaneous DSBs. In turn, the origin of the spontaneous DSBs is associated with transcription factor binding in accessible chromatin revealing the vulnerability of functional, open chromatin. The chromatin itself is enriched with repeats, particularly fixed Alu elements that provide the homology required to maintain stability via HDR. Through co-localization of fixed Alus and NAHR deletions in open chromatin we hypothesize that old Alu expansion had a stabilizing role on the human genome.
Collapse
Affiliation(s)
- Tanmoy Roychowdhury
- Mayo Clinic, Department of Health Sciences Research, Center for Individualized Medicine, Rochester, MN 55905, USA
| | - Alexej Abyzov
- Mayo Clinic, Department of Health Sciences Research, Center for Individualized Medicine, Rochester, MN 55905, USA
| |
Collapse
|
29
|
Dukić M, Berner D, Haag CR, Ebert D. How clonal are clones? A quest for loss of heterozygosity during asexual reproduction in Daphnia magna. J Evol Biol 2019; 32:619-628. [PMID: 30888725 PMCID: PMC6850383 DOI: 10.1111/jeb.13443] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Accepted: 03/04/2019] [Indexed: 12/25/2022]
Abstract
Due to the lack of recombination, asexual organisms are predicted to accumulate mutations and show high levels of within‐individual allelic divergence (heterozygosity); however, empirical evidence for this prediction is largely missing. Instead, evidence of genome homogenization during asexual reproduction is accumulating. Ameiotic crossover recombination is a mechanism that could lead to long genomic stretches of loss of heterozygosity (LOH) and unmasking of mutations that have little or no effect in heterozygous state. Therefore, LOH might be an important force for inducing variation among asexual offspring and may contribute to the limited longevity of asexual lineages. To investigate the genetic consequences of asexuality, here we used high‐throughput sequencing of Daphnia magna for assessing the rate of LOH over a single generation of asexual reproduction. Comparing parthenogenetic daughters with their mothers at several thousand genetic markers generated by restriction site‐associated DNA (RAD) sequencing resulted in high LOH rate estimation that largely overlapped with our estimates for the error rate. To distinguish these two, we Sanger re‐sequenced the top 17 candidate RAD‐loci for LOH, and all of them proved to be false positives. Hence, even though we cannot exclude the possibility that short stretches of LOH occur in genomic regions not covered by our markers, we conclude that LOH does not occur frequently during asexual reproduction in D. magna and ameiotic crossovers are very rare or absent. This finding suggests that clonal lineages of D. magna will remain genetically homogeneous at least over time periods typically relevant for experimental work.
Collapse
Affiliation(s)
- Marinela Dukić
- Zoological Institute, University of Basel, Basel, Switzerland.,Department of Cell and Developmental Biology, John Innes Centre, Norwich Research Park, Norwich, UK
| | - Daniel Berner
- Zoological Institute, University of Basel, Basel, Switzerland
| | - Christoph R Haag
- Centre d'Ecologie Fonctionnelle et Evolutive-CEFE UMR 5175, CNRS-Université de Montpellier-Université Paul-Valéry Montpellier-EPHE, campus CNRS, Montpellier, France.,Department of Biology, Ecology and Evolution, University of Fribourg, Fribourg, Switzerland
| | - Dieter Ebert
- Zoological Institute, University of Basel, Basel, Switzerland
| |
Collapse
|
30
|
Krasileva KV. The role of transposable elements and DNA damage repair mechanisms in gene duplications and gene fusions in plant genomes. CURRENT OPINION IN PLANT BIOLOGY 2019; 48:18-25. [PMID: 30849712 DOI: 10.1016/j.pbi.2019.01.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Revised: 01/16/2019] [Accepted: 01/29/2019] [Indexed: 05/02/2023]
Abstract
Plant genomes are shaped by structural variation. Gene-size insertions and among most prominent events and can have significant effects on amplification of gene families as well as facilitate new gene fusions. Transposable elements as well as plant DNA repair machinery have overlapping contributions to these events, and often work in synergy. Activity of transposable elements is often lineage specific and can preferentially affect specific gene families, such as disease resistance genes. Once duplicated, genes themselves can serve templates for additional variation that can arise from non-allelic homologous recombination. Non-homologous DNA repair mechanisms contribute to additional variation and diversify the mechanisms of gene movement, such as through ligation of extra-chromosomal DNA fragments. Genomic processes that generate structural variation can be induced by stress and, therefore, can provide adaptive potential. This review describes mechanisms that contribute to gene-size structural variation in plants, result in gene duplication and generation of new plant genes through gene fusion.
Collapse
Affiliation(s)
- Ksenia V Krasileva
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, United States.
| |
Collapse
|
31
|
Parks MM, Kurylo CM, Batchelder JE, Theresa Vincent C, Blanchard SC. Implications of sequence variation on the evolution of rRNA. Chromosome Res 2019; 27:89-93. [PMID: 30719681 PMCID: PMC6505490 DOI: 10.1007/s10577-018-09602-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 12/19/2018] [Accepted: 12/26/2018] [Indexed: 12/22/2022]
Abstract
The evolution of the multi-copy family of ribosomal RNA (rRNA) genes is unique in regard to its genetics and genome evolution. Paradoxically, rRNA genes are highly homogenized within and between individuals, yet they are globally distinct between species. Here, we discuss the implications for models of rRNA gene evolution in light of our recent discoveries that ribosomes bearing rRNA sequence variants can affect gene expression and physiology and that intra-individual rRNA alleles exhibit both context- and tissue-specific expression.
Collapse
Affiliation(s)
- Matthew M Parks
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Chad M Kurylo
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | | | - C Theresa Vincent
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.
| | - Scott C Blanchard
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.
- Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
32
|
A Megabase-Scale Deletion is Associated with Phenotypic Variation of Multiple Traits in Maize. Genetics 2018; 211:305-316. [PMID: 30389804 PMCID: PMC6325712 DOI: 10.1534/genetics.118.301567] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 10/26/2018] [Indexed: 11/18/2022] Open
Abstract
Genomic deletions are pervasive in the maize (Zea mays L.) genome, and play important roles in phenotypic variation and adaptive evolution. However, little is known about the biological functions of these genomic deletions. Here, we report the biological function of a megabase-scale deletion, which we identified by position-based cloning of the multi-trait weakened (muw) mutant, which is inherited as a single recessive locus. MUW was mapped to a 5.16-Mb region on chromosome 2. The 5.16-Mb deletion in the muw mutant led to the loss of 48 genes and was responsible for a set of phenotypic abnormities, including wilting leaves, poor yield performance, reduced plant height, increased stomatal density, and rapid water loss. While muw appears to have resulted from double-stranded break repair that was not dependent on intragenomic DNA homology, extensive duplication of maize genes may have mitigated its effects and facilitated its survival.
Collapse
|
33
|
Parks MM, Raphael BJ, Lawrence CE. Using controls to limit false discovery in the era of big data. BMC Bioinformatics 2018; 19:323. [PMID: 30217148 PMCID: PMC6137876 DOI: 10.1186/s12859-018-2356-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 09/03/2018] [Indexed: 12/04/2022] Open
Abstract
Background Procedures for controlling the false discovery rate (FDR) are widely applied as a solution to the multiple comparisons problem of high-dimensional statistics. Current FDR-controlling procedures require accurately calculated p-values and rely on extrapolation into the unknown and unobserved tails of the null distribution. Both of these intermediate steps are challenging and can compromise the reliability of the results. Results We present a general method for controlling the FDR that capitalizes on the large amount of control data often found in big data studies to avoid these frequently problematic intermediate steps. The method utilizes control data to empirically construct the distribution of the test statistic under the null hypothesis and directly compares this distribution to the empirical distribution of the test data. By not relying on p-values, our control data-based empirical FDR procedure more closely follows the foundational principles of the scientific method: that inference is drawn by comparing test data to control data. The method is demonstrated through application to a problem in structural genomics. Conclusions The method described here provides a general statistical framework for controlling the FDR that is specifically tailored for the big data setting. By relying on empirically constructed distributions and control data, it forgoes potentially problematic modeling steps and extrapolation into the unknown tails of the null distribution. This procedure is broadly applicable insofar as controlled experiments or internal negative controls are available, as is increasingly common in the big data setting. Electronic supplementary material The online version of this article (10.1186/s12859-018-2356-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Matthew M Parks
- Department of Physiology and Biophysics, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ, 08540, USA
| | - Charles E Lawrence
- Center for Computational Molecular Biology, Brown University, 115 Waterman Street, Providence, RI, 02912, USA. .,Division of Applied Mathematics, Brown University, 182 George Street, Providence, RI, 02912, USA.
| |
Collapse
|
34
|
Alves I, Houle AA, Hussin JG, Awadalla P. The impact of recombination on human mutation load and disease. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160465. [PMID: 29109227 PMCID: PMC5698626 DOI: 10.1098/rstb.2016.0465] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2017] [Indexed: 12/21/2022] Open
Abstract
Recombination promotes genomic integrity among cells and tissues through double-strand break repair, and is critical for gamete formation and fertility through a strict regulation of the molecular mechanisms associated with proper chromosomal disjunction. In humans, congenital defects and recurrent structural abnormalities can be attributed to aberrant meiotic recombination. Moreover, mutations affecting genes involved in recombination pathways are directly linked to pathologies including infertility and cancer. Recombination is among the most prominent mechanism shaping genome variation, and is associated with not only the structuring of genomic variability, but is also tightly linked with the purging of deleterious mutations from populations. Together, these observations highlight the multiple roles of recombination in human genetics: its ability to act as a major force of evolution, its molecular potential to maintain genome repair and integrity in cell division and its mutagenic cost impacting disease evolution.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Isabel Alves
- Ontario Institute of Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, Canada M5G 0A3
| | - Armande Ang Houle
- Ontario Institute of Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, Canada M5G 0A3
- Department of Molecular Genetics, University of Toronto, Medical Science Building, 1 King's College Circle, Toronto, Ontario, Canada M5S 1A8
| | - Julie G Hussin
- Montreal Heart Institute, Department of Medicine, University of Montreal, 5000 Rue Bélanger, Montréal, Quebec, Canada H1T 1C8
- The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Philip Awadalla
- Ontario Institute of Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, Canada M5G 0A3
- Department of Molecular Genetics, University of Toronto, Medical Science Building, 1 King's College Circle, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
35
|
McKerrow WH, Savva YA, Rezaei A, Reenan RA, Lawrence CE. Predicting RNA hyper-editing with a novel tool when unambiguous alignment is impossible. BMC Genomics 2017; 18:522. [PMID: 28693467 PMCID: PMC5502491 DOI: 10.1186/s12864-017-3898-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 06/22/2017] [Indexed: 12/13/2022] Open
Abstract
Background Repetitive elements are now known to have relevant cellular functions, including self-complementary sequences that form double stranded (ds) RNA. There are numerous pathways that determine the fate of endogenous dsRNA, and misregulation of endogenous dsRNA is a driver of autoimmune disease, particularly in the brain. Unfortunately, the alignment of high-throughput, short-read sequences to repeat elements poses a dilemma: Such sequences may align equally well to multiple genomic locations. In order to differentiate repeat elements, current alignment methods depend on sequence variation in the reference genome. Reads are discarded when no such variations are present. However, RNA hyper-editing, a possible fate for dsRNA, introduces enough variation to distinguish between repeats that are otherwise identical. Results To take advantage of this variation, we developed a new algorithm, RepProfile, that simultaneously aligns reads and predicts novel variations. RepProfile accurately aligns hyper-edited reads that other methods discard. In particular we predict hyper-editing of Drosophila melanogaster repeat elements in vivo at levels previously described only in vitro, and provide validation by Sanger sequencing sixty-two individual cloned sequences. We find that hyper-editing is concentrated in genes involved in cell-cell communication at the synapse, including some that are associated with neurodegeneration. We also find that hyper-editing tends to occur in short runs. Conclusions Previous studies of RNA hyper-editing discarded ambiguously aligned reads, ignoring hyper-editing in long, perfect dsRNA – the perfect substrate for hyper-editing. We provide a method that simulation and Sanger validation show accurately predicts such RNA editing, yielding a superior picture of hyper-editing. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3898-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Wilson H McKerrow
- Division of Applied Mathematics, Brown University, Providence, 02912, RI, USA.
| | - Yiannis A Savva
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, 02912, RI, USA
| | - Ali Rezaei
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, 02912, RI, USA
| | - Robert A Reenan
- Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, 02912, RI, USA
| | - Charles E Lawrence
- Division of Applied Mathematics, Brown University, Providence, 02912, RI, USA
| |
Collapse
|