1
|
Barington M, Bak M, Kjartansdóttir KR, Hansen TVO, Birkedal U, Østergaard E, Hove HB. Novel Alu insertion in the ZEB2 gene causing Mowat-Wilson syndrome. Am J Med Genet A 2024; 194:e63581. [PMID: 38600862 DOI: 10.1002/ajmg.a.63581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Revised: 02/06/2024] [Accepted: 02/21/2024] [Indexed: 04/12/2024]
Abstract
Alu elements are short, interspersed elements located throughout the genome, playing a role in human diversity, and occasionally causing genetic diseases. Here, we report a novel Alu insertion causing Mowat-Wilson syndrome, a rare neurodevelopmental disorder, in an 8-year-old boy displaying the typical clinical features for Mowat-Wilson syndrome. The variant was not initially detected in genome sequencing data, but through deep phenotyping, which pointed to only one plausible candidate gene, manual inspection of genome sequencing alignment data enabled us to identify a de novo heterozygous Alu insertion in exon 8 of the ZEB2 gene. Nanopore long-read sequencing confirmed the Alu insertion, leading to the formation of a premature stop codon and likely haploinsufficiency of ZEB2. This underscores the importance of deep phenotyping and mobile element insertion analysis in uncovering genetic causes of monogenic disorders as these elements might be overlooked in standard next-generation sequencing protocols.
Collapse
Affiliation(s)
- Maria Barington
- Department of Clinical Genetics, Copenhagen University Hospital, Copenhagen, Denmark
| | - Mads Bak
- Department of Clinical Genetics, Copenhagen University Hospital, Copenhagen, Denmark
| | | | - Thomas van Overeem Hansen
- Department of Clinical Genetics, Copenhagen University Hospital, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Ulf Birkedal
- Department of Clinical Genetics, Copenhagen University Hospital, Copenhagen, Denmark
| | - Elsebet Østergaard
- Department of Clinical Genetics, Copenhagen University Hospital, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Hanne Buciek Hove
- Center for Rare Diseases, Department of Paediatrics and Adolescent Medicine, Copenhagen University Hospital, Copenhagen, Denmark
| |
Collapse
|
2
|
Karkas R, Abdullah KSA, Kaizer L, Ürmös Á, Raya M, Tiszlavicz L, Pankotai T, Nagy I, Mátés L, Sükösd F. LINE-1 ORF1p is a Promising Biomarker in Cervical Intraepithelial Neoplasia Degree Assessment. Int J Gynecol Pathol 2024:00004347-990000000-00166. [PMID: 38920137 DOI: 10.1097/pgp.0000000000001035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/27/2024]
Abstract
Cervical intraepithelial neoplasia (CIN) represents a spectrum of preinvasive squamous lesions within the cervical epithelium, whose identification is a diagnostic challenge due to subtle histomorphological differences among its categories. This study explores ORF1p, a nucleic acid-binding protein derived from long interspersed nuclear element-1 (LINE-1), as a potential biomarker for enhancing CIN diagnosis. A comprehensive analysis of 143 cervical specimens, encompassing CIN I (n=20), CIN II (n=46), CIN III (n=14), invasive cancer (n=32), and nondysplastic cases (normal cervical epithelia (n=24) and atrophy (n=7) were conducted. ORF1p, Ki67, and p16 expressions were evaluated using immunohistochemistry. ORF1p immunopositivity was detected in the vast majority [110/112 (98.2%)] of dysplastic and neoplastic (CIN and invasive cancer) specimens, whereas 19/24 (79.2%) of normal cervical specimens lacked ORF1p expression. The observed pattern of ORF1p expression showed a progressively increasing extent and intensity with advancing CIN grades. CIN I exhibited mild ORF1p expression in the lower one or two-thirds of the cervical epithelium [14/16 (87.5%)], whereas CIN II demonstrated moderate to strong ORF1p expression spanning the lower two-thirds [29/46 (63.0%)]. Pronounced transepithelial ORF1p immunopositivity characterized CIN III cases [13/14 (92.8%)] and cervical cancer [30/32 (93.8%)]. These findings propose ORF1p as a valuable indicator even for detecting CIN I, effectively discerning them from normal cervical tissue (p < 0.0001). Our findings underscore the potential of ORF1p as an early diagnostic marker for cervical neoplasia.
Collapse
Affiliation(s)
- Réka Karkas
- Laboratory of Cancer Genome Research, Institute of Genetics, HUN-REN Biological Research Centre, Szeged, Hungary
- Doctoral School of Multidisciplinary Medical Sciences, University of Szeged, Albert Szent-Györgyi Medical School, Szeged, Hungary
| | - Khaldoon Sadiq Ahmed Abdullah
- Laboratory of Cancer Genome Research, Institute of Genetics, HUN-REN Biological Research Centre, Szeged, Hungary
- Doctoral School of Multidisciplinary Medical Sciences, University of Szeged, Albert Szent-Györgyi Medical School, Szeged, Hungary
| | - László Kaizer
- Department of Pathology, Albert Szent-Györgyi Health Centre, University of Szeged, Szeged, Hungary
| | - Ádám Ürmös
- Genome Integrity and DNA Repair Core Group, Hungarian Centre of Excellence for Molecular Medicine, Szeged, Hungary
| | - May Raya
- Laboratory of Cancer Genome Research, Institute of Genetics, HUN-REN Biological Research Centre, Szeged, Hungary
- Doctoral School of Multidisciplinary Medical Sciences, University of Szeged, Albert Szent-Györgyi Medical School, Szeged, Hungary
| | - Lilla Tiszlavicz
- Department of Pediatrics and Pediatric Health Centre, Albert Szent-Györgyi Health Centre, University of Szeged, Szeged, Hungary
| | - Tibor Pankotai
- Department of Pathology, Albert Szent-Györgyi Health Centre, University of Szeged, Szeged, Hungary
- Genome Integrity and DNA Repair Core Group, Hungarian Centre of Excellence for Molecular Medicine, Szeged, Hungary
- Competence Centre of the Life Sciences Cluster of the Centre of Excellence for Interdisciplinary Research, Development and Innovation, University of Szeged, Hungary
| | - István Nagy
- Seqomics Biotechnology Ltd, Mórahalom, Hungary
- Sequencing Platform, Institute of Biochemistry, HUN-REN Biological Research Centre, Szeged, Hungary
| | - Lajos Mátés
- Laboratory of Cancer Genome Research, Institute of Genetics, HUN-REN Biological Research Centre, Szeged, Hungary
| | - Farkas Sükösd
- Department of Pathology, Albert Szent-Györgyi Health Centre, University of Szeged, Szeged, Hungary
| |
Collapse
|
3
|
Caporale LH. Evolutionary feedback from the environment shapes mechanisms that generate genome variation. J Physiol 2024; 602:2601-2614. [PMID: 38194279 DOI: 10.1113/jp284411] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 12/14/2023] [Indexed: 01/10/2024] Open
Abstract
Darwin recognized that 'a grand and almost untrodden field of inquiry will be opened, on the causes and laws of variation.' However, because the Modern Synthesis assumes that the intrinsic probability of any individual mutation is unrelated to that mutation's potential adaptive value, attention has been focused on selection rather than on the intrinsic generation of variation. Yet many examples illustrate that the term 'random' mutation, as widely understood, is inaccurate. The probabilities of distinct classes of variation are neither evenly distributed across a genome nor invariant over time, nor unrelated to their potential adaptive value. Because selection acts upon variation, multiple biochemical mechanisms can and have evolved that increase the relative probability of adaptive mutations. In effect, the generation of heritable variation is in a feedback loop with selection, such that those mechanisms that tend to generate variants that survive recurring challenges in the environment would be captured by this survival and thus inherited and accumulated within lineages of genomes. Moreover, because genome variation is affected by a wide range of biochemical processes, genome variation can be regulated. Biochemical mechanisms that sense stress, from lack of nutrients to DNA damage, can increase the probability of specific classes of variation. A deeper understanding of evolution involves attention to the evolution of, and environmental influences upon, the intrinsic variation generated in gametes, in other words upon the biochemical mechanisms that generate variation across generations. These concepts have profound implications for the types of questions that can and should be asked, as omics databases become more comprehensive, detection methods more sensitive, and computation and experimental analyses even more high throughput and thus capable of revealing the intrinsic generation of variation in individual gametes. These concepts also have profound implications for evolutionary theory, which, upon reflection it will be argued, predicts that selection would increase the probability of generating adaptive mutations, in other words, predicts that the ability to evolve itself evolves.
Collapse
|
4
|
D'Ordine AM, Jogl G, Sedivy JM. Identification and characterization of small molecule inhibitors of the LINE-1 retrotransposon endonuclease. Nat Commun 2024; 15:3883. [PMID: 38719805 PMCID: PMC11078990 DOI: 10.1038/s41467-024-48066-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 04/18/2024] [Indexed: 05/12/2024] Open
Abstract
The long interspersed nuclear element-1 (LINE-1 or L1) retrotransposon is the only active autonomously replicating retrotransposon in the human genome. L1 harms the cell by inserting new copies, generating DNA damage, and triggering inflammation. Therefore, L1 inhibition could be used to treat many diseases associated with these processes. Previous research has focused on inhibition of the L1 reverse transcriptase due to the prevalence of well-characterized inhibitors of related viral enzymes. Here we present the L1 endonuclease as another target for reducing L1 activity. We characterize structurally diverse small molecule endonuclease inhibitors using computational, biochemical, and biophysical methods. We also show that these inhibitors reduce L1 retrotransposition, L1-induced DNA damage, and inflammation reinforced by L1 in senescent cells. These inhibitors could be used for further pharmacological development and as tools to better understand the life cycle of this element and its impact on disease processes.
Collapse
Affiliation(s)
- Alexandra M D'Ordine
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI, USA
- Center on the Biology of Aging, Brown University, Providence, RI, USA
| | - Gerwald Jogl
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI, USA.
- Center on the Biology of Aging, Brown University, Providence, RI, USA.
| | - John M Sedivy
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI, USA.
- Center on the Biology of Aging, Brown University, Providence, RI, USA.
| |
Collapse
|
5
|
Wijngaard R, Demidov G, O'Gorman L, Corominas-Galbany J, Yaldiz B, Steyaert W, de Boer E, Vissers LELM, Kamsteeg EJ, Pfundt R, Swinkels H, den Ouden A, Te Paske IBAW, de Voer RM, Faivre L, Denommé-Pichon AS, Duffourd Y, Vitobello A, Chevarin M, Straub V, Töpf A, van der Kooi AJ, Magrinelli F, Rocca C, Hanna MG, Vandrovcova J, Ossowski S, Laurie S, Gilissen C. Mobile element insertions in rare diseases: a comparative benchmark and reanalysis of 60,000 exome samples. Eur J Hum Genet 2024; 32:200-208. [PMID: 37853102 PMCID: PMC10853235 DOI: 10.1038/s41431-023-01478-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 08/29/2023] [Accepted: 10/04/2023] [Indexed: 10/20/2023] Open
Abstract
Mobile element insertions (MEIs) are a known cause of genetic disease but have been underexplored due to technical limitations of genetic testing methods. Various bioinformatic tools have been developed to identify MEIs in Next Generation Sequencing data. However, most tools have been developed specifically for genome sequencing (GS) data rather than exome sequencing (ES) data, which remains more widely used for routine diagnostic testing. In this study, we benchmarked six MEI detection tools (ERVcaller, MELT, Mobster, SCRAMble, TEMP2 and xTea) on ES data and on GS data from publicly available genomic samples (HG002, NA12878). For all the tools we evaluated sensitivity and precision of different filtering strategies. Results show that there were substantial differences in tool performance between ES and GS data. MELT performed best with ES data and its combination with SCRAMble increased substantially the detection rate of MEIs. By applying both tools to 10,890 ES samples from Solve-RD and 52,624 samples from Radboudumc we were able to diagnose 10 patients who had remained undiagnosed by conventional ES analysis until now. Our study shows that MELT and SCRAMble can be used reliably to identify clinically relevant MEIs in ES data. This may lead to an additional diagnosis for 1 in 3000 to 4000 patients in routine clinical ES.
Collapse
Affiliation(s)
- Robin Wijngaard
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - German Demidov
- Universitätsklinikum Tübingen - Institut für Medizinische Genetik und angewandte Genomik, Tübingen, Germany
| | - Luke O'Gorman
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | | | - Burcu Yaldiz
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Wouter Steyaert
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Elke de Boer
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Lisenka E L M Vissers
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Erik-Jan Kamsteeg
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Rolph Pfundt
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Hilde Swinkels
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Amber den Ouden
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Iris B A W Te Paske
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Richarda M de Voer
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Laurence Faivre
- Centre de Référence Maladies Rares "Anomalies du développement et syndromes malformatifs", Centre de Génétique, FHU-TRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Anne-Sophie Denommé-Pichon
- UMR1231-Inserm, Génétique des Anomalies du développement, Université de Bourgogne Franche-Comté, Dijon, France
- Laboratoire de Génétique chromosomique et moléculaire, UF6254 Innovation en diagnostic génomique des maladies rares, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Yannis Duffourd
- UMR1231-Inserm, Génétique des Anomalies du développement, Université de Bourgogne Franche-Comté, Dijon, France
- Laboratoire de Génétique chromosomique et moléculaire, UF6254 Innovation en diagnostic génomique des maladies rares, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Antonio Vitobello
- UMR1231-Inserm, Génétique des Anomalies du développement, Université de Bourgogne Franche-Comté, Dijon, France
- Laboratoire de Génétique chromosomique et moléculaire, UF6254 Innovation en diagnostic génomique des maladies rares, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Martin Chevarin
- UMR1231-Inserm, Génétique des Anomalies du développement, Université de Bourgogne Franche-Comté, Dijon, France
- Laboratoire de Génétique chromosomique et moléculaire, UF6254 Innovation en diagnostic génomique des maladies rares, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Volker Straub
- John Walton Muscular Dystrophy Research Centre, Translational and Clinical Research Institute, Newcastle University and Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Ana Töpf
- John Walton Muscular Dystrophy Research Centre, Translational and Clinical Research Institute, Newcastle University and Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Anneke J van der Kooi
- Department of Neurology, Amsterdam UMC, University of Amsterdam, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | - Francesca Magrinelli
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Clarissa Rocca
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK
- Clinical Pharmacology, William Harvey Research Institute, School of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Michael G Hanna
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK
| | - Jana Vandrovcova
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK
| | - Stephan Ossowski
- Universitätsklinikum Tübingen - Institut für Medizinische Genetik und angewandte Genomik, Tübingen, Germany
| | - Steven Laurie
- Centro Nacional de Análisis Genómico (CNAG), Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Christian Gilissen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands.
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands.
| |
Collapse
|
6
|
He Z, Chen O, Phillips N, Pasquesi GIM, Sabunciyan S, Florea L. Predicting Alu exonization in the human genome with a deep learning model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.03.574099. [PMID: 38260329 PMCID: PMC10802380 DOI: 10.1101/2024.01.03.574099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Alu exonization, or the recruitment of intronic Alu elements into gene sequences, has contributed to functional diversification; however, its extent and the ways in which it influences gene regulation are not fully understood. We developed an unbiased approach to predict Alu exonization events from genomic sequences implemented in a deep learning model, eXAlu, that overcomes the limitations of tissue or condition specificity and the computational burden of RNA-seq analysis. The model captures previously reported characteristics of exonized Alu sequences and can predict sequence elements important for Alu exonization. Using eXAlu, we estimate the number of Alu elements in the human genome undergoing exonization to be between 55-110K, 11-21 fold more than represented in the GENCODE gene database. Using RT-PCR we were able to validate selected predicted Alu exonization events, supporting the accuracy of our method. Lastly, we highlight a potential application of our method to identify polymorphic Alu insertion exonizations in individuals and in the population from whole genome sequencing data.
Collapse
Affiliation(s)
- Zitong He
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21205
| | - Ou Chen
- Department of Pediatrics, Johns Hopkins School of Medicine, Baltimore, MD 21205
| | - Noelani Phillips
- School of Kinesiology, University of Michigan, Ann Arbor, MI 48109
| | - Giulia Irene Maria Pasquesi
- BioFrontiers Institute and Department of Molecular, Cellular and Developmental Biology, University of Colorado Boulder, Boulder, CO 80309 and Crnic Institute Boulder Branch, BioFrontiers Institute, University of Colorado Boulder, Boulder, CO 80303
| | - Sarven Sabunciyan
- Department of Pediatrics, Johns Hopkins School of Medicine, Baltimore, MD 21205
| | - Liliana Florea
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21205
- Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD 21205
| |
Collapse
|
7
|
Yang L, Metzger GA, Padilla Del Valle R, Delgadillo Rubalcaba D, McLaughlin RN. Evolutionary insights from profiling LINE-1 activity at allelic resolution in a single human genome. EMBO J 2024; 43:112-131. [PMID: 38177314 PMCID: PMC10883270 DOI: 10.1038/s44318-023-00007-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/18/2023] [Accepted: 11/10/2023] [Indexed: 01/06/2024] Open
Abstract
Transposable elements have created the majority of the sequence in many genomes. In mammals, LINE-1 retrotransposons have been expanding for more than 100 million years as distinct, consecutive lineages; however, the drivers of this recurrent lineage emergence and disappearance are unknown. Most human genome assemblies provide a record of this ancient evolution, but fail to resolve ongoing LINE-1 retrotranspositions. Utilizing the human CHM1 long-read-based haploid assembly, we identified and cloned all full-length, intact LINE-1s, and found 29 LINE-1s with measurable in vitro retrotransposition activity. Among individuals, these LINE-1s varied in their presence, their allelic sequences, and their activity. We found that recently retrotransposed LINE-1s tend to be active in vitro and polymorphic in the population relative to more ancient LINE-1s. However, some rare allelic forms of old LINE-1s retain activity, suggesting older lineages can persist longer than expected. Finally, in LINE-1s with in vitro activity and in vivo fitness, we identified mutations that may have increased replication in ancient genomes and may prove promising candidates for mechanistic investigations of the drivers of LINE-1 evolution and which LINE-1 sequences contribute to human disease.
Collapse
Affiliation(s)
- Lei Yang
- Pacific Northwest Research Institute, Seattle, WA, USA
| | | | - Ricky Padilla Del Valle
- Pacific Northwest Research Institute, Seattle, WA, USA
- Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, WA, USA
| | | | - Richard N McLaughlin
- Pacific Northwest Research Institute, Seattle, WA, USA.
- Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, WA, USA.
| |
Collapse
|
8
|
Wen X, Li J, Yang F, Zhang X, Li Y. Exploring the Effect of High-Energy Heavy Ion Beam on Rice Genome: Transposon Activation. Genes (Basel) 2023; 14:2178. [PMID: 38137000 PMCID: PMC10742395 DOI: 10.3390/genes14122178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 11/25/2023] [Accepted: 11/29/2023] [Indexed: 12/24/2023] Open
Abstract
High-energy heavy ion beams are a new type of physical mutagen that can produce a wide range of phenotypic variations. In order to understand the mechanism of high-energy heavy ion beams, we resequenced the whole genome of individual plants with obvious phenotypic variations in rice. The sequence alignment results revealed a large number of SNPs and InDels, as well as genetic variations related to grain type and heading date. The distribution of SNP and InDel on chromosomes is random, but they often occur in the up/downstream regions and the intergenic region. Mutagenesis can cause changes in transposons such as Dasheng, mPing, Osr13 and RIRE2, affecting the stability of the genome. This study obtained the major gene mutation types, discovered differentially active transposons, screened out gene variants related to phenotype, and explored the mechanism of high-energy heavy ion beam radiation on rice genes.
Collapse
Affiliation(s)
- Xiaoting Wen
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jingpeng Li
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
- Jilin Provincial Laboratory of Crop Germplasm Resources, Changchun 130299, China
| | - Fu Yang
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
| | - Xin Zhang
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
| | - Yiwei Li
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
9
|
Luqman-Fatah A, Miyoshi T. Human LINE-1 retrotransposons: impacts on the genome and regulation by host factors. Genes Genet Syst 2023; 98:121-154. [PMID: 36436935 DOI: 10.1266/ggs.22-00038] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Genome sequencing revealed that nearly half of the human genome is comprised of transposable elements. Although most of these elements have been rendered inactive due to mutations, full-length intact long interspersed element-1 (LINE-1 or L1) copies retain the ability to mobilize through RNA intermediates by a so-called "copy-and-paste" mechanism, termed retrotransposition. L1 is the only known autonomous mobile genetic element in the genome, and its retrotransposition contributes to inter- or intra-individual genetic variation within the human population. However, L1 retrotransposition also poses a threat to genome integrity due to gene disruption and chromosomal instability. Moreover, recent studies suggest that aberrant L1 expression can impact human health by causing diseases such as cancer and chronic inflammation that might lead to autoimmune disorders. To counteract these adverse effects, the host cells have evolved multiple layers of defense mechanisms at the epigenetic, RNA and protein levels. Intriguingly, several host factors have also been reported to facilitate L1 retrotransposition, suggesting that there is competition between negative and positive regulation of L1 by host factors. Here, we summarize the known host proteins that regulate L1 activity at different stages of the replication cycle and discuss how these factors modulate disease-associated phenotypes caused by L1.
Collapse
Affiliation(s)
- Ahmad Luqman-Fatah
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University
- Department of Stress Response, Radiation Biology Center, Graduate School of Biostudies, Kyoto University
| | - Tomoichiro Miyoshi
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University
- Department of Stress Response, Radiation Biology Center, Graduate School of Biostudies, Kyoto University
| |
Collapse
|
10
|
Rybacki K, Xia M, Ahsan MU, Xing J, Wang K. Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines. Genes (Basel) 2023; 14:1893. [PMID: 37895242 PMCID: PMC10606529 DOI: 10.3390/genes14101893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 09/21/2023] [Accepted: 09/27/2023] [Indexed: 10/29/2023] Open
Abstract
Transposable elements, such as Long INterspersed Elements (LINEs), are DNA sequences that can replicate within genomes. LINEs replicate using an RNA intermediate followed by reverse transcription and are typically a few kilobases in length. LINE activity creates genomic structural variants in human populations and leads to somatic alterations in cancer genomes. Long-read RNA sequencing technologies, including Oxford Nanopore and PacBio, can directly sequence relatively long transcripts, thus providing the opportunity to examine full-length LINE transcripts. This study focuses on the development of a new bioinformatics pipeline for the identification and quantification of active, full-length LINE transcripts in diverse human tissues and cell lines. In our pipeline, we utilized RepeatMasker to identify LINE-1 (L1) transcripts from long-read transcriptome data and incorporated several criteria, such as transcript start position, divergence, and length, to remove likely false positives. Comparisons between cancerous and normal cell lines, as well as human tissue samples, revealed elevated expression levels of young LINEs in cancer, particularly at intact L1 loci. By employing bioinformatics methodologies on long-read transcriptome data, this study demonstrates the landscape of L1 expression in tissues and cell lines.
Collapse
Affiliation(s)
- Karleena Rybacki
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA; (K.R.); (M.X.)
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA;
| | - Mingyi Xia
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA; (K.R.); (M.X.)
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA;
| | - Mian Umair Ahsan
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA;
| | - Jinchuan Xing
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Kai Wang
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA; (K.R.); (M.X.)
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA;
| |
Collapse
|
11
|
Helm BM, Smith AM, Schmit K, Landis BJ, Vatta M, Ware SM. Disruption of FBN1 by an Alu element insertion: A novel genetic cause of Marfan syndrome. Eur J Med Genet 2023; 66:104775. [PMID: 37264881 DOI: 10.1016/j.ejmg.2023.104775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 02/17/2023] [Accepted: 04/29/2023] [Indexed: 06/03/2023]
Abstract
Alu elements are retrotransposons with ubiquitous presence in the human genome that have contributed to human genomic diversity and health. These approximately 300-bp sequences can cause or mediate disease by disrupting coding/splicing regions in the germline, by insertional mutagenesis in somatic cells, and in promoting formation of copy-number variants. Alu elements may also disrupt epigenetic regulation by affecting non-coding regulatory regions. There are increasing reports of apparently sporadic and inherited genetic disorders caused by Alu-related gene disruption, but Marfan syndrome resulting from Alu element insertion has not been previously described. We report a family with classic features of Marfan syndrome whose previous FBN1 genetic testing was inconclusive. Using contemporary next-generation sequencing and bioinformatics analysis, a pathogenic/disruptive Alu insertion occurring in the coding region of the FBN1 gene was identified (c.6564_6565insAlu; p. Glu2189fs) and was confirmed and specified further with Sanger sequencing. This identified the molecular basis of disease in the family that was missed using previous genetic testing technologies and highlights a novel pathogenic mechanism for Marfan syndrome. This case adds to the growing literature of Mendelian diseases caused by Alu retrotransposition, and it also shows the growing capability of genomic technologies for detecting atypical mutation events.
Collapse
Affiliation(s)
- Benjamin M Helm
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA; Department of Epidemiology, Indiana University Fairbanks School of Public Health, Indianapolis, IN, USA.
| | - Amanda M Smith
- Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Kelly Schmit
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA.
| | - Benjamin J Landis
- Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN, USA.
| | | | - Stephanie M Ware
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA; Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN, USA.
| |
Collapse
|
12
|
Stamidis N, Żylicz JJ. RNA-mediated heterochromatin formation at repetitive elements in mammals. EMBO J 2023; 42:e111717. [PMID: 36847618 PMCID: PMC10106986 DOI: 10.15252/embj.2022111717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 12/12/2022] [Accepted: 02/07/2023] [Indexed: 03/01/2023] Open
Abstract
The failure to repress transcription of repetitive genomic elements can lead to catastrophic genome instability and is associated with various human diseases. As such, multiple parallel mechanisms cooperate to ensure repression and heterochromatinization of these elements, especially during germline development and early embryogenesis. A vital question in the field is how specificity in establishing heterochromatin at repetitive elements is achieved. Apart from trans-acting protein factors, recent evidence points to a role of different RNA species in targeting repressive histone marks and DNA methylation to these sites in mammals. Here, we review recent discoveries on this topic and predominantly focus on the role of RNA methylation, piRNAs, and other localized satellite RNAs.
Collapse
Affiliation(s)
- Nikolaos Stamidis
- Novo Nordisk Foundation Center for Stem Cell Medicine, reNEW, University of Copenhagen, Copenhagen, Denmark
| | - Jan Jakub Żylicz
- Novo Nordisk Foundation Center for Stem Cell Medicine, reNEW, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
13
|
Lee H, Min JW, Mun S, Han K. Human Retrotransposons and Effective Computational Detection Methods for Next-Generation Sequencing Data. LIFE (BASEL, SWITZERLAND) 2022; 12:life12101583. [PMID: 36295018 PMCID: PMC9605557 DOI: 10.3390/life12101583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/03/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022]
Abstract
Transposable elements (TEs) are classified into two classes according to their mobilization mechanism. Compared to DNA transposons that move by the "cut and paste" mechanism, retrotransposons mobilize via the "copy and paste" method. They have been an essential research topic because some of the active elements, such as Long interspersed element 1 (LINE-1), Alu, and SVA elements, have contributed to the genetic diversity of primates beyond humans. In addition, they can cause genetic disorders by altering gene expression and generating structural variations (SVs). The development and rapid technological advances in next-generation sequencing (NGS) have led to new perspectives on detecting retrotransposon-mediated SVs, especially insertions. Moreover, various computational methods have been developed based on NGS data to precisely detect the insertions and deletions in the human genome. Therefore, this review discusses details about the recently studied and utilized NGS technologies and the effective computational approaches for discovering retrotransposons through it. The final part covers a diverse range of computational methods for detecting retrotransposon insertions with human NGS data. This review will give researchers insights into understanding the TEs and how to investigate them and find connections with research interests.
Collapse
Affiliation(s)
- Haeun Lee
- Department of Bioconvergence Engineering, Dankook University, Yongin 16890, Korea
| | - Jun Won Min
- Department of Surgery, Dankook University College of Medicine, Cheonan 31116, Korea
| | - Seyoung Mun
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
- Correspondence: (S.M.); (K.H.)
| | - Kyudong Han
- Department of Bioconvergence Engineering, Dankook University, Yongin 16890, Korea
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
- HuNbiome Co., Ltd., R&D Center, Seoul 08507, Korea
- Correspondence: (S.M.); (K.H.)
| |
Collapse
|
14
|
Paulat NS, McGuire E, Subramanian K, Osmanski AB, Moreno-Santillán DD, Ray DA, Xing J. Transposable Elements in Bats Show Differential Accumulation Patterns Determined by Class and Functionality. LIFE (BASEL, SWITZERLAND) 2022; 12:life12081190. [PMID: 36013369 PMCID: PMC9409754 DOI: 10.3390/life12081190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 08/01/2022] [Accepted: 08/02/2022] [Indexed: 11/16/2022]
Abstract
Bat genomes are characterized by a diverse transposable element (TE) repertoire. In particular, the genomes of members of the family Vespertilionidae contain both active retrotransposons and active DNA transposons. Each TE type is characterized by a distinct pattern of accumulation over the past ~40 million years. Each also exhibits its own target site preferences (sometimes shared with other TEs) that impact where they are likely to insert when mobilizing. Therefore, bats provide a great resource for understanding the diversity of TE insertion patterns. To gain insight into how these diverse TEs impact genome structure, we performed comparative spatial analyses between different TE classes and genomic features, including genic regions and CpG islands. Our results showed a depletion of all TEs in the coding sequence and revealed patterns of species- and element-specific attraction in the transcript. Trends of attraction in the distance tests also suggested significant TE activity in regions adjacent to genes. In particular, the enrichment of small, non-autonomous TE insertions in introns and near coding regions supports the hypothesis that the genomic distribution of TEs is the product of a balance of the TE insertion preference in open chromatin regions and the purifying selection against TEs within genes.
Collapse
Affiliation(s)
- Nicole S. Paulat
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
| | - Erin McGuire
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Krishnamurthy Subramanian
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Austin B. Osmanski
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
| | | | - David A. Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
- Correspondence: (D.A.R.); (J.X.)
| | - Jinchuan Xing
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Correspondence: (D.A.R.); (J.X.)
| |
Collapse
|
15
|
Baar T, Dümcke S, Gressel S, Schwalb B, Dilthey A, Cramer P, Tresch A. RNA transcription and degradation of Alu retrotransposons depends on sequence features and evolutionary history. G3 GENES|GENOMES|GENETICS 2022; 12:6543614. [PMID: 35253846 PMCID: PMC9073682 DOI: 10.1093/g3journal/jkac054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 02/25/2022] [Indexed: 11/16/2022]
Abstract
Alu elements are one of the most successful groups of RNA retrotransposons and make up 11% of the human genome with over 1 million individual loci. They are linked to genetic defects, increases in sequence diversity, and influence transcriptional activity. Still, their RNA metabolism is poorly understood yet. It is even unclear whether Alu elements are mostly transcribed by RNA Polymerase II or III. We have conducted a transcription shutoff experiment by α-amanitin and metabolic RNA labeling by 4-thiouridine combined with RNA fragmentation (TT-seq) and RNA-seq to shed further light on the origin and life cycle of Alu transcripts. We find that Alu RNAs are more stable than previously thought and seem to originate in part from RNA Polymerase II activity, as previous reports suggest. Their expression however seems to be independent of the transcriptional activity of adjacent genes. Furthermore, we have developed a novel statistical test for detecting the expression of quantitative trait loci in Alu elements that relies on the de Bruijn graph representation of all Alu sequences. It controls for both statistical significance and biological relevance using a tuned k-mer representation, discovering influential sequence features missed by regular motif search. In addition, we discover several point mutations using a generalized linear model, and motifs of interest, which also match transcription factor-binding motifs.
Collapse
Affiliation(s)
- Till Baar
- Institute of Medical Statistics and Computational Biology, Faculty of Medicine, University of Cologne, Cologne 50937, Germany
| | | | - Saskia Gressel
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen 37077, Germany
| | - Björn Schwalb
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen 37077, Germany
| | - Alexander Dilthey
- Institute of Medical Microbiology and Hospital Hygiene, Medical Faculty, Heinrich-Heine-University Düsseldorf, Düsseldorf 40225, Germany
| | - Patrick Cramer
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen 37077, Germany
| | - Achim Tresch
- Institute of Medical Statistics and Computational Biology, Faculty of Medicine, University of Cologne, Cologne 50937, Germany
- CECAD, University of Cologne, Cologne 50931, Germany
- Center for Data and Simulation Science, University of Cologne, Cologne 50923, Germany
| |
Collapse
|
16
|
Pfaff AL, Singleton LM, Kõks S. Mechanisms of disease-associated SINE-VNTR-Alus. Exp Biol Med (Maywood) 2022; 247:756-764. [PMID: 35387528 DOI: 10.1177/15353702221082612] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
SINE-VNTR-Alus (SVAs) are the youngest retrotransposon family in the human genome. Their ongoing mobilization has generated genetic variation within the human population. At least 24 insertions to date, detailed in this review, have been associated with disease. The predominant mechanisms through which this occurs are alterations to normal splicing patterns, exonic insertions causing loss-of-function mutations, and large genomic deletions. Dissecting the functional impact of these SVAs and the mechanism through which they cause disease provides insight into the consequences of their presence in the genome and how these elements could influence phenotypes. Many of these disease-associated SVAs have been difficult to characterize and would not have been identified through routine analyses. However, the number identified has increased in recent years as DNA and RNA sequencing data became more widely available. Therefore, as the search for complex structural variation in disease continues, it is likely to yield further disease-causing SVA insertions.
Collapse
Affiliation(s)
- Abigail L Pfaff
- Perron Institute for Neurological and Translational Science, Perth, WA 6009, Australia.,Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, WA 6150, Australia
| | - Lewis M Singleton
- Perron Institute for Neurological and Translational Science, Perth, WA 6009, Australia
| | - Sulev Kõks
- Perron Institute for Neurological and Translational Science, Perth, WA 6009, Australia.,Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, WA 6150, Australia
| |
Collapse
|
17
|
Hoyt SJ, Storer JM, Hartley GA, Grady PGS, Gershman A, de Lima LG, Limouse C, Halabian R, Wojenski L, Rodriguez M, Altemose N, Rhie A, Core LJ, Gerton JL, Makalowski W, Olson D, Rosen J, Smit AFA, Straight AF, Vollger MR, Wheeler TJ, Schatz MC, Eichler EE, Phillippy AM, Timp W, Miga KH, O’Neill RJ. From telomere to telomere: The transcriptional and epigenetic state of human repeat elements. Science 2022; 376:eabk3112. [PMID: 35357925 PMCID: PMC9301658 DOI: 10.1126/science.abk3112] [Citation(s) in RCA: 118] [Impact Index Per Article: 59.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Mobile elements and repetitive genomic regions are sources of lineage-specific genomic innovation and uniquely fingerprint individual genomes. Comprehensive analyses of such repeat elements, including those found in more complex regions of the genome, require a complete, linear genome assembly. We present a de novo repeat discovery and annotation of the T2T-CHM13 human reference genome. We identified previously unknown satellite arrays, expanded the catalog of variants and families for repeats and mobile elements, characterized classes of complex composite repeats, and located retroelement transduction events. We detected nascent transcription and delineated CpG methylation profiles to define the structure of transcriptionally active retroelements in humans, including those in centromeres. These data expand our insight into the diversity, distribution, and evolution of repetitive regions that have shaped the human genome.
Collapse
Affiliation(s)
- Savannah J. Hoyt
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | | | - Gabrielle A. Hartley
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Patrick G. S. Grady
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
| | | | - Charles Limouse
- Department of Biochemistry, Stanford University, Stanford, CA, USA
| | - Reza Halabian
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Luke Wojenski
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Matias Rodriguez
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Nicolas Altemose
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Leighton J. Core
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| | | | - Wojciech Makalowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Daniel Olson
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Jeb Rosen
- Institute for Systems Biology, Seattle, WA, USA
| | | | | | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Travis J. Wheeler
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Michael C. Schatz
- Department of Computer Science and Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Rachel J. O’Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Genetics and Genome Sciences, UConn Health, Farmington, CT, USA
| |
Collapse
|
18
|
Borges-Monroy R, Chu C, Dias C, Choi J, Lee S, Gao Y, Shin T, Park PJ, Walsh CA, Lee EA. Whole-genome analysis reveals the contribution of non-coding de novo transposon insertions to autism spectrum disorder. Mob DNA 2021; 12:28. [PMID: 34838103 PMCID: PMC8627061 DOI: 10.1186/s13100-021-00256-w] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 11/02/2021] [Indexed: 12/30/2022] Open
Abstract
Background Retrotransposons have been implicated as causes of Mendelian disease, but their role in autism spectrum disorder (ASD) has not been systematically defined, because they are only called with adequate sensitivity from whole genome sequencing (WGS) data and a large enough cohort for this analysis has only recently become available. Results We analyzed WGS data from a cohort of 2288 ASD families from the Simons Simplex Collection by establishing a scalable computational pipeline for retrotransposon insertion detection. We report 86,154 polymorphic retrotransposon insertions—including > 60% not previously reported—and 158 de novo retrotransposition events. The overall burden of de novo events was similar between ASD individuals and unaffected siblings, with 1 de novo insertion per 29, 117, and 206 births for Alu, L1, and SVA respectively, and 1 de novo insertion per 21 births total. However, ASD cases showed more de novo L1 insertions than expected in ASD genes. Additionally, we observed exonic insertions in loss-of-function intolerant genes, including a likely pathogenic exonic insertion in CSDE1, only in ASD individuals. Conclusions These findings suggest a modest, but important, impact of intronic and exonic retrotransposon insertions in ASD, show the importance of WGS for their analysis, and highlight the utility of specific bioinformatic tools for high-throughput detection of retrotransposon insertions. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-021-00256-w.
Collapse
Affiliation(s)
- Rebeca Borges-Monroy
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chong Chu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Caroline Dias
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Division of Developmental Medicine, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Jaejoon Choi
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Genetics, Harvard Medical School, MA, Boston, USA
| | - Soohyun Lee
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Yue Gao
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Pediatrics, Harvard Medical School, MA, Boston, USA
| | - Taehwan Shin
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Pediatrics, Harvard Medical School, MA, Boston, USA
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Christopher A Walsh
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA. .,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. .,Department of Pediatrics, Harvard Medical School, MA, Boston, USA. .,Department of Neurology, Harvard Medical School, Boston, MA, USA. .,Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA.
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA. .,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. .,Department of Pediatrics, Harvard Medical School, MA, Boston, USA.
| |
Collapse
|
19
|
Vlachogiannis NI, Sachse M, Georgiopoulos G, Zormpas E, Bampatsias D, Delialis D, Bonini F, Galyfos G, Sigala F, Stamatelopoulos K, Gatsiou A, Stellos K. Adenosine-to-inosine Alu RNA editing controls the stability of the pro-inflammatory long noncoding RNA NEAT1 in atherosclerotic cardiovascular disease. J Mol Cell Cardiol 2021; 160:111-120. [PMID: 34302813 PMCID: PMC8585018 DOI: 10.1016/j.yjmcc.2021.07.005] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 06/21/2021] [Accepted: 07/16/2021] [Indexed: 12/24/2022]
Abstract
Long non-coding RNAs (lncRNAs) have emerged as critical regulators in human disease including atherosclerosis. However, the mechanisms involved in the post-transcriptional regulation of the expression of disease-associated lncRNAs are not fully understood. Gene expression studies revealed that Nuclear Paraspeckle Assembly Transcript 1 (NEAT1) lncRNA expression was increased by >2-fold in peripheral blood mononuclear cells (PBMCs) derived from patients with coronary artery disease (CAD) or in carotid artery atherosclerotic plaques. We observed a linear association between NEAT1 lncRNA expression and prevalence of CAD which was independent of age, sex, cardiovascular traditional risk factors and renal function. NEAT1 expression was induced by TNF-α, while silencing of NEAT1 profoundly attenuated the TNF-α-induced vascular endothelial cell pro-inflammatory response as defined by the expression of CXCL8, CCL2, VCAM1 and ICAM1. Overexpression of the RNA editing enzyme adenosine deaminase acting on RNA-1 (ADAR1), but not of its editing-deficient mutant, upregulated NEAT1 levels. Conversely, silencing of ADAR1 suppressed the basal levels and the TNF-α-induced increase of NEAT1. NEAT1 lncRNA expression was strongly associated with ADAR1 in CAD and peripheral arterial vascular disease. RNA editing mapping studies revealed the presence of several inosines in close proximity to AU-rich elements within the AluSx3+/AluJo- double-stranded RNA complex. Silencing of the stabilizing RNA-binding protein AUF1 reduced NEAT1 levels while silencing of ADAR1 profoundly affected the binding capacity of AUF1 to NEAT1. Together, our findings propose a mechanism by which ADAR1-catalyzed A-to-I RNA editing controls NEAT1 lncRNA stability in ASCVD.
Collapse
Affiliation(s)
- Nikolaos I Vlachogiannis
- Biosciences Institute, Vascular Biology and Medicine Theme, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK; Department of Cardiology, Freeman Hospital, Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Marco Sachse
- Biosciences Institute, Vascular Biology and Medicine Theme, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK; Vascular Inflammation and RNA Metabolism Laboratory, Institute for Vascular Signalling, JW Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Georgios Georgiopoulos
- Department of Clinical Therapeutics, Alexandra Hospital, National and Kapodistrian University of Athens, Athens, Greece
| | - Eleftherios Zormpas
- Biosciences Institute, Vascular Biology and Medicine Theme, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK
| | - Dimitrios Bampatsias
- Department of Clinical Therapeutics, Alexandra Hospital, National and Kapodistrian University of Athens, Athens, Greece
| | - Dimitrios Delialis
- Department of Clinical Therapeutics, Alexandra Hospital, National and Kapodistrian University of Athens, Athens, Greece
| | - Francesca Bonini
- Biosciences Institute, Vascular Biology and Medicine Theme, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK; Vascular Inflammation and RNA Metabolism Laboratory, Institute for Vascular Signalling, JW Goethe University Frankfurt, Frankfurt am Main, Germany
| | - George Galyfos
- First Propaedeutic Department of Surgery, National and Kapodistrian University of Athens, Hippocration Hospital, Athens, Greece
| | - Fragiska Sigala
- First Propaedeutic Department of Surgery, National and Kapodistrian University of Athens, Hippocration Hospital, Athens, Greece
| | - Kimon Stamatelopoulos
- Department of Clinical Therapeutics, Alexandra Hospital, National and Kapodistrian University of Athens, Athens, Greece
| | - Aikaterini Gatsiou
- Biosciences Institute, Vascular Biology and Medicine Theme, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK.
| | - Konstantinos Stellos
- Biosciences Institute, Vascular Biology and Medicine Theme, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK; Department of Cardiology, Freeman Hospital, Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK; Vascular Inflammation and RNA Metabolism Laboratory, Institute for Vascular Signalling, JW Goethe University Frankfurt, Frankfurt am Main, Germany.
| |
Collapse
|
20
|
Wang Y, Zhao B, Choi J, Lee EA. Genomic approaches to trace the history of human brain evolution with an emerging opportunity for transposon profiling of ancient humans. Mob DNA 2021; 12:22. [PMID: 34663455 PMCID: PMC8525043 DOI: 10.1186/s13100-021-00250-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 09/27/2021] [Indexed: 12/17/2022] Open
Abstract
Transposable elements (TEs) significantly contribute to shaping the diversity of the human genome, and lines of evidence suggest TEs as one of driving forces of human brain evolution. Existing computational approaches, including cross-species comparative genomics and population genetic modeling, can be adapted for the study of the role of TEs in evolution. In particular, diverse ancient and archaic human genome sequences are increasingly available, allowing reconstruction of past human migration events and holding the promise of identifying and tracking TEs among other evolutionarily important genetic variants at an unprecedented spatiotemporal resolution. However, highly degraded short DNA templates and other unique challenges presented by ancient human DNA call for major changes in current experimental and computational procedures to enable the identification of evolutionarily important TEs. Ancient human genomes are valuable resources for investigating TEs in the evolutionary context, and efforts to explore ancient human genomes will potentially provide a novel perspective on the genetic mechanism of human brain evolution and inspire a variety of technological and methodological advances. In this review, we summarize computational and experimental approaches that can be adapted to identify and validate evolutionarily important TEs, especially for human brain evolution. We also highlight strategies that leverage ancient genomic data and discuss unique challenges in ancient transposon genomics.
Collapse
Affiliation(s)
- Yilan Wang
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Boxun Zhao
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
| | - Jaejoon Choi
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
| |
Collapse
|
21
|
Factors Regulating the Activity of LINE1 Retrotransposons. Genes (Basel) 2021; 12:genes12101562. [PMID: 34680956 PMCID: PMC8535693 DOI: 10.3390/genes12101562] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 09/21/2021] [Accepted: 09/22/2021] [Indexed: 12/15/2022] Open
Abstract
LINE-1 (L1) is a class of autonomous mobile genetic elements that form somatic mosaicisms in various tissues of the organism. The activity of L1 retrotransposons is strictly controlled by many factors in somatic and germ cells at all stages of ontogenesis. Alteration of L1 activity was noted in a number of diseases: in neuropsychiatric and autoimmune diseases, as well as in various forms of cancer. Altered activity of L1 retrotransposons for some pathologies is associated with epigenetic changes and defects in the genes involved in their repression. This review discusses the molecular genetic mechanisms of the retrotransposition and regulation of the activity of L1 elements. The contribution of various factors controlling the expression and distribution of L1 elements in the genome occurs at all stages of the retrotransposition. The regulation of L1 elements at the transcriptional, post-transcriptional and integration into the genome stages is described in detail. Finally, this review also focuses on the evolutionary aspects of L1 accumulation and their interplay with the host regulation system.
Collapse
|
22
|
Florea L, Payer L, Antonescu C, Yang G, Burns K. Detection of Alu Exonization Events in Human Frontal Cortex From RNA-Seq Data. Front Mol Biosci 2021; 8:727537. [PMID: 34568430 PMCID: PMC8460874 DOI: 10.3389/fmolb.2021.727537] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 08/30/2021] [Indexed: 11/15/2022] Open
Abstract
Alu exonization events functionally diversify the transcriptome, creating alternative mRNA isoforms and accounting for an estimated 5% of the alternatively spliced (skipped) exons in the human genome. We developed computational methods, implemented into a software called Alubaster, for detecting incorporation of Alu sequences in mRNA transcripts from large scale RNA-seq data sets. The approach detects Alu sequences derived from both fixed and polymorphic Alu elements, including Alu insertions missing from the reference genome. We applied our methods to 117 GTEx human frontal cortex samples to build and characterize a collection of Alu-containing mRNAs. In particular, we detected and characterized Alu exonizations occurring at 870 fixed Alu loci, of which 237 were novel, as well as hundreds of putative events involving Alu elements that are polymorphic variants or rare alleles not present in the reference genome. These methods and annotations represent a unique and valuable resource that can be used to understand the characteristics of Alu-containing mRNAs and their tissue-specific expression patterns.
Collapse
Affiliation(s)
- Liliana Florea
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, United States.,Department of Computer Science, Johns Hopkins University, Baltimore, MD, United States
| | - Lindsay Payer
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, United States.,Department of Pathology, Johns Hopkins School of Medicine, Baltimore, MD, United States
| | - Corina Antonescu
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, United States
| | - Guangyu Yang
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, United States.,Department of Computer Science, Johns Hopkins University, Baltimore, MD, United States
| | - Kathleen Burns
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, United States.,Department of Pathology, Johns Hopkins School of Medicine, Baltimore, MD, United States.,Department of Pathology, Dana-Farber Cancer Institute, Boston, MA, United States.,Harvard Medical School, Boston, MA, United States
| |
Collapse
|
23
|
Jeon S, Kim S, Oh MH, Liang P, Tang W, Han K. A comprehensive analysis of gorilla-specific LINE-1 retrotransposons. Genes Genomics 2021; 43:1133-1141. [PMID: 34406591 DOI: 10.1007/s13258-021-01146-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 07/29/2021] [Indexed: 11/29/2022]
Abstract
BACKGROUND Long interspersed element-1 (LINE-1 or L1) is the most abundant retrotransposons in the primate genome. They have approximately 520,000 copies and make up ~ 17% of the primate genome. Full-length L1s can mobilize to a new genomic location using their enzymatic machinery. Gorilla is the second closest species to humans after the chimpanzee, and human-gorilla split 7-12 million years ago. The gorilla genome provides an opportunity to explore primate origins and evolution. OBJECTIVE L1s have contributed to genome diversity and variations during primate evolution. This study aimed to identify gorilla-specific L1s using a more recent version of the gorilla reference genome (Mar. 2016 GSMRT3/gorGor5). METHODS We collected gorilla-specific L1 candidates through computational analysis and manual inspection. L1Xplorer was used to identify whether full-length gorilla-specific L1s were intact. In addition, to determine the level of sequence conservation between intact fulllength gorilla-specific L1s, two ORFs of intact L1s were aligned with the L1PA2 consensus sequence. RESULTS 2002 gorilla-specific L1 candidates were identified through computational analysis. Among them, we manually inspected 1,883 gorilla-specific L1s, among which most of them belong to the L1PA2 subfamily and 12 were intact L1s that could influence genomic variations in the gorilla genome. Interestingly, the 12 intact full-length gorilla-specific L1s have 14 highly conserved nonsynonymous mutations, including 6 mutations and 8 mutations in ORF1 and ORF2, respectively. In comparison to the intact full-length chimpanzee-specific L1s and human-specific hot-L1s, two of these in ORF1 (L256F and E293G) were shown as gorilla-specific nonsynonymous mutations. CONCLUSION The gorilla-specific L1s may have had significantly affected the gorilla genome to compose a genome different form that of other primates during primate evolution.
Collapse
Affiliation(s)
- Soyeon Jeon
- Department of Microbiology, College of Science and Technology, Dankook University, Cheonan, 31116, Republic of Korea
| | - Songmi Kim
- Department of Microbiology, College of Science and Technology, Dankook University, Cheonan, 31116, Republic of Korea.,Center for Bio-Medical Engineering Core Facility, Dankook University, Cheonan, 31116, Republic of Korea
| | - Man Hwan Oh
- Department of Microbiology, College of Science and Technology, Dankook University, Cheonan, 31116, Republic of Korea
| | - Ping Liang
- Department of Biological Sciences, Brock University, St. Catharines, ON, L2S 3A1, Canada.,Centre of Biotechnologies, Brock University, St. Catharines, ON, L2S 3A1, Canada
| | - Wanxiangfu Tang
- Department of Biological Sciences, Brock University, St. Catharines, ON, L2S 3A1, Canada
| | - Kyudong Han
- Department of Microbiology, College of Science and Technology, Dankook University, Cheonan, 31116, Republic of Korea. .,Center for Bio-Medical Engineering Core Facility, Dankook University, Cheonan, 31116, Republic of Korea.
| |
Collapse
|
24
|
Anania C, Lupiáñez DG. Order and disorder: abnormal 3D chromatin organization in human disease. Brief Funct Genomics 2021; 19:128-138. [PMID: 32025693 PMCID: PMC7115703 DOI: 10.1093/bfgp/elz028] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Revised: 07/23/2019] [Accepted: 09/20/2019] [Indexed: 02/06/2023] Open
Abstract
A precise three-dimensional (3D) organization of chromatin is central to achieve the intricate transcriptional patterns that are required to form complex organisms. Growing evidence supports an important role of 3D chromatin architecture in development and delineates its alterations as prominent causes of disease. In this review, we discuss emerging concepts on the fundamental forces shaping genomes in space and on how their disruption can lead to pathogenic phenotypes. We describe the molecular mechanisms underlying a wide range of diseases, from the systemic effects of coding mutations on 3D architectural factors, to the more tissue-specific phenotypes resulting from genetic and epigenetic modifications at specific loci. Understanding the connection between the 3D organization of the genome and its underlying biological function will allow a better interpretation of human pathogenesis.
Collapse
Affiliation(s)
- Chiara Anania
- Epigenetics and Sex Development Group, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Darío G Lupiáñez
- Epigenetics and Sex Development Group, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin, Germany
| |
Collapse
|
25
|
Stow EC, Kaul T, deHaro DL, Dem MR, Beletsky AG, Morales ME, Du Q, LaRosa AJ, Yang H, Smither E, Baddoo M, Ungerleider N, Deininger P, Belancio VP. Organ-, sex- and age-dependent patterns of endogenous L1 mRNA expression at a single locus resolution. Nucleic Acids Res 2021; 49:5813-5831. [PMID: 34023901 PMCID: PMC8191783 DOI: 10.1093/nar/gkab369] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 04/21/2021] [Accepted: 04/28/2021] [Indexed: 11/13/2022] Open
Abstract
Expression of L1 mRNA, the first step in the L1 copy-and-paste amplification cycle, is a prerequisite for L1-associated genomic instability. We used a reported stringent bioinformatics method to parse L1 mRNA transcripts and measure the level of L1 mRNA expressed in mouse and rat organs at a locus-specific resolution. This analysis determined that mRNA expression of L1 loci in rodents exhibits striking organ specificity with less than 0.8% of loci shared between organs of the same organism. This organ specificity in L1 mRNA expression is preserved in male and female mice and across age groups. We discovered notable differences in L1 mRNA expression between sexes with only 5% of expressed L1 loci shared between male and female mice. Moreover, we report that the levels of total L1 mRNA expression and the number and spectrum of expressed L1 loci fluctuate with age as independent variables, demonstrating different patterns in different organs and sexes. Overall, our comparisons between organs and sexes and across ages ranging from 2 to 22 months establish previously unforeseen dynamic changes in L1 mRNA expression in vivo. These findings establish the beginning of an atlas of endogenous L1 mRNA expression across a broad range of biological variables that will guide future studies.
Collapse
Affiliation(s)
- Emily C Stow
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Tiffany Kaul
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Epidemiology, Tulane School of Public Health and Tropical Medicine, New Orleans, LA 70112 USA
| | - Dawn L deHaro
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Madeleine R Dem
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Anna G Beletsky
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Maria E Morales
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Epidemiology, Tulane School of Public Health and Tropical Medicine, New Orleans, LA 70112 USA
| | - Qianhui Du
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Alexis J LaRosa
- Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Hanlin Yang
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA
| | - Emily Smither
- Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Melody Baddoo
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA
| | - Nathan Ungerleider
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA
| | - Prescott Deininger
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Epidemiology, Tulane School of Public Health and Tropical Medicine, New Orleans, LA 70112 USA
| | - Victoria P Belancio
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| |
Collapse
|
26
|
Steely CJ, Russell KL, Feusier JE, Qiao Y, Tavtigian SV, Marth G, Jorde LB. Mobile element insertions and associated structural variants in longitudinal breast cancer samples. Sci Rep 2021; 11:13020. [PMID: 34158539 PMCID: PMC8219704 DOI: 10.1038/s41598-021-92444-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/07/2021] [Indexed: 02/05/2023] Open
Abstract
While mobile elements are largely inactive in healthy somatic tissues, increased activity has been found in cancer tissues, with significant variation among different cancer types. In addition to insertion events, mobile elements have also been found to mediate many structural variation events in the genome. Here, to better understand the timing and impact of mobile element insertions and associated structural variants in cancer, we examined their activity in longitudinal samples of four metastatic breast cancer patients. We identified 11 mobile element insertions or associated structural variants and found that the majority of these occurred early in tumor progression. Most of the variants impact intergenic regions; however, we identified a translocation interrupting MAP2K4 involving Alu elements and a deletion in YTHDF2 involving mobile elements that likely inactivate reported tumor suppressor genes. The high variant allele fraction of the translocation, the loss of the other copy of MAP2K4, the recurrent loss-of-function mutations found in this gene in other cancers, and the important function of MAP2K4 indicate that this translocation is potentially a driver mutation. Overall, using a unique longitudinal dataset, we find that most variants are likely passenger mutations in the four patients we examined, but some variants impact tumor progression.
Collapse
Affiliation(s)
- Cody J Steely
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA.
| | - Kristi L Russell
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
| | - Julie E Feusier
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
| | - Yi Qiao
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| | - Sean V Tavtigian
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah School of Medicine, Salt Lake City, UT, 84112, USA
| | - Gabor Marth
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| |
Collapse
|
27
|
A study of transposable element-associated structural variations (TASVs) using a de novo-assembled Korean genome. Exp Mol Med 2021; 53:615-630. [PMID: 33833373 PMCID: PMC8102501 DOI: 10.1038/s12276-021-00586-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Revised: 01/26/2021] [Accepted: 01/27/2021] [Indexed: 12/13/2022] Open
Abstract
Advances in next-generation sequencing (NGS) technology have made personal genome sequencing possible, and indeed, many individual human genomes have now been sequenced. Comparisons of these individual genomes have revealed substantial genomic differences between human populations as well as between individuals from closely related ethnic groups. Transposable elements (TEs) are known to be one of the major sources of these variations and act through various mechanisms, including de novo insertion, insertion-mediated deletion, and TE–TE recombination-mediated deletion. In this study, we carried out de novo whole-genome sequencing of one Korean individual (KPGP9) via multiple insert-size libraries. The de novo whole-genome assembly resulted in 31,305 scaffolds with a scaffold N50 size of 13.23 Mb. Furthermore, through computational data analysis and experimental verification, we revealed that 182 TE-associated structural variation (TASV) insertions and 89 TASV deletions contributed 64,232 bp in sequence gain and 82,772 bp in sequence loss, respectively, in the KPGP9 genome relative to the hg19 reference genome. We also verified structural differences associated with TASVs by comparative analysis with TASVs in recent genomes (AK1 and TCGA genomes) and reported their details. Here, we constructed a new Korean de novo whole-genome assembly and provide the first study, to our knowledge, focused on the identification of TASVs in an individual Korean genome. Our findings again highlight the role of TEs as a major driver of structural variations in human individual genomes. A novel strategy for genome analysis offers insights into the distribution and impact on genome variation of transposable elements, DNA sequences that can replicate and relocate themselves at different chromosomal regions. These sequences, also known as ‘jumping genes’, comprise up to 50% of the genome, but it has proven challenging to map them with existing techniques. Seyoung Mun of Dankook University, Cheonan, South Korea, and coworkers have developed a sequencing and computational analysis strategy that allowed them to accurately map transposable elements across the genome of a Korean individual. These data revealed hundreds of insertion and deletion events relative to an existing reference map of the genome, showing significant alterations in the chromosomal structure. The authors speculate that such widespread transposition events could potentially contribute to individual differences in gene expression and risk of disease.
Collapse
|
28
|
Hiatt SM, Lawlor JM, Handley LH, Ramaker RC, Rogers BB, Partridge EC, Boston LB, Williams M, Plott CB, Jenkins J, Gray DE, Holt JM, Bowling KM, Bebin EM, Grimwood J, Schmutz J, Cooper GM. Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders. HGG ADVANCES 2021; 2:100023. [PMID: 33937879 PMCID: PMC8087252 DOI: 10.1016/j.xhgg.2021.100023] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 01/07/2021] [Indexed: 02/07/2023] Open
Abstract
Exome and genome sequencing have proven to be effective tools for the diagnosis of neurodevelopmental disorders (NDDs), but large fractions of NDDs cannot be attributed to currently detectable genetic variation. This is likely, at least in part, a result of the fact that many genetic variants are difficult or impossible to detect through typical short-read sequencing approaches. Here, we describe a genomic analysis using Pacific Biosciences circular consensus sequencing (CCS) reads, which are both long (>10 kb) and accurate (>99% bp accuracy). We used CCS on six proband-parent trios with NDDs that were unexplained despite extensive testing, including genome sequencing with short reads. We identified variants and created de novo assemblies in each trio, with global metrics indicating these datasets are more accurate and comprehensive than those provided by short-read data. In one proband, we identified a likely pathogenic (LP), de novo L1-mediated insertion in CDKL5 that results in duplication of exon 3, leading to a frameshift. In a second proband, we identified multiple large de novo structural variants, including insertion-translocations affecting DGKB and MLLT3, which we show disrupt MLLT3 transcript levels. We consider this extensive structural variation likely pathogenic. The breadth and quality of variant detection, coupled to finding variants of clinical and research interest in two of six probands with unexplained NDDs, support the hypothesis that long-read genome sequencing can substantially improve rare disease genetic discovery rates.
Collapse
Affiliation(s)
- Susan M. Hiatt
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | | | - Lori H. Handley
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Ryne C. Ramaker
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Brianne B. Rogers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
- Department of Genetics, University of Alabama at Birmingham, Birmingham, AL 35924, USA
| | | | - Lori Beth Boston
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Melissa Williams
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | | | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - David E. Gray
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - James M. Holt
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Kevin M. Bowling
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - E. Martina Bebin
- Department of Neurology, University of Alabama at Birmingham, Birmingham, AL 35924, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | | |
Collapse
|
29
|
Stenz L. The L1-dependant and Pol III transcribed Alu retrotransposon, from its discovery to innate immunity. Mol Biol Rep 2021; 48:2775-2789. [PMID: 33725281 PMCID: PMC7960883 DOI: 10.1007/s11033-021-06258-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 02/26/2021] [Indexed: 02/07/2023]
Abstract
The 300 bp dimeric repeats digestible by AluI were discovered in 1979. Since then, Alu were involved in the most fundamental epigenetic mechanisms, namely reprogramming, pluripotency, imprinting and mosaicism. These Alu encode a family of retrotransposons transcribed by the RNA Pol III machinery, notably when the cytosines that constitute their sequences are de-methylated. Then, Alu hijack the functions of ORF2 encoded by another transposons named L1 during reverse transcription and integration into new sites. That mechanism functions as a complex genetic parasite able to copy-paste Alu sequences. Doing that, Alu have modified even the size of the human genome, as well as of other primate genomes, during 65 million years of co-evolution. Actually, one germline retro-transposition still occurs each 20 births. Thus, Alu continue to modify our human genome nowadays and were implicated in de novo mutation causing diseases including deletions, duplications and rearrangements. Most recently, retrotransposons were found to trigger neuronal diversity by inducing mosaicism in the brain. Finally, boosted during viral infections, Alu clearly interact with the innate immune system. The purpose of that review is to give a condensed overview of all these major findings that concern the fascinating physiology of Alu from their discovery up to the current knowledge.
Collapse
Affiliation(s)
- Ludwig Stenz
- Department of Genetic Medicine and Development, Faculty of Medicine, Geneva University, Geneva, Switzerland. .,Swiss Centre for Applied Human Toxicology, University of Basel, Basel, Switzerland.
| |
Collapse
|
30
|
Zhang C, Xiao X, Li T, Li M. Translational genomics and beyond in bipolar disorder. Mol Psychiatry 2021; 26:186-202. [PMID: 32424235 DOI: 10.1038/s41380-020-0782-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Revised: 05/05/2020] [Accepted: 05/07/2020] [Indexed: 02/08/2023]
Abstract
Genome-wide association studies (GWAS) have revealed multiple genomic loci conferring risk of bipolar disorder (BD), providing hints for its underlying pathobiology. However, there are still remaining questions to answer. For example, discordance exists between BD heritability estimated with earlier epidemiological evidence and that calculated based on common GWAS variations. Where is the "missing heritability"? How can we explain the biology of the disease based on genetic findings? In this review, we summarize the accomplishments and limitations of current BD GWAS, and discuss potential reasons for the "missing heritability." In addition, progresses of research for the biological mechanisms underlying BD genetic risk using brain tissues, reprogrammed cells, and model animals are reviewed. While our knowledge of BD genetic basis is significantly promoted by these efforts, the complexities of gene regulation in the genome, the spatial-temporal heterogeneity during brain development, and the limitations of different experimental models should always be considered. Notably, several genes have been widely studied given their relatively well-characterized involvement in BD (e.g., CACAN1C and ANK3), and findings of these genes are summarized to both outline possible biological mechanisms of BD and describe examples of translating GWAS discoveries into the pathophysiology.
Collapse
Affiliation(s)
- Chen Zhang
- Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Xiao Xiao
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Tao Li
- Mental Health Center and Psychiatric Laboratory, State Key Laboratory of Biotherapy, West China Hospital of Sichuan University, Chengdu, Sichuan, China. .,West China Brain Research Center, West China Hospital of Sichuan University, Chengdu, Sichuan, China.
| | - Ming Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China.
| |
Collapse
|
31
|
Barrow TM, Wong Doo N, Milne RL, Giles GG, Willmore E, Strathdee G, Byun HM. Analysis of retrotransposon subfamily DNA methylation reveals novel early epigenetic changes in chronic lymphocytic leukemia. Haematologica 2021; 106:98-110. [PMID: 31919093 PMCID: PMC7776340 DOI: 10.3324/haematol.2019.228478] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 01/07/2020] [Indexed: 11/30/2022] Open
Abstract
Retrotransposons such as LINE-1 and Alu comprise >25% of the human genome. While global hypomethylation of these elements has been widely reported in solid tumours, their epigenetic dysregulation is yet to be characterised in chronic lymphocytic leukemia (CLL), and there has been scant consideration of their evolutionary history that mediates sensitivity to hypomethylation. Here, we developed an approach for locus- and evolutionary subfamily-specific analysis of retrotransposons using the Illumina Infinium Human Methylation 450K microarray platform, which we applied to publicly-available datasets from CLL and other haematological malignancies. We identified 9,797 microarray probes mapping to 117 LINE-1 subfamilies and 13,130 mapping to 37 Alu subfamilies. Of these, 10,782 were differentially methylated (PFDR<0.05) in CLL patients (n=139) compared with healthy individuals (n=14), with enrichment at enhancers (P=0.002). Differential methylation was associated with evolutionary age of LINE-1 (r2=0.31, P=0.003) and Alu (r2=0.74, P=0.002) elements, with greater hypomethylation of older subfamilies (L1M, AluJ). Locus-specific hypomethylation was associated with differential expression of proximal genes, including DCLK2, HK1, ILRUN, TANK, TBCD, TNFRSF1B and TXNRD2, with higher expression of DCLK2 and TNFRSF1B associated with reduced patient survival. Hypomethylation at nine loci was highly frequent in CLL (>90% patients) but not observed in healthy individuals or other leukaemias, and was detectable in blood samples taken prior to CLL diagnosis in 9 of 82 individuals from the Melbourne Collaborative Cohort Study. Our results demonstrate differential methylation of retrotransposons in CLL by their evolutionary heritage that modulates expression of proximal genes.
Collapse
Affiliation(s)
- Timothy M Barrow
- Faculty of Health Sciences and Wellbeing, University of Sunderland, Sunderland, United Kingdom
| | - Nicole Wong Doo
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Australia
| | - Roger L Milne
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Australia
| | - Graham G Giles
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Australia
| | - Elaine Willmore
- Northern Institute for Cancer Research, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Gordon Strathdee
- Northern Institute for Cancer Research, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Hyang-Min Byun
- Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne, United Kingdom
| |
Collapse
|
32
|
Yamamoto G, Miyabe I, Tanaka K, Kakuta M, Watanabe M, Kawakami S, Ishida H, Akagi K. SVA retrotransposon insertion in exon of MMR genes results in aberrant RNA splicing and causes Lynch syndrome. Eur J Hum Genet 2020; 29:680-686. [PMID: 33293698 DOI: 10.1038/s41431-020-00779-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 10/30/2020] [Accepted: 11/17/2020] [Indexed: 11/10/2022] Open
Abstract
Lynch syndrome is an autosomal dominant hereditary cancer syndrome in which many cancers develop, the main one being colorectal cancer. Germline pathogenic variants in one of four mismatch repair (MMR) genes are known to be causative of this disease. Accurate diagnosis using genetic testing can greatly benefit the health of those affected. Recently, owing to the improvement of sequence techniques, complicated variants affecting the functions of MMR genes were discovered. In this study, we analyzed insertions of a retrotransposon-like sequence in exon 5 of the MSH6 gene and exon 3 of the MSH2 gene found in Japanese families suspected of having Lynch syndrome. Both of these insertions induced aberrant splicing, and these variants were successfully identified by mRNA sequencing or visual observation of mapping results, although a standard DNA-seq analysis pipeline failed to detect them. The insertion sequences were ~2.5 kbp in length and were found to have the structure of an SVA retrotransposon (SVA). One SVA sequence was not present in the hg19 or hg38 reference genome, but was in a Japanese-specific reference sequence (JRGv2). Our study illustrates the difficulties of identifying SVA insertions in disease genes, and that the possibility of polymorphic insertions should be considered when analyzing mobile elements.
Collapse
Affiliation(s)
- Gou Yamamoto
- Department of Molecular Diagnosis and Cancer Prevention, Saitama Cancer Center, 780 Komuro, Ina-machi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Izumi Miyabe
- Department of Molecular Diagnosis and Cancer Prevention, Saitama Cancer Center, 780 Komuro, Ina-machi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Keisuke Tanaka
- Department of Molecular Diagnosis and Cancer Prevention, Saitama Cancer Center, 780 Komuro, Ina-machi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Miho Kakuta
- Department of Molecular Diagnosis and Cancer Prevention, Saitama Cancer Center, 780 Komuro, Ina-machi, Kitaadachi-gun, Saitama, 362-0806, Japan
| | - Motoko Watanabe
- Department of Clinical Genetics, Juntendo University Graduate School of Medicine, Tokyo, Japan
| | - Satoru Kawakami
- Department of Urology, Saitama Medical Center, Saitama Medical University, Kawagoe, Saitama, Japan
| | - Hideyuki Ishida
- Department of Digestive Tract and General Surgery, Saitama Medical Center, Saitama Medical University, Kawagoe, Saitama, Japan
| | - Kiwamu Akagi
- Department of Molecular Diagnosis and Cancer Prevention, Saitama Cancer Center, 780 Komuro, Ina-machi, Kitaadachi-gun, Saitama, 362-0806, Japan.
| |
Collapse
|
33
|
A SINE-VNTR- Alu in the LRIG2 Promoter Is Associated with Gene Expression at the Locus. Int J Mol Sci 2020; 21:ijms21228486. [PMID: 33187279 PMCID: PMC7697779 DOI: 10.3390/ijms21228486] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 11/06/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open
Abstract
The hominid SINE-VNTR-Alu (SVA) retrotransposons represent a repertoire of genomic variation which could have significant effects on genome function. A human-specific SVA in the promoter region of the gene leucine-rich repeats and immunoglobulin-like domains 2 (LRIG2), which we termed SVA_LRIG2, is a common retrotransposon insertion polymorphism (RIP), defined as an element which is polymorphic for its presence or absence in the genome. We hypothesised that this RIP might be associated with differential levels of expression of LRIG2. The RIP genotype of SVA_LRIG2 was determined in a subset of frontal cortex DNA samples from the North American Brain Expression Consortium (NABEC) cohort and was imputed for a larger set of that cohort. Utilising available frontal cortex total RNA-seq and CpG methylation data for this cohort, we observed that increased allele dosage of SVA_LRIG2 was non-significantly associated with a decrease in transcription from the region and significantly associated with increased methylation of the CpG probe nearest to SVA_LRIG2, i.e., SVA_LRIG2 is a significant methylation quantitative trait loci (mQTL) at the LRIG2 locus. These data are consistent with SVA_LRIG2 being a transcriptional regulator, which in part may involve epigenetic modulation.
Collapse
|
34
|
Tong YR, Geng C, Guan YZ, Zhao YH, Ren HT, Yao FX, Ling C, Wang DC, Chen L, Cui LY, Zhang SY, Dai Y. A Comprehensive Analysis of 2013 Dystrophinopathies in China: A Report From National Rare Disease Center. Front Neurol 2020; 11:572006. [PMID: 33101180 PMCID: PMC7554367 DOI: 10.3389/fneur.2020.572006] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 08/24/2020] [Indexed: 12/17/2022] Open
Abstract
Duchenne muscular dystrophy (DMD) and Becker muscular dystrophy (BMD) are X-linked recessive neuromuscular disorders caused by mutations in DMD. A high-quality database of DMD/BMD is essential not only for clinical practice but also for fundamental research. Here, we aimed to build the largest Chinese national dystrophinopathy database using the National Rare Diseases Registry System of China. Peking Union Medical College Hospital (PUMCH) was the National Rare Diseases Center of China. This research involved 2013 patients with dystrophinopathies, whose diagnoses were confirmed; they were registered and followed up at PUMCH from March 2011 to December 2018. Family history, clinical signs, and treatment data were reported for patients with DMD and BMD at different rates. All six serum biochemical indexes could accurately distinguish between DMD and BMD patients. Copy number variations were the most frequent mutation type (79.2% in DMD and 84.3% in BMD), of which large deletions accounted for 88.4 and 88.6%, large duplications accounted for 11.6 and 11.4% in DMD and BMD, respectively. An exon deletion hotspot, located in exons 45-54, was observed in DMD, and intron 44 was the most frequent deletion starting point (26.5%). Duplication and single nucleotide variations appeared to be uniformly distributed among all exons. Eleven patients were identified to have ultrarare mutation types. Eleven other patients suffered from two separate mutations simultaneously, some of which may have taken place via dependent mechanisms. Thus, we have established the largest hospital-based Chinese dystrophinopathy database via the National Rare Diseases Registry System. This study provides valuable information for further diagnostic and therapeutic studies of dystrophinopathy.
Collapse
Affiliation(s)
- Yuan-Ren Tong
- Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Chang Geng
- Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Yu-Zhou Guan
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Yan-Huan Zhao
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Hai-Tao Ren
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Feng-Xia Yao
- Laboratory of Clinical Genetics, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Chao Ling
- Laboratory of Clinical Genetics, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Dan-Chen Wang
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Lin Chen
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Li-Ying Cui
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Shu-Yang Zhang
- Department of Cardiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Yi Dai
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| |
Collapse
|
35
|
Cao X, Zhang Y, Payer LM, Lords H, Steranka JP, Burns KH, Xing J. Polymorphic mobile element insertions contribute to gene expression and alternative splicing in human tissues. Genome Biol 2020; 21:185. [PMID: 32718348 PMCID: PMC7385971 DOI: 10.1186/s13059-020-02101-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 07/14/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Mobile elements are a major source of structural variants in the human genome, and some mobile elements can regulate gene expression and transcript splicing. However, the impact of polymorphic mobile element insertions (pMEIs) on gene expression and splicing in diverse human tissues has not been thoroughly studied. The multi-tissue gene expression and whole genome sequencing data generated by the Genotype-Tissue Expression (GTEx) project provide a great opportunity to systematically evaluate the role of pMEIs in regulating gene expression in human tissues. RESULTS Using the GTEx whole genome sequencing data, we identify 20,545 high-quality pMEIs from 639 individuals. Coupling pMEI genotypes with gene expression profiles, we identify pMEI-associated expression quantitative trait loci (eQTLs) and splicing quantitative trait loci (sQTLs) in 48 tissues. Using joint analyses of pMEIs and other genomic variants, pMEIs are predicted to be the potential causal variant for 3522 eQTLs and 3717 sQTLs. The pMEI-associated eQTLs and sQTLs show a high level of tissue specificity, and these pMEIs are enriched in the proximity of affected genes and in regulatory elements. Using reporter assays, we confirm that several pMEIs associated with eQTLs and sQTLs can alter gene expression levels and isoform proportions, respectively. CONCLUSION Overall, our study shows that pMEIs are associated with thousands of gene expression and splicing variations, indicating that pMEIs could have a significant role in regulating tissue-specific gene expression and transcript splicing. Detailed mechanisms for the role of pMEIs in gene regulation in different tissues will be an important direction for future studies.
Collapse
Affiliation(s)
- Xiaolong Cao
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Yeting Zhang
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
- Human Genetic Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Lindsay M Payer
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Hannah Lords
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Jared P Steranka
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Jinchuan Xing
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
- Human Genetic Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
| |
Collapse
|
36
|
Damert A. LINE-1 ORF1p does not determine substrate preference for human/orangutan SVA and gibbon LAVA. Mob DNA 2020; 11:27. [PMID: 32676128 PMCID: PMC7353768 DOI: 10.1186/s13100-020-00222-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 07/06/2020] [Indexed: 12/28/2022] Open
Abstract
Background Non-autonomous VNTR (Variable Number of Tandem Repeats) composite retrotransposons – SVA (SINE-R-VNTR-Alu) and LAVA (L1-Alu-VNTR-Alu) – are specific to hominoid primates. SVA expanded in great apes, LAVA in gibbon. Both SVA and LAVA have been shown to be mobilized by the autonomous LINE-1 (L1)-encoded protein machinery in a cell-based assay in trans. The efficiency of human SVA retrotransposition in vitro has, however, been considerably lower than would be expected based on recent pedigree-based in vivo estimates. The VNTR composite elements across hominoids – gibbon LAVA, orangutan SVA_A descendants and hominine SVA_D descendants – display characteristic structures of the 5′ Alu-like domain and the VNTR. Different partner L1 subfamilies are currently active in each of the lineages. The possibility that the lineage-specific types of VNTR composites evolved in response to evolutionary changes in their autonomous partners, particularly in the nucleic acid binding L1 ORF1-encoded protein, has not been addressed. Results Here I report the identification and functional characterization of a highly active human SVA element using an improved mneo retrotransposition reporter cassette. The modified cassette (mneoM) minimizes splicing between the VNTR of human SVAs and the neomycin phosphotransferase stop codon. SVA deletion analysis provides evidence that key elements determining its mobilization efficiency reside in the VNTR and 5′ hexameric repeats. Simultaneous removal of the 5′ hexameric repeats and part of the VNTR has an additive negative effect on mobilization rates. Taking advantage of the modified reporter cassette that facilitates robust cross-species comparison of SVA/LAVA retrotransposition, I show that the ORF1-encoded proteins of the L1 subfamilies currently active in gibbon, orangutan and human do not display substrate preference for gibbon LAVA versus orangutan SVA versus human SVA. Finally, I demonstrate that an orangutan-derived ORF1p supports only limited retrotransposition of SVA/LAVA in trans, despite being fully functional in L1 mobilization in cis. Conclusions Overall, the analysis confirms SVA as a highly active human retrotransposon and preferred substrate of the L1-encoded protein machinery. Based on the results obtained in human cells coevolution of L1 ORF1p and VNTR composites does not appear very likely. The changes in orangutan L1 ORF1p that markedly reduce its mobilization capacity in trans might explain the different SVA insertion rates in the orangutan and hominine lineages, respectively.
Collapse
Affiliation(s)
- Annette Damert
- Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
| |
Collapse
|
37
|
Jönsson ME, Garza R, Johansson PA, Jakobsson J. Transposable Elements: A Common Feature of Neurodevelopmental and Neurodegenerative Disorders. Trends Genet 2020; 36:610-623. [PMID: 32499105 DOI: 10.1016/j.tig.2020.05.004] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 05/11/2020] [Accepted: 05/13/2020] [Indexed: 12/30/2022]
Abstract
The etiology of most neurological disorders is poorly understood and current treatments are largely ineffective. New ideas and concepts are therefore vitally important for future research in this area. This review explores the concept that dysregulation of transposable elements (TEs) contributes to the appearance and pathology of neurodevelopmental and neurodegenerative disorders. Despite TEs making up at least half of the human genome, they are vastly understudied in relation to brain disorders. However, recent advances in sequencing technologies and gene editing approaches are now starting to unravel the pathological role of TEs. Aberrant activation of TEs has been found in many neurological disorders; the resulting pathogenic effects, which include alterations of gene expression, neuroinflammation, and direct neurotoxicity, are starting to be resolved. An increased understanding of the relationship between TEs and pathological processes in the brain improves the potential for novel diagnostics and interventions for brain disorders.
Collapse
Affiliation(s)
- Marie E Jönsson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Raquel Garza
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Pia A Johansson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Johan Jakobsson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden.
| |
Collapse
|
38
|
Burns KH. Our Conflict with Transposable Elements and Its Implications for Human Disease. ANNUAL REVIEW OF PATHOLOGY-MECHANISMS OF DISEASE 2020; 15:51-70. [PMID: 31977294 DOI: 10.1146/annurev-pathmechdis-012419-032633] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Our genome is a historic record of successive invasions of mobile genetic elements. Like other eukaryotes, we have evolved mechanisms to limit their propagation and minimize the functional impact of new insertions. Although these mechanisms are vitally important, they are imperfect, and a handful of retroelement families remain active in modern humans. This review introduces the intrinsic functions of transposons, the tactics employed in their restraint, and the relevance of this conflict to human pathology. The most straightforward examples of disease-causing transposable elements are germline insertions that disrupt a gene and result in a monogenic disease allele. More enigmatic are the abnormal patterns of transposable element expression in disease states. Changes in transposon regulation and cellular responses to their expression have implicated these sequences in diseases as diverse as cancer, autoimmunity, and neurodegeneration. Distinguishing their epiphenomenal from their pathogenic effects may provide wholly new perspectives on our understanding of disease.
Collapse
Affiliation(s)
- Kathleen H Burns
- Department of Pathology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA;
| |
Collapse
|
39
|
Goubert C, Thomas J, Payer LM, Kidd JM, Feusier J, Watkins WS, Burns KH, Jorde LB, Feschotte C. TypeTE: a tool to genotype mobile element insertions from whole genome resequencing data. Nucleic Acids Res 2020; 48:e36. [PMID: 32067044 PMCID: PMC7102983 DOI: 10.1093/nar/gkaa074] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 01/08/2020] [Accepted: 02/11/2020] [Indexed: 12/12/2022] Open
Abstract
Alu retrotransposons account for more than 10% of the human genome, and insertions of these elements create structural variants segregating in human populations. Such polymorphic Alus are powerful markers to understand population structure, and they represent variants that can greatly impact genome function, including gene expression. Accurate genotyping of Alus and other mobile elements has been challenging. Indeed, we found that Alu genotypes previously called for the 1000 Genomes Project are sometimes erroneous, which poses significant problems for phasing these insertions with other variants that comprise the haplotype. To ameliorate this issue, we introduce a new pipeline - TypeTE - which genotypes Alu insertions from whole-genome sequencing data. Starting from a list of polymorphic Alus, TypeTE identifies the hallmarks (poly-A tail and target site duplication) and orientation of Alu insertions using local re-assembly to reconstruct presence and absence alleles. Genotype likelihoods are then computed after re-mapping sequencing reads to the reconstructed alleles. Using a high-quality set of PCR-based genotyping of >200 loci, we show that TypeTE improves genotype accuracy from 83% to 92% in the 1000 Genomes dataset. TypeTE can be readily adapted to other retrotransposon families and brings a valuable toolbox addition for population genomics.
Collapse
Affiliation(s)
- Clément Goubert
- Department of Molecular Biology and Genetics, 215 Tower Rd, Cornell University, Ithaca, NY 14853, USA
| | - Jainy Thomas
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Lindsay M Payer
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Julie Feusier
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - W Scott Watkins
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, 215 Tower Rd, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
40
|
Zhou W, Emery SB, Flasch DA, Wang Y, Kwan KY, Kidd JM, Moran JV, Mills RE. Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology. Nucleic Acids Res 2020; 48:1146-1163. [PMID: 31853540 PMCID: PMC7026601 DOI: 10.1093/nar/gkz1173] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 11/14/2019] [Accepted: 12/05/2019] [Indexed: 11/13/2022] Open
Abstract
Long Interspersed Element-1 (LINE-1) retrotransposition contributes to inter- and intra-individual genetic variation and occasionally can lead to human genetic disorders. Various strategies have been developed to identify human-specific LINE-1 (L1Hs) insertions from short-read whole genome sequencing (WGS) data; however, they have limitations in detecting insertions in complex repetitive genomic regions. Here, we developed a computational tool (PALMER) and used it to identify 203 non-reference L1Hs insertions in the NA12878 benchmark genome. Using PacBio long-read sequencing data, we identified L1Hs insertions that were absent in previous short-read studies (90/203). Approximately 81% (73/90) of the L1Hs insertions reside within endogenous LINE-1 sequences in the reference assembly and the analysis of unique breakpoint junction sequences revealed 63% (57/90) of these L1Hs insertions could be genotyped in 1000 Genomes Project sequences. Moreover, we observed that amplification biases encountered in single-cell WGS experiments led to a wide variation in L1Hs insertion detection rates between four individual NA12878 cells; under-amplification limited detection to 32% (65/203) of insertions, whereas over-amplification increased false positive calls. In sum, these data indicate that L1Hs insertions are often missed using standard short-read sequencing approaches and long-read sequencing approaches can significantly improve the detection of L1Hs insertions present in individual genomes.
Collapse
Affiliation(s)
- Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Sarah B Emery
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Diane A Flasch
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Yifan Wang
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Kenneth Y Kwan
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.,Molecular and Behavioral Neuroscience Institute, University of Michigan Medical School, 109 Zina Pitcher Place, Ann Arbor, MI 48109, USA
| | - Jeffrey M Kidd
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - John V Moran
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.,Department of Internal Medicine, University of Michigan, 1500 East Medical Center Drive, Ann Arbor, MI 48109, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| |
Collapse
|
41
|
Loh JW, Ha H, Lin T, Sun N, Burns KH, Xing J. Integrated Mobile Element Scanning (ME-Scan) method for identifying multiple types of polymorphic mobile element insertions. Mob DNA 2020; 11:12. [PMID: 32110248 PMCID: PMC7035633 DOI: 10.1186/s13100-020-00207-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Accepted: 02/14/2020] [Indexed: 01/29/2023] Open
Abstract
Background Mobile elements are ubiquitous components of mammalian genomes and constitute more than half of the human genome. Polymorphic mobile element insertions (pMEIs) are a major source of human genomic variation and are gaining research interest because of their involvement in gene expression regulation, genome integrity, and disease. Results Building on our previous Mobile Element Scanning (ME-Scan) protocols, we developed an integrated ME-Scan protocol to identify three major active families of human mobile elements, AluYb, L1HS, and SVA. This approach selectively amplifies insertion sites of currently active retrotransposons for Illumina sequencing. By pooling the libraries together, we can identify pMEIs from all three mobile element families in one sequencing run. To demonstrate the utility of the new ME-Scan protocol, we sequenced 12 human parent-offspring trios. Our results showed high sensitivity (> 90%) and accuracy (> 95%) of the protocol for identifying pMEIs in the human genome. In addition, we also tested the feasibility of identifying somatic insertions using the protocol. Conclusions The integrated ME-Scan protocol is a cost-effective way to identify novel pMEIs in the human genome. In addition, by developing the protocol to detect three mobile element families, we demonstrate the flexibility of the ME-Scan protocol. We present instructions for the library design, a sequencing protocol, and a computational pipeline for downstream analyses as a complete framework that will allow researchers to easily adapt the ME-Scan protocol to their own projects in other genomes.
Collapse
Affiliation(s)
- Jui Wan Loh
- 1Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ 08854 USA
| | - Hongseok Ha
- 1Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ 08854 USA.,2Human Genetic Institute of New Jersey, Rutgers, the State University of New Jersey, Piscataway, 08854 NJ USA
| | - Timothy Lin
- 1Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ 08854 USA
| | - Nawei Sun
- 1Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ 08854 USA.,2Human Genetic Institute of New Jersey, Rutgers, the State University of New Jersey, Piscataway, 08854 NJ USA
| | - Kathleen H Burns
- 3Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, 21205 MD USA
| | - Jinchuan Xing
- 1Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ 08854 USA.,2Human Genetic Institute of New Jersey, Rutgers, the State University of New Jersey, Piscataway, 08854 NJ USA
| |
Collapse
|
42
|
McKerrow W, Tang Z, Steranka JP, Payer LM, Boeke JD, Keefe D, Fenyö D, Burns KH, Liu C. Human transposon insertion profiling by sequencing (TIPseq) to map LINE-1 insertions in single cells. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190335. [PMID: 32075555 PMCID: PMC7061987 DOI: 10.1098/rstb.2019.0335] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Long interspersed element-1 (LINE-1, L1) sequences, which comprise about 17% of human genome, are the product of one of the most active types of mobile DNAs in modern humans. LINE-1 insertion alleles can cause inherited and de novo genetic diseases, and LINE-1-encoded proteins are highly expressed in some cancers. Genome-wide LINE-1 mapping in single cells could be useful for defining somatic and germline retrotransposition rates, and for enabling studies to characterize tumour heterogeneity, relate insertions to transcriptional and epigenetic effects at the cellular level, or describe cellular phylogenies in development. Our laboratories have reported a genome-wide LINE-1 insertion site mapping method for bulk DNA, named transposon insertion profiling by sequencing (TIPseq). There have been significant barriers applying LINE-1 mapping to single cells, owing to the chimeric artefacts and features of repetitive sequences. Here, we optimize a modified TIPseq protocol and show its utility for LINE-1 mapping in single lymphoblastoid cells. Results from single-cell TIPseq experiments compare well to known LINE-1 insertions found by whole-genome sequencing and TIPseq on bulk DNA. Among the several approaches we tested, whole-genome amplification by multiple displacement amplification followed by restriction enzyme digestion, vectorette ligation and LINE-1-targeted PCR had the best assay performance. This article is part of a discussion meeting issue 'Crossroads between transposons and gene regulation'.
Collapse
Affiliation(s)
- Wilson McKerrow
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, USA
| | - Zuojian Tang
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, USA
| | - Jared P Steranka
- Department of Pathology, Johns Hopkins University School of Medicine, 733N Broadway, Baltimore, MD 21205, USA
| | - Lindsay M Payer
- Department of Pathology, Johns Hopkins University School of Medicine, 733N Broadway, Baltimore, MD 21205, USA
| | - Jef D Boeke
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, USA
| | - David Keefe
- Department of Obstetrics and Gynecology, New York University Langone School of Medicine, 462 First Avenue, New York, NY 10016, USA.,Department of Cell Biology, New York University Langone School of Medicine, 462 First Avenue, New York, NY 10016, USA
| | - David Fenyö
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, USA
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, 733N Broadway, Baltimore, MD 21205, USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, 733N Broadway, Baltimore, MD 21205, USA.,High Throughput (HiT) Biology Center, Johns Hopkins University School of Medicine, 733N Broadway, Baltimore, MD 21205, USA.,Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, 401N Broadway, Baltimore, MD 21231, USA
| | - Chunhong Liu
- Department of Pathology, Johns Hopkins University School of Medicine, 733N Broadway, Baltimore, MD 21205, USA
| |
Collapse
|
43
|
Lou C, Goodier JL, Qiang R. A potential new mechanism for pregnancy loss: considering the role of LINE-1 retrotransposons in early spontaneous miscarriage. Reprod Biol Endocrinol 2020; 18:6. [PMID: 31964400 PMCID: PMC6971995 DOI: 10.1186/s12958-020-0564-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 01/07/2020] [Indexed: 12/14/2022] Open
Abstract
LINE1 retrotransposons are mobile DNA elements that copy and paste themselves into new sites in the genome. To ensure their evolutionary success, heritable new LINE-1 insertions accumulate in cells that can transmit genetic information to the next generation (i.e., germ cells and embryonic stem cells). It is our hypothesis that LINE1 retrotransposons, insertional mutagens that affect expression of genes, may be causal agents of early miscarriage in humans. The cell has evolved various defenses restricting retrotransposition-caused mutation, but these are occasionally relaxed in certain somatic cell types, including those of the early embryo. We predict that reduced suppression of L1s in germ cells or early-stage embryos may lead to excessive genome mutation by retrotransposon insertion, or to the induction of an inflammatory response or apoptosis due to increased expression of L1-derived nucleic acids and proteins, and so disrupt gene function important for embryogenesis. If correct, a novel threat to normal human development is revealed, and reverse transcriptase therapy could be one future strategy for controlling this cause of embryonic damage in patients with recurrent miscarriages.
Collapse
Affiliation(s)
- Chao Lou
- Department of Genetics, Northwest Women’s and Children’s Hospital, 1616 Yanxiang Road, Xi’an, Shaanxi Province People’s Republic of China
| | - John L. Goodier
- 0000 0001 2171 9311grid.21107.35McKusick-Nathans Deartment of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD USA
| | - Rong Qiang
- Department of Genetics, Northwest Women’s and Children’s Hospital, 1616 Yanxiang Road, Xi’an, Shaanxi Province People’s Republic of China
| |
Collapse
|
44
|
Li C, Jiang Y, Li S. LEMON: a method to construct the local strains at horizontal gene transfer sites in gut metagenomics. BMC Bioinformatics 2019; 20:702. [PMID: 31881904 PMCID: PMC6933643 DOI: 10.1186/s12859-019-3301-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 12/02/2019] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Horizontal Gene Transfer (HGT) refers to the transfer of genetic materials between organisms through mechanisms other than parent-offspring inheritance. HGTs may affect human health through a large number of microorganisms, especially the gut microbiomes which the human body harbors. The transferred segments may lead to complicated local genome structural variations. Details of the local genome structure can elucidate the effects of the HGTs. RESULTS In this work, we propose a graph-based method to reconstruct the local strains from the gut metagenomics data at the HGT sites. The method is implemented in a package named LEMON. The simulated results indicate that the method can identify transferred segments accurately on reference sequences of the microbiome. Simulation results illustrate that LEMON could recover local strains with complicated structure variation. Furthermore, the gene fusion points detected in real data near HGT breakpoints validate the accuracy of LEMON. Some strains reconstructed by LEMON have a replication time profile with lower standard error, which demonstrates HGT events recovered by LEMON is reliable. CONCLUSIONS Through LEMON we could reconstruct the sequence structure of bacteria, which harbors HGT events. This helps us to study gene flow among different microbial species.
Collapse
Affiliation(s)
- Chen Li
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong SAR, HongKong, China
| | - Yiqi Jiang
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong SAR, HongKong, China
| | - Shuaicheng Li
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong SAR, HongKong, China
| |
Collapse
|
45
|
Pre-mRNA structures forming circular RNAs. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2019; 1862:194410. [DOI: 10.1016/j.bbagrm.2019.194410] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 07/16/2019] [Accepted: 07/17/2019] [Indexed: 12/25/2022]
|
46
|
Feusier J, Watkins WS, Thomas J, Farrell A, Witherspoon DJ, Baird L, Ha H, Xing J, Jorde LB. Pedigree-based estimation of human mobile element retrotransposition rates. Genome Res 2019; 29:1567-1577. [PMID: 31575651 PMCID: PMC6771411 DOI: 10.1101/gr.247965.118] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 08/14/2019] [Indexed: 12/26/2022]
Abstract
Germline mutation rates in humans have been estimated for a variety of mutation types, including single-nucleotide and large structural variants. Here, we directly measure the germline retrotransposition rate for the three active retrotransposon elements: L1, Alu, and SVA. We used three tools for calling mobile element insertions (MEIs) (MELT, RUFUS, and TranSurVeyor) on blood-derived whole-genome sequence (WGS) data from 599 CEPH individuals, comprising 33 three-generation pedigrees. We identified 26 de novo MEIs in 437 births. The retrotransposition rate estimates for Alu elements, one in 40 births, is roughly half the rate estimated using phylogenetic analyses, a difference in magnitude similar to that observed for single-nucleotide variants. The L1 retrotransposition rate is one in 63 births and is within range of previous estimates (1:20-1:200 births). The SVA retrotransposition rate, one in 63 births, is much higher than the previous estimate of one in 900 births. Our large, three-generation pedigrees allowed us to assess parent-of-origin effects and the timing of insertion events in either gametogenesis or early embryonic development. We find a statistically significant paternal bias in Alu retrotransposition. Our study represents the first in-depth analysis of the rate and dynamics of human retrotransposition from WGS data in three-generation human pedigrees.
Collapse
Affiliation(s)
- Julie Feusier
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84112, USA
| | - W Scott Watkins
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84112, USA
| | - Jainy Thomas
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84112, USA
| | - Andrew Farrell
- USTAR Center for Genetic Discovery, Salt Lake City, Utah 84112, USA
| | - David J Witherspoon
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84112, USA
| | - Lisa Baird
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84112, USA
| | - Hongseok Ha
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Jinchuan Xing
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84112, USA
| |
Collapse
|
47
|
Puurand T, Kukuškina V, Pajuste FD, Remm M. AluMine: alignment-free method for the discovery of polymorphic Alu element insertions. Mob DNA 2019; 10:31. [PMID: 31360240 PMCID: PMC6639938 DOI: 10.1186/s13100-019-0174-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Accepted: 07/12/2019] [Indexed: 01/09/2023] Open
Abstract
Background Recently, alignment-free sequence analysis methods have gained popularity in the field of personal genomics. These methods are based on counting frequencies of short k-mer sequences, thus allowing faster and more robust analysis compared to traditional alignment-based methods. Results We have created a fast alignment-free method, AluMine, to analyze polymorphic insertions of Alu elements in the human genome. We tested the method on 2,241 individuals from the Estonian Genome Project and identified 28,962 potential polymorphic Alu element insertions. Each tested individual had on average 1,574 Alu element insertions that were different from those in the reference genome. In addition, we propose an alignment-free genotyping method that uses the frequency of insertion/deletion-specific 32-mer pairs to call the genotype directly from raw sequencing reads. Using this method, the concordance between the predicted and experimentally observed genotypes was 98.7%. The running time of the discovery pipeline is approximately 2 h per individual. The genotyping of potential polymorphic insertions takes between 0.4 and 4 h per individual, depending on the hardware configuration. Conclusions AluMine provides tools that allow discovery of novel Alu element insertions and/or genotyping of known Alu element insertions from personal genomes within few hours.
Collapse
Affiliation(s)
- Tarmo Puurand
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Viktoria Kukuškina
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | | | - Maido Remm
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| |
Collapse
|
48
|
Chatron N, Cassinari K, Quenez O, Baert-Desurmont S, Bardel C, Buisine MP, Calpena E, Capri Y, Corominas Galbany J, Diguet F, Edery P, Isidor B, Labalme A, Le Caignec C, Lévy J, Lecoquierre F, Lindenbaum P, Pichon O, Rollat-Farnier PA, Simonet T, Saugier-Veber P, Tabet AC, Toutain A, Wilkie AOM, Lesca G, Sanlaville D, Nicolas G, Schluth-Bolard C. Identification of mobile retrocopies during genetic testing: Consequences for routine diagnosis. Hum Mutat 2019; 40:1993-2000. [PMID: 31230393 DOI: 10.1002/humu.23845] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 05/29/2019] [Accepted: 06/17/2019] [Indexed: 12/24/2022]
Abstract
Human retrocopies, that is messenger RNA transcripts benefitting from the long interspersed element 1 machinery for retrotransposition, may have specific consequences for genomic testing. Next genetration sequencing (NGS) techniques allow the detection of such mobile elements but they may be misinterpreted as genomic duplications or be totally overlooked. We report eight observations of retrocopies detected during diagnostic NGS analyses of targeted gene panels, exome, or genome sequencing. For seven cases, while an exons-only copy number gain was called, read alignment inspection revealed a depth of coverage shift at every exon-intron junction where indels were also systematically called. Moreover, aberrant chimeric read pairs spanned entire introns or were paired with another locus for terminal exons. The 8th retrocopy was present in the reference genome and thus showed a normal NGS profile. We emphasize the existence of retrocopies and strategies to accurately detect them at a glance during genetic testing and discuss pitfalls for genetic testing.
Collapse
Affiliation(s)
- Nicolas Chatron
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Kevin Cassinari
- Department of Genetics and CNR-MAJ, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Olivier Quenez
- Department of Genetics and CNR-MAJ, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Stéphanie Baert-Desurmont
- Department of Genetics, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Claire Bardel
- Bioinformatics group of the Lyon University Hospital NGS facility, Groupement Hospitalier Est, Lyon, France.,Biostatistics and Bioinformatics Department, HCL, Lyon, France
| | - Marie-Pierre Buisine
- Department of Biochemistry and Molecular Biology, JPA Research Center, Inserm UMR-S 1172, Lille University, Lille University Hospital, Lille, France
| | - Eduardo Calpena
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Yline Capri
- Genetics Department, Clinical Genetics Unit, Hôpital Universitaire Robert Debré, Paris, France
| | | | - Flavie Diguet
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Patrick Edery
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | | | - Audrey Labalme
- Genetics Department, Hospices Civils de Lyon, Lyon, France
| | - Cedric Le Caignec
- Genetics Department, CHU Nantes, Nantes, France.,INSERM UMR_S915, Institut du thorax, Nantes University, Nantes, France
| | - Jonathan Lévy
- Genetics Department, Cytogenetics Unit, Hôpital Universitaire Robert Debré, Paris, France
| | - François Lecoquierre
- Department of Genetics, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Pierre Lindenbaum
- INSERM, UMR_S1087, Institut du thorax, Nantes, France.,CNRS, UMR 6291, Nantes, France
| | | | - Pierre-Antoine Rollat-Farnier
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,Bioinformatics group of the Lyon University Hospital NGS facility, Groupement Hospitalier Est, Lyon, France
| | - Thomas Simonet
- Cellular Biotechnology Center, Hospices Civils de Lyon, Lyon, France.,Nerve-Muscle Interactions Team, Institut NeuroMyoGène CNRS UMR 5310-INSERM U1217-Université Claude Bernard Lyon 1, Lyon, France
| | - Pascale Saugier-Veber
- Department of Genetics, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Anne-Claude Tabet
- Genetics Department, Cytogenetics Unit, Hôpital Universitaire Robert Debré, Paris, France.,Neuroscience Department, Human Genetics and Cognitive Function Unit, Institut Pasteur, Paris, France
| | - Annick Toutain
- Genetics Department, Hôpital Bretonneau, CHU, Tours, France.,UMR 1253, iBrain, Tours University, Inserm, Tours, France
| | - Andrew O M Wilkie
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Gaetan Lesca
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Damien Sanlaville
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Gaël Nicolas
- Department of Genetics and CNR-MAJ, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Caroline Schluth-Bolard
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| |
Collapse
|
49
|
Li W, Yang L, Harris RS, Lin L, Olson TL, Hamele CE, Feith DJ, Loughran TP, Poss M. Retrovirus insertion site analysis of LGL leukemia patient genomes. BMC Med Genomics 2019; 12:88. [PMID: 31208405 PMCID: PMC6580525 DOI: 10.1186/s12920-019-0549-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 06/06/2019] [Indexed: 02/08/2023] Open
Abstract
Background Large granular lymphocyte (LGL) leukemia is an uncommon cancer characterized by sustained clonal proliferation of LGL cells. Antibodies reactive to retroviruses have been documented in the serum of patients with LGL leukemia. Culture or molecular approaches have to date not been successful in identifying a retrovirus. Methods Because a retrovirus must integrate into the genome of an infected cell, we focused our efforts on detecting a novel retrovirus integration site in the clonally expanded LGL cells. We present a new computational tool that uses long-insert mate pair sequence data to search the genome of LGL leukemia cells for retrovirus integration sites. We also utilize recently published methods to interrogate the status of polymorphic human endogenous retrovirus type K (HERV-K) provirus in patient genomes. Results Our data show that there are no new retrovirus insertions in LGL genomes of LGL leukemia patients. However, our insertion call tool did detect four HERV-K provirus integration sites that are polymorphic in the human population but absent from the human reference genome, hg19. To determine if the prevalence of these or other polymorphic proviral HERV-Ks differed between LGL leukemia patients and the general population, we used a recently developed tool that reports sites in the human genome occupied by a known proviral HERV-K. We report that there are significant differences in the number of polymorphic HERV-Ks in the genomes of LGL leukemia patients of European origin compared to individuals with European ancestry in the 1000 genomes (KGP) data. Conclusions Our study confirms that the clonal expansion of LGL cells in LGL leukemia is not driven by the integration of a new infectious or endogenous retrovirus, although we do not rule out that these cells are responding to retroviral antigens produced in other cell types. However, our computational analyses revealed that the genomes of LGL leukemia patients carry a higher burden of polymorphic HERV-K proviruses compare to individuals from KGP of European ancestry. Our research emphasizes the merits of comprehensive genomic assessment of HERV-K in cancer samples and suggests that further analyses to determine contributions of HERV-K to LGL leukemia are warranted. Electronic supplementary material The online version of this article (10.1186/s12920-019-0549-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Weiling Li
- The School of Electrical Engineering and Computer Science, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Lei Yang
- Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA.,Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Robert S Harris
- Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Lin Lin
- Department of Statistics, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Thomas L Olson
- University of Virginia Cancer Center and Department of Medicine, Division of Hematology & Oncology, University of Virginia, Charlottesville, Virginia, 22908, USA
| | - Cait E Hamele
- University of Virginia Cancer Center and Department of Medicine, Division of Hematology & Oncology, University of Virginia, Charlottesville, Virginia, 22908, USA
| | - David J Feith
- University of Virginia Cancer Center and Department of Medicine, Division of Hematology & Oncology, University of Virginia, Charlottesville, Virginia, 22908, USA
| | - Thomas P Loughran
- University of Virginia Cancer Center and Department of Medicine, Division of Hematology & Oncology, University of Virginia, Charlottesville, Virginia, 22908, USA
| | - Mary Poss
- Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA. .,Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA, 16802, USA. .,University of Virginia Cancer Center and Department of Medicine, Division of Hematology & Oncology, University of Virginia, Charlottesville, Virginia, 22908, USA.
| |
Collapse
|
50
|
Jamil MA, Sharma A, Nuesgen N, Pezeshkpoor B, Heimbach A, Pavlova A, Oldenburg J, El-Maarri O. F8 Inversions at Xq28 Causing Hemophilia A Are Associated With Specific Methylation Changes: Implication for Molecular Epigenetic Diagnosis. Front Genet 2019; 10:508. [PMID: 31191618 PMCID: PMC6548806 DOI: 10.3389/fgene.2019.00508] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Accepted: 05/09/2019] [Indexed: 12/27/2022] Open
Abstract
Diverse DNA structural variations (SVs) in human cancers and several other diseases are well documented. For genomic inversions in particular, the disease causing mechanism may not be clear, especially if the inversion border does not cross a coding sequence. Understanding about the molecular processes of these inverted genomic sequences, in a mainly epigenetic context, may provide additional information regarding sequence-specific regulation of gene expression in human diseases. Herein, we study one such inversion hotspot at Xq28, which leads to the disruption of F8 gene and results in hemophilia A phenotype. To determine the epigenetic consequence of this rearrangement, we evaluated DNA methylation levels of 12 CpG rich regions with the coverage of 550 kb by using bisulfite-pyrosequencing and next-generation sequencing (NGS)-based bisulfite re-sequencing enrichment assay. Our results show that this inversion prone area harbors widespread methylation changes at the studied regions. However, only 5/12 regions showed significant methylation changes, specifically in case of intron 1 inversion (two regions), intron 22 inversion (two regions) and one common region in both inversions. Interestingly, these aberrant methylated regions were found to be overlapping with the inversion proximities. In addition, two CpG sites reached 100% sensitivity and specificity to discriminate wild type from intron 22 and intron 1 inversion samples. While we found age to be an influencing factor on methylation levels at some regions, covariate analysis still confirms the differential methylation induced by inversion, regardless of age. The hemophilia A methylation inversion "HAMI" assay provides an advantage over conventional PCR-based methods, which may not detect novel rare genomic rearrangements. Taken together, we showed that genomic inversions in the F8 (Xq28) region are associated with detectable changes in methylation levels and can be used as an epigenetic diagnostic marker.
Collapse
Affiliation(s)
- Muhammad Ahmer Jamil
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| | - Amit Sharma
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| | - Nicole Nuesgen
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| | - Behnaz Pezeshkpoor
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| | - André Heimbach
- Institute of Human Genetics, School of Medicine, University of Bonn - University Hospital Bonn, Bonn, Germany
| | - Anne Pavlova
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| | - Johannes Oldenburg
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| | - Osman El-Maarri
- Institute of Experimental Hematology and Transfusion Medicine, University of Bonn, Bonn, Germany
| |
Collapse
|