101
|
Wang C, Liang C. The insertion and dysregulation of transposable elements in osteosarcoma and their association with patient event-free survival. Sci Rep 2022; 12:377. [PMID: 35013466 PMCID: PMC8748539 DOI: 10.1038/s41598-021-04208-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 11/23/2021] [Indexed: 12/11/2022] Open
Abstract
The dysregulation of transposable elements (TEs) has been explored in a variety of cancers. However, TE activities in osteosarcoma (OS) have not been extensively studied yet. By integrative analysis of RNA-seq, whole-genome sequencing (WGS), and methylation data, we showed aberrant TE activities associated with dysregulations of TEs in OS tumors. Specifically, expression levels of LINE-1 and Alu of different evolutionary ages, as well as subfamilies of SVA and HERV-K, were significantly up-regulated in OS tumors, accompanied by enhanced DNA repair responses. We verified the characteristics of LINE-1 mediated TE insertions, including target site duplication (TSD) length (centered around 15 bp) and preferential insertions into intergenic and AT-rich regions as well as intronic regions of longer genes. By filtering polymorphic TE insertions reported in 1000 genome project (1KGP), besides 148 tumor-specific somatic TE insertions, we found most OS patient-specific TE insertions (3175 out of 3326) are germline insertions, which are associated with genes involved in neuronal processes or with transcription factors important for cancer development. In addition to 68 TE-affected cancer genes, we found recurrent germline TE insertions in 72 non-cancer genes with high frequencies among patients. We also found that +/− 500 bps flanking regions of transcription start sites (TSS) of LINE-1 (young) and Alu showed lower methylation levels in OS tumor samples than controls. Interestingly, by incorporating patient clinical data and focusing on TE activities in OS tumors, our data analysis suggested that higher TE insertions in OS tumors are associated with a longer event-free survival time.
Collapse
Affiliation(s)
- Chao Wang
- Department of Biology, Miami University, Oxford, Ohio, 45056, USA.
| | - Chun Liang
- Department of Biology, Miami University, Oxford, Ohio, 45056, USA.
| |
Collapse
|
102
|
Bowles H, Kabiljo R, Al Khleifat A, Jones A, Quinn JP, Dobson RJB, Swanson CM, Al-Chalabi A, Iacoangeli A. An assessment of bioinformatics tools for the detection of human endogenous retroviral insertions in short-read genome sequencing data. FRONTIERS IN BIOINFORMATICS 2022; 2:1062328. [PMID: 36845320 PMCID: PMC9945273 DOI: 10.3389/fbinf.2022.1062328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 12/12/2022] [Indexed: 02/10/2023] Open
Abstract
There is a growing interest in the study of human endogenous retroviruses (HERVs) given the substantial body of evidence that implicates them in many human diseases. Although their genomic characterization presents numerous technical challenges, next-generation sequencing (NGS) has shown potential to detect HERV insertions and their polymorphisms in humans. Currently, a number of computational tools to detect them in short-read NGS data exist. In order to design optimal analysis pipelines, an independent evaluation of the available tools is required. We evaluated the performance of a set of such tools using a variety of experimental designs and datasets. These included 50 human short-read whole-genome sequencing samples, matching long and short-read sequencing data, and simulated short-read NGS data. Our results highlight a great performance variability of the tools across the datasets and suggest that different tools might be suitable for different study designs. However, specialized tools designed to detect exclusively human endogenous retroviruses consistently outperformed generalist tools that detect a wider range of transposable elements. We suggest that, if sufficient computing resources are available, using multiple HERV detection tools to obtain a consensus set of insertion loci may be ideal. Furthermore, given that the false positive discovery rate of the tools varied between 8% and 55% across tools and datasets, we recommend the wet lab validation of predicted insertions if DNA samples are available.
Collapse
Affiliation(s)
- Harry Bowles
- Department of Basic and Clinical Neuroscience, King’s College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
| | - Renata Kabiljo
- Department of Basic and Clinical Neuroscience, King’s College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
- Department of Biostatistics and Health Informatics, King’s College London, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
| | - Ahmad Al Khleifat
- Department of Basic and Clinical Neuroscience, King’s College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
| | - Ashley Jones
- Department of Basic and Clinical Neuroscience, King’s College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
| | - John P. Quinn
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Richard J. B. Dobson
- Department of Biostatistics and Health Informatics, King’s College London, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
- NIHR Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London, London, United Kingdom
- Institute of Health Informatics, University College London, London, United Kingdom
- NIHR Biomedical Research Centre, University College London Hospitals NHS Foundation Trust, London, United Kingdom
| | - Chad M. Swanson
- Department of Infectious Diseases, School of Immunology and Microbial Sciences, King’s College London, London, United Kingdom
| | - Ammar Al-Chalabi
- Department of Basic and Clinical Neuroscience, King’s College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
- Department of Neurology, King’s College Hospital, London, United Kingdom
| | - Alfredo Iacoangeli
- Department of Basic and Clinical Neuroscience, King’s College London, Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
- Department of Biostatistics and Health Informatics, King’s College London, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
- NIHR Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London, London, United Kingdom
- *Correspondence: Alfredo Iacoangeli,
| |
Collapse
|
103
|
Buttler CA, Chuong EB. Emerging roles for endogenous retroviruses in immune epigenetic regulation. Immunol Rev 2022; 305:165-178. [PMID: 34816452 PMCID: PMC8766910 DOI: 10.1111/imr.13042] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 10/21/2021] [Accepted: 11/12/2021] [Indexed: 01/03/2023]
Abstract
In recent years, there has been significant progress toward understanding the transcriptional networks underlying mammalian immune responses, fueled by advances in regulatory genomic technologies. Epigenomic studies profiling immune cells have generated detailed genome-wide maps of regulatory elements that will be key to deciphering the regulatory networks underlying cellular immune responses and autoimmune disorders. Unbiased analyses of these genomic maps have uncovered endogenous retroviruses as an unexpected ally in the regulation of human immune systems. Despite their parasitic origins, studies are finding an increasing number of examples of retroviral sequences having been co-opted for beneficial immune function and regulation by the host cell. Here, we review how endogenous retroviruses have given rise to numerous regulatory elements that shape the epigenetic landscape of host immune responses. We will discuss the implications of these elements on the function, dysfunction, and evolution of innate immunity.
Collapse
|
104
|
Zverinova S, Guryev V. Variant calling: Considerations, practices, and developments. Hum Mutat 2021; 43:976-985. [PMID: 34882898 PMCID: PMC9545713 DOI: 10.1002/humu.24311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 11/02/2021] [Accepted: 12/03/2021] [Indexed: 11/10/2022]
Abstract
The success of many clinical, association, or population genetics studies critically relies on properly performed variant calling step. The variety of modern genomics protocols, techniques, and platforms makes our choices of methods and algorithms difficult and there is no "one size fits all" solution for study design and data analysis. In this review, we discuss considerations that need to be taken into account while designing the study and preparing for the experiments. We outline the variety of variant types that can be detected using sequencing approaches and highlight some specific requirements and basic principles of their detection. Finally, we cover interesting developments that enable variant calling for a broad range of applications in the genomics field. We conclude by discussing technological and algorithmic advances that have the potential to change the ways of calling DNA variants in the nearest future.
Collapse
Affiliation(s)
- Stepanka Zverinova
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| |
Collapse
|
105
|
Chuang NT, Gardner EJ, Terry DM, Crabtree J, Mahurkar AA, Rivell GL, Hong CC, Perry JA, Devine SE. Mutagenesis of human genomes by endogenous mobile elements on a population scale. Genome Res 2021; 31:2225-2235. [PMID: 34772701 PMCID: PMC8647825 DOI: 10.1101/gr.275323.121] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 09/16/2021] [Indexed: 01/22/2023]
Abstract
Several large-scale Illumina whole-genome sequencing (WGS) and whole-exome sequencing (WES) projects have emerged recently that have provided exceptional opportunities to discover mobile element insertions (MEIs) and study the impact of these MEIs on human genomes. However, these projects also have presented major challenges with respect to the scalability and computational costs associated with performing MEI discovery on tens or even hundreds of thousands of samples. To meet these challenges, we have developed a more efficient and scalable version of our mobile element locator tool (MELT) called CloudMELT. We then used MELT and CloudMELT to perform MEI discovery in 57,919 human genomes and exomes, leading to the discovery of 104,350 nonredundant MEIs. We leveraged this collection (1) to examine potentially active L1 source elements that drive the mobilization of new Alu, L1, and SVA MEIs in humans; (2) to examine the population distributions and subfamilies of these MEIs; and (3) to examine the mutagenesis of GENCODE genes, ENCODE-annotated features, and disease genes by these MEIs. Our study provides new insights on the L1 source elements that drive MEI mutagenesis and brings forth a better understanding of how this mutagenesis impacts human genomes.
Collapse
Affiliation(s)
- Nelson T Chuang
- Graduate Program in Molecular Medicine, University of Maryland, Baltimore, Baltimore, Maryland 21201, USA
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
- Division of Gastroenterology, Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Eugene J Gardner
- Graduate Program in Molecular Medicine, University of Maryland, Baltimore, Baltimore, Maryland 21201, USA
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Diane M Terry
- Graduate Program in Molecular Medicine, University of Maryland, Baltimore, Baltimore, Maryland 21201, USA
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Jonathan Crabtree
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Anup A Mahurkar
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Guillermo L Rivell
- Greenebaum Comprehensive Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Charles C Hong
- Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - James A Perry
- Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Scott E Devine
- Graduate Program in Molecular Medicine, University of Maryland, Baltimore, Baltimore, Maryland 21201, USA
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
- Greenebaum Comprehensive Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
- Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| |
Collapse
|
106
|
Payer LM, Steranka JP, Kryatova MS, Grillo G, Lupien M, Rocha PP, Burns KH. Alu insertion variants alter gene transcript levels. Genome Res 2021; 31:2236-2248. [PMID: 34799402 PMCID: PMC8647820 DOI: 10.1101/gr.261305.120] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 09/23/2021] [Indexed: 12/23/2022]
Abstract
Alu are high copy number interspersed repeats that have accumulated near genes during primate and human evolution. They are a pervasive source of structural variation in modern humans. Impacts that Alu insertions may have on gene expression are not well understood, although some have been associated with expression quantitative trait loci (eQTLs). Here, we directly test regulatory effects of polymorphic Alu insertions in isolation of other variants on the same haplotype. To screen insertion variants for those with such effects, we used ectopic luciferase reporter assays and evaluated 110 Alu insertion variants, including more than 40 with a potential role in disease risk. We observed a continuum of effects with significant outliers that up- or down-regulate luciferase activity. Using a series of reporter constructs, which included genomic context surrounding the Alu, we can distinguish between instances in which the Alu disrupts another regulator and those in which the Alu introduces new regulatory sequence. We next focused on three polymorphic Alu loci associated with breast cancer that display significant effects in the reporter assay. We used CRISPR to modify the endogenous sequences, establishing cell lines varying in the Alu genotype. Our findings indicate that Alu genotype can alter expression of genes implicated in cancer risk, including PTHLH, RANBP9, and MYC These data show that commonly occurring polymorphic Alu elements can alter transcript levels and potentially contribute to disease risk.
Collapse
Affiliation(s)
- Lindsay M Payer
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Jared P Steranka
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Maria S Kryatova
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Giacomo Grillo
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario M5G 1L7, Canada
| | - Mathieu Lupien
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario M5G 1L7, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7, Canada
- Ontario Institute for Cancer Research, Toronto, Ontario M5G 0A3, Canada
| | - Pedro P Rocha
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, Maryland 20892-4340, USA
- National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
- McKusick-Nathans Institute of Genetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| |
Collapse
|
107
|
Scott AJ, Chiang C, Hall IM. Structural variants are a major source of gene expression differences in humans and often affect multiple nearby genes. Genome Res 2021; 31:2249-2257. [PMID: 34544830 PMCID: PMC8647827 DOI: 10.1101/gr.275488.121] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Accepted: 09/14/2021] [Indexed: 11/29/2022]
Abstract
Structural variants (SVs) are an important source of human genome diversity, but their functional effects are poorly understood. We mapped 61,668 SVs in 613 individuals from the GTEx project and measured their effects on gene expression. We estimate that common SVs are causal at 2.66% of eQTLs, a 10.5-fold enrichment relative to their abundance in the genome. Duplications and deletions were the most impactful variant types, whereas the contribution of mobile element insertions was small (0.12% of eQTLs, 1.9-fold enriched). Multitissue analysis of eQTLs revealed that gene-altering SVs show more constitutive effects than other variant types, with 62.09% of coding SV-eQTLs active in all tissues with eQTL activity compared with 23.08% of coding SNV- and indel-eQTLs. Noncoding SVs, SNVs and indels show broadly similar patterns. We also identified 539 rare SVs associated with nearby gene expression outliers. Of these, 62.34% are noncoding SVs that affect gene expression but have modest enrichment at regulatory elements, showing that rare noncoding SVs are a major source of gene expression differences but remain difficult to predict from current annotations. Both common and rare SVs often affect the expression of multiple genes: SV-eQTLs affect an average of 1.82 nearby genes, whereas SNV- and indel-eQTLs affect an average of 1.09 genes, and 21.34% of rare expression-altering SVs show effects on two to nine different genes. We also observe significant effects on rare gene expression changes extending 1 Mb from the SV. This provides a mechanism by which individual SVs may have strong or pleiotropic effects on phenotypic variation.
Collapse
Affiliation(s)
- Alexandra J Scott
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Colby Chiang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Ira M Hall
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06510, USA
| |
Collapse
|
108
|
Borges-Monroy R, Chu C, Dias C, Choi J, Lee S, Gao Y, Shin T, Park PJ, Walsh CA, Lee EA. Whole-genome analysis reveals the contribution of non-coding de novo transposon insertions to autism spectrum disorder. Mob DNA 2021; 12:28. [PMID: 34838103 PMCID: PMC8627061 DOI: 10.1186/s13100-021-00256-w] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 11/02/2021] [Indexed: 12/30/2022] Open
Abstract
Background Retrotransposons have been implicated as causes of Mendelian disease, but their role in autism spectrum disorder (ASD) has not been systematically defined, because they are only called with adequate sensitivity from whole genome sequencing (WGS) data and a large enough cohort for this analysis has only recently become available. Results We analyzed WGS data from a cohort of 2288 ASD families from the Simons Simplex Collection by establishing a scalable computational pipeline for retrotransposon insertion detection. We report 86,154 polymorphic retrotransposon insertions—including > 60% not previously reported—and 158 de novo retrotransposition events. The overall burden of de novo events was similar between ASD individuals and unaffected siblings, with 1 de novo insertion per 29, 117, and 206 births for Alu, L1, and SVA respectively, and 1 de novo insertion per 21 births total. However, ASD cases showed more de novo L1 insertions than expected in ASD genes. Additionally, we observed exonic insertions in loss-of-function intolerant genes, including a likely pathogenic exonic insertion in CSDE1, only in ASD individuals. Conclusions These findings suggest a modest, but important, impact of intronic and exonic retrotransposon insertions in ASD, show the importance of WGS for their analysis, and highlight the utility of specific bioinformatic tools for high-throughput detection of retrotransposon insertions. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-021-00256-w.
Collapse
Affiliation(s)
- Rebeca Borges-Monroy
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chong Chu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Caroline Dias
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Division of Developmental Medicine, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Jaejoon Choi
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Genetics, Harvard Medical School, MA, Boston, USA
| | - Soohyun Lee
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Yue Gao
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Pediatrics, Harvard Medical School, MA, Boston, USA
| | - Taehwan Shin
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Department of Pediatrics, Harvard Medical School, MA, Boston, USA
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Christopher A Walsh
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA. .,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. .,Department of Pediatrics, Harvard Medical School, MA, Boston, USA. .,Department of Neurology, Harvard Medical School, Boston, MA, USA. .,Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, USA.
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Manton Center for Orphan Disease, Boston Children's Hospital, Boston, MA, USA. .,Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. .,Department of Pediatrics, Harvard Medical School, MA, Boston, USA.
| |
Collapse
|
109
|
Petersen M, Winter S, Coimbra R, J de Jong M, Kapitonov VV, Nilsson MA. Population analysis of retrotransposons in giraffe genomes supports RTE decline and widespread LINE1 activity in Giraffidae. Mob DNA 2021; 12:27. [PMID: 34836553 PMCID: PMC8620236 DOI: 10.1186/s13100-021-00254-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 10/25/2021] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND The majority of structural variation in genomes is caused by insertions of transposable elements (TEs). In mammalian genomes, the main TE fraction is made up of autonomous and non-autonomous non-LTR retrotransposons commonly known as LINEs and SINEs (Long and Short Interspersed Nuclear Elements). Here we present one of the first population-level analysis of TE insertions in a non-model organism, the giraffe. Giraffes are ruminant artiodactyls, one of the few mammalian groups with genomes that are colonized by putatively active LINEs of two different clades of non-LTR retrotransposons, namely the LINE1 and RTE/BovB LINEs as well as their associated SINEs. We analyzed TE insertions of both types, and their associated SINEs in three giraffe genome assemblies, as well as across a population level sampling of 48 individuals covering all extant giraffe species. RESULTS The comparative genome screen identified 139,525 recent LINE1 and RTE insertions in the sampled giraffe population. The analysis revealed a drastically reduced RTE activity in giraffes, whereas LINE1 is still actively propagating in the genomes of extant (sub)-species. In concert with the extremely low activity of the giraffe RTE, we also found that RTE-dependent SINEs, namely Bov-tA and Bov-A2, have been virtually immobile in the last 2 million years. Despite the high current activity of the giraffe LINE1, we did not find evidence for the presence of currently active LINE1-dependent SINEs. TE insertion heterozygosity rates differ among the different (sub)-species, likely due to divergent population histories. CONCLUSIONS The horizontally transferred RTE/BovB and its derived SINEs appear to be close to inactivation and subsequent extinction in the genomes of extant giraffe species. This is the first time that the decline of a TE family has been meticulously analyzed from a population genetics perspective. Our study shows how detailed information about past and present TE activity can be obtained by analyzing large-scale population-level genomic data sets.
Collapse
Affiliation(s)
- Malte Petersen
- Max Planck Institute of Immunobiology and Epigenetics, Stübeweg 51, 79108, Freiburg, Germany
| | - Sven Winter
- Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325, Frankfurt am Main, Germany
| | - Raphael Coimbra
- Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325, Frankfurt am Main, Germany
- Institute for Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Straße 13, 60438, Frankfurt am Main, Germany
| | - Menno J de Jong
- Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325, Frankfurt am Main, Germany
| | - Vladimir V Kapitonov
- Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325, Frankfurt am Main, Germany
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt am Main, Germany
| | - Maria A Nilsson
- Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325, Frankfurt am Main, Germany.
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt am Main, Germany.
| |
Collapse
|
110
|
Evaluation of copy number variants for genetic hearing loss: a review of current approaches and recent findings. Hum Genet 2021; 141:387-400. [PMID: 34811589 DOI: 10.1007/s00439-021-02365-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 09/02/2021] [Indexed: 01/22/2023]
Abstract
Structural variation includes a change in copy number, orientation, or location of a part of the genome. Copy number variants (CNVs) are a common cause of genetic hearing loss, comprising nearly 20% of diagnosed cases. While large deletions involving the gene STRC are the most common pathogenic CNVs, a significant proportion of known hearing loss genes also contain pathogenic CNVs. In this review, we provide an overview of currently used methods for detection of CNVs in genes known to cause hearing loss including molecular techniques such as multiplex ligation probe amplification (MLPA) and digital droplet polymerase chain reaction (ddPCR), array-CGH and single-nucleotide polymorphism (SNP) arrays, as well as techniques for detection of CNVs using next-generation sequencing data analysis including targeted gene panel, exome, and genome sequencing data. In addition, in this review, we compile published data on pathogenic hearing loss CNVs to provide an up-to-date overview. We show that CNVs have been identified in 29 different non-syndromic hearing loss genes. An understanding of the contribution of CNVs to genetic hearing loss is critical to the current diagnosis of hearing loss and is crucial for future gene therapies. Thus, evaluation for CNVs is required in any modern pipeline for genetic diagnosis of hearing loss.
Collapse
|
111
|
Wehbi SS, Zu Dohna H. A comparative analysis of L1 retrotransposition activities in human genomes suggests an ongoing increase in L1 number despite an evolutionary trend towards lower activity. Mob DNA 2021; 12:26. [PMID: 34782009 PMCID: PMC8594186 DOI: 10.1186/s13100-021-00255-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 10/26/2021] [Indexed: 11/18/2022] Open
Abstract
Background LINE-1 (Long Interspersed Nuclear Elements, L1) retrotransposons are the only autonomously active transposable elements in the human genome. The evolution of L1 retrotransposition rates and its implications for L1 dynamics are poorly understood. Retrotransposition rates are commonly measured in cell culture-based assays, but it is unclear how well these measurements provide insight into L1 population dynamics. This study applied comparative methods to estimate parameters for the evolution of retrotransposition rates, and infer L1 dynamics from these estimates. Results Our results show that the rates at which new L1s emerge in the human population correlate positively to cell-culture based retrotransposition activities, that there is an evolutionary trend towards lower retrotransposition activity, and that this evolutionary trend is not sufficient to counter-balance the increase in active L1s resulting from continuing retrotransposition. Conclusions Together, these findings support a model of the population-level L1 retrotransposition dynamics that is consistent with prior expectations and indicate the remaining gaps in the understanding of L1 dynamics in human genomes.
Collapse
Affiliation(s)
- Sawsan Sami Wehbi
- Department of Biology, American University of Beirut, Beirut, Lebanon
| | - Heinrich Zu Dohna
- Department of Biology, American University of Beirut, Beirut, Lebanon.
| |
Collapse
|
112
|
Gardner EJ, Sifrim A, Lindsay SJ, Prigmore E, Rajan D, Danecek P, Gallone G, Eberhardt RY, Martin HC, Wright CF, FitzPatrick DR, Firth HV, Hurles ME. Detecting cryptic clinically relevant structural variation in exome-sequencing data increases diagnostic yield for developmental disorders. Am J Hum Genet 2021; 108:2186-2194. [PMID: 34626536 PMCID: PMC8595893 DOI: 10.1016/j.ajhg.2021.09.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 09/15/2021] [Indexed: 11/29/2022] Open
Abstract
Structural variation (SV) describes a broad class of genetic variation greater than 50 bp in size. SVs can cause a wide range of genetic diseases and are prevalent in rare developmental disorders (DDs). Individuals presenting with DDs are often referred for diagnostic testing with chromosomal microarrays (CMAs) to identify large copy-number variants (CNVs) and/or with single-gene, gene-panel, or exome sequencing (ES) to identify single-nucleotide variants, small insertions/deletions, and CNVs. However, individuals with pathogenic SVs undetectable by conventional analysis often remain undiagnosed. Consequently, we have developed the tool InDelible, which interrogates short-read sequencing data for split-read clusters characteristic of SV breakpoints. We applied InDelible to 13,438 probands with severe DDs recruited as part of the Deciphering Developmental Disorders (DDD) study and discovered 63 rare, damaging variants in genes previously associated with DDs missed by standard SNV, indel, or CNV discovery approaches. Clinical review of these 63 variants determined that about half (30/63) were plausibly pathogenic. InDelible was particularly effective at ascertaining variants between 21 and 500 bp in size and increased the total number of potentially pathogenic variants identified by DDD in this size range by 42.9%. Of particular interest were seven confirmed de novo variants in MECP2, which represent 35.0% of all de novo protein-truncating variants in MECP2 among DDD study participants. InDelible provides a framework for the discovery of pathogenic SVs that are most likely missed by standard analytical workflows and has the potential to improve the diagnostic yield of ES across a broad range of genetic diseases.
Collapse
Affiliation(s)
- Eugene J Gardner
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Alejandro Sifrim
- Department of Human Genetics, KU Leuven, Herestraat 49, Box 602, Leuven 3000, Belgium
| | - Sarah J Lindsay
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Elena Prigmore
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Diana Rajan
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Petr Danecek
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Giuseppe Gallone
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Ruth Y Eberhardt
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Hilary C Martin
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK
| | - Caroline F Wright
- University of Exeter Medical School, Institute of Biomedical and Clinical Science, Royal Devon and Exeter Hospital, Exeter EX2 5DW, UK
| | - David R FitzPatrick
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, WGH, Edinburgh EH4 2SP, UK
| | - Helen V Firth
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK; East Anglian Medical Genetics Service, Box 134, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge CB2 0QQ, UK
| | - Matthew E Hurles
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton CB10 1SA, UK.
| |
Collapse
|
113
|
Autio MI, Bin Amin T, Perrin A, Wong JY, Foo RSY, Prabhakar S. Transposable elements that have recently been mobile in the human genome. BMC Genomics 2021; 22:789. [PMID: 34732136 PMCID: PMC8567694 DOI: 10.1186/s12864-021-08085-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 10/14/2021] [Indexed: 11/29/2022] Open
Abstract
Background Transposable elements (TE) comprise nearly half of the human genome and their insertions have profound effects to human genetic diversification and as well as disease. Despite their abovementioned significance, there is no consensus on the TE subfamilies that remain active in the human genome. In this study, we therefore developed a novel statistical test for recently mobile subfamilies (RMSs), based on patterns of overlap with > 100,000 polymorphic indels. Results Our analysis produced a catalogue of 20 high-confidence RMSs, which excludes many false positives in public databases. Intriguingly though, it includes HERV-K, an LTR subfamily previously thought to be extinct. The RMS catalogue is strongly enriched for contributions to germline genetic disorders (P = 1.1e-10), and thus constitutes a valuable resource for diagnosing disorders of unknown aetiology using targeted TE-insertion screens. Remarkably, RMSs are also highly enriched for somatic insertions in diverse cancers (P = 2.8e-17), thus indicating strong correlations between germline and somatic TE mobility. Using CRISPR/Cas9 deletion, we show that an RMS-derived polymorphic TE insertion increased the expression of RPL17, a gene associated with lower survival in liver cancer. More broadly, polymorphic TE insertions from RMSs were enriched near genes with allele-specific expression, suggesting widespread effects on gene regulation. Conclusions By using a novel statistical test we have defined a catalogue of 20 recently mobile transposable element subfamilies. We illustrate the gene regulatory potential of RMS-derived polymorphic TE insertions, using CRISPR/Cas9 deletion in vitro on a specific candidate, as well as by genome wide analysis of allele-specific expression. Our study presents novel insights into TE mobility and regulatory potential and provides a key resource for human disease genetics and population history studies. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08085-0.
Collapse
Affiliation(s)
- Matias I Autio
- Laboratory of Epigenomics and Chromatin Organization, Genome Institute of Singapore, A*STAR, Singapore, 138672, Singapore.,Cardiovascular Research Institute, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117599, Singapore
| | - Talal Bin Amin
- Spatial and Single Cell Systems, Genome Institute of Singapore, A*STAR, 60 Biopolis St, Genome #02-01, Singapore, 138672, Singapore
| | - Arnaud Perrin
- Laboratory of Epigenomics and Chromatin Organization, Genome Institute of Singapore, A*STAR, Singapore, 138672, Singapore.,Cardiovascular Research Institute, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117599, Singapore
| | - Jen Yi Wong
- Spatial and Single Cell Systems, Genome Institute of Singapore, A*STAR, 60 Biopolis St, Genome #02-01, Singapore, 138672, Singapore
| | - Roger S-Y Foo
- Laboratory of Epigenomics and Chromatin Organization, Genome Institute of Singapore, A*STAR, Singapore, 138672, Singapore.,Cardiovascular Research Institute, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117599, Singapore
| | - Shyam Prabhakar
- Spatial and Single Cell Systems, Genome Institute of Singapore, A*STAR, 60 Biopolis St, Genome #02-01, Singapore, 138672, Singapore.
| |
Collapse
|
114
|
Wang Y, Zhao B, Choi J, Lee EA. Genomic approaches to trace the history of human brain evolution with an emerging opportunity for transposon profiling of ancient humans. Mob DNA 2021; 12:22. [PMID: 34663455 PMCID: PMC8525043 DOI: 10.1186/s13100-021-00250-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 09/27/2021] [Indexed: 12/17/2022] Open
Abstract
Transposable elements (TEs) significantly contribute to shaping the diversity of the human genome, and lines of evidence suggest TEs as one of driving forces of human brain evolution. Existing computational approaches, including cross-species comparative genomics and population genetic modeling, can be adapted for the study of the role of TEs in evolution. In particular, diverse ancient and archaic human genome sequences are increasingly available, allowing reconstruction of past human migration events and holding the promise of identifying and tracking TEs among other evolutionarily important genetic variants at an unprecedented spatiotemporal resolution. However, highly degraded short DNA templates and other unique challenges presented by ancient human DNA call for major changes in current experimental and computational procedures to enable the identification of evolutionarily important TEs. Ancient human genomes are valuable resources for investigating TEs in the evolutionary context, and efforts to explore ancient human genomes will potentially provide a novel perspective on the genetic mechanism of human brain evolution and inspire a variety of technological and methodological advances. In this review, we summarize computational and experimental approaches that can be adapted to identify and validate evolutionarily important TEs, especially for human brain evolution. We also highlight strategies that leverage ancient genomic data and discuss unique challenges in ancient transposon genomics.
Collapse
Affiliation(s)
- Yilan Wang
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Boxun Zhao
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
| | - Jaejoon Choi
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
| |
Collapse
|
115
|
Factors Regulating the Activity of LINE1 Retrotransposons. Genes (Basel) 2021; 12:genes12101562. [PMID: 34680956 PMCID: PMC8535693 DOI: 10.3390/genes12101562] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 09/21/2021] [Accepted: 09/22/2021] [Indexed: 12/15/2022] Open
Abstract
LINE-1 (L1) is a class of autonomous mobile genetic elements that form somatic mosaicisms in various tissues of the organism. The activity of L1 retrotransposons is strictly controlled by many factors in somatic and germ cells at all stages of ontogenesis. Alteration of L1 activity was noted in a number of diseases: in neuropsychiatric and autoimmune diseases, as well as in various forms of cancer. Altered activity of L1 retrotransposons for some pathologies is associated with epigenetic changes and defects in the genes involved in their repression. This review discusses the molecular genetic mechanisms of the retrotransposition and regulation of the activity of L1 elements. The contribution of various factors controlling the expression and distribution of L1 elements in the genome occurs at all stages of the retrotransposition. The regulation of L1 elements at the transcriptional, post-transcriptional and integration into the genome stages is described in detail. Finally, this review also focuses on the evolutionary aspects of L1 accumulation and their interplay with the host regulation system.
Collapse
|
116
|
Wong JS, Jadhav T, Young E, Wang Y, Xiao M. Characterization of full-length LINE-1 insertions in 154 genomes. Genomics 2021; 113:3804-3810. [PMID: 34534648 DOI: 10.1016/j.ygeno.2021.09.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 08/18/2021] [Accepted: 09/11/2021] [Indexed: 10/20/2022]
Abstract
Long interspersed nuclear elements (LINEs) are retrotransposons that contribute to genetic variation in the human genome. LINE-1 elements in larger-scale studies are challenging to identify using sequencing technologies due to cost and scalability. We developed an approach using optical mapping for detection of full-length LINE-1 insertions and 10× sequencing for confirmation. We found 51 true positive full-length LINE-1 insertions, of which 4 are novel insertions, in NA12878. Repeating our analysis on a larger sample set representing 26 populations, we identified 329 full-length LINE-1 elements, of which 123 are novel. 24.8% of these 329 LINE-1 insertions were shared amongst all 5 superpopulations (AFR, AMR, EUR, EAS, SAS). The African superpopulation has a higher percentage of population-specific LINE-1 insertions than any other superpopulation. These data indicate that our approach can provide high-speed, cost-effective, and increased accuracy for LINE-1 detection. These data also provide an insight into variations of LINE-1 elements between different populations.
Collapse
Affiliation(s)
- Jessica S Wong
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Tanaya Jadhav
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Eleanor Young
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Yilin Wang
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America
| | - Ming Xiao
- School of Biomedical Engineering, Drexel University, Philadelphia, PA, United States of America; Center for Genomic Sciences, Institute of Molecular Medicine and Infectious Disease, Drexel University, Philadelphia, PA, United States of America.
| |
Collapse
|
117
|
Watkins WS, Feusier JE, Thomas J, Goubert C, Mallick S, Jorde LB. The Simons Genome Diversity Project: A Global Analysis of Mobile Element Diversity. Genome Biol Evol 2021; 12:779-794. [PMID: 32359137 PMCID: PMC7290288 DOI: 10.1093/gbe/evaa086] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/24/2020] [Indexed: 12/30/2022] Open
Abstract
Ongoing retrotransposition of Alu, LINE-1, and SINE–VNTR–Alu elements generates diversity and variation among human populations. Previous analyses investigating the population genetics of mobile element insertions (MEIs) have been limited by population ascertainment bias or by relatively small numbers of populations and low sequencing coverage. Here, we use 296 individuals representing 142 global populations from the Simons Genome Diversity Project (SGDP) to discover and characterize MEI diversity from deeply sequenced whole-genome data. We report 5,742 MEIs not originally reported by the 1000 Genomes Project and show that high sampling diversity leads to a 4- to 7-fold increase in MEI discovery rates over the original 1000 Genomes Project data. As a result of negative selection, nonreference polymorphic MEIs are underrepresented within genes, and MEIs within genes are often found in the transcriptional orientation opposite that of the gene. Globally, 80% of Alu subfamilies predate the expansion of modern humans from Africa. Polymorphic MEIs show heterozygosity gradients that decrease from Africa to Eurasia to the Americas, and the number of MEIs found uniquely in a single individual are also distributed in this general pattern. The maximum fraction of MEI diversity partitioned among the seven major SGDP population groups (FST) is 7.4%, similar to, but slightly lower than, previous estimates and likely attributable to the diverse sampling strategy of the SGDP. Finally, we utilize these MEIs to extrapolate the primary Native American shared ancestry component to back to Asia and provide new evidence from genome-wide identical-by-descent genetic markers that add additional support for a southeastern Siberian origin for most Native Americans.
Collapse
Affiliation(s)
| | | | - Jainy Thomas
- Department of Human Genetics, University of Utah
| | - Clement Goubert
- Department of Molecular Biology and Genetics, Cornell University
| | - Swapon Mallick
- Department of Genetics, Harvard Medical School, Boston, Massachusetts
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah
| |
Collapse
|
118
|
In Silico identification of a common mobile element insertion in exon 4 of RP1. Sci Rep 2021; 11:13381. [PMID: 34183725 PMCID: PMC8238996 DOI: 10.1038/s41598-021-92834-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 06/14/2021] [Indexed: 11/18/2022] Open
Abstract
Mobile element insertions (MEIs) typically exceed the read lengths of short-read sequencing technologies and are therefore frequently missed. Recently, a founder Alu insertion in exon 4 of RP1 has been detected in Japanese patients with macular dystrophy by PCR and gel electrophoresis. We aimed to develop a grep search program for the detection of the Alu insertion in exon 4 of RP1 using unprocessed short reads. Among 494 unrelated Korean patients with inherited eye diseases, 273 patients with specific retinal phenotypes who were previously genotyped by targeted panel or whole exome sequencing were selected. Five probands had a single heterozygous truncating RP1 variant, and one of their unaffected parents also carry this variant. To find a hidden genetic variant, whole genome sequencing was performed in two patients, and it revealed AluY c.4052_4053ins328/p.(Tyr1352Alafs*9) insertion in RP1 exon 4. This AluY insertion was additionally identified in other 3 families, which was confirmed by PCR and gel electrophoresis. We developed simplified grep search program to detect this AluY insertion in RP1 exon 4. The simple grep search revealed a median variant allele frequency of 0.282 (interquartile range, 0.232–0.383), with no false-positive results using 120 control samples. The MEI in RP1 exon 4 was a common founder mutation in Korean, occurring in 1.8% of our cohort. The RP1-Alu grep program efficiently detected the AluY insertion, without the preprocessing of raw data or complex installation processes.
Collapse
|
119
|
Steely CJ, Russell KL, Feusier JE, Qiao Y, Tavtigian SV, Marth G, Jorde LB. Mobile element insertions and associated structural variants in longitudinal breast cancer samples. Sci Rep 2021; 11:13020. [PMID: 34158539 PMCID: PMC8219704 DOI: 10.1038/s41598-021-92444-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/07/2021] [Indexed: 02/05/2023] Open
Abstract
While mobile elements are largely inactive in healthy somatic tissues, increased activity has been found in cancer tissues, with significant variation among different cancer types. In addition to insertion events, mobile elements have also been found to mediate many structural variation events in the genome. Here, to better understand the timing and impact of mobile element insertions and associated structural variants in cancer, we examined their activity in longitudinal samples of four metastatic breast cancer patients. We identified 11 mobile element insertions or associated structural variants and found that the majority of these occurred early in tumor progression. Most of the variants impact intergenic regions; however, we identified a translocation interrupting MAP2K4 involving Alu elements and a deletion in YTHDF2 involving mobile elements that likely inactivate reported tumor suppressor genes. The high variant allele fraction of the translocation, the loss of the other copy of MAP2K4, the recurrent loss-of-function mutations found in this gene in other cancers, and the important function of MAP2K4 indicate that this translocation is potentially a driver mutation. Overall, using a unique longitudinal dataset, we find that most variants are likely passenger mutations in the four patients we examined, but some variants impact tumor progression.
Collapse
Affiliation(s)
- Cody J Steely
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA.
| | - Kristi L Russell
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
| | - Julie E Feusier
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
| | - Yi Qiao
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| | - Sean V Tavtigian
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah School of Medicine, Salt Lake City, UT, 84112, USA
| | - Gabor Marth
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah School of Medicine, 15 N. 2030 E. Rm 5100, Salt Lake City, UT, 84112, USA
- Utah Center for Genetic Discovery, Salt Lake City, UT, 84112, USA
| |
Collapse
|
120
|
Comprehensive identification of transposable element insertions using multiple sequencing technologies. Nat Commun 2021; 12:3836. [PMID: 34158502 PMCID: PMC8219666 DOI: 10.1038/s41467-021-24041-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 05/27/2021] [Indexed: 02/05/2023] Open
Abstract
Transposable elements (TEs) help shape the structure and function of the human genome. When inserted into some locations, TEs may disrupt gene regulation and cause diseases. Here, we present xTea (x-Transposable element analyzer), a tool for identifying TE insertions in whole-genome sequencing data. Whereas existing methods are mostly designed for short-read data, xTea can be applied to both short-read and long-read data. Our analysis shows that xTea outperforms other short read-based methods for both germline and somatic TE insertion discovery. With long-read data, we created a catalogue of polymorphic insertions with full assembly and annotation of insertional sequences for various types of retroelements, including pseudogenes and endogenous retroviruses. Notably, we find that individual genomes have an average of nine groups of full-length L1s in centromeres, suggesting that centromeres and other highly repetitive regions such as telomeres are a significant yet unexplored source of active L1s. xTea is available at https://github.com/parklab/xTea .
Collapse
|
121
|
Yu T, Huang X, Dou S, Tang X, Luo S, Theurkauf WE, Lu J, Weng Z. A benchmark and an algorithm for detecting germline transposon insertions and measuring de novo transposon insertion frequencies. Nucleic Acids Res 2021; 49:e44. [PMID: 33511407 PMCID: PMC8096211 DOI: 10.1093/nar/gkab010] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 12/28/2020] [Accepted: 01/06/2021] [Indexed: 02/01/2023] Open
Abstract
Transposons are genomic parasites, and their new insertions can cause instability and spur the evolution of their host genomes. Rapid accumulation of short-read whole-genome sequencing data provides a great opportunity for studying new transposon insertions and their impacts on the host genome. Although many algorithms are available for detecting transposon insertions, the task remains challenging and existing tools are not designed for identifying de novo insertions. Here, we present a new benchmark fly dataset based on PacBio long-read sequencing and a new method TEMP2 for detecting germline insertions and measuring de novo ‘singleton’ insertion frequencies in eukaryotic genomes. TEMP2 achieves high sensitivity and precision for detecting germline insertions when compared with existing tools using both simulated data in fly and experimental data in fly and human. Furthermore, TEMP2 can accurately assess the frequencies of de novo transposon insertions even with high levels of chimeric reads in simulated datasets; such chimeric reads often occur during the construction of short-read sequencing libraries. By applying TEMP2 to published data on hybrid dysgenic flies inflicted by de-repressed P-elements, we confirmed the continuous new insertions of P-elements in dysgenic offspring before they regain piRNAs for P-element repression. TEMP2 is freely available at Github: https://github.com/weng-lab/TEMP2.
Collapse
Affiliation(s)
- Tianxiong Yu
- Department of Thoracic Surgery, Clinical Translational Research Center, Shanghai Pulmonary Hospital, The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China.,Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Xiao Huang
- Department of Thoracic Surgery, Clinical Translational Research Center, Shanghai Pulmonary Hospital, The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Shengqian Dou
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Xiaolu Tang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Shiqi Luo
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - William E Theurkauf
- Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Zhiping Weng
- Department of Thoracic Surgery, Clinical Translational Research Center, Shanghai Pulmonary Hospital, The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China.,Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| |
Collapse
|
122
|
Expression Quantitative Trait Loci (eQTLs) Associated with Retrotransposons Demonstrate their Modulatory Effect on the Transcriptome. Int J Mol Sci 2021; 22:ijms22126319. [PMID: 34204806 PMCID: PMC8231655 DOI: 10.3390/ijms22126319] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2021] [Revised: 06/03/2021] [Accepted: 06/10/2021] [Indexed: 12/26/2022] Open
Abstract
Transposable elements (TEs) are repetitive elements that belong to a variety of functional classes and have an important role in shaping genome evolution. Around 50% of the human genome contains TEs, and they have been termed the "dark matter" of the genome because relatively little is known about their function. While TEs have been shown to participate in aberrant gene regulation and the pathogenesis of diseases, only a few studies have explored the systemic effect of TEs on gene expression. In the present study, we analysed whole genome sequences and blood whole transcriptome data from 570 individuals within the Parkinson's Progressive Markers Initiative (PPMI) cohort to identify expression quantitative trait loci (eQTL) regulating genome-wide gene expression associated with TEs. We identified 2132 reference TEs that were polymorphic for their presence or absence in our study cohort. The presence or absence of the TE element could change the expression of the gene or gene clusters from zero to tens of thousands of copies of RNA. The main finding is that many TEs possess very strong regulatory effects, and they have the potential to modulate large genetic networks with hundreds of target genes over the genome. We illustrate the plethora of regulatory mechanisms using examples of their action at the HLA gene cluster and data showing different TEs' convergence to modulate WFS1 gene expression. In conclusion, the presence or absence of polymorphisms of TEs has an eminent genome-wide regulatory function with large effect size at the level of the whole transcriptome. The role of TEs in explaining, in part, the missing heritability for complex traits is convincing and should be considered.
Collapse
|
123
|
McDonald TL, Zhou W, Castro CP, Mumm C, Switzenberg JA, Mills RE, Boyle AP. Cas9 targeted enrichment of mobile elements using nanopore sequencing. Nat Commun 2021; 12:3586. [PMID: 34117247 PMCID: PMC8196195 DOI: 10.1038/s41467-021-23918-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 05/25/2021] [Indexed: 02/05/2023] Open
Abstract
Mobile element insertions (MEIs) are repetitive genomic sequences that contribute to genetic variation and can lead to genetic disorders. Targeted and whole-genome approaches using short-read sequencing have been developed to identify reference and non-reference MEIs; however, the read length hampers detection of these elements in complex genomic regions. Here, we pair Cas9-targeted nanopore sequencing with computational methodologies to capture active MEIs in human genomes. We demonstrate parallel enrichment for distinct classes of MEIs, averaging 44% of reads on-targeted signals and exhibiting a 13.4-54x enrichment over whole-genome approaches. We show an individual flow cell can recover most MEIs (97% L1Hs, 93% AluYb, 51% AluYa, 99% SVA_F, and 65% SVA_E). We identify seventeen non-reference MEIs in GM12878 overlooked by modern, long-read analysis pipelines, primarily in repetitive genomic regions. This work introduces the utility of nanopore sequencing for MEI enrichment and lays the foundation for rapid discovery of elusive, repetitive genetic elements.
Collapse
Affiliation(s)
- Torrin L McDonald
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Christopher P Castro
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Camille Mumm
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Jessica A Switzenberg
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Ryan E Mills
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| | - Alan P Boyle
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
124
|
Jain D, Chu C, Alver BH, Lee S, Lee EA, Park PJ. HiTea: a computational pipeline to identify non-reference transposable element insertions in Hi-C data. Bioinformatics 2021; 37:1045-1051. [PMID: 33136153 DOI: 10.1093/bioinformatics/btaa923] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 09/14/2020] [Accepted: 10/17/2020] [Indexed: 11/13/2022] Open
Abstract
Hi-C is a common technique for assessing 3D chromatin conformation. Recent studies have shown that long-range interaction information in Hi-C data can be used to generate chromosome-length genome assemblies and identify large-scale structural variations. Here, we demonstrate the use of Hi-C data in detecting mobile transposable element (TE) insertions genome-wide. Our pipeline Hi-C-based TE analyzer (HiTea) capitalizes on clipped Hi-C reads and is aided by a high proportion of discordant read pairs in Hi-C data to detect insertions of three major families of active human TEs. Despite the uneven genome coverage in Hi-C data, HiTea is competitive with the existing callers based on whole-genome sequencing (WGS) data and can supplement the WGS-based characterization of the TE-insertion landscape. We employ the pipeline to identify TE-insertions from human cell-line Hi-C samples. AVAILABILITY AND IMPLEMENTATION HiTea is available at https://github.com/parklab/HiTea and as a Docker image. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dhawal Jain
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Chong Chu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Burak Han Alver
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Soohyun Lee
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA 02115, USA.,Broad Institute of MIT and Harvard University, Cambridge, MA 02142, USA
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
125
|
Watson CM, Crinnion LA, Simmonds J, Camm N, Adlard J, Bonthron DT. Long-read nanopore sequencing enables accurate confirmation of a recurrent PMS2 insertion-deletion variant located in a region of complex genomic architecture. Cancer Genet 2021; 256-257:122-126. [PMID: 34116445 DOI: 10.1016/j.cancergen.2021.05.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 04/08/2021] [Accepted: 05/24/2021] [Indexed: 10/21/2022]
Abstract
Targeted next generation sequencing (NGS) is the predominant methodology for the molecular genetic diagnosis of inherited conditions. In many laboratories, NGS-identified variants are routinely validated using a different method, to minimize the risk of a false-positive diagnosis. This can be particularly important when pathogenic variants are located in complex genomic regions. In this situation, new long-read sequencing technologies have potential advantages over existing alternatives. However, practical examples of their utility for diagnostic purposes remain scant. Here, we report the use of nanopore sequencing to validate a PMS2 mutation refractory to conventional methods. In a patient who presented with colorectal cancer and loss of PMS2 immunostaining, short-read NGS of Lynch syndrome-associated genes identified the recurrent PMS2 insertion-deletion variant, c.736_741delinsTGTGTGTGAAG (p.Pro246Cysfs*3). Confirmation of this variant using bidirectional Sanger sequencing was impeded by an upstream intron 6 poly(T) tract. Using a locus-specific amplicon template, we undertook nanopore long-read sequencing in order to assess the diagnostic accuracy of this platform. Pairwise comparison between a curated benchmark allele (derived from short-read NGS and unidirectional Sanger sequencing) and the consensus nanopore dataset revealed 100% sequence identity. Our experience provides insight into the robustness and ease of deployment of "third-generation" sequencing for accurate characterisation of pathogenic variants.
Collapse
Affiliation(s)
- Christopher M Watson
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds LS9 7TF, United Kingdom; Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds LS9 7TF, United Kingdom.
| | - Laura A Crinnion
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds LS9 7TF, United Kingdom; Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds LS9 7TF, United Kingdom
| | - Jennifer Simmonds
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds LS9 7TF, United Kingdom
| | - Nick Camm
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds LS9 7TF, United Kingdom
| | - Julian Adlard
- The Clinical Genetics Department, Chapel Allerton Hospital, Leeds LS7 4SA, United Kingdom
| | - David T Bonthron
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds LS9 7TF, United Kingdom; The Clinical Genetics Department, Chapel Allerton Hospital, Leeds LS7 4SA, United Kingdom
| |
Collapse
|
126
|
Reference SVA insertion polymorphisms are associated with Parkinson's Disease progression and differential gene expression. NPJ Parkinsons Dis 2021; 7:44. [PMID: 34035310 PMCID: PMC8149882 DOI: 10.1038/s41531-021-00189-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 04/23/2021] [Indexed: 12/20/2022] Open
Abstract
The development of Parkinson's disease (PD) involves a complex interaction of genetic and environmental factors. Genome-wide association studies using extensive single nucleotide polymorphism datasets have identified many loci involved in disease. However much of the heritability of Parkinson's disease is still to be identified and the functional elements associated with the risk to be determined and understood. To investigate the component of PD that may involve complex genetic variants we characterised the hominid specific retrotransposon SINE-VNTR-Alus (SVAs) in the Parkinson's Progression Markers Initiative cohort utilising whole genome sequencing. We identified 81 reference SVAs polymorphic for their presence/absence, seven of which were associated with the progression of the disease and with differential gene expression in whole blood RNA sequencing data. This study highlights the importance of addressing SVA variants and potentially other types of retrotransposons in PD genetics, furthermore, these SVA elements should be considered as regulatory domains that could play a role in disease progression.
Collapse
|
127
|
Chu C, Zhao B, Park PJ, Lee EA. Identification and Genotyping of Transposable Element Insertions From Genome Sequencing Data. ACTA ACUST UNITED AC 2021; 107:e102. [PMID: 32662945 DOI: 10.1002/cphg.102] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Transposable element (TE) mobilization is a significant source of genomic variation and has been associated with various human diseases. The exponential growth of population-scale whole-genome sequencing and rapid innovations in long-read sequencing technologies provide unprecedented opportunities to study TE insertions and their functional impact in human health and disease. Identifying TE insertions, however, is challenging due to the repetitive nature of the TE sequences. Here, we review computational approaches to detecting and genotyping TE insertions using short- and long-read sequencing and discuss the strengths and weaknesses of different approaches. © 2020 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Chong Chu
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
| | - Boxun Zhao
- Division of Genetics and Genomics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, Massachusetts.,Department of Pediatrics, Harvard Medical School, Boston, Massachusetts.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, Massachusetts.,Department of Pediatrics, Harvard Medical School, Boston, Massachusetts.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| |
Collapse
|
128
|
Hytönen MK, Niskanen JE, Arumilli M, Brookhart-Knox CA, Donner J, Lohi H. Missense variant in LOXHD1 is associated with canine nonsyndromic hearing loss. Hum Genet 2021; 140:1611-1618. [PMID: 33983508 PMCID: PMC8521602 DOI: 10.1007/s00439-021-02286-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 04/20/2021] [Indexed: 11/19/2022]
Abstract
Hearing loss is a common sensory deficit in both humans and dogs. In canines, the genetic basis is largely unknown, as genetic variants have only been identified for a syndromic form of hearing impairment. We observed a congenital or early-onset sensorineural hearing loss in a Rottweiler litter. Assuming an autosomal recessive inheritance, we used a combined approach of homozygosity mapping and genome sequencing to dissect the genetic background of the disorder. We identified a fully segregating missense variant in LOXHD1, a gene that is known to be essential for cochlear hair cell function and associated with nonsyndromic hearing loss in humans and mice. The canine LOXHD1 variant was specific to the Rottweiler breed in our study cohorts of pure-bred dogs. However, it also was present in some mixed-breed dogs, of which the majority showed Rottweiler ancestry. Low allele frequencies in these populations, 2.6% and 0.04%, indicate a rare variant. To summarize, our study describes the first genetic variant for canine nonsyndromic hearing loss, which is clinically and genetically similar to human LOXHD1-related hearing disorder, and therefore, provides a new large animal model for hearing loss. Equally important, the affected breed will benefit from a genetic test to eradicate this LOXHD1-related hearing disorder from the population.
Collapse
Affiliation(s)
- Marjo K Hytönen
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Julia E Niskanen
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Meharji Arumilli
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | | | | | - Hannes Lohi
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland. .,Folkhälsan Research Center, Helsinki, Finland. .,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
129
|
Abe‐Hatano C, Iida A, Kosugi S, Momozawa Y, Terao C, Ishikawa K, Okubo M, Hachiya Y, Nishida H, Nakamura K, Miyata R, Murakami C, Takahashi K, Hoshino K, Sakamoto H, Ohta S, Kubota M, Takeshita E, Ishiyama A, Nakagawa E, Sasaki M, Kato M, Matsumoto N, Kamatani Y, Kubo M, Takahashi Y, Natsume J, Inoue K, Goto Y. Whole genome sequencing of 45 Japanese patients with intellectual disability. Am J Med Genet A 2021; 185:1468-1480. [PMID: 33624935 PMCID: PMC8247954 DOI: 10.1002/ajmg.a.62138] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 12/23/2020] [Accepted: 02/06/2021] [Indexed: 02/06/2023]
Abstract
Intellectual disability (ID) is characterized by significant limitations in both intellectual functioning and adaptive behaviors, originating before the age of 18 years. However, the genetic etiologies of ID are still incompletely elucidated due to the wide range of clinical and genetic heterogeneity. Whole genome sequencing (WGS) has been applied as a single-step clinical diagnostic tool for ID because it detects genetic variations with a wide range of resolution from single nucleotide variants (SNVs) to structural variants (SVs). To explore the causative genes for ID, we employed WGS in 45 patients from 44 unrelated Japanese families and performed a stepwise screening approach focusing on the coding variants in the genes. Here, we report 12 pathogenic and likely pathogenic variants: seven heterozygous variants of ADNP, SATB2, ANKRD11, PTEN, TCF4, SPAST, and KCNA2, three hemizygous variants of SMS, SLC6A8, and IQSEC2, and one homozygous variant in AGTPBP1. Of these, four were considered novel. Furthermore, a novel 76 kb deletion containing exons 1 and 2 in DYRK1A was identified. We confirmed the clinical and genetic heterogeneity and high frequency of de novo causative variants (8/12, 66.7%). This is the first report of WGS analysis in Japanese patients with ID. Our results would provide insight into the correlation between novel variants and expanded phenotypes of the disease.
Collapse
Affiliation(s)
- Chihiro Abe‐Hatano
- Department of Mental Retardation and Birth Defect ResearchNational Institute of Neuroscience, National Center of Neurology and PsychiatryTokyoJapan
- Department of PediatricsNagoya University Graduate School of MedicineAichiJapan
| | - Aritoshi Iida
- Medical Genome CenterNational Center of Neurology and PsychiatryTokyoJapan
| | - Shunichi Kosugi
- Laboratory for Statistical and Translational GeneticsRIKEN Center for Integrative Medical SciencesKanagawaJapan
| | - Yukihide Momozawa
- Laboratory for Genotyping DevelopmentRIKEN Center for Integrative Medical SciencesKanagawaJapan
| | - Chikashi Terao
- Laboratory for Statistical and Translational GeneticsRIKEN Center for Integrative Medical SciencesKanagawaJapan
- Clinical Research CenterShizuoka General HospitalShizuokaJapan
- The Department of Applied GeneticsThe School of Pharmaceutical Sciences, University of ShizuokaShizuokaJapan
| | - Keiko Ishikawa
- Medical Genome CenterNational Center of Neurology and PsychiatryTokyoJapan
| | - Mariko Okubo
- Department of Child NeurologyNational Center Hospital, National Center of Neurology and PsychiatryTokyoJapan
| | - Yasuo Hachiya
- Department of NeuropediatricsTokyo Metropolitan Neurological HospitalTokyoJapan
| | - Hiroya Nishida
- Department of NeuropediatricsTokyo Metropolitan Neurological HospitalTokyoJapan
| | - Kazuyuki Nakamura
- Department of PediatricsYamagata University Faculty of MedicineYamagataJapan
| | - Rie Miyata
- Department of PediatricsTokyo‐Kita Medical CenterTokyoJapan
| | - Chie Murakami
- Department of PediatricsKitakyusyu Children's Rehabilitation CenterFukuokaJapan
| | - Kan Takahashi
- Department of PediatricsOme Municipal General HospitalTokyoJapan
| | - Kyoko Hoshino
- Department of PediatricsMinami Wakayama Medical CenterWakayamaJapan
| | - Haruko Sakamoto
- Department of NeonatologyJapanese Red Cross Osaka HospitalOsakaJapan
| | - Sayaka Ohta
- Division of NeurologyNational Center for Child Health and DevelopmentTokyoJapan
| | - Masaya Kubota
- Division of NeurologyNational Center for Child Health and DevelopmentTokyoJapan
| | - Eri Takeshita
- Department of Child NeurologyNational Center Hospital, National Center of Neurology and PsychiatryTokyoJapan
| | - Akihiko Ishiyama
- Department of Child NeurologyNational Center Hospital, National Center of Neurology and PsychiatryTokyoJapan
| | - Eiji Nakagawa
- Department of Child NeurologyNational Center Hospital, National Center of Neurology and PsychiatryTokyoJapan
| | - Masayuki Sasaki
- Department of Child NeurologyNational Center Hospital, National Center of Neurology and PsychiatryTokyoJapan
| | - Mitsuhiro Kato
- Department of PediatricsYamagata University Faculty of MedicineYamagataJapan
- Department of PediatricsShowa University School of MedicineTokyoJapan
| | - Naomichi Matsumoto
- Department of Human GeneticsYokohama City University Graduate School of MedicineKanagawaJapan
| | - Yoichiro Kamatani
- Laboratory for Statistical and Translational GeneticsRIKEN Center for Integrative Medical SciencesKanagawaJapan
- Department of Computational Biology and Medical SciencesGraduate School of Frontier Sciences, The University of TokyoTokyoJapan
| | - Michiaki Kubo
- Laboratory for Genotyping DevelopmentRIKEN Center for Integrative Medical SciencesKanagawaJapan
| | - Yoshiyuki Takahashi
- Department of PediatricsNagoya University Graduate School of MedicineAichiJapan
| | - Jun Natsume
- Department of PediatricsNagoya University Graduate School of MedicineAichiJapan
| | - Ken Inoue
- Department of Mental Retardation and Birth Defect ResearchNational Institute of Neuroscience, National Center of Neurology and PsychiatryTokyoJapan
| | - Yu‐Ichi Goto
- Department of Mental Retardation and Birth Defect ResearchNational Institute of Neuroscience, National Center of Neurology and PsychiatryTokyoJapan
- Medical Genome CenterNational Center of Neurology and PsychiatryTokyoJapan
| |
Collapse
|
130
|
de Goede OM, Nachun DC, Ferraro NM, Gloudemans MJ, Rao AS, Smail C, Eulalio TY, Aguet F, Ng B, Xu J, Barbeira AN, Castel SE, Kim-Hellmuth S, Park Y, Scott AJ, Strober BJ, Brown CD, Wen X, Hall IM, Battle A, Lappalainen T, Im HK, Ardlie KG, Mostafavi S, Quertermous T, Kirkegaard K, Montgomery SB. Population-scale tissue transcriptomics maps long non-coding RNAs to complex disease. Cell 2021; 184:2633-2648.e19. [PMID: 33864768 DOI: 10.1016/j.cell.2021.03.050] [Citation(s) in RCA: 81] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 10/16/2020] [Accepted: 03/24/2021] [Indexed: 02/07/2023]
Abstract
Long non-coding RNA (lncRNA) genes have well-established and important impacts on molecular and cellular functions. However, among the thousands of lncRNA genes, it is still a major challenge to identify the subset with disease or trait relevance. To systematically characterize these lncRNA genes, we used Genotype Tissue Expression (GTEx) project v8 genetic and multi-tissue transcriptomic data to profile the expression, genetic regulation, cellular contexts, and trait associations of 14,100 lncRNA genes across 49 tissues for 101 distinct complex genetic traits. Using these approaches, we identified 1,432 lncRNA gene-trait associations, 800 of which were not explained by stronger effects of neighboring protein-coding genes. This included associations between lncRNA quantitative trait loci and inflammatory bowel disease, type 1 and type 2 diabetes, and coronary artery disease, as well as rare variant associations to body mass index.
Collapse
Affiliation(s)
- Olivia M de Goede
- Department of Genetics, Stanford University, Stanford, CA 94305, USA.
| | - Daniel C Nachun
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
| | - Nicole M Ferraro
- Biomedical Informatics Training Program, Stanford University, Stanford, CA 94305, USA
| | - Michael J Gloudemans
- Biomedical Informatics Training Program, Stanford University, Stanford, CA 94305, USA
| | - Abhiram S Rao
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Craig Smail
- Biomedical Informatics Training Program, Stanford University, Stanford, CA 94305, USA; Genomic Medicine Center, Children's Mercy Research Institute, Kansas City, MO 64108, USA
| | - Tiffany Y Eulalio
- Biomedical Informatics Training Program, Stanford University, Stanford, CA 94305, USA
| | - François Aguet
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bernard Ng
- Department of Statistics, University of British Columbia, Vancouver, BC V6T 1Z4, Canada; Centre for Molecular Medicine and Therapeutics, Vancouver, BC V5Z 4H4, Canada
| | - Jishu Xu
- Rush Alzheimer's Disease Center, Rush University, Chicago, Illinois 60612, USA
| | - Alvaro N Barbeira
- Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL 60637, USA
| | - Stephane E Castel
- New York Genome Center, New York, NY 10013, USA; Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Sarah Kim-Hellmuth
- New York Genome Center, New York, NY 10013, USA; Department of Systems Biology, Columbia University, New York, NY 10032, USA; Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital LMU Munich, Munich 80337, Germany
| | - YoSon Park
- Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Alexandra J Scott
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Benjamin J Strober
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Christopher D Brown
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Xiaoquan Wen
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ira M Hall
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Alexis Battle
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA; Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Tuuli Lappalainen
- New York Genome Center, New York, NY 10013, USA; Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Hae Kyung Im
- Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL 60637, USA
| | - Kristin G Ardlie
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Sara Mostafavi
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98195, USA
| | - Thomas Quertermous
- Division of Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA 94305, USA
| | - Karla Kirkegaard
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Microbiology and Immunology, Stanford University, Stanford, CA 94305, USA
| | - Stephen B Montgomery
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Pathology, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
131
|
Hytönen MK, Sarviaho R, Jackson CB, Syrjä P, Jokinen T, Matiasek K, Rosati M, Dallabona C, Baruffini E, Quintero I, Arumilli M, Monteuuis G, Donner J, Anttila M, Suomalainen A, Bindoff LA, Lohi H. In-frame deletion in canine PITRM1 is associated with a severe early-onset epilepsy, mitochondrial dysfunction and neurodegeneration. Hum Genet 2021; 140:1593-1609. [PMID: 33835239 PMCID: PMC8519929 DOI: 10.1007/s00439-021-02279-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 03/27/2021] [Indexed: 11/30/2022]
Abstract
We investigated the clinical, genetic, and pathological characteristics of a previously unknown severe juvenile brain disorder in several litters of Parson Russel Terriers. The disease started with epileptic seizures at 6–12 weeks of age and progressed rapidly to status epilepticus and death or euthanasia. Histopathological changes at autopsy were restricted to the brain. There was severe acute neuronal degeneration and necrosis diffusely affecting the grey matter throughout the brain with extensive intraneuronal mitochondrial crowding and accumulation of amyloid-β (Aβ). Combined homozygosity mapping and genome sequencing revealed an in-frame 6-bp deletion in the nuclear-encoded pitrilysin metallopeptidase 1 (PITRM1) encoding for a mitochondrial protease involved in mitochondrial targeting sequence processing and degradation. The 6-bp deletion results in the loss of two amino acid residues in the N-terminal part of PITRM1, potentially affecting protein folding and function. Assessment of the mitochondrial function in the affected brain tissue showed a significant deficiency in respiratory chain function. The functional consequences of the mutation were modeled in yeast and showed impaired growth in permissive conditions and an impaired respiration capacity. Loss-of-function variants in human PITRM1 result in a childhood-onset progressive amyloidotic neurological syndrome characterized by spinocerebellar ataxia with behavioral, psychiatric and cognitive abnormalities. Homozygous Pitrm1-knockout mice are embryonic lethal, while heterozygotes show a progressive, neurodegenerative phenotype characterized by impairment in motor coordination and Aβ deposits. Our study describes a novel early-onset PITRM1-related neurodegenerative canine brain disorder with mitochondrial dysfunction, Aβ accumulation, and lethal epilepsy. The findings highlight the essential role of PITRM1 in neuronal survival and strengthen the connection between mitochondrial dysfunction and neurodegeneration.
Collapse
Affiliation(s)
- Marjo K Hytönen
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Riika Sarviaho
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Christopher B Jackson
- Department of Biochemistry and Developmental Biology, University of Helsinki, Helsinki, Finland
| | - Pernilla Syrjä
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Tarja Jokinen
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Department of Equine and Small Animal Medicine, University of Helsinki, Helsinki, Finland
| | - Kaspar Matiasek
- Faculty of Veterinary Medicine, Centre for Clinical Veterinary Medicine, LMU-Munich, Veterinärstrasse 13, 80539, Munich, Germany
| | - Marco Rosati
- Faculty of Veterinary Medicine, Centre for Clinical Veterinary Medicine, LMU-Munich, Veterinärstrasse 13, 80539, Munich, Germany
| | - Cristina Dallabona
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy
| | - Enrico Baruffini
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy
| | - Ileana Quintero
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Meharji Arumilli
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Geoffray Monteuuis
- Department of Biochemistry and Developmental Biology, University of Helsinki, Helsinki, Finland
| | - Jonas Donner
- Wisdom Health (Genoscoper Laboratories), Helsinki, Finland
| | | | - Anu Suomalainen
- Research Programs Unit, Stem Cells and Metabolism Research Program, University of Helsinki, Helsinki, Finland
| | - Laurence A Bindoff
- Department of Clinical Medicine (K1), University of Bergen, Bergen, Norway.,Department of Neurology, Neuro-SysMed, Haukeland University Hospital, Bergen, Norway
| | - Hannes Lohi
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland. .,Folkhälsan Research Center, Helsinki, Finland. .,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
132
|
A study of transposable element-associated structural variations (TASVs) using a de novo-assembled Korean genome. Exp Mol Med 2021; 53:615-630. [PMID: 33833373 PMCID: PMC8102501 DOI: 10.1038/s12276-021-00586-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Revised: 01/26/2021] [Accepted: 01/27/2021] [Indexed: 12/13/2022] Open
Abstract
Advances in next-generation sequencing (NGS) technology have made personal genome sequencing possible, and indeed, many individual human genomes have now been sequenced. Comparisons of these individual genomes have revealed substantial genomic differences between human populations as well as between individuals from closely related ethnic groups. Transposable elements (TEs) are known to be one of the major sources of these variations and act through various mechanisms, including de novo insertion, insertion-mediated deletion, and TE–TE recombination-mediated deletion. In this study, we carried out de novo whole-genome sequencing of one Korean individual (KPGP9) via multiple insert-size libraries. The de novo whole-genome assembly resulted in 31,305 scaffolds with a scaffold N50 size of 13.23 Mb. Furthermore, through computational data analysis and experimental verification, we revealed that 182 TE-associated structural variation (TASV) insertions and 89 TASV deletions contributed 64,232 bp in sequence gain and 82,772 bp in sequence loss, respectively, in the KPGP9 genome relative to the hg19 reference genome. We also verified structural differences associated with TASVs by comparative analysis with TASVs in recent genomes (AK1 and TCGA genomes) and reported their details. Here, we constructed a new Korean de novo whole-genome assembly and provide the first study, to our knowledge, focused on the identification of TASVs in an individual Korean genome. Our findings again highlight the role of TEs as a major driver of structural variations in human individual genomes. A novel strategy for genome analysis offers insights into the distribution and impact on genome variation of transposable elements, DNA sequences that can replicate and relocate themselves at different chromosomal regions. These sequences, also known as ‘jumping genes’, comprise up to 50% of the genome, but it has proven challenging to map them with existing techniques. Seyoung Mun of Dankook University, Cheonan, South Korea, and coworkers have developed a sequencing and computational analysis strategy that allowed them to accurately map transposable elements across the genome of a Korean individual. These data revealed hundreds of insertion and deletion events relative to an existing reference map of the genome, showing significant alterations in the chromosomal structure. The authors speculate that such widespread transposition events could potentially contribute to individual differences in gene expression and risk of disease.
Collapse
|
133
|
Hiatt SM, Lawlor JM, Handley LH, Ramaker RC, Rogers BB, Partridge EC, Boston LB, Williams M, Plott CB, Jenkins J, Gray DE, Holt JM, Bowling KM, Bebin EM, Grimwood J, Schmutz J, Cooper GM. Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders. HGG ADVANCES 2021; 2:100023. [PMID: 33937879 PMCID: PMC8087252 DOI: 10.1016/j.xhgg.2021.100023] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 01/07/2021] [Indexed: 02/07/2023] Open
Abstract
Exome and genome sequencing have proven to be effective tools for the diagnosis of neurodevelopmental disorders (NDDs), but large fractions of NDDs cannot be attributed to currently detectable genetic variation. This is likely, at least in part, a result of the fact that many genetic variants are difficult or impossible to detect through typical short-read sequencing approaches. Here, we describe a genomic analysis using Pacific Biosciences circular consensus sequencing (CCS) reads, which are both long (>10 kb) and accurate (>99% bp accuracy). We used CCS on six proband-parent trios with NDDs that were unexplained despite extensive testing, including genome sequencing with short reads. We identified variants and created de novo assemblies in each trio, with global metrics indicating these datasets are more accurate and comprehensive than those provided by short-read data. In one proband, we identified a likely pathogenic (LP), de novo L1-mediated insertion in CDKL5 that results in duplication of exon 3, leading to a frameshift. In a second proband, we identified multiple large de novo structural variants, including insertion-translocations affecting DGKB and MLLT3, which we show disrupt MLLT3 transcript levels. We consider this extensive structural variation likely pathogenic. The breadth and quality of variant detection, coupled to finding variants of clinical and research interest in two of six probands with unexplained NDDs, support the hypothesis that long-read genome sequencing can substantially improve rare disease genetic discovery rates.
Collapse
Affiliation(s)
- Susan M. Hiatt
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | | | - Lori H. Handley
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Ryne C. Ramaker
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Brianne B. Rogers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
- Department of Genetics, University of Alabama at Birmingham, Birmingham, AL 35924, USA
| | | | - Lori Beth Boston
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Melissa Williams
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | | | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - David E. Gray
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - James M. Holt
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Kevin M. Bowling
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - E. Martina Bebin
- Department of Neurology, University of Alabama at Birmingham, Birmingham, AL 35924, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | | |
Collapse
|
134
|
Ebert P, Audano PA, Zhu Q, Rodriguez-Martin B, Porubsky D, Bonder MJ, Sulovari A, Ebler J, Zhou W, Serra Mari R, Yilmaz F, Zhao X, Hsieh P, Lee J, Kumar S, Lin J, Rausch T, Chen Y, Ren J, Santamarina M, Höps W, Ashraf H, Chuang NT, Yang X, Munson KM, Lewis AP, Fairley S, Tallon LJ, Clarke WE, Basile AO, Byrska-Bishop M, Corvelo A, Evani US, Lu TY, Chaisson MJP, Chen J, Li C, Brand H, Wenger AM, Ghareghani M, Harvey WT, Raeder B, Hasenfeld P, Regier AA, Abel HJ, Hall IM, Flicek P, Stegle O, Gerstein MB, Tubio JMC, Mu Z, Li YI, Shi X, Hastie AR, Ye K, Chong Z, Sanders AD, Zody MC, Talkowski ME, Mills RE, Devine SE, Lee C, Korbel JO, Marschall T, Eichler EE. Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science 2021; 372:eabf7117. [PMID: 33632895 PMCID: PMC8026704 DOI: 10.1126/science.abf7117] [Citation(s) in RCA: 307] [Impact Index Per Article: 102.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 02/09/2021] [Indexed: 12/14/2022]
Abstract
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.
Collapse
Affiliation(s)
- Peter Ebert
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Peter A Audano
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Bernardo Rodriguez-Martin
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Marc Jan Bonder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Arvis Sulovari
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Jana Ebler
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Rebecca Serra Mari
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Xuefang Zhao
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Joyce Lee
- Bionano Genomics, San Diego, CA 92121, USA
| | - Sushant Kumar
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 and 437, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Jiadong Lin
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Yu Chen
- Department of Genetics and Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Jingwen Ren
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Martin Santamarina
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Zoology, Genetics, and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Hufsah Ashraf
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Nelson T Chuang
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | - Xiaofei Yang
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Luke J Tallon
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | | | | | | | | | | | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Junjie Chen
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | - Chong Li
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | - Harrison Brand
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Aaron M Wenger
- Pacific Biosciences of California, Menlo Park, CA 94025, USA
| | - Maryam Ghareghani
- Max Planck Institute for Informatics, Saarland Informatics Campus E1.4, 66123 Saarbrücken, Germany
- Saarbrücken Graduate School of Computer Science, Saarland University, Saarland Informatics Campus E1.3, 66123 Saarbrücken, Germany
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Benjamin Raeder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Allison A Regier
- Department of Medicine, Washington University, St. Louis, MO 63108, USA
| | - Haley J Abel
- Department of Medicine, Washington University, St. Louis, MO 63108, USA
| | - Ira M Hall
- Department of Genetics, Yale School of Medicine, 333 Cedar Street, New Haven, CT 06510, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Oliver Stegle
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Mark B Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 and 437, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Jose M C Tubio
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Zoology, Genetics, and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Zepeng Mu
- Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | | | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
- Department of Human Genetics, University of Michigan, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | - Zechen Chong
- Department of Genetics and Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Ashley D Sanders
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | | | - Michael E Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
- Department of Human Genetics, University of Michigan, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | - Scott E Devine
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA.
- Precision Medicine Center, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an, 710061, Shaanxi, China
- Department of Graduate Studies-Life Sciences, Ewha Womans University, Ewhayeodae-gil, Seodaemun-gu, Seoul 120-750, South Korea
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany.
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tobias Marschall
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany.
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
135
|
Belyeu JR, Brand H, Wang H, Zhao X, Pedersen BS, Feusier J, Gupta M, Nicholas TJ, Brown J, Baird L, Devlin B, Sanders SJ, Jorde LB, Talkowski ME, Quinlan AR. De novo structural mutation rates and gamete-of-origin biases revealed through genome sequencing of 2,396 families. Am J Hum Genet 2021; 108:597-607. [PMID: 33675682 PMCID: PMC8059337 DOI: 10.1016/j.ajhg.2021.02.012] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 02/12/2021] [Indexed: 01/05/2023] Open
Abstract
Each human genome includes de novo mutations that arose during gametogenesis. While these germline mutations represent a fundamental source of new genetic diversity, they can also create deleterious alleles that impact fitness. Whereas the rate and patterns of point mutations in the human germline are now well understood, far less is known about the frequency and features that impact de novo structural variants (dnSVs). We report a family-based study of germline mutations among 9,599 human genomes from 33 multigenerational CEPH-Utah families and 2,384 families from the Simons Foundation Autism Research Initiative. We find that de novo structural mutations detected by alignment-based, short-read WGS occur at an overall rate of at least 0.160 events per genome in unaffected individuals, and we observe a significantly higher rate (0.206 per genome) in ASD-affected individuals. In both probands and unaffected samples, nearly 73% of de novo structural mutations arose in paternal gametes, and we predict most de novo structural mutations to be caused by mutational mechanisms that do not require sequence homology. After multiple testing correction, we did not observe a statistically significant correlation between parental age and the rate of de novo structural variation in offspring. These results highlight that a spectrum of mutational mechanisms contribute to germline structural mutations and that these mechanisms most likely have markedly different rates and selective pressures than those leading to point mutations.
Collapse
Affiliation(s)
- Jonathan R Belyeu
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Harrison Brand
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02114, USA
| | - Harold Wang
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02114, USA
| | - Xuefang Zhao
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02114, USA
| | - Brent S Pedersen
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Julie Feusier
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT 84112, USA
| | - Meenal Gupta
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Thomas J Nicholas
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Joseph Brown
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Lisa Baird
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Bernie Devlin
- Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Stephan J Sanders
- Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94143, USA; Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA; Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT 84112, USA
| | - Michael E Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02114, USA.
| | - Aaron R Quinlan
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA; Department of Biomedical Informatics, University of Utah, Salt Lake City, UT 84112, USA; Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT 84112, USA.
| |
Collapse
|
136
|
Kovaka S, Fan Y, Ni B, Timp W, Schatz MC. Targeted nanopore sequencing by real-time mapping of raw electrical signal with UNCALLED. Nat Biotechnol 2021; 39:431-441. [PMID: 33257863 PMCID: PMC8567335 DOI: 10.1038/s41587-020-0731-9] [Citation(s) in RCA: 116] [Impact Index Per Article: 38.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Accepted: 10/07/2020] [Indexed: 02/07/2023]
Abstract
Conventional targeted sequencing methods eliminate many of the benefits of nanopore sequencing, such as the ability to accurately detect structural variants or epigenetic modifications. The ReadUntil method allows nanopore devices to selectively eject reads from pores in real time, which could enable purely computational targeted sequencing. However, this requires rapid identification of on-target reads while most mapping methods require computationally intensive basecalling. We present UNCALLED ( https://github.com/skovaka/UNCALLED ), an open source mapper that rapidly matches streaming of nanopore current signals to a reference sequence. UNCALLED probabilistically considers k-mers that could be represented by the signal and then prunes the candidates based on the reference encoded within a Ferragina-Manzini index. We used UNCALLED to deplete sequencing of known bacterial genomes within a metagenomics community, enriching the remaining species 4.46-fold. UNCALLED also enriched 148 human genes associated with hereditary cancers to 29.6× coverage using one MinION flowcell, enabling accurate detection of single-nucleotide polymorphisms, insertions and deletions, structural variants and methylation in these genes.
Collapse
Affiliation(s)
- Sam Kovaka
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
| | - Yunfan Fan
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Bohan Ni
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Winston Timp
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| |
Collapse
|
137
|
Watson CM, Crinnion LA, Lindsay H, Mitchell R, Camm N, Robinson R, Joyce C, Tanteles GA, Halloran DJO, Pena SDJ, Carr IM, Bonthron DT. Assessing the utility of long-read nanopore sequencing for rapid and efficient characterization of mobile element insertions. J Transl Med 2021; 101:442-449. [PMID: 32989232 DOI: 10.1038/s41374-020-00489-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Revised: 09/09/2020] [Accepted: 09/10/2020] [Indexed: 12/16/2022] Open
Abstract
Short-read next generation sequencing (NGS) has become the predominant first-line technique used to diagnose patients with rare genetic conditions. Inherent limitations of short-read technology, notably for the detection and characterization of complex insertion-containing variants, are offset by the ability to concurrently screen many disease genes. "Third-generation" long-read sequencers are increasingly being deployed as an orthogonal adjunct technology, but their full potential for molecular genetic diagnosis has yet to be exploited. Here, we describe three diagnostic cases in which pathogenic mobile element insertions were refractory to characterization by short-read sequencing. To validate the accuracy of the long-read technology, we first used Sanger sequencing to confirm the integration sites and derive curated benchmark sequences of the variant-containing alleles. Long-read nanopore sequencing was then performed on locus-specific amplicons. Pairwise comparison between these data and the previously determined benchmark alleles revealed 100% identity of the variant-containing sequences. We demonstrate a number of technical advantages over existing wet-laboratory approaches, including in silico size selection of a mixed pool of amplification products, and the relative ease with which an automated informatics workflow can be established. Our findings add to a growing body of literature describing the diagnostic utility of long-read sequencing.
Collapse
Affiliation(s)
- Christopher M Watson
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, LS9 7TF, UK.
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, LS9 7TF, UK.
| | - Laura A Crinnion
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, LS9 7TF, UK
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, LS9 7TF, UK
| | - Helen Lindsay
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, LS9 7TF, UK
| | - Rowena Mitchell
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, LS9 7TF, UK
| | - Nick Camm
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, LS9 7TF, UK
| | - Rachel Robinson
- Yorkshire and North East Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, LS9 7TF, UK
| | - Caroline Joyce
- Department of Endocrinology, Cork University Hospital, Wilton, Cork, Ireland
| | - George A Tanteles
- Department of Clinical Genetics, The Cyprus Institute of Neurology and Genetics, 6 International Airport Avenue, PO Box 23462, CY1683, Nicosia, Cyprus
| | | | | | - Ian M Carr
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, LS9 7TF, UK
| | - David T Bonthron
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, LS9 7TF, UK
| |
Collapse
|
138
|
Alkailani M, Palidwor G, Poulin A, Mohan R, Pepin D, Vanderhyden B, Gibbings D. A genome-wide strategy to identify causes and consequences of retrotransposon expression finds activation by BRCA1 in ovarian cancer. NAR Cancer 2021; 3:zcaa040. [PMID: 33447827 PMCID: PMC7787265 DOI: 10.1093/narcan/zcaa040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 11/20/2020] [Accepted: 11/30/2020] [Indexed: 12/03/2022] Open
Abstract
It is challenging to identify the causes and consequences of retrotransposon expression in human disease due to the hundreds of active genomic copies and their poor conservation across species. We profiled genomic insertions of retrotransposons in ovarian cancer. In addition, in ovarian and breast cancer we analyzed RNAs exhibiting Bayesian correlation with retrotransposon RNA to identify causes and consequences of retrotransposon expression. This strategy finds divergent inflammatory responses associated with retrotransposon expression in ovarian and breast cancer and identifies new factors inducing expression of endogenous retrotransposons including anti-viral responses and the common tumor suppressor BRCA1. In cell lines, mouse ovarian epithelial cells and patient-derived tumor spheroids, BRCA1 promotes accumulation of retrotransposon RNA. BRCA1 promotes transcription of active families of retrotransposons and their insertion into the genome. Intriguingly, elevated retrotransposon expression predicts survival in ovarian cancer patients. Retrotransposons are part of a complex regulatory network in ovarian cancer including BRCA1 that contributes to patient survival. The described strategy can be used to identify the regulators and impacts of retrotransposons in various contexts of biology and disease in humans.
Collapse
Affiliation(s)
- Maisa Alkailani
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, K1H 8M5, Canada
| | - Gareth Palidwor
- Ottawa Institute for Systems Biology, University of Ottawa, Ottawa, Ontario, K1H 8M5, Canada
- Bioinformatics, Ottawa Hospital Research Institute, Ottawa, Ontario, K1H 8L6, Canada
| | - Ariane Poulin
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, K1H 8M5, Canada
| | - Raghav Mohan
- Pediatrics Surgical Research Laboratories, Massachusetts General Hospital, Boston, MA 021145, USA
| | - David Pepin
- Pediatrics Surgical Research Laboratories, Massachusetts General Hospital, Boston, MA 021145, USA
- Department of Surgery, Harvard Medical School, Boston, MA 021156, USA
| | - Barbara Vanderhyden
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, K1H 8M5, Canada
- Cancer Therapeutics Program, Ottawa Hospital Research Institute, Ottawa, Ontario, K1H 8L6, Canada
| | - Derrick Gibbings
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, K1H 8M5, Canada
| |
Collapse
|
139
|
Kaukonen M, Pettinen IT, Wickström K, Arumilli M, Donner J, Juhola IJ, Holopainen S, Turunen JA, Yoshihara M, Kere J, Lohi H. A missense variant in IFT122 associated with a canine model of retinitis pigmentosa. Hum Genet 2021; 140:1569-1579. [PMID: 33606121 PMCID: PMC8519925 DOI: 10.1007/s00439-021-02266-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 02/10/2021] [Indexed: 11/28/2022]
Abstract
Retinitis pigmentosa (RP) is a blinding eye disease affecting nearly two million people worldwide. Dogs are affected with a similar illness termed progressive retinal atrophy (PRA). Lapponian herders (LHs) are affected with several types of inherited retinal dystrophies, and variants in PRCD and BEST1 genes have been associated with generalized PRA and canine multifocal retinopathy 3 (cmr3), respectively. However, all retinal dystrophy cases in LHs are not explained by these variants, indicating additional genetic causes of disease in the breed. We collected DNA samples from 10 PRA affected LHs, with known PRCD and BEST1 variants excluded, and 34 unaffected LHs. A genome-wide association study identified a locus on CFA20 (praw = 2.4 × 10-7, pBonf = 0.035), and subsequent whole-genome sequencing of an affected LH revealed a missense variant, c.3176G>A, in the intraflagellar transport 122 (IFT122) gene. The variant was also found in Finnish Lapphunds, in which its clinical relevancy needs to be studied further. The variant interrupts a highly conserved residue, p.(R1059H), in IFT122 and likely impairs its function. Variants in IFT122 have not been associated with retinal degeneration in mammals, but the loss of ift122 in zebrafish larvae impaired opsin transport and resulted in progressive photoreceptor degeneration. Our study establishes a new spontaneous dog model to study the role of IFT122 in RP biology, while the affected breed will benefit from a genetic test for a recessive condition.
Collapse
Affiliation(s)
- Maria Kaukonen
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Inka-Tuulevi Pettinen
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | | | - Meharji Arumilli
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Jonas Donner
- Genoscoper Laboratories Ltd (Wisdom Health), Helsinki, Finland
| | - Ida-Julia Juhola
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Saila Holopainen
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of Equine and Small Animal Medicine, University of Helsinki, Helsinki, Finland
| | - Joni A Turunen
- Folkhälsan Research Center, Helsinki, Finland.,Department of Ophthalmology, University of Helsinki, Helsinki University Hospital, Helsinki, Finland
| | - Masahito Yoshihara
- Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden
| | - Juha Kere
- Folkhälsan Research Center, Helsinki, Finland.,Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden.,Stem Cells and Metabolism Research Program STEMM, University of Helsinki, 00014, Helsinki, Finland
| | - Hannes Lohi
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland. .,Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland. .,Folkhälsan Research Center, Helsinki, Finland.
| |
Collapse
|
140
|
Mapping of susceptible variants for cold medicine-related Stevens-Johnson syndrome by whole-genome resequencing. NPJ Genom Med 2021; 6:9. [PMID: 33574277 PMCID: PMC7878485 DOI: 10.1038/s41525-021-00171-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 01/06/2021] [Indexed: 11/16/2022] Open
Abstract
Stevens–Johnson syndrome (SJS) and its severe condition with extensive skin detachment and a poor prognosis, toxic epidermal necrolysis (TEN), are immunologically mediated severe cutaneous reactions of the skin and mucous membranes such as the ocular surface. Genetic variations on the HLA-A and other autosomal genes have been identified as risk factors for cold medicine-related SJS/TEN with severe ocular complications (CM-SJS/TEN with SOC). Using a whole-genome sequencing (WGS) approach, we explored other susceptible variants of CM-SJS/TEN with SOC, especially among rare variants and structural variants (SVs). WGS was performed on samples from 133 patients with CM-SJS/TEN with SOC and 418 healthy controls to obtain single nucleotide polymorphisms (SNPs) and SVs. Genome-wide association tests were performed with these variants. Our genome-wide association test reproduced the associations of the common variants of HLA-A and loci on chromosome 16q12.1. We also identified novel associations of SVs on these loci and an aggregation of rare coding variants on the TPRM8 gene. In silico gene expression analysis on the HLA-A locus revealed that the SNP (rs12202296), which was significantly associated with susceptibility to CM-SJS/TEN with SOC, was correlated to an increase in HLA-A expression levels in the whole blood (P = 2.9 × 10−17), from the GTEx database. The majority of variants that were significantly associated with CM-SJS/TEN with SOC were found in non-coding regions, indicating the regulatory role of genetic variations in the pathogenesis of CM-SJS/TEN with SOC.
Collapse
|
141
|
Kyöstilä K, Niskanen JE, Arumilli M, Donner J, Hytönen MK, Lohi H. Intronic variant in POU1F1 associated with canine pituitary dwarfism. Hum Genet 2021; 140:1553-1562. [PMID: 33550451 PMCID: PMC8519942 DOI: 10.1007/s00439-021-02259-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 01/25/2021] [Indexed: 01/20/2023]
Abstract
The anterior pituitary gland secretes several endocrine hormones, essential for growth, reproduction and other basic physiological functions. Abnormal development or function of the pituitary gland leads to isolated or combined pituitary hormone deficiency (CPHD). At least 30 genes have been associated with human CPHD, including many transcription factors, such as POU1F1. CPHD occurs spontaneously also in mice and dogs. Two affected breeds have been reported in dogs: German Shepherds with a splice defect in the LHX3 gene and Karelian Bear Dogs (KBD) with an unknown genetic cause. We obtained samples from five KBDs presenting dwarfism and abnormal coats. A combined analysis of genome-wide association and next-generation sequencing mapped the disease to a region in chromosome 31 and identified a homozygous intronic variant in the fourth exon of the POU1F1 gene in the affected dogs. The identified variant, c.605-3C>A, resided in the splice region and was predicted to affect splicing. The variant's screening in three new prospective cases, related breeds, and ~ 8000 dogs from 207 breeds indicated complete segregation in KBDs with a carrier frequency of 8%, and high breed-specificity as carriers were found at a low frequency only in Lapponian Herders, a related breed. Our study establishes a novel canine model for CPHD with a candidate POU1F1 defect.
Collapse
Affiliation(s)
- Kaisa Kyöstilä
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Julia E Niskanen
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Meharji Arumilli
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Jonas Donner
- Genoscoper Laboratories Ltd (Wisdom Health), Helsinki, Finland
| | - Marjo K Hytönen
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland
| | - Hannes Lohi
- Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland. .,Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland. .,Folkhälsan Research Center, Helsinki, Finland.
| |
Collapse
|
142
|
Sohrab V, López-Díaz C, Di Pietro A, Ma LJ, Ayhan DH. TEfinder: A Bioinformatics Pipeline for Detecting New Transposable Element Insertion Events in Next-Generation Sequencing Data. Genes (Basel) 2021; 12:genes12020224. [PMID: 33557410 PMCID: PMC7914406 DOI: 10.3390/genes12020224] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 01/30/2021] [Accepted: 01/31/2021] [Indexed: 11/16/2022] Open
Abstract
Transposable elements (TEs) are mobile elements capable of introducing genetic changes rapidly. Their importance has been documented in many biological processes, such as introducing genetic instability, altering patterns of gene expression, and accelerating genome evolution. Increasing appreciation of TEs has resulted in a growing number of bioinformatics software to identify insertion events. However, the application of existing tools is limited by either narrow-focused design of the package, too many dependencies on other tools, or prior knowledge required as input files that may not be readily available to all users. Here, we reported a simple pipeline, TEfinder, developed for the detection of new TE insertions with minimal software and input file dependencies. The external software requirements are BEDTools, SAMtools, and Picard. Necessary input files include the reference genome sequence in FASTA format, an alignment file from paired-end reads, existing TEs in GTF format, and a text file of TE names. We tested TEfinder among several evolving populations of Fusarium oxysporum generated through a short-term adaptation study. Our results demonstrate that this easy-to-use tool can effectively detect new TE insertion events, making it accessible and practical for TE analysis.
Collapse
Affiliation(s)
- Vista Sohrab
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA; (V.S.); (L.-J.M.)
| | - Cristina López-Díaz
- Departamento de Genética, Universidad de Córdoba, 14071 Córdoba, Spain; (C.L.-D.); (A.D.P.)
| | - Antonio Di Pietro
- Departamento de Genética, Universidad de Córdoba, 14071 Córdoba, Spain; (C.L.-D.); (A.D.P.)
| | - Li-Jun Ma
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA; (V.S.); (L.-J.M.)
- Molecular and Cellular Biology Graduate Program, University of Massachusetts Amherst, Amherst, MA 01003, USA
| | - Dilay Hazal Ayhan
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA; (V.S.); (L.-J.M.)
- Molecular and Cellular Biology Graduate Program, University of Massachusetts Amherst, Amherst, MA 01003, USA
- Correspondence:
| |
Collapse
|
143
|
Oleksyk TK, Wolfsberger WW, Weber AM, Shchubelka K, Oleksyk OT, Levchuk O, Patrus A, Lazar N, Castro-Marquez SO, Hasynets Y, Boldyzhar P, Neymet M, Urbanovych A, Stakhovska V, Malyar K, Chervyakova S, Podoroha O, Kovalchuk N, Rodriguez-Flores JL, Zhou W, Medley S, Battistuzzi F, Liu R, Hou Y, Chen S, Yang H, Yeager M, Dean M, Mills RE, Smolanka V. Genome diversity in Ukraine. Gigascience 2021; 10:6079618. [PMID: 33438729 PMCID: PMC7804371 DOI: 10.1093/gigascience/giaa159] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 08/21/2020] [Accepted: 12/15/2020] [Indexed: 01/21/2023] Open
Abstract
Background The main goal of this collaborative effort is to provide genome-wide data for the previously underrepresented population in Eastern Europe, and to provide cross-validation of the data from genome sequences and genotypes of the same individuals acquired by different technologies. We collected 97 genome-grade DNA samples from consented individuals representing major regions of Ukraine that were consented for public data release. BGISEQ-500 sequence data and genotypes by an Illumina GWAS chip were cross-validated on multiple samples and additionally referenced to 1 sample that has been resequenced by Illumina NovaSeq6000 S4 at high coverage. Results The genome data have been searched for genomic variation represented in this population, and a number of variants have been reported: large structural variants, indels, copy number variations, single-nucletide polymorphisms, and microsatellites. To our knowledge, this study provides the largest to-date survey of genetic variation in Ukraine, creating a public reference resource aiming to provide data for medical research in a large understudied population. Conclusions Our results indicate that the genetic diversity of the Ukrainian population is uniquely shaped by evolutionary and demographic forces and cannot be ignored in future genetic and biomedical studies. These data will contribute a wealth of new information bringing forth a wealth of novel, endemic and medically related alleles.
Collapse
Affiliation(s)
- Taras K Oleksyk
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine.,Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Walter W Wolfsberger
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine.,Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Alexandra M Weber
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Khrystyna Shchubelka
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA.,Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| | - Olga T Oleksyk
- A. Novak Transcarpathian Regional Clinical Hospital, Uzhhorod 88000, Ukraine
| | | | | | | | - Stephanie O Castro-Marquez
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Yaroslava Hasynets
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine
| | - Patricia Boldyzhar
- Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| | - Mikhailo Neymet
- Velyka Kopanya Family Hospital, Transcarpatia 90330, Ukraine
| | | | | | - Kateryna Malyar
- I.I.Mechnikov Dnipro Regional Clinical Hospital, Dnipro 49000, Ukraine
| | | | | | - Natalia Kovalchuk
- Rivne Regional Specialized Hospital of Radiation Protection, Rivne 33028, Ukraine
| | | | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Sarah Medley
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA
| | - Fabia Battistuzzi
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA
| | - Ryan Liu
- BGI Shenzhen, Shenzhen, 518083, China
| | - Yong Hou
- BGI Shenzhen, Shenzhen, 518083, China
| | - Siru Chen
- BGI Shenzhen, Shenzhen, 518083, China
| | | | - Meredith Yeager
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Michael Dean
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Volodymyr Smolanka
- Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| |
Collapse
|
144
|
Kim J, Light N, Subasri V, Young EL, Wegman-Ostrosky T, Barkauskas DA, Hall D, Lupo PJ, Patidar R, Maese LD, Jones K, Wang M, Tavtigian SV, Wu D, Shlien A, Telfer F, Goldenberg A, Skapek SX, Wei JS, Wen X, Catchpoole D, Hawkins DS, Schiffman JD, Khan J, Malkin D, Stewart DR. Pathogenic Germline Variants in Cancer Susceptibility Genes in Children and Young Adults With Rhabdomyosarcoma. JCO Precis Oncol 2021; 5:PO.20.00218. [PMID: 34095712 PMCID: PMC8169077 DOI: 10.1200/po.20.00218] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 09/10/2020] [Accepted: 11/06/2020] [Indexed: 12/30/2022] Open
Abstract
Rhabdomyosarcoma (RMS) is the most common pediatric soft-tissue sarcoma and accounts for 3% of all pediatric cancer. In this study, we investigated germline sequence and structural variation in a broad set of genes in two large, independent RMS cohorts. MATERIALS AND METHODS Genome sequencing of the discovery cohort (n = 273) and exome sequencing of the secondary cohort (n = 121) were conducted on germline DNA. Analyses were performed on 130 cancer susceptibility genes (CSG). Pathogenic or likely pathogenic (P/LP) variants were predicted using the American College of Medical Genetics and Genomics (ACMG) criteria. Structural variation and survival analyses were performed on the discovery cohort. RESULTS We found that 6.6%-7.7% of patients with RMS harbored P/LP variants in dominant-acting CSG. An additional approximately 1% have structural variants (ATM, CDKN1C) in CSGs. CSG variants did not influence survival, although there was a significant correlation with an earlier age of tumor onset. There was a nonsignificant excess of P/LP variants in dominant inheritance genes in the patients with FOXO1 fusion-negative RMS patients versus the patients with FOXO1 fusion-positive RMS. We identified pathogenic germline variants in CSGs previously (TP53, NF1, DICER1, mismatch repair genes), rarely (BRCA2, CBL, CHEK2, SMARCA4), or never (FGFR4) reported in RMS. Numerous genes (TP53, BRCA2, mismatch repair) were on the ACMG Secondary Findings 2.0 list. CONCLUSION In two cohorts of patients with RMS, we identified pathogenic germline variants for which gene-specific therapies and surveillance guidelines may be beneficial. In families with a proband with an RMS-risk P/LP variant, genetic counseling and cascade testing should be considered, especially for ACMG Secondary Findings genes and/or with gene-specific surveillance guidelines.
Collapse
Affiliation(s)
- Jung Kim
- Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD
| | - Nicholas Light
- Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, ON, Canada
- Institute of Medical Science, University of Toronto, Toronto, ON, Canada
| | - Vallijah Subasri
- Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, ON, Canada
- Vector Institute of Artificial Intelligence, Toronto, ON, Canada
| | - Erin L. Young
- Department of Pediatrics, University of Utah, Salt Lake City, UT
| | - Talia Wegman-Ostrosky
- Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD
- Basic Research Subdirection, Instituto Nacional de Cancerología (INCan), Mexico City, Mexico
| | - Donald A. Barkauskas
- QuadW-COG Childhood Sarcoma Biostatistics and Annotation Office, Children's Oncology Group, Monrovia, CA
- Department of Preventive Medicine, Keck School of Medicine of the University of Southern California, Los Angeles, CA
| | - David Hall
- QuadW-COG Childhood Sarcoma Biostatistics and Annotation Office, Children's Oncology Group, Monrovia, CA
| | - Philip J. Lupo
- Department of Pediatrics, Hematology-Oncology Section, Baylor College of Medicine, Houston, TX
| | - Rajesh Patidar
- Genetics Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD
| | - Luke D. Maese
- Department of Pediatrics, University of Utah, Salt Lake City, UT
| | - Kristine Jones
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD
| | - Mingyi Wang
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD
| | - Sean V. Tavtigian
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT
| | - Dongjing Wu
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD
| | - Adam Shlien
- Department of Pediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Laboratory Medicine and Pathology, University of Toronto, Toronto, ON, Canada
| | - Frank Telfer
- Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, ON, Canada
| | - Anna Goldenberg
- Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, ON, Canada
- Vector Institute of Artificial Intelligence, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | | | - Jun S. Wei
- Genetics Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD
| | - Xinyu Wen
- Genetics Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD
| | - Daniel Catchpoole
- The Tumour Bank, Children's Cancer Research Unit, Kids Research Institute, The Children's Hospital at Westmead, Westmead, NSW, Australia
| | - Douglas S. Hawkins
- Division of Hematology/Oncology, Seattle Children's Hospital, University of Washington, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Joshua D. Schiffman
- Department of Pediatrics, University of Utah, Salt Lake City, UT
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT
| | - Javed Khan
- Genetics Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD
| | - David Malkin
- Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, ON, Canada
- Institute of Medical Science, University of Toronto, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, ON, Canada
- Division of Hematology-Oncology, The Hospital for Sick Children, Department of Pediatrics, University of Toronto, Toronto, ON, Canada
| | - Douglas R. Stewart
- Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD
| |
Collapse
|
145
|
Meshkov A, Ershova A, Kiseleva A, Zotova E, Sotnikova E, Petukhova A, Zharikova A, Malyshev P, Rozhkova T, Blokhina A, Limonova A, Ramensky V, Divashuk M, Khasanova Z, Bukaeva A, Kurilova O, Skirko O, Pokrovskaya M, Mikova V, Snigir E, Akinshina A, Mitrofanov S, Kashtanova D, Makarov V, Kukharchuk V, Boytsov S, Yudin S, Drapkina O. The LDLR, APOB, and PCSK9 Variants of Index Patients with Familial Hypercholesterolemia in Russia. Genes (Basel) 2021; 12:66. [PMID: 33418990 PMCID: PMC7825309 DOI: 10.3390/genes12010066] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 12/25/2020] [Accepted: 12/30/2020] [Indexed: 01/12/2023] Open
Abstract
Familial hypercholesterolemia (FH) is a common autosomal codominant disorder, characterized by elevated low-density lipoprotein cholesterol levels causing premature atherosclerotic cardiovascular disease. About 2900 variants of LDLR, APOB, and PCSK9 genes potentially associated with FH have been described earlier. Nevertheless, the genetics of FH in a Russian population is poorly understood. The aim of this study is to present data on the spectrum of LDLR, APOB, and PCSK9 gene variants in a cohort of 595 index Russian patients with FH, as well as an additional systematic analysis of the literature for the period of 1995-2020 on LDLR, APOB and PCSK9 gene variants described in Russian patients with FH. We used targeted and whole genome sequencing to search for variants. Accordingly, when combining our novel data and the data of a systematic literature review, we described 224 variants: 187 variants in LDLR, 14 variants in APOB, and 23 variants in PCSK9. A significant proportion of variants, 81 of 224 (36.1%), were not described earlier in FH patients in other populations and may be specific for Russia. Thus, this study significantly supplements knowledge about the spectrum of variants causing FH in Russia and may contribute to a wider implementation of genetic diagnostics in FH patients in Russia.
Collapse
Affiliation(s)
- Alexey Meshkov
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Alexandra Ershova
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Anna Kiseleva
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Evgenia Zotova
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Evgeniia Sotnikova
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Anna Petukhova
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Anastasia Zharikova
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory, 1-73, 119991 Moscow, Russia
| | - Pavel Malyshev
- National Medical Research Center for Cardiology, 3-ya Cherepkovskaya Street, 15A, 121552 Moscow, Russia; (P.M.); (T.R.); (Z.K.); (V.K.); (S.B.)
| | - Tatyana Rozhkova
- National Medical Research Center for Cardiology, 3-ya Cherepkovskaya Street, 15A, 121552 Moscow, Russia; (P.M.); (T.R.); (Z.K.); (V.K.); (S.B.)
| | - Anastasia Blokhina
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Alena Limonova
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Vasily Ramensky
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory, 1-73, 119991 Moscow, Russia
| | - Mikhail Divashuk
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Zukhra Khasanova
- National Medical Research Center for Cardiology, 3-ya Cherepkovskaya Street, 15A, 121552 Moscow, Russia; (P.M.); (T.R.); (Z.K.); (V.K.); (S.B.)
| | - Anna Bukaeva
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Olga Kurilova
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Olga Skirko
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Maria Pokrovskaya
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| | - Valeriya Mikova
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Ekaterina Snigir
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Alexsandra Akinshina
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Sergey Mitrofanov
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Daria Kashtanova
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Valentin Makarov
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Valeriy Kukharchuk
- National Medical Research Center for Cardiology, 3-ya Cherepkovskaya Street, 15A, 121552 Moscow, Russia; (P.M.); (T.R.); (Z.K.); (V.K.); (S.B.)
| | - Sergey Boytsov
- National Medical Research Center for Cardiology, 3-ya Cherepkovskaya Street, 15A, 121552 Moscow, Russia; (P.M.); (T.R.); (Z.K.); (V.K.); (S.B.)
| | - Sergey Yudin
- Centre for Strategic Planning of FMBA of Russia, Pogodinskaya Street, 10, bld. 1, 119121 Moscow, Russia; (E.Z.); (A.P.); (A.B.); (V.M.); (E.S.); (A.A.); (S.M.); (D.K.); (V.M.); (S.Y.)
| | - Oxana Drapkina
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, bld. 3, 101000 Moscow, Russia; (A.E.); (A.K.); (E.S.); (A.Z.); (A.B.); (A.L.); (V.R.); (M.D.); (O.K.); (O.S.); (M.P.); (O.D.)
| |
Collapse
|
146
|
Striking heterogeneity of somatic L1 retrotransposition in single normal and cancerous gastrointestinal cells. Proc Natl Acad Sci U S A 2020; 117:32215-32222. [PMID: 33277430 DOI: 10.1073/pnas.2019450117] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Somatic LINE-1 (L1) retrotransposition has been detected in early embryos, adult brains, and the gastrointestinal (GI) tract, and many cancers, including epithelial GI tumors. We previously found numerous somatic L1 insertions in paired normal and GI cancerous tissues. Here, using a modified method of single-cell analysis for somatic L1 insertions, we studied adenocarcinomas of colon, pancreas, and stomach, and found a variable number of somatic L1 insertions in tumors of the same type from patient to patient. We detected no somatic L1 insertions in single cells of 5 of 10 tumors studied. In three tumors, aneuploid cells were detected by FACS. In one pancreatic tumor, there were many more L1 insertions in aneuploid than in euploid tumor cells. In one gastric cancer, both aneuploid and euploid cells contained large numbers of likely clonal insertions. However, in a second gastric cancer with aneuploid cells, no somatic L1 insertions were found. We suggest that when the cellular environment is favorable to retrotransposition, aneuploidy predisposes tumor cells to L1 insertions, and retrotransposition may occur at the transition from euploidy to aneuploidy. Seventeen percent of insertions were also present in normal cells, similar to findings in genomic DNA from normal tissues of GI tumor patients. We provide evidence that: 1) The number of L1 insertions in tumors of the same type is highly variable, 2) most somatic L1 insertions in GI cancer tissues are absent from normal tissues, and 3) under certain conditions, somatic L1 retrotransposition exhibits a propensity for occurring in aneuploid cells.
Collapse
|
147
|
Savage AL, Lopez AI, Iacoangeli A, Bubb VJ, Smith B, Troakes C, Alahmady N, Koks S, Schumann GG, Al-Chalabi A, Quinn JP. Frequency and methylation status of selected retrotransposition competent L1 loci in amyotrophic lateral sclerosis. Mol Brain 2020; 13:154. [PMID: 33187550 PMCID: PMC7666467 DOI: 10.1186/s13041-020-00694-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 11/03/2020] [Indexed: 12/11/2022] Open
Abstract
Long interspersed element-1 (LINE-1/L1) is the only autonomous transposable element in the human genome that currently mobilises in both germline and somatic tissues. Recent studies have identified correlations between altered retrotransposon expression and the fatal neurodegenerative disease amyotrophic lateral sclerosis (ALS) in a subset of patients. The risk of an individual developing ALS is dependent on an interaction of genetic variants and subsequent modifiers during life. These modifiers could include environmental factors, which can lead to epigenetic and genomic changes, such as somatic mutations, occurring in the neuronal cells that degenerate as the disease develops. There are more than 1 million L1 copies in the human genome today, but only 80-100 L1 loci in the reference genome are considered to be retrotransposition-competent (RC) and an even smaller number of these RC-L1s loci are highly active. We hypothesise that RC-L1s could affect normal cellular function through their mutagenic potential conferred by their ability to retrotranspose in neuronal cells and through DNA damage caused by the endonuclease activity of the L1-encoded ORF2 protein. To investigate whether either an increase in the genomic burden of RC-L1s or epigenetic changes to RC-L1s altering their expression, could play a role in disease development, we chose a set of seven well characterised genomic RC-L1 loci that were reported earlier to be highly active in a cellular L1 retrotransposition reporter assay or serve as major source elements for germline and/or somatic retrotransposition events. Analysis of the insertion allele frequency of five polymorphic RC-L1s, out of the set of seven, for their presence or absence, did not identify an increased number individually or when combined in individuals with the disease. However, we did identify reduced levels of methylation of RC-L1s in the motor cortex of those individuals with both familial and sporadic ALS compared to control brains. The changes to the regulation of the loci encompassing these RC-L1s demonstrated tissue specificity and could be related to the disease process.
Collapse
Affiliation(s)
- Abigail L Savage
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Ana Illera Lopez
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Alfredo Iacoangeli
- Maurice Wohl Clinical Neuroscience Institute, King's College London, London, UK
- Department of Biostatistics and Health Informatics, King's College London, London, UK
| | - Vivien J Bubb
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Bradley Smith
- Maurice Wohl Clinical Neuroscience Institute, King's College London, London, UK
| | - Claire Troakes
- London Neurodegenerative Diseases Brain Bank, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, London, UK
| | - Nada Alahmady
- Maurice Wohl Clinical Neuroscience Institute, King's College London, London, UK
- Department of Biology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
| | - Sulev Koks
- Perron Institute for Neurological and Translational Science, Perth, WA, Australia
- Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, WA, Australia
| | - Gerald G Schumann
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Langen, Germany
| | - Ammar Al-Chalabi
- Maurice Wohl Clinical Neuroscience Institute, King's College London, London, UK
| | - John P Quinn
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK.
| |
Collapse
|
148
|
Lee YG, Lee JY, Kim J, Kim YJ. Insertion variants missing in the human reference genome are widespread among human populations. BMC Biol 2020; 18:167. [PMID: 33187521 PMCID: PMC7666470 DOI: 10.1186/s12915-020-00894-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 10/09/2020] [Indexed: 01/07/2023] Open
Abstract
Background Structural variants comprise diverse genomic arrangements including deletions, insertions, inversions, and translocations, which can generally be detected in humans through sequence comparison to the reference genome. Among structural variants, insertions are the least frequently identified variants, mainly due to ascertainment bias in the reference genome, lack of previous sequence knowledge, and low complexity of typical insertion sequences. Though recent developments in long-read sequencing deliver promise in annotating individual non-reference insertions, population-level catalogues on non-reference insertion variants have not been identified and the possible functional roles of these hidden variants remain elusive. Results To detect non-reference insertion variants, we developed a pipeline, InserTag, which generates non-reference contigs by local de novo assembly and then infers the full-sequence of insertion variants by tracing contigs from non-human primates and other human genome assemblies. Application of the pipeline to data from 2535 individuals of the 1000 Genomes Project helped identify 1696 non-reference insertion variants and re-classify the variants as retention of ancestral sequences or novel sequence insertions based on the ancestral state. Genotyping of the variants showed that individuals had, on average, 0.92-Mbp sequences missing from the reference genome, 92% of the variants were common (allele frequency > 5%) among human populations, and more than half of the variants were major alleles. Among human populations, African populations were the most divergent and had the most non-reference sequences, which was attributed to the greater prevalence of high-frequency insertion variants. The subsets of insertion variants were in high linkage disequilibrium with phenotype-associated SNPs and showed signals of recent continent-specific selection. Conclusions Non-reference insertion variants represent an important type of genetic variation in the human population, and our developed pipeline, InserTag, provides the frameworks for the detection and genotyping of non-reference sequences missing from human populations. Supplementary information Supplementary information accompanies this paper at 10.1186/s12915-020-00894-1.
Collapse
Affiliation(s)
- Young-Gun Lee
- Department of Integrated Omics for Biomedical Science, WCU Graduate School, Yonsei University, Seoul, Republic of Korea
| | - Jin-Young Lee
- Department of Biochemistry, College of Life Science and Technology, Yonsei University, Seoul, Republic of Korea
| | - Junhyong Kim
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Young-Joon Kim
- Department of Integrated Omics for Biomedical Science, WCU Graduate School, Yonsei University, Seoul, Republic of Korea. .,Department of Biochemistry, College of Life Science and Technology, Yonsei University, Seoul, Republic of Korea.
| |
Collapse
|
149
|
Ewing AD, Smits N, Sanchez-Luque FJ, Faivre J, Brennan PM, Richardson SR, Cheetham SW, Faulkner GJ. Nanopore Sequencing Enables Comprehensive Transposable Element Epigenomic Profiling. Mol Cell 2020; 80:915-928.e5. [PMID: 33186547 DOI: 10.1016/j.molcel.2020.10.024] [Citation(s) in RCA: 95] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 10/14/2020] [Accepted: 10/15/2020] [Indexed: 12/12/2022]
Abstract
Transposable elements (TEs) drive genome evolution and are a notable source of pathogenesis, including cancer. While CpG methylation regulates TE activity, the locus-specific methylation landscape of mobile human TEs has to date proven largely inaccessible. Here, we apply new computational tools and long-read nanopore sequencing to directly infer CpG methylation of novel and extant TE insertions in hippocampus, heart, and liver, as well as paired tumor and non-tumor liver. As opposed to an indiscriminate stochastic process, we find pronounced demethylation of young long interspersed element 1 (LINE-1) retrotransposons in cancer, often distinct to the adjacent genome and other TEs. SINE-VNTR-Alu (SVA) retrotransposons, including their internal tandem repeat-associated CpG island, are near-universally methylated. We encounter allele-specific TE methylation and demethylation of aberrantly expressed young LINE-1s in normal tissues. Finally, we recover the complete sequences of tumor-specific LINE-1 insertions and their retrotransposition hallmarks, demonstrating how long-read sequencing can simultaneously survey the epigenome and detect somatic TE mobilization.
Collapse
Affiliation(s)
- Adam D Ewing
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia.
| | - Nathan Smits
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia
| | - Francisco J Sanchez-Luque
- GENYO, Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PTS Granada 18016, Spain; MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine (IGMM), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| | - Jamila Faivre
- INSERM, U1193, Paul-Brousse University Hospital, Hepatobiliary Centre, Villejuif 94800, France
| | - Paul M Brennan
- Translational Neurosurgery, Centre for Clinical Brain Sciences, Edinburgh EH16 4SB, UK
| | - Sandra R Richardson
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia
| | - Seth W Cheetham
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia.
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia; Queensland Brain Institute, University of Queensland, St. Lucia, QLD 4067, Australia.
| |
Collapse
|
150
|
The tumor suppressor microRNA let-7 inhibits human LINE-1 retrotransposition. Nat Commun 2020; 11:5712. [PMID: 33177501 PMCID: PMC7658363 DOI: 10.1038/s41467-020-19430-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 10/03/2020] [Indexed: 12/13/2022] Open
Abstract
Nearly half of the human genome is made of transposable elements (TEs) whose activity continues to impact its structure and function. Among them, Long INterspersed Element class 1 (LINE-1 or L1) elements are the only autonomously active TEs in humans. L1s are expressed and mobilized in different cancers, generating mutagenic insertions that could affect tumor malignancy. Tumor suppressor microRNAs are ∼22nt RNAs that post-transcriptionally regulate oncogene expression and are frequently downregulated in cancer. Here we explore whether they also influence L1 mobilization. We show that downregulation of let-7 correlates with accumulation of L1 insertions in human lung cancer. Furthermore, we demonstrate that let-7 binds to the L1 mRNA and impairs the translation of the second L1-encoded protein, ORF2p, reducing its mobilization. Overall, our data reveals that let-7, one of the most relevant microRNAs, maintains somatic genome integrity by restricting L1 retrotransposition. Human Long INterspersed Element class 1 (LINE-1) elements are expressed and mobilized in many types of cancer, contributing to malignancy. Here the authors show that the tumor suppressor microRNA let-7 targets the LINE-1 mRNA and reduces LINE-1 mobilization.
Collapse
|