1
|
Vervoort L, Dierckxsens N, Santos MS, Meynants S, Souche E, Cools R, Heung T, Devriendt K, Peeters H, McDonald-McGinn DM, Swillen A, Breckpot J, Emanuel BS, Van Esch H, Bassett AS, Vermeesch JR. Multiple paralogues and recombination mechanisms drive the high incidence of 22q11.2 Deletion Syndrome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.14.585046. [PMID: 38562770 PMCID: PMC10983858 DOI: 10.1101/2024.03.14.585046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
The 22q11.2 deletion syndrome (22q11.2DS) is the most common microdeletion disorder. Why the incidence of 22q11.2DS is much greater than that of other genomic disorders remains unknown. Short read sequencing cannot resolve the complex segmental duplicon structure to provide direct confirmation of the hypothesis that the rearrangements are caused by non-allelic homologous recombination between the low copy repeats on chromosome 22 (LCR22s). To enable haplotype-specific assembly and rearrangement mapping in LCR22 regions, we combined fiber-FISH optical mapping with whole genome (ultra-)long read sequencing or rearrangement-specific long-range PCR on 24 duos (22q11.2DS patient and parent-of-origin) comprising several different LCR22-mediated rearrangements. Unexpectedly, we demonstrate that not only different paralogous segmental duplicon but also palindromic AT-rich repeats (PATRR) are driving 22q11.2 rearrangements. In addition, we show the existence of two different inversion polymorphisms preceding rearrangement, and somatic mosaicism. The existence of different recombination sites and mechanisms in paralogues and PATRRs which are copy number expanding in the human population are a likely explanation for the high 22q11.2DS incidence.
Collapse
|
2
|
Martin R, Espinoza CY, Large CRL, Rosswork J, Van Bruinisse C, Miller AW, Sanchez JC, Miller M, Paskvan S, Alvino GM, Dunham MJ, Raghuraman MK, Brewer BJ. Template switching between the leading and lagging strands at replication forks generates inverted copy number variants through hairpin-capped extrachromosomal DNA. PLoS Genet 2024; 20:e1010850. [PMID: 38175823 PMCID: PMC10766183 DOI: 10.1371/journal.pgen.1010850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 10/23/2023] [Indexed: 01/06/2024] Open
Abstract
Inherited and germ-line de novo copy number variants (CNVs) are increasingly found to be correlated with human developmental and cancerous phenotypes. Several models for template switching during replication have been proposed to explain the generation of these gross chromosomal rearrangements. We proposed a model of template switching (ODIRA-origin dependent inverted repeat amplification) in which simultaneous ligation of the leading and lagging strands at diverging replication forks could generate segmental inverted triplications through an extrachromosomal inverted circular intermediate. Here, we created a genetic assay using split-ura3 cassettes to trap the proposed inverted intermediate. However, instead of recovering circular inverted intermediates, we found inverted linear chromosomal fragments ending in native telomeres-suggesting that a template switch had occurred at the centromere-proximal fork of a replication bubble. As telomeric inverted hairpin fragments can also be created through double strand breaks we tested whether replication errors or repair of double stranded DNA breaks were the most likely initiating event. The results from CRISPR/Cas9 cleavage experiments and growth in the replication inhibitor hydroxyurea indicate that it is a replication error, not a double stranded break that creates the inverted junctions. Since inverted amplicons of the SUL1 gene occur during long-term growth in sulfate-limited chemostats, we sequenced evolved populations to look for evidence of linear intermediates formed by an error in replication. All of the data are compatible with a two-step version of the ODIRA model in which sequential template switching at short inverted repeats between the leading and lagging strands at a replication fork, followed by integration via homologous recombination, generates inverted interstitial triplications.
Collapse
Affiliation(s)
- Rebecca Martin
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Claudia Y. Espinoza
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Christopher R. L. Large
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Joshua Rosswork
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Cole Van Bruinisse
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Aaron W. Miller
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Joseph C. Sanchez
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Madison Miller
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Samantha Paskvan
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Gina M. Alvino
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Maitreya J. Dunham
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - M. K. Raghuraman
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Bonita J. Brewer
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
3
|
Reeve J, Butlin RK, Koch EL, Stankowski S, Faria R. Chromosomal inversion polymorphisms are widespread across the species ranges of rough periwinkles (Littorina saxatilis and L. arcana). Mol Ecol 2023. [PMID: 37843465 DOI: 10.1111/mec.17160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 09/13/2023] [Accepted: 09/20/2023] [Indexed: 10/17/2023]
Abstract
Inversions are thought to play a key role in adaptation and speciation, suppressing recombination between diverging populations. Genes influencing adaptive traits cluster in inversions, and changes in inversion frequencies are associated with environmental differences. However, in many organisms, it is unclear if inversions are geographically and taxonomically widespread. The intertidal snail, Littorina saxatilis, is one such example. Strong associations between putative polymorphic inversions and phenotypic differences have been demonstrated between two ecotypes of L. saxatilis in Sweden and inferred elsewhere, but no direct evidence for inversion polymorphism currently exists across the species range. Using whole genome data from 107 snails, most inversion polymorphisms were found to be widespread across the species range. The frequencies of some inversion arrangements were significantly different among ecotypes, suggesting a parallel adaptive role. Many inversions were also polymorphic in the sister species, L. arcana, hinting at an ancient origin.
Collapse
Affiliation(s)
- James Reeve
- Tjärnö Marine Laboratory, University of Gothenburg, Strömstad, Sweden
| | - Roger K Butlin
- Tjärnö Marine Laboratory, University of Gothenburg, Strömstad, Sweden
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield, UK
| | - Eva L Koch
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield, UK
- Department of Zoology, University of Cambridge, Cambridge, UK
| | - Sean Stankowski
- Institute of Science and Technology - Austria, Klosterneuburg, Austria
| | - Rui Faria
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield, UK
- Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO, Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
| |
Collapse
|
4
|
Abstract
DNA sequencing has revolutionized medicine over recent decades. However, analysis of large structural variation and repetitive DNA, a hallmark of human genomes, has been limited by short-read technology, with read lengths of 100-300 bp. Long-read sequencing (LRS) permits routine sequencing of human DNA fragments tens to hundreds of kilobase pairs in size, using both real-time sequencing by synthesis and nanopore-based direct electronic sequencing. LRS permits analysis of large structural variation and haplotypic phasing in human genomes and has enabled the discovery and characterization of rare pathogenic structural variants and repeat expansions. It has also recently enabled the assembly of a complete, gapless human genome that includes previously intractable regions, such as highly repetitive centromeres and homologous acrocentric short arms. With the addition of protocols for targeted enrichment, direct epigenetic DNA modification detection, and long-range chromatin profiling, LRS promises to launch a new era of understanding of genetic diversity and pathogenic mutations in human populations.
Collapse
Affiliation(s)
- Peter E Warburton
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; ,
- Center for Advanced Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert P Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; ,
- Center for Advanced Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
5
|
Soto DC, Uribe-Salazar JM, Shew CJ, Sekar A, McGinty S, Dennis MY. Genomic structural variation: A complex but important driver of human evolution. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2023; 181 Suppl 76:118-144. [PMID: 36794631 PMCID: PMC10329998 DOI: 10.1002/ajpa.24713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 01/21/2023] [Accepted: 02/05/2023] [Indexed: 02/17/2023]
Abstract
Structural variants (SVs)-including duplications, deletions, and inversions of DNA-can have significant genomic and functional impacts but are technically difficult to identify and assay compared with single-nucleotide variants. With the aid of new genomic technologies, it has become clear that SVs account for significant differences across and within species. This phenomenon is particularly well-documented for humans and other primates due to the wealth of sequence data available. In great apes, SVs affect a larger number of nucleotides than single-nucleotide variants, with many identified SVs exhibiting population and species specificity. In this review, we highlight the importance of SVs in human evolution by (1) how they have shaped great ape genomes resulting in sensitized regions associated with traits and diseases, (2) their impact on gene functions and regulation, which subsequently has played a role in natural selection, and (3) the role of gene duplications in human brain evolution. We further discuss how to incorporate SVs in research, including the strengths and limitations of various genomic approaches. Finally, we propose future considerations in integrating existing data and biospecimens with the ever-expanding SV compendium propelled by biotechnology advancements.
Collapse
Affiliation(s)
- Daniela C. Soto
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - José M. Uribe-Salazar
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Colin J. Shew
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Aarthi Sekar
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Sean McGinty
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| |
Collapse
|
6
|
Porubsky D, Harvey WT, Rozanski AN, Ebler J, Höps W, Ashraf H, Hasenfeld P, Paten B, Sanders AD, Marschall T, Korbel JO, Eichler EE. Inversion polymorphism in a complete human genome assembly. Genome Biol 2023; 24:100. [PMID: 37122002 PMCID: PMC10150506 DOI: 10.1186/s13059-023-02919-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 03/31/2023] [Indexed: 05/02/2023] Open
Abstract
The telomere-to-telomere (T2T) complete human reference has significantly improved our ability to characterize genome structural variation. To understand its impact on inversion polymorphisms, we remapped data from 41 genomes against the T2T reference genome and compared it to the GRCh38 reference. We find a ~ 21% increase in sensitivity improving mapping of 63 inversions on the T2T reference. We identify 26 misorientations within GRCh38 and show that the T2T reference is three times more likely to represent the correct orientation of the major human allele. Analysis of 10 additional samples reveals novel rare inversions at chromosomes 15q25.2, 16p11.2, 16q22.1-23.1, and 22q11.21.
Collapse
Affiliation(s)
- David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
| | - Hufsah Ashraf
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Ashley D Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Helmholtz Association, 10115, Berlin, Germany
- Berlin Institute of Health (BIH), 10178, Berlin, Germany
- Charité-Universitätsmedizin, 10117, Berlin, Germany
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
7
|
Shieh JTC. Genomic technologies to improve variation identification in undiagnosed diseases. Pediatr Neonatol 2023; 64 Suppl 1:S18-S21. [PMID: 36428199 DOI: 10.1016/j.pedneo.2022.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 10/12/2022] [Indexed: 11/10/2022] Open
Abstract
Human genome variation has increasingly posed challenges and opportunities for patients, medical providers, and an increasing group of stakeholders including advocacy groups, disadvantaged communities, public health experts, and scientists. Here, advances in genomic sequencing and mapping technologies are discussed with particular attention to the increasing ability to detect personal and population genome variation and the potential for accurate integration of variation into health and disease-related care. Genome mapping, one technique used to create genome map scaffolds, has now been combined with long read sequencing. New technologies have led to improved variation detection, including cryptic structural variation and diverse variants with different degrees of disease association. Combined with advances in automated and medical interpretations, variation detection is increasingly being applied in healthcare. These advances promise to make disease diagnostics more rapid, and potentially more accessible, to those with medical needs. Consequentially, the need for medical genetics and genomics experts is increasing. Here, the opportunities and potential challenges for application of genome-scale variation detection in disease are examined. (<300 words).
Collapse
Affiliation(s)
- Joseph T C Shieh
- University of California San Francisco, Division of Medical Genetics, Department of Pediatrics, Institute for Human Genetics, Benioff Children's Hospital, University of California San Francisco, California, USA.
| |
Collapse
|