1
|
Guitart X, Porubsky D, Yoo D, Dougherty ML, Dishuck PC, Munson KM, Lewis AP, Hoekzema K, Knuth J, Chang S, Pastinen T, Eichler EE. Independent expansion, selection and hypervariability of the TBC1D3 gene family in humans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.12.584650. [PMID: 38654825 PMCID: PMC11037872 DOI: 10.1101/2024.03.12.584650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
TBC1D3 is a primate-specific gene family that has expanded in the human lineage and has been implicated in neuronal progenitor proliferation and expansion of the frontal cortex. The gene family and its expression have been challenging to investigate because it is embedded in high-identity and highly variable segmental duplications. We sequenced and assembled the gene family using long-read sequencing data from 34 humans and 11 nonhuman primate species. Our analysis shows that this particular gene family has independently duplicated in at least five primate lineages, and the duplicated loci are enriched at sites of large-scale chromosomal rearrangements on chromosome 17. We find that most humans vary along two TBC1D3 clusters where human haplotypes are highly variable in copy number, differing by as many as 20 copies, and structure (structural heterozygosity 90%). We also show evidence of positive selection, as well as a significant change in the predicted human TBC1D3 protein sequence. Lastly, we find that, despite multiple duplications, human TBC1D3 expression is limited to a subset of copies and, most notably, from a single paralog group: TBC1D3-CDKL. These observations may help explain why a gene potentially important in cortical development can be so variable in the human population.
Collapse
Affiliation(s)
- Xavi Guitart
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - DongAhn Yoo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Max L. Dougherty
- Tisch Cancer Institute, Division of Hematology and Medical Oncology, The Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Philip C. Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Katherine M. Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P. Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jordan Knuth
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Stephen Chang
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
- Department of Medicine, Division of Cardiovascular Medicine, Stanford University, Stanford, CA, USA
| | - Tomi Pastinen
- Department of Pediatrics, Genomic Medicine Center, Children’s Mercy Kansas City, Kansas City, MO, USA
- Department of Pediatrics, School of Medicine, University of Missouri Kansas City, Kansas City, MO, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical institute, University of Washington, Seattle, WA, USA
| |
Collapse
|
2
|
Porubsky D, Harvey WT, Rozanski AN, Ebler J, Höps W, Ashraf H, Hasenfeld P, Paten B, Sanders AD, Marschall T, Korbel JO, Eichler EE. Inversion polymorphism in a complete human genome assembly. Genome Biol 2023; 24:100. [PMID: 37122002 PMCID: PMC10150506 DOI: 10.1186/s13059-023-02919-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 03/31/2023] [Indexed: 05/02/2023] Open
Abstract
The telomere-to-telomere (T2T) complete human reference has significantly improved our ability to characterize genome structural variation. To understand its impact on inversion polymorphisms, we remapped data from 41 genomes against the T2T reference genome and compared it to the GRCh38 reference. We find a ~ 21% increase in sensitivity improving mapping of 63 inversions on the T2T reference. We identify 26 misorientations within GRCh38 and show that the T2T reference is three times more likely to represent the correct orientation of the major human allele. Analysis of 10 additional samples reveals novel rare inversions at chromosomes 15q25.2, 16p11.2, 16q22.1-23.1, and 22q11.21.
Collapse
Affiliation(s)
- David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
| | - Hufsah Ashraf
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Ashley D Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Helmholtz Association, 10115, Berlin, Germany
- Berlin Institute of Health (BIH), 10178, Berlin, Germany
- Charité-Universitätsmedizin, 10117, Berlin, Germany
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Moorenstraße 5, 40225, Düsseldorf, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
3
|
Wang X, Yue F. HiCLift: A fast and efficient tool for converting chromatin interaction data between genome assemblies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.17.524475. [PMID: 36712087 PMCID: PMC9882170 DOI: 10.1101/2023.01.17.524475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Motivation With the continuous effort to improve the quality of human reference genome and the generation of more and more personal genomes, the conversion of genomic coordinates between genome assemblies is critical in many integrative and comparative studies. While tools have been developed for such task for linear genome signals such as ChIP-Seq, no tool exists to convert genome assemblies for chromatin interaction data, despite the importance of three-dimensional (3D) genome organization in gene regulation and disease. Results Here, we present HiCLift, a fast and efficient tool that can convert the genomic coordinates of chromatin contacts such as Hi-C and Micro-C from one assembly to another, including the latest T2T genome. Comparing with the strategy of directly re-mapping raw reads to a different genome, HiCLift runs on average 42 times faster (hours vs. days), while outputs nearly identical contact matrices. More importantly, as HiCLift does not need to re-map the raw reads, it can directly convert human patient sample data, where the raw sequencing reads are sometimes hard to acquire or not available. Availability HiCLift is publicly available at https://github.com/XiaoTaoWang/HiCLift .
Collapse
|
4
|
Gupta C, Chandrashekar P, Jin T, He C, Khullar S, Chang Q, Wang D. Bringing machine learning to research on intellectual and developmental disabilities: taking inspiration from neurological diseases. J Neurodev Disord 2022; 14:28. [PMID: 35501679 PMCID: PMC9059371 DOI: 10.1186/s11689-022-09438-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 04/07/2022] [Indexed: 12/31/2022] Open
Abstract
Intellectual and Developmental Disabilities (IDDs), such as Down syndrome, Fragile X syndrome, Rett syndrome, and autism spectrum disorder, usually manifest at birth or early childhood. IDDs are characterized by significant impairment in intellectual and adaptive functioning, and both genetic and environmental factors underpin IDD biology. Molecular and genetic stratification of IDDs remain challenging mainly due to overlapping factors and comorbidity. Advances in high throughput sequencing, imaging, and tools to record behavioral data at scale have greatly enhanced our understanding of the molecular, cellular, structural, and environmental basis of some IDDs. Fueled by the "big data" revolution, artificial intelligence (AI) and machine learning (ML) technologies have brought a whole new paradigm shift in computational biology. Evidently, the ML-driven approach to clinical diagnoses has the potential to augment classical methods that use symptoms and external observations, hoping to push the personalized treatment plan forward. Therefore, integrative analyses and applications of ML technology have a direct bearing on discoveries in IDDs. The application of ML to IDDs can potentially improve screening and early diagnosis, advance our understanding of the complexity of comorbidity, and accelerate the identification of biomarkers for clinical research and drug development. For more than five decades, the IDDRC network has supported a nexus of investigators at centers across the USA, all striving to understand the interplay between various factors underlying IDDs. In this review, we introduced fast-increasing multi-modal data types, highlighted example studies that employed ML technologies to illuminate factors and biological mechanisms underlying IDDs, as well as recent advances in ML technologies and their applications to IDDs and other neurological diseases. We discussed various molecular, clinical, and environmental data collection modes, including genetic, imaging, phenotypical, and behavioral data types, along with multiple repositories that store and share such data. Furthermore, we outlined some fundamental concepts of machine learning algorithms and presented our opinion on specific gaps that will need to be filled to accomplish, for example, reliable implementation of ML-based diagnosis technology in IDD clinics. We anticipate that this review will guide researchers to formulate AI and ML-based approaches to investigate IDDs and related conditions.
Collapse
Affiliation(s)
- Chirag Gupta
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Pramod Chandrashekar
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Ting Jin
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Chenfeng He
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Saniya Khullar
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Qiang Chang
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Medical Genetics, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Neurology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Daifeng Wang
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA.
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA.
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
5
|
Lasser M, Bolduc J, Murphy L, O'Brien C, Lee S, Girirajan S, Lowery LA. 16p12.1 Deletion Orthologs are Expressed in Motile Neural Crest Cells and are Important for Regulating Craniofacial Development in Xenopus laevis. Front Genet 2022; 13:833083. [PMID: 35401697 PMCID: PMC8987115 DOI: 10.3389/fgene.2022.833083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 03/09/2022] [Indexed: 12/03/2022] Open
Abstract
Copy number variants (CNVs) associated with neurodevelopmental disorders are characterized by extensive phenotypic heterogeneity. In particular, one CNV was identified in a subset of children clinically diagnosed with intellectual disabilities (ID) that results in a hemizygous deletion of multiple genes at chromosome 16p12.1. In addition to ID, individuals with this deletion display a variety of symptoms including microcephaly, seizures, cardiac defects, and growth retardation. Moreover, patients also manifest severe craniofacial abnormalities, such as micrognathia, cartilage malformation of the ears and nose, and facial asymmetries; however, the function of the genes within the 16p12.1 region have not been studied in the context of vertebrate craniofacial development. The craniofacial tissues affected in patients with this deletion all derive from the same embryonic precursor, the cranial neural crest, leading to the hypothesis that one or more of the 16p12.1 genes may be involved in regulating neural crest cell (NCC)-related processes. To examine this, we characterized the developmental role of the 16p12.1-affected gene orthologs, polr3e, mosmo, uqcrc2, and cdr2, during craniofacial morphogenesis in the vertebrate model system, Xenopus laevis. While the currently-known cellular functions of these genes are diverse, we find that they share similar expression patterns along the neural tube, pharyngeal arches, and later craniofacial structures. As these genes show co-expression in the pharyngeal arches where NCCs reside, we sought to elucidate the effect of individual gene depletion on craniofacial development and NCC migration. We find that reduction of several 16p12.1 genes significantly disrupts craniofacial and cartilage formation, pharyngeal arch migration, as well as NCC specification and motility. Thus, we have determined that some of these genes play an essential role during vertebrate craniofacial patterning by regulating specific processes during NCC development, which may be an underlying mechanism contributing to the craniofacial defects associated with the 16p12.1 deletion.
Collapse
Affiliation(s)
- Micaela Lasser
- Department of Biology, Boston College, Chestnut Hill, MA, United States
| | - Jessica Bolduc
- Department of Biology, Boston College, Chestnut Hill, MA, United States
| | - Luke Murphy
- Department of Biology, Boston College, Chestnut Hill, MA, United States
| | - Caroline O'Brien
- Department of Biology, Boston College, Chestnut Hill, MA, United States
| | - Sangmook Lee
- Department of Biology, Boston College, Chestnut Hill, MA, United States
| | - Santhosh Girirajan
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, State College, PA, United States
| | - Laura Anne Lowery
- Alfred B. Nobel Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA, United States
- *Correspondence: Laura Anne Lowery,
| |
Collapse
|
6
|
Išerić H, Alkan C, Hach F, Numanagić I. Fast characterization of segmental duplication structure in multiple genome assemblies. Algorithms Mol Biol 2022; 17:4. [PMID: 35303886 PMCID: PMC8932185 DOI: 10.1186/s13015-022-00210-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 02/08/2022] [Indexed: 11/29/2022] Open
Abstract
MOTIVATION The increasing availability of high-quality genome assemblies raised interest in the characterization of genomic architecture. Major architectural elements, such as common repeats and segmental duplications (SDs), increase genome plasticity that stimulates further evolution by changing the genomic structure and inventing new genes. Optimal computation of SDs within a genome requires quadratic-time local alignment algorithms that are impractical due to the size of most genomes. Additionally, to perform evolutionary analysis, one needs to characterize SDs in multiple genomes and find relations between those SDs and unique (non-duplicated) segments in other genomes. A naïve approach consisting of multiple sequence alignment would make the optimal solution to this problem even more impractical. Thus there is a need for fast and accurate algorithms to characterize SD structure in multiple genome assemblies to better understand the evolutionary forces that shaped the genomes of today. RESULTS Here we introduce a new approach, BISER, to quickly detect SDs in multiple genomes and identify elementary SDs and core duplicons that drive the formation of such SDs. BISER improves earlier tools by (i) scaling the detection of SDs with low homology to multiple genomes while introducing further 7-33[Formula: see text] speed-ups over the existing tools, and by (ii) characterizing elementary SDs and detecting core duplicons to help trace the evolutionary history of duplications to as far as 300 million years. AVAILABILITY AND IMPLEMENTATION BISER is implemented in Seq programming language and is publicly available at https://github.com/0xTCG/biser .
Collapse
Affiliation(s)
- Hamza Išerić
- Department of Computer Science, University of Victoria, Victoria, BC, V8P 5C2, Canada
| | - Can Alkan
- Department of Computer Engineering, Bilkent University, 06800, Ankara, Turkey
| | - Faraz Hach
- Vancouver Prostate Centre, Vancouver, BC, V6H 3Z6, Canada
- Department of Urologic Sciences, University of British Columbia, Vancouver, BC, V5Z 1M9, Canada
| | - Ibrahim Numanagić
- Department of Computer Science, University of Victoria, Victoria, BC, V8P 5C2, Canada.
| |
Collapse
|
7
|
A functional genomics pipeline identifies pleiotropy and cross-tissue effects within obesity-associated GWAS loci. Nat Commun 2021; 12:5253. [PMID: 34489471 PMCID: PMC8421397 DOI: 10.1038/s41467-021-25614-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 08/20/2021] [Indexed: 02/07/2023] Open
Abstract
Genome-wide association studies (GWAS) have identified many disease-associated variants, yet mechanisms underlying these associations remain unclear. To understand obesity-associated variants, we generate gene regulatory annotations in adipocytes and hypothalamic neurons across cellular differentiation stages. We then test variants in 97 obesity-associated loci using a massively parallel reporter assay and identify putatively causal variants that display cell type specific or cross-tissue enhancer-modulating properties. Integrating these variants with gene regulatory information suggests genes that underlie obesity GWAS associations. We also investigate a complex genomic interval on 16p11.2 where two independent loci exhibit megabase-range, cross-locus chromatin interactions. We demonstrate that variants within these two loci regulate a shared gene set. Together, our data support a model where GWAS loci contain variants that alter enhancer activity across tissues, potentially with temporally restricted effects, to impact the expression of multiple genes. This complex model has broad implications for ongoing efforts to understand GWAS.
Collapse
|
8
|
Mostovoy Y, Yilmaz F, Chow SK, Chu C, Lin C, Geiger EA, Meeks NJL, Chatfield KC, Coughlin CR, Surti U, Kwok PY, Shaikh TH. Genomic regions associated with microdeletion/microduplication syndromes exhibit extreme diversity of structural variation. Genetics 2021; 217:6066166. [PMID: 33724415 DOI: 10.1093/genetics/iyaa038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2020] [Accepted: 12/18/2020] [Indexed: 11/12/2022] Open
Abstract
Segmental duplications (SDs) are a class of long, repetitive DNA elements whose paralogs share a high level of sequence similarity with each other. SDs mediate chromosomal rearrangements that lead to structural variation in the general population as well as genomic disorders associated with multiple congenital anomalies, including the 7q11.23 (Williams-Beuren Syndrome, WBS), 15q13.3, and 16p12.2 microdeletion syndromes. Population-level characterization of SDs has generally been lacking because most techniques used for analyzing these complex regions are both labor and cost intensive. In this study, we have used a high-throughput technique to genotype complex structural variation with a single molecule, long-range optical mapping approach. We characterized SDs and identified novel structural variants (SVs) at 7q11.23, 15q13.3, and 16p12.2 using optical mapping data from 154 phenotypically normal individuals from 26 populations comprising five super-populations. We detected several novel SVs for each locus, some of which had significantly different prevalence between populations. Additionally, we localized the microdeletion breakpoints to specific paralogous duplicons located within complex SDs in two patients with WBS, one patient with 15q13.3, and one patient with 16p12.2 microdeletion syndromes. The population-level data presented here highlights the extreme diversity of large and complex SVs within SD-containing regions. The approach we outline will greatly facilitate the investigation of the role of inter-SD structural variation as a driver of chromosomal rearrangements and genomic disorders.
Collapse
Affiliation(s)
- Yulia Mostovoy
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Feyza Yilmaz
- Department of Integrative Biology, University of Colorado Denver, Denver, CO 80204, USA.,Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Stephen K Chow
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Catherine Chu
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Chin Lin
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Elizabeth A Geiger
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Naomi J L Meeks
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Kathryn C Chatfield
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA.,Department of Pediatrics, Section of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Curtis R Coughlin
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Urvashi Surti
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA.,Department of Dermatology, UCSF School of Medicine, San Francisco, CA 94143, USA.,Institute for Human Genetics, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Tamim H Shaikh
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
9
|
Laurent S, Gehrig C, Nouspikel T, Amr SS, Oza A, Murphy E, Vannier A, Béna FS, Carminho-Rodrigues MT, Blouin JL, Cao Van H, Abramowicz M, Paoloni-Giacobino A, Guipponi M. Molecular characterization of pathogenic OTOA gene conversions in hearing loss patients. Hum Mutat 2021; 42:373-377. [PMID: 33492714 DOI: 10.1002/humu.24167] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 12/02/2020] [Accepted: 12/16/2020] [Indexed: 11/11/2022]
Abstract
Bi-allelic loss-of-function variants of OTOA are a well-known cause of moderate-to-severe hearing loss. Whereas non-allelic homologous recombination-mediated deletions of the gene are well known, gene conversions to pseudogene OTOAP1 have been reported in the literature but never fully described nor their pathogenicity assessed. Here, we report two unrelated patients with moderate hearing-loss, who were compound heterozygotes for a converted allele and a deletion of OTOA. The conversions were initially detected through sequencing depths anomalies at the OTOA locus after exome sequencing, then confirmed with long range polymerase chain reactions. Both conversions lead to loss-of-function by introducing a premature stop codon in exon 22 (p.Glu787*). Using genomic alignments and long read nanopore sequencing, we found that the two probands carry stretches of converted DNA of widely different lengths (at least 9 kbp and around 900 bp, respectively).
Collapse
Affiliation(s)
- Sacha Laurent
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland
| | - Corinne Gehrig
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland
| | - Thierry Nouspikel
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland
| | - Sami S Amr
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, Massachusetts, USA
| | - Andrea Oza
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, Massachusetts, USA
| | - Elissa Murphy
- Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, Massachusetts, USA
| | - Anne Vannier
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland
| | - Frédérique Sloan Béna
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland.,Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | | | - Jean-Louis Blouin
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland.,Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Hélène Cao Van
- Department of Otorhinolaryngology, Head and Neck Surgery, Pediatric Otolaryngology Unit, University Hospitals of Geneva, Geneva, Switzerland
| | - Marc Abramowicz
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland.,Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Ariane Paoloni-Giacobino
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland.,Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Michel Guipponi
- Department of Diagnostic, Genetic Medicine Unit, University Hospitals of Geneva, Geneva, Switzerland.,Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| |
Collapse
|
10
|
Single-cell strand sequencing of a macaque genome reveals multiple nested inversions and breakpoint reuse during primate evolution. Genome Res 2020; 30:1680-1693. [PMID: 33093070 PMCID: PMC7605249 DOI: 10.1101/gr.265322.120] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 09/02/2020] [Indexed: 12/14/2022]
Abstract
Rhesus macaque is an Old World monkey that shared a common ancestor with human ∼25 Myr ago and is an important animal model for human disease studies. A deep understanding of its genetics is therefore required for both biomedical and evolutionary studies. Among structural variants, inversions represent a driving force in speciation and play an important role in disease predisposition. Here we generated a genome-wide map of inversions between human and macaque, combining single-cell strand sequencing with cytogenetics. We identified 375 total inversions between 859 bp and 92 Mbp, increasing by eightfold the number of previously reported inversions. Among these, 19 inversions flanked by segmental duplications overlap with recurrent copy number variants associated with neurocognitive disorders. Evolutionary analyses show that in 17 out of 19 cases, the Hominidae orientation of these disease-associated regions is always derived. This suggests that duplicated sequences likely played a fundamental role in generating inversions in humans and great apes, creating architectures that nowadays predispose these regions to disease-associated genetic instability. Finally, we identified 861 genes mapping at 156 inversions breakpoints, with some showing evidence of differential expression in human and macaque cell lines, thus highlighting candidates that might have contributed to the evolution of species-specific features. This study depicts the most accurate fine-scale map of inversions between human and macaque using a two-pronged integrative approach, such as single-cell strand sequencing and cytogenetics, and represents a valuable resource toward understanding of the biology and evolution of primate species.
Collapse
|
11
|
Cantsilieris S, Sunkin SM, Johnson ME, Anaclerio F, Huddleston J, Baker C, Dougherty ML, Underwood JG, Sulovari A, Hsieh P, Mao Y, Catacchio CR, Malig M, Welch AE, Sorensen M, Munson KM, Jiang W, Girirajan S, Ventura M, Lamb BT, Conlon RA, Eichler EE. An evolutionary driver of interspersed segmental duplications in primates. Genome Biol 2020; 21:202. [PMID: 32778141 PMCID: PMC7419210 DOI: 10.1186/s13059-020-02074-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 06/08/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The complex interspersed pattern of segmental duplications in humans is responsible for rearrangements associated with neurodevelopmental disease, including the emergence of novel genes important in human brain evolution. We investigate the evolution of LCR16a, a putative driver of this phenomenon that encodes one of the most rapidly evolving human-ape gene families, nuclear pore interacting protein (NPIP). RESULTS Comparative analysis shows that LCR16a has independently expanded in five primate lineages over the last 35 million years of primate evolution. The expansions are associated with independent lineage-specific segmental duplications flanking LCR16a leading to the emergence of large interspersed duplication blocks at non-orthologous chromosomal locations in each primate lineage. The intron-exon structure of the NPIP gene family has changed dramatically throughout primate evolution with different branches showing characteristic gene models yet maintaining an open reading frame. In the African ape lineage, we detect signatures of positive selection that occurred after a transition to more ubiquitous expression among great ape tissues when compared to Old World and New World monkeys. Mouse transgenic experiments from baboon and human genomic loci confirm these expression differences and suggest that the broader ape expression pattern arose due to mutational changes that emerged in cis. CONCLUSIONS LCR16a promotes serial interspersed duplications and creates hotspots of genomic instability that appear to be an ancient property of primate genomes. Dramatic changes to NPIP gene structure and altered tissue expression preceded major bouts of positive selection in the African ape lineage, suggestive of a gene undergoing strong adaptive evolution.
Collapse
Affiliation(s)
- Stuart Cantsilieris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
- Present Address: Centre for Eye Research Australia, Department of Surgery (Ophthalmology), University of Melbourne, Royal Victorian Eye and Ear Hospital, East Melbourne, VIC, 3002, Australia
| | | | - Matthew E Johnson
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Fabio Anaclerio
- Department of Biology-Genetics, University of Bari, Bari, Italy
| | - John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, 98195, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Max L Dougherty
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Jason G Underwood
- Pacific Biosciences (PacBio) of California, Incorporated, Menlo Park, CA, 94025, USA
| | - Arvis Sulovari
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Yafei Mao
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | | | - Maika Malig
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
- Present Address: Department of Molecular and Cellular Biology, University of California, Davis, CA, 95616, USA
- Present Address: Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, 95616, USA
| | - AnneMarie E Welch
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
- Present Address: Brain and Mitochondrial Research, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC, Australia
| | - Melanie Sorensen
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Weihong Jiang
- Case Transgenic and Targeting Facility, Department of Genetics and Genome Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Santhosh Girirajan
- Department of Biochemistry and Molecular Biology, Department of Anthropology, Pennsylvania State University, University Park, PA, 16802, USA
| | - Mario Ventura
- Department of Biology-Genetics, University of Bari, Bari, Italy
| | - Bruce T Lamb
- Stark Neurosciences Research Institute, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Ronald A Conlon
- Case Transgenic and Targeting Facility, Department of Genetics and Genome Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA.
- Howard Hughes Medical Institute, University of Washington School of Medicine, 3720 15th Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA.
| |
Collapse
|
12
|
Lengyel A, Pinti É, Pikó H, Jávorszky E, David D, Tihanyi M, Gönczi É, Kiss E, Tóth Z, Tory K, Fekete G, Haltrich I. Clinical and genetic findings in Hungarian pediatric patients carrying chromosome 16p copy number variants and a review of the literature. Eur J Med Genet 2020; 63:104027. [PMID: 32758661 DOI: 10.1016/j.ejmg.2020.104027] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 07/10/2020] [Accepted: 07/25/2020] [Indexed: 11/27/2022]
Abstract
The short arm of chromosome 16 (16p) is enriched for segmental duplications, making it susceptible to recurrent, reciprocal rearrangements implicated in the etiology of several phenotypes, including intellectual disability, speech disorders, developmental coordination disorder, autism spectrum disorders, attention deficit hyperactivity disorders, obesity and congenital skeletal disorders. In our clinical study 73 patients were analyzed by chromosomal microarray, and results were confirmed by fluorescence in situ hybridization or polymerase chain reaction. All patients underwent detailed clinical evaluation, with special emphasis on behavioral symptoms. 16p rearrangements were identified in 10 individuals. We found six pathogenic deletions and duplications of the recurrent regions within 16p11.2: one patient had a deletion of the distal 16p11.2 region associated with obesity, while four individuals had duplications, and one patient a deletion of the proximal 16p11.2 region. The other four patients carried 16p variations as second-site genomic alterations, acting as possible modifying genetic factors. We present the phenotypic and genotypic results of our patients and discuss our findings in relation to the available literature.
Collapse
Affiliation(s)
- Anna Lengyel
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary.
| | - Éva Pinti
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Henriett Pikó
- I Department of Internal Medicine, Semmelweis University, Budapest, Hungary
| | - Eszter Jávorszky
- I Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Dezső David
- Department of Human Genetics, National Health Institute Dr. Ricardo Jorge, Lisbon, Portugal
| | - Mariann Tihanyi
- Department of Genetics, Zala County Hospital, Zalaegerszeg, Hungary
| | - Éva Gönczi
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Eszter Kiss
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Zsuzsa Tóth
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Kálmán Tory
- I Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - György Fekete
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Irén Haltrich
- II Department of Pediatrics, Semmelweis University, Budapest, Hungary
| |
Collapse
|
13
|
Optical mapping of the 22q11.2DS region reveals complex repeat structures and preferred locations for non-allelic homologous recombination (NAHR). Sci Rep 2020; 10:12235. [PMID: 32699385 PMCID: PMC7376033 DOI: 10.1038/s41598-020-69134-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 07/03/2020] [Indexed: 12/27/2022] Open
Abstract
The most prevalent microdeletion in humans occurs at 22q11.2, a region rich in chromosome-specific low copy repeats (LCR22s). The structure of this region has defied elucidation due to its size, regional complexity, and haplotype diversity, and is not well represented in the human genome reference. Most individuals with 22q11.2 deletion syndrome (22q11.2DS) carry a de novo hemizygous deletion of ~ 3 Mbp occurring by non-allelic homologous recombination (NAHR) mediated by LCR22s. In this study, optical mapping has been used to elucidate LCR22 structure and variation in 88 individuals in thirty 22q11.2DS families to uncover potential risk factors for germline rearrangements leading to 22q11.2DS offspring. Families were optically mapped to characterize LCR22 structures, NAHR locations, and genomic signatures associated with the deletion. Bioinformatics analyses revealed clear delineations between LCR22 structures in normal and deletion-containing haplotypes. Despite no explicit whole-haplotype predisposing configurations being identified, all NAHR events contain a segmental duplication encompassing FAM230 gene members suggesting preferred recombination sequences. Analysis of deletion breakpoints indicates that preferred recombinations occur between FAM230 and specific segmental duplication orientations within LCR22A and LCR22D, ultimately leading to NAHR. This work represents the most comprehensive analysis of 22q11.2DS NAHR events demonstrating completely contiguous LCR22 structures surrounding and within deletion breakpoints.
Collapse
|
14
|
Hsieh P, Vollger MR, Dang V, Porubsky D, Baker C, Cantsilieris S, Hoekzema K, Lewis AP, Munson KM, Sorensen M, Kronenberg ZN, Murali S, Nelson BJ, Chiatante G, Maggiolini FAM, Blanché H, Underwood JG, Antonacci F, Deleuze JF, Eichler EE. Adaptive archaic introgression of copy number variants and the discovery of previously unknown human genes. Science 2020; 366:366/6463/eaax2083. [PMID: 31624180 DOI: 10.1126/science.aax2083] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2019] [Revised: 07/05/2019] [Accepted: 09/12/2019] [Indexed: 01/01/2023]
Abstract
Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide variants, but their roles in archaic introgression and adaptation have not been systematically investigated. We show that stratified CNVs are significantly associated with signatures of positive selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, we reconstruct the structure and complex evolutionary history of these polymorphisms and show that both encode positively selected genes absent from most human populations. Our results collectively suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation.
Collapse
Affiliation(s)
- PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Vy Dang
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Stuart Cantsilieris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Melanie Sorensen
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Zev N Kronenberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Shwetha Murali
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Giorgia Chiatante
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari, Italy
| | | | - Hélène Blanché
- Fondation Jean Dausset-Centre d'Etude du Polymorphisme Humain, Paris, France
| | - Jason G Underwood
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.,Pacific Biosciences (PacBio) of California, Inc., Menlo Park, CA, USA
| | - Francesca Antonacci
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari, Italy
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA. .,Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| |
Collapse
|
15
|
Tassano E, Ronchetto P, Calcagno A, Fiorio P, Gimelli G, Capra V, Scala M. 'Distal 16p12.2 microdeletion' in a patient with autosomal recessive deafness-22. J Genet 2019; 98:56. [PMID: 31204719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The 16p12.2 chromosome band contains three large segmental duplications: BP1, BP2 and BP3, providing a substrate for recombination and recurrent chromosomal rearrangements. The '16p12.2 microdeletion' is a recurrent deletion comprised between BP2 and BP3, associated with variable clinical findings. We identified a heterozygous 16p12.2 microdeletion spanning between BP1 and BP2 in a child evaluated for short stature and mild dyslexia. Unexpectedly, the mother carried the same deletion in the homozygous state and suffered from severe hearing loss. Detailed family history revealed consanguinity of the maternal grandparents. The 16p12.2 microdeletion is a rare condition and contains only three genes: METTL9, IGSF6 and OTOA of which the OTOA is considered responsible for DFNB22 hearing loss (MIM: 607039) under its homozygous condition. A number of OTOA mutations have been described, whereas very few cases of a 16p12.2 microdeletion similar to that observed in our family have been reported. In conclusion, we describe a rare 'distal 16p12.2microdeletion' widening the phenotypic spectrum associated with the recurrent 16p12.2 microdeletion and support the causative role of OTOA microdeletion in hearing impairment.
Collapse
Affiliation(s)
- Elisa Tassano
- Laboratorio di Citogenetica, Istituto Giannina Gaslini, 16147 Genova, Italy.
| | | | | | | | | | | | | |
Collapse
|
16
|
|
17
|
Maggiolini FAM, Cantsilieris S, D’Addabbo P, Manganelli M, Coe BP, Dumont BL, Sanders AD, Pang AWC, Vollger MR, Palumbo O, Palumbo P, Accadia M, Carella M, Eichler EE, Antonacci F. Genomic inversions and GOLGA core duplicons underlie disease instability at the 15q25 locus. PLoS Genet 2019; 15:e1008075. [PMID: 30917130 PMCID: PMC6436712 DOI: 10.1371/journal.pgen.1008075] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 03/07/2019] [Indexed: 11/19/2022] Open
Abstract
Human chromosome 15q25 is involved in several disease-associated structural rearrangements, including microdeletions and chromosomal markers with inverted duplications. Using comparative fluorescence in situ hybridization, strand-sequencing, single-molecule, real-time sequencing and Bionano optical mapping analyses, we investigated the organization of the 15q25 region in human and nonhuman primates. We found that two independent inversions occurred in this region after the fission event that gave rise to phylogenetic chromosomes XIV and XV in humans and great apes. One of these inversions is still polymorphic in the human population today and may confer differential susceptibility to 15q25 microdeletions and inverted duplications. The inversion breakpoints map within segmental duplications containing core duplicons of the GOLGA gene family and correspond to the site of an ancestral centromere, which became inactivated about 25 million years ago. The inactivation of this centromere likely released segmental duplications from recombination repression typical of centromeric regions. We hypothesize that this increased the frequency of ectopic recombination creating a hotspot of hominid inversions where dispersed GOLGA core elements now predispose this region to recurrent genomic rearrangements associated with disease.
Collapse
Affiliation(s)
| | - Stuart Cantsilieris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, United States of America
| | - Pietro D’Addabbo
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Michele Manganelli
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Bradley P. Coe
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, United States of America
| | - Beth L. Dumont
- The Jackson Laboratory, Bar Harbor, ME, United States of America
| | - Ashley D. Sanders
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, Heidelberg, Germany
| | | | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, United States of America
| | - Orazio Palumbo
- Medical Genetics Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo (FG), Italy
| | - Pietro Palumbo
- Medical Genetics Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo (FG), Italy
| | - Maria Accadia
- Medical Genetics Service, Hospital “Cardinale G. Panico”, Via San Pio X n°4, Tricase, LE, Italy
| | - Massimo Carella
- Medical Genetics Unit, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo (FG), Italy
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, United States of America
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, United States of America
| | - Francesca Antonacci
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| |
Collapse
|
18
|
Liu J, Zhou Y, Liu S, Song X, Yang XZ, Fan Y, Chen W, Akdemir ZC, Yan Z, Zuo Y, Du R, Liu Z, Yuan B, Zhao S, Liu G, Chen Y, Zhao Y, Lin M, Zhu Q, Niu Y, Liu P, Ikegawa S, Song YQ, Posey JE, Qiu G, Zhang F, Wu Z, Lupski JR, Wu N. The coexistence of copy number variations (CNVs) and single nucleotide polymorphisms (SNPs) at a locus can result in distorted calculations of the significance in associating SNPs to disease. Hum Genet 2018; 137:553-567. [PMID: 30019117 DOI: 10.1007/s00439-018-1910-3] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2018] [Accepted: 07/07/2018] [Indexed: 01/25/2023]
Abstract
With the recent advance in genome-wide association studies (GWAS), disease-associated single nucleotide polymorphisms (SNPs) and copy number variants (CNVs) have been extensively reported. Accordingly, the issue of incorrect identification of recombination events that can induce the distortion of multi-allelic or hemizygous variants has received more attention. However, the potential distorted calculation bias or significance of a detected association in a GWAS due to the coexistence of CNVs and SNPs in the same genomic region may remain under-recognized. Here we performed the association study within a congenital scoliosis (CS) cohort whose genetic etiology was recently elucidated as a compound inheritance model, including mostly one rare variant deletion CNV null allele and one common variant non-coding hypomorphic haplotype of the TBX6 gene. We demonstrated that the existence of a deletion in TBX6 led to an overestimation of the contribution of the SNPs on the hypomorphic allele. Furthermore, we generalized a model to explain the calculation bias, or distorted significance calculation for an association study, that can be 'induced' by CNVs at a locus. Meanwhile, overlapping between the disease-associated SNPs from published GWAS and common CNVs (overlap 10%) and pathogenic/likely pathogenic CNVs (overlap 99.69%) was significantly higher than the random distribution (p < 1 × 10-6 and p = 0.034, respectively), indicating that such co-existence of CNV and SNV alleles might generally influence data interpretation and potential outcomes of a GWAS. We also verified and assessed the influence of colocalizing CNVs to the detection sensitivity of disease-associated SNP variant alleles in another adolescent idiopathic scoliosis (AIS) genome-wide association study. We proposed that detecting co-existent CNVs when evaluating the association signals between SNPs and disease traits could improve genetic model analyses and better integrate GWAS with robust Mendelian principles.
Collapse
Affiliation(s)
- Jiaqi Liu
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Department of Breast Surgical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China
| | - Yangzhong Zhou
- Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Department of Internal Medicine, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Sen Liu
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Xiaofei Song
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Xin-Zhuang Yang
- Department of Central Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Yanhui Fan
- School of Biomedical Sciences, The University of Hong Kong, Hong Kong, China
| | - Weisheng Chen
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Zeynep Coban Akdemir
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Zihui Yan
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Yuzhi Zuo
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Renqian Du
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Zhenlei Liu
- Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Department of Neurosurgery, Xuanwu Hospital, Capital Medical University, Beijing, 100053, China
| | - Bo Yuan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Sen Zhao
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Gang Liu
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Yixin Chen
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Yanxue Zhao
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Mao Lin
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Qiankun Zhu
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Yuchen Niu
- Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China.,Department of Central Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Shiro Ikegawa
- Laboratory of Bone and Joint Diseases, Center for Integrative Medical Sciences, RIKEN, Tokyo, 108-8639, Japan
| | - You-Qiang Song
- School of Biomedical Sciences, The University of Hong Kong, Hong Kong, China
| | - Jennifer E Posey
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Guixing Qiu
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
| | | | - Feng Zhang
- Obstetrics and Gynecology Hospital, Institute of Reproduction and Development, Fudan University, Shanghai, 200433, China.,Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Zhihong Wu
- Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China.,Department of Central Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA.,Texas Children's Hospital, Houston, TX, 77030, USA
| | - Nan Wu
- Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China. .,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China. .,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China.
| |
Collapse
|
19
|
Catacchio CR, Maggiolini FAM, D'Addabbo P, Bitonto M, Capozzi O, Lepore Signorile M, Miroballo M, Archidiacono N, Eichler EE, Ventura M, Antonacci F. Inversion variants in human and primate genomes. Genome Res 2018; 28:910-920. [PMID: 29776991 PMCID: PMC5991517 DOI: 10.1101/gr.234831.118] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Accepted: 04/26/2018] [Indexed: 02/06/2023]
Abstract
For many years, inversions have been proposed to be a direct driving force in speciation since they suppress recombination when heterozygous. Inversions are the most common large-scale differences among humans and great apes. Nevertheless, they represent large events easily distinguishable by classical cytogenetics, whose resolution, however, is limited. Here, we performed a genome-wide comparison between human, great ape, and macaque genomes using the net alignments for the most recent releases of genome assemblies. We identified a total of 156 putative inversions, between 103 kb and 91 Mb, corresponding to 136 human loci. Combining literature, sequence, and experimental analyses, we analyzed 109 of these loci and found 67 regions inverted in one or multiple primates, including 28 newly identified inversions. These events overlap with 81 human genes at their breakpoints, and seven correspond to sites of recurrent rearrangements associated with human disease. This work doubles the number of validated primate inversions larger than 100 kb, beyond what was previously documented. We identified 74 sites of errors, where the sequence has been assembled in the wrong orientation, in the reference genomes analyzed. Our data serve two purposes: First, we generated a map of evolutionary inversions in these genomes representing a resource for interrogating differences among these species at a functional level; second, we provide a list of misassembled regions in these primate genomes, involving over 300 Mb of DNA and 1978 human genes. Accurately annotating these regions in the genome references has immediate applications for evolutionary and biomedical studies on primates.
Collapse
Affiliation(s)
| | | | - Pietro D'Addabbo
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari 70125, Italy
| | - Miriana Bitonto
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari 70125, Italy
| | - Oronzo Capozzi
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari 70125, Italy
| | | | - Mattia Miroballo
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari 70125, Italy
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
| | - Mario Ventura
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari 70125, Italy
| | - Francesca Antonacci
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari 70125, Italy
| |
Collapse
|
20
|
Hillmer M, Summerer A, Mautner VF, Högel J, Cooper DN, Kehrer-Sawatzki H. Consideration of the haplotype diversity at nonallelic homologous recombination hotspots improves the precision of rearrangement breakpoint identification. Hum Mutat 2017; 38:1711-1722. [PMID: 28862369 DOI: 10.1002/humu.23319] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Revised: 08/07/2017] [Accepted: 08/26/2017] [Indexed: 01/30/2023]
Abstract
Precise characterization of nonallelic homologous recombination (NAHR) breakpoints is key to identifying those features that influence NAHR frequency. Until now, analysis of NAHR-mediated rearrangements has generally been performed by comparison of the breakpoint-spanning sequences with the human genome reference sequence. We show here that the haplotype diversity of NAHR hotspots may interfere with breakpoint-mapping. We studied the transmitting parents of individuals with germline type-1 NF1 deletions mediated by NAHR within the paralogous recombination site 1 (PRS1) or paralogous recombination site 2 (PRS2) hotspots. Several parental wild-type PRS1 and PRS2 haplotypes were identified that exhibited considerable sequence differences with respect to the reference sequence, which also affected the number of predicted PRDM9-binding sites. Sequence comparisons between the parental wild-type PRS1 or PRS2 haplotypes and the deletion breakpoint-spanning sequences from the patients (method #2) turned out to be an accurate means to assign NF1 deletion breakpoints and proved superior to crude reference sequence comparisons that neglect to consider haplotype diversity (method #1). The mean length of the deletion breakpoint regions assigned by method #2 was 269-bp in contrast to 502-bp by method #1. Our findings imply that paralog-specific haplotype diversity of NAHR hotspots (such as PRS2) and population-specific haplotype diversity must be taken into account in order to accurately ascertain NAHR-mediated rearrangement breakpoints.
Collapse
Affiliation(s)
- Morten Hillmer
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | - Anna Summerer
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | - Victor-Felix Mautner
- Department of Neurology, University Hospital Hamburg Eppendorf, Hamburg, Germany
| | - Josef Högel
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | | |
Collapse
|
21
|
Chromosomal contacts connect loci associated with autism, BMI and head circumference phenotypes. Mol Psychiatry 2017; 22:836-849. [PMID: 27240531 PMCID: PMC5508252 DOI: 10.1038/mp.2016.84] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Revised: 03/18/2016] [Accepted: 04/18/2016] [Indexed: 12/20/2022]
Abstract
Copy number variants (CNVs) are major contributors to genomic imbalance disorders. Phenotyping of 137 unrelated deletion and reciprocal duplication carriers of the distal 16p11.2 220 kb BP2-BP3 interval showed that these rearrangements are associated with autism spectrum disorders and mirror phenotypes of obesity/underweight and macrocephaly/microcephaly. Such phenotypes were previously associated with rearrangements of the non-overlapping proximal 16p11.2 600 kb BP4-BP5 interval. These two CNV-prone regions at 16p11.2 are reciprocally engaged in complex chromatin looping, as successfully confirmed by 4C-seq, fluorescence in situ hybridization and Hi-C, as well as coordinated expression and regulation of encompassed genes. We observed that genes differentially expressed in 16p11.2 BP4-BP5 CNV carriers are concomitantly modified in their chromatin interactions, suggesting that disruption of chromatin interplays could participate in the observed phenotypes. We also identified cis- and trans-acting chromatin contacts to other genomic regions previously associated with analogous phenotypes. For example, we uncovered that individuals with reciprocal rearrangements of the trans-contacted 2p15 locus similarly display mirror phenotypes on head circumference and weight. Our results indicate that chromosomal contacts' maps could uncover functionally and clinically related genes.
Collapse
|
22
|
Dougherty ML, Nuttle X, Penn O, Nelson BJ, Huddleston J, Baker C, Harshman L, Duyzend MH, Ventura M, Antonacci F, Sandstrom R, Dennis MY, Eichler EE. The birth of a human-specific neural gene by incomplete duplication and gene fusion. Genome Biol 2017; 18:49. [PMID: 28279197 PMCID: PMC5345166 DOI: 10.1186/s13059-017-1163-9] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 01/27/2017] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Gene innovation by duplication is a fundamental evolutionary process but is difficult to study in humans due to the large size, high sequence identity, and mosaic nature of segmental duplication blocks. The human-specific gene hydrocephalus-inducing 2, HYDIN2, was generated by a 364 kbp duplication of 79 internal exons of the large ciliary gene HYDIN from chromosome 16q22.2 to chromosome 1q21.1. Because the HYDIN2 locus lacks the ancestral promoter and seven terminal exons of the progenitor gene, we sought to characterize transcription at this locus by coupling reverse transcription polymerase chain reaction and long-read sequencing. RESULTS 5' RACE indicates a transcription start site for HYDIN2 outside of the duplication and we observe fusion transcripts spanning both the 5' and 3' breakpoints. We observe extensive splicing diversity leading to the formation of altered open reading frames (ORFs) that appear to be under relaxed selection. We show that HYDIN2 adopted a new promoter that drives an altered pattern of expression, with highest levels in neural tissues. We estimate that the HYDIN duplication occurred ~3.2 million years ago and find that it is nearly fixed (99.9%) for diploid copy number in contemporary humans. Examination of 73 chromosome 1q21 rearrangement patients reveals that HYDIN2 is deleted or duplicated in most cases. CONCLUSIONS Together, these data support a model of rapid gene innovation by fusion of incomplete segmental duplications, altered tissue expression, and potential subfunctionalization or neofunctionalization of HYDIN2 early in the evolution of the Homo lineage.
Collapse
Affiliation(s)
- Max L Dougherty
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - Xander Nuttle
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - Osnat Penn
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - John Huddleston
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - Lana Harshman
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - Michael H Duyzend
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
| | - Mario Ventura
- Department of Biology, University of Bari, Bari, 70121, Italy
| | | | | | - Megan Y Dennis
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, 95616, CA, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15 Ave NE, S413C, Box 355065, Seattle, WA, 98195-5065, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
23
|
Dennis MY, Harshman L, Nelson BJ, Penn O, Cantsilieris S, Huddleston J, Antonacci F, Penewit K, Denman L, Raja A, Baker C, Mark K, Malig M, Janke N, Espinoza C, Stessman HAF, Nuttle X, Hoekzema K, Lindsay-Graves TA, Wilson RK, Eichler EE. The evolution and population diversity of human-specific segmental duplications. Nat Ecol Evol 2017; 1:69. [PMID: 28580430 PMCID: PMC5450946 DOI: 10.1038/s41559-016-0069] [Citation(s) in RCA: 97] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Segmental duplications contribute to human evolution, adaptation and genomic instability but are often poorly characterized. We investigate the evolution, genetic variation and coding potential of human-specific segmental duplications (HSDs). We identify 218 HSDs based on analysis of 322 deeply sequenced archaic and contemporary hominid genomes. We sequence 550 human and nonhuman primate genomic clones to reconstruct the evolution of the largest, most complex regions with protein-coding potential (n=80 genes/33 gene families). We show that HSDs are non-randomly organized, associate preferentially with ancestral ape duplications termed “core duplicons”, and evolved primarily in an interspersed inverted orientation. In addition to Homo sapiens-specific gene expansions (e.g., TCAF1/2), we highlight ten gene families (e.g., ARHGAP11B and SRGAP2C) where copy number never returns to the ancestral state, there is evidence of mRNA splicing, and no common gene-disruptive mutations are observed in the general population. Such duplicates are candidates for the evolution of human-specific adaptive traits.
Collapse
Affiliation(s)
- Megan Y Dennis
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA 95616, USA.,Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Lana Harshman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Osnat Penn
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Stuart Cantsilieris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - John Huddleston
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Francesca Antonacci
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro", Bari 70125, Italy
| | - Kelsi Penewit
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Laura Denman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Archana Raja
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Kenneth Mark
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Maika Malig
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Nicolette Janke
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Claudia Espinoza
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Holly A F Stessman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Xander Nuttle
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Tina A Lindsay-Graves
- McDonnell Genome Institute at Washington University, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Richard K Wilson
- McDonnell Genome Institute at Washington University, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
24
|
Eslami Rasekh M, Chiatante G, Miroballo M, Tang J, Ventura M, Amemiya CT, Eichler EE, Antonacci F, Alkan C. Discovery of large genomic inversions using long range information. BMC Genomics 2017; 18:65. [PMID: 28073353 PMCID: PMC5223412 DOI: 10.1186/s12864-016-3444-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Accepted: 12/19/2016] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. RESULTS Here we propose a novel algorithm, VALOR, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to as long range sequencing. We demonstrate the utility of VALOR using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison of VALOR against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. CONCLUSIONS In this paper, we show that VALOR is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. Using VALOR, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. VALOR is available at https://github.com/BilkentCompGen/VALOR.
Collapse
Affiliation(s)
- Marzieh Eslami Rasekh
- Department of Computer Engineering, Bilkent University, Bilkent, 06800, Ankara, Turkey
| | - Giorgia Chiatante
- Department of Biology, University of Bari, Via Orabona 4, 70125, Bari, Italy
| | - Mattia Miroballo
- Department of Biology, University of Bari, Via Orabona 4, 70125, Bari, Italy
| | - Joyce Tang
- Benaroya Research Institute, 1201 Ninth Avenue, 98101, Seattle, WA, USA
| | - Mario Ventura
- Department of Biology, University of Bari, Via Orabona 4, 70125, Bari, Italy
| | - Chris T Amemiya
- Benaroya Research Institute, 1201 Ninth Avenue, 98101, Seattle, WA, USA
| | - Evan E Eichler
- Department of Genome Sciences and Howard Hughes Medical Institute, University of Washington, 3720 15th Avenue NE, 98195, Seattle, WA, USA
| | - Francesca Antonacci
- Department of Biology, University of Bari, Via Orabona 4, 70125, Bari, Italy.
| | - Can Alkan
- Department of Computer Engineering, Bilkent University, Bilkent, 06800, Ankara, Turkey.
| |
Collapse
|
25
|
An exploratory study of predisposing genetic factors for DiGeorge/velocardiofacial syndrome. Sci Rep 2017; 7:40031. [PMID: 28059126 PMCID: PMC5216377 DOI: 10.1038/srep40031] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2016] [Accepted: 12/01/2016] [Indexed: 12/13/2022] Open
Abstract
DiGeorge/velocardiofacial syndrome (DGS/VCFS) is a disorder caused by a 22q11.2 deletion mediated by non-allelic homologous recombination (NAHR) between low-copy repeats (LCRs). We have evaluated the role of LCR22 genomic architecture and PRDM9 variants as DGS/VCFS predisposing factors. We applied FISH using fosmid probes on chromatin fibers to analyze the number of tandem repeat blocks in LCR22 in two DGS/VCFS fathers-of-origin with proven 22q11.2 NAHR susceptibility. Results revealed copy number variations (CNVs) of L9 and K3 fosmids in these individuals compared to controls. The total number of L9 and K3 copies was also characterized using droplet digital PCR (ddPCR). Although we were unable to confirm variations, we detected an additional L9 amplicon corresponding to a pseudogene. Moreover, none of the eight DGS/VCFS parents-of-origin was heterozygote for the inv(22)(q11.2) haplotype. PRDM9 sequencing showed equivalent allelic distributions between DGS/VCFS parents-of-origin and controls, although a new PRDM9 allele (L50) was identified in one case. Our results support the hypothesis that LCR22s variations influences 22q11.2 NAHR events, however further studies are needed to confirm this association and clarify the contribution of pseudogenes and rare PDRM9 alleles to NAHR susceptibility.
Collapse
|
26
|
Dennis MY, Eichler EE. Human adaptation and evolution by segmental duplication. Curr Opin Genet Dev 2016; 41:44-52. [PMID: 27584858 PMCID: PMC5161654 DOI: 10.1016/j.gde.2016.08.001] [Citation(s) in RCA: 114] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Revised: 07/02/2016] [Accepted: 08/02/2016] [Indexed: 12/29/2022]
Abstract
Duplications are the primary force by which new gene functions arise and provide a substrate for large-scale structural variation. Analysis of thousands of genomes shows that humans and great apes have more genetic differences in content and structure over recent segmental duplications than any other euchromatic region. Novel human-specific duplicated genes, ARHGAP11B and SRGAP2C, have recently been described with a potential role in neocortical expansion and increased neuronal spine density. Large segmental duplications and the structural variants they promote are also frequently stratified between human populations with a subset being subjected to positive selection. The impact of recent duplications on human evolution and adaptation is only beginning to be realized as new technologies enhance their discovery and accurate genotyping.
Collapse
Affiliation(s)
- Megan Y Dennis
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA 95616, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
27
|
Pizzo L, Andrieux J, Amor DJ, Girirajan S. Clinical utility gene card for: 16p12.2 microdeletion. Eur J Hum Genet 2016; 25:ejhg2016158. [PMID: 27848943 DOI: 10.1038/ejhg.2016.158] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Revised: 09/20/2016] [Accepted: 10/11/2016] [Indexed: 11/09/2022] Open
Affiliation(s)
- Lucilla Pizzo
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Joris Andrieux
- Institut de Génétique Médicale, CHRU de Lille, Lille, France
| | - David J Amor
- Department of Paediatrics, Murdoch Childrens Research Institute, University of Melbourne, Royal Children's Hospital, Melbourne, Victoria, Australia
| | - Santhosh Girirajan
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA.,Department of Anthropology, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
28
|
Wang T, Guo H, Xiong B, Stessman HAF, Wu H, Coe BP, Turner TN, Liu Y, Zhao W, Hoekzema K, Vives L, Xia L, Tang M, Ou J, Chen B, Shen Y, Xun G, Long M, Lin J, Kronenberg ZN, Peng Y, Bai T, Li H, Ke X, Hu Z, Zhao J, Zou X, Xia K, Eichler EE. De novo genic mutations among a Chinese autism spectrum disorder cohort. Nat Commun 2016; 7:13316. [PMID: 27824329 PMCID: PMC5105161 DOI: 10.1038/ncomms13316] [Citation(s) in RCA: 249] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2016] [Accepted: 09/22/2016] [Indexed: 12/27/2022] Open
Abstract
Recurrent de novo (DN) and likely gene-disruptive (LGD) mutations contribute significantly to autism spectrum disorders (ASDs) but have been primarily investigated in European cohorts. Here, we sequence 189 risk genes in 1,543 Chinese ASD probands (1,045 from trios). We report an 11-fold increase in the odds of DN LGD mutations compared with expectation under an exome-wide neutral model of mutation. In aggregate, ∼4% of ASD patients carry a DN mutation in one of just 29 autism risk genes. The most prevalent gene for recurrent DN mutations is SCN2A (1.1% of patients) followed by CHD8, DSCAM, MECP2, POGZ, WDFY3 and ASH1L. We identify novel DN LGD recurrences (GIGYF2, MYT1L, CUL3, DOCK8 and ZNF292) and DN mutations in previous ASD candidates (ARHGAP32, NCOR1, PHIP, STXBP1, CDKL5 and SHANK1). Phenotypic follow-up confirms potential subtypes and highlights how large global cohorts might be leveraged to prove the pathogenic significance of individually rare mutations. Recurrent sporadic mutations are important risk factors for autism spectrum disorders (ASDs) but have been primarily investigated in European cohorts. Here, Eichler, Xia and colleagues analyse risk genes in a large Chinese ASD cohort and find novel recurrences of potential pathogenic significance.
Collapse
Affiliation(s)
- Tianyun Wang
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Hui Guo
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China.,Mental Health Institute, the Second Xiangya Hospital, Central South University, Changsha, Hunan 410011, China
| | - Bo Xiong
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Holly A F Stessman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Huidan Wu
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Bradley P Coe
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Tychele N Turner
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Yanling Liu
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Wenjing Zhao
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Laura Vives
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Lu Xia
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Meina Tang
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Jianjun Ou
- Mental Health Institute, the Second Xiangya Hospital, Central South University, Changsha, Hunan 410011, China
| | - Biyuan Chen
- Children's Development Behavior Center, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Yidong Shen
- Mental Health Institute, the Second Xiangya Hospital, Central South University, Changsha, Hunan 410011, China
| | - Guanglei Xun
- Mental Health Center of Shandong Province, Jinan, Shandong 250014, China
| | - Min Long
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Janice Lin
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Zev N Kronenberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Yu Peng
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Ting Bai
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Honghui Li
- Child Healthcare Department, Liuzhou Maternity and Child Healthcare Hospital, Liuzhou, Guangxi 545000, China
| | - Xiaoyan Ke
- Child Mental Health Research Center, Nanjing Brain Hospital Affiliated of Nanjing Medical University, Nanjing, Jiangsu 210029, China
| | - Zhengmao Hu
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Jingping Zhao
- Mental Health Institute, the Second Xiangya Hospital, Central South University, Changsha, Hunan 410011, China
| | - Xiaobing Zou
- Children's Development Behavior Center, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Kun Xia
- The State Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China.,Collaborative Innovation Center for Genetics and Development, Shanghai 200433, China.,Key Laboratory of Medical Information Research, Central South University, Changsha, Hunan 410013, China
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
29
|
Mohajeri K, Cantsilieris S, Huddleston J, Nelson BJ, Coe BP, Campbell CD, Baker C, Harshman L, Munson KM, Kronenberg ZN, Kremitzki M, Raja A, Catacchio CR, Graves TA, Wilson RK, Ventura M, Eichler EE. Interchromosomal core duplicons drive both evolutionary instability and disease susceptibility of the Chromosome 8p23.1 region. Genome Res 2016; 26:1453-1467. [PMID: 27803192 PMCID: PMC5088589 DOI: 10.1101/gr.211284.116] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2016] [Accepted: 09/12/2016] [Indexed: 12/13/2022]
Abstract
Recurrent rearrangements of Chromosome 8p23.1 are associated with congenital heart defects and developmental delay. The complexity of this region has led to inconsistencies in the current reference assembly, confounding studies of genetic variation. Using comparative sequence-based approaches, we generated a high-quality 6.3-Mbp alternate reference assembly of an inverted Chromosome 8p23.1 haplotype. Comparison with nonhuman primates reveals a 746-kbp duplicative transposition and two separate inversion events that arose in the last million years of human evolution. The breakpoints associated with these rearrangements map to an ape-specific interchromosomal core duplicon that clusters at sites of evolutionary inversion (P = 7.8 × 10−5). Refinement of microdeletion breakpoints identifies a subgroup of patients that map to the same interchromosomal core involved in the evolutionary formation of the duplication blocks. Our results define a higher-order genomic instability element that has shaped the structure of specific chromosomes during primate evolution contributing to rearrangements associated with inversion and disease.
Collapse
Affiliation(s)
- Kiana Mohajeri
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Stuart Cantsilieris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - John Huddleston
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Bradley P Coe
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Catarina D Campbell
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Lana Harshman
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Zev N Kronenberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Milinn Kremitzki
- The McDonnell Genome Institute at Washington University, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Archana Raja
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
| | | | - Tina A Graves
- The McDonnell Genome Institute at Washington University, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Richard K Wilson
- The McDonnell Genome Institute at Washington University, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Mario Ventura
- Dipartimento di Biologia, Università degli Studi di Bari Aldo Moro, Bari 70125, Italy
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
30
|
Li Y, Zhou S, Schwartz DC, Ma J. Allele-Specific Quantification of Structural Variations in Cancer Genomes. Cell Syst 2016; 3:21-34. [PMID: 27453446 PMCID: PMC4965314 DOI: 10.1016/j.cels.2016.05.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Revised: 05/13/2016] [Accepted: 05/24/2016] [Indexed: 12/21/2022]
Abstract
Aneuploidy and structural variations (SVs) generate cancer genomes containing a mixture of rearranged genomic segments with extensive somatic copy number alterations. However, existing methods can identify either SVs or allele-specific copy number alterations, but not both simultaneously, which provides a limited view of cancer genome structure. Here we introduce Weaver, an algorithm for the quantification and analysis of allele-specific copy numbers of SVs. Weaver uses a Markov Random Field to estimate joint probabilities of allele-specific copy number of SVs and their inter-connectivity based on paired-end whole-genome sequencing data. Weaver also predicts the timing of SVs relative to chromosome amplifications. We demonstrate the accuracy of Weaver using simulations and findings from whole-genome Optical Mapping. We apply Weaver to generate allele-specific copy numbers of SVs for MCF-7 and HeLa cell lines, and identify recurrent SV patterns in 44 TCGA ovarian cancer whole-genome sequencing datasets. Our approach provides a more complete assessment of the complex genomic architectures inherent to many cancer genomes.
Collapse
Affiliation(s)
- Yang Li
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - David C Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jian Ma
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| |
Collapse
|
31
|
Glassford MR, Rosenfeld JA, Freedman AA, Zwick ME, Mulle JG. Novel features of 3q29 deletion syndrome: Results from the 3q29 registry. Am J Med Genet A 2016; 170A:999-1006. [PMID: 26738761 PMCID: PMC4849199 DOI: 10.1002/ajmg.a.37537] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Accepted: 12/09/2015] [Indexed: 11/30/2022]
Abstract
3q29 deletion syndrome is caused by a recurrent, typically de novo heterozygous 1.6 Mb deletion, but because incidence of the deletion is rare (1 in 30,000 births) the phenotype is not well described. To characterize the range of phenotypic manifestations associated with 3q29 deletion syndrome, we have developed an online registry (3q29deletion.org) for ascertainment of study subjects and phenotypic data collection via Internet‐based survey instruments. We report here on data collected during the first 18 months of registry operation, from 44 patients. This is the largest cohort of 3q29 deletion carriers ever assembled and surveyed in a systematic way. Our data reveal that 28% of registry participants report neuropsychiatric phenotypes, including anxiety disorder, panic attacks, depression, bipolar disorder, and schizophrenia. Other novel findings include a high prevalence (64%) of feeding problems in infancy and reduced weight at birth for 3q29 deletion carriers (average reduction 13.9 oz (394 g), adjusted for gestational age and sex, P = 6.5e‐07). We further report on the frequency of heart defects, autism, recurrent ear infections, gastrointestinal phenotypes, and dental phenotypes, among others. We also report on the expected timing of delayed developmental milestones. This is the most comprehensive description of the 3q29 deletion phenotype to date. These results are clinically actionable toward improving patient care for 3q29 deletion carriers, and can guide the expectations of physicians and parents. These data also demonstrate the value of patient‐reported outcomes to reveal the full phenotypic spectrum of rare genomic disorders. © 2016 The Authors. American Journal of Medical Genetics Part A Published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Megan R Glassford
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia
| | - Jill A Rosenfeld
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | - Alexa A Freedman
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, Georgia
| | - Michael E Zwick
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia.,Department of Pediatrics, Emory University School of Medicine, Atlanta, Georgia
| | - Jennifer G Mulle
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia.,Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, Georgia
| | | |
Collapse
|
32
|
Teo ASM, Verzotto D, Yao F, Nagarajan N, Hillmer AM. Single-molecule optical genome mapping of a human HapMap and a colorectal cancer cell line. Gigascience 2015; 4:65. [PMID: 26719794 PMCID: PMC4696294 DOI: 10.1186/s13742-015-0106-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 12/17/2015] [Indexed: 11/21/2022] Open
Abstract
Background Next-generation sequencing (NGS) technologies have changed our understanding of the variability of the human genome. However, the identification of genome structural variations based on NGS approaches with read lengths of 35–300 bases remains a challenge. Single-molecule optical mapping technologies allow the analysis of DNA molecules of up to 2 Mb and as such are suitable for the identification of large-scale genome structural variations, and for de novo genome assemblies when combined with short-read NGS data. Here we present optical mapping data for two human genomes: the HapMap cell line GM12878 and the colorectal cancer cell line HCT116. Findings High molecular weight DNA was obtained by embedding GM12878 and HCT116 cells, respectively, in agarose plugs, followed by DNA extraction under mild conditions. Genomic DNA was digested with KpnI and 310,000 and 296,000 DNA molecules (≥150 kb and 10 restriction fragments), respectively, were analyzed per cell line using the Argus optical mapping system. Maps were aligned to the human reference by OPTIMA, a new glocal alignment method. Genome coverage of 6.8× and 5.7× was obtained, respectively; 2.9× and 1.7× more than the coverage obtained with previously available software. Conclusions Optical mapping allows the resolution of large-scale structural variations of the genome, and the scaffold extension of NGS-based de novo assemblies. OPTIMA is an efficient new alignment method; our optical mapping data provide a resource for genome structure analyses of the human HapMap reference cell line GM12878, and the colorectal cancer cell line HCT116.
Collapse
Affiliation(s)
- Audrey S M Teo
- Cancer Therapeutics and Stratified Oncology, Genome Institute of Singapore, 60 Biopolis Street, Singapore, 138672 Singapore
| | - Davide Verzotto
- Computational and Systems Biology, Genome Institute of Singapore, 60 Biopolis Street, Singapore, 138672 Singapore
| | - Fei Yao
- Cancer Therapeutics and Stratified Oncology, Genome Institute of Singapore, 60 Biopolis Street, Singapore, 138672 Singapore
| | - Niranjan Nagarajan
- Computational and Systems Biology, Genome Institute of Singapore, 60 Biopolis Street, Singapore, 138672 Singapore
| | - Axel M Hillmer
- Cancer Therapeutics and Stratified Oncology, Genome Institute of Singapore, 60 Biopolis Street, Singapore, 138672 Singapore
| |
Collapse
|
33
|
Yuan B, Liu P, Gupta A, Beck CR, Tejomurtula A, Campbell IM, Gambin T, Simmons AD, Withers MA, Harris RA, Rogers J, Schwartz DC, Lupski JR. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates. PLoS Genet 2015; 11:e1005686. [PMID: 26641089 PMCID: PMC4671654 DOI: 10.1371/journal.pgen.1005686] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Accepted: 10/29/2015] [Indexed: 11/30/2022] Open
Abstract
Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. Genomic instability due to the intrinsic sequence architecture of the genome, such as low copy repeats (LCRs), is a major contributor to de novo mutations that can occur in the process of human genome evolution. LCRs can mediate genomic rearrangements associated with genomic disorders by acting as substrates for nonallelic homologous recombination. Juvenile-onset nephronophthisis 1 is the most frequent genetic cause of renal failure in children. An LCR-mediated, homozygous common recurrent deletion encompassing NPHP1 is found in the majority of affected subjects, while heterozygous deletion representing the nephronophthisis 1 recessive carrier state is frequently observed amongst world populations. Interestingly, the human NPHP1 locus is located proximal to the head-to-head fusion site of two ancestral chromosomes that occurred in the great apes, which resulted in a reduction of chromosome number from 48 in nonhuman primates to the current 46 in humans. In this study, we characterized and provided evidence for the diverse genomic architecture at the NPHP1 locus and potential structural variant haplotypes in the human population. Furthermore, our analyses of primate genomes shed light on the massive changes of genomic architecture at the human NPHP1 locus and delineated a model for the emergence of the LCRs during primate evolution.
Collapse
Affiliation(s)
- Bo Yuan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Aditya Gupta
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and The UW-Biotechnology Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Christine R. Beck
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Anusha Tejomurtula
- Graduate Program in Diagnostic Genetics, School of Health Professions, University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America
| | - Ian M. Campbell
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Tomasz Gambin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Alexandra D. Simmons
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Marjorie A. Withers
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - R. Alan Harris
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
| | - Jeffrey Rogers
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
| | - David C. Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and The UW-Biotechnology Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - James R. Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas, United States of America
- Texas Children’s Hospital, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
34
|
Mendelowitz LM, Schwartz DC, Pop M. Maligner: a fast ordered restriction map aligner. Bioinformatics 2015; 32:1016-22. [PMID: 26637292 DOI: 10.1093/bioinformatics/btv711] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Accepted: 12/01/2015] [Indexed: 12/28/2022] Open
Abstract
MOTIVATION The Optical Mapping System discovers structural variants and potentiates sequence assembly of genomes via scaffolding and comparisons that globally validate or correct sequence assemblies. Despite its utility, there are few publicly available tools for aligning optical mapping datasets. RESULTS Here we present software, named 'Maligner', for the alignment of both single molecule restriction maps (Rmaps) and in silico restriction maps of sequence contigs to a reference. Maligner provides two modes of alignment: an efficient, sensitive dynamic programming implementation that scales to large eukaryotic genomes, and a faster indexed based implementation for finding alignments with unmatched sites in the reference but not the query. We compare our software to other publicly available tools on Rmap datasets and show that Maligner finds more correct alignments in comparable runtime. Lastly, we introduce the M-Score statistic for normalizing alignment scores across restriction maps and demonstrate its utility for selecting high quality alignments. AVAILABILITY AND IMPLEMENTATION The Maligner software is written in C ++ and is available at https://github.com/LeeMendelowitz/maligner under the GNU General Public License. CONTACT mpop@umiacs.umd.edu.
Collapse
Affiliation(s)
- Lee M Mendelowitz
- Center for Bioinformatics and Computational Biology, Applied Math & Statistics, and Scientific Computation
| | - David C Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, USA and the UW-Biotechnology Center, University of Wisconsin-Madison, WI 53706, USA
| | - Mihai Pop
- Center for Bioinformatics and Computational Biology, Applied Math & Statistics, and Scientific Computation, Department of Computer Science, University of Maryland, College Park, MD 20742, USA and
| |
Collapse
|
35
|
Abstract
Optical Mapping is an established single-molecule, whole-genome analysis system, which has been used to gain a comprehensive understanding of genomic structure and to study structural variation of complex genomes. A critical component of Optical Mapping system is the image processing module, which extracts single molecule restriction maps from image datasets of immobilized, restriction digested and fluorescently stained large DNA molecules. In this review, we describe robust and efficient image processing techniques to process these massive datasets and extract accurate restriction maps in the presence of noise, ambiguity and confounding artifacts. We also highlight a few applications of the Optical Mapping system.
Collapse
Affiliation(s)
- Prabu Ravindran
- Laboratory of Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, USA
| | - Aditya Gupta
- Laboratory of Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, USA
| |
Collapse
|
36
|
Ramaswamy K, Yik WY, Wang XM, Oliphant EN, Lu W, Shibata D, Ryder OA, Hacia JG. Derivation of induced pluripotent stem cells from orangutan skin fibroblasts. BMC Res Notes 2015; 8:577. [PMID: 26475477 PMCID: PMC4609060 DOI: 10.1186/s13104-015-1567-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 10/07/2015] [Indexed: 01/08/2023] Open
Abstract
Background Orangutans are an endangered species whose natural habitats are restricted to the Southeast Asian islands of Borneo and Sumatra. Along with the African great apes, orangutans are among the closest living relatives to humans. For potential species conservation and functional genomics studies, we derived induced pluripotent stem cells (iPSCs) from cryopreserved somatic cells obtained from captive orangutans. Results Primary skin fibroblasts from two Sumatran orangutans were transduced with retroviral vectors expressing the human OCT4, SOX2, KLF4, and c-MYC factors. Candidate orangutan iPSCs were characterized by global gene expression and DNA copy number analysis. All were consistent with pluripotency and provided no evidence of large genomic insertions or deletions. In addition, orangutan iPSCs were capable of producing cells derived from all three germ layers in vitro through embryoid body differentiation assays and in vivo through teratoma formation in immune-compromised mice. Conclusions We demonstrate that orangutan skin fibroblasts are capable of being reprogrammed into iPSCs with hallmark molecular signatures and differentiation potential. We suggest that reprogramming orangutan somatic cells in genome resource banks could provide new opportunities for advancing assisted reproductive technologies relevant for species conservation efforts. Furthermore, orangutan iPSCs could have applications for investigating the phenotypic relevance of genomic changes that occurred in the human, African great ape, and/or orangutan lineages. This provides opportunities for orangutan cell culture models that would otherwise be impossible to develop from living donors due to the invasive nature of the procedures required for obtaining primary cells. Electronic supplementary material The online version of this article (doi:10.1186/s13104-015-1567-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Krishna Ramaswamy
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA, USA.
| | - Wing Yan Yik
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA, USA.
| | - Xiao-Ming Wang
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA, USA.
| | - Erin N Oliphant
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA, USA.
| | - Wange Lu
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA, USA.
| | - Darryl Shibata
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA, USA.
| | - Oliver A Ryder
- San Diego Zoo Institute for Conservation Research , San Diego Zoo Global, San Diego, CA, USA.
| | - Joseph G Hacia
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
37
|
Weckselblatt B, Rudd MK. Human Structural Variation: Mechanisms of Chromosome Rearrangements. Trends Genet 2015; 31:587-599. [PMID: 26209074 PMCID: PMC4600437 DOI: 10.1016/j.tig.2015.05.010] [Citation(s) in RCA: 158] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 05/26/2015] [Accepted: 05/27/2015] [Indexed: 01/05/2023]
Abstract
Chromosome structural variation (SV) is a normal part of variation in the human genome, but some classes of SV can cause neurodevelopmental disorders. Analysis of the DNA sequence at SV breakpoints can reveal mutational mechanisms and risk factors for chromosome rearrangement. Large-scale SV breakpoint studies have become possible recently owing to advances in next-generation sequencing (NGS) including whole-genome sequencing (WGS). These findings have shed light on complex forms of SV such as triplications, inverted duplications, insertional translocations, and chromothripsis. Sequence-level breakpoint data resolve SV structure and determine how genes are disrupted, fused, and/or misregulated by breakpoints. Recent improvements in breakpoint sequencing have also revealed non-allelic homologous recombination (NAHR) between paralogous long interspersed nuclear element (LINE) or human endogenous retrovirus (HERV) repeats as a cause of deletions, duplications, and translocations. This review covers the genomic organization of simple and complex constitutional SVs, as well as the molecular mechanisms of their formation.
Collapse
Affiliation(s)
- Brooke Weckselblatt
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - M Katharine Rudd
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA.
| |
Collapse
|
38
|
Single-molecule analysis reveals widespread structural variation in multiple myeloma. Proc Natl Acad Sci U S A 2015; 112:7689-94. [PMID: 26056298 DOI: 10.1073/pnas.1418577112] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Multiple myeloma (MM), a malignancy of plasma cells, is characterized by widespread genomic heterogeneity and, consequently, differences in disease progression and drug response. Although recent large-scale sequencing studies have greatly improved our understanding of MM genomes, our knowledge about genomic structural variation in MM is attenuated due to the limitations of commonly used sequencing approaches. In this study, we present the application of optical mapping, a single-molecule, whole-genome analysis system, to discover new structural variants in a primary MM genome. Through our analysis, we have identified and characterized widespread structural variation in this tumor genome. Additionally, we describe our efforts toward comprehensive characterization of genome structure and variation by integrating our findings from optical mapping with those from DNA sequencing-based genomic analysis. Finally, by studying this MM genome at two time points during tumor progression, we have demonstrated an increase in mutational burden with tumor progression at all length scales of variation.
Collapse
|
39
|
Iyer J, Girirajan S. Gene discovery and functional assessment of rare copy-number variants in neurodevelopmental disorders. Brief Funct Genomics 2015; 14:315-28. [DOI: 10.1093/bfgp/elv018] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
|
40
|
Geng J, Picker J, Zheng Z, Zhang X, Wang J, Hisama F, Brown DW, Mullen MP, Harris D, Stoler J, Seman A, Miller DT, Fu Q, Roberts AE, Shen Y. Chromosome microarray testing for patients with congenital heart defects reveals novel disease causing loci and high diagnostic yield. BMC Genomics 2014; 15:1127. [PMID: 25516202 PMCID: PMC4378009 DOI: 10.1186/1471-2164-15-1127] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2014] [Accepted: 12/11/2014] [Indexed: 11/29/2022] Open
Abstract
Background Congenital heart defects (CHD), as the most common congenital anomaly, have been reported to be frequently associated with pathogenic copy number variants (CNVs). Currently, patients with CHD are routinely offered chromosomal microarray (CMA) testing, but the diagnostic yield of CMA on CHD patients has not been extensively evaluated based on a large patient cohort. In this study, we retrospectively assessed the detected CNVs in a total of 514 CHD cases (a 422-case clinical cohort from Boston Children's Hospital (BCH) and a 92-case research cohort from Shanghai Children’s Medical Center (SCMC)) and conducted a genotype-phenotype analysis. Furthermore, genes encompassed in pathogenic/likely pathogenic CNVs were prioritized by integrating several tools and public data sources for novel CHD candidate gene identification. Results Based on the BCH cohort, the overall diagnostic yield of CMA testing for CHD patients was 12.8(pathogenic CNVs)-18.5% (pathogenic and likely pathogenic CNVs). The diagnostic yield of CMA for syndromic CHD was 14.1-20.6% (excluding aneuploidy cases), whereas the diagnostic yield for isolated CHD was 4.3-9.3%. Four recurrent genomic loci (4q terminal region, 15q11.2, 16p12.2 and Yp11.2) were more significantly enriched in cases than in controls. These regions are considered as novel CHD loci. We further identified 20 genes as the most likely novel CHD candidate genes through gene prioritization analysis. Conclusion The high clinical diagnostic yield of CMA in this study provides supportive evidence for CMA as the first-line genetic diagnostic tool for CHD patients. The CNVs detected in our study suggest a number of CHD candidate genes that warrant further investigation. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1127) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | - Qihua Fu
- Department of Laboratory Medicine, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai 200127, China.
| | | | | |
Collapse
|
41
|
Vergés L, Molina O, Geán E, Vidal F, Blanco J. Deletions and duplications of the 22q11.2 region in spermatozoa from DiGeorge/velocardiofacial fathers. Mol Cytogenet 2014; 7:86. [PMID: 25435913 PMCID: PMC4247602 DOI: 10.1186/s13039-014-0086-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2014] [Accepted: 11/04/2014] [Indexed: 11/12/2022] Open
Abstract
Background DiGeorge/velocardiofacial syndrome (DGS/VCFS) is the most common deletion syndrome in humans. Low copy repeats flanking the 22q11.2 region confer a substrate for non-allelic homologous recombination (NAHR) events leading to rearrangements. This study sought to identify DGS/VCFS fathers with increased susceptibility to deletions and duplications at the 22q11.2 region in spermatozoa and to assess the particular contribution of intra-chromatid and/or inter-chromatid NAHR. Semen samples from nine DGS/VCFS fathers were analyzed by triple-color FISH using a probe combination that discriminated between normal, deleted and duplicated genotypes. Microsatellite analysis were performed in the parents and the affected children to determine the parental origin of the deleted chromosome 22. Results A significant increase in 22q11.2 deletions was observed in the sperm of two out of nine DGS/VCFS fathers (odds ratio 2.03-fold, P < 0.01), and in both cases the deletion in the offspring was transmitted by the father. Patients with significant increases in sperm anomalies presented a disturbed deletion:duplication 1:1 ratio (P < 0.01). Conclusions Altogether, results support that intra-chromatid NAHR is the mechanism responsible for the higher rate of sperm deletions, which is directly related to the transmission of the deleted chromosome 22 to offspring. Accordingly, the screening of sperm anomalies in the 22q11.2 region should be taken into account in the genetic counseling of DGS/VCFS families.
Collapse
Affiliation(s)
- Laia Vergés
- Unitat de Biologia Cellular (Facultat de Biociències). Universitat Autònoma de Barcelona, 08193-Bellaterra, Cerdanyola del Vallès, Spain
| | - Oscar Molina
- Unitat de Biologia Cellular (Facultat de Biociències). Universitat Autònoma de Barcelona, 08193-Bellaterra, Cerdanyola del Vallès, Spain ; Current address: Wellcome Trust Center for Cell Biology, University of Edinburgh, Edinburgh, Scotland United Kingdom
| | - Esther Geán
- Secció de Genètica Clínica. Hospital Universitari Sant Joan de Déu, 08950-Esplugues de Llobregat, Barcelona, Spain
| | - Francesca Vidal
- Unitat de Biologia Cellular (Facultat de Biociències). Universitat Autònoma de Barcelona, 08193-Bellaterra, Cerdanyola del Vallès, Spain
| | - Joan Blanco
- Unitat de Biologia Cellular (Facultat de Biociències). Universitat Autònoma de Barcelona, 08193-Bellaterra, Cerdanyola del Vallès, Spain
| |
Collapse
|
42
|
Abstract
PURPOSE OF REVIEW Recent studies clearly demonstrate that copy number variations (CNVs) are widespread in our genome and play an important role in human genetic variation, accounting for both human population diversity and human genetic disease. This review will discuss the most current knowledge regarding our understanding of the biology of CNVs in relation to human genetic disease. RECENT FINDINGS CNVs associated with human genetic disease can be either recurrent, with a common size and breakpoint clustering, or nonrecurrent, with different sizes and variable breakpoints. Two types of recurrent CNVs have been distinguished, including the syndromic forms in which the phenotypic features are relatively consistent, and those in which the same recurrent CNV can be associated with a diverse set of diagnoses. Recently, the 'Two-hit model' was used to explain the phenotypic variability associated with the latter group of recurrent CNVs. Nonrecurrent CNVs, on the contrary, occur at a relatively lower frequency at the individual locus level but collectively they are as common as recurrent CNVs. Finally, the study of CNV burden in different diseases demonstrated a clear trend of an increasing CNV burden in diseases with more severe phenotypes. SUMMARY In spite of the advances in the study of the CNV landscape associated with human genetic disease, there still remain many unexplored questions especially regarding the role of CNVs in the pathogenesis of complex human genetic diseases.
Collapse
|
43
|
Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion and evolutionary instability. Nat Genet 2014; 46:1293-302. [PMID: 25326701 PMCID: PMC4244265 DOI: 10.1038/ng.3120] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2014] [Accepted: 09/25/2014] [Indexed: 12/14/2022]
Abstract
Recurrent deletions of chromosome 15q13.3 associate with intellectual disability, schizophrenia, autism and epilepsy. To gain insight into its instability, we sequenced the region in patients, normal individuals and nonhuman primates. We discovered five structural configurations of the human chromosome 15q13.3 region ranging in size from 2 to 3 Mbp. These configurations arose recently (~0.5–0.9 million years ago) as a result of human-specific expansions of segmental duplications and two independent inversion events. All inversion breakpoints map near GOLGA8 core duplicons—a ~14 kbp primate-specific chromosome 15 repeat that became organized into larger palindromic structures. GOLGA8-flanked palindromes also demarcate the breakpoints of recurrent 15q13.3 microdeletions, the expansion of chromosome 15 segmental duplications in the human lineage, and independent structural changes in apes. The significant clustering (p=0.002) of breakpoints provides mechanistic evidence for the role of this core duplicon and its palindromic architecture in promoting evolutionary and disease-related instability of chromosome 15.
Collapse
|
44
|
Giannuzzi G, Migliavacca E, Reymond A. Novel H3K4me3 marks are enriched at human- and chimpanzee-specific cytogenetic structures. Genome Res 2014; 24:1455-68. [PMID: 24916972 PMCID: PMC4158755 DOI: 10.1101/gr.167742.113] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Human and chimpanzee genomes are 98.8% identical within comparable sequences. However, they differ structurally in nine pericentric inversions, one fusion that originated human chromosome 2, and content and localization of heterochromatin and lineage-specific segmental duplications. The possible functional consequences of these cytogenetic and structural differences are not fully understood and their possible involvement in speciation remains unclear. We show that subtelomeric regions—regions that have a species-specific organization, are more divergent in sequence, and are enriched in genes and recombination hotspots—are significantly enriched for species-specific histone modifications that decorate transcription start sites in different tissues in both human and chimpanzee. The human lineage-specific chromosome 2 fusion point and ancestral centromere locus as well as chromosome 1 and 18 pericentric inversion breakpoints showed enrichment of human-specific H3K4me3 peaks in the prefrontal cortex. Our results reveal an association between plastic regions and potential novel regulatory elements.
Collapse
Affiliation(s)
- Giuliana Giannuzzi
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland;
| | - Eugenia Migliavacca
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland;
| |
Collapse
|
45
|
Maya I, Vinkler C, Konen O, Kornreich L, Steinberg T, Yeshaya J, Latarowski V, Shohat M, Lev D, Baris HN. Abnormal brain magnetic resonance imaging in two patients with Smith-Magenis syndrome. Am J Med Genet A 2014; 164A:1940-6. [PMID: 24788350 DOI: 10.1002/ajmg.a.36583] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Accepted: 03/28/2014] [Indexed: 11/06/2022]
Abstract
Smith-Magenis syndrome (SMS) is a clinically recognizable contiguous gene syndrome ascribed to an interstitial deletion in chromosome 17p11.2. Seventy percent of SMS patients have a common deletion interval spanning 3.5 megabases (Mb). Clinical features of SMS include characteristic mild dysmorphic features, ocular anomalies, short stature, brachydactyly, and hypotonia. SMS patients have a unique neurobehavioral phenotype that includes intellectual disability, self-injurious behavior and severe sleep disturbance. Little has been reported in the medical literature about anatomical brain anomalies in patients with SMS. Here we describe two patients with SMS caused by the common deletion in 17p11.2 diagnosed using chromosomal microarray (CMA). Both patients had a typical clinical presentation and abnormal brain magnetic resonance imaging (MRI) findings. One patient had subependymal periventricular gray matter heterotopia, and the second had a thin corpus callosum, a thin brain stem and hypoplasia of the cerebellar vermis. This report discusses the possible abnormal MRI images in SMS and reviews the literature on brain malformations in SMS. Finally, although structural brain malformations in SMS patients are not a common feature, we suggest baseline routine brain imaging in patients with SMS in particular, and in patients with chromosomal microdeletion/microduplication syndromes in general. Structural brain malformations in these patients may affect the decision-making process regarding their management.
Collapse
Affiliation(s)
- Idit Maya
- The Raphael Recanati Genetic Institute, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel
| | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Watson CT, Marques-Bonet T, Sharp AJ, Mefford HC. The genetics of microdeletion and microduplication syndromes: an update. Annu Rev Genomics Hum Genet 2014; 15:215-244. [PMID: 24773319 DOI: 10.1146/annurev-genom-091212-153408] [Citation(s) in RCA: 115] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Chromosomal abnormalities, including microdeletions and microduplications, have long been associated with abnormal developmental outcomes. Early discoveries relied on a common clinical presentation and the ability to detect chromosomal abnormalities by standard karyotype analysis or specific assays such as fluorescence in situ hybridization. Over the past decade, the development of novel genomic technologies has allowed more comprehensive, unbiased discovery of microdeletions and microduplications throughout the human genome. The ability to quickly interrogate large cohorts using chromosome microarrays and, more recently, next-generation sequencing has led to the rapid discovery of novel microdeletions and microduplications associated with disease, including very rare but clinically significant rearrangements. In addition, the observation that some microdeletions are associated with risk for several neurodevelopmental disorders contributes to our understanding of shared genetic susceptibility for such disorders. Here, we review current knowledge of microdeletion/duplication syndromes, with a particular focus on recurrent rearrangement syndromes.
Collapse
Affiliation(s)
- Corey T Watson
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Tomas Marques-Bonet
- Institut de Biologia Evolutiva, Universitat Pompeu Fabra/CSIC, 08003 Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain.,Centro Nacional de Análisis Genómico, 08023 Barcelona, Spain
| | - Andrew J Sharp
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Heather C Mefford
- Department of Pediatrics, University of Washington, Seattle, Washington 98195
| |
Collapse
|
47
|
Rees E, Walters JT, Chambert KD, O'Dushlaine C, Szatkiewicz J, Richards AL, Georgieva L, Mahoney-Davies G, Legge SE, Moran JL, Genovese G, Levinson D, Morris DW, Cormican P, Kendler KS, O'Neill FA, Riley B, Gill M, Corvin A, Sklar P, Hultman C, Pato C, Pato M, Sullivan PF, Gejman PV, McCarroll SA, O'Donovan MC, Owen MJ, Kirov G. CNV analysis in a large schizophrenia sample implicates deletions at 16p12.1 and SLC1A1 and duplications at 1p36.33 and CGNL1. Hum Mol Genet 2014; 23:1669-76. [PMID: 24163246 PMCID: PMC3929090 DOI: 10.1093/hmg/ddt540] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2013] [Revised: 09/26/2013] [Accepted: 10/24/2013] [Indexed: 12/29/2022] Open
Abstract
Large and rare copy number variants (CNVs) at several loci have been shown to increase risk for schizophrenia. Aiming to discover novel susceptibility CNV loci, we analyzed 6882 cases and 11 255 controls genotyped on Illumina arrays, most of which have not been used for this purpose before. We identified genes enriched for rare exonic CNVs among cases, and then attempted to replicate the findings in additional 14 568 cases and 15 274 controls. In a combined analysis of all samples, 12 distinct loci were enriched among cases with nominal levels of significance (P < 0.05); however, none would survive correction for multiple testing. These loci include recurrent deletions at 16p12.1, a locus previously associated with neurodevelopmental disorders (P = 0.0084 in the discovery sample and P = 0.023 in the replication sample). Other plausible candidates include non-recurrent deletions at the glutamate transporter gene SLC1A1, a CNV locus recently suggested to be involved in schizophrenia through linkage analysis, and duplications at 1p36.33 and CGNL1. A burden analysis of large (>500 kb), rare CNVs showed a 1.2% excess in cases after excluding known schizophrenia-associated loci, suggesting that additional susceptibility loci exist. However, even larger samples are required for their discovery.
Collapse
Affiliation(s)
- Elliott Rees
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - James T.R. Walters
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - Kimberly D. Chambert
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA,
| | - Colm O'Dushlaine
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA,
| | - Jin Szatkiewicz
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA,
| | - Alexander L. Richards
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - Lyudmila Georgieva
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - Gerwyn Mahoney-Davies
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - Sophie E. Legge
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - Jennifer L. Moran
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA,
| | - Giulio Genovese
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA,
| | - Douglas Levinson
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, USA,
| | - Derek W. Morris
- Department of Psychiatry and Neuropsychiatric Genetics Research Group, Institute of Molecular Medicine, Trinity College Dublin, Dublin 2, Ireland,
| | - Paul Cormican
- Department of Psychiatry and Neuropsychiatric Genetics Research Group, Institute of Molecular Medicine, Trinity College Dublin, Dublin 2, Ireland,
| | - Kenneth S. Kendler
- Department of Psychiatry and Human Genetics, Virginia Institute of Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA,
| | - Francis A. O'Neill
- Department of Psychiatry, Queen's University, BelfastBT71NN, Northern Ireland,
| | - Brien Riley
- Department of Psychiatry and Human Genetics, Virginia Institute of Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA,
| | - Michael Gill
- Department of Psychiatry and Neuropsychiatric Genetics Research Group, Institute of Molecular Medicine, Trinity College Dublin, Dublin 2, Ireland,
| | - Aiden Corvin
- Department of Psychiatry and Neuropsychiatric Genetics Research Group, Institute of Molecular Medicine, Trinity College Dublin, Dublin 2, Ireland,
| | | | - Pamela Sklar
- Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, NY, USA,
| | - Christina Hultman
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,
| | - Carlos Pato
- Department of Psychiatry and Behavioral Science, Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, CA 90033-0121, USA,
| | - Michele Pato
- Department of Psychiatry and Behavioral Science, Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, CA 90033-0121, USA,
| | - Patrick F. Sullivan
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA,
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA,
| | - Pablo V. Gejman
- Department of Psychiatry and Behavioral Sciences, NorthShore University HealthSystem, Evanston, IL 60201, USA and
- Department of Psychiatry and Behavioral Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Steven A. McCarroll
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA,
| | - Michael C. O'Donovan
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - Michael J. Owen
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| | - George Kirov
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff CF24 4HQ, UK,
| |
Collapse
|
48
|
Male infertility and copy number variants (CNVs) in the dog: a two-pronged approach using Computer Assisted Sperm Analysis (CASA) and Fluorescent In Situ Hybridization (FISH). BMC Genomics 2013; 14:921. [PMID: 24373333 PMCID: PMC3922845 DOI: 10.1186/1471-2164-14-921] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 12/10/2013] [Indexed: 12/31/2022] Open
Abstract
Background Infertility affects ~10-15% of couples trying to have children, in which the rate of male fertility problems is approximately at 30-50%. Copy number variations (CNVs) are DNA sequences greater than or equal to 1 kb in length sharing a high level of similarity, and present at a variable number of copies in the genome; in our study, we used the canine species as an animal model to detect CNVs responsible for male infertility. We aim to identify CNVs associated with male infertility in the dog genome with a two-pronged approach: we performed a sperm analysis using the CASA system and a cytogenetic-targeted analysis on genes involved in male gonad development and spermatogenesis with fluorescence in situ hybridization (FISH), using dog-specific clones. This analysis was carried out to evaluate possible correlations between CNVs on targeted genes and spermatogenesis impairments or infertility factors. Results We identified two genomic regions hybridized by BACs CH82-321J09 and CH82-509B23 showing duplication patterns in all samples except for an azoospermic dog. These two regions harbor two important genes for spermatogenesis: DNM2 and TEKT1. The genomic region encompassed by the BAC clone CH82-324I01 showed a single-copy pattern in all samples except for one dog, assessed with low-quality sperm, displaying a marked duplication pattern. This genomic region harbors SOX8, a key gene for testis development. Conclusion We present the first study involving functional and genetic analyses in male infertility. We set up an extremely reliable analysis on dog sperm cells with a highly consistent statistical significance, and we succeeded in conducting FISH experiments on sperm cells using BAC clones as probes. We found copy number differences in infertile compared with fertile dogs for genomic regions encompassing TEKT1, DNM2, and SOX8, suggesting those genes could have a role if deleted or duplicated with respect to the reference copy number in fertility biology. This method is of particular interest in the dog due to the recognized role of this species as an animal model for the study of human genetic diseases and could be useful for other species of economic interest and for endangered animal species.
Collapse
|
49
|
Kounovsky-Shafer KL, Hernández-Ortiz JP, Jo K, Odijk T, de Pablo JJ, Schwartz DC. Presentation of large DNA molecules for analysis as nanoconfined dumbbells. Macromolecules 2013; 46:8356-8368. [PMID: 24683272 PMCID: PMC3964590 DOI: 10.1021/ma400926h] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The analysis of very large DNA molecules intrinsically supports long-range, phased sequence information, but requires new approaches for their effective presentation as part of any genome analysis platform. Using a multi-pronged approach that marshaled molecular confinement, ionic environment, and DNA elastic properties-but tressed by molecular simulations-we have developed an efficient and scalable approach for presentation of large DNA molecules within nanoscale slits. Our approach relies on the formation of DNA dumbbells, where large segments of the molecules remain outside the nanoslits used to confine them. The low ionic environment, synergizing other features of our approach, enables DNA molecules to adopt a fully stretched conformation, comparable to the contour length, thereby facilitating analysis by optical microscopy. Accordingly, a molecular model is proposed to describe the conformation and dynamics of the DNA molecules within the nanoslits; a Langevin description of the polymer dynamics is adopted in which hydrodynamic effects are included through a Green's function formalism. Our simulations reveal that a delicate balance between electrostatic and hydrodynamic interactions is responsible for the observed molecular conformations. We demonstrate and further confirm that the "Odijk regime" does indeed start when the confinement dimensions size are of the same order of magnitude as the persistence length of the molecule. We also summarize current theories concerning dumbbell dynamics.
Collapse
Affiliation(s)
- Kristy L. Kounovsky-Shafer
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and UW-Biotechnology Center, University of Wisconsin-Madison, Madison, WI 53706-1580
| | - Juan P. Hernández-Ortiz
- Departamento de Materiales, Universidad Nacional de Colombia, Sede Medellín, Kra 80 # 65-223
| | - Kyubong Jo
- Bloque M3-050, Medellín Colombia, Department of Chemistry, Sogang University, Seoul, Korea
| | - Theo Odijk
- Lorentz Institute for Theoretical Physics, University of Leiden, The Netherlands
| | - Juan J. de Pablo
- Institute for Molecular Engineering, University of Chicago, Chicago, IL 60637
| | - David C. Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and UW-Biotechnology Center, University of Wisconsin-Madison, Madison, WI 53706-1580
| |
Collapse
|
50
|
Dumont BL, Eichler EE. Signals of historical interlocus gene conversion in human segmental duplications. PLoS One 2013; 8:e75949. [PMID: 24124524 PMCID: PMC3790853 DOI: 10.1371/journal.pone.0075949] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 08/17/2013] [Indexed: 12/04/2022] Open
Abstract
Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC). Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i) a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii) the alignment-based method implemented in the GENECONV program. One-quarter (25.4%) of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans.
Collapse
Affiliation(s)
- Beth L. Dumont
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- * E-mail:
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
| |
Collapse
|