1
|
Guarracino A, Buonaiuto S, de Lima LG, Potapova T, Rhie A, Koren S, Rubinstein B, Fischer C, Gerton JL, Phillippy AM, Colonna V, Garrison E. Recombination between heterologous human acrocentric chromosomes. Nature 2023; 617:335-343. [PMID: 37165241 PMCID: PMC10172130 DOI: 10.1038/s41586-023-05976-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 03/17/2023] [Indexed: 05/12/2023]
Abstract
The short arms of the human acrocentric chromosomes 13, 14, 15, 21 and 22 (SAACs) share large homologous regions, including ribosomal DNA repeats and extended segmental duplications1,2. Although the resolution of these regions in the first complete assembly of a human genome-the Telomere-to-Telomere Consortium's CHM13 assembly (T2T-CHM13)-provided a model of their homology3, it remained unclear whether these patterns were ancestral or maintained by ongoing recombination exchange. Here we show that acrocentric chromosomes contain pseudo-homologous regions (PHRs) indicative of recombination between non-homologous sequences. Utilizing an all-to-all comparison of the human pangenome from the Human Pangenome Reference Consortium4 (HPRC), we find that contigs from all of the SAACs form a community. A variation graph5 constructed from centromere-spanning acrocentric contigs indicates the presence of regions in which most contigs appear nearly identical between heterologous acrocentric chromosomes in T2T-CHM13. Except on chromosome 15, we observe faster decay of linkage disequilibrium in the pseudo-homologous regions than in the corresponding short and long arms, indicating higher rates of recombination6,7. The pseudo-homologous regions include sequences that have previously been shown to lie at the breakpoint of Robertsonian translocations8, and their arrangement is compatible with crossover in inverted duplications on chromosomes 13, 14 and 21. The ubiquity of signals of recombination between heterologous acrocentric chromosomes seen in the HPRC draft pangenome suggests that these shared sequences form the basis for recurrent Robertsonian translocations, providing sequence and population-based confirmation of hypotheses first developed from cytogenetic studies 50 years ago9.
Collapse
Affiliation(s)
- Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Genomics Research Centre, Human Technopole, Milan, Italy
| | - Silvia Buonaiuto
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | | | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Christian Fischer
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | | | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Vincenza Colonna
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA.
| |
Collapse
|
2
|
Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze AV, Mikheenko A, Vollger MR, Altemose N, Uralsky L, Gershman A, Aganezov S, Hoyt SJ, Diekhans M, Logsdon GA, Alonge M, Antonarakis SE, Borchers M, Bouffard GG, Brooks SY, Caldas GV, Chen NC, Cheng H, Chin CS, Chow W, de Lima LG, Dishuck PC, Durbin R, Dvorkina T, Fiddes IT, Formenti G, Fulton RS, Fungtammasan A, Garrison E, Grady PG, Graves-Lindsay TA, Hall IM, Hansen NF, Hartley GA, Haukness M, Howe K, Hunkapiller MW, Jain C, Jain M, Jarvis ED, Kerpedjiev P, Kirsche M, Kolmogorov M, Korlach J, Kremitzki M, Li H, Maduro VV, Marschall T, McCartney AM, McDaniel J, Miller DE, Mullikin JC, Myers EW, Olson ND, Paten B, Peluso P, Pevzner PA, Porubsky D, Potapova T, Rogaev EI, Rosenfeld JA, Salzberg SL, Schneider VA, Sedlazeck FJ, Shafin K, Shew CJ, Shumate A, Sims Y, Smit AFA, Soto DC, Sović I, Storer JM, Streets A, Sullivan BA, Thibaud-Nissen F, Torrance J, Wagner J, Walenz BP, Wenger A, Wood JMD, Xiao C, Yan SM, Young AC, Zarate S, Surti U, McCoy RC, Dennis MY, Alexandrov IA, Gerton JL, O’Neill RJ, Timp W, Zook JM, Schatz MC, Eichler EE, Miga KH, Phillippy AM. The complete sequence of a human genome. Science 2022; 376:44-53. [PMID: 35357919 PMCID: PMC9186530 DOI: 10.1126/science.abj6987] [Citation(s) in RCA: 1143] [Impact Index Per Article: 571.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Since its initial release in 2000, the human reference genome has covered only the euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. Addressing the remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium presents a complete 3.055 billion-base pair sequence of a human genome, T2T-CHM13, that includes gapless assemblies for all chromosomes except Y, corrects errors in the prior references, and introduces nearly 200 million base pairs of sequence containing 1956 gene predictions, 99 of which are predicted to be protein coding. The completed regions include all centromeric satellite arrays, recent segmental duplications, and the short arms of all five acrocentric chromosomes, unlocking these complex regions of the genome to variational and functional studies.
Collapse
Affiliation(s)
- Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Andrey V. Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego; La Jolla, CA, USA
| | - Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University; Saint Petersburg, Russia
| | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Nicolas Altemose
- Department of Bioengineering, University of California, Berkeley; Berkeley, CA, USA
| | - Lev Uralsky
- Sirius University of Science and Technology; Sochi, Russia
- Vavilov Institute of General Genetics; Moscow, Russia
| | - Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University; Baltimore, MD, USA
| | - Sergey Aganezov
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Savannah J. Hoyt
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | - Glennis A. Logsdon
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Michael Alonge
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | | | | | - Gerard G. Bouffard
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Shelise Y. Brooks
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Gina V. Caldas
- Department of Molecular and Cell Biology, University of California, Berkeley; Berkeley, CA, USA
| | - Nae-Chyun Chen
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute; Boston, MA
- Department of Biomedical Informatics, Harvard Medical School; Boston, MA
| | | | | | | | - Philip C. Dishuck
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Richard Durbin
- Wellcome Sanger Institute; Cambridge, UK
- Department of Genetics, University of Cambridge; Cambridge, UK
| | - Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University; Saint Petersburg, Russia
| | | | - Giulio Formenti
- Laboratory of Neurogenetics of Language and The Vertebrate Genome Lab, The Rockefeller University; New York, NY, USA
- Howard Hughes Medical Institute; Chevy Chase, MD, USA
| | - Robert S. Fulton
- Department of Genetics, Washington University School of Medicine; St. Louis, MO, USA
| | | | - Erik Garrison
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
- University of Tennessee Health Science Center; Memphis, TN, USA
| | - Patrick G.S. Grady
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | | | - Ira M. Hall
- Department of Genetics, Yale University School of Medicine; New Haven, CT, USA
| | - Nancy F. Hansen
- Comparative Genomics Analysis Unit, Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Gabrielle A. Hartley
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | - Marina Haukness
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | | | | | - Chirag Jain
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
- Department of Computational and Data Sciences, Indian Institute of Science; Bangalore KA, India
| | - Miten Jain
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | - Erich D. Jarvis
- Laboratory of Neurogenetics of Language and The Vertebrate Genome Lab, The Rockefeller University; New York, NY, USA
- Howard Hughes Medical Institute; Chevy Chase, MD, USA
| | | | - Melanie Kirsche
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Mikhail Kolmogorov
- Department of Computer Science and Engineering, University of California, San Diego; San Diego, CA, USA
| | | | - Milinn Kremitzki
- McDonnell Genome Institute, Washington University in St. Louis; St. Louis, MO, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute; Boston, MA
- Department of Biomedical Informatics, Harvard Medical School; Boston, MA
| | - Valerie V. Maduro
- Undiagnosed Diseases Program, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Tobias Marschall
- Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics; Düsseldorf, Germany
| | - Ann M. McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Jennifer McDaniel
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Danny E. Miller
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
- Department of Pediatrics, Division of Genetic Medicine, University of Washington and Seattle Children’s Hospital; Seattle, WA, USA
| | - James C. Mullikin
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
- Comparative Genomics Analysis Unit, Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Eugene W. Myers
- Max-Planck Institute of Molecular Cell Biology and Genetics; Dresden, Germany
| | - Nathan D. Olson
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | | | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California, San Diego; San Diego, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Tamara Potapova
- Stowers Institute for Medical Research; Kansas City, MO, USA
| | - Evgeny I. Rogaev
- Sirius University of Science and Technology; Sochi, Russia
- Vavilov Institute of General Genetics; Moscow, Russia
- Department of Psychiatry, University of Massachusetts Medical School; Worcester, MA, USA
- Faculty of Biology, Lomonosov Moscow State University; Moscow, Russia
| | | | - Steven L. Salzberg
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD, USA
| | - Valerie A. Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health; Bethesda, MD, USA
| | - Fritz J. Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine; Houston TX, USA
| | - Kishwar Shafin
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | - Colin J. Shew
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis; CA, USA
| | - Alaina Shumate
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD, USA
| | - Ying Sims
- Wellcome Sanger Institute; Cambridge, UK
| | | | - Daniela C. Soto
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis; CA, USA
| | - Ivan Sović
- Pacific Biosciences; Menlo Park, CA, USA
- Digital BioLogic d.o.o.; Ivanić-Grad, Croatia
| | | | - Aaron Streets
- Department of Bioengineering, University of California, Berkeley; Berkeley, CA, USA
- Chan Zuckerberg Biohub; San Francisco, CA, USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine; Durham, NC, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health; Bethesda, MD, USA
| | | | - Justin Wagner
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Brian P. Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | | | | | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health; Bethesda, MD, USA
| | - Stephanie M. Yan
- Department of Biology, Johns Hopkins University; Baltimore, MD, USA
| | - Alice C. Young
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Samantha Zarate
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Urvashi Surti
- Department of Pathology, University of Pittsburgh; Pittsburgh, PA, USA
| | - Rajiv C. McCoy
- Department of Biology, Johns Hopkins University; Baltimore, MD, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis; CA, USA
| | - Ivan A. Alexandrov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University; Saint Petersburg, Russia
- Vavilov Institute of General Genetics; Moscow, Russia
- Research Center of Biotechnology of the Russian Academy of Sciences; Moscow, Russia
| | - Jennifer L. Gerton
- Stowers Institute for Medical Research; Kansas City, MO, USA
- Department of Biochemistry and Molecular Biology, University of Kansas Medical School; Kansas City, MO, USA
| | - Rachel J. O’Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University; Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD, USA
| | - Justin M. Zook
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
- Department of Biology, Johns Hopkins University; Baltimore, MD, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
- Howard Hughes Medical Institute; Chevy Chase, MD, USA
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| |
Collapse
|
3
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
4
|
Miga KH. Centromere studies in the era of 'telomere-to-telomere' genomics. Exp Cell Res 2020; 394:112127. [PMID: 32504677 DOI: 10.1016/j.yexcr.2020.112127] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 05/23/2020] [Accepted: 05/30/2020] [Indexed: 12/17/2022]
Abstract
We are entering into an exciting era of genomics where truly complete, high-quality assemblies of human chromosomes are available end-to-end, or from 'telomere-to-telomere' (T2T). This technological advance offers a new opportunity to include endogenous human centromeric regions in high-resolution, sequence-based studies. These emerging reference maps are expected to reveal a new functional landscape in the human genome, where centromere proteins, transcriptional regulation, and spatial organization can be examined with base-level resolution across different stages of development and disease. Such studies will depend on innovative assembly methods of extremely long tandem repeats (ETRs), or satellite DNAs, paired with the development of new, orthogonal validation methods to ensure accuracy and completeness. This review reflects the progress in centromere genomics, credited by recent advancements in long-read sequencing and assembly methods. In doing so, I will discuss the challenges that remain and the promise for a new period of scientific discovery for satellite DNA biology and centromere function.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, CA, 95064, USA.
| |
Collapse
|
5
|
Centromere Repeats: Hidden Gems of the Genome. Genes (Basel) 2019; 10:genes10030223. [PMID: 30884847 PMCID: PMC6471113 DOI: 10.3390/genes10030223] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 03/07/2019] [Accepted: 03/11/2019] [Indexed: 01/08/2023] Open
Abstract
Satellite DNAs are now regarded as powerful and active contributors to genomic and chromosomal evolution. Paired with mobile transposable elements, these repetitive sequences provide a dynamic mechanism through which novel karyotypic modifications and chromosomal rearrangements may occur. In this review, we discuss the regulatory activity of satellite DNA and their neighboring transposable elements in a chromosomal context with a particular emphasis on the integral role of both in centromere function. In addition, we discuss the varied mechanisms by which centromeric repeats have endured evolutionary processes, producing a novel, species-specific centromeric landscape despite sharing a ubiquitously conserved function. Finally, we highlight the role these repetitive elements play in the establishment and functionality of de novo centromeres and chromosomal breakpoints that underpin karyotypic variation. By emphasizing these unique activities of satellite DNAs and transposable elements, we hope to disparage the conventional exemplification of repetitive DNA in the historically-associated context of ‘junk’.
Collapse
|
6
|
McNulty SM, Sullivan BA. Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Res 2018; 26:115-138. [PMID: 29974361 DOI: 10.1007/s10577-018-9582-3] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/14/2018] [Indexed: 02/05/2023]
Abstract
Repetitive DNA, formerly referred to by the misnomer "junk DNA," comprises a majority of the human genome. One class of this DNA, alpha satellite, comprises up to 10% of the genome. Alpha satellite is enriched at all human centromere regions and is competent for de novo centromere assembly. Because of the highly repetitive nature of alpha satellite, it has been difficult to achieve genome assemblies at centromeres using traditional next-generation sequencing approaches, and thus, centromeres represent gaps in the current human genome assembly. Moreover, alpha satellite DNA is transcribed into repetitive noncoding RNA and contributes to a large portion of the transcriptome. Recent efforts to characterize these transcripts and their function have uncovered pivotal roles for satellite RNA in genome stability, including silencing "selfish" DNA elements and recruiting centromere and kinetochore proteins. This review will describe the genomic and epigenetic features of alpha satellite DNA, discuss recent findings of noncoding transcripts produced from distinct alpha satellite arrays, and address current progress in the functional understanding of this oft-neglected repetitive sequence. We will discuss unique challenges of studying human satellite DNAs and RNAs and point toward new technologies that will continue to advance our understanding of this largely untapped portion of the genome.
Collapse
Affiliation(s)
- Shannon M McNulty
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA. .,Division of Human Genetics, Duke University Medical Center, Durham, NC, 27710, USA.
| |
Collapse
|
7
|
Sujiwattanarat P, Thapana W, Srikulnath K, Hirai Y, Hirai H, Koga A. Higher-order repeat structure in alpha satellite DNA occurs in New World monkeys and is not confined to hominoids. Sci Rep 2015; 5:10315. [PMID: 25974220 PMCID: PMC4431391 DOI: 10.1038/srep10315] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2014] [Accepted: 03/25/2015] [Indexed: 11/17/2022] Open
Abstract
Centromeres usually contain large amounts of tandem repeat DNA. Alpha satellite DNA (AS) is the most abundant tandem repeat DNA found in the centromeres of simian primates. The AS of humans contains sequences organized into higher-order repeat (HOR) structures, which are tandem arrays of larger repeat units consisting of multiple basic repeat units. HOR-carrying AS also occurs in other hominoids, but results reported to date for phylogenetically more remote taxa have been negative. Here we show direct evidence for clear HOR structures in AS of the owl monkey and common marmoset. These monkeys are New World monkey species that are located phylogenetically outside of hominoids. It is currently postulated that the presence of HOR structures in AS is unique to hominoids. Our results suggest that this view must be modified. A plausible explanation is that generation of HOR structures is a general event that occurs occasionally or frequently in primate centromeres, and that, in humans, HOR-carrying AS became predominant in the central region of the centromere. It is often difficult to assemble sequence reads of tandem repeat DNAs into accurate contig sequences; our careful sequencing strategy allowed us to overcome this problem.
Collapse
Affiliation(s)
- Penporn Sujiwattanarat
- 1] Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan [2] Faculty of Science, Kasetsart University, Bangkok 10900, Thailand
| | - Watcharaporn Thapana
- 1] Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan [2] Faculty of Science, Kasetsart University, Bangkok 10900, Thailand
| | | | - Yuriko Hirai
- Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan
| | - Hirohisa Hirai
- Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan
| | - Akihiko Koga
- Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan
| |
Collapse
|
8
|
Paar V, Basar I, Rosandić M, Glunčić M. Consensus higher order repeats and frequency of string distributions in human genome. Curr Genomics 2007; 8:93-111. [PMID: 18660848 PMCID: PMC2435359 DOI: 10.2174/138920207780368169] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2007] [Revised: 01/26/2007] [Accepted: 01/30/2007] [Indexed: 02/01/2023] Open
Abstract
Key string algorithm (KSA) could be viewed as robust computational generalization of restriction enzyme method. KSA enables robust and effective identification and structural analyzes of any given genomic sequences, like in the case of NCBI assembly for human genome. We have developed a method, using total frequency distribution of all r-bp key strings in dependence on the fragment length l, to determine the exact size of all repeats within the given genomic sequence, both of monomeric and HOR type. Subsequently, for particular fragment lengths equal to each of these repeat sizes we compute the partial frequency distribution of r-bp key strings; the key string with highest frequency is a dominant key string, optimal for segmentation of a given genomic sequence into repeat units. We illustrate how a wide class of 3-bp key strings leads to a key-string-dependent periodic cell which enables a simple identification and consensus length determinations of HORs, or any other highly convergent repeat of monomeric or HOR type, both tandem or dispersed. We illustrated KSA application for HORs in human genome and determined consensus HORs in the Build 35.1 assembly. In the next step we compute suprachromosomal family classification and CENP-B box / pJalpha distributions for HORs. In the case of less convergent repeats, like for example monomeric alpha satellite (20-40% divergence), we searched for optimal compact key string using frequency method and developed a concept of composite key string (GAAAC--CTTTG) or flexible relaxation (28 bp key string) which provides both monomeric alpha satellites as well as alpha monomer segmentation of internal HOR structure. This method is convenient also for study of R-strand (direct) / S-strand (reverse complement) alpha monomer alternations. Using KSA we identified 16 alternating regions of R-strand and S-strand monomers in one contig in choromosome 7. Use of CENP-B box and/or pJalpha motif as key string is suitable both for identification of HORs and monomeric pattern as well as for studies of CENP-B box / pJalpha distribution. As an example of application of KSA to sequences outside of HOR regions we present our finding of a tandem with highly convergent 3434-bp Long monomer in chromosome 5 (divergence less then 0.3%).
Collapse
Affiliation(s)
- Vladimir Paar
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | - Ivan Basar
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | - Marija Rosandić
- Department of Internal Medicine,
University Hospital Rebro, Kišpatićeva 12, 10000 Zagreb, Croatia
| | - Matko Glunčić
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| |
Collapse
|
9
|
Pellestor F. In situ aneuploidy assessment in human sperm: the use of primed in situ and peptide nucleic acid-fluorescence in situ hybridization techniques. Asian J Androl 2006; 8:387-92. [PMID: 16763713 DOI: 10.1111/j.1745-7262.2006.00137.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
Both the primed in situ (PRINS) and the peptide nucleic acid-fluorescence in situ hybridization (PNA-FISH) techniques constitute alternatives to the conventional (fluorescence in situ hybridization, FISH) procedure for chromosomal investigations. The PRINS reaction is based on the use of a DNA polymerase and labeled nucleotide in an in situ primer extension reaction. Peptide nucleic acid probes are synthetic DNA analogs with uncharged polyamide backbones. The two procedures present several advantages (specificity, rapidity and discriminating ability) that make them very attractive for cytogenetic purposes. Their adaptation to human spermatozoa has allowed the development of new and fast procedures for the chromosomal screening of male gametes and has provided efficient complements to FISH for in situ assessment of aneuploidy in male gametes.
Collapse
Affiliation(s)
- Franck Pellestor
- CNRS UPR 1142, Institute of Human Genetics, Montpellier Cedex 5, France.
| |
Collapse
|
10
|
Kolas NK, Svetlanov A, Lenzi ML, Macaluso FP, Lipkin SM, Liskay RM, Greally J, Edelmann W, Cohen PE. Localization of MMR proteins on meiotic chromosomes in mice indicates distinct functions during prophase I. ACTA ACUST UNITED AC 2005; 171:447-58. [PMID: 16260499 PMCID: PMC2171243 DOI: 10.1083/jcb.200506170] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Mammalian MutL homologues function in DNA mismatch repair (MMR) after replication errors and in meiotic recombination. Both functions are initiated by a heterodimer of MutS homologues specific to either MMR (MSH2-MSH3 or MSH2-MSH6) or crossing over (MSH4-MSH5). Mutations of three of the four MutL homologues (Mlh1, Mlh3, and Pms2) result in meiotic defects. We show herein that two distinct complexes involving MLH3 are formed during murine meiosis. The first is a stable association between MLH3 and MLH1 and is involved in promoting crossing over in conjunction with MSH4-MSH5. The second complex involves MLH3 together with MSH2-MSH3 and localizes to repetitive sequences at centromeres and the Y chromosome. This complex is up-regulated in Pms2-/- males, but not females, providing an explanation for the sexual dimorphism seen in Pms2-/- mice. The association of MLH3 with repetitive DNA sequences is coincident with MSH2-MSH3 and is decreased in Msh2-/- and Msh3-/- mice, suggesting a novel role for the MMR family in the maintenance of repeat unit integrity during mammalian meiosis.
Collapse
Affiliation(s)
- Nadine K Kolas
- Department of Molecular Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Pon J, Juan C, Petitpierre E. Higher-order organization and compartmentalization of satellite DNA PIM357 in species of the coleopteran genus Pimelia. Chromosome Res 2003; 10:597-606. [PMID: 12498348 DOI: 10.1023/a:1020918803675] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The PIM357 satellite DNA family is present in 26 Pimelia taxa (Tenebrionidae, Coleoptera) with endemic congeneric species from the Canary Islands showing higher interrepeat variability than continental ones. In this paper, we compare the repetitive DNA sequences of a Canarian species that has distinct subfamilies of repeat units, P. radula ascendens, with another without such subfamilies, P. sparsa sparsa. The chromosomal localization of the repeat units and the comparison of the variability of randomly cloned monomers to the one estimated by comparing repeat units from dimers and trimers suggest the absence of satellite subfamilies in P. sparsa sparsa. Hence, the repeat units of this species seem to be uniformly and randomly distributed throughout all chromosomes out of one chromosomal pair. On the contrary, P. radula ascendens shows four divergent subfamilies of repeat units supported by several diagnostic nucleotide substitutions. These subfamilies seem to form four distinct repeat units: monomer subfamily 1, monomer subfamily 4 and two higher-order units (dimer linking subfamily 1 and 4, and dimer linking subfamily 2 and 3). Moreover, monomers of subfamily 1 are present in three chromosomal pairs only. We discuss the effect of different potential factors acting in the concerted evolution and the genomic organization of stDNA sequences in these taxa.
Collapse
Affiliation(s)
- Joan Pon
- Laboratori de Genètica, Departament de Biologia, Universitat de les Illes Balears, 07071 Palma de Mallorca, Spain
| | | | | |
Collapse
|
12
|
Schindelhauer D, Schwarz T. Evidence for a fast, intrachromosomal conversion mechanism from mapping of nucleotide variants within a homogeneous alpha-satellite DNA array. Genome Res 2002; 12:1815-26. [PMID: 12466285 PMCID: PMC187568 DOI: 10.1101/gr.451502] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Assuming that patterns of sequence variants within highly homogeneous centromeric tandem repeat arrays can tell us which molecular turnover mechanisms are presently at work, we analyzed the alpha-satellite tandem repeat array DXZ1 of one human X chromosome. Here we present accurate snapshots from this dark matter of the genome. We demonstrate stable and representative cloning of the array in a P1 artificial chromosome (PAC) library, use samples of higher-order repeats subcloned from five unmapped PACs (120-160 kb) to identify common variants, and show that such variants are presently in a fixed transition state. To characterize patterns of variant spread throughout homogeneous array segments, we use a novel partial restriction and pulsed-field gel electrophoresis mapping approach. We find an older large-scale (35-50 kb) duplication event supporting the evolutionarily important unequal crossing-over hypothesis, but generally find independent variant occurrence and a paucity of potential de novo mutations within segments of highest homogeneity (99.1%-99.3%). Within such segments, a highly nonrandom variant clustering within adjacent higher-order repeats was found in the absence of haplotypic repeats. Such variant clusters are hardly explained by interchromosomal, fixation-driving mechanisms and likely reflect a fast, localized, intrachromosomal sequence conversion mechanism.
Collapse
Affiliation(s)
- Dirk Schindelhauer
- Institute of Human Genetics, Technical University of Munich, Munich, Germany.
| | | |
Collapse
|
13
|
Horvath JE, Schwartz S, Eichler EE. The mosaic structure of human pericentromeric DNA: a strategy for characterizing complex regions of the human genome. Genome Res 2000; 10:839-52. [PMID: 10854415 PMCID: PMC310890 DOI: 10.1101/gr.10.6.839] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The pericentromeric regions of human chromosomes pose particular problems for both mapping and sequencing. These difficulties are due, in large part, to the presence of duplicated genomic segments that are distributed among multiple human chromosomes. To ensure contiguity of genomic sequence in these regions, we designed a sequence-based strategy to characterize different pericentromeric regions using a single (162 kb) 2p11 seed sequence as a point of reference. Molecular and cytogenetic techniques were first used to construct a paralogy map that delineated the interchromosomal distribution of duplicated segments throughout the human genome. Monochromosomal hybrid DNAs were PCR amplified by primer pairs designed to the 2p11 reference sequence. The PCR products were directly sequenced and used to develop a catalog of sequence tags for each duplicon for each chromosome. A total of 685 paralogous sequence variants were generated by sequencing 34.7 kb of paralogous pericentromeric sequence. Using PCR products as hybridization probes, we were able to identify 702 human BAC clones, of which a subset, 107 clones, were analyzed at the sequence level. We used diagnostic paralogous sequence variants to assign 65 of these BACs to at least 9 chromosomal pericentromeric regions: 1q12, 2p11, 9p11/q12, 10p11, 14q11, 15q11, 16p11, 17p11, and 22q11. Comparisons with existing sequence and physical maps for the human genome suggest that many of these BACs map to regions of the genome with sequence gaps. Our analysis indicates that large portions of pericentromeric DNA are virtually devoid of unique sequences. Instead, they consist of a mosaic of different genomic segments that have had different propensities for duplication. These biologic properties may be exploited for the rapid characterization of, not only pericentromeric DNA, but also other complex paralogous regions of the human genome.
Collapse
Affiliation(s)
- J E Horvath
- Department of Genetics and Center for Human Genetics, Case Western Reserve School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106 USA
| | | | | |
Collapse
|
14
|
Crovella S, Del Pero M, Marziliano N, Garagna S, Pecile V, Morgutti M, Boniotto M, Amoroso A, Montagnon D, Campa CC, Lamberti L, Bigatti MP, Ardito G. MFASAT: a new alphoid DNA sequence isolated from Macaca fascicularis (Cercopithecidae, Primates). Genome 1999; 42:1066-70. [PMID: 10659771 DOI: 10.1139/g99-054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A new highly repeated DNA fragment isolated from Macaca fascicularis (MFASAT) is described. Our findings obtained by sequencing, Southern blot analysis, and fluorescent in situ hybridization (FISH) on metaphasic chromosomes strongly suggest that MFASAT can be considered as a member of the alphoid DNA family characteristic of Old World monkeys. The chromosomal localization of MFASAT, obtained by FISH, showed that this alphoid DNA is present in the peri-centromeric area of all the chromosomes. MFASAT showed a high degree of conservation when compared, by sequence alignment, to other Macaca species and Papio papio as expected for species with considerable genome conservation. A low degree of homology has been found comparing M. fascicularis alphoid DNA with a more distantly related Cercopithecidae species such as Cercopithecus aethiops.
Collapse
Affiliation(s)
- S Crovella
- Cattedra di Genetica e Servizio di Genetica, IRCCS Burlo-Garofolo, Trieste, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Kato M, Kato A, Shimizu N. A method for evaluating phylogenetic relationship of alpha-satellite DNA suprachromosomal family by nucleotide frequency calculation. Mol Phylogenet Evol 1999; 13:329-35. [PMID: 10603261 DOI: 10.1006/mpev.1999.0665] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The sequence similarity among chromosome-specific alpha-satellite DNA was quantitatively evaluated by a novel procedure: nucleotide frequency calculation. Tandem-arrayed repetitive DNA segments were aligned with unit length repeat, and the nucleotide frequency at each position was used to estimate the phylogenetic distance between repetitive DNA segments. The calculations for human and chimpanzee X chromosome alpha-satellites showed that the results were consistent with the known relationships of primates, indicating that the nucleotide frequency calculation worked effectively to estimate the distances between satellite arrays. Human chromosome-specific alpha-satellites had been grouped into three suprachromosomal families (I, II, and III), and in the current work the nucleotide frequency analysis has defined the quantitative distances between the chromosome-specific alpha-satellite DNA.
Collapse
Affiliation(s)
- M Kato
- College of the Integrated Arts and Sciences, Osaka Prefecture University, 1-1 Gakuencho, Sakai, 599-8531, Japan.
| | | | | |
Collapse
|
16
|
Kato M. Structural bistability of repetitive DNA elements featuring CA/TG dinucleotide steps and mode of evolution of satellite DNA. EUROPEAN JOURNAL OF BIOCHEMISTRY 1999; 265:204-9. [PMID: 10491175 DOI: 10.1046/j.1432-1327.1999.00714.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Satellite DNA sequences are known to be important components required for the construction of centromeres and are common to all higher eukaryotes. Nevertheless, their nucleotide sequences vary significantly, even in evolutionarily related species. In order to elucidate how the nucleotide sequences define the conformational character of centromeric satellite DNA, an evolutionary path toward repetitive units has been hypothesized. In that context, the DNA conformation of fish satellite DNA was evaluated in two ways: the organization of subrepeats and sequence characteristics were compared, and the differences in stacking energies between A-helix and B-helix and the sequence-dependent bendability of the helices were evaluated. Our findings suggest that the monomeric units making up currently observed repetitive sequences have evolved through stepwise amplification of shorter, ancestral sequences by increasing the length of the units. In addition, we suggest that potentially key sequences required for DNA amplification comprise highly flexible structures. Thus flexibility of the DNA structure may be a primary prerequisite for DNA amplification.
Collapse
Affiliation(s)
- M Kato
- Department of Life Sciences, College of Integrated Arts and Sciences, Osaka Prefecture University, Sakai, Japan.
| |
Collapse
|
17
|
Maratou K, Siddique Y, Kessling AM, Davies GE. Novel methodology for the detection of chromosome 21-specific alpha-satellite DNA sequences. Genomics 1999; 57:429-32. [PMID: 10329010 DOI: 10.1006/geno.1999.5786] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We present a novel method, based on the hybridization of allele-specific oligonucleotide probes, that allows the specific detection of chromosome 21 alpha-satellite sequences. Absence of informative polymorphic markers from the centromeric region of chromosome 21 has constituted one of the difficulties in studying the centromere of this chromosome. The alpha-satellite subfamilies from chromosomes 21 and 13 are almost identical in sequence and thus cannot be distinguished using conventional hybridization techniques. Analysis using nuclear families showed that the centromeric polymorphism, detected using our specific probe and pulsed-field gel restriction analysis, segregates in a Mendelian fashion and exhibits a high degree of polymorphism among unrelated individuals. The alphoid DNA of chromosome 21 is highly polymorphic, useful not only as a definitive anchor for the genetic map, but also for studies of chromosome 21 nondisjunction, including the unequivocal assignment of meiotic origin.
Collapse
Affiliation(s)
- K Maratou
- Department of Medical and Community Genetics, Imperial College of Science, Technology and Medicine, London, HA1 3UJ, United Kingdom
| | | | | | | |
Collapse
|
18
|
Kibar Z, Lafrenière RG, Chakravarti A, Wang JC, Chevrette M, Der Kaloustian VM, Rouleau GA. A radiation hybrid map of 48 loci including the clouston hidrotic ectodermal dysplasia locus in the pericentromeric region of chromosome 13q. Genomics 1999; 56:127-30. [PMID: 10036193 DOI: 10.1006/geno.1998.5698] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
To facilitate the identification of the gene responsible for Clouston hidrotic ectodermal dysplasia (HED), we used a chromosome 13-specific radiation hybrid panel to map 54 loci in the HED candidate region. The marker retention data were analyzed using RHMAP version 3. The 54 markers have an average retention frequency of 31.6% with decreasing retention as a function of distance from the centromere. Two-point analysis identified three linkage groups with a threshold lod score of 4.00; one linkage group consisted of 49 loci including the centromeric marker D13Z1 and the telomeric flanking marker for the HED candidate region D13S143. Assuming a centromeric retention model, multipoint maximum likelihood analysis of these 49 loci except D13Z1 provided a 1000:1 framework map ordering 29 loci with 21 unique map positions and approximately 2000 times more likely than the next order. Loci that could not be ordered with this level of support were positioned within a range of adjacent intervals. This map spans 347 cR9000, has an average resolution of 17.3 cR9000, and includes 3 genes (TUBA2, GJbeta2, and FGF-9), 18 ESTs, 19 polymorphic loci, and 8 single-copy DNA segments. Comparison of our RH map to a YAC contig showed an inconsistency in order involving a reversed interval of 6 loci. Fiber-FISH and FISH on interphase nuclei analyses with PACs isolated from this region supported our order. We also describe the isolation of 8 new chromosome 13q polymorphic (CA)n markers that have an average PIC value of 0.67. These data and mapping reagents will facilitate the isolation of disease genes from this region.
Collapse
Affiliation(s)
- Z Kibar
- Centre for Research in Neurosciences, Montreal General Hospital Research Institute, Montreal, Quebec, H3G 1A4, Canada
| | | | | | | | | | | | | |
Collapse
|
19
|
Laurent AM, Puechberty J, Prades C, Gimenez S, Roizès G. Site-specific retrotransposition of L1 elements within human alphoid satellite sequences. Genomics 1997; 46:127-32. [PMID: 9403067 DOI: 10.1006/geno.1997.4987] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
In the course of a search for microsatellites as centromeric polymorphic markers at the 3' ends of Alu or L1 elements, we observed a much higher frequency of L1 than Alu elements embedded within alpha satellite DNA. By sequence analysis of the L1 elements at their alphoid locus of insertion, we found that the insertion site was specific, with the consensus being (Py)2-10/ (Pu)3-7. All potential sites within the consensus alphoid 171-bp repeat are occupied by such elements. This confirms the finding by Feng et al. (1996; Human retrotransposon encodes a conserved endonuclease required for retrotransposition, Cell 87:905-916) that the progenitor L1 elements encode a site-specific endonuclease and that they generate copies that are inserted at these specific sites. The analysis of retrotransposed L1 elements within the alphoid domains of the acrocentric chromosomes showed that a number of loci are shared among all five acrocentrics. This sheds light on the manner in which centromeric regions of these chromosomes are exchanging information during evolution.
Collapse
Affiliation(s)
- A M Laurent
- Centre National de la Recherche Scientifique, ERS 155, Montpellier, France
| | | | | | | | | |
Collapse
|
20
|
Nilsson M, Krejci K, Koch J, Kwiatkowski M, Gustavsson P, Landegren U. Padlock probes reveal single-nucleotide differences, parent of origin and in situ distribution of centromeric sequences in human chromosomes 13 and 21. Nat Genet 1997; 16:252-5. [PMID: 9207789 DOI: 10.1038/ng0797-252] [Citation(s) in RCA: 128] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Chromosome centromeres, composed of repeated DNA sequences, orchestrate the correct segregation of chromatids in cell division. We have examined the centromeres of human chromosomes 13 and 21 by studying the distribution, in situ, of two alpha satellite sequences that differ in a single nucleotide position. This was possible using padlock probes, oligo-nucleotides that can be ligated into circles upon target recognition. The segregation of individual 13 and 21 homologues in a family was followed by monitoring of the signals from two differentially labelled probes, specific for either sequence variant. A characteristic arrangement of the repeat motifs in three separate spots, oriented transverse to the length axis of the metaphase chromosomes and bilaterally symmetric, indicates that only parts of the detected regions are involved in the centromeric region, joining the sister chromatids before anaphase.
Collapse
Affiliation(s)
- M Nilsson
- Beijer Laboratory, Department of Medical Genetics, Uppsala, Sweden.
| | | | | | | | | | | |
Collapse
|
21
|
Schmid CW. Alu: structure, origin, evolution, significance and function of one-tenth of human DNA. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 1996; 53:283-319. [PMID: 8650306 DOI: 10.1016/s0079-6603(08)60148-8] [Citation(s) in RCA: 139] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Affiliation(s)
- C W Schmid
- Section of Molecular and Cellular Biology, University of California, Davis 95616, USA
| |
Collapse
|
22
|
Warburton PE, Willard HF. Interhomologue sequence variation of alpha satellite DNA from human chromosome 17: evidence for concerted evolution along haplotypic lineages. J Mol Evol 1995; 41:1006-15. [PMID: 8587099 DOI: 10.1007/bf00173182] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Alpha satellite DNA is a family of tandemly repeated DNA found at the centromeres of all primate chromosomes. Different human chromosomes 17 in the population are characterized by distinct alpha satellite haplotypes, distinguished by the presence of variant repeat forms that have precise monomeric deletions. Pair-wise comparisons of sequence diversity between variant repeat units from each haplotype show that they are closely related in sequence. Direct sequencing of PCR-amplified alpha satellite reveals heterogeneous positions between the repeat units on a chromosome as two bands at the same position on a sequencing ladder. No variation was detected in the sequence and location of these heterogeneous positions between chromosomes 17 from the same haplotype, but distinct patterns of variation were detected between chromosomes from different haplotypes. Subsequent sequence analysis of individual repeats from each haplotype confirmed the presence of extensive haplotype-specific sequence variation. Phylogenetic inference yielded a tree that suggests these chromosome 17 repeat units evolve principally along haplotypic lineages. These studies allow insight into the relative rates and/or timing of genetic turnover processes that lead to the homogenization of tandem DNA families.
Collapse
Affiliation(s)
- P E Warburton
- Department of Genetics, Stanford University, CA 94305, USA
| | | |
Collapse
|