1
|
Hallast P, Ebert P, Loftus M, Yilmaz F, Audano PA, Logsdon GA, Bonder MJ, Zhou W, Höps W, Kim K, Li C, Hoyt SJ, Dishuck PC, Porubsky D, Tsetsos F, Kwon JY, Zhu Q, Munson KM, Hasenfeld P, Harvey WT, Lewis AP, Kordosky J, Hoekzema K, O'Neill RJ, Korbel JO, Tyler-Smith C, Eichler EE, Shi X, Beck CR, Marschall T, Konkel MK, Lee C. Assembly of 43 human Y chromosomes reveals extensive complexity and variation. Nature 2023; 621:355-364. [PMID: 37612510 PMCID: PMC10726138 DOI: 10.1038/s41586-023-06425-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 07/11/2023] [Indexed: 08/25/2023]
Abstract
The prevalence of highly repetitive sequences within the human Y chromosome has prevented its complete assembly to date1 and led to its systematic omission from genomic analyses. Here we present de novo assemblies of 43 Y chromosomes spanning 182,900 years of human evolution and report considerable diversity in size and structure. Half of the male-specific euchromatic region is subject to large inversions with a greater than twofold higher recurrence rate compared with all other chromosomes2. Ampliconic sequences associated with these inversions show differing mutation rates that are sequence context dependent, and some ampliconic genes exhibit evidence for concerted evolution with the acquisition and purging of lineage-specific pseudogenes. The largest heterochromatic region in the human genome, Yq12, is composed of alternating repeat arrays that show extensive variation in the number, size and distribution, but retain a 1:1 copy-number ratio. Finally, our data suggest that the boundary between the recombining pseudoautosomal region 1 and the non-recombining portions of the X and Y chromosomes lies 500 kb away from the currently established1 boundary. The availability of fully sequence-resolved Y chromosomes from multiple individuals provides a unique opportunity for identifying new associations of traits with specific Y-chromosomal variants and garnering insights into the evolution and function of complex regions of the human genome.
Collapse
Affiliation(s)
- Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter Ebert
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Mark Loftus
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Marc Jan Bonder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Wolfram Höps
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Kwondo Kim
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Chong Li
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Savannah J Hoyt
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Fotios Tsetsos
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jee Young Kwon
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Patrick Hasenfeld
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Jan O Korbel
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Miriam K Konkel
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| |
Collapse
|
2
|
Cechova M, Miga KH. Satellite DNAs and human sex chromosome variation. Semin Cell Dev Biol 2022; 128:15-25. [PMID: 35644878 PMCID: PMC9233459 DOI: 10.1016/j.semcdb.2022.04.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/26/2022] [Accepted: 04/27/2022] [Indexed: 11/17/2022]
Abstract
Satellite DNAs are present on every chromosome in the cell and are typically enriched in repetitive, heterochromatic parts of the human genome. Sex chromosomes represent a unique genomic and epigenetic context. In this review, we first report what is known about satellite DNA biology on human X and Y chromosomes, including repeat content and organization, as well as satellite variation in typical euploid individuals. Then, we review sex chromosome aneuploidies that are among the most common types of aneuploidies in the general population, and are better tolerated than autosomal aneuploidies. This is demonstrated also by the fact that aging is associated with the loss of the X, and especially the Y chromosome. In addition, supernumerary sex chromosomes enable us to study general processes in a cell, such as analyzing heterochromatin dosage (i.e. additional Barr bodies and long heterochromatin arrays on Yq) and their downstream consequences. Finally, genomic and epigenetic organization and regulation of satellite DNA could influence chromosome stability and lead to aneuploidy. In this review, we argue that the complete annotation of satellite DNA on sex chromosomes in human, and especially in centromeric regions, will aid in explaining the prevalence and the consequences of sex chromosome aneuploidies.
Collapse
Affiliation(s)
- Monika Cechova
- Faculty of Informatics, Masaryk University, Czech Republic
| | - Karen H Miga
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA; UC Santa Cruz Genomics Institute, University of California Santa Cruz, CA 95064, USA
| |
Collapse
|
3
|
de Lima LG, Howe E, Singh VP, Potapova T, Li H, Xu B, Castle J, Crozier S, Harrison CJ, Clifford SC, Miga KH, Ryan SL, Gerton JL. PCR amplicons identify widespread copy number variation in human centromeric arrays and instability in cancer. CELL GENOMICS 2021; 1:100064. [PMID: 34993501 PMCID: PMC8730464 DOI: 10.1016/j.xgen.2021.100064] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 07/13/2021] [Accepted: 08/24/2021] [Indexed: 12/13/2022]
Abstract
Centromeric α-satellite repeats represent ~6% of the human genome, but their length and repetitive nature make sequencing and analysis of those regions challenging. However, centromeres are essential for the stable propagation of chromosomes, so tools are urgently needed to monitor centromere copy number and how it influences chromosome transmission and genome stability. We developed and benchmarked droplet digital PCR (ddPCR) assays that measure copy number for five human centromeric arrays. We applied them to characterize natural variation in centromeric array size, analyzing normal tissue from 37 individuals from China and 39 individuals from the US and UK. Each chromosome-specific array varies in size up to 10-fold across individuals and up to 50-fold across chromosomes, indicating a unique complement of arrays in each individual. We also used the ddPCR assays to analyze centromere copy number in 76 matched tumor-normal samples across four cancer types, representing the most-comprehensive quantitative analysis of centromeric array stability in cancer to date. In contrast to stable transmission in cultured cells, centromeric arrays show gain and loss events in each of the cancer types, suggesting centromeric α-satellite DNA represents a new category of genome instability in cancer. Our methodology for measuring human centromeric-array copy number will advance research on centromeres and genome integrity in normal and disease states.
Collapse
Affiliation(s)
| | - Edmund Howe
- The Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | - Tamara Potapova
- The Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Hua Li
- The Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Baoshan Xu
- Hospital of Stomatology, Guangdong Provincial Key Laboratory of Stomatology, Guanghua School of Stomatology, Institute of Stomatological Research, Sun Yat-sen University, Guangzhou, Guangdong Province, China
| | - Jemma Castle
- Newcastle University Centre for Cancer, Newcastle upon Tyne, UK
| | - Steve Crozier
- Newcastle University Centre for Cancer, Newcastle upon Tyne, UK
| | | | | | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Sarra L. Ryan
- Newcastle University Centre for Cancer, Newcastle upon Tyne, UK
| | - Jennifer L. Gerton
- The Stowers Institute for Medical Research, Kansas City, MO, USA
- University of Kansas Medical Center, Kansas City, KS, USA
| |
Collapse
|
4
|
Suzuki Y, Morishita S. The time is ripe to investigate human centromeres by long-read sequencing†. DNA Res 2021; 28:6381569. [PMID: 34609504 PMCID: PMC8502840 DOI: 10.1093/dnares/dsab021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 09/28/2021] [Indexed: 01/05/2023] Open
Abstract
The complete sequencing of human centromeres, which are filled with highly repetitive elements, has long been challenging. In human centromeres, α-satellite monomers of about 171 bp in length are the basic repeating units, but α-satellite monomers constitute the higher-order repeat (HOR) units, and thousands of copies of highly homologous HOR units form large arrays, which have hampered sequence assembly of human centromeres. Because most HOR unit occurrences are covered by long reads of about 10 kb, the recent availability of much longer reads is expected to enable observation of individual HOR occurrences in terms of their single-nucleotide or structural variants. The time has come to examine the complete sequence of human centromeres.
Collapse
Affiliation(s)
- Yuta Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8568, Japan
| | - Shinichi Morishita
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8568, Japan
| |
Collapse
|
5
|
Suzuki Y, Myers EW, Morishita S. Rapid and ongoing evolution of repetitive sequence structures in human centromeres. SCIENCE ADVANCES 2020; 6:6/50/eabd9230. [PMID: 33310858 PMCID: PMC7732198 DOI: 10.1126/sciadv.abd9230] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 10/30/2020] [Indexed: 06/12/2023]
Abstract
Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.
Collapse
Affiliation(s)
- Yuta Suzuki
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Shinichi Morishita
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| |
Collapse
|
6
|
Miga KH. Centromere studies in the era of 'telomere-to-telomere' genomics. Exp Cell Res 2020; 394:112127. [PMID: 32504677 DOI: 10.1016/j.yexcr.2020.112127] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 05/23/2020] [Accepted: 05/30/2020] [Indexed: 12/17/2022]
Abstract
We are entering into an exciting era of genomics where truly complete, high-quality assemblies of human chromosomes are available end-to-end, or from 'telomere-to-telomere' (T2T). This technological advance offers a new opportunity to include endogenous human centromeric regions in high-resolution, sequence-based studies. These emerging reference maps are expected to reveal a new functional landscape in the human genome, where centromere proteins, transcriptional regulation, and spatial organization can be examined with base-level resolution across different stages of development and disease. Such studies will depend on innovative assembly methods of extremely long tandem repeats (ETRs), or satellite DNAs, paired with the development of new, orthogonal validation methods to ensure accuracy and completeness. This review reflects the progress in centromere genomics, credited by recent advancements in long-read sequencing and assembly methods. In doing so, I will discuss the challenges that remain and the promise for a new period of scientific discovery for satellite DNA biology and centromere function.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, CA, 95064, USA.
| |
Collapse
|
7
|
Miga KH. Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population. Genes (Basel) 2019; 10:E352. [PMID: 31072070 PMCID: PMC6562703 DOI: 10.3390/genes10050352] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 05/03/2019] [Accepted: 05/03/2019] [Indexed: 12/30/2022] Open
Abstract
The central goal of medical genomics is to understand the inherited basis of sequence variation that underlies human physiology, evolution, and disease. Functional association studies currently ignore millions of bases that span each centromeric region and acrocentric short arm. These regions are enriched in long arrays of tandem repeats, or satellite DNAs, that are known to vary extensively in copy number and repeat structure in the human population. Satellite sequence variation in the human genome is often so large that it is detected cytogenetically, yet due to the lack of a reference assembly and informatics tools to measure this variability, contemporary high-resolution disease association studies are unable to detect causal variants in these regions. Nevertheless, recently uncovered associations between satellite DNA variation and human disease support that these regions present a substantial and biologically important fraction of human sequence variation. Therefore, there is a pressing and unmet need to detect and incorporate this uncharacterized sequence variation into broad studies of human evolution and medical genomics. Here I discuss the current knowledge of satellite DNA variation in the human genome, focusing on centromeric satellites and their potential implications for disease.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, CA 95064, USA.
| |
Collapse
|
8
|
Jain M, Olsen HE, Turner DJ, Stoddart D, Bulazel KV, Paten B, Haussler D, Willard HF, Akeson M, Miga KH. Linear assembly of a human centromere on the Y chromosome. Nat Biotechnol 2018; 36:321-323. [PMID: 29553574 PMCID: PMC5886786 DOI: 10.1038/nbt.4109] [Citation(s) in RCA: 167] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 02/22/2018] [Indexed: 01/21/2023]
Abstract
The human genome reference sequence remains incomplete owing to the challenge of assembling long tracts of near-identical tandem repeats in centromeres. We implemented a nanopore sequencing strategy to generate high-quality reads that span hundreds of kilobases of highly repetitive DNA in a human Y chromosome centromere. Combining these data with short-read variant validation, we assembled and characterized the centromeric region of a human Y chromosome.
Collapse
Affiliation(s)
- Miten Jain
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California USA
| | - Hugh E Olsen
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California USA
| | | | | | - Kira V Bulazel
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California USA
| | - Huntington F Willard
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina USA
- Geisinger National, Bethesda, Maryland USA
| | - Mark Akeson
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California USA
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California USA
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina USA
| |
Collapse
|
9
|
Human Y chromosome copy number variation in the next generation sequencing era and beyond. Hum Genet 2017; 136:591-603. [PMID: 28378101 PMCID: PMC5418319 DOI: 10.1007/s00439-017-1788-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 03/25/2017] [Indexed: 11/16/2022]
Abstract
The human Y chromosome provides a fertile ground for structural rearrangements owing to its haploidy and high content of repeated sequences. The methodologies used for copy number variation (CNV) studies have developed over the years. Low-throughput techniques based on direct observation of rearrangements were developed early on, and are still used, often to complement array-based or sequencing approaches which have limited power in regions with high repeat content and specifically in the presence of long, identical repeats, such as those found in human sex chromosomes. Some specific rearrangements have been investigated for decades; because of their effects on fertility, or their outstanding evolutionary features, the interest in these has not diminished. However, following the flourishing of large-scale genomics, several studies have investigated CNVs across the whole chromosome. These studies sometimes employ data generated within large genomic projects such as the DDD study or the 1000 Genomes Project, and often survey large samples of healthy individuals without any prior selection. Novel technologies based on sequencing long molecules and combinations of technologies, promise to stimulate the study of Y-CNVs in the immediate future.
Collapse
|
10
|
Miga KH. The Promises and Challenges of Genomic Studies of Human Centromeres. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:285-304. [PMID: 28840242 DOI: 10.1007/978-3-319-58592-5_12] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Human centromeres are genomic regions that act as sites of kinetochore assembly to ensure proper chromosome segregation during mitosis and meiosis. Although the biological importance of centromeres in genome stability, and ultimately, cell viability are well understood, the complete sequence content and organization in these multi-megabase-sized regions remains unknown. The lack of a high-resolution reference assembly inhibits standard bioinformatics protocols, and as a result, sequence-based studies involving human centromeres lag far behind the advances made for the non-repetitive sequences in the human genome. In this chapter, I introduce what is known about the genomic organization in the highly repetitive regions spanning human centromeres, and discuss the challenges these sequences pose for assembly, alignment, and data interpretation. Overcoming these obstacles is expected to issue a new era for centromere genomics, which will offer new discoveries in basic cell biology and human biomedical research.
Collapse
Affiliation(s)
- Karen H Miga
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA, USA.
| |
Collapse
|
11
|
Catacchio CR, Ragone R, Chiatante G, Ventura M. Organization and evolution of Gorilla centromeric DNA from old strategies to new approaches. Sci Rep 2015; 5:14189. [PMID: 26387916 PMCID: PMC4585704 DOI: 10.1038/srep14189] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 08/18/2015] [Indexed: 11/09/2022] Open
Abstract
The centromere/kinetochore interaction is responsible for the pairing and segregation of replicated chromosomes in eukaryotes. Centromere DNA is portrayed as scarcely conserved, repetitive in nature, quickly evolving and protein-binding competent. Among primates, the major class of centromeric DNA is the pancentromeric α-satellite, made of arrays of 171 bp monomers, repeated in a head-to-tail pattern. α-satellite sequences can either form tandem heterogeneous monomeric arrays or assemble in higher-order repeats (HORs). Gorilla centromere DNA has barely been characterized, and data are mainly based on hybridizations of human alphoid sequences. We isolated and finely characterized gorilla α-satellite sequences and revealed relevant structure and chromosomal distribution similarities with other great apes as well as gorilla-specific features, such as the uniquely octameric structure of the suprachromosomal family-2 (SF2). We demonstrated for the first time the orthologous localization of alphoid suprachromosomal families-1 and −2 (SF1 and SF2) between human and gorilla in contrast to chimpanzee centromeres. Finally, the discovery of a new 189 bp monomer type in gorilla centromeres unravels clues to the role of the centromere protein B, paving the way to solve the significance of the centromere DNA’s essential repetitive nature in association with its function and the peculiar evolution of the α-satellite sequence.
Collapse
Affiliation(s)
- C R Catacchio
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - R Ragone
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - G Chiatante
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - M Ventura
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| |
Collapse
|
12
|
Espinosa JRF, Ayub Q, Chen Y, Xue Y, Tyler-Smith C. Structural variation on the human Y chromosome from population-scale resequencing. Croat Med J 2015; 56:194-207. [PMID: 26088844 PMCID: PMC4500966 DOI: 10.3325/cmj.2015.56.194] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2014] [Accepted: 05/24/2015] [Indexed: 11/05/2022] Open
Abstract
AIM To investigate the information about Y-structural variants (SVs) in the general population that could be obtained by low-coverage whole-genome sequencing. METHODS We investigated SVs on the male-specific portion of the Y chromosome in the 70 individuals from Africa, Europe, or East Asia sequenced as part of the 1000 Genomes Pilot project, using data from this project and from additional studies on the same samples. We applied a combination of read-depth and read-pair methods to discover candidate Y-SVs, followed by validation using information from the literature, independent sequence and single nucleotide polymorphism-chip data sets, and polymerase chain reaction experiments. RESULTS We validated 19 Y-SVs, 2 of which were novel. Non-reference allele counts ranged from 1 to 64. The regions richest in variation were the heterochromatic segments near the centromere or the DYZ19 locus, followed by the ampliconic regions, but some Y-SVs were also present in the X-transposed and X-degenerate regions. In all, 5 of the 27 protein-coding gene families on the Y chromosome varied in copy number. CONCLUSIONS We confirmed that Y-SVs were readily detected from low-coverage sequence data and were abundant on the chromosome. We also reported both common and rare Y-SVs that are novel.
Collapse
Affiliation(s)
| | | | | | | | - Chris Tyler-Smith
- Chris Tyler-Smith,The Wellcome Trust Sanger Institute, Hinxton, Cambs. CB10 1SA, UK,
| |
Collapse
|
13
|
Wei W, Fitzgerald TW, Fitzgerald T, Ayub Q, Massaia A, Smith BH, Smith BB, Dominiczak AF, Dominiczak AA, Morris AD, Morris AA, Porteous DJ, Porteous DD, Hurles ME, Tyler-Smith C, Xue Y. Copy number variation in the human Y chromosome in the UK population. Hum Genet 2015; 134:789-800. [PMID: 25957587 PMCID: PMC4460274 DOI: 10.1007/s00439-015-1562-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2015] [Accepted: 04/28/2015] [Indexed: 11/25/2022]
Abstract
We have assessed copy number variation (CNV) in the male-specific part of the human Y chromosome discovered by array comparative genomic hybridization (array-CGH) in 411 apparently healthy UK males, and validated the findings using SNP genotype intensity data available for 149 of them. After manual curation taking account of the complex duplicated structure of Y-chromosomal sequences, we discovered 22 curated CNV events considered validated or likely, mean 0.93 (range 0–4) per individual. 16 of these were novel. Curated CNV events ranged in size from <1 kb to >3 Mb, and in frequency from 1/411 to 107/411. Of the 24 protein-coding genes or gene families tested, nine showed CNV. These included a large duplication encompassing the AMELY and TBL1Y genes that probably has no phenotypic effect, partial deletions of the TSPY cluster and AZFc region that may influence spermatogenesis, and other variants with unknown functional implications, including abundant variation in the number of RBMY genes and/or pseudogenes, and a novel complex duplication of two segments overlapping the AZFa region and including the 3′ end of the UTY gene.
Collapse
Affiliation(s)
- Wei Wei
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function.
Collapse
Affiliation(s)
- Megan E. Aldrup-MacDonald
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; E-Mail:
- Division of Human Genetics, Duke University, Durham, NC 27710, USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; E-Mail:
- Division of Human Genetics, Duke University, Durham, NC 27710, USA
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +1-919-684-9038
| |
Collapse
|
15
|
Miga KH, Newton Y, Jain M, Altemose N, Willard HF, Kent WJ. Centromere reference models for human chromosomes X and Y satellite arrays. Genome Res 2014; 24:697-707. [PMID: 24501022 PMCID: PMC3975068 DOI: 10.1101/gr.159624.113] [Citation(s) in RCA: 165] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The human genome sequence remains incomplete, with multimegabase-sized gaps representing the endogenous centromeres and other heterochromatic regions. Available sequence-based studies within these sites in the genome have demonstrated a role in centromere function and chromosome pairing, necessary to ensure proper chromosome segregation during cell division. A common genomic feature of these regions is the enrichment of long arrays of near-identical tandem repeats, known as satellite DNAs, which offer a limited number of variant sites to differentiate individual repeat copies across millions of bases. This substantial sequence homogeneity challenges available assembly strategies and, as a result, centromeric regions are omitted from ongoing genomic studies. To address this problem, we utilize monomer sequence and ordering information obtained from whole-genome shotgun reads to model two haploid human satellite arrays on chromosomes X and Y, resulting in an initial characterization of 3.83 Mb of centromeric DNA within an individual genome. To further expand the utility of each centromeric reference sequence model, we evaluate sites within the arrays for short-read mappability and chromosome specificity. Because satellite DNAs evolve in a concerted manner, we use these centromeric assemblies to assess the extent of sequence variation among 366 individuals from distinct human populations. We thus identify two satellite array variants in both X and Y centromeres, as determined by array length and sequence composition. This study provides an initial sequence characterization of a regional centromere and establishes a foundation to extend genomic characterization to these sites as well as to other repeat-rich regions within complex genomes.
Collapse
Affiliation(s)
- Karen H Miga
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | | | | | | | |
Collapse
|
16
|
Natural Selection on Human Y Chromosomes. J Genet Genomics 2014; 41:47-52. [DOI: 10.1016/j.jgg.2014.01.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2013] [Revised: 01/23/2014] [Accepted: 01/23/2014] [Indexed: 12/24/2022]
|
17
|
Graham AN, Kalitsis P. Chromosome Y centromere array deletion leads to impaired centromere function. PLoS One 2014; 9:e86875. [PMID: 24466276 PMCID: PMC3899357 DOI: 10.1371/journal.pone.0086875] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 12/17/2013] [Indexed: 11/18/2022] Open
Abstract
The centromere is an essential chromosomal structure that is required for the faithful distribution of replicated chromosomes to daughter cells. Defects in the centromere can compromise the stability of chromosomes resulting in segregation errors. We have characterised the centromeric structure of the spontaneous mutant mouse strain, BALB/cWt, which exhibits a high rate of Y chromosome instability. The Y centromere DNA array shows a de novo interstitial deletion and a reduction in the level of the foundation centromere protein, CENP-A, when compared to the non-deleted centromere array in the progenitor strain. These results suggest there is a lower threshold limit of centromere size that ensures full kinetochore function during cell division.
Collapse
Affiliation(s)
- Alison N. Graham
- Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
| | - Paul Kalitsis
- Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
- Department of Paediatrics, University of Melbourne, Melbourne, Victoria, Australia
- * E-mail:
| |
Collapse
|
18
|
Abstract
Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance.
Collapse
|
19
|
Abstract
Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.
Collapse
|
20
|
Sullivan LL, Boivin CD, Mravinac B, Song IY, Sullivan BA. Genomic size of CENP-A domain is proportional to total alpha satellite array size at human centromeres and expands in cancer cells. Chromosome Res 2011; 19:457-70. [PMID: 21484447 DOI: 10.1007/s10577-011-9208-5] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2011] [Revised: 03/26/2011] [Accepted: 03/29/2011] [Indexed: 12/13/2022]
Abstract
Human centromeres contain multi-megabase-sized arrays of alpha satellite DNA, a family of satellite DNA repeats based on a tandemly arranged 171 bp monomer. The centromere-specific histone protein CENP-A is assembled on alpha satellite DNA within the primary constriction, but does not extend along its entire length. CENP-A domains have been estimated to extend over 2,500 kb of alpha satellite DNA. However, these estimates do not take into account inter-individual variation in alpha satellite array sizes on homologous chromosomes and among different chromosomes. We defined the genomic distance of CENP-A chromatin on human chromosomes X and Y from different individuals. CENP-A chromatin occupied different genomic intervals on different chromosomes, but despite inter-chromosomal and inter-individual array size variation, the ratio of CENP-A to total alpha satellite DNA size remained consistent. Changes in the ratio of alpha satellite array size to CENP-A domain size were observed when CENP-A was overexpressed and when primary cells were transformed by disrupting interactions between the tumor suppressor protein Rb and chromatin. Our data support a model for centromeric domain organization in which the genomic limits of CENP-A chromatin varies on different human chromosomes, and imply that alpha satellite array size may be a more prominent predictor of CENP-A incorporation than chromosome size. In addition, our results also suggest that cancer transformation and amounts of centromeric heterochromatin have notable effects on the amount of alpha satellite that is associated with CENP-A chromatin.
Collapse
Affiliation(s)
- Lori L Sullivan
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA
| | | | | | | | | |
Collapse
|
21
|
Xue Y, Tyler-Smith C. An Exceptional Gene: Evolution of the TSPY Gene Family in Humans and Other Great Apes. Genes (Basel) 2011; 2:36-47. [PMID: 24710137 PMCID: PMC3924835 DOI: 10.3390/genes2010036] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2010] [Revised: 12/24/2010] [Accepted: 12/28/2010] [Indexed: 11/16/2022] Open
Abstract
The TSPY gene stands out from all other human protein-coding genes because of its high copy number and tandemly-repeated organization. Here, we review its evolutionary history in great apes in order to assess whether these unusual properties are more likely to result from a relaxation of constraint or an unusual functional role. Detailed comparisons with chimpanzee are possible because a finished sequence of the chimpanzee Y chromosome is available, together with more limited data from other apes. These comparisons suggest that the human-chimpanzee ancestral Y chromosome carried a tandem array of TSPY genes which expanded on the human lineage while undergoing multiple duplication events followed by pseudogene formation on the chimpanzee lineage. The protein coding region is the most highly conserved of the multi-copy Y genes in human-chimpanzee comparisons, and the analysis of the dN/dS ratio indicates that TSPY is evolutionarily highly constrained, but may have experienced positive selection after the human-chimpanzee split. We therefore conclude that the exceptionally high copy number in humans is most likely due to a human-specific but unknown functional role, possibly involving rapid production of a large amount of TSPY protein at some stage during spermatogenesis.
Collapse
Affiliation(s)
- Yali Xue
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambs. CB10 1SA, UK.
| | - Chris Tyler-Smith
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambs. CB10 1SA, UK.
| |
Collapse
|
22
|
Paar V, Glunčić M, Basar I, Rosandić M, Paar P, Cvitković M. Large Tandem, Higher Order Repeats and Regularly Dispersed Repeat Units Contribute Substantially to Divergence Between Human and Chimpanzee Y Chromosomes. J Mol Evol 2010; 72:34-55. [DOI: 10.1007/s00239-010-9401-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2010] [Accepted: 10/25/2010] [Indexed: 10/18/2022]
|
23
|
Giachini C, Nuti F, Turner DJ, Laface I, Xue Y, Daguin F, Forti G, Tyler-Smith C, Krausz C. TSPY1 copy number variation influences spermatogenesis and shows differences among Y lineages. J Clin Endocrinol Metab 2009; 94:4016-22. [PMID: 19773397 PMCID: PMC3330747 DOI: 10.1210/jc.2009-1029] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
CONTEXT TSPY1 is a tandemly-repeated gene on the human Y chromosome forming an array of approximately 21-35 copies. The testicular expression pattern and the inferred function of the TSPY1 protein suggest possible involvement in spermatogenesis. However, data are scarce on TSPY1 copy number variation in different Y lineages and its role in spermatogenesis. OBJECTIVES We sought to define: 1) the extent of TSPY1 copy number variation within and among Y chromosome haplogroups; and 2) the role of TSPY1 dosage in spermatogenic efficiency. MATERIALS AND METHODS A total of 154 idiopathic infertile men and 130 normozoospermic controls from Central Italy were analyzed. We used a quantitative PCR assay to measure TSPY1 copy number and also defined Y haplogroups in all subjects. RESULTS We provide evidence that TSPY1 copy number shows substantial variation among Y haplogroups and thus that population stratification does represent a potential bias in case-control association studies. We also found: 1) a significant positive correlation between TSPY1 copy number and sperm count (P < 0.001); 2) a significant difference in mean TSPY1 copy number between patients and controls (28.4 +/- 8.3 vs. 33.9 +/- 10.7; P < 0.001); and 3) a 1.5-fold increased risk of abnormal sperm parameters in men with less than 33 copies (P < 0.001). CONCLUSIONS TSPY copy number variation significantly influences spermatogenic efficiency. Low TSPY1 copy number is a new risk factor for male infertility with potential clinical consequences.
Collapse
Affiliation(s)
- Claudia Giachini
- Andrology Unit, Department of Clinical Physiopathology, University of Florence, Florence 50139, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Abstract
By 1959 it was recognized that the gene (or genes) responsible for initiating the human male phenotype were carried on the Y chromosome. But in subsequent years, few phenotypes were associated with the Y chromosome. Recently, using molecular techniques combined with classical genetics, the Y chromosome has been the focus of intensive and productive investigation. Some of the findings are unexpected and have extended our understanding of the functions of the human Y chromosome. The notion that the Y chromosome is largely devoid of genes is changing. At the present, over 20 Y chromosome genes or pseudogenes have been identified or cloned, a number that is rapidly increasing. A high proportion of Y chromosome sequences have been found to be related to X chromosome sequences: the assembly of a complete physical map of the Y chromosome euchromatic region (believed to carry all of the genes) has shown 25% of the region studied to have homology to the X chromosome.3 Several X-homologous genes are located in the X and Y chromosome pairing regions, an area predicted to have shared homology. Surprisingly, some of the Y-encoded genes that lie outside of the X and Y pairing region share high sequence similarity, and in at least one case, functional identity, with genes on the X chromosome.
Collapse
|
25
|
|
26
|
Jobling MA, Lo ICC, Turner DJ, Bowden GR, Lee AC, Xue Y, Carvalho-Silva D, Hurles ME, Adams SM, Chang YM, Kraaijenbrink T, Henke J, Guanti G, McKeown B, van Oorschot RAH, Mitchell RJ, de Knijff P, Tyler-Smith C, Parkin EJ. Structural variation on the short arm of the human Y chromosome: recurrent multigene deletions encompassing Amelogenin Y. Hum Mol Genet 2006; 16:307-16. [PMID: 17189292 PMCID: PMC2590852 DOI: 10.1093/hmg/ddl465] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Structural polymorphism is increasingly recognized as a major form of human genome variation, and is particularly prevalent on the Y chromosome. Assay of the Amelogenin Y gene (AMELY) on Yp is widely used in DNA-based sex testing, and sometimes reveals males who have interstitial deletions. In a collection of 45 deletion males from 12 populations, we used a combination of sequence-tagged site mapping, and binary-marker and Y-short tandem repeat haplotyping to understand the structural basis of this variation. Of the 45 deletion males, 41 carry indistinguishable deletions, 3.0-3.8 Mb in size. Breakpoint mapping strongly implicates a mechanism of non-allelic homologous recombination between the proximal major array of TSPY gene-containing repeats, and a single distal copy of TSPY; this is supported by the estimation of TSPY copy number in deleted and non-deleted males. The remaining four males carry three distinct non-recurrent deletions (2.5-4.0 Mb), which may be due to non-homologous mechanisms. Haplotyping shows that TSPY-mediated deletions have arisen seven times independently in the sample. One instance, represented by 30 chromosomes mostly of Indian origin within haplogroup J2e1*/M241, has a time-to-most-recent-common-ancestor of approximately 7700+/-1300 years. In addition to AMELY, deletion males all lack the genes PRKY and TBL1Y, and the rarer deletion classes also lack PCDH11Y. The persistence and expansion of deletion lineages, together with direct phenotypic evidence, suggests that absence of these genes has no major deleterious effects.
Collapse
Affiliation(s)
- Mark A Jobling
- Department of Genetics, University of Leicester, University Road, Leicester LE1 7RH, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Lee HR, Neumann P, Macas J, Jiang J. Transcription and Evolutionary Dynamics of the Centromeric Satellite Repeat CentO in Rice. Mol Biol Evol 2006; 23:2505-20. [PMID: 16987952 DOI: 10.1093/molbev/msl127] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Satellite DNA is a major component of centromeric heterochromatin in most multicellular eukaryotes, where it is typically organized into megabase-sized tandem arrays. It has recently been demonstrated that small interfering RNAs (siRNAs) processed from centromeric satellite repeats can be involved in epigenetic chromatin modifications which appear to underpin centromere function. However, the structural organization and evolution of the centromeric satellite DNA is still poorly understood. We analyzed the centromeric satellite repeat arrays from rice chromosomes 1 and 8 and identified higher order structures and local homogenization of the CentO repeats in these 2 centromeres. We also cloned the CentO repeats from the CENH3-associated nucleosomes by a chromatin immunoprecipitation (ChIP)-based method. Sequence variability analysis of the ChIPed CentO repeats revealed a single variable domain within the repeat. We detected transcripts derived from both strands of the CentO repeats. The CentO transcripts are processed into siRNA, suggesting a potential role of this satellite repeat family in epigenetic chromatin modification.
Collapse
Affiliation(s)
- Hye-Ran Lee
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, USA
| | | | | | | |
Collapse
|
28
|
Yan H, Ito H, Nobuta K, Ouyang S, Jin W, Tian S, Lu C, Venu RC, Wang GL, Green PJ, Wing RA, Buell CR, Meyers BC, Jiang J. Genomic and genetic characterization of rice Cen3 reveals extensive transcription and evolutionary implications of a complex centromere. THE PLANT CELL 2006; 18:2123-33. [PMID: 16877494 PMCID: PMC1560911 DOI: 10.1105/tpc.106.043794] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The centromere is the chromosomal site for assembly of the kinetochore where spindle fibers attach during cell division. In most multicellular eukaryotes, centromeres are composed of long tracts of satellite repeats that are recalcitrant to sequencing and fine-scale genetic mapping. Here, we report the genomic and genetic characterization of the complete centromere of rice (Oryza sativa) chromosome 3. Using a DNA fiber-fluorescence in situ hybridization approach, we demonstrated that the centromere of chromosome 3 (Cen3) contains approximately 441 kb of the centromeric satellite repeat CentO. Cen3 includes an approximately 1,881-kb domain associated with the centromeric histone CENH3. This CENH3-associated chromatin domain is embedded within a 3,113-kb region that lacks genetic recombination. Extensive transcription was detected within the CENH3 binding domain based on comprehensive annotation of protein-coding genes coupled with empirical measurements of mRNA levels using RT-PCR and massively parallel signature sequencing. Genes <10 kb from the CentO satellite array were expressed in several rice tissues and displayed histone modification patterns consistent with euchromatin, suggesting that rice centromeric chromatin accommodates normal gene expression. These results support the hypothesis that centromeres can evolve from gene-containing genomic regions.
Collapse
Affiliation(s)
- Huihuang Yan
- Department of Horticulture, University of Wisconsin, Madison, 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Ewis AA, Lee J, Naroda T, Sano T, Kagawa S, Iwamoto T, Shinka T, Shinohara Y, Ishikawa M, Baba Y, Nakahori Y. Prostate cancer incidence varies among males from different Y-chromosome lineages. Prostate Cancer Prostatic Dis 2006; 9:303-9. [PMID: 16683011 DOI: 10.1038/sj.pcan.4500876] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
The incidence rate of prostate cancer in African-American males is two times higher than Caucasian men and ten times higher than Japanese men. The geographical specificity of Y haplogroups implies that males from different ethnic groups undoubtedly have various Y lineages with different Y-chromosomal characteristics that may affect their susceptibility or resistance to such a male-specific cancer. To confirm this hypothesis we studied the Y-chromosomal haplogroups of 92 Japanese prostate cancer patients comparing them with randomly selected 109 unrelated healthy Japanese male controls who were confirmed to be residents of the same geographical area. Males could be classified using three binary Y-chromosome markers (sex-determining region Y (SRY), YAP, 47z) into four haplogroups DE, O2b(*), O2b1, and untagged group. Our results confirmed that prostate cancer incidence varies among males from different Y-chromosome lineages. Males from DE and the untagged haplogroups are at a significantly higher risk to develop prostate cancer than O2b(*) and O2b1 haplogroups (P=0.01), odds ratio 2.17 and 95% confidence interval (1.16-4.07). Males from haplogroup DE are over-represented in the patient group showing a percentage of 41.3%. The underlying possible causes of susceptibility variations of different Y lineages for such a male-specific cancer tumorigenesis are discussed. These findings explain the lower incidence of prostate cancer in Japanese and other South East Asian males than other populations. To our knowledge, this is the first reliable study examining the association between prostate cancer and Y-chromosomal haplogroups, comparing prostate cancer patients with carefully selected matched controls.
Collapse
Affiliation(s)
- A A Ewis
- Health Technology Research Center, National Institute of Advanced Industrial Science and Technology, Hayashi-cho 2217-14, Takamatsu, Japan.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Bedoya G, Montoya P, García J, Soto I, Bourgeois S, Carvajal L, Labuda D, Alvarez V, Ospina J, Hedrick PW, Ruiz-Linares A. Admixture dynamics in Hispanics: a shift in the nuclear genetic ancestry of a South American population isolate. Proc Natl Acad Sci U S A 2006; 103:7234-9. [PMID: 16648268 PMCID: PMC1464326 DOI: 10.1073/pnas.0508716103] [Citation(s) in RCA: 153] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Although it is well established that Hispanics generally have a mixed Native American, African, and European ancestry, the dynamics of admixture at the foundation of Hispanic populations is heterogeneous and poorly documented. Genetic analyses are potentially very informative for probing the early demographic history of these populations. Here we evaluate the genetic structure and admixture dynamics of a province in northwest Colombia (Antioquia), which prior analyses indicate was founded mostly by Spanish men and native women. We examined surname, Y chromosome, and mtDNA diversity in a geographically structured sample of the region and obtained admixture estimates with highly informative autosomal and X chromosome markers. We found evidence of reduced surname diversity and support for the introduction of several common surnames by single founders, consistent with the isolation of Antioquia after the colonial period. Y chromosome and mtDNA data indicate little population substructure among founder Antioquian municipalities. Interestingly, despite a nearly complete Native American mtDNA background, Antioquia has a markedly predominant European ancestry at the autosomal and X chromosome level, which suggests that, after foundation, continuing admixture with Spanish men (but not with native women) increased the European nuclear ancestry of Antioquia. This scenario is consistent with historical information and with results from population genetics theory.
Collapse
Affiliation(s)
| | | | - Jenny García
- Psiquiatria, Universidad de Antioquia, Apartado Aéreo 1226, Medellín, Colombia
| | - Ivan Soto
- Laboratorio de Genética Molecular and Departamentos de
| | | | - Luis Carvajal
- The Galton Laboratory, Department of Biology, University College London, London NW1 2HE, United Kingdom; and
| | - Damian Labuda
- Université de Montreal, Montreal, QC, Canada H3C 3J7
| | | | - Jorge Ospina
- Psiquiatria, Universidad de Antioquia, Apartado Aéreo 1226, Medellín, Colombia
| | | | - Andrés Ruiz-Linares
- Laboratorio de Genética Molecular and Departamentos de
- The Galton Laboratory, Department of Biology, University College London, London NW1 2HE, United Kingdom; and
- To whom correspondence should be addressed at:
Department of Biology, Wolfson House, University College London, 4 Stephenson Way, London NW1 2HE, United Kingdom. E-mail:
| |
Collapse
|
31
|
Wolf U, Schempp W, Scherer G. Molecular biology of the human Y chromosome. Rev Physiol Biochem Pharmacol 2005; 121:147-213. [PMID: 1485072 DOI: 10.1007/bfb0033195] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- U Wolf
- Institut für Humangenetik und Anthropologie der Universität, Freiburg, FRG
| | | | | |
Collapse
|
32
|
Mudge JM, Jackson MS. Evolutionary implications of pericentromeric gene expression in humans. Cytogenet Genome Res 2005; 108:47-57. [PMID: 15545715 DOI: 10.1159/000080801] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2003] [Accepted: 02/09/2004] [Indexed: 11/19/2022] Open
Abstract
Human pericentromeric sequences are enriched for recent sequence duplications. The continual creation and shuffling of these duplications can create novel intron-exon structures and it has been suggested that these regions have a function as gene nurseries. However, these sequences are also rich in satellite repeats which can repress transcription, and analyses of chromosomes 10 and 21 have suggested that they are transcript poor. Here, we investigate the relationship between pericentromeric duplication and transcription by analyzing the in silico transcriptional profiles within the proximal 1.5 Mb of genomic sequence on all human chromosome arms in relation to duplication status. We identify an approximately 5x excess of transcripts specific to cancer and/or testis in pericentromeric duplications compared to surrounding single copy sequence, with the expression of >50% of all transcripts in duplications being restricted to these tissues. We also identify an approximately 5x excess of transcripts in duplications which contain large quantities of interspersed repeats. These results indicate that the transcriptional profiles of duplicated and single copy sequences within pericentromeric DNA are distinct, suggesting that pericentromeric instability is unlikely to represent a common route for gene creation but may have a disproportionate effect upon genes whose function is restricted to the germ line.
Collapse
Affiliation(s)
- J M Mudge
- The Institute of Human Genetics, The International Centre For Life, University of Newcastle Upon Tyne, UK
| | | |
Collapse
|
33
|
|
34
|
Vogt PH. AZF deletions and Y chromosomal haplogroups: history and update based on sequence. Hum Reprod Update 2005; 11:319-36. [PMID: 15890785 DOI: 10.1093/humupd/dmi017] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
AZF deletions are genomic deletions in the euchromatic part of the long arm of the human Y chromosome (Yq11) associated with azoospermia or severe oligozoospermia. Consequently, it can be assumed that these deletions remove Y chromosomal genes required for spermatogenesis. However, these 'classical' or 'complete' AZF deletions, AZFa, AZFb and AZFc, represent only a subset of rearrangements in Yq11. With the benefit of the Y chromosome sequence, more rearrangements (deletions, duplications, inversions) inside and outside the classical AZF deletion intervals have been elucidated and intra-chromosomal non-allelic homologous recombinations (NAHRs) of repetitive sequence blocks have been identified as their major cause. These include duplications in AZFa, AZFb and AZFc and the partial AZFb and AZFc deletions of which some were summarized under the pseudonym 'gr/gr' deletions. At least some of these rearrangements are associated with distinct Y chromosomal haplogroups and are present with similar frequencies in fertile and infertile men. This suggests a functional redundancy of the AZFb/AZFc multi-copy genes. Alternatively, the functional contribution(s) of these genes to human spermatogenesis might be different in men of different Y haplogroups. That raises the question whether, the frequency of Y haplogroups with different AZF gene contents in distinct human populations leads to a male fertility status that varies between populations or whether, the presence of the multiple Y haplogroups implies a balancing selection via genomic deletion/amplification mechanisms.
Collapse
Affiliation(s)
- Peter H Vogt
- Section of Molecular Genetics & Infertility, Department of Gynecological Endocrinology & Reproductive Medicine, University of Heidelberg, Heidelberg, Germany.
| |
Collapse
|
35
|
Ito H, Nasuda S, Endo TR. A direct repeat sequence associated with the centromeric retrotransposons in wheat. Genome 2005; 47:747-56. [PMID: 15284880 DOI: 10.1139/g04-034] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A high-density BAC filter of Triticum monococcum was screened for the presence of a centromeric retrotransposon using the integrase region as a probe. Southern hybridization to the BAC digests using total genomic DNA probes of Triticum monococcum, Triticum aestivum, and Hordeum vulgare detected differentially hybridizing restriction fragments between wheat and barley. The fragments that hybridized to genomic DNA of wheat but not to that of barley were subcloned. Fluorescence in situ hybridization (FISH) analysis indicated that the clone pHind258 hybridized strongly to centromeric regions in wheat and rye and weakly to those in barley. The sequence of pHind258 was homologous to integrase and long terminal repeats of centromeric Ty3-gypsy retrotransposons of cereal species. Additionally, pHind258 has a pair of 192-bp direct repeats. FISH analysis indicated that the 192-bp repeat probe hybridized to centromeres of wheat and rye but not to those of barley. We found differential FISH signal intensities among wheat chromosomes using the 192-bp probe. In general, the A-genome chromosomes possess strong FISH signals, the B-genome chromosomes possess moderate signals, and the D-genome chromosomes possess weak signals. This was consistent with the estimated copy numbers of the 192-bp repeats in the ancestral species of hexaploid wheat.
Collapse
Affiliation(s)
- Hidetaka Ito
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kitashirakawaoiwake-cho, Sakyo-ku, Kyoto 606-8502, Japan
| | | | | |
Collapse
|
36
|
Amor DJ, Bentley K, Ryan J, Perry J, Wong L, Slater H, Choo KHA. Human centromere repositioning "in progress". Proc Natl Acad Sci U S A 2004; 101:6542-7. [PMID: 15084747 PMCID: PMC404081 DOI: 10.1073/pnas.0308637101] [Citation(s) in RCA: 168] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2003] [Accepted: 03/12/2004] [Indexed: 01/31/2023] Open
Abstract
Centromere repositioning provides a potentially powerful evolutionary force for reproductive isolation and speciation, but the underlying mechanisms remain ill-defined. An attractive model is through the simultaneous inactivation of a normal centromere and the formation of a new centromere at a hitherto noncentromeric chromosomal location with minimal detrimental effect. We report a two-generation family in which the centromeric activity of one chromosome 4 has been relocated to a euchromatic site at 4q21.3 through the epigenetic formation of a neocentromere in otherwise cytogenetically normal and mitotically stable karyotypes. Strong epigenetic inactivation of the original centromere is suggested by retention of 1.3 megabases of centromeric alpha-satellite DNA, absence of detectable molecular alteration in chromosome 4-centromereproximal p- and q-arm sequences, and failure of the inactive centromere to be reactivated through extensive culturing or treatment with histone deacetylase inhibitor trichostatin A. The neocentromere binds functionally essential centromere proteins (CENP-A, CENP-C, CENP-E, CENP-I, BUB1, and HP1), although a moderate reduction in CENP-A binding and sister-chromatid cohesion compared with the typical centromeres suggests possible underlying structural/functional differences. The stable mitotic and meiotic transmissibility of this pseudodicentric-neocentric chromosome in healthy individuals and the ability of the neocentric activity to form in a euchromatic site in preference to a preexisting alphoid domain provide direct evidence for an inherent mechanism of human centromere repositioning and karyotype evolution "in progress." We discuss the wider implication of such a mechanism for meiotic drive and the evolution of primate and other species.
Collapse
Affiliation(s)
- David J Amor
- Murdoch Children's Research Institute and Department of Paediatrics, Genetic Health Services Victoria, Royal Children's Hospital, Flemington Road, Victoria 3052, Australia
| | | | | | | | | | | | | |
Collapse
|
37
|
Kouprina N, Ebersole T, Koriabine M, Pak E, Rogozin IB, Katoh M, Oshimura M, Ogi K, Peredelchuk M, Solomon G, Brown W, Barrett JC, Larionov V. Cloning of human centromeres by transformation-associated recombination in yeast and generation of functional human artificial chromosomes. Nucleic Acids Res 2003; 31:922-34. [PMID: 12560488 PMCID: PMC149202 DOI: 10.1093/nar/gkg182] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2002] [Revised: 12/05/2002] [Accepted: 12/05/2002] [Indexed: 11/12/2022] Open
Abstract
Human centromeres remain poorly characterized regions of the human genome despite their importance for the maintenance of chromosomes. In part this is due to the difficulty of cloning of highly repetitive DNA fragments and distinguishing chromosome-specific clones in a genomic library. In this work we report the highly selective isolation of human centromeric DNA using transformation-associated recombination (TAR) cloning. A TAR vector with alphoid DNA monomers as targeting sequences was used to isolate large centromeric regions of human chromosomes 2, 5, 8, 11, 15, 19, 21 and 22 from human cells as well as monochromosomal hybrid cells. The alphoid DNA array was also isolated from the 12 Mb human mini-chromosome DeltaYq74 that contained the minimum amount of alphoid DNA required for proper chromosome segregation. Preliminary results of the structural analyses of different centromeres are reported in this paper. The ability of the cloned human centromeric regions to support human artificial chromosome (HAC) formation was assessed by transfection into human HT1080 cells. Centromeric clones from DeltaYq74 did not support the formation of HACs, indicating that the requirements for the existence of a functional centromere on an endogenous chromosome and those for forming a de novo centromere may be distinct. A construct with an alphoid DNA array from chromosome 22 with no detectable CENP-B motifs formed mitotically stable HACs in the absence of drug selection without detectable acquisition of host DNAs. In summary, our results demonstrated that TAR cloning is a useful tool for investigating human centromere organization and the structural requirements for formation of HAC vectors that might have a potential for therapeutic applications.
Collapse
Affiliation(s)
- N Kouprina
- Laboratory of Biosystems and Cancer, Center for Cancer Research, National Cancer Institute, NIH, Building 37, Room 5032, Bethesda, MD 20892, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Spence JM, Critcher R, Ebersole TA, Valdivia MM, Earnshaw WC, Fukagawa T, Farr CJ. Co-localization of centromere activity, proteins and topoisomerase II within a subdomain of the major human X alpha-satellite array. EMBO J 2002; 21:5269-80. [PMID: 12356743 PMCID: PMC129033 DOI: 10.1093/emboj/cdf511] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Dissection of human centromeres is difficult because of the lack of landmarks within highly repeated DNA. We have systematically manipulated a single human X centromere generating a large series of deletion derivatives, which have been examined at four levels: linear DNA structure; the distribution of constitutive centromere proteins; topoisomerase IIalpha cleavage activity; and mitotic stability. We have determined that the human X major alpha-satellite locus, DXZ1, is asymmetrically organized with an active subdomain anchored approximately 150 kb in from the Xp-edge. We demonstrate a major site of topoisomerase II cleavage within this domain that can shift if juxtaposed with a telomere, suggesting that this enzyme recognizes an epigenetic determinant within the DXZ1 chromatin. The observation that the only part of the DXZ1 locus shared by all deletion derivatives is a highly restricted region of <50 kb, which coincides with the topo isomerase II cleavage site, together with the high levels of cleavage detected, identify topoisomerase II as a major player in centromere biology.
Collapse
Affiliation(s)
| | | | - Thomas A. Ebersole
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH,
Wellcome Trust Centre for Cell Biology, Institute of Cell and Molecular Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh EH9 3JR, UK, Laboratory of Biosystems and Cancer Genome Structure and Function Section, National Cancer Institute, NIH, Building 49, Room 4A56, Bethesda, MD 20892-4471, USA, Department of Biochemistry and Molecular Biology, University of Cadiz, 11510 Puerto Real, Cadiz, Spain and PRESTO of the Japan Science and Technology Corporation, National Institute of Genetics and Graduate University for Advanced Studies, Mishima, Shizuoka 411-8540, Japan Corresponding author e-mail:
| | - Manuel M. Valdivia
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH,
Wellcome Trust Centre for Cell Biology, Institute of Cell and Molecular Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh EH9 3JR, UK, Laboratory of Biosystems and Cancer Genome Structure and Function Section, National Cancer Institute, NIH, Building 49, Room 4A56, Bethesda, MD 20892-4471, USA, Department of Biochemistry and Molecular Biology, University of Cadiz, 11510 Puerto Real, Cadiz, Spain and PRESTO of the Japan Science and Technology Corporation, National Institute of Genetics and Graduate University for Advanced Studies, Mishima, Shizuoka 411-8540, Japan Corresponding author e-mail:
| | - William C. Earnshaw
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH,
Wellcome Trust Centre for Cell Biology, Institute of Cell and Molecular Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh EH9 3JR, UK, Laboratory of Biosystems and Cancer Genome Structure and Function Section, National Cancer Institute, NIH, Building 49, Room 4A56, Bethesda, MD 20892-4471, USA, Department of Biochemistry and Molecular Biology, University of Cadiz, 11510 Puerto Real, Cadiz, Spain and PRESTO of the Japan Science and Technology Corporation, National Institute of Genetics and Graduate University for Advanced Studies, Mishima, Shizuoka 411-8540, Japan Corresponding author e-mail:
| | - Tatsuo Fukagawa
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH,
Wellcome Trust Centre for Cell Biology, Institute of Cell and Molecular Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh EH9 3JR, UK, Laboratory of Biosystems and Cancer Genome Structure and Function Section, National Cancer Institute, NIH, Building 49, Room 4A56, Bethesda, MD 20892-4471, USA, Department of Biochemistry and Molecular Biology, University of Cadiz, 11510 Puerto Real, Cadiz, Spain and PRESTO of the Japan Science and Technology Corporation, National Institute of Genetics and Graduate University for Advanced Studies, Mishima, Shizuoka 411-8540, Japan Corresponding author e-mail:
| | - Christine J. Farr
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH,
Wellcome Trust Centre for Cell Biology, Institute of Cell and Molecular Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh EH9 3JR, UK, Laboratory of Biosystems and Cancer Genome Structure and Function Section, National Cancer Institute, NIH, Building 49, Room 4A56, Bethesda, MD 20892-4471, USA, Department of Biochemistry and Molecular Biology, University of Cadiz, 11510 Puerto Real, Cadiz, Spain and PRESTO of the Japan Science and Technology Corporation, National Institute of Genetics and Graduate University for Advanced Studies, Mishima, Shizuoka 411-8540, Japan Corresponding author e-mail:
| |
Collapse
|
39
|
Bennett LB, Shriver MD, Bowcock AM. Markers and methods for reconstructing modern human history. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 2001; 8:329-41. [PMID: 10993603 DOI: 10.3109/10425179809034077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Affiliation(s)
- L B Bennett
- Dept. of Pediatrics, University of Texas Southwestern Medical Center, Dallas 75235-8591, USA
| | | | | |
Collapse
|
40
|
Floridia G, Zatterale A, Zuffardi O, Tyler-Smith C. Mapping of a human centromere onto the DNA by topoisomerase II cleavage. EMBO Rep 2000; 1:489-93. [PMID: 11263492 PMCID: PMC1083782 DOI: 10.1093/embo-reports/kvd110] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We have mapped the positions of topoisomerase II binding sites at the centromere of the human Y chromosome using etoposide-mediated DNA cleavage. A single region of cleavage is seen at normal centromeres, spanning approximately 50 kb within the centromeric alphoid array, but this pattern is abolished at two inactive centromeres. It therefore provides a marker for the position of the active centromere. Although the underlying centromeric DNA structure is variable, the position of the centromere measured in this way is fixed relative to the Yp edge of the array, and has retained the same position for >100,000 years.
Collapse
Affiliation(s)
- G Floridia
- Department of Biochemistry, University of Oxford, UK
| | | | | | | |
Collapse
|
41
|
Abstract
Recent discoveries of many new genes have made it clear that there is more to the human Y chromosome than a heap of evolutionary debris, hooked up to a sequence that happens to endow its bearer with testes. Coupled with the recent development of new polymorphic markers on the Y, making it the best-characterized haplotypic system in the genome, this gives us new opportunities to assess its role in disease and selection, through association studies with phenotypes such as infertility and cancers. However, the peculiar genetics of this bizarre chromosome means that we should interpret such studies particularly cautiously.
Collapse
Affiliation(s)
- M A Jobling
- Department of Genetics, University of Leicester, UK.
| | | |
Collapse
|
42
|
Kittles RA, Long JC, Bergen AW, Eggert M, Virkkunen M, Linnoila M, Goldman D. Cladistic association analysis of Y chromosome effects on alcohol dependence and related personality traits. Proc Natl Acad Sci U S A 1999; 96:4204-9. [PMID: 10097188 PMCID: PMC22445 DOI: 10.1073/pnas.96.7.4204] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Association between Y chromosome haplotype variation and alcohol dependence and related personality traits was investigated in a large sample of psychiatrically diagnosed Finnish males. Haplotypes were constructed for 359 individuals using alleles at eight loci (seven microsatellite loci and a nucleotide substitution in the DYZ3 alphoid satellite locus). A cladogram linking the 102 observed haplotype configurations was constructed by using parsimony with a single-step mutation model. Then, a series of contingency tables nested according to the cladogram hierarchy were used to test for association between Y haplotype and alcohol dependence. Finally, using only alcohol-dependent subjects, we tested for association between Y haplotype and personality variables postulated to define subtypes of alcoholism-antisocial personality disorder, novelty seeking, harm avoidance, and reward dependence. Significant association with alcohol dependence was observed at three Y haplotype clades, with significance levels of P = 0.002, P = 0.020, and P = 0.010. Within alcohol-dependent subjects, no relationship was revealed between Y haplotype and antisocial personality disorder, novelty seeking, harm avoidance, or reward dependence. These results demonstrate, by using a fully objective association design, that differences among Y chromosomes contribute to variation in vulnerability to alcohol dependence. However, they do not demonstrate an association between Y haplotype and the personality variables thought to underlie the subtypes of alcoholism.
Collapse
Affiliation(s)
- R A Kittles
- Section on Population Genetics and Linkage, National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health, Rockville, MD. 20852, USA
| | | | | | | | | | | | | |
Collapse
|
43
|
Ali S, Azfer MA, Bashamboo A, Mathur PK, Malik PK, Mathur VB, Raha AK, Ansari S. Characterization of a species-specific repetitive DNA from a highly endangered wild animal, Rhinoceros unicornis, and assessment of genetic polymorphism by microsatellite associated sequence amplification (MASA). Gene X 1999; 228:33-42. [PMID: 10072756 DOI: 10.1016/s0378-1119(99)00015-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
We have cloned and sequenced a 906bp EcoRI repeat DNA fraction from Rhinoceros unicornis genome. The contig pSS(R)2 is AT rich with 340 A (37.53%), 187 C (20.64%), 173 G (19.09%) and 206 T (22.74%). The sequence contains MALT box, NF-E1, Poly-A signal, lariat consensus sequences, TATA box, translational initiation sequences and several stop codons. Translation of the contig showed seven different types of protein motifs, among which, EGF-like domain cysteine pattern signatures and Bowman-Birk serine protease inhibitor family signatures were prominent. The presence of eukaryotic transcriptional elements, protein signatures and analysis of subset sequences in the 5' region from 1 to 165nt indicating coding potential (test code value=0.97) suggest possible regulatory and/or functional role(s) of these sequences in the rhino genome. Translation of the complementary strand from 906 to 706nt and 190 to 2nt showed proteins of more than 7kDa rich in non-polar residues. This suggests that pSS(R)2 is either a part of, or adjacent to, a functional gene. The contig contains mostly non-consecutive simple repeat units from 2 to 17nt with varying frequencies, of which four base motifs were found to be predominant. Zoo-blot hybridization revealed that pSS(R)2 sequences are unique to R. unicornis genome because they do not cross-hybridize, even with the genomic DNA of South African black rhino Diceros bicornis. Southern blot analysis of R. unicornis genomic DNA with pSS(R)2 and other synthetic oligo probes revealed a high level of genetic homogeneity, which was also substantiated by microsatellite associated sequence amplification (MASA). Owing to its uniqueness, the pSS(R)2 probe has a potential application in the area of conservation biology for unequivocal identification of horn or other body tissues of R. unicornis. The evolutionary aspect of this repeat fraction in the context of comparative genome analysis is discussed.
Collapse
Affiliation(s)
- S Ali
- Molecular Genetics Laboratory, National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110 067, India.
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Pilgrim D. CeRep25B forms chromosome-specific minisatellite arrays in Caenorhabditis elegans. Genome Res 1998; 8:1192-201. [PMID: 9847081 PMCID: PMC310793 DOI: 10.1101/gr.8.11.1192] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
With the completion of the Genome Sequencing Project, it is now possible to rapidly and accurately determine the frequency and position of a particular repeat sequence in the Caenorhabditis elegans genome. Several repeat sequences with a variety of characteristics have been examined and with few exceptions they show a near-random distribution throughout the genome. We characterized several genes near the left end of Chromosome III in the C. elegans genome, and found a 24-bp minisatellite repeat sequence present in the introns of two unrelated genes. This prompted a search of the databank for other occurrences of this sequence. Multiple copy arrays of this repeat are all located on the same autosome and fall in two clusters: one near the left end, and one in the central region separated by approximately 10 Mb. There are >200 copies of this repeat on the chromosome. This euchromatic repeat sequence seems unrelated to gene expression, is absent from homologous sites in a related species, is unstable in Escherichia coli, and is polymorphic between different wild isolates of C. elegans. Most CeRep25B units in the array match the consensus sequence very well, suggesting that either this repeat originated quite recently or its sequence is functionally constrained. Although chromosome-specific repeat sequences have been reported previously in many organisms, such sequences are usually structural and heterochromatic (e.g., centromeric alpha-satellite) or on the mammalian sex chromosomes. This report describes the first confirmed instance from a whole genome sequencing project of an autosomal euchromatic chromosome-specific minisatellite repeat.
Collapse
Affiliation(s)
- D Pilgrim
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada T6G 2E9.
| |
Collapse
|
45
|
Laurent AM, Puechberty J, Prades C, Roizès G. Informative genetic polymorphic markers within the centromeric regions of human chromosomes 17 (D17S2205) and 11 (D11S4975). Genomics 1998; 52:166-72. [PMID: 9782082 DOI: 10.1006/geno.1998.5428] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have taken advantage of the presence of retrotransposed L1 elements within the centromeric alphoid sequences of the human genome to characterize polymorphic markers at the centromeres of human chromosomes 17 and 11 (D17S2205 and D11S4975, respectively). They correspond to microsatellites found at the 3' ends of L1 elements inserted within the alpha satellite sequences of the two chromosomes. They were detected after PCR by direct analysis in sequencing gels. Eight and five alleles, respectively, were found with heterozygosities of 0.67 and 0.68. They were converted into STSs by designing primers specific for each. D17S2205 and D11S4975 can be used as genuine anchor-informative genetic points for chromosomes 17 and 11. Both markers have been placed on the available genetic maps of their centromeric regions. The alphoid domain within which D17S2205 is embedded is ancestral to the canonical ones on chromosome 17 that exhibit several haplotypes in present-day human populations.
Collapse
MESH Headings
- Centromere/genetics
- Chromosomes, Human, Pair 11/genetics
- Chromosomes, Human, Pair 17/genetics
- DNA, Satellite/analysis
- DNA, Satellite/chemistry
- DNA, Satellite/genetics
- Electrophoresis, Gel, Pulsed-Field
- Humans
- Microsatellite Repeats
- Molecular Sequence Data
- Pedigree
- Polymorphism, Genetic
Collapse
Affiliation(s)
- A M Laurent
- Séquences Répétées et Centromères Humains, CNRS ERS 155, Institut de Biologie, 4 Boulevard Henri IV, Montpellier Cedex, 34060, France
| | | | | | | |
Collapse
|
46
|
Mahtani MM, Willard HF. Physical and genetic mapping of the human X chromosome centromere: repression of recombination. Genome Res 1998; 8:100-10. [PMID: 9477338 DOI: 10.1101/gr.8.2.100] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Classical genetic studies in Drosophila and yeast have shown that chromosome centromeres have a cis-acting ability to repress meiotic exchange in adjacent DNA. To determine whether a similar phenomenon exists at human centromeres, we measured the rate of meiotic recombination across the centromere of the human X chromosome. We have constructed a long-range physical map of centromeric alpha-satellite DNA (DXZ1) by pulsed-field gel analysis, as well as detailed meiotic maps of the pericentromeric region of the X chromosome in the CEPH family panel. By comparing these two maps, we determined that, in the proximal region of the X chromosome, a genetic distance of 0.57 cM exists between markers that span the centromere and are separated by at least the average 3600 kb physical distance mapped across the DXZ1 array. Therefore, the rate of meiotic exchange across the X chromosome centromere is <1 cM/6300 kb (and perhaps as low as 1 cM/17,000 kb on the basis of other physical mapping data), at least eightfold lower than the average rate of female recombination on the X chromosome and one of the lowest rates of exchange yet observed in the human genome.
Collapse
Affiliation(s)
- M M Mahtani
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120, USA
| | | |
Collapse
|
47
|
Poloni ES, Semino O, Passarino G, Santachiara-Benerecetti AS, Dupanloup I, Langaney A, Excoffier L. Human genetic affinities for Y-chromosome P49a,f/TaqI haplotypes show strong correspondence with linguistics. Am J Hum Genet 1997; 61:1015-35. [PMID: 9346874 PMCID: PMC1716025 DOI: 10.1086/301602] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Numerous population samples from around the world have been tested for Y chromosome-specific p49a,f/TaqI restriction polymorphisms. Here we review the literature as well as unpublished data on Y-chromosome p49a,f/TaqI haplotypes and provide a new nomenclature unifying the notations used by different laboratories. We use this large data set to study worldwide genetic variability of human populations for this paternally transmitted chromosome segment. We observe, for the Y chromosome, an important level of population genetics structure among human populations (FST = .230, P < .001), mainly due to genetic differences among distinct linguistic groups of populations (FCT = .246, P < .001). A multivariate analysis based on genetic distances between populations shows that human population structure inferred from the Y chromosome corresponds broadly to language families (r = .567, P < .001), in agreement with autosomal and mitochondrial data. Times of divergence of linguistic families, estimated from their internal level of genetic differentiation, are fairly concordant with current archaeological and linguistic hypotheses. Variability of the p49a,f/TaqI polymorphic marker is also significantly correlated with the geographic location of the populations (r = .613, P < .001), reflecting the fact that distinct linguistic groups generally also occupy distinct geographic areas. Comparison of Y-chromosome and mtDNA RFLPs in a restricted set of populations shows a globally high level of congruence, but it also allows identification of unequal maternal and paternal contributions to the gene pool of several populations.
Collapse
Affiliation(s)
- E S Poloni
- Département d'Anthropologie et Ecologie, Université de Genève, Carouge, Switzerland.
| | | | | | | | | | | | | |
Collapse
|
48
|
Mitchell RJ, Earl L, Fricke B. Y-chromosome specific alleles and haplotypes in European and Asian populations: linkage disequilibrium and geographic diversity. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 1997; 104:167-76. [PMID: 9386824 DOI: 10.1002/(sici)1096-8644(199710)104:2<167::aid-ajpa3>3.0.co;2-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Variation on the Y chromosome may permit our understanding the evolution of the human paternal lineage and male gene flow. This study reports upon the distribution and non random association of alleles at four Y-chromosome specific loci in four populations, three Caucasoid (Italian, Greek and Slav) and one Asian. The markers include insertion/deletion (p12f), point mutation (92R7 and pY alpha I), and repeat sequence (p21A1) polymorphisms. Our data confirm that the p12f/TaqI 8 kb allele is a Caucasoid marker and that Asians are monomorphic at three of the loci (p12f, 92R7, and pY alpha I). The alleles at 92R7 and pY alpha I were found to be in complete disequilibrium in Europeans. Y-haplotype diversity was highly significant between Asians and all three European groups (P < 0.001), but the Greeks and Italians were also significantly different with respect to some alleles and haplotypes (P < 0.02). We find strong evidence that the p12f/TaqI 8 kb allele may have arisen only once, as a deletion event, and, additionally, that the present-day frequency distribution of Y chromosomes carrying the p12f/8 kb allele suggests that it may have been spread by colonising sea-faring peoples from the Near East, possibly the Phoenicians, rather than by expansion of Neolithic farmers into continental Europe. The p12f deletion is the key marker of a unique Y chromosome, found only in Caucasians to date, labelled 'Mediterranean' and this further increases the level of Y-chromosome diversity seen among Caucasoids when compared to the other major population groups.
Collapse
Affiliation(s)
- R J Mitchell
- School of Genetics and Human Variation, La Trobe University, Melbourne, Victoria, Australia.
| | | | | |
Collapse
|
49
|
Abstract
The past two years have seen the increased study of Y-chromosome polymorphisms and their relationship to human evolution and variation. Low Y-chromosome sequence diversity indicates that the common ancestor of all extant Y chromosomes lived relatively recently and the consensus of estimates of time to the most recent common ancestor concur with estimates of the mitochondrial DNA ancestor; but we do not know where this 'Adam' lived. Though the reason for low nucleotide diversity on the Y-chromosome remains unresolved, some of the mutations are proving highly informative in tracing human prehistoric migrations and are generating new hypotheses on human colonizations and migrations. The recent discovery of highly polymorphic microsatellites on the Y offers new possibilities for the investigation of more recent human evolutionary events, including the identification of male founders.
Collapse
Affiliation(s)
- R J Mitchell
- La Trobe University, School of Genetics & Human Variation, Bundoora, Victoria, Australia.
| | | |
Collapse
|
50
|
Ruiz Linares A, Nayar K, Goldstein DB, Hebert JM, Seielstad MT, Underhill PA, Lin AA, Feldman MW, Cavalli Sforza LL. Geographic clustering of human Y-chromosome haplotypes. Ann Hum Genet 1996; 60:401-8. [PMID: 8912793 DOI: 10.1111/j.1469-1809.1996.tb00438.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Five polymorphic markers on the Y-chromosome (mostly microsatellites) were typed in 121 individuals from 13 populations around the world. With these markers 78 different haplotypes were detected. Haplotypes present more than once tend to be shared by individuals from the same population or continent. A reconstruction of haplotype phylogeny also indicates significant geographic structure in the data. Based on the similarity of the haplotypes, population relationships were examined and found to be largely concordant with those obtained with other markers. Even though the sample size and the number of markers are small, there is very signficant clustering of the haplotypes by continent of origin.
Collapse
Affiliation(s)
- A Ruiz Linares
- Department of Genetics, Stanford University, CA 94305, USA
| | | | | | | | | | | | | | | | | |
Collapse
|