1
|
Using de novo assembly to identify structural variation of eight complex immune system gene regions. PLoS Comput Biol 2021; 17:e1009254. [PMID: 34343164 PMCID: PMC8363018 DOI: 10.1371/journal.pcbi.1009254] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 08/13/2021] [Accepted: 07/06/2021] [Indexed: 12/11/2022] Open
Abstract
Driven by the necessity to survive environmental pathogens, the human immune system has evolved exceptional diversity and plasticity, to which several factors contribute including inheritable structural polymorphism of the underlying genes. Characterizing this variation is challenging due to the complexity of these loci, which contain extensive regions of paralogy, segmental duplication and high copy-number repeats, but recent progress in long-read sequencing and optical mapping techniques suggests this problem may now be tractable. Here we assess this by using long-read sequencing platforms from PacBio and Oxford Nanopore, supplemented with short-read sequencing and Bionano optical mapping, to sequence DNA extracted from CD14+ monocytes and peripheral blood mononuclear cells from a single European individual identified as HV31. We use this data to build a de novo assembly of eight genomic regions encoding four key components of the immune system, namely the human leukocyte antigen, immunoglobulins, T cell receptors, and killer-cell immunoglobulin-like receptors. Validation of our assembly using k-mer based and alignment approaches suggests that it has high accuracy, with estimated base-level error rates below 1 in 10 kb, although we identify a small number of remaining structural errors. We use the assembly to identify heterozygous and homozygous structural variation in comparison to GRCh38. Despite analyzing only a single individual, we find multiple large structural variants affecting core genes at all three immunoglobulin regions and at two of the three T cell receptor regions. Several of these variants are not accurately callable using current algorithms, implying that further methodological improvements are needed. Our results demonstrate that assessing haplotype variation in these regions is possible given sufficiently accurate long-read and associated data. Continued reductions in the cost of these technologies will enable application of these methods to larger samples and provide a broader catalogue of germline structural variation at these loci, an important step toward making these regions accessible to large-scale genetic association studies. The human immune system is incredibly versatile underlying its capacity to defend the body against thousands of pathogens. At a molecular level, it recognizes pathogens using large libraries of antibodies and related protein receptors. These molecules are encoded by gene families that are particularly difficult to analyze due to their unusually complex patterns of similarities and differences between genes and individuals. To overcome this, we applied several sequencing methods to DNA from a single individual and developed methods to reconstruct the underlying sequence at eight of the immune-associated regions. Importantly, we used DNA extracted from monocytes to avoid capturing the further rearrangements that occur in active immune cells. We generated accurate assemblies by integrating multiple complementary data types, although we noted a small subset of locations that remain challenging. Moreover, we found that this individual contains multiple structural differences between the two inherited chromosomes and compared to previously analyzed genomes, affecting the copy number of immune system genes. Application of these methods in larger numbers of individuals will clearly uncover much more variation than is currently known, and might lead to new understanding of the effect of genetic variation on the broad range of human diseases determined by the immune response.
Collapse
|
2
|
Foley JF, Elgart B, Alex Merrick B, Phadke DP, Cook ME, Malphurs JA, Solomon GG, Shah RR, Fessler MB, Miller FW, Gerrish KE. Whole genome sequencing of low input circulating cell-free DNA obtained from normal human subjects. Physiol Rep 2021; 9:e14993. [PMID: 34350716 PMCID: PMC8339531 DOI: 10.14814/phy2.14993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 07/13/2021] [Accepted: 07/14/2021] [Indexed: 11/29/2022] Open
Abstract
Cell-free DNA circulates in plasma at low levels as a normal by-product of cellular apoptosis. Multiple clinical pathologies, as well as environmental stressors can lead to increased circulating cell-free DNA (ccfDNA) levels. Plasma DNA studies frequently employ targeted amplicon deep sequencing platforms due to limited concentrations (ng/ml) of ccfDNA in the blood. Here, we report whole genome sequencing (WGS) and read distribution across chromosomes of ccfDNA extracted from two human plasma samples from normal, healthy subjects, representative of limited clinical samples at <1 ml. Amplification was sufficiently robust with ~90% of the reference genome (GRCh38.p2) exhibiting 10X coverage. Chromosome read coverage was uniform and directly proportional to the number of reads for each chromosome across both samples. Almost 99% of the identified genomic sequence variants were known annotated dbSNP variants in the hg38 reference genome. A high prevalence of C>T and T>C mutations was present along with a strong concordance of variants shared between the germline genome databases; gnomAD (81.1%) and the 1000 Genome Project (93.6%). This study demonstrates isolation and amplification procedures from low input ccfDNA samples that can detect sequence variants across the whole genome from amplified human plasma ccfDNA that can translate to multiple clinical research disciplines.
Collapse
Affiliation(s)
- Julie F. Foley
- Division of National Toxicology ProgramNIEHSDurhamNorth CarolinaUSA
| | | | - B. Alex Merrick
- Division of National Toxicology ProgramNIEHSDurhamNorth CarolinaUSA
| | | | - Molly E. Cook
- Division of Intramural ResearchNIEHSDurhamNorth CarolinaUSA
| | | | | | | | | | | | | |
Collapse
|
3
|
Contreras-Galindo R, Fischer S, Saha AK, Lundy JD, Cervantes PW, Mourad M, Wang C, Qian B, Dai M, Meng F, Chinnaiyan A, Omenn GS, Kaplan MH, Markovitz DM. Rapid molecular assays to study human centromere genomics. Genome Res 2017; 27:2040-2049. [PMID: 29141960 PMCID: PMC5741061 DOI: 10.1101/gr.219709.116] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 10/27/2017] [Indexed: 01/16/2023]
Abstract
The centromere is the structural unit responsible for the faithful segregation of chromosomes. Although regulation of centromeric function by epigenetic factors has been well-studied, the contributions of the underlying DNA sequences have been much less well defined, and existing methodologies for studying centromere genomics in biology are laborious. We have identified specific markers in the centromere of 23 of the 24 human chromosomes that allow for rapid PCR assays capable of capturing the genomic landscape of human centromeres at a given time. Use of this genetic strategy can also delineate which specific centromere arrays in each chromosome drive the recruitment of epigenetic modulators. We further show that, surprisingly, loss and rearrangement of DNA in centromere 21 is associated with trisomy 21. This new approach can thus be used to rapidly take a snapshot of the genetics and epigenetics of each specific human centromere in nondisjunction disorders and other biological settings.
Collapse
Affiliation(s)
| | - Sabrina Fischer
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Laboratory of Molecular Virology, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay 11400
| | - Anjan K Saha
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Medical Scientist Training Program, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cancer Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - John D Lundy
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Patrick W Cervantes
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Mohamad Mourad
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Claire Wang
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Brian Qian
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Manhong Dai
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Fan Meng
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, Michigan 48109, USA.,Department of Psychiatry, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Arul Chinnaiyan
- Michigan Center for Translational Pathology and Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA.,Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA
| | - Gilbert S Omenn
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Department of Human Genetics.,Departments of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Mark H Kaplan
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - David M Markovitz
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cancer Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Immunology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cellular and Molecular Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
4
|
Kumar P, Jain M, Kalsi AK, Halder A. Molecular characterisation of a case of dicentric Y presented as nonobstructive azoospermia with testicular early maturation arrest. Andrologia 2017; 50. [PMID: 28836280 DOI: 10.1111/and.12886] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/10/2017] [Indexed: 12/28/2022] Open
Abstract
The dicentric Y chromosome is the most common cytogenetically visible structural abnormality of Y chromosome. The sites of break and fusion of dicentric Y are variable, but break and fusion at Yq12 (proximal to the pseudoautosomal region 2/PAR 2) is very rare. Dicentric Y chromosome is unstable during cell division and likely to generate chromosomal mosaicism. Here, we report a case of infertile male with nonmosaic 46,XY where chromosome Y was dicentric with break and fusion at Yq12 (proximal to PAR 2). Clinical presentation of the case was nonobstructive azoospermia due to early maturation arrest at the primary spermatocyte stage. Various molecular techniques such as FISH, STS-PCR and DNA microarray were carried out to characterise genetic defect leading to testicular maturation arrest in the patient. The break and fusion was found at Yq12 (proximal to PAR 2) and resulted in near total duplication of Y chromosome (excluding PAR 2). The reason for maturation arrest seems due to CNVs of PARs (gain in PAR 1 and loss of PAR 2) and azoospermia factors (gain).
Collapse
Affiliation(s)
- P Kumar
- Department of Reproductive Biology, All India Institute of Medical Sciences, New Delhi, India
| | - M Jain
- Department of Reproductive Biology, All India Institute of Medical Sciences, New Delhi, India
| | - A K Kalsi
- Department of Reproductive Biology, All India Institute of Medical Sciences, New Delhi, India
| | - A Halder
- Department of Reproductive Biology, All India Institute of Medical Sciences, New Delhi, India
| |
Collapse
|
5
|
Miga KH. The Promises and Challenges of Genomic Studies of Human Centromeres. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:285-304. [PMID: 28840242 DOI: 10.1007/978-3-319-58592-5_12] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Human centromeres are genomic regions that act as sites of kinetochore assembly to ensure proper chromosome segregation during mitosis and meiosis. Although the biological importance of centromeres in genome stability, and ultimately, cell viability are well understood, the complete sequence content and organization in these multi-megabase-sized regions remains unknown. The lack of a high-resolution reference assembly inhibits standard bioinformatics protocols, and as a result, sequence-based studies involving human centromeres lag far behind the advances made for the non-repetitive sequences in the human genome. In this chapter, I introduce what is known about the genomic organization in the highly repetitive regions spanning human centromeres, and discuss the challenges these sequences pose for assembly, alignment, and data interpretation. Overcoming these obstacles is expected to issue a new era for centromere genomics, which will offer new discoveries in basic cell biology and human biomedical research.
Collapse
Affiliation(s)
- Karen H Miga
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA, USA.
| |
Collapse
|
6
|
Altemose N, Miga KH, Maggioni M, Willard HF. Genomic characterization of large heterochromatic gaps in the human genome assembly. PLoS Comput Biol 2014; 10:e1003628. [PMID: 24831296 PMCID: PMC4022460 DOI: 10.1371/journal.pcbi.1003628] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 03/26/2014] [Indexed: 01/24/2023] Open
Abstract
The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3). The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat families cannot be readily applied to complex satellite repeats such as HSat2,3, which lack a consistent repeat unit reference sequence. Here we present an alignment-free method to characterize complex satellites using whole-genome shotgun read datasets. Utilizing this approach, we classify HSat2,3 sequences into fourteen subfamilies and predict their chromosomal distributions, resulting in a comprehensive satellite reference database to further enable genomic studies of heterochromatic regions. We also identify 1.3 Mb of non-repetitive sequence interspersed with HSat2,3 across 17 unmapped assembly scaffolds, including eight annotated gene predictions. Finally, we apply our satellite reference database to high-throughput sequence data from 396 males to estimate array size variation of the predominant HSat3 array on the Y chromosome, confirming that satellite array sizes can vary between individuals over an order of magnitude (7 to 98 Mb) and further demonstrating that array sizes are distributed differently within distinct Y haplogroups. In summary, we present a novel framework for generating initial reference databases for unassembled genomic regions enriched with complex satellite DNA, and we further demonstrate the utility of these reference databases for studying patterns of sequence variation within human populations. At least 5–10% of the human genome remains unassembled, unmapped, and poorly characterized. The reference assembly annotates these missing regions as multi-megabase heterochromatic gaps, found primarily near centromeres and on the short arms of the acrocentric chromosomes. This missing fraction of the genome consists predominantly of long arrays of near-identical tandem repeats called satellite DNA. Due to the repetitive nature of satellite DNA, sequence assembly algorithms cannot uniquely align overlapping sequence reads, and thus satellite-rich domains have been omitted from the reference assembly and from most genome-wide studies of variation and function. Existing methods for analyzing some satellite DNAs cannot be easily extended to a large portion of satellites whose repeat structures are complex and largely uncharacterized, such as Human Satellites 2 and 3 (HSat2,3). Here we characterize HSat2,3 using a novel approach that does not depend on having a well-defined repeat structure. By classifying genome-wide HSat2,3 sequences into subfamilies and localizing them to chromosomes, we have generated an initial HSat2,3 genomic reference, which serves as a critical foundation for future studies of variation and function in these regions. This approach should be generally applicable to other classes of satellite DNA, in both the human genome and other complex genomes.
Collapse
Affiliation(s)
- Nicolas Altemose
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| | - Karen H. Miga
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
- * E-mail:
| | - Mauro Maggioni
- Department of Mathematics, Duke University, Durham, North Carolina, United States of America
| | - Huntington F. Willard
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| |
Collapse
|
7
|
Carvalho AB, Clark AG. Efficient identification of Y chromosome sequences in the human and Drosophila genomes. Genome Res 2013; 23:1894-907. [PMID: 23921660 PMCID: PMC3814889 DOI: 10.1101/gr.156034.113] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Accepted: 07/25/2013] [Indexed: 12/25/2022]
Abstract
Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes.
Collapse
Affiliation(s)
- Antonio Bernardo Carvalho
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Caixa Postal 68011, CEP 21941-971, Rio de Janeiro, Brazil
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Andrew G. Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
8
|
Genovese G, Handsaker RE, Li H, Altemose N, Lindgren AM, Chambert K, Pasaniuc B, Price AL, Reich D, Morton CC, Pollak MR, Wilson JG, McCarroll SA. Using population admixture to help complete maps of the human genome. Nat Genet 2013; 45:406-14, 414e1-2. [PMID: 23435088 PMCID: PMC3683849 DOI: 10.1038/ng.2565] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 01/31/2013] [Indexed: 12/16/2022]
Abstract
Tens of millions of base pairs of euchromatic human genome sequence, including many protein-coding genes, have no known location in the human genome. We describe an approach for localizing the human genome's missing pieces using the patterns of genome sequence variation created by population admixture. We mapped the locations of 70 scaffolds spanning 4 million base pairs of the human genome's unplaced euchromatic sequence, including more than a dozen protein-coding genes, and identified 8 new large interchromosomal segmental duplications. We find that most of these sequences are hidden in the genome's heterochromatin, particularly its pericentromeric regions. Many cryptic, pericentromeric genes are expressed at the RNA level and have been maintained intact for millions of years while their expression patterns diverged from those of paralogous genes elsewhere in the genome. We describe how knowledge of the locations of these sequences can inform disease association and genome biology studies.
Collapse
Affiliation(s)
- Giulio Genovese
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Abstract
Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.
Collapse
|
10
|
Lee JR, Kim HS. Radiation-induced retroelement-mediated genomic instability. BIOTECHNOL BIOPROC E 2012. [DOI: 10.1007/s12257-012-0008-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
11
|
Paria N, Raudsepp T, Pearks Wilkerson AJ, O'Brien PCM, Ferguson-Smith MA, Love CC, Arnold C, Rakestraw P, Murphy WJ, Chowdhary BP. A gene catalogue of the euchromatic male-specific region of the horse Y chromosome: comparison with human and other mammals. PLoS One 2011; 6:e21374. [PMID: 21799735 PMCID: PMC3143126 DOI: 10.1371/journal.pone.0021374] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Accepted: 05/27/2011] [Indexed: 11/30/2022] Open
Abstract
Studies of the Y chromosome in primates, rodents and carnivores provide compelling evidence that the male specific region of Y (MSY) contains functional genes, many of which have specialized roles in spermatogenesis and male-fertility. Little similarity, however, has been found between the gene content and sequence of MSY in different species. This hinders the discovery of species-specific male fertility genes and limits our understanding about MSY evolution in mammals. Here, a detailed MSY gene catalogue was developed for the horse – an odd-toed ungulate. Using direct cDNA selection from horse testis, and sequence analysis of Y-specific BAC clones, 37 horse MSY genes/transcripts were identified. The genes were mapped to the MSY BAC contig map, characterized for copy number, analyzed for transcriptional profiles by RT-PCR, examined for the presence of ORFs, and compared to other mammalian orthologs. We demonstrate that the horse MSY harbors 20 X-degenerate genes with known orthologs in other eutherian species. The remaining 17 genes are acquired or novel and have so far been identified only in the horse or donkey Y chromosomes. Notably, 3 transcripts were found in the heterochromatic part of the Y. We show that despite substantial differences between the sequence, gene content and organization of horse and other mammalian Y chromosomes, the functions of MSY genes are predominantly related to testis and spermatogenesis. Altogether, 10 multicopy genes with testis-specific expression were identified in the horse MSY, and considered likely candidate genes for stallion fertility. The findings establish an important foundation for the study of Y-linked genetic factors governing fertility in stallions, and improve our knowledge about the evolutionary processes that have shaped Y chromosomes in different mammalian lineages.
Collapse
Affiliation(s)
- Nandina Paria
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, United States of America
| | - Terje Raudsepp
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, United States of America
- * E-mail: (BPC); (TR)
| | - Alison J. Pearks Wilkerson
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, United States of America
| | | | | | - Charles C. Love
- Department of Large Animal Clinical Sciences, Texas A&M University, College Station, Texas, United States of America
| | - Carolyn Arnold
- Department of Large Animal Clinical Sciences, Texas A&M University, College Station, Texas, United States of America
| | - Peter Rakestraw
- Department of Large Animal Clinical Sciences, Texas A&M University, College Station, Texas, United States of America
| | - William J. Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, United States of America
| | - Bhanu P. Chowdhary
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, United States of America
- * E-mail: (BPC); (TR)
| |
Collapse
|
12
|
Gabrieli P, Gomulski LM, Bonomi A, Siciliano P, Scolari F, Franz G, Jessup A, Malacrida AR, Gasperi G. Interchromosomal duplications on the Bactrocera oleae Y chromosome imply a distinct evolutionary origin of the sex chromosomes compared to Drosophila. PLoS One 2011; 6:e17747. [PMID: 21408187 PMCID: PMC3049792 DOI: 10.1371/journal.pone.0017747] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2010] [Accepted: 02/11/2011] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Diptera have an extraordinary variety of sex determination mechanisms, and Drosophila melanogaster is the paradigm for this group. However, the Drosophila sex determination pathway is only partially conserved and the family Tephritidae affords an interesting example. The tephritid Y chromosome is postulated to be necessary to determine male development. Characterization of Y sequences, apart from elucidating the nature of the male determining factor, is also important to understand the evolutionary history of sex chromosomes within the Tephritidae. We studied the Y sequences from the olive fly, Bactrocera oleae. Its Y chromosome is minute and highly heterochromatic, and displays high heteromorphism with the X chromosome. METHODOLOGY/PRINCIPAL FINDINGS A combined Representational Difference Analysis (RDA) and fluorescence in-situ hybridization (FISH) approach was used to investigate the Y chromosome to derive information on its sequence content. The Y chromosome is strewn with repetitive DNA sequences, the majority of which are also interdispersed in the pericentromeric regions of the autosomes. The Y chromosome appears to have accumulated small and large repetitive interchromosomal duplications. The large interchromosomal duplications harbour an importin-4-like gene fragment. Apart from these importin-4-like sequences, the other Y repetitive sequences are not shared with the X chromosome, suggesting molecular differentiation of these two chromosomes. Moreover, as the identified Y sequences were not detected on the Y chromosomes of closely related tephritids, we can infer divergence in the repetitive nature of their sequence contents. CONCLUSIONS/SIGNIFICANCE The identification of Y-linked sequences may tell us much about the repetitive nature, the origin and the evolution of Y chromosomes. We hypothesize how these repetitive sequences accumulated and were maintained on the Y chromosome during its evolutionary history. Our data reinforce the idea that the sex chromosomes of the Tephritidae may have distinct evolutionary origins with respect to those of the Drosophilidae and other Dipteran families.
Collapse
Affiliation(s)
- Paolo Gabrieli
- Department of Animal Biology, University of Pavia, Pavia, Italy
| | | | - Angelica Bonomi
- Department of Animal Biology, University of Pavia, Pavia, Italy
| | - Paolo Siciliano
- Department of Animal Biology, University of Pavia, Pavia, Italy
| | | | - Gerald Franz
- Entomology Unit, FAO/IAEA Agriculture and Biotechnology Laboratory, Joint FAO/IAEA Programme, International Atomic Energy Agency, Vienna, Austria
| | - Andrew Jessup
- Entomology Unit, FAO/IAEA Agriculture and Biotechnology Laboratory, Joint FAO/IAEA Programme, International Atomic Energy Agency, Vienna, Austria
| | | | | |
Collapse
|
13
|
Ansseau E, Laoudj-Chenivesse D, Marcowycz A, Tassin A, Vanderplanck C, Sauvage S, Barro M, Mahieu I, Leroy A, Leclercq I, Mainfroid V, Figlewicz D, Mouly V, Butler-Browne G, Belayew A, Coppée F. DUX4c is up-regulated in FSHD. It induces the MYF5 protein and human myoblast proliferation. PLoS One 2009; 4:e7482. [PMID: 19829708 PMCID: PMC2759506 DOI: 10.1371/journal.pone.0007482] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2009] [Accepted: 09/17/2009] [Indexed: 12/21/2022] Open
Abstract
Facioscapulohumeral muscular dystrophy (FSHD) is a dominant disease linked to contractions of the D4Z4 repeat array in 4q35. We have previously identified a double homeobox gene (DUX4) within each D4Z4 unit that encodes a transcription factor expressed in FSHD but not control myoblasts. DUX4 and its target genes contribute to the global dysregulation of gene expression observed in FSHD. We have now characterized the homologous DUX4c gene mapped 42 kb centromeric of the D4Z4 repeat array. It encodes a 47-kDa protein with a double homeodomain identical to DUX4 but divergent in the carboxyl-terminal region. DUX4c was detected in primary myoblast extracts by Western blot with a specific antiserum, and was induced upon differentiation. The protein was increased about 2-fold in FSHD versus control myotubes but reached 2-10-fold induction in FSHD muscle biopsies. We have shown by Western blot and by a DNA-binding assay that DUX4c over-expression induced the MYF5 myogenic regulator and its DNA-binding activity. DUX4c might stabilize the MYF5 protein as we detected their interaction by co-immunoprecipitation. In keeping with the known role of Myf5 in myoblast accumulation during mouse muscle regeneration DUX4c over-expression activated proliferation of human primary myoblasts and inhibited their differentiation. Altogether, these results suggested that DUX4c could be involved in muscle regeneration and that changes in its expression could contribute to the FSHD pathology.
Collapse
Affiliation(s)
- Eugénie Ansseau
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | | | - Aline Marcowycz
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - Alexandra Tassin
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - Céline Vanderplanck
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - Sébastien Sauvage
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - Marietta Barro
- INSERM ERI 25 Muscle et Pathologies, CHU A. de Villeneuve, Montpellier, France
| | - Isabelle Mahieu
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - Axelle Leroy
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - India Leclercq
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | | | - Denise Figlewicz
- Department of Neurology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Vincent Mouly
- Institute of Myology, Platform for human cell culture, Paris, France
| | | | - Alexandra Belayew
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
| | - Frédérique Coppée
- Laboratory of Molecular Biology, University of Mons-Hainaut, 6, Mons, Belgium
- * E-mail:
| |
Collapse
|
14
|
Isodicentric Y chromosomes and sex disorders as byproducts of homologous recombination that maintains palindromes. Cell 2009; 138:855-69. [PMID: 19737515 DOI: 10.1016/j.cell.2009.07.042] [Citation(s) in RCA: 166] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Revised: 07/12/2009] [Accepted: 07/22/2009] [Indexed: 11/22/2022]
Abstract
Massive palindromes in the human Y chromosome harbor mirror-image gene pairs essential for spermatogenesis. During evolution, these gene pairs have been maintained by intrapalindrome, arm-to-arm recombination. The mechanism of intrapalindrome recombination and risk of harmful effects are unknown. We report 51 patients with isodicentric Y (idicY) chromosomes formed by homologous crossing over between opposing arms of palindromes on sister chromatids. These ectopic recombination events occur at nearly all Y-linked palindromes. Based on our findings, we propose that intrapalindrome sequence identity is maintained via noncrossover pathways of homologous recombination. DNA double-strand breaks that initiate these pathways can be alternatively resolved by crossing over between sister chromatids to form idicY chromosomes, with clinical consequences ranging from spermatogenic failure to sex reversal and Turner syndrome. Our observations imply that crossover and noncrossover pathways are active in nearly all Y-linked palindromes, exposing an Achilles' heel in the mechanism that preserves palindrome-borne genes.
Collapse
|
15
|
Schmidt J, Kirsch S, Rappold GA, Schempp W. Complex evolution of a Y-chromosomal double homeobox 4 (DUX4)-related gene family in hominoids. PLoS One 2009; 4:e5288. [PMID: 19404400 PMCID: PMC2671837 DOI: 10.1371/journal.pone.0005288] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 03/24/2009] [Indexed: 12/21/2022] Open
Abstract
The human Y chromosome carries four human Y-chromosomal euchromatin/heterochromatin transition regions, all of which are characterized by the presence of interchromosomal segmental duplications. The Yq11.1/Yq11.21 transition region harbours a peculiar segment composed of an imperfectly organized tandem-repeat structure encoding four members of the double homeobox (DUX) gene family. By comparative fluorescence in situ hybridization (FISH) analysis we have documented the primary appearance of Y-chromosomal DUX genes (DUXY) on the gibbon Y chromosome. The major amplification and dispersal of DUXY paralogs occurred after the gibbon and hominid lineages had diverged. Orthologous DUXY loci of human and chimpanzee show a highly similar structural organization. Sequence alignment survey, phylogenetic reconstruction and recombination detection analyses of human and chimpanzee DUXY genes revealed the existence of all copies in a common ancestor. Comparative analysis of the circumjacent beta-satellites indicated that DUXY genes and beta-satellites evolved in concert. However, evolutionary forces acting on DUXY genes may have induced amino acid sequence differences in the orthologous chimpanzee and human DUXY open reading frames (ORFs). The acquisition of complete ORFs in human copies might relate to evolutionary advantageous functions indicating neo-functionalization. We propose an evolutionary scenario in which an ancestral tandem array DUX gene cassette transposed to the hominoid Y chromosome followed by lineage-specific chromosomal rearrangements paved the way for a species-specific evolution of the Y-chromosomal members of a large highly diverged homeobox gene family.
Collapse
Affiliation(s)
- Julia Schmidt
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Stefan Kirsch
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Gudrun A. Rappold
- Institute of Human Genetics, University of Heidelberg, Heidelberg, Germany
| | - Werner Schempp
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
- * E-mail:
| |
Collapse
|
16
|
Münch C, Kirsch S, Fernandes AMG, Schempp W. Evolutionary analysis of the highly dynamic CHEK2 duplicon in anthropoids. BMC Evol Biol 2008; 8:269. [PMID: 18831734 PMCID: PMC2566985 DOI: 10.1186/1471-2148-8-269] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Accepted: 10/02/2008] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Segmental duplications (SDs) are euchromatic portions of genomic DNA (> or = 1 kb) that occur at more than one site within the genome, and typically share a high level of sequence identity (>90%). Approximately 5% of the human genome is composed of such duplicated sequences. Here we report the detailed investigation of CHEK2 duplications. CHEK2 is a multiorgan cancer susceptibility gene encoding a cell cycle checkpoint kinase acting in the DNA-damage response signalling pathway. The continuous presence of the CHEK2 gene in all eukaryotes and its important role in maintaining genome stability prompted us to investigate the duplicative evolution and phylogeny of CHEK2 and its paralogs during anthropoid evolution. RESULTS To study CHEK2 duplicon evolution in anthropoids we applied a combination of comparative FISH and in silico analyses. Our comparative FISH results with a CHEK2 fosmid probe revealed the single-copy status of CHEK2 in New World monkeys, Old World monkeys and gibbons. Whereas a single CHEK2 duplication was detected in orangutan, a multi-site signal pattern indicated a burst of duplication in African great apes and human. Phylogenetic analysis of paralogous and ancestral CHEK2 sequences in human, chimpanzee and rhesus macaque confirmed this burst of duplication, which occurred after the radiation of orangutan and African great apes. In addition, we used inter-species quantitative PCR to determine CHEK2 copy numbers. An amplification of CHEK2 was detected in African great apes and the highest CHEK2 copy number of all analysed species was observed in the human genome. Furthermore, we detected variation in CHEK2 copy numbers within the analysed set of human samples. CONCLUSION Our detailed analysis revealed the highly dynamic nature of CHEK2 duplication during anthropoid evolution. We determined a burst of CHEK2 duplication after the radiation of orangutan and African great apes and identified the highest CHEK2 copy number in human. In conclusion, our analysis of CHEK2 duplicon evolution revealed that SDs contribute to inter-species variation. Furthermore, our qPCR analysis led us to presume CHEK2 copy number variation in human, and molecular diagnostics of the cancer susceptibility gene CHEK2 inside the duplicated region might be hampered by the individual-specific set of duplicons.
Collapse
Affiliation(s)
- Claudia Münch
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - Stefan Kirsch
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - António MG Fernandes
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - Werner Schempp
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| |
Collapse
|
17
|
Kirsch S, Münch C, Jiang Z, Cheng Z, Chen L, Batz C, Eichler EE, Schempp W. Evolutionary dynamics of segmental duplications from human Y-chromosomal euchromatin/heterochromatin transition regions. Genome Res 2008; 18:1030-42. [PMID: 18445620 DOI: 10.1101/gr.076711.108] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Human chromosomal regions enriched in segmental duplications are subject to extensive genomic reorganization. Such regions are particularly informative for illuminating the evolutionary history of a given chromosome. We have analyzed 866 kb of Y-chromosomal non-palindromic segmental duplications delineating four euchromatin/heterochromatin transition regions (Yp11.2/Yp11.1, Yq11.1/Yq11.21, Yq11.23/Yq12, and Yq12/PAR2). Several computational methods were applied to decipher the segmental duplication architecture and identify the ancestral origin of the 41 different duplicons. Combining computational and comparative FISH analysis, we reconstruct the evolutionary history of these regions. Our analysis indicates a continuous process of transposition of duplicated sequences onto the evolving higher primate Y chromosome, providing unique insights into the development of species-specific Y-chromosomal and autosomal duplicons. Phylogenetic sequence comparisons show that duplicons of the human Yp11.2/Yp11.1 region were already present in the macaque-human ancestor as multiple paralogs located predominantly in subtelomeric regions. In contrast, duplicons from the Yq11.1/Yq11.21, Yq11.23/Yq12, and Yq12/PAR2 regions show no evidence of duplication in rhesus macaque, but map to the pericentromeric regions in chimpanzee and human. This suggests an evolutionary shift in the direction of duplicative transposition events from subtelomeric in Old World monkeys to pericentromeric in the human/ape lineage. Extensive chromosomal relocation of autosomal-duplicated sequences from euchromatin/heterochromatin transition regions to interstitial regions as demonstrated on the pygmy chimpanzee Y chromosome support a model in which substantial reorganization and amplification of duplicated sequences may contribute to speciation.
Collapse
Affiliation(s)
- Stefan Kirsch
- Institute of Human Genetics, University of Freiburg, 79106 Freiburg, Germany
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Lange J, Skaletsky H, Bell GW, Page DC. MSY Breakpoint Mapper, a database of sequence-tagged sites useful in defining naturally occurring deletions in the human Y chromosome. Nucleic Acids Res 2007; 36:D809-14. [PMID: 17965095 PMCID: PMC2238960 DOI: 10.1093/nar/gkm849] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Y chromosome deletions arise frequently in human populations, where they cause sex reversal and Turner syndrome and predispose individuals to infertility and germ cell cancer. Knowledge of the nucleotide sequence of the male-specific region of the Y chromosome (MSY) makes it possible to precisely demarcate such deletions and the repertoires of genes lost, offering insights into mechanisms of deletion and the molecular etiologies of associated phenotypes. Such deletion mapping is usually conducted using polymerase chain reaction (PCR) assays for the presence or absence of a series of Y-chromosomal DNA markers, or sequence-tagged sites (STSs). In the course of mapping intact and aberrant Y chromosomes during the past two decades, we and our colleagues have developed robust PCR assays for 1287 Y-specific STSs. These PCR assays amplify 1698 loci at an average spacing of <14 kb across the MSY euchromatin. To facilitate mapping of deletions, we have compiled a database of these STSs, MSY Breakpoint Mapper (http://breakpointmapper.wi.mit.edu/). When queried, this online database provides regionally targeted catalogs of STSs and nearby genes. MSY Breakpoint Mapper is useful for efficiently and systematically defining the breakpoint(s) of virtually any naturally occurring Y chromosome deletion.
Collapse
Affiliation(s)
- Julian Lange
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | | | | | | |
Collapse
|
19
|
Qi L, Friebe B, Gill BS. Complex genome rearrangements reveal evolutionary dynamics of pericentromeric regions in the Triticeae. Genome 2007; 49:1628-39. [PMID: 17426778 DOI: 10.1139/g06-123] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Most pericentromeric regions of eukaryotic chromosomes are heterochromatic and are the most rapidly evolving regions of complex genomes. The closely related genomes within hexaploid wheat (Triticum aestivum L., 2n=6x=42, AABBDD), as well as in the related Triticeae taxa, share large conserved chromosome segments and provide a good model for the study of the evolution of pericentromeric regions. Here we report on the comparative analysis of pericentric inversions in the Triticeae, including Triticum aestivum, Aegilops speltoides, Ae. longissima, Ae. searsii, Hordeum vulgare, Secale cereale, and Agropyron elongatum. Previously, 4 pericentric inversions were identified in the hexaploid wheat cultivar 'Chinese Spring' ('CS') involving chromosomes 2B, 4A, 4B, and 5A. In the present study, 2 additional pericentric inversions were detected in chromosomes 3B and 6B of 'CS' wheat. Only the 3B inversion pre-existed in chromosome 3S, 3Sl, and 3Ss of Aegilops species of the Sitopsis section, the remaining inversions occurring after wheat polyploidization. The translocation T2BS/6BS previously reported in 'CS' was detected in the hexaploid variety 'Wichita' but not in other species of the Triticeae. It appears that the B genome is more prone to genome rearrangements than are the A and D genomes. Five different pericentric inversions were detected in rye chromosomes 3R and 4R, 4Sl of Ae. longissima, 4H of barley, and 6E of Ag. elongatum. This indicates that pericentric regions in the Triticeae, especially those of group 4 chromosomes, are undergoing rapid and recurrent rearrangements.
Collapse
Affiliation(s)
- Lili Qi
- Wheat Genetic and Genomic Resources Center, Department of Plant Pathology, Throckmorton Hall, Kansas State University, Manhattan, KS 66506-5502, USA
| | | | | |
Collapse
|
20
|
Bailey JA, Eichler EE. Primate segmental duplications: crucibles of evolution, diversity and disease. Nat Rev Genet 2006; 7:552-64. [PMID: 16770338 DOI: 10.1038/nrg1895] [Citation(s) in RCA: 441] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Compared with other mammals, the genomes of humans and other primates show an enrichment of large, interspersed segmental duplications (SDs) with high levels of sequence identity. Recent evidence has begun to shed light on the origin of primate SDs, pointing to a complex interplay of mechanisms and indicating that distinct waves of duplication took place during primate evolution. There is also evidence for a strong association between duplication, genomic instability and large-scale chromosomal rearrangements. Exciting new findings suggest that SDs have not only created novel primate gene families, but might have also influenced current human genic and phenotypic variation on a previously unappreciated scale. A growing number of examples link natural human genetic variation of these regions to susceptibility to common disease.
Collapse
Affiliation(s)
- Jeffrey A Bailey
- Department of Pathology, Case Western University School of Medicine and University Hospitals of Cleveland, Ohio 44106, USA
| | | |
Collapse
|
21
|
Graves JAM, Koina E, Sankovic N. How the gene content of human sex chromosomes evolved. Curr Opin Genet Dev 2006; 16:219-24. [PMID: 16650758 DOI: 10.1016/j.gde.2006.04.007] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2006] [Accepted: 04/18/2006] [Indexed: 11/24/2022]
Abstract
The X and Y chromosomes of humans and other mammals both have very atypical gene contents. The degenerate Y bears only a handful of genes that are specialized for male sex and reproduction. Now it seems that the X over-represents genes controlling reproductive traits and intelligence. This is hard to explain in terms of function but makes excellent sense in terms of evolution. Comparisons between the gene content of the X and Y in humans, distantly related mammals, and other vertebrates, define the evolutionary past of our sex chromosomes and suggest how special selective forces act on the X and Y.
Collapse
|
22
|
Ross MT, Bentley DR, Tyler-Smith C. The sequences of the human sex chromosomes. Curr Opin Genet Dev 2006; 16:213-8. [PMID: 16650760 DOI: 10.1016/j.gde.2006.04.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2006] [Accepted: 04/18/2006] [Indexed: 10/24/2022]
Abstract
The sequences of both of the human sex chromosomes and of a substantial part of the chimpanzee Y chromosome have now been determined, and most of the protein-coding genes have been identified. The X chromosome codes for more than 800 proteins but the Y chromosome for only approximately 60, illustrating their very different evolutionary histories since their origin from an autosomal pair approximately 300 million years ago and explaining their differential importance in disease. These sequences have provided the basis for understanding normal patterns of variation, such as the distribution of SNPs, and patterns of linkage disequilibrium. In addition, they have been useful for identifying variants associated with simple Mendelian disorders such as microphthalmia or mental retardation, and more complex disorders such as osteoporosis.
Collapse
Affiliation(s)
- Mark T Ross
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | |
Collapse
|
23
|
Murphy WJ, Pearks Wilkerson AJ, Raudsepp T, Agarwala R, Schäffer AA, Stanyon R, Chowdhary BP. Novel gene acquisition on carnivore Y chromosomes. PLoS Genet 2006; 2:e43. [PMID: 16596168 PMCID: PMC1420679 DOI: 10.1371/journal.pgen.0020043] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2005] [Accepted: 02/08/2006] [Indexed: 11/19/2022] Open
Abstract
Despite its importance in harboring genes critical for spermatogenesis and male-specific functions, the Y chromosome has been largely excluded as a priority in recent mammalian genome sequencing projects. Only the human and chimpanzee Y chromosomes have been well characterized at the sequence level. This is primarily due to the presumed low overall gene content and highly repetitive nature of the Y chromosome and the ensuing difficulties using a shotgun sequence approach for assembly. Here we used direct cDNA selection to isolate and evaluate the extent of novel Y chromosome gene acquisition in the genome of the domestic cat, a species from a different mammalian superorder than human, chimpanzee, and mouse (currently being sequenced). We discovered four novel Y chromosome genes that do not have functional copies in the finished human male-specific region of the Y or on other mammalian Y chromosomes explored thus far. Two genes are derived from putative autosomal progenitors, and the other two have X chromosome homologs from different evolutionary strata. All four genes were shown to be multicopy and expressed predominantly or exclusively in testes, suggesting that their duplication and specialization for testis function were selected for because they enhance spermatogenesis. Two of these genes have testis-expressed, Y-borne copies in the dog genome as well. The absence of the four newly described genes on other characterized mammalian Y chromosomes demonstrates the gene novelty on this chromosome between mammalian orders, suggesting it harbors many lineage-specific genes that may go undetected by traditional comparative genomic approaches. Specific plans to identify the male-specific genes encoded in the Y chromosome of mammals should be a priority. Y chromosomes are typically gene poor and enriched with repetitive elements, making them difficult to sequence by standard methods. Hence, the Y chromosome gene repertoire in mammalian species other than human has not been explored until very recently. Here the authors used a directed approach to isolate Y chromosome genes of the domestic cat, an evolutionary divergent species from human and mouse. They found that the feline Y chromosome harbors its own unique set of genes that are expressed specifically in the testes, presumably where they play an important role in spermatogenesis. Paralleling the discoveries seen from the full human Y chromosome sequence, the feline Y chromosome has acquired and remodeled some genes from autosomes, while other genes have a shared ancestry with the X chromosome. However, none of the four new genes are found on the Y chromosomes of human or mouse, although two are shared with the canine Y chromosome. This work highlights the Y chromosome as a source of potential gene novelty in different species and suggests that more directed efforts at characterizing this hitherto understudied chromosome will further enrich our understanding of the types of genes found there and the roles they may play in mammalian spermatogenesis.
Collapse
Affiliation(s)
- William J Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, United States of America.
| | | | | | | | | | | | | |
Collapse
|
24
|
de Carvalho CMB, Santos FR. Human Y-chromosome variation and male dysfunction. J Mol Genet Med 2005; 1:63-75. [PMID: 19565015 PMCID: PMC2702067 DOI: 10.4172/1747-0862.1000014] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2005] [Revised: 09/30/2005] [Accepted: 10/05/2005] [Indexed: 11/09/2022] Open
Abstract
The Y-chromosome is responsible for sex determination in mammals, which is triggered by the expression of the SRY gene, a testis-determining factor. This particular gene, as well as other genes related to male fertility, are located in the non-recombining portion of the Y (NRY), a specific region that encompasses 95% of the human Y-chromosome. The other 5% is composed of the pseudo-autosomal regions (PARs) at the tips of Yp and Yq, a X-chromosome homologous region used during male meiosis for the correct pairing of sexual chromosomes. Despite of the large size of the human NRY (about 60 Mb), only a few active genes are found in this region, most of which are related to fertility. Recently, several male fertility dysfunctions were associated to microdeletions by STS mapping. Now that the complete genetic map of the human Y-chromosome is available, the role of particular NRY genes in fertility dysfunctions is being investigated. Besides, along with the description of several nucleotide and structural variations in the Y-chromosome, the association between phenotype and genotype is being addressed more precisely. Particularly, several research groups are investigating the association between Y-chromosome types and susceptibility to certain male dysfunctions in different population backgrounds. New insights on the role of the Y-chromosome and maleness are being envisaged by this approach.
Collapse
Affiliation(s)
- Cláudia Márcia Benedetto de Carvalho
- Departamento de Bioquímica e Imunologia, and Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | | |
Collapse
|
25
|
Vogt PH. AZF deletions and Y chromosomal haplogroups: history and update based on sequence. Hum Reprod Update 2005; 11:319-36. [PMID: 15890785 DOI: 10.1093/humupd/dmi017] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
AZF deletions are genomic deletions in the euchromatic part of the long arm of the human Y chromosome (Yq11) associated with azoospermia or severe oligozoospermia. Consequently, it can be assumed that these deletions remove Y chromosomal genes required for spermatogenesis. However, these 'classical' or 'complete' AZF deletions, AZFa, AZFb and AZFc, represent only a subset of rearrangements in Yq11. With the benefit of the Y chromosome sequence, more rearrangements (deletions, duplications, inversions) inside and outside the classical AZF deletion intervals have been elucidated and intra-chromosomal non-allelic homologous recombinations (NAHRs) of repetitive sequence blocks have been identified as their major cause. These include duplications in AZFa, AZFb and AZFc and the partial AZFb and AZFc deletions of which some were summarized under the pseudonym 'gr/gr' deletions. At least some of these rearrangements are associated with distinct Y chromosomal haplogroups and are present with similar frequencies in fertile and infertile men. This suggests a functional redundancy of the AZFb/AZFc multi-copy genes. Alternatively, the functional contribution(s) of these genes to human spermatogenesis might be different in men of different Y haplogroups. That raises the question whether, the frequency of Y haplogroups with different AZF gene contents in distinct human populations leads to a male fertility status that varies between populations or whether, the presence of the multiple Y haplogroups implies a balancing selection via genomic deletion/amplification mechanisms.
Collapse
Affiliation(s)
- Peter H Vogt
- Section of Molecular Genetics & Infertility, Department of Gynecological Endocrinology & Reproductive Medicine, University of Heidelberg, Heidelberg, Germany.
| |
Collapse
|
26
|
Y chromosome reveals hidden sequence. Nature 2005. [DOI: 10.1038/news050117-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|