1
|
Makova KD, Pickett BD, Harris RS, Hartley GA, Cechova M, Pal K, Nurk S, Yoo D, Li Q, Hebbar P, McGrath BC, Antonacci F, Aubel M, Biddanda A, Borchers M, Bomberg E, Bouffard GG, Brooks SY, Carbone L, Carrel L, Carroll A, Chang PC, Chin CS, Cook DE, Craig SJ, de Gennaro L, Diekhans M, Dutra A, Garcia GH, Grady PG, Green RE, Haddad D, Hallast P, Harvey WT, Hickey G, Hillis DA, Hoyt SJ, Jeong H, Kamali K, Kosakovsky Pond SL, LaPolice TM, Lee C, Lewis AP, Loh YHE, Masterson P, McCoy RC, Medvedev P, Miga KH, Munson KM, Pak E, Paten B, Pinto BJ, Potapova T, Rhie A, Rocha JL, Ryabov F, Ryder OA, Sacco S, Shafin K, Shepelev VA, Slon V, Solar SJ, Storer JM, Sudmant PH, Sweetalana, Sweeten A, Tassia MG, Thibaud-Nissen F, Ventura M, Wilson MA, Young AC, Zeng H, Zhang X, Szpiech ZA, Huber CD, Gerton JL, Yi SV, Schatz MC, Alexandrov IA, Koren S, O’Neill RJ, Eichler E, Phillippy AM. The Complete Sequence and Comparative Analysis of Ape Sex Chromosomes. bioRxiv 2023:2023.11.30.569198. [PMID: 38077089 PMCID: PMC10705393 DOI: 10.1101/2023.11.30.569198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
Apes possess two sex chromosomes-the male-specific Y and the X shared by males and females. The Y chromosome is crucial for male reproduction, with deletions linked to infertility. The X chromosome carries genes vital for reproduction and cognition. Variation in mating patterns and brain function among great apes suggests corresponding differences in their sex chromosome structure and evolution. However, due to their highly repetitive nature and incomplete reference assemblies, ape sex chromosomes have been challenging to study. Here, using the state-of-the-art experimental and computational methods developed for the telomere-to-telomere (T2T) human genome, we produced gapless, complete assemblies of the X and Y chromosomes for five great apes (chimpanzee, bonobo, gorilla, Bornean and Sumatran orangutans) and a lesser ape, the siamang gibbon. These assemblies completely resolved ampliconic, palindromic, and satellite sequences, including the entire centromeres, allowing us to untangle the intricacies of ape sex chromosome evolution. We found that, compared to the X, ape Y chromosomes vary greatly in size and have low alignability and high levels of structural rearrangements. This divergence on the Y arises from the accumulation of lineage-specific ampliconic regions and palindromes (which are shared more broadly among species on the X) and from the abundance of transposable elements and satellites (which have a lower representation on the X). Our analysis of Y chromosome genes revealed lineage-specific expansions of multi-copy gene families and signatures of purifying selection. In summary, the Y exhibits dynamic evolution, while the X is more stable. Finally, mapping short-read sequencing data from >100 great ape individuals revealed the patterns of diversity and selection on their sex chromosomes, demonstrating the utility of these reference assemblies for studies of great ape evolution. These complete sex chromosome assemblies are expected to further inform conservation genetics of nonhuman apes, all of which are endangered species.
Collapse
Affiliation(s)
| | - Brandon D. Pickett
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Monika Cechova
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Karol Pal
- Penn State University, University Park, PA, USA
| | - Sergey Nurk
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - DongAhn Yoo
- University of Washington School of Medicine, Seattle, WA, USA
| | - Qiuhui Li
- Johns Hopkins University, Baltimore, MD, USA
| | - Prajna Hebbar
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | | | | | | | - Erich Bomberg
- University of Münster, Münster, Germany
- MPI for Developmental Biology, Tübingen, Germany
| | - Gerard G. Bouffard
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shelise Y. Brooks
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Lucia Carbone
- Oregon Health & Science University, Portland, OR, USA
- Oregon National Primate Research Center, Hillsboro, OR, USA
| | - Laura Carrel
- Penn State University School of Medicine, Hershey, PA, USA
| | | | | | - Chen-Shan Chin
- Foundation of Biological Data Sciences, Belmont, CA, USA
| | | | | | | | - Mark Diekhans
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Amalia Dutra
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Gage H. Garcia
- University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Diana Haddad
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Glenn Hickey
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - David A. Hillis
- University of California Santa Barbara, Santa Barbara, CA, USA
| | | | - Hyeonsoo Jeong
- University of Washington School of Medicine, Seattle, WA, USA
| | | | | | | | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | | | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Karen H. Miga
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Evgenia Pak
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Benedict Paten
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | - Arang Rhie
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Fedor Ryabov
- Masters Program in National Research University Higher School of Economics, Moscow, Russia
| | | | - Samuel Sacco
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | | | - Steven J. Solar
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Sweetalana
- Penn State University, University Park, PA, USA
| | - Alex Sweeten
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Johns Hopkins University, Baltimore, MD, USA
| | | | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Alice C. Young
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Xinru Zhang
- Penn State University, University Park, PA, USA
| | | | | | | | - Soojin V. Yi
- University of California Santa Barbara, Santa Barbara, CA, USA
| | | | | | - Sergey Koren
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Evan Eichler
- University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Adam M. Phillippy
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
2
|
Hallast P, Ebert P, Loftus M, Yilmaz F, Audano PA, Logsdon GA, Bonder MJ, Zhou W, Höps W, Kim K, Li C, Hoyt SJ, Dishuck PC, Porubsky D, Tsetsos F, Kwon JY, Zhu Q, Munson KM, Hasenfeld P, Harvey WT, Lewis AP, Kordosky J, Hoekzema K, O'Neill RJ, Korbel JO, Tyler-Smith C, Eichler EE, Shi X, Beck CR, Marschall T, Konkel MK, Lee C. Assembly of 43 human Y chromosomes reveals extensive complexity and variation. Nature 2023; 621:355-364. [PMID: 37612510 PMCID: PMC10726138 DOI: 10.1038/s41586-023-06425-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 07/11/2023] [Indexed: 08/25/2023]
Abstract
The prevalence of highly repetitive sequences within the human Y chromosome has prevented its complete assembly to date1 and led to its systematic omission from genomic analyses. Here we present de novo assemblies of 43 Y chromosomes spanning 182,900 years of human evolution and report considerable diversity in size and structure. Half of the male-specific euchromatic region is subject to large inversions with a greater than twofold higher recurrence rate compared with all other chromosomes2. Ampliconic sequences associated with these inversions show differing mutation rates that are sequence context dependent, and some ampliconic genes exhibit evidence for concerted evolution with the acquisition and purging of lineage-specific pseudogenes. The largest heterochromatic region in the human genome, Yq12, is composed of alternating repeat arrays that show extensive variation in the number, size and distribution, but retain a 1:1 copy-number ratio. Finally, our data suggest that the boundary between the recombining pseudoautosomal region 1 and the non-recombining portions of the X and Y chromosomes lies 500 kb away from the currently established1 boundary. The availability of fully sequence-resolved Y chromosomes from multiple individuals provides a unique opportunity for identifying new associations of traits with specific Y-chromosomal variants and garnering insights into the evolution and function of complex regions of the human genome.
Collapse
Affiliation(s)
- Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter Ebert
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Mark Loftus
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Marc Jan Bonder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Wolfram Höps
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Kwondo Kim
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Chong Li
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Savannah J Hoyt
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Fotios Tsetsos
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jee Young Kwon
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Patrick Hasenfeld
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Jan O Korbel
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Miriam K Konkel
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| |
Collapse
|
3
|
Yilmaz F, Gurusamy U, Mosley TJ, Hallast P, Kim K, Mostovoy Y, Purcell RH, Shaikh TH, Zwick ME, Kwok PY, Lee C, Mulle JG. High level of complexity and global diversity of the 3q29 locus revealed by optical mapping and long-read sequencing. Genome Med 2023; 15:35. [PMID: 37165454 PMCID: PMC10170684 DOI: 10.1186/s13073-023-01184-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 04/20/2023] [Indexed: 05/12/2023] Open
Abstract
BACKGROUND High sequence identity between segmental duplications (SDs) can facilitate copy number variants (CNVs) via non-allelic homologous recombination (NAHR). These CNVs are one of the fundamental causes of genomic disorders such as the 3q29 deletion syndrome (del3q29S). There are 21 protein-coding genes lost or gained as a result of such recurrent 1.6-Mbp deletions or duplications, respectively, in the 3q29 locus. While NAHR plays a role in CNV occurrence, the factors that increase the risk of NAHR at this particular locus are not well understood. METHODS We employed an optical genome mapping technique to characterize the 3q29 locus in 161 unaffected individuals, 16 probands with del3q29S and their parents, and 2 probands with the 3q29 duplication syndrome (dup3q29S). Long-read sequencing-based haplotype resolved de novo assemblies from 44 unaffected individuals, and 1 trio was used for orthogonal validation of haplotypes and deletion breakpoints. RESULTS In total, we discovered 34 haplotypes, of which 19 were novel haplotypes. Among these 19 novel haplotypes, 18 were detected in unaffected individuals, while 1 novel haplotype was detected on the parent-of-origin chromosome of a proband with the del3q29S. Phased assemblies from 44 unaffected individuals enabled the orthogonal validation of 20 haplotypes. In 89% (16/18) of the probands, breakpoints were confined to paralogous copies of a 20-kbp segment within the 3q29 SDs. In one del3q29S proband, the breakpoint was confined to a 374-bp region using long-read sequencing. Furthermore, we categorized del3q29S cases into three classes and dup3q29S cases into two classes based on breakpoints. Finally, we found no evidence of inversions in parent-of-origin chromosomes. CONCLUSIONS We have generated the most comprehensive haplotype map for the 3q29 locus using unaffected individuals, probands with del3q29S or dup3q29S, and available parents, and also determined the deletion breakpoint to be within a 374-bp region in one proband with del3q29S. These results should provide a better understanding of the underlying genetic architecture that contributes to the etiology of del3q29S and dup3q29S.
Collapse
Affiliation(s)
- Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA
| | - Umamaheswaran Gurusamy
- Cardiovascular Research Institute and Institute for Human Genetics, UCSF School of Medicine, 513 Parnassus Ave, San Francisco, CA, 94143, USA
| | - Trenell J Mosley
- Graduate Program in Genetics and Molecular Biology, Laney Graduate School, Emory University, 201 Dowman Drive, Atlanta, GA, 30322, USA
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA
| | - Kwondo Kim
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA
| | - Yulia Mostovoy
- Cardiovascular Research Institute and Institute for Human Genetics, UCSF School of Medicine, 513 Parnassus Ave, San Francisco, CA, 94143, USA
| | - Ryan H Purcell
- Laboratory of Translational Cell Biology, Department of Cell Biology, Emory University School of Medicine, 100 Woodruff Circle, Atlanta, GA, 30322, USA
| | - Tamim H Shaikh
- Department of Pediatrics, Section of Genetics and Metabolism, University of Colorado School of Medicine, 13123 E 16Th Ave, Aurora, CO, 80045, USA
| | - Michael E Zwick
- Department of Genetics, Rutgers University-New Brunswick, Rutgers University, Piscataway, New Brunswick, NJ, 08901, USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute and Institute for Human Genetics, UCSF School of Medicine, 513 Parnassus Ave, San Francisco, CA, 94143, USA
- Department of Dermatology, UCSF School of Medicine, 1701 Divisadero Street, San Francisco, CA, 94115, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA.
| | - Jennifer G Mulle
- Department of Psychiatry, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, Rutgers University, 671 Hoes Lane, New Brunswick, NJ, 08901, USA.
| |
Collapse
|
4
|
Usluer S, Hallast P, Crepaldi L, Zhou Y, Urgo K, Dincer C, Su J, Noell G, Alasoo K, El Garwany O, Gerety SS, Newman B, Dovey OM, Parts L. Optimized whole-genome CRISPR interference screens identify ARID1A-dependent growth regulators in human induced pluripotent stem cells. Stem Cell Reports 2023; 18:1061-1074. [PMID: 37028423 DOI: 10.1016/j.stemcr.2023.03.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 03/07/2023] [Accepted: 03/13/2023] [Indexed: 04/09/2023] Open
Abstract
Perturbing expression is a powerful way to understand the role of individual genes, but can be challenging in important models. CRISPR-Cas screens in human induced pluripotent stem cells (iPSCs) are of limited efficiency due to DNA break-induced stress, while the less stressful silencing with an inactive Cas9 has been considered less effective so far. Here, we developed the dCas9-KRAB-MeCP2 fusion protein for screening in iPSCs from multiple donors. We found silencing in a 200 bp window around the transcription start site in polyclonal pools to be as effective as using wild-type Cas9 for identifying essential genes, but with much reduced cell numbers. Whole-genome screens to identify ARID1A-dependent dosage sensitivity revealed the PSMB2 gene, and enrichment of proteasome genes among the hits. This selective dependency was replicated with a proteasome inhibitor, indicating a targetable drug-gene interaction. Many more plausible targets in challenging cell models can be efficiently identified with our approach.
Collapse
Affiliation(s)
| | | | | | - Yan Zhou
- Wellcome Sanger Institute, Cambridge, UK
| | - Katie Urgo
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Jing Su
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Kaur Alasoo
- Department of Computer Science, University of Tartu, Tartu, Estonia
| | | | | | - Ben Newman
- Wellcome Sanger Institute, Cambridge, UK
| | | | - Leopold Parts
- Wellcome Sanger Institute, Cambridge, UK; Department of Computer Science, University of Tartu, Tartu, Estonia.
| |
Collapse
|
5
|
Porubsky D, Höps W, Ashraf H, Hsieh P, Rodriguez-Martin B, Yilmaz F, Ebler J, Hallast P, Maria Maggiolini FA, Harvey WT, Henning B, Audano PA, Gordon DS, Ebert P, Hasenfeld P, Benito E, Zhu Q, Lee C, Antonacci F, Steinrücken M, Beck CR, Sanders AD, Marschall T, Eichler EE, Korbel JO. Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders. Cell 2022; 185:1986-2005.e26. [PMID: 35525246 PMCID: PMC9563103 DOI: 10.1016/j.cell.2022.04.017] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 02/14/2022] [Accepted: 04/08/2022] [Indexed: 12/13/2022]
Abstract
Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions <2 kbp form by twin-priming during L1 retrotransposition; 80% of the larger inversions are balanced and affect twice as many nucleotides as CNVs. Balanced inversions show an excess of common variants, and 72% are flanked by segmental duplications (SDs) or retrotransposons. Since flanking repeats promote non-allelic homologous recombination, we developed complementary approaches to identify recurrent inversion formation. We describe 40 recurrent inversions encompassing 0.6% of the genome, showing inversion rates up to 2.7 × 10-4 per locus per generation. Recurrent inversions exhibit a sex-chromosomal bias and co-localize with genomic disorder critical regions. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes specific haplotypes to disease-causing CNVs.
Collapse
|
6
|
Abstract
Joint phylogenetic analysis of ancient DNA (aDNA) with modern phylogenies is hampered by low sequence coverage and post-mortem deamination, often resulting in overconservative or incorrect assignment. We provide a new efficient likelihood-based workflow, pathPhynder, that takes advantage of all the polymorphic sites in the target sequence. This effectively evaluates the number of ancestral and derived alleles present on each branch and reports the most likely placement of an ancient sample in the phylogeny and a haplogroup assignment, together with alternatives and supporting evidence. To illustrate the application of pathPhynder, we show improved Y chromosome assignments for published aDNA sequences, using a newly compiled Y variation data set (120,908 markers from 2,014 samples) that significantly enhances Y haplogroup assignment for low coverage samples. We apply the method to all published male aDNA samples from Africa, giving new insights into ancient migrations and the relationships between ancient and modern populations. The same software can be used to place samples with large amounts of missing data into other large non-recombining phylogenies such as the mitochondrial tree.
Collapse
Affiliation(s)
- Rui Martiniano
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- School of Biological and Environmental Sciences, Liverpool John Moores University, Liverpool, United Kingdom
| | - Bianca De Sanctis
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Pille Hallast
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
- Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
7
|
Almarri MA, Haber M, Lootah RA, Hallast P, Al Turki S, Martin HC, Xue Y, Tyler-Smith C. The genomic history of the Middle East. Cell 2021; 184:4612-4625.e14. [PMID: 34352227 PMCID: PMC8445022 DOI: 10.1016/j.cell.2021.07.013] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 05/17/2021] [Accepted: 07/09/2021] [Indexed: 11/22/2022]
Abstract
The Middle East region is important to understand human evolution and migrations but is underrepresented in genomic studies. Here, we generated 137 high-coverage physically phased genome sequences from eight Middle Eastern populations using linked-read sequencing. We found no genetic traces of early expansions out-of-Africa in present-day populations but found Arabians have elevated Basal Eurasian ancestry that dilutes their Neanderthal ancestry. Population sizes within the region started diverging 15–20 kya, when Levantines expanded while Arabians maintained smaller populations that derived ancestry from local hunter-gatherers. Arabians suffered a population bottleneck around the aridification of Arabia 6 kya, while Levantines had a distinct bottleneck overlapping the 4.2 kya aridification event. We found an association between movement and admixture of populations in the region and the spread of Semitic languages. Finally, we identify variants that show evidence of selection, including polygenic selection. Our results provide detailed insights into the genomic and selective histories of the Middle East. Middle Easterners do not have ancestry from an early out-of-Africa expansion Basal Eurasian and African ancestry in Arabians deplete their Neanderthal ancestry Populations experienced bottlenecks overlapping aridification events Identification of recent single and polygenic signals of selection in Arabia
Collapse
Affiliation(s)
- Mohamed A Almarri
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK; Department of Forensic Science and Criminology, Dubai Police GHQ, Dubai, United Arab Emirates.
| | - Marc Haber
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham B15 2TT, UK; Centre for Computational Biology, University of Birmingham, Birmingham B15 2TT, UK.
| | - Reem A Lootah
- Department of Forensic Science and Criminology, Dubai Police GHQ, Dubai, United Arab Emirates
| | - Pille Hallast
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK; Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu 50411, Estonia
| | - Saeed Al Turki
- Translational Pathology, Department of Pathology and Laboratory Medicine, King Abdulaziz Medical City, Ministry of National Guard-Health Affairs, Riyadh, Saudi Arabia; Department of Genetics & Genomics, College of Medicine and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Hilary C Martin
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Yali Xue
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Chris Tyler-Smith
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| |
Collapse
|
8
|
Hallast P, Kibena L, Punab M, Arciero E, Rootsi S, Grigorova M, Flores R, Jobling MA, Poolamets O, Pomm K, Korrovits P, Rull K, Xue Y, Tyler-Smith C, Laan M. A common 1.6 mb Y-chromosomal inversion predisposes to subsequent deletions and severe spermatogenic failure in humans. eLife 2021; 10:65420. [PMID: 33781384 PMCID: PMC8009663 DOI: 10.7554/elife.65420] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 03/15/2021] [Indexed: 12/19/2022] Open
Abstract
Male infertility is a prevalent condition, affecting 5–10% of men. So far, few genetic factors have been described as contributors to spermatogenic failure. Here, we report the first re-sequencing study of the Y-chromosomal Azoospermia Factor c (AZFc) region, combined with gene dosage analysis of the multicopy DAZ, BPY2, and CDYgenes and Y-haplogroup determination. In analysing 2324 Estonian men, we uncovered a novel structural variant as a high-penetrance risk factor for male infertility. The Y lineage R1a1-M458, reported at >20% frequency in several European populations, carries a fixed ~1.6 Mb r2/r3 inversion, destabilizing the AZFc region and predisposing to large recurrent microdeletions. Such complex rearrangements were significantly enriched among severe oligozoospermia cases. The carrier vs non-carrier risk for spermatogenic failure was increased 8.6-fold (p=6.0×10−4). This finding contributes to improved molecular diagnostics and clinical management of infertility. Carrier identification at young age will facilitate timely counselling and reproductive decision-making.
Collapse
Affiliation(s)
- Pille Hallast
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia.,Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Laura Kibena
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
| | - Margus Punab
- Andrology Unit, Tartu University Hospital, Tartu, Estonia.,Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
| | - Elena Arciero
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Siiri Rootsi
- Institute of Genomics, Estonian Biocentre, University of Tartu, Tartu, Estonia
| | - Marina Grigorova
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
| | - Rodrigo Flores
- Institute of Genomics, Estonian Biocentre, University of Tartu, Tartu, Estonia
| | - Mark A Jobling
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Olev Poolamets
- Andrology Unit, Tartu University Hospital, Tartu, Estonia
| | - Kristjan Pomm
- Andrology Unit, Tartu University Hospital, Tartu, Estonia
| | - Paul Korrovits
- Andrology Unit, Tartu University Hospital, Tartu, Estonia
| | - Kristiina Rull
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia.,Institute of Clinical Medicine, University of Tartu, Tartu, Estonia.,Women's Clinic, Tartu University Hospital, Tartu, Estonia
| | - Yali Xue
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Chris Tyler-Smith
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Maris Laan
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
| |
Collapse
|
9
|
Lall GM, Larmuseau MHD, Wetton JH, Batini C, Hallast P, Huszar TI, Zadik D, Aase S, Baker T, Balaresque P, Bodmer W, Børglum AD, de Knijff P, Dunn H, Harding SE, Løvvik H, Dupuy BM, Pamjav H, Tillmar AO, Tomaszewski M, Tyler-Smith C, Verdugo MP, Winney B, Vohra P, Story J, King TE, Jobling MA. Subdividing Y-chromosome haplogroup R1a1 reveals Norse Viking dispersal lineages in Britain. Eur J Hum Genet 2020; 29:512-523. [PMID: 33139852 PMCID: PMC7940619 DOI: 10.1038/s41431-020-00747-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 09/08/2020] [Accepted: 10/07/2020] [Indexed: 12/16/2022] Open
Abstract
The influence of Viking-Age migrants to the British Isles is obvious in archaeological and place-names evidence, but their demographic impact has been unclear. Autosomal genetic analyses support Norse Viking contributions to parts of Britain, but show no signal corresponding to the Danelaw, the region under Scandinavian administrative control from the ninth to eleventh centuries. Y-chromosome haplogroup R1a1 has been considered as a possible marker for Viking migrations because of its high frequency in peninsular Scandinavia (Norway and Sweden). Here we select ten Y-SNPs to discriminate informatively among hg R1a1 sub-haplogroups in Europe, analyse these in 619 hg R1a1 Y chromosomes including 163 from the British Isles, and also type 23 short-tandem repeats (Y-STRs) to assess internal diversity. We find three specifically Western-European sub-haplogroups, two of which predominate in Norway and Sweden, and are also found in Britain; star-like features in the STR networks of these lineages indicate histories of expansion. We ask whether geographical distributions of hg R1a1 overall, and of the two sub-lineages in particular, correlate with regions of Scandinavian influence within Britain. Neither shows any frequency difference between regions that have higher (≥10%) or lower autosomal contributions from Norway and Sweden, but both are significantly overrepresented in the region corresponding to the Danelaw. These differences between autosomal and Y-chromosomal histories suggest either male-specific contribution, or the influence of patrilocality. Comparison of modern DNA with recently available ancient DNA data supports the interpretation that two sub-lineages of hg R1a1 spread with the Vikings from peninsular Scandinavia.
Collapse
Affiliation(s)
| | - Maarten H D Larmuseau
- Department of Human Genetics, KU Leuven-University of Leuven, Leuven, Belgium.,Laboratory of Socioecology and Social Evolution, KU Leuven-University of Leuven, Leuven, Belgium.,Histories vzw, Zoutwerf 5, 2800, Mechelen, Belgium
| | - Jon H Wetton
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,School of History, Politics and International Relations, University of Leicester, Leicester, UK
| | - Chiara Batini
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,Department of Health Sciences, University of Leicester, University Road, Leicester, LE1 7RH, UK
| | - Pille Hallast
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,Wellcome Sanger Institute, Hinxton, Cambridge, UK.,Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411, Estonia
| | - Tunde I Huszar
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK
| | - Daniel Zadik
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham, UK
| | | | - Tina Baker
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,MRC Human Genetics Unit, MRC IGMM, University of Edinburgh, Western General Hospital, Edinburgh, UK
| | - Patricia Balaresque
- UMR5288, Laboratoire d'Anthropologie Moléculaire et Imagerie de Synthèse, Université Paul Sabatier, Toulouse, France
| | - Walter Bodmer
- Department of Oncology, University of Oxford, Oxford, UK
| | - Anders D Børglum
- Department of Biomedicine & Centre for Integrative Sequencing, Aarhus University, Aarhus, Denmark
| | - Peter de Knijff
- Department of Human Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Hayley Dunn
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,School of Archaeology and Ancient History, University of Leicester, Leicester, UK
| | - Stephen E Harding
- National Centre for Macromolecular Hydrodynamics, University of Nottingham, Sutton Bonington Campus, Loughborough, UK.,Museum of Cultural History, University of Oslo, Oslo, Norway
| | | | - Berit Myhre Dupuy
- Division of Forensic Sciences, Norwegian Institute of Public Health, Oslo, Norway
| | - Horolma Pamjav
- Hungarian Institute for Forensic Sciences, Institute of Forensic Genetics, Budapest, Hungary
| | - Andreas O Tillmar
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden
| | - Maciej Tomaszewski
- Division of Cardiovascular Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK.,Division of Medicine and Manchester Academic Health Science Centre, Manchester University NHS Foundation Trust Manchester, Manchester, UK
| | | | - Marta Pereira Verdugo
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.,Smurfit Institute of Genetics, Trinity College, Dublin 2, Ireland
| | - Bruce Winney
- Department of Oncology, University of Oxford, Oxford, UK
| | - Pragya Vohra
- School of History, Politics and International Relations, University of Leicester, Leicester, UK.,Department of History, University of York, Heslington, York, UK
| | - Joanna Story
- School of History, Politics and International Relations, University of Leicester, Leicester, UK
| | - Turi E King
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.
| | - Mark A Jobling
- Department of Genetics & Genome Biology, University of Leicester, Leicester, UK.
| |
Collapse
|
10
|
Hallast P, Agdzhoyan A, Balanovsky O, Xue Y, Tyler-Smith C. A Southeast Asian origin for present-day non-African human Y chromosomes. Hum Genet 2020; 140:299-307. [PMID: 32666166 PMCID: PMC7864842 DOI: 10.1007/s00439-020-02204-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 07/02/2020] [Indexed: 12/17/2022]
Abstract
The genomes of present-day humans outside Africa originated almost entirely from a single out-migration ~ 50,000–70,000 years ago, followed by mixture with Neanderthals contributing ~ 2% to all non-Africans. However, the details of this initial migration remain poorly understood because no ancient DNA analyses are available from this key time period, and interpretation of present-day autosomal data is complicated due to subsequent population movements/reshaping. One locus, however, does retain male-specific information from this early period: the Y chromosome, where a detailed calibrated phylogeny has been constructed. Three present-day Y lineages were carried by the initial migration: the rare haplogroup D, the moderately rare C, and the very common FT lineage which now dominates most non-African populations. Here, we show that phylogenetic analyses of haplogroup C, D and FT sequences, including very rare deep-rooting lineages, together with phylogeographic analyses of ancient and present-day non-African Y chromosomes, all point to East/Southeast Asia as the origin 50,000–55,000 years ago of all known surviving non-African male lineages (apart from recent migrants). This observation contrasts with the expectation of a West Eurasian origin predicted by a simple model of expansion from a source near Africa, and can be interpreted as resulting from extensive genetic drift in the initial population or replacement of early western Y lineages from the east, thus informing and constraining models of the initial expansion.
Collapse
Affiliation(s)
- Pille Hallast
- Institute of Biomedicine and Translational Medicine, University of Tartu, 50411, Tartu, Estonia. .,Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Anastasia Agdzhoyan
- Vavilov Institute of General Genetics, Moscow, 119991, Russia.,Research Centre for Medical Genetics, Moscow, 115522, Russia
| | - Oleg Balanovsky
- Vavilov Institute of General Genetics, Moscow, 119991, Russia.,Research Centre for Medical Genetics, Moscow, 115522, Russia.,Biobank of North Eurasia, Moscow, 115201, Russia
| | - Yali Xue
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Chris Tyler-Smith
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| |
Collapse
|
11
|
Bergström A, McCarthy SA, Hui R, Almarri MA, Ayub Q, Danecek P, Chen Y, Felkel S, Hallast P, Kamm J, Blanché H, Deleuze JF, Cann H, Mallick S, Reich D, Sandhu MS, Skoglund P, Scally A, Xue Y, Durbin R, Tyler-Smith C. Insights into human genetic variation and population history from 929 diverse genomes. Science 2020; 367:eaay5012. [PMID: 32193295 PMCID: PMC7115999 DOI: 10.1126/science.aay5012] [Citation(s) in RCA: 341] [Impact Index Per Article: 85.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 02/04/2020] [Indexed: 12/17/2022]
Abstract
Genome sequences from diverse human groups are needed to understand the structure of genetic variation in our species and the history of, and relationships between, different populations. We present 929 high-coverage genome sequences from 54 diverse human populations, 26 of which are physically phased using linked-read sequencing. Analyses of these genomes reveal an excess of previously undocumented common genetic variation private to southern Africa, central Africa, Oceania, and the Americas, but an absence of such variants fixed between major geographical regions. We also find deep and gradual population separations within Africa, contrasting population size histories between hunter-gatherer and agriculturalist groups in the past 10,000 years, and a contrast between single Neanderthal but multiple Denisovan source populations contributing to present-day human populations.
Collapse
Affiliation(s)
- Anders Bergström
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK.
- The Francis Crick Institute, London NW1 1AT, UK
| | - Shane A McCarthy
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Ruoyun Hui
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
- McDonald Institute for Archaeological Research, University of Cambridge, Cambridge CB2 3ER, UK
| | | | - Qasim Ayub
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Monash University Malaysia Genomics Facility, Tropical Medicine and Biology Multidisciplinary Platform, 47500 Bandar Sunway, Malaysia
- School of Science, Monash University Malaysia, 47500 Bandar Sunway, Malaysia
| | | | - Yuan Chen
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
| | - Sabine Felkel
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna 1210, Austria
| | - Pille Hallast
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu 50411, Estonia
| | - Jack Kamm
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| | - Hélène Blanché
- Centre d'Etude du Polymorphisme Humain, Fondation Jean Dausset, 75010 Paris, France
- GENMED Labex, ANR-10-LABX-0013 Paris, France
| | - Jean-François Deleuze
- Centre d'Etude du Polymorphisme Humain, Fondation Jean Dausset, 75010 Paris, France
- GENMED Labex, ANR-10-LABX-0013 Paris, France
| | - Howard Cann
- Centre d'Etude du Polymorphisme Humain, Fondation Jean Dausset, 75010 Paris, France
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - David Reich
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Manjinder S Sandhu
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Department of Medicine, University of Cambridge, Cambridge CB2 0QQ, UK
| | | | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Yali Xue
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
| | - Richard Durbin
- Wellcome Sanger Institute, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | | |
Collapse
|
12
|
Shi W, Massaia A, Louzada S, Handsaker J, Chow W, McCarthy S, Collins J, Hallast P, Howe K, Church DM, Yang F, Xue Y, Tyler-Smith C. Birth, expansion, and death of VCY-containing palindromes on the human Y chromosome. Genome Biol 2019; 20:207. [PMID: 31610793 PMCID: PMC6790999 DOI: 10.1186/s13059-019-1816-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Accepted: 09/04/2019] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Large palindromes (inverted repeats) make up substantial proportions of mammalian sex chromosomes, often contain genes, and have high rates of structural variation arising via ectopic recombination. As a result, they underlie many genomic disorders. Maintenance of the palindromic structure by gene conversion between the arms has been documented, but over longer time periods, palindromes are remarkably labile. Mechanisms of origin and loss of palindromes have, however, received little attention. RESULTS Here, we use fiber-FISH, 10x Genomics Linked-Read sequencing, and breakpoint PCR sequencing to characterize the structural variation of the P8 palindrome on the human Y chromosome, which contains two copies of the VCY (Variable Charge Y) gene. We find a deletion of almost an entire arm of the palindrome, leading to death of the palindrome, a size increase by recruitment of adjacent sequence, and other complex changes including the formation of an entire new palindrome nearby. Together, these changes are found in ~ 1% of men, and we can assign likely molecular mechanisms to these mutational events. As a result, healthy men can have 1-4 copies of VCY. CONCLUSIONS Gross changes, especially duplications, in palindrome structure can be relatively frequent and facilitate the evolution of sex chromosomes in humans, and potentially also in other mammalian species.
Collapse
Affiliation(s)
- Wentao Shi
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Andrea Massaia
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
- Present address: National Heart and Lung Institute, Imperial College London, London, SW7 2AZ, UK
| | - Sandra Louzada
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Juliet Handsaker
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - William Chow
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Shane McCarthy
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
- Present address: Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Joanna Collins
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Pille Hallast
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
- Institute of Biomedicine and Translational Medicine, University of Tartu, 51011, Tartu, Estonia
| | - Kerstin Howe
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Deanna M Church
- 10x Genomics, 7068 Koll Center Parkway, Suite 401, Pleasanton, CA, 94566, USA
- Present address: Inscripta Inc., 5500 Central Avenue #220, Boulder, CO, 80301, USA
| | - Fengtang Yang
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Yali Xue
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK.
| | - Chris Tyler-Smith
- The Wellcome Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK.
| |
Collapse
|
13
|
Eales JM, Maan AA, Xu X, Michoel T, Hallast P, Batini C, Zadik D, Prestes PR, Molina E, Denniff M, Schroeder J, Bjorkegren JLM, Thompson J, Maffia P, Guzik TJ, Keavney B, Jobling MA, Samani NJ, Charchar FJ, Tomaszewski M. Human Y Chromosome Exerts Pleiotropic Effects on Susceptibility to Atherosclerosis. Arterioscler Thromb Vasc Biol 2019; 39:2386-2401. [PMID: 31644355 PMCID: PMC6818981 DOI: 10.1161/atvbaha.119.312405] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Supplemental Digital Content is available in the text. The male-specific region of the Y chromosome (MSY) remains one of the most unexplored regions of the genome. We sought to examine how the genetic variants of the MSY influence male susceptibility to coronary artery disease (CAD) and atherosclerosis.
Collapse
Affiliation(s)
- James M Eales
- From the Division of Cardiovascular Sciences, Faculty of Biology, Medicine and Health, University of Manchester, United Kingdom (J.M.E., A.A.M., X.X., B.K., M.T.)
| | - Akhlaq A Maan
- From the Division of Cardiovascular Sciences, Faculty of Biology, Medicine and Health, University of Manchester, United Kingdom (J.M.E., A.A.M., X.X., B.K., M.T.)
| | - Xiaoguang Xu
- From the Division of Cardiovascular Sciences, Faculty of Biology, Medicine and Health, University of Manchester, United Kingdom (J.M.E., A.A.M., X.X., B.K., M.T.)
| | - Tom Michoel
- The Roslin Institute, The University of Edinburgh, United Kingdom (T.M.).,Computational Biology Unit and Department of Informatics, University of Bergen, Norway (T.M.)
| | - Pille Hallast
- Institute of Biomedicine and Translational Medicine, University of Tartu, Estonia (P.H.)
| | - Chiara Batini
- Department of Health Sciences (C.B., J.T.), University of Leicester, United Kingdom
| | - Daniel Zadik
- Department of Genetics and Genome Biology (D.Z., M.A.J.), University of Leicester, United Kingdom
| | - Priscilla R Prestes
- School of Health and Life Sciences, Federation University Australia, Ballarat, Victoria (P.R.P., E.M., F.J.C.)
| | - Elsa Molina
- School of Health and Life Sciences, Federation University Australia, Ballarat, Victoria (P.R.P., E.M., F.J.C.)
| | - Matthew Denniff
- Department of Cardiovascular Sciences (M.D., N.J.S., F.J.C.), University of Leicester, United Kingdom
| | - Juliane Schroeder
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation (J.S., P.M.), College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom.,Institute of Cardiovascular and Medical Sciences (J.S., P.M., T.J.G.), College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - Johan L M Bjorkegren
- Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY (J.L.M.B.)
| | - John Thompson
- Department of Health Sciences (C.B., J.T.), University of Leicester, United Kingdom
| | - Pasquale Maffia
- Centre for Immunobiology, Institute of Infection, Immunity and Inflammation (J.S., P.M.), College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom.,Institute of Cardiovascular and Medical Sciences (J.S., P.M., T.J.G.), College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom.,Department of Pharmacy, University of Naples Federico II, Italy (P.M.)
| | - Tomasz J Guzik
- Institute of Cardiovascular and Medical Sciences (J.S., P.M., T.J.G.), College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom.,Jagiellonian University College of Medicine, Kraków, Poland (T.J.G.)
| | - Bernard Keavney
- From the Division of Cardiovascular Sciences, Faculty of Biology, Medicine and Health, University of Manchester, United Kingdom (J.M.E., A.A.M., X.X., B.K., M.T.).,Division of Medicine, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, United Kingdom (B.K., M.T.)
| | - Mark A Jobling
- Department of Genetics and Genome Biology (D.Z., M.A.J.), University of Leicester, United Kingdom
| | - Nilesh J Samani
- Department of Cardiovascular Sciences (M.D., N.J.S., F.J.C.), University of Leicester, United Kingdom.,NIHR Leicester Biomedical Research Centre, United Kingdom (N.J.S.)
| | - Fadi J Charchar
- Department of Cardiovascular Sciences (M.D., N.J.S., F.J.C.), University of Leicester, United Kingdom.,School of Health and Life Sciences, Federation University Australia, Ballarat, Victoria (P.R.P., E.M., F.J.C.).,Department of Physiology, University of Melbourne, Parkville, Victoria, Australia (F.J.C.)
| | - Maciej Tomaszewski
- From the Division of Cardiovascular Sciences, Faculty of Biology, Medicine and Health, University of Manchester, United Kingdom (J.M.E., A.A.M., X.X., B.K., M.T.).,Division of Medicine, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, United Kingdom (B.K., M.T.)
| |
Collapse
|
14
|
Shi W, Louzada S, Grigorova M, Massaia A, Arciero E, Kibena L, Ge XJ, Chen Y, Ayub Q, Poolamets O, Tyler-Smith C, Punab M, Laan M, Yang F, Hallast P, Xue Y. Evolutionary and functional analysis of RBMY1 gene copy number variation on the human Y chromosome. Hum Mol Genet 2019; 28:2785-2798. [PMID: 31108506 PMCID: PMC6687947 DOI: 10.1093/hmg/ddz101] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 05/10/2019] [Accepted: 05/11/2019] [Indexed: 01/17/2023] Open
Abstract
Human RBMY1 genes are located in four variable-sized clusters on the Y chromosome, expressed in male germ cells and possibly associated with sperm motility. We have re-investigated the mutational background and evolutionary history of the RBMY1 copy number distribution in worldwide samples and its relevance to sperm parameters in an Estonian cohort of idiopathic male factor infertility subjects. We estimated approximate RBMY1 copy numbers in 1218 1000 Genomes Project phase 3 males from sequencing read-depth, then chose 14 for valid ation by multicolour fibre-FISH. These fibre-FISH samples provided accurate calibration standards for the entire panel and led to detailed insights into population variation and mutational mechanisms. RBMY1 copy number worldwide ranged from 3 to 13 with a mode of 8. The two larger proximal clusters were the most variable, and additional duplications, deletions and inversions were detected. Placing the copy number estimates onto the published Y-SNP-based phylogeny of the same samples suggested a minimum of 562 mutational changes, translating to a mutation rate of 2.20 × 10-3 (95% CI 1.94 × 10-3 to 2.48 × 10-3) per father-to-son Y-transmission, higher than many short tandem repeat (Y-STRs), and showed no evidence for selection for increased or decreased copy number, but possible copy number stabilizing selection. An analysis of RBMY1 copy numbers among 376 infertility subjects failed to replicate a previously reported association with sperm motility and showed no significant effect on sperm count and concentration, serum follicle stimulating hormone (FSH), luteinizing hormone (LH) and testosterone levels or testicular and semen volume. These results provide the first in-depth insights into the structural rearrangements underlying RBMY1 copy number variation across diverse human lineages.
Collapse
Affiliation(s)
- Wentao Shi
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Sandra Louzada
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Marina Grigorova
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu 50411, Estonia
| | - Andrea Massaia
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- National Heart and Lung Institute, Imperial College London, London SW7 2AZ, UK
| | - Elena Arciero
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Laura Kibena
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu 50411, Estonia
| | - Xiangyu Jack Ge
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Faculty of Biology, Medicine and Health, School of Biological Science, Division of Musculoskeletal and Dermatological Science, University of Manchester, Manchester M13 9PL, UK
| | - Yuan Chen
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Qasim Ayub
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Monash University Malaysia Genomics Facility, Tropical Medicine and Biology Multidisciplinary Platform, Bandar Sunway, Selangor Darul Ehsan 47500, Malaysia
- School of Science, Monash University Malaysia, Bandar Sunway, Selangor Darul Ehsan 47500, Malaysia
| | - Olev Poolamets
- Andrology Unit, Tartu University Hospital, Tartu 50406, Estonia
| | - Chris Tyler-Smith
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Margus Punab
- Andrology Unit, Tartu University Hospital, Tartu 50406, Estonia
| | - Maris Laan
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu 50411, Estonia
| | - Fengtang Yang
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Pille Hallast
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu 50411, Estonia
| | - Yali Xue
- Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| |
Collapse
|
15
|
Shi W, Massaia A, Louzada S, Banerjee R, Hallast P, Chen Y, Bergström A, Gu Y, Leonard S, Quail MA, Ayub Q, Yang F, Tyler-Smith C, Xue Y. Copy number variation arising from gene conversion on the human Y chromosome. Hum Genet 2017; 137:73-83. [PMID: 29209947 DOI: 10.1007/s00439-017-1857-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Accepted: 11/28/2017] [Indexed: 11/27/2022]
Abstract
We describe the variation in copy number of a ~ 10 kb region overlapping the long intergenic noncoding RNA (lincRNA) gene, TTTY22, within the IR3 inverted repeat on the short arm of the human Y chromosome, leading to individuals with 0-3 copies of this region in the general population. Variation of this CNV is common, with 266 individuals having 0 copies, 943 (including the reference sequence) having 1, 23 having 2 copies, and two having 3 copies, and was validated by breakpoint PCR, fibre-FISH, and 10× Genomics Chromium linked-read sequencing in subsets of 1234 individuals from the 1000 Genomes Project. Mapping the changes in copy number to the phylogeny of these Y chromosomes previously established by the Project identified at least 20 mutational events, and investigation of flanking paralogous sequence variants showed that the mutations involved flanking sequences in 18 of these, and could extend over > 30 kb of DNA. While either gene conversion or double crossover between misaligned sister chromatids could formally explain the 0-2 copy events, gene conversion is the more likely mechanism, and these events include the longest non-allelic gene conversion reported thus far. Chromosomes with three copies of this CNV have arisen just once in our data set via another mechanism: duplication of 420 kb that places the third copy 230 kb proximal to the existing proximal copy. Our results establish gene conversion as a previously under-appreciated mechanism of generating copy number changes in humans and reveal the exceptionally large size of the conversion events that can occur.
Collapse
Affiliation(s)
- Wentao Shi
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 30070, China
| | - Andrea Massaia
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- National Heart and Lung Institute, Imperial College London, London, SW7 2AZ, UK
| | - Sandra Louzada
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Ruby Banerjee
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Pille Hallast
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, 51010, Estonia
| | - Yuan Chen
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Anders Bergström
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Yong Gu
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Steven Leonard
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Michael A Quail
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Qasim Ayub
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- School of Science, Monash University Malaysia, Jalan Lagoon Selantan, Bandar Sunway, 47500, Subang Jaya, Selangor Darul Ehsan, Malaysia
| | - Fengtang Yang
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Yali Xue
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| |
Collapse
|
16
|
Maisano Delser P, Neumann R, Ballereau S, Hallast P, Batini C, Zadik D, Jobling MA. Signatures of human European Palaeolithic expansion shown by resequencing of non-recombining X-chromosome segments. Eur J Hum Genet 2017; 25:485-492. [PMID: 28120839 PMCID: PMC5386427 DOI: 10.1038/ejhg.2016.207] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 11/07/2016] [Accepted: 12/14/2016] [Indexed: 11/09/2022] Open
Abstract
Human genetic diversity in Europe has been extensively studied using uniparentally inherited sequences (mitochondrial DNA (mtDNA) and the Y chromosome), which reveal very different patterns indicating sex-specific demographic histories. The X chromosome, haploid in males and inherited twice as often from mothers as from fathers, could provide insights into past female behaviours, but has not been extensively investigated. Here, we use HapMap single-nucleotide polymorphism data to identify genome-wide segments of the X chromosome in which recombination is historically absent and mutations are likely to be the only source of genetic variation, referring to these as phylogeographically informative haplotypes on autosomes and X chromosome (PHAXs). Three such sequences on the X chromosome spanning a total of ~49 kb were resequenced in 240 males from Europe, the Middle East and Africa at an average coverage of 181 ×. These PHAXs were confirmed to be essentially non-recombining across European samples. All three loci show highly homogeneous patterns across Europe and are highly differentiated from the African sample. Star-like structures of European-specific haplotypes in median-joining networks indicate past population expansions. Bayesian skyline plots and time-to-most-recent-common-ancestor estimates suggest expansions pre-dating the Neolithic transition, a finding that is more compatible with data on mtDNA than the Y chromosome, and with the female bias of X-chromosomal inheritance. This study demonstrates the potential of the use of X-chromosomal haplotype blocks, and the utility of the accurate ascertainment of rare variants for inferring human demographic history.
Collapse
Affiliation(s)
| | - Rita Neumann
- Department of Genetics, University of Leicester, Leicester, UK
| | | | - Pille Hallast
- Department of Genetics, University of Leicester, Leicester, UK
| | - Chiara Batini
- Department of Genetics, University of Leicester, Leicester, UK
| | - Daniel Zadik
- Department of Genetics, University of Leicester, Leicester, UK
| | - Mark A Jobling
- Department of Genetics, University of Leicester, Leicester, UK
| |
Collapse
|
17
|
Haber M, Mezzavilla M, Bergström A, Prado-Martinez J, Hallast P, Saif-Ali R, Al-Habori M, Dedoussis G, Zeggini E, Blue-Smith J, Wells R, Xue Y, Zalloua P, Tyler-Smith C. Chad Genetic Diversity Reveals an African History Marked by Multiple Holocene Eurasian Migrations. Am J Hum Genet 2016; 99:1316-1324. [PMID: 27889059 PMCID: PMC5142112 DOI: 10.1016/j.ajhg.2016.10.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 10/24/2016] [Indexed: 12/02/2022] Open
Abstract
Understanding human genetic diversity in Africa is important for interpreting the evolution of all humans, yet vast regions in Africa, such as Chad, remain genetically poorly investigated. Here, we use genotype data from 480 samples from Chad, the Near East, and southern Europe, as well as whole-genome sequencing from 19 of them, to show that many populations today derive their genomes from ancient African-Eurasian admixtures. We found evidence of early Eurasian backflow to Africa in people speaking the unclassified isolate Laal language in southern Chad and estimate from linkage-disequilibrium decay that this occurred 4,750–7,200 years ago. It brought to Africa a Y chromosome lineage (R1b-V88) whose closest relatives are widespread in present-day Eurasia; we estimate from sequence data that the Chad R1b-V88 Y chromosomes coalesced 5,700–7,300 years ago. This migration could thus have originated among Near Eastern farmers during the African Humid Period. We also found that the previously documented Eurasian backflow into Africa, which occurred ∼3,000 years ago and was thought to be mostly limited to East Africa, had a more westward impact affecting populations in northern Chad, such as the Toubou, who have 20%–30% Eurasian ancestry today. We observed a decline in heterozygosity in admixed Africans and found that the Eurasian admixture can bias inferences on their coalescent history and confound genetic signals from adaptation and archaic introgression.
Collapse
|
18
|
de Manuel M, Kuhlwilm M, Frandsen P, Sousa VC, Desai T, Prado-Martinez J, Hernandez-Rodriguez J, Dupanloup I, Lao O, Hallast P, Schmidt JM, Heredia-Genestar JM, Benazzo A, Barbujani G, Peter BM, Kuderna LFK, Casals F, Angedakin S, Arandjelovic M, Boesch C, Kühl H, Vigilant L, Langergraber K, Novembre J, Gut M, Gut I, Navarro A, Carlsen F, Andrés AM, Siegismund HR, Scally A, Excoffier L, Tyler-Smith C, Castellano S, Xue Y, Hvilsom C, Marques-Bonet T. Chimpanzee genomic diversity reveals ancient admixture with bonobos. Science 2016; 354:477-481. [PMID: 27789843 DOI: 10.1126/science.aag2602] [Citation(s) in RCA: 153] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Accepted: 09/09/2016] [Indexed: 12/13/2022]
Abstract
Our closest living relatives, chimpanzees and bonobos, have a complex demographic history. We analyzed the high-coverage whole genomes of 75 wild-born chimpanzees and bonobos from 10 countries in Africa. We found that chimpanzee population substructure makes genetic information a good predictor of geographic origin at country and regional scales. Multiple lines of evidence suggest that gene flow occurred from bonobos into the ancestors of central and eastern chimpanzees between 200,000 and 550,000 years ago, probably with subsequent spread into Nigeria-Cameroon chimpanzees. Together with another, possibly more recent contact (after 200,000 years ago), bonobos contributed less than 1% to the central chimpanzee genomes. Admixture thus appears to have been widespread during hominid evolution.
Collapse
Affiliation(s)
- Marc de Manuel
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Martin Kuhlwilm
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Peter Frandsen
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark. Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000 Frederiksberg, Denmark
| | - Vitor C Sousa
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Berne, 3012 Berne, Switzerland. Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Tariq Desai
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Javier Prado-Martinez
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain. Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Jessica Hernandez-Rodriguez
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Isabelle Dupanloup
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Berne, 3012 Berne, Switzerland. Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Oscar Lao
- National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain. Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Pille Hallast
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK. Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia
| | - Joshua M Schmidt
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - José María Heredia-Genestar
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Andrea Benazzo
- Department of Life Sciences and Biotechnology, University of Ferrara, 44121 Ferrara, Italy
| | - Guido Barbujani
- Department of Life Sciences and Biotechnology, University of Ferrara, 44121 Ferrara, Italy
| | - Benjamin M Peter
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Lukas F K Kuderna
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Ferran Casals
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Samuel Angedakin
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Mimi Arandjelovic
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Christophe Boesch
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Hjalmar Kühl
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Linda Vigilant
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Kevin Langergraber
- School of Human Evolution and Social Change and Institute of Human Origins, Arizona State University, Tempe, AZ 85287, USA
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Marta Gut
- National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Ivo Gut
- National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Arcadi Navarro
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain. National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain. Institucio Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia 08010, Spain
| | - Frands Carlsen
- Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000 Frederiksberg, Denmark
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - Hans R Siegismund
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Laurent Excoffier
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Berne, 3012 Berne, Switzerland. Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Sergi Castellano
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - Yali Xue
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Christina Hvilsom
- Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000 Frederiksberg, Denmark.
| | - Tomas Marques-Bonet
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain. National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain. Institucio Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia 08010, Spain.
| |
Collapse
|
19
|
Hallast P, Maisano Delser P, Batini C, Zadik D, Rocchi M, Schempp W, Tyler-Smith C, Jobling MA. Great ape Y Chromosome and mitochondrial DNA phylogenies reflect subspecies structure and patterns of mating and dispersal. Genome Res 2016; 26:427-39. [PMID: 26883546 PMCID: PMC4817767 DOI: 10.1101/gr.198754.115] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 01/25/2016] [Indexed: 12/30/2022]
Abstract
The distribution of genetic diversity in great ape species is likely to have been affected by patterns of dispersal and mating. This has previously been investigated by sequencing autosomal and mitochondrial DNA (mtDNA), but large-scale sequence analysis of the male-specific region of the Y Chromosome (MSY) has not yet been undertaken. Here, we use the human MSY reference sequence as a basis for sequence capture and read mapping in 19 great ape males, combining the data with sequences extracted from the published whole genomes of 24 additional males to yield a total sample of 19 chimpanzees, four bonobos, 14 gorillas, and six orangutans, in which interpretable MSY sequence ranges from 2.61 to 3.80 Mb. This analysis reveals thousands of novel MSY variants and defines unbiased phylogenies. We compare these with mtDNA-based trees in the same individuals, estimating time-to-most-recent common ancestor (TMRCA) for key nodes in both cases. The two loci show high topological concordance and are consistent with accepted (sub)species definitions, but time depths differ enormously between loci and (sub)species, likely reflecting different dispersal and mating patterns. Gorillas and chimpanzees/bonobos present generally low and high MSY diversity, respectively, reflecting polygyny versus multimale–multifemale mating. However, particularly marked differences exist among chimpanzee subspecies: The western chimpanzee MSY phylogeny has a TMRCA of only 13.2 (10.8–15.8) thousand years, but that for central chimpanzees exceeds 1 million years. Cross-species comparison within a single MSY phylogeny emphasizes the low human diversity, and reveals species-specific branch length variation that may reflect differences in long-term generation times.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom; Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia
| | | | - Chiara Batini
- Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom
| | - Daniel Zadik
- Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom
| | - Mariano Rocchi
- Department of Biology, University of Bari, 70124 Bari, Italy
| | - Werner Schempp
- Institute of Human Genetics, University of Freiburg, 79106 Freiburg, Germany
| | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Mark A Jobling
- Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom
| |
Collapse
|
20
|
Hallast P, Batini C, Zadik D, Maisano Delser P, Wetton JH, Arroyo-Pardo E, Cavalleri GL, de Knijff P, Destro Bisol G, Dupuy BM, Eriksen HA, Jorde LB, King TE, Larmuseau MH, López de Munain A, López-Parra AM, Loutradis A, Milasin J, Novelletto A, Pamjav H, Sajantila A, Schempp W, Sears M, Tolun A, Tyler-Smith C, Van Geystelen A, Watkins S, Winney B, Jobling MA. The Y-chromosome tree bursts into leaf: 13,000 high-confidence SNPs covering the majority of known clades. Mol Biol Evol 2014; 32:661-73. [PMID: 25468874 PMCID: PMC4327154 DOI: 10.1093/molbev/msu327] [Citation(s) in RCA: 111] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Many studies of human populations have used the male-specific region of the Y chromosome (MSY) as a marker, but MSY sequence variants have traditionally been subject to ascertainment bias. Also, dating of haplogroups has relied on Y-specific short tandem repeats (STRs), involving problems of mutation rate choice, and possible long-term mutation saturation. Next-generation sequencing can ascertain single nucleotide polymorphisms (SNPs) in an unbiased way, leading to phylogenies in which branch-lengths are proportional to time, and allowing the times-to-most-recent-common-ancestor (TMRCAs) of nodes to be estimated directly. Here we describe the sequencing of 3.7 Mb of MSY in each of 448 human males at a mean coverage of 51×, yielding 13,261 high-confidence SNPs, 65.9% of which are previously unreported. The resulting phylogeny covers the majority of the known clades, provides date estimates of nodes, and constitutes a robust evolutionary framework for analyzing the history of other classes of mutation. Different clades within the tree show subtle but significant differences in branch lengths to the root. We also apply a set of 23 Y-STRs to the same samples, allowing SNP- and STR-based diversity and TMRCA estimates to be systematically compared. Ongoing purifying selection is suggested by our analysis of the phylogenetic distribution of nonsynonymous variants in 15 MSY single-copy genes.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Chiara Batini
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Daniel Zadik
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | | | - Jon H Wetton
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Eduardo Arroyo-Pardo
- Laboratory of Forensic and Population Genetics, Department of Toxicology and Health Legislation, Faculty of Medicine, Complutense University, Madrid, Spain
| | - Gianpiero L Cavalleri
- Molecular and Cellular Therapeutics, The Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Peter de Knijff
- Department of Human Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Giovanni Destro Bisol
- Istituto Italiano di Antropologia, Rome, Italy Department of Environmental Biology, Sapienza University of Rome, Rome, Italy
| | - Berit Myhre Dupuy
- Division of Forensic Sciences, Norwegian Institute of Public Health, Oslo, Norway
| | - Heidi A Eriksen
- Centre of Arctic Medicine, Thule Institute, University of Oulu, Oulu, Finland Utsjoki Health Care Centre, Utsjoki, Finland
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, UT
| | - Turi E King
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Maarten H Larmuseau
- Laboratory of Forensic Genetics and Molecular Archaeology, KU Leuven, Leuven, Belgium Department of Imaging & Pathology, Biomedical Forensic Sciences, KU Leuven, Leuven, Belgium Laboratory of Biodiversity and Evolutionary Genomics, Department of Biology, KU Leuven, Leuven, Belgium
| | | | - Ana M López-Parra
- Laboratory of Forensic and Population Genetics, Department of Toxicology and Health Legislation, Faculty of Medicine, Complutense University, Madrid, Spain
| | | | - Jelena Milasin
- School of Dental Medicine, Institute of Human Genetics, University of Belgrade, Belgrade, Serbia
| | | | - Horolma Pamjav
- Network of Forensic Science Institutes, Institute of Forensic Medicine, Budapest, Hungary
| | - Antti Sajantila
- Department of Forensic Medicine, Hjelt Institute, University of Helsinki, Helsinki, Finland Department of Molecular and Medical Genetics, Institute of Applied Genetics, University of North Texas Health Science Center, Fort Worth, Texas
| | - Werner Schempp
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Matt Sears
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Aslıhan Tolun
- Department of Molecular Biology and Genetics, Boğaziçi University, Istanbul, Turkey
| | | | - Anneleen Van Geystelen
- Laboratory of Socioecology and Social Evolution, Department of Biology, KU Leuven, Leuven, Belgium
| | - Scott Watkins
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, UT
| | - Bruce Winney
- Department of Oncology, University of Oxford, Oxford, United Kingdom
| | - Mark A Jobling
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| |
Collapse
|
21
|
Hallast P, Balaresque P, Bowden GR, Ballereau S, Jobling MA. Recombination dynamics of a human Y-chromosomal palindrome: rapid GC-biased gene conversion, multi-kilobase conversion tracts, and rare inversions. PLoS Genet 2013; 9:e1003666. [PMID: 23935520 PMCID: PMC3723533 DOI: 10.1371/journal.pgen.1003666] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 06/07/2013] [Indexed: 11/19/2022] Open
Abstract
The male-specific region of the human Y chromosome (MSY) includes eight large inverted repeats (palindromes) in which arm-to-arm similarity exceeds 99.9%, due to gene conversion activity. Here, we studied one of these palindromes, P6, in order to illuminate the dynamics of the gene conversion process. We genotyped ten paralogous sequence variants (PSVs) within the arms of P6 in 378 Y chromosomes whose evolutionary relationships within the SNP-defined Y phylogeny are known. This allowed the identification of 146 historical gene conversion events involving individual PSVs, occurring at a rate of 2.9–8.4×10−4 events per generation. A consideration of the nature of nucleotide change and the ancestral state of each PSV showed that the conversion process was significantly biased towards the fixation of G or C nucleotides (GC-biased), and also towards the ancestral state. Determination of haplotypes by long-PCR allowed likely co-conversion of PSVs to be identified, and suggested that conversion tract lengths are large, with a mean of 2068 bp, and a maximum in excess of 9 kb. Despite the frequent formation of recombination intermediates implied by the rapid observed gene conversion activity, resolution via crossover is rare: only three inversions within P6 were detected in the sample. An analysis of chimpanzee and gorilla P6 orthologs showed that the ancestral state bias has existed in all three species, and comparison of human and chimpanzee sequences with the gorilla outgroup confirmed that GC bias of the conversion process has apparently been active in both the human and chimpanzee lineages. The sex-determining role of the human Y chromosome makes it male-specific, and always present in only a single copy. This solo lifestyle has endowed it with some bizarre features, among which are eight large DNA units constituting about a quarter of the chromosome's length, and containing many genes important for sperm production. These units are called palindromes, since, taking into account the polarity of the DNA strands, the sequence is the same read from either end of the unit. We investigated the details of a process (gene conversion) that transfers sequence variants in one half of a palindrome into the other, thereby maintaining >99.9% similarity between the halves. We analysed patterns of sequence variants within one palindrome in a set of Y chromosomes whose evolutionary relationships are known. This allowed us to identify past gene conversion events, and to demonstrate a bias towards events that eliminate new variants, and retain old ones. Gene conversion has therefore acted during human evolution to retard sequence change in these regions. Analysis of the chimpanzee and gorilla versions of the palindrome shows that the dynamic processes we see in human Y chromosomes have a deep evolutionary history.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | | | - Georgina R. Bowden
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Stéphane Ballereau
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Mark A. Jobling
- Department of Genetics, University of Leicester, Leicester, United Kingdom
- * E-mail:
| |
Collapse
|
22
|
Putku M, Kepp K, Org E, Sõber S, Comas D, Viigimaa M, Veldre G, Juhanson P, Hallast P, Tõnisson N, Shaw-Hawkins S, Caulfield MJ, Khusnutdinova E, Kožich V, Munroe PB, Laan M. Novel polymorphic AluYb8 insertion in the WNK1 gene is associated with blood pressure variation in Europeans. Hum Mutat 2011; 32:806-14. [PMID: 21520334 PMCID: PMC3298642 DOI: 10.1002/humu.21508] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2010] [Accepted: 03/28/2011] [Indexed: 01/16/2023]
Abstract
Mutations in WNK1 and WNK4 cause familial hypertension, the Gordon syndrome. WNK1 and WNK4 conserved noncoding regions were targeted to polymorphism screening using DHPLC and DGGE. The scan identified an undescribed polymorphic AluYb8 insertion in WNK1 intron 10. Screening in primates revealed that this Alu-insertion has probably occurred in human lineage. Genotyping in 18 populations from Europe, Asia, and Africa (n = 854) indicated an expansion of the WNK1 AluYb8 bearing chromosomes out of Africa. The allele frequency in Sub-Saharan Africa was ∼3.3 times lower than in other populations (4.8 vs. 15.8%; P = 9.7 × 10−9). Meta-analysis across three European sample sets (n = 3,494; HYPEST, Estonians; BRIGHT, the British; CADCZ, Czech) detected significant association of the WNK1 AluYb8 insertion with blood pressure (BP; systolic BP, P = 4.03 × 10−3, effect 1.12; diastolic BP, P = 1.21 × 10−2, effect 0.67). Gender-stratified analysis revealed that this effect might be female-specific (n = 2,088; SBP, P = 1.99 × 10−3, effect 1.59; DBP P = 3.64 × 10−4, effect 1.23; resistant to Bonferroni correction), whereas no statistical support was identified for the association with male BP (n = 1,406). In leucocytes, the expressional proportions of the full-length WNK1 transcript and the splice-form skipping exon 11 were significantly shifted in AluYb8 carriers compared to noncarriers. The WNK1 AluYb8 insertion might affect human BP via altering the profile of alternatively spliced transcripts. Hum Mutat 32:1–9, 2011. © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Margus Putku
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - Katrin Kepp
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - Elin Org
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - Siim Sõber
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - David Comas
- Institute of Evolutionary Biology (UPF-CSIC), CEXS-UPF-PRBB, Universitat Pompeu FabraBarcelona, Spain
| | - Margus Viigimaa
- Centre of Cardiology, North Estonia Medical CentreTallinn, Estonia
- Tallinn University of Technology, Department of Biomedical Engineering, Chair of Medical PhysicsTallinn, Estonia
| | - Gudrun Veldre
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
- Department of Cardiology, University of TartuTartu, Estonia
| | - Peeter Juhanson
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - Pille Hallast
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - Neeme Tõnisson
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
| | - Sue Shaw-Hawkins
- Clinical Pharmacology and The Genome Centre, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of LondonLondon EC1M 6BQ, United Kingdom
| | - Mark J Caulfield
- Clinical Pharmacology and The Genome Centre, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of LondonLondon EC1M 6BQ, United Kingdom
| | - Elza Khusnutdinova
- Institute of Biochemistry and Genetics, Ufa Science Center, Russian Academy of SciencesUfa, Bashkortostan, Russia
| | - Viktor Kožich
- Institute of Inherited Metabolic Diseases, Charles University—First Faculty of MedicinePrague, Czech Republic
| | - Patricia B Munroe
- Clinical Pharmacology and The Genome Centre, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of LondonLondon EC1M 6BQ, United Kingdom
| | - Maris Laan
- Human Molecular Genetics Research Group, Institute of Molecular and Cell Biology, University of TartuTartu, Estonia
- *Correspondence to: Maris Laan, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia. E-mail:
| |
Collapse
|
23
|
Nagirnaja L, Rull K, Uusküla L, Hallast P, Grigorova M, Laan M. Genomics and genetics of gonadotropin beta-subunit genes: Unique FSHB and duplicated LHB/CGB loci. Mol Cell Endocrinol 2010; 329:4-16. [PMID: 20488225 PMCID: PMC2954307 DOI: 10.1016/j.mce.2010.04.024] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/05/2010] [Revised: 04/13/2010] [Accepted: 04/26/2010] [Indexed: 01/28/2023]
Abstract
The follicle stimulating hormone (FSH), luteinizing hormone (LH) and chorionic gonadotropin (HCG) play a critical role in human reproduction. Despite the common evolutionary ancestry and functional relatedness of the gonadotropin hormone beta (GtHB) genes, the single-copy FSHB (at 11p13) and the multi-copy LHB/CGB genes (at 19q13.32) exhibit locus-specific differences regarding their genomic context, evolution, genetic variation and expressional profile. FSHB represents a conservative vertebrate gene with a unique function and it is located in a structurally stable gene-poor region. In contrast, the primate-specific LHB/CGB gene cluster is located in a gene-rich genomic context and demonstrates an example of evolutionary young and unstable genomic region. The gene cluster is shaped by a constant balance between selection that acts on specific functions of the loci and frequent gene conversion events among duplicons. As the transcription of the GtHB genes is rate-limiting in the assembly of respective hormones, the genomic and genetic context of the FSHB and the LHB/CGB genes largely affects the profile of the hormone production.
Collapse
Affiliation(s)
- Liina Nagirnaja
- Institute of Molecular and Cell Biology, University of Tartu, Riia St. 23, 51010 Tartu, Estonia
| | - Kristiina Rull
- Institute of Molecular and Cell Biology, University of Tartu, Riia St. 23, 51010 Tartu, Estonia
- Department of Obstetrics and Gynecology, University of Tartu, Puusepa 8 G2, 51014 Tartu, Estonia
- Estonian Biocentre, Riia St. 23b, 51010 Tartu, Estonia
| | - Liis Uusküla
- Institute of Molecular and Cell Biology, University of Tartu, Riia St. 23, 51010 Tartu, Estonia
| | - Pille Hallast
- Institute of Molecular and Cell Biology, University of Tartu, Riia St. 23, 51010 Tartu, Estonia
| | - Marina Grigorova
- Institute of Molecular and Cell Biology, University of Tartu, Riia St. 23, 51010 Tartu, Estonia
- Estonian Biocentre, Riia St. 23b, 51010 Tartu, Estonia
| | - Maris Laan
- Institute of Molecular and Cell Biology, University of Tartu, Riia St. 23, 51010 Tartu, Estonia
- Corresponding author. Tel.: +372 7375008; fax: +372 7420286.
| |
Collapse
|
24
|
Hallast P, Saarela J, Palotie A, Laan M. High divergence in primate-specific duplicated regions: human and chimpanzee chorionic gonadotropin beta genes. BMC Evol Biol 2008; 8:195. [PMID: 18606016 PMCID: PMC2478647 DOI: 10.1186/1471-2148-8-195] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2007] [Accepted: 07/07/2008] [Indexed: 11/17/2022] Open
Abstract
Background Low nucleotide divergence between human and chimpanzee does not sufficiently explain the species-specific morphological, physiological and behavioral traits. As gene duplication is a major prerequisite for the emergence of new genes and novel biological processes, comparative studies of human and chimpanzee duplicated genes may assist in understanding the mechanisms behind primate evolution. We addressed the divergence between human and chimpanzee duplicated genomic regions by using Luteinizing Hormone Beta (LHB)/Chorionic Gonadotropin Beta (CGB) gene cluster as a model. The placental CGB genes that are essential for implantation have evolved from an ancestral pituitary LHB gene by duplications in the primate lineage. Results We shotgun sequenced and compared the human (45,165 bp) and chimpanzee (39,876 bp) LHB/CGB regions and hereby present evidence for structural variation resulting in discordant number of CGB genes (6 in human, 5 in chimpanzee). The scenario of species-specific parallel duplications was supported (i) as the most parsimonious solution requiring the least rearrangement events to explain the interspecies structural differences; (ii) by the phylogenetic trees constructed with fragments of intergenic regions; (iii) by the sequence similarity calculations. Across the orthologous regions of LHB/CGB cluster, substitutions and indels contributed approximately equally to the interspecies divergence and the distribution of nucleotide identity was correlated with the regional repeat content. Intraspecies gene conversion may have shaped the LHB/CGB gene cluster. The substitution divergence (1.8–2.59%) exceeded two-three fold the estimates for single-copy loci and the fraction of transversional mutations was increased compared to the unique sequences (43% versus ~30%). Despite the high sequence identity among LHB/CGB genes, there are signs of functional differentiation among the gene copies. Estimates for dn/ds rate ratio suggested a purifying selection on LHB and CGB8, and a positive evolution of CGB1. Conclusion If generalized, our data suggests that in addition to species-specific deletions and duplications, parallel duplication events may have contributed to genetic differences separating humans from their closest relatives. Compared to unique genomic segments, duplicated regions are characterized by high divergence promoted by intraspecies gene conversion and species-specific chromosomal rearrangements, including the alterations in gene copy number.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia.
| | | | | | | |
Collapse
|
25
|
Rull K, Hallast P, Uusküla L, Jackson J, Punab M, Salumets A, Campbell RK, Laan M. Fine-scale quantification of HCG beta gene transcription in human trophoblastic and non-malignant non-trophoblastic tissues. Mol Hum Reprod 2008; 14:23-31. [PMID: 18048458 PMCID: PMC2628200 DOI: 10.1093/molehr/gam082] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Human chorionic gonadotropin (HCG) is produced by syncytiotrophoblast of placenta. It delays the apoptosis of corpus luteum and functions in implantation. Its possible role in male reproduction has been raised. HCG beta subunit is encoded by CGB, CGB5, CGB7 and CGB8 genes located at 19q13.3 in a common genome cluster with beta subunit non-coding CGB1 and CGB2. We conducted a sensitive quantification and comparison of CGB gene expression in human trophoblastic (blastocysts, n = 6; normal/failed pregnancy, n = 51) and non-malignant non-trophoblastic tissues (15 different tissue types, samples n = 241), by real-time RT-PCR. We showed a wide transcriptional window of CGB genes in normal pregnancy, a significant reduction in recurrent miscarriages, and a high expression (especially CGB1/CGB2) in ectopic and molar pregnancies. Expression was several orders of magnitude lower in the non-placental tissues, with the highest CGB levels being seen in testis, prostate, thymus, skeletal muscle and lung samples. The contribution of CGB1/CGB2 to the summarized expression of six CGB genes was not proportional to their gene dosage: 1/1000 to 1/10,000. An interesting exception was the testis exhibiting a much higher CGB1/CGB2 to total CGB mRNA ratio of approximately one-third, corresponding to gene dosage. In conclusion, the expressional profile of CGB genes, activated already in blastocyst stage, is associated with the status of pregnancy. The presence of CGB transcripts in testes, and in particular CGB1/CGB2 transcripts, may indicate a role in male reproductive tract.
Collapse
Affiliation(s)
- K Rull
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
- Department of Obstetrics and Gynecology, University of Tartu, Lossi 36, 51003 Tartu, Estonia
| | - P Hallast
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
| | - L Uusküla
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
| | - J Jackson
- Department of Molecular Biology, EMD Serono Research Institute, One Technology Place, Rockland, MA USA 02093
| | - M Punab
- Andrology Unit, Tartu University Hospital, Puusepa 1A, 50406, Tartu, Estonia
| | - A Salumets
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
- Department of Obstetrics and Gynecology, University of Tartu, Lossi 36, 51003 Tartu, Estonia
- Nova Vita Clinic, Centre for Infertility Treatment and Medical Genetics, Kaluri tee 5A, Viimsi vald, 74001 Harjumaa Estonia
| | - RK Campbell
- Department of Molecular Biology, EMD Serono Research Institute, One Technology Place, Rockland, MA USA 02093
| | - M Laan
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
- Correspondence: Prof. Maris Laan, Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu; Riia St. 23, 51010 Tartu, Estonia; telephone: +372-7375008; fax: +372-7-420286,
| |
Collapse
|
26
|
Abstract
The origin of completely novel proteins is a significant question in evolution. The luteinizing hormone (LHB)/chorionic gonadotropin (CGB) gene cluster in humans contains a candidate example of this process. Two genes in this cluster (CGB1 and CGB2) exhibit nucleotide sequence similarity with the other LHB/CGB genes, but as a result of frameshifting are predicted to encode a completely novel protein. Our analysis of these genes from humans and related primates indicates a recent origin in the lineage specific to humans and African great apes. While the function of these genes is not yet known, they are strongly conserved between human and chimpanzee and exhibit three-fold lower diversity than LHB across human populations with no mutations that would disrupt the coding sequence. The 5'-upstream region of CGB1/2 contains most of the promoter sequence of hCGbeta plus a novel region proximal to the putative transcription start site. In silico prediction of putative transcription factor binding sites supports the hypothesis that CGB1 and CGB2 gene products are expressed in, and may contribute to, implantation and placental development.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
| | - Kristiina Rull
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
- Department of Obstetrics and Gynecology, University of Tartu, Estonia
| | - Maris Laan
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
- Corresponding author. Tel.: +372 7375008; fax: +372 7420286.
| |
Collapse
|
27
|
Hallast P, Nagirnaja L, Margus T, Laan M. Segmental duplications and gene conversion: Human luteinizing hormone/chorionic gonadotropin beta gene cluster. Genome Res 2006; 15:1535-46. [PMID: 16251463 PMCID: PMC1310641 DOI: 10.1101/gr.4270505] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Segmental duplicons (>1 kb) of high sequence similarity (>90%) covering >5% of the human genome are characterized by complex sequence variation. Apart from a few well-characterized regions (MHC, beta-globin), the diversity and linkage disequilibrium (LD) patterns of duplicons and the role of gene conversion in shaping them have been poorly studied. To shed light on these issues, we have re-sequenced the human Luteinizing Hormone/Chorionic Gonadotropin beta (LHB/CGB) cluster (19q13.32) of three population samples (Estonians, Mandenka, and Han). The LHB/CGB cluster consists of seven duplicated genes critical in human reproduction. In the LHB/CGB region, high sequence diversity, concentration of gene-conversion acceptor sites, and strong LD colocalize with peripheral genes, whereas central loci are characterized by lower variation, gene-conversion donor activity, and breakdown of LD between close markers. The data highlight an important role of gene conversion in spreading polymorphisms among duplicon copies and generating LD around them. The directionality of gene-conversion events seems to be determined by the localization of a predicted recombination "hotspot" and "warm spot" in the vicinity of the most active acceptor genes at the periphery of the cluster. The data suggest that enriched crossover activity in direct and inverted segmental repeats is in accordance with the formation of palindromic secondary structures promoting double-strand breaks rather than fixed DNA sequence motifs. Also, this first detailed coverage of sequence diversity and structure of the LHB/CGB gene cluster will pave the way for studying the identified polymorphisms as well as potential genomic rearrangements in association with an individual's reproductive success.
Collapse
Affiliation(s)
- Pille Hallast
- Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
| | | | | | | |
Collapse
|
28
|
Kõks S, Nikopensius T, Koido K, Maron E, Altmäe S, Heinaste E, Vabrit K, Tammekivi V, Hallast P, Kurg A, Shlik J, Vasar V, Metspalu A, Vasar E. Analysis of SNP profiles in patients with major depressive disorder. Int J Neuropsychopharmacol 2006; 9:167-74. [PMID: 15927089 DOI: 10.1017/s1461145705005468] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/02/2005] [Revised: 03/09/2005] [Accepted: 03/14/2005] [Indexed: 11/05/2022] Open
Abstract
The present study focused on 91 single-nucleotide polymorphisms (SNPs) in 21 candidate genes to find associations with major depressive disorder (MDD). In total, 160 healthy controls and 177 patients with MDD were studied. We applied arrayed primer extension (APEX) based genotyping technology followed by association and haplotype analysis. SNPs in CCKAR, DRD1, DRD2, and HTR2C genes showed nominally significant associations with MDD. None of these associations remained significant after adjustment for multiple testing. Haplotype analysis revealed CCKAR haplotypes to be associated with MDD (global p=0.004). More precisely, we found the GAGT haplotype to be associated with increased risk for MDD (OR 7.42, 95% CI 2.13-25.85, p=0.002). This haplotype effect remained significant after Bonferroni correction (p=0.04 after Bonferroni's adjustment). Altogether we were able to find some nominal associations, but due to small sample size these results should be taken as exploratory. However, the effect of GAGT haplotype on the CCKAR gene may be considered as increasing the risk for MDD.
Collapse
Affiliation(s)
- Sulev Kõks
- Department of Physiology, University of Tartu, Tartu, EstoniaVisgenyx Ltd, Tartu, Estonia.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Maron E, Nikopensius T, Kõks S, Altmäe S, Heinaste E, Vabrit K, Tammekivi V, Hallast P, Koido K, Kurg A, Metspalu A, Vasar E, Vasar V, Shlik J. Association study of 90 candidate gene polymorphisms in panic disorder. Psychiatr Genet 2005; 15:17-24. [PMID: 15722953 DOI: 10.1097/00041444-200503000-00004] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE In the present investigation we screened a large number of single nucleotide polymorphisms in the genes relevant to the neurobiology of anxiety for their association with panic disorder (PD). METHODS The study sample included 127 patients with PD and 146 healthy control subjects. Using Arrayed Primer Extension technology we genotyped 90 polymorphisms in 21 candidate genes of serotonin, cholecystokinin, dopamine and opioid neurotransmitter systems. The association and haplotype analyses were performed in the whole group (PD-all) and in the subgroups of PD comorbid with major depression (PD-comorbid, n = 60) and without any comorbidity (PD-pure, n = 42). RESULTS From the set of 90 polymorphisms, eight single nucleotide polymorphism markers in eight genes displayed at least a nominal association with any of the studied PD phenotype subgroups. Several polymorphisms of cholecystokinin, serotonin and dopamine systems were associated with PD-all and/or PD-comorbid phenotypes, while pure PD was associated only with HTR2A receptor 102T-C (P = 0.01) and DRD1 receptor -94G-A (P = 0.02) polymorphisms. Haplotype analysis supported an association of the cholecystokinin gene TG haplotype with the PD-all group (P = 0.04), whereas DRD1 receptor CAA and HTR2A receptor AT haplotypes were associated with a lower risk for PD-pure phenotype (P = 0.03 and P = 0.04, respectively). CONCLUSIONS The study results suggest that genetic variants of several candidate genes of neurotransmitter systems, each of a minor individual effect, may contribute to the susceptibility to PD. Our data also indicate that genetic variability may have a distinctive influence on pure and comorbid phenotypes of PD.
Collapse
Affiliation(s)
- Eduard Maron
- Department of Psychiatry, University of Tartu, Tartu 50417, Estonia.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Koido K, Kõks S, Nikopensius T, Maron E, Altmäe S, Heinaste E, Vabrit K, Tammekivi V, Hallast P, Kurg A, Shlik J, Vasar V, Metspalu A, Vasar E. Polymorphisms in wolframin (WFS1) gene are possibly related to increased risk for mood disorders. Int J Neuropsychopharmacol 2005; 8:235-44. [PMID: 15473915 DOI: 10.1017/s1461145704004791] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/17/2004] [Revised: 07/11/2004] [Indexed: 11/06/2022] Open
Abstract
Wolfram syndrome gene (WFS1) has been suggested to have a role in the susceptibility for mood disorders. A 26-fold increased risk for psychiatric disorders in WFS1 mutation carriers has been suggested. In this study we tested the hypothesis that the WFS1 gene is related to the risk for mood disorders. We analysed 28 single-nucleotide polymorphisms (SNPs) of the WFS1 gene in 224 unrelated patients with major depressive disorder and bipolar disorder and in 160 healthy control subjects. Patients were further stratified according to their comorbidity with anxiety disorders. We applied arrayed primer extension (APEX)-based genotyping technology followed by association and haplotype analysis. Five SNPs in the WFS1 gene were associated with major depressive disorder, and three SNPs with bipolar disorder. Haplotype analysis revealed a common GTA haplotype, formed by SNPs 684C/G, 1185C/T and 1832G/A, conferring risk for affective disorders. Specifically, for major depression the GTA haplotype has an OR of 1.59 (p = 0.01) and for bipolar disorder an OR of 1.89 (p = 0.03). These results support the hypothesis that the WFS1 gene is involved in the genetic predisposition for mood disorders.
Collapse
Affiliation(s)
- Kati Koido
- Department of Physiology, University of Tartu, Tartu, Estonia
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|