1
|
Porubsky D, Eichler EE. A 25-year odyssey of genomic technology advances and structural variant discovery. Cell 2024; 187:1024-1037. [PMID: 38290514 DOI: 10.1016/j.cell.2024.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 12/20/2023] [Accepted: 01/02/2024] [Indexed: 02/01/2024]
Abstract
This perspective focuses on advances in genome technology over the last 25 years and their impact on germline variant discovery within the field of human genetics. The field has witnessed tremendous technological advances from microarrays to short-read sequencing and now long-read sequencing. Each technology has provided genome-wide access to different classes of human genetic variation. We are now on the verge of comprehensive variant detection of all forms of variation for the first time with a single assay. We predict that this transition will further transform our understanding of human health and biology and, more importantly, provide novel insights into the dynamic mutational processes shaping our genomes.
Collapse
Affiliation(s)
- David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
2
|
Vollger MR, Dishuck PC, Harvey WT, DeWitt WS, Guitart X, Goldberg ME, Rozanski AN, Lucas J, Asri M, Munson KM, Lewis AP, Hoekzema K, Logsdon GA, Porubsky D, Paten B, Harris K, Hsieh P, Eichler EE. Increased mutation and gene conversion within human segmental duplications. Nature 2023; 617:325-334. [PMID: 37165237 PMCID: PMC10172114 DOI: 10.1038/s41586-023-05895-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 02/28/2023] [Indexed: 05/12/2023]
Abstract
Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data1,2. Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions3,4. We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have 'relocated' on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences5,6.
Collapse
Affiliation(s)
- Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - William S DeWitt
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA
| | - Xavi Guitart
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Michael E Goldberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Julian Lucas
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Kelley Harris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
3
|
Bonito M, D’Atanasio E, Ravasini F, Cariati S, Finocchio A, Novelletto A, Trombetta B, Cruciani F. New insights into the evolution of human Y chromosome palindromes through mutation and gene conversion. Hum Mol Genet 2021; 30:2272-2285. [PMID: 34244762 PMCID: PMC8600007 DOI: 10.1093/hmg/ddab189] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 07/01/2021] [Accepted: 07/05/2021] [Indexed: 12/16/2022] Open
Abstract
About one-quarter of the euchromatic portion of the male-specific region of the human Y chromosome consists of large duplicated sequences that are organized in eight palindromes (termed P1-P8), which undergo arm-to arm gene conversion, a proposed mechanism for maintaining their sequence integrity. Although the relevance of gene conversion in the evolution of palindromic sequences has been profoundly recognized, the dynamic of this mechanism is still nuanced. To shed light into the evolution of these genomic elements, we performed a high-depth (50×) targeted next-generation sequencing of the palindrome P6 in 157 subjects belonging to the most divergent evolutionary lineages of the Y chromosome. We found 118 new paralogous sequence variants, which were placed into the context of a robust Y chromosome phylogeny based on 7240 SNPs of the X-degenerate region. We mapped along the phylogeny 80 gene conversion events that shaped the diversity of P6 arms during recent human history. In contrast to previous studies, we demonstrated that arm-to-arm gene conversion, which occurs at a rate of 6.01 × 10 -6 conversions/base/year, is not biased toward the retention of the ancestral state of sequences. We also found a significantly lower mutation rate of the arms (6.18 × 10-10 mutations/base/year) compared with the spacer (9.16 × 10-10 mutations/base/year), a finding that may explain the observed higher inter-species conservation of arms, without invoking any bias of conversion. Finally, by formally testing the mutation/conversion balance in P6, we found that the arms of this palindrome reached a steady-state equilibrium between mutation and gene conversion.
Collapse
Affiliation(s)
- Maria Bonito
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Eugenia D’Atanasio
- Institute of Molecular Biology and Pathology (IBPM), CNR, Rome 0185, Italy
| | - Francesco Ravasini
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Selene Cariati
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Andrea Finocchio
- Department of Biology, University of Rome Tor Vergata, Rome 0133, Italy
| | - Andrea Novelletto
- Department of Biology, University of Rome Tor Vergata, Rome 0133, Italy
| | - Beniamino Trombetta
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Fulvio Cruciani
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
- Institute of Molecular Biology and Pathology (IBPM), CNR, Rome 0185, Italy
| |
Collapse
|
4
|
Esin A, Bergendahl LT, Savolainen V, Marsh JA, Warnecke T. The genetic basis and evolution of red blood cell sickling in deer. Nat Ecol Evol 2018; 2:367-376. [PMID: 29255300 PMCID: PMC5777626 DOI: 10.1038/s41559-017-0420-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 11/20/2017] [Indexed: 11/09/2022]
Abstract
Crescent-shaped red blood cells, the hallmark of sickle-cell disease, present a striking departure from the biconcave disc shape normally found in mammals. Characterized by increased mechanical fragility, sickled cells promote haemolytic anaemia and vaso-occlusions and contribute directly to disease in humans. Remarkably, a similar sickle-shaped morphology has been observed in erythrocytes from several deer species, without obvious pathological consequences. The genetic basis of erythrocyte sickling in deer, however, remains unknown. Here, we determine the sequences of human β-globin orthologues in 15 deer species and use protein structural modelling to identify a sickling mechanism distinct from the human disease, coordinated by a derived valine (E22V) that is unique to sickling deer. Evidence for long-term maintenance of a trans-species sickling/non-sickling polymorphism suggests that sickling in deer is adaptive. Our results have implications for understanding the ecological regimes and molecular architectures that have promoted convergent evolution of sickling erythrocytes across vertebrates.
Collapse
Affiliation(s)
- Alexander Esin
- Molecular Systems Group, Medical Research Council London Institute of Medical Sciences, Du Cane Road, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Du Cane Road, London, United Kingdom
| | - L Therese Bergendahl
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, United Kingdom
| | - Vincent Savolainen
- Department of Life Sciences, Silwood Park Campus, Imperial College London, Ascot, United Kingdom
- University of Johannesburg, Auckland Park, Johannesburg, South Africa
| | - Joseph A Marsh
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, United Kingdom
| | - Tobias Warnecke
- Molecular Systems Group, Medical Research Council London Institute of Medical Sciences, Du Cane Road, London, United Kingdom.
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Du Cane Road, London, United Kingdom.
| |
Collapse
|
5
|
Shi W, Massaia A, Louzada S, Banerjee R, Hallast P, Chen Y, Bergström A, Gu Y, Leonard S, Quail MA, Ayub Q, Yang F, Tyler-Smith C, Xue Y. Copy number variation arising from gene conversion on the human Y chromosome. Hum Genet 2017; 137:73-83. [PMID: 29209947 DOI: 10.1007/s00439-017-1857-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Accepted: 11/28/2017] [Indexed: 11/27/2022]
Abstract
We describe the variation in copy number of a ~ 10 kb region overlapping the long intergenic noncoding RNA (lincRNA) gene, TTTY22, within the IR3 inverted repeat on the short arm of the human Y chromosome, leading to individuals with 0-3 copies of this region in the general population. Variation of this CNV is common, with 266 individuals having 0 copies, 943 (including the reference sequence) having 1, 23 having 2 copies, and two having 3 copies, and was validated by breakpoint PCR, fibre-FISH, and 10× Genomics Chromium linked-read sequencing in subsets of 1234 individuals from the 1000 Genomes Project. Mapping the changes in copy number to the phylogeny of these Y chromosomes previously established by the Project identified at least 20 mutational events, and investigation of flanking paralogous sequence variants showed that the mutations involved flanking sequences in 18 of these, and could extend over > 30 kb of DNA. While either gene conversion or double crossover between misaligned sister chromatids could formally explain the 0-2 copy events, gene conversion is the more likely mechanism, and these events include the longest non-allelic gene conversion reported thus far. Chromosomes with three copies of this CNV have arisen just once in our data set via another mechanism: duplication of 420 kb that places the third copy 230 kb proximal to the existing proximal copy. Our results establish gene conversion as a previously under-appreciated mechanism of generating copy number changes in humans and reveal the exceptionally large size of the conversion events that can occur.
Collapse
Affiliation(s)
- Wentao Shi
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 30070, China
| | - Andrea Massaia
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- National Heart and Lung Institute, Imperial College London, London, SW7 2AZ, UK
| | - Sandra Louzada
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Ruby Banerjee
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Pille Hallast
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, 51010, Estonia
| | - Yuan Chen
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Anders Bergström
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Yong Gu
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Steven Leonard
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Michael A Quail
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Qasim Ayub
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- School of Science, Monash University Malaysia, Jalan Lagoon Selantan, Bandar Sunway, 47500, Subang Jaya, Selangor Darul Ehsan, Malaysia
| | - Fengtang Yang
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Yali Xue
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| |
Collapse
|
6
|
Frequent nonallelic gene conversion on the human lineage and its effect on the divergence of gene duplicates. Proc Natl Acad Sci U S A 2017; 114:12779-12784. [PMID: 29138319 DOI: 10.1073/pnas.1708151114] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Gene conversion is the copying of a genetic sequence from a "donor" region to an "acceptor." In nonallelic gene conversion (NAGC), the donor and the acceptor are at distinct genetic loci. Despite the role NAGC plays in various genetic diseases and the concerted evolution of gene families, the parameters that govern NAGC are not well characterized. Here, we survey duplicate gene families and identify converted tracts in 46% of them. These conversions reflect a large GC bias of NAGC. We develop a sequence evolution model that leverages substantially more information in duplicate sequences than used by previous methods and use it to estimate the parameters that govern NAGC in humans: a mean converted tract length of 250 bp and a probability of [Formula: see text] per generation for a nucleotide to be converted (an order of magnitude higher than the point mutation rate). Despite this high baseline rate, we show that NAGC slows down as duplicate sequences diverge-until an eventual "escape" of the sequences from its influence. As a result, NAGC has a small average effect on the sequence divergence of duplicates. This work improves our understanding of the NAGC mechanism and the role that it plays in the evolution of gene duplicates.
Collapse
|
7
|
Trombetta B, D'Atanasio E, Cruciani F. Patterns of Inter-Chromosomal Gene Conversion on the Male-Specific Region of the Human Y Chromosome. Front Genet 2017; 8:54. [PMID: 28515739 PMCID: PMC5413550 DOI: 10.3389/fgene.2017.00054] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 04/18/2017] [Indexed: 12/31/2022] Open
Abstract
The male-specific region of the human Y chromosome (MSY) is characterized by the lack of meiotic recombination and it has long been considered an evolutionary independent region of the human genome. In recent years, however, the idea that human MSY did not have an independent evolutionary history begun to emerge with the discovery that inter-chromosomal gene conversion (ICGC) can modulate the genetic diversity of some portions of this genomic region. Despite the study of the dynamics of this molecular mechanism in humans is still in its infancy, some peculiar features and consequences of it can be summarized. The main effect of ICGC is to increase the allelic diversity of MSY by generating a significant excess of clustered single nucleotide polymorphisms (SNPs) (defined as groups of two or more SNPs occurring in close proximity and on the same branch of the Y phylogeny). On the human MSY, 13 inter-chromosomal gene conversion hotspots (GCHs) have been identified so far, involving donor sequences mainly from the X-chromosome and, to a lesser extent, from autosomes. Most of the GCHs are evolutionary conserved and overlap with regions involved in aberrant X–Y crossing-over. This review mainly focuses on the dynamics and the current knowledge concerning the recombinational landscape of the human MSY in the form of ICGC, on how this molecular mechanism may influence the evolution of the MSY, and on how it could affect the information enclosed within a genomic region which, until recently, appeared to be an evolutionary independent unit.
Collapse
Affiliation(s)
- Beniamino Trombetta
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di RomaRome, Italy
| | - Eugenia D'Atanasio
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di RomaRome, Italy
| | - Fulvio Cruciani
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di RomaRome, Italy.,Istituto di Biologia e Patologia Molecolari, Consiglio Nazionale delle Ricerche (CNR),Rome, Italy
| |
Collapse
|
8
|
Y chromosome palindromes and gene conversion. Hum Genet 2017; 136:605-619. [PMID: 28303348 DOI: 10.1007/s00439-017-1777-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2016] [Accepted: 03/07/2017] [Indexed: 02/02/2023]
Abstract
The presence of large and near-identical inverted repeat sequences (called palindromes) is a common feature of the constitutively haploid sex chromosomes of different species. Despite the fact palindromes originated in a non-recombining context, they have evolved a strong recombinational activity in the form of abundant arm-to-arm gene conversion. Their independent appearance in different species suggests they can have a profound biological significance that has yet to be fully clarified. It has been theorized that natural selection may have favored palindromic organization of male-specific genes and that the establishment of intra-palindrome gene conversion has strong adaptive significance. Arm-to-arm gene conversion allows the efficient removal of deleterious mutations, increases the fixation rate of beneficial mutations and has played an important role in modulating the equilibrium between gene loss and acquisition during Y chromosome evolution. Additionally, a palindromic organization of duplicates could favor the formation of unusual chromatin structures and could optimize the use of gene conversion as a mechanism to maintain the structural integrity of male-specific genes. In this review, we describe the structural features of palindromes on mammalian sex chromosomes and summarize different hypotheses regarding palindrome evolution and the functional benefits of arm-to-arm gene conversion on the unique haploid portion of the nuclear genome.
Collapse
|
9
|
Abstract
The great apes (orangutans, gorillas, chimpanzees, bonobos and humans) descended from a common ancestor around 13 million years ago, and since then their sex chromosomes have followed very different evolutionary paths. While great-ape X chromosomes are highly conserved, their Y chromosomes, reflecting the general lability and degeneration of this male-specific part of the genome since its early mammalian origin, have evolved rapidly both between and within species. Understanding great-ape Y chromosome structure, gene content and diversity would provide a valuable evolutionary context for the human Y, and would also illuminate sex-biased behaviours, and the effects of the evolutionary pressures exerted by different mating strategies on this male-specific part of the genome. High-quality Y-chromosome sequences are available for human and chimpanzee (and low-quality for gorilla). The chromosomes differ in size, sequence organisation and content, and while retaining a relatively stable set of ancestral single-copy genes, show considerable variation in content and copy number of ampliconic multi-copy genes. Studies of Y-chromosome diversity in other great apes are relatively undeveloped compared to those in humans, but have nevertheless provided insights into speciation, dispersal, and mating patterns. Future studies, including data from larger sample sizes of wild-born and geographically well-defined individuals, and full Y-chromosome sequences from bonobos, gorillas and orangutans, promise to further our understanding of population histories, male-biased behaviours, mutation processes, and the functions of Y-chromosomal genes.
Collapse
|
10
|
Trombetta B, Fantini G, D'Atanasio E, Sellitto D, Cruciani F. Evidence of extensive non-allelic gene conversion among LTR elements in the human genome. Sci Rep 2016; 6:28710. [PMID: 27346230 PMCID: PMC4921805 DOI: 10.1038/srep28710] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 06/06/2016] [Indexed: 12/16/2022] Open
Abstract
Long Terminal Repeats (LTRs) are nearly identical DNA sequences found at either end of Human Endogenous Retroviruses (HERVs). The high sequence similarity that exists among different LTRs suggests they could be substrate of ectopic gene conversion events. To understand the extent to which gene conversion occurs and to gain new insights into the evolutionary history of these elements in humans, we performed an intra-species phylogenetic study of 52 LTRs on different unrelated Y chromosomes. From this analysis, we obtained direct evidence that demonstrates the occurrence of ectopic gene conversion in several LTRs, with donor sequences located on both sex chromosomes and autosomes. We also found that some of these elements are characterized by an extremely high density of polymorphisms, showing one of the highest nucleotide diversities in the human genome, as well as a complex patchwork of sequences derived from different LTRs. Finally, we highlighted the limits of current short-read NGS studies in the analysis of genetic diversity of the LTRs in the human genome. In conclusion, our comparative re-sequencing analysis revealed that ectopic gene conversion is a common event in the evolution of LTR elements, suggesting complex genetic links among LTRs from different chromosomes.
Collapse
Affiliation(s)
- Beniamino Trombetta
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, Rome, Italy
| | - Gloria Fantini
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, Rome, Italy
| | - Eugenia D'Atanasio
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, Rome, Italy
| | | | - Fulvio Cruciani
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, Rome, Italy.,Istituto di Biologia e Patologia Molecolari, CNR, Rome, Italy.,Istituto Pasteur-Fondazione Cenci Bolognetti, Sapienza Università di Roma, Rome, Italy
| |
Collapse
|
11
|
Ghenu AH, Bolker BM, Melnick DJ, Evans BJ. Multicopy gene family evolution on primate Y chromosomes. BMC Genomics 2016; 17:157. [PMID: 26925773 PMCID: PMC4772468 DOI: 10.1186/s12864-015-2187-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2015] [Accepted: 11/02/2015] [Indexed: 12/12/2022] Open
Abstract
Background The primate Y chromosome is distinguished by a lack of inter-chromosomal recombination along most of its length, extensive gene loss, and a prevalence of repetitive elements. A group of genes on the male-specific portion of the Y chromosome known as the “ampliconic genes” are present in multiple copies that are sometimes part of palindromes, and that undergo a form of intra-chromosomal recombination called gene conversion, wherein the nucleotides of one copy are homogenized by those of another. With the aim of further understanding gene family evolution of these genes, we collected nucleotide sequence and gene copy number information for several species of papionin monkey. We then tested for evidence of gene conversion, and developed a novel statistical framework to evaluate alternative models of gene family evolution using our data combined with other information from a human, a chimpanzee, and a rhesus macaque. Results Our results (i) recovered evidence for several novel examples of gene conversion in papionin monkeys and indicate that (ii) ampliconic gene families evolve faster than autosomal gene families and than single-copy genes on the Y chromosome and that (iii) Y-linked singleton and autosomal gene families evolved faster in humans and chimps than they do in the other Old World Monkey lineages we studied. Conclusions Rapid evolution of ampliconic genes cannot be attributed solely to residence on the Y chromosome, nor to variation between primate lineages in the rate of gene family evolution. Instead other factors, such as natural selection and gene conversion, appear to play a role in driving temporal and genomic evolutionary heterogeneity in primate gene families. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2187-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana-Hermina Ghenu
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.
| | - Benjamin M Bolker
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.,Department of Mathematics & Statistics, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada
| | - Don J Melnick
- Department of Ecology, Evolution, and Environmental Biology, Columbia University, 10th Floor Schermerhorn Extension, New York, 10027, USA
| | - Ben J Evans
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.
| |
Collapse
|
12
|
Dumont BL. Interlocus gene conversion explains at least 2.7% of single nucleotide variants in human segmental duplications. BMC Genomics 2015; 16:456. [PMID: 26077037 PMCID: PMC4467073 DOI: 10.1186/s12864-015-1681-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 06/01/2015] [Indexed: 01/24/2023] Open
Abstract
Background Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. Results Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. Conclusions Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7 % of single nucleotide variants in duplicated regions of the human genome. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1681-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Beth L Dumont
- Initiative in Biological Complexity, North Carolina State University, 112 Derieux Place, 3510 Thomas Hall, Campus Box 7614, Raleigh, NC, 27695-7614, USA.
| |
Collapse
|
13
|
Hallast P, Batini C, Zadik D, Maisano Delser P, Wetton JH, Arroyo-Pardo E, Cavalleri GL, de Knijff P, Destro Bisol G, Dupuy BM, Eriksen HA, Jorde LB, King TE, Larmuseau MH, López de Munain A, López-Parra AM, Loutradis A, Milasin J, Novelletto A, Pamjav H, Sajantila A, Schempp W, Sears M, Tolun A, Tyler-Smith C, Van Geystelen A, Watkins S, Winney B, Jobling MA. The Y-chromosome tree bursts into leaf: 13,000 high-confidence SNPs covering the majority of known clades. Mol Biol Evol 2014; 32:661-73. [PMID: 25468874 PMCID: PMC4327154 DOI: 10.1093/molbev/msu327] [Citation(s) in RCA: 111] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Many studies of human populations have used the male-specific region of the Y chromosome (MSY) as a marker, but MSY sequence variants have traditionally been subject to ascertainment bias. Also, dating of haplogroups has relied on Y-specific short tandem repeats (STRs), involving problems of mutation rate choice, and possible long-term mutation saturation. Next-generation sequencing can ascertain single nucleotide polymorphisms (SNPs) in an unbiased way, leading to phylogenies in which branch-lengths are proportional to time, and allowing the times-to-most-recent-common-ancestor (TMRCAs) of nodes to be estimated directly. Here we describe the sequencing of 3.7 Mb of MSY in each of 448 human males at a mean coverage of 51×, yielding 13,261 high-confidence SNPs, 65.9% of which are previously unreported. The resulting phylogeny covers the majority of the known clades, provides date estimates of nodes, and constitutes a robust evolutionary framework for analyzing the history of other classes of mutation. Different clades within the tree show subtle but significant differences in branch lengths to the root. We also apply a set of 23 Y-STRs to the same samples, allowing SNP- and STR-based diversity and TMRCA estimates to be systematically compared. Ongoing purifying selection is suggested by our analysis of the phylogenetic distribution of nonsynonymous variants in 15 MSY single-copy genes.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Chiara Batini
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Daniel Zadik
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | | | - Jon H Wetton
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Eduardo Arroyo-Pardo
- Laboratory of Forensic and Population Genetics, Department of Toxicology and Health Legislation, Faculty of Medicine, Complutense University, Madrid, Spain
| | - Gianpiero L Cavalleri
- Molecular and Cellular Therapeutics, The Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Peter de Knijff
- Department of Human Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Giovanni Destro Bisol
- Istituto Italiano di Antropologia, Rome, Italy Department of Environmental Biology, Sapienza University of Rome, Rome, Italy
| | - Berit Myhre Dupuy
- Division of Forensic Sciences, Norwegian Institute of Public Health, Oslo, Norway
| | - Heidi A Eriksen
- Centre of Arctic Medicine, Thule Institute, University of Oulu, Oulu, Finland Utsjoki Health Care Centre, Utsjoki, Finland
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, UT
| | - Turi E King
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Maarten H Larmuseau
- Laboratory of Forensic Genetics and Molecular Archaeology, KU Leuven, Leuven, Belgium Department of Imaging & Pathology, Biomedical Forensic Sciences, KU Leuven, Leuven, Belgium Laboratory of Biodiversity and Evolutionary Genomics, Department of Biology, KU Leuven, Leuven, Belgium
| | | | - Ana M López-Parra
- Laboratory of Forensic and Population Genetics, Department of Toxicology and Health Legislation, Faculty of Medicine, Complutense University, Madrid, Spain
| | | | - Jelena Milasin
- School of Dental Medicine, Institute of Human Genetics, University of Belgrade, Belgrade, Serbia
| | | | - Horolma Pamjav
- Network of Forensic Science Institutes, Institute of Forensic Medicine, Budapest, Hungary
| | - Antti Sajantila
- Department of Forensic Medicine, Hjelt Institute, University of Helsinki, Helsinki, Finland Department of Molecular and Medical Genetics, Institute of Applied Genetics, University of North Texas Health Science Center, Fort Worth, Texas
| | - Werner Schempp
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Matt Sears
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Aslıhan Tolun
- Department of Molecular Biology and Genetics, Boğaziçi University, Istanbul, Turkey
| | | | - Anneleen Van Geystelen
- Laboratory of Socioecology and Social Evolution, Department of Biology, KU Leuven, Leuven, Belgium
| | - Scott Watkins
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, UT
| | - Bruce Winney
- Department of Oncology, University of Oxford, Oxford, United Kingdom
| | - Mark A Jobling
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| |
Collapse
|
14
|
Interplay of interlocus gene conversion and crossover in segmental duplications under a neutral scenario. G3-GENES GENOMES GENETICS 2014; 4:1479-89. [PMID: 24906640 PMCID: PMC4132178 DOI: 10.1534/g3.114.012435] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Interlocus gene conversion is a major evolutionary force that drives the concerted evolution of duplicated genomic regions. Theoretical models successfully have addressed the effects of interlocus gene conversion and the importance of crossover in the evolutionary fate of gene families and duplications but have not considered complex recombination scenarios, such as the presence of hotspots. To study the interplay between interlocus gene conversion and crossover, we have developed a forward-time simulator that allows the exploration of a wide range of interlocus gene conversion rates under different crossover models. Using it, we have analyzed patterns of nucleotide variation and linkage disequilibrium within and between duplicate regions, focusing on a neutral scenario with constant population size and validating our results with the existing theoretical models. We show that the interaction of gene conversion and crossover is nontrivial and that the location of crossover junctions is a fundamental determinant of levels of variation and linkage disequilibrium in duplicated regions. We also show that if crossover activity between duplications is strong enough, recurrent interlocus gene conversion events can break linkage disequilibrium within duplicates. Given the complex nature of interlocus gene conversion and crossover, we provide a framework to explore their interplay to help increase knowledge on molecular evolution within segmental duplications under more complex scenarios, such as demographic changes or natural selection.
Collapse
|
15
|
Trombetta B, Sellitto D, Scozzari R, Cruciani F. Inter- and intraspecies phylogenetic analyses reveal extensive X-Y gene conversion in the evolution of gametologous sequences of human sex chromosomes. Mol Biol Evol 2014; 31:2108-23. [PMID: 24817545 PMCID: PMC4104316 DOI: 10.1093/molbev/msu155] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
It has long been believed that the male-specific region of the human Y chromosome (MSY) is genetically independent from the X chromosome. This idea has been recently dismissed due to the discovery that X–Y gametologous gene conversion may occur. However, the pervasiveness of this molecular process in the evolution of sex chromosomes has yet to be exhaustively analyzed. In this study, we explored how pervasive X–Y gene conversion has been during the evolution of the youngest stratum of the human sex chromosomes. By comparing about 0.5 Mb of human–chimpanzee gametologous sequences, we identified 19 regions in which extensive gene conversion has occurred. From our analysis, two major features of these emerged: 1) Several of them are evolutionarily conserved between the two species and 2) almost all of the 19 hotspots overlap with regions where X–Y crossing-over has been previously reported to be involved in sex reversal. Furthermore, in order to explore the dynamics of X–Y gametologous conversion in recent human evolution, we resequenced these 19 hotspots in 68 widely divergent Y haplogroups and used publicly available single nucleotide polymorphism data for the X chromosome. We found that at least ten hotspots are still active in humans. Hence, the results of the interspecific analysis are consistent with the hypothesis of widespread reticulate evolution within gametologous sequences in the differentiation of hominini sex chromosomes. In turn, intraspecific analysis demonstrates that X–Y gene conversion may modulate human sex-chromosome-sequence evolution to a greater extent than previously thought.
Collapse
Affiliation(s)
- Beniamino Trombetta
- Dipartimento di Biologia e Biotecnologie "Charles Darwin," Sapienza Università di Roma, Roma, Italy
| | | | - Rosaria Scozzari
- Dipartimento di Biologia e Biotecnologie "Charles Darwin," Sapienza Università di Roma, Roma, Italy
| | - Fulvio Cruciani
- Dipartimento di Biologia e Biotecnologie "Charles Darwin," Sapienza Università di Roma, Roma, ItalyIstituto di Biologia e Patologia Molecolari, CNR, Roma, ItalyIstituto Pasteur-Fondazione Cenci Bolognetti, Sapienza Università di Roma, Roma, Italy
| |
Collapse
|
16
|
Morrison DA. Is the Tree of Life the Best Metaphor, Model, or Heuristic for Phylogenetics? Syst Biol 2014; 63:628-38. [DOI: 10.1093/sysbio/syu026] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- David A. Morrison
- Section for Parasitology, Swedish University of Agricultural Sciences, 751 89 Uppsala, Sweden
| |
Collapse
|
17
|
Mussotter T, Bengesser K, Högel J, Cooper DN, Kehrer-Sawatzki H. Population-specific differences in gene conversion patterns between human SUZ12 and SUZ12P are indicative of the dynamic nature of interparalog gene conversion. Hum Genet 2014; 133:383-401. [PMID: 24385046 DOI: 10.1007/s00439-013-1410-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Accepted: 12/08/2013] [Indexed: 11/29/2022]
Abstract
Nonallelic homologous gene conversion (NAHGC) resulting from interparalog recombination without crossover represents an important influence on the evolution of duplicated sequences in the human genome. In 17q11.2, different paralogous sequences mediate large NF1 deletions by nonallelic homologous recombination with crossover (NAHR). Among these paralogs are SUZ12 and its pseudogene SUZ12P which harbour the breakpoints of type-2 (1.2-Mb) NF1 deletions. Such deletions are caused predominantly by mitotic NAHR since somatic mosaicism with normal cells is evident in most patients. Investigating whether SUZ12 and SUZ12P have also been involved in NAHGC, we observed gene conversion tracts between these paralogs in both Africans (AFR) and Europeans (EUR). Since germline type-2 NF1 deletions resulting from meiotic NAHR are very rare, the vast majority of the gene conversion tracts in SUZ12 and SUZ12P are likely to have resulted from mitotic recombination during premeiotic cell divisions of germ cells. A higher number of gene conversion tracts were noted within SUZ12 and SUZ12P in AFR as compared to EUR. Further, the distinctive signature of NAHGC (a high number of SNPs per paralog and a high number of shared SNPs between paralogs), a characteristic of many actively recombining paralogs, was observed in both SUZ12 and SUZ12P but only in AFR and not in EUR. A novel polymorphic 2.3-kb deletion in SUZ12P was identified which exhibited a high allele frequency in EUR. We postulate that this interparalog structural difference, together with low allelic recombination rates, could have caused a reduction in NAHGC between SUZ12 and SUZ12P during human evolution.
Collapse
Affiliation(s)
- Tanja Mussotter
- Institute of Human Genetics, University of Ulm, Albert-Einstein-Allee 11, 89081, Ulm, Germany
| | | | | | | | | |
Collapse
|
18
|
Bengesser K, Vogt J, Mussotter T, Mautner VF, Messiaen L, Cooper DN, Kehrer-Sawatzki H. Analysis of crossover breakpoints yields new insights into the nature of the gene conversion events associated with large NF1 deletions mediated by nonallelic homologous recombination. Hum Mutat 2013; 35:215-26. [PMID: 24186807 DOI: 10.1002/humu.22473] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 10/18/2013] [Indexed: 12/31/2022]
Abstract
Large NF1 deletions are mediated by nonallelic homologous recombination (NAHR). An in-depth analysis of gene conversion operating in the breakpoint-flanking regions of large NF1 deletions was performed to investigate whether the rate of discontinuous gene conversion during NAHR with crossover is increased, as has been previously noted in NAHR-mediated rearrangements. All 20 germline type-1 NF1 deletions analyzed were mediated by NAHR associated with continuous gene conversion within the breakpoint-flanking regions. Continuous gene conversion was also observed in 31/32 type-2 NF1 deletions investigated. In contrast to the meiotic type-1 NF1 deletions, type-2 NF1 deletions are predominantly of post-zygotic origin. Our findings therefore imply that the mitotic as well as the meiotic NAHR intermediates of large NF1 deletions are processed by long-patch mismatch repair (MMR), thereby ensuring gene conversion tract continuity instead of the discontinuous gene conversion that is characteristic of short-patch repair. However, the single type-2 NF1 deletion not exhibiting continuous gene conversion was processed without MMR, yielding two different deletion-bearing chromosomes, which were distinguishable in terms of their breakpoint positions. Our findings indicate that MMR failure during NAHR, followed by post-meiotic/mitotic segregation, has the potential to give rise to somatic mosaicism in human genomic rearrangements by generating breakpoint heterogeneity.
Collapse
|
19
|
Dumont BL, Eichler EE. Signals of historical interlocus gene conversion in human segmental duplications. PLoS One 2013; 8:e75949. [PMID: 24124524 PMCID: PMC3790853 DOI: 10.1371/journal.pone.0075949] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 08/17/2013] [Indexed: 12/04/2022] Open
Abstract
Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC). Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i) a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii) the alignment-based method implemented in the GENECONV program. One-quarter (25.4%) of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans.
Collapse
Affiliation(s)
- Beth L. Dumont
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- * E-mail:
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
| |
Collapse
|
20
|
Hanikenne M, Kroymann J, Trampczynska A, Bernal M, Motte P, Clemens S, Krämer U. Hard selective sweep and ectopic gene conversion in a gene cluster affording environmental adaptation. PLoS Genet 2013; 9:e1003707. [PMID: 23990800 PMCID: PMC3749932 DOI: 10.1371/journal.pgen.1003707] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2013] [Accepted: 06/22/2013] [Indexed: 12/27/2022] Open
Abstract
Among the rare colonizers of heavy-metal rich toxic soils, Arabidopsis halleri is a compelling model extremophile, physiologically distinct from its sister species A. lyrata, and A. thaliana. Naturally selected metal hypertolerance and extraordinarily high leaf metal accumulation in A. halleri both require Heavy Metal ATPase4 (HMA4) encoding a PIB-type ATPase that pumps Zn(2+) and Cd(2+) out of specific cell types. Strongly enhanced HMA4 expression results from a combination of gene copy number expansion and cis-regulatory modifications, when compared to A. thaliana. These findings were based on a single accession of A. halleri. Few studies have addressed nucleotide sequence polymorphism at loci known to govern adaptations. We thus sequenced 13 DNA segments across the HMA4 genomic region of multiple A. halleri individuals from diverse habitats. Compared to control loci flanking the three tandem HMA4 gene copies, a gradual depletion of nucleotide sequence diversity and an excess of low-frequency polymorphisms are hallmarks of positive selection in HMA4 promoter regions, culminating at HMA4-3. The accompanying hard selective sweep is segmentally eclipsed as a consequence of recurrent ectopic gene conversion among HMA4 protein-coding sequences, resulting in their concerted evolution. Thus, HMA4 coding sequences exhibit a network-like genealogy and locally enhanced nucleotide sequence diversity within each copy, accompanied by lowered sequence divergence between paralogs in any given individual. Quantitative PCR corroborated that, across A. halleri, three genomic HMA4 copies generate overall 20- to 130-fold higher transcript levels than in A. thaliana. Together, our observations constitute an unexpectedly complex profile of polymorphism resulting from natural selection for increased gene product dosage. We propose that these findings are paradigmatic of a category of multi-copy genes from a broad range of organisms. Our results emphasize that enhanced gene product dosage, in addition to neo- and sub-functionalization, can account for the genomic maintenance of gene duplicates underlying environmental adaptation.
Collapse
Affiliation(s)
- Marc Hanikenne
- Functional Genomics and Plant Molecular Imaging, Center for Protein Engineering (CIP), Department of Life Sciences, University of Liège, Liège, Belgium
| | - Juergen Kroymann
- Laboratoire d'Ecologie, Systématique et Evolution, Université Paris-Sud/CNRS, Orsay, France
| | | | - María Bernal
- Department of Plant Physiology, Ruhr University Bochum, Bochum, Germany
| | - Patrick Motte
- Functional Genomics and Plant Molecular Imaging, Center for Protein Engineering (CIP), Department of Life Sciences, University of Liège, Liège, Belgium
| | - Stephan Clemens
- Department of Plant Physiology, University of Bayreuth, Bayreuth, Germany
| | - Ute Krämer
- Department of Plant Physiology, Ruhr University Bochum, Bochum, Germany
| |
Collapse
|
21
|
Fawcett JA, Innan H. The role of gene conversion in preserving rearrangement hotspots in the human genome. Trends Genet 2013; 29:561-8. [PMID: 23953668 DOI: 10.1016/j.tig.2013.07.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Revised: 06/20/2013] [Accepted: 07/08/2013] [Indexed: 11/27/2022]
Abstract
Hotspots of non-allelic homologous recombination (NAHR) have a crucial role in creating genetic diversity and are also associated with dozens of genomic disorders. Recent studies suggest that many human NAHR hotspots have been preserved throughout the evolution of primates. NAHR hotspots are likely to remain active as long as the segmental duplications (SDs) promoting NAHR retain sufficient similarity. Here, we propose an evolutionary model of SDs that incorporates the effect of gene conversion and compare it with a null model that assumes SDs evolve independently without gene conversion. The gene conversion model predicts a much longer lifespan of NAHR hotspots compared with the null model. We show that the literature on copy number variants (CNVs) and genomic disorders, and also the results of additional analysis of CNVs, are all more consistent with the gene conversion model.
Collapse
Affiliation(s)
- Jeffrey A Fawcett
- Graduate University for Advanced Studies, Hayama, Kanagawa 240-0193, Japan
| | | |
Collapse
|
22
|
Hallast P, Balaresque P, Bowden GR, Ballereau S, Jobling MA. Recombination dynamics of a human Y-chromosomal palindrome: rapid GC-biased gene conversion, multi-kilobase conversion tracts, and rare inversions. PLoS Genet 2013; 9:e1003666. [PMID: 23935520 PMCID: PMC3723533 DOI: 10.1371/journal.pgen.1003666] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 06/07/2013] [Indexed: 11/19/2022] Open
Abstract
The male-specific region of the human Y chromosome (MSY) includes eight large inverted repeats (palindromes) in which arm-to-arm similarity exceeds 99.9%, due to gene conversion activity. Here, we studied one of these palindromes, P6, in order to illuminate the dynamics of the gene conversion process. We genotyped ten paralogous sequence variants (PSVs) within the arms of P6 in 378 Y chromosomes whose evolutionary relationships within the SNP-defined Y phylogeny are known. This allowed the identification of 146 historical gene conversion events involving individual PSVs, occurring at a rate of 2.9–8.4×10−4 events per generation. A consideration of the nature of nucleotide change and the ancestral state of each PSV showed that the conversion process was significantly biased towards the fixation of G or C nucleotides (GC-biased), and also towards the ancestral state. Determination of haplotypes by long-PCR allowed likely co-conversion of PSVs to be identified, and suggested that conversion tract lengths are large, with a mean of 2068 bp, and a maximum in excess of 9 kb. Despite the frequent formation of recombination intermediates implied by the rapid observed gene conversion activity, resolution via crossover is rare: only three inversions within P6 were detected in the sample. An analysis of chimpanzee and gorilla P6 orthologs showed that the ancestral state bias has existed in all three species, and comparison of human and chimpanzee sequences with the gorilla outgroup confirmed that GC bias of the conversion process has apparently been active in both the human and chimpanzee lineages. The sex-determining role of the human Y chromosome makes it male-specific, and always present in only a single copy. This solo lifestyle has endowed it with some bizarre features, among which are eight large DNA units constituting about a quarter of the chromosome's length, and containing many genes important for sperm production. These units are called palindromes, since, taking into account the polarity of the DNA strands, the sequence is the same read from either end of the unit. We investigated the details of a process (gene conversion) that transfers sequence variants in one half of a palindrome into the other, thereby maintaining >99.9% similarity between the halves. We analysed patterns of sequence variants within one palindrome in a set of Y chromosomes whose evolutionary relationships are known. This allowed us to identify past gene conversion events, and to demonstrate a bias towards events that eliminate new variants, and retain old ones. Gene conversion has therefore acted during human evolution to retard sequence change in these regions. Analysis of the chimpanzee and gorilla versions of the palindrome shows that the dynamic processes we see in human Y chromosomes have a deep evolutionary history.
Collapse
Affiliation(s)
- Pille Hallast
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | | | - Georgina R. Bowden
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Stéphane Ballereau
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Mark A. Jobling
- Department of Genetics, University of Leicester, Leicester, United Kingdom
- * E-mail:
| |
Collapse
|
23
|
Willett CS. Gene conversion yields novel gene combinations in paralogs of GOT1 in the copepod Tigriopus californicus. BMC Evol Biol 2013; 13:148. [PMID: 23845062 PMCID: PMC3728101 DOI: 10.1186/1471-2148-13-148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 07/08/2013] [Indexed: 11/24/2022] Open
Abstract
Background Gene conversion of duplicated genes can slow the divergence of paralogous copies over time but can also result in other interesting evolutionary patterns. Islands of genetic divergence that persist in the face of gene conversion can point to gene regions undergoing selection for new functions. Novel combinations of genetic variation that differ greatly from the original sequence can result from the transfer of genetic variation between paralogous genes by rare gene conversion events. Genetically divergent populations of the copepod Tigriopus californicus provide an excellent model to look at the patterns of divergence among paralogs across multiple independent evolutionary lineages. Results In this study the evolution of a set of paralogous genes encoding putative aspartate transaminase proteins (called GOT1 here) are examined in populations of the copepod T. californicus. One pair of duplicated genes, GOT1p1 and GOT1p2, has regions of high divergence between the copies in the face of apparent on-going gene conversion. The GOT1p2 gene also has unique haplotypes in two populations that appear to have resulted from a transfer of genetic variation via inter-paralog gene conversion. A second pair of duplicated genes GOT1Sr and GOT1Sd also shows evidence of gene conversion, but this gene conversion does not appear to have maintained each as a functional copy in all populations. Conclusions The patterns of conservation and sequence divergence across this set of paralogous genes among populations of T. californicus suggest that some interesting evolutionary patterns are occurring at these loci. The results for the GOT1p1/GOT1p2 paralogs illustrate how gene conversion can factor in the creation of a mosaic pattern of regions of high divergence and low divergence. When coupled with rare gene conversion events of divergent regions, this pattern can result in the formation of novel proteins differing substantially from either original protein. The evolutionary patterns across these paralogs show how gene conversion can both constrain and facilitate diversification of genetic sequences.
Collapse
Affiliation(s)
- Christopher S Willett
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599-3280, USA.
| |
Collapse
|
24
|
Scozzari R, Massaia A, D’Atanasio E, Myres NM, Perego UA, Trombetta B, Cruciani F. Molecular dissection of the basal clades in the human Y chromosome phylogenetic tree. PLoS One 2012; 7:e49170. [PMID: 23145109 PMCID: PMC3492319 DOI: 10.1371/journal.pone.0049170] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2012] [Accepted: 10/04/2012] [Indexed: 11/19/2022] Open
Abstract
One hundred and forty-six previously detected mutations were more precisely positioned in the human Y chromosome phylogeny by the analysis of 51 representative Y chromosome haplogroups and the use of 59 mutations from literature. Twenty-two new mutations were also described and incorporated in the revised phylogeny. This analysis made it possible to identify new haplogroups and to resolve a deep trifurcation within haplogroup B2. Our data provide a highly resolved branching in the African-specific portion of the Y tree and support the hypothesis of an origin in the north-western quadrant of the African continent for the human MSY diversity.
Collapse
Affiliation(s)
- Rosaria Scozzari
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, Rome, Italy
| | - Andrea Massaia
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, Rome, Italy
| | - Eugenia D’Atanasio
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, Rome, Italy
| | - Natalie M. Myres
- Sorenson Molecular Genealogy Foundation, Salt Lake City, Utah, United States of America
- AncestryDNA, Provo, Utah, United States of America
| | - Ugo A. Perego
- Sorenson Molecular Genealogy Foundation, Salt Lake City, Utah, United States of America
- Dipartimento di Biologia e Biotecnologie “Lazzaro Spallanzani”, Università di Pavia, Pavia, Italy
| | - Beniamino Trombetta
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, Rome, Italy
| | - Fulvio Cruciani
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, Rome, Italy
| |
Collapse
|
25
|
Why chromosome palindromes? INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:207958. [PMID: 22844637 PMCID: PMC3403216 DOI: 10.1155/2012/207958] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2012] [Accepted: 05/09/2012] [Indexed: 11/25/2022]
Abstract
We look at sex-limited chromosome (Y or W) evolution with particular emphasis on the importance of palindromes. Y chromosome palindromes consist of inverted duplicates that allow for local recombination in an otherwise nonrecombining chromosome. Since palindromes enable intrachromosomal gene conversion that can help eliminate deleterious mutations, they are often highlighted as mechanisms to protect against Y degeneration. However, the adaptive significance of recombination resides in its ability to decouple the evolutionary fates of linked mutations, leading to both a decrease in degeneration rate and an increase in adaptation rate. Our paper emphasizes the latter, that palindromes may exist to accelerate adaptation by increasing the potential targets and fixation rates of incoming beneficial mutations. This hypothesis helps reconcile two enigmatic features of the “palindromes as protectors” view: (1) genes that are not located in palindromes have been retained under purifying selection for tens of millions of years, and (2) under models that only consider deleterious mutations, gene conversion benefits duplicate gene maintenance but not initial fixation. We conclude by looking at ways to test the hypothesis that palindromes enhance the rate of adaptive evolution of Y-linked genes and whether this effect can be extended to palindromes on other chromosomes.
Collapse
|
26
|
Xu K, Wu X, Tompkins JD, Her C. Assessment of anti-recombination and double-strand break-induced gene conversion in human cells by a chromosomal reporter. J Biol Chem 2012; 287:29543-53. [PMID: 22773873 DOI: 10.1074/jbc.m112.352302] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Gene conversion is one of the frequent end results of homologous recombination, and it often underlies the inactivation of tumor suppressor genes in cancer cells. Here, we have developed an integrated assay system that allows simultaneous examination of double-strand break (DSB)-induced gene conversion events at the site of a DSB (proximal region) and at a surrounding region ~1 kb away from the break (distal region). Utilizing this assay system, we find that gene conversion events at the proximal and distal regions are relatively independent of one another. The results also indicate that synthesis-dependent strand annealing (SDSA) plays a major role in DSB-induced gene conversion. In addition, our current study has demonstrated that hMLH1 plays an essential role in anti-recombination and gene conversion. Specifically, the anti-recombination activity of hMLH1 is partially dependent on its interaction with hMRE11. Our data suggests that the role of hMLH1 and hMRE11 in the process of gene conversion is complex, and these proteins play different roles in DSB-induced proximal and distal gene conversions. In particular, the involvement of hMLH1 and hMRE11 in the distal gene conversion requires both hMSH2 and heteroduplex formation.
Collapse
Affiliation(s)
- Keqian Xu
- From the School of Molecular Biosciences, Washington State University, Pullman, Washington 99164-7520
| | | | | | | |
Collapse
|
27
|
Zickler AM, Hampp S, Messiaen L, Bengesser K, Mussotter T, Roehl AC, Wimmer K, Mautner VF, Kluwe L, Upadhyaya M, Pasmant E, Chuzhanova N, Kestler HA, Högel J, Legius E, Claes K, Cooper DN, Kehrer-Sawatzki H. Characterization of the nonallelic homologous recombination hotspot PRS3 associated with type-3 NF1 deletions. Hum Mutat 2011; 33:372-83. [PMID: 22045503 DOI: 10.1002/humu.21644] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Accepted: 10/06/2011] [Indexed: 12/21/2022]
Abstract
Nonallelic homologous recombination (NAHR) is the major mechanism underlying recurrent genomic rearrangements, including the large deletions at 17q11.2 that cause neurofibromatosis type 1 (NF1). Here, we identify a novel NAHR hotspot, responsible for type-3 NF1 deletions that span 1.0 Mb. Breakpoint clustering within this 1-kb hotspot, termed PRS3, was noted in 10 of 11 known type-3 NF1 deletions. PRS3 is located within the LRRC37B pseudogene of the NF1-REPb and NF1-REPc low-copy repeats. In contrast to other previously characterized NAHR hotspots, PRS3 has not developed on a preexisting allelic homologous recombination hotspot. Furthermore, the variation pattern of PRS3 and its flanking regions is unusual since only NF1-REPc (and not NF1-REPb) is characterized by a high single nucleotide polymorphism (SNP) frequency, suggestive of unidirectional sequence transfer via nonallelic homologous gene conversion (NAHGC). By contrast, the previously described intense NAHR hotspots within the CMT1A-REPs, and the PRS1 and PRS2 hotspots underlying type-1 NF1 deletions, experience frequent bidirectional sequence transfer. PRS3 within NF1-REPc was also found to be involved in NAHGC with the LRRC37B gene, the progenitor locus of the LRRC37B-P duplicons, as indicated by the presence of shared SNPs between these loci. PRS3 therefore represents a weak (and probably evolutionarily rather young) NAHR hotspot with unique properties.
Collapse
Affiliation(s)
- Antje M Zickler
- Institute of Human Genetics, University of Ulm, Albert-Einstein-Allee 11, Ulm, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Rapid evolution and copy number variation of primate RHOXF2, an X-linked homeobox gene involved in male reproduction and possibly brain function. BMC Evol Biol 2011; 11:298. [PMID: 21988730 PMCID: PMC3214919 DOI: 10.1186/1471-2148-11-298] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Accepted: 10/12/2011] [Indexed: 11/30/2022] Open
Abstract
Background Homeobox genes are the key regulators during development, and they are in general highly conserved with only a few reported cases of rapid evolution. RHOXF2 is an X-linked homeobox gene in primates. It is highly expressed in the testicle and may play an important role in spermatogenesis. As male reproductive system is often the target of natural and/or sexual selection during evolution, in this study, we aim to dissect the pattern of molecular evolution of RHOXF2 in primates and its potential functional consequence. Results We studied sequences and copy number variation of RHOXF2 in humans and 16 nonhuman primate species as well as the expression patterns in human, chimpanzee, white-browed gibbon and rhesus macaque. The gene copy number analysis showed that there had been parallel gene duplications/losses in multiple primate lineages. Our evidence suggests that 11 nonhuman primate species have one RHOXF2 copy, and two copies are present in humans and four Old World monkey species, and at least 6 copies in chimpanzees. Further analysis indicated that the gene duplications in primates had likely been mediated by endogenous retrovirus (ERV) sequences flanking the gene regions. In striking contrast to non-human primates, humans appear to have homogenized their two RHOXF2 copies by the ERV-mediated non-allelic recombination mechanism. Coding sequence and phylogenetic analysis suggested multi-lineage strong positive selection on RHOXF2 during primate evolution, especially during the origins of humans and chimpanzees. All the 8 coding region polymorphic sites in human populations are non-synonymous, implying on-going selection. Gene expression analysis demonstrated that besides the preferential expression in the reproductive system, RHOXF2 is also expressed in the brain. The quantitative data suggests expression pattern divergence among primate species. Conclusions RHOXF2 is a fast-evolving homeobox gene in primates. The rapid evolution and copy number changes of RHOXF2 had been driven by Darwinian positive selection acting on the male reproductive system and possibly also on the central nervous system, which sheds light on understanding the role of homeobox genes in adaptive evolution.
Collapse
|
29
|
Cruciani F, Trombetta B, Massaia A, Destro-Bisol G, Sellitto D, Scozzari R. A revised root for the human Y chromosomal phylogenetic tree: the origin of patrilineal diversity in Africa. Am J Hum Genet 2011; 88:814-818. [PMID: 21601174 DOI: 10.1016/j.ajhg.2011.05.002] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2011] [Revised: 04/21/2011] [Accepted: 05/02/2011] [Indexed: 10/18/2022] Open
Abstract
To shed light on the structure of the basal backbone of the human Y chromosome phylogeny, we sequenced about 200 kb of the male-specific region of the human Y chromosome (MSY) from each of seven Y chromosomes belonging to clades A1, A2, A3, and BT. We detected 146 biallelic variant sites through this analysis. We used these variants to construct a patrilineal tree, without taking into account any previously reported information regarding the phylogenetic relationships among the seven Y chromosomes here analyzed. There are several key changes at the basal nodes as compared with the most recent reference Y chromosome tree. A different position of the root was determined, with important implications for the origin of human Y chromosome diversity. An estimate of 142 KY was obtained for the coalescence time of the revised MSY tree, which is earlier than that obtained in previous studies and easier to reconcile with plausible scenarios of modern human origin. The number of deep branchings leading to African-specific clades has doubled, further strengthening the MSY-based evidence for a modern human origin in the African continent. An analysis of 2204 African DNA samples showed that the deepest clades of the revised MSY phylogeny are currently found in central and northwest Africa, opening new perspectives on early human presence in the continent.
Collapse
|
30
|
The Rate and Tract Length of Gene Conversion between Duplicated Genes. Genes (Basel) 2011; 2:313-31. [PMID: 24710193 PMCID: PMC3924818 DOI: 10.3390/genes2020313] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2011] [Revised: 03/11/2011] [Accepted: 03/17/2011] [Indexed: 11/26/2022] Open
Abstract
Interlocus gene conversion occurs such that a certain length of DNA fragment is non-reciprocally transferred (copied and pasted) between paralogous regions. To understand the rate and tract length of gene conversion, there are two major approaches. One is based on mutation-accumulation experiments, and the other uses natural DNA sequence variation. In this review, we overview the two major approaches and discuss their advantages and disadvantages. In addition, to demonstrate the importance of statistical analysis of empirical and evolutionary data for estimating tract length, we apply a maximum likelihood method to several data sets.
Collapse
|
31
|
Menkis A, Whittle CA, Johannesson H. Gene genealogies indicates abundant gene conversions and independent evolutionary histories of the mating-type chromosomes in the evolutionary history of Neurospora tetrasperma. BMC Evol Biol 2010; 10:234. [PMID: 20673371 PMCID: PMC2923516 DOI: 10.1186/1471-2148-10-234] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Accepted: 07/31/2010] [Indexed: 11/27/2022] Open
Abstract
Background The self-fertile filamentous ascomycete Neurospora tetrasperma contains a large (~7 Mbp) and young (< 6 MYA) region of suppressed recombination within its mating-type (mat) chromosomes. The objective of the present study is to reveal the evolutionary history, including key genomic events, associated with the various regions of the mat chromosomes among ten strains representing all the nine known species (lineages) contained within the N. tetrasperma species complex. Results Comparative analysis of sequence divergence among alleles of 24 mat-linked genes (mat A and mat a) indicates that a large region of suppressed recombination exists within the mat chromosome for each of nine lineages of N. tetrasperma sensu latu. The recombinationally suppressed region varies in size and gene composition among lineages, and is flanked on both ends by normally recombining regions. Genealogical analyses among lineages reveals that eight gene conversion events have occurred between homologous mat A and mat a-linked alleles of genes located within the region of restricted recombination during the evolutionary history of N. tetrasperma. Conclusions We conclude that the region of suppressed recombination in the mat chromosomes has likely been subjected to independent contraction and/or expansion during the evolutionary history of the N. tetrasperma species complex. Furthermore, we infer that gene conversion events are likely a common phenomenon within this recombinationally suppressed genomic region. We argue that gene conversions might provide an efficient mechanism of adaptive editing of functional genes, including the removal of deleterious mutations, within the young recombinationally suppressed region of the mat chromosomes.
Collapse
Affiliation(s)
- Audrius Menkis
- Uppsala BioCenter, Department of Forest Mycology and Pathology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | | | | |
Collapse
|
32
|
Marais GAB, Campos PRA, Gordo I. Can intra-Y gene conversion oppose the degeneration of the human Y chromosome? A simulation study. Genome Biol Evol 2010; 2:347-57. [PMID: 20624739 PMCID: PMC2997549 DOI: 10.1093/gbe/evq026] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The human Y is a genetically degenerate chromosome, which has lost about 97% of the genes originally present. Most of the remaining human Y genes are in large duplicated segments (ampliconic regions) undergoing intense Y–Y gene conversion. It has been suggested that Y–Y gene conversion may help these genes getting rid of deleterious mutations that would inactivate them otherwise. Here, we tested this idea by simulating the evolution of degenerating Y chromosomes with or without gene conversion using the most up-to-date population genetics parameters for humans. We followed the fate of a variant with Y–Y gene conversion in a population of Y chromosomes where Y–Y gene conversion is originally absent. We found that this variant gets fixed more frequently than the neutral expectation, which supports the idea that gene conversion is beneficial for a degenerating Y chromosome. Interestingly, a very high rate of gene conversion is needed for an effect of gene conversion to be observed. This suggests that high levels of Y-Y gene conversion observed in humans may have been selected to oppose the Y degeneration. We also studied with a similar approach the evolution of ampliconic regions on the Y chromosomes and found that the fixation of many copies at once is unlikely, which suggest these regions probably evolved gradually unless selection for increased dosage favored large-scale duplication events. Exploring the parameter space showed that Y–Y gene conversion may be beneficial in most mammalian species, which is consistent with recent data in chimpanzees and mice.
Collapse
Affiliation(s)
- Gabriel A B Marais
- Université Lyon 1, Centre National de la Recherche Scientifique, UMR5558, Laboratoire de Biométrie et Biologie évolutive, Villeurbanne, France.
| | | | | |
Collapse
|
33
|
Aleshin A, Zhi D. Recombination-associated sequence homogenization of neighboring Alu elements: signature of nonallelic gene conversion. Mol Biol Evol 2010; 27:2300-11. [PMID: 20453015 DOI: 10.1093/molbev/msq116] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Recently, researchers have begun to recognize that, in order to establish neutral models for disease association and evolutionary genomics studies, it is crucial to have a clear understanding of the genomic impact of nonallelic gene conversion. Drawing on previous successes in characterizing this phenomenon over protein-coding gene families, we undertook a computational analysis of neighboring Alu sequences in the genome scale. For this purpose, we developed adjusted comutation rate (aCMR), a novel statistical method measuring the excess number of identical point mutations shared by adjacent Alu sequences, vis-à-vis random pairs. Using aCMR, we uncovered a remarkable genome-wide sequence homogenization of neighboring Alus, with the strongest signal observed in the pseudoautosomal regions of the X and Y chromosomes. The magnitude of sequence homogenization between Alu pairs is greater with shorter interlocus distance, higher sequence identity, and parallel orientation. Moreover, shared substitutions show a strong directionality toward GC nucleotides, with multiple substitutions tending to cluster within the Alu sequence. Taken together, these observed recombination-associated sequence homogenization patterns are best explained by frequent ubiquitous gene conversion events between neighboring Alus. We believe that these observations help to illuminate the nature and impact of the enigmatic phenomenon of gene conversion.
Collapse
Affiliation(s)
- Alexey Aleshin
- Department of Medicine, Division of Hematology, Oncology, David Geffen School of Medicine, University of California, Los Angeles, USA
| | | |
Collapse
|
34
|
Navarro-Costa P, Gonçalves J, Plancha CE. The AZFc region of the Y chromosome: at the crossroads between genetic diversity and male infertility. Hum Reprod Update 2010; 16:525-42. [PMID: 20304777 PMCID: PMC2918367 DOI: 10.1093/humupd/dmq005] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The three azoospermia factor (AZF) regions of the Y chromosome represent genomic niches for spermatogenesis genes. Yet, the most distal region, AZFc, is a major generator of large-scale variation in the human genome. Determining to what extent this variability affects spermatogenesis is a highly contentious topic in human reproduction. METHODS In this review, an extensive characterization of the molecular mechanisms responsible for AZFc genotypical variation is undertaken. Such data are complemented with the assessment of the clinical consequences for male fertility imputable to the different AZFc variants. For this, a critical re-evaluation of 23 association studies was performed in order to extract unifying conclusions by curtailing methodological heterogeneities. RESULTS Intrachromosomal homologous recombination mechanisms, either crossover or non-crossover based, are the main drivers for AZFc genetic diversity. In particular, rearrangements affecting gene dosage are the most likely to introduce phenotypical disruptions in the spermatogenic profile. In the specific cases of partial AZFc deletions, both the actual existence and the severity of the spermatogenic defect are dependent on the evolutionary background of the Y chromosome. CONCLUSIONS AZFc is one of the most genetically dynamic regions in the human genome. This property may serve as counter against the genetic degeneracy associated with the lack of a meiotic partner. However, such strategy comes at a price: some rearrangements represent a risk factor or a de-facto causative agent of spermatogenic disruption. Interestingly, this precarious balance is modulated, among other yet unknown factors, by the evolutionary history of the Y chromosome.
Collapse
Affiliation(s)
- Paulo Navarro-Costa
- Instituto de Medicina Molecular, Faculdade de Medicina de Lisboa, Lisboa, Portugal.
| | | | | |
Collapse
|
35
|
Genome destabilization by homologous recombination in the germ line. Nat Rev Mol Cell Biol 2010; 11:182-95. [PMID: 20164840 DOI: 10.1038/nrm2849] [Citation(s) in RCA: 159] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Meiotic recombination, which promotes proper homologous chromosome segregation at the first meiotic division, normally occurs between allelic sequences on homologues. However, recombination can also take place between non-allelic DNA segments that share high sequence identity. Such non-allelic homologous recombination (NAHR) can markedly alter genome architecture during gametogenesis by generating chromosomal rearrangements. Indeed, NAHR-mediated deletions, duplications, inversions and other alterations have been implicated in numerous human genetic disorders. Studies in yeast have provided insights into the molecular mechanisms of meiotic NAHR as well as the cellular strategies that limit it.
Collapse
|
36
|
Rane HS, Smith JM, Bergthorsson U, Katju V. Gene conversion and DNA sequence polymorphism in the sex-determination gene fog-2 and its paralog ftr-1 in Caenorhabditis elegans. Mol Biol Evol 2010; 27:1561-9. [PMID: 20133352 DOI: 10.1093/molbev/msq039] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Gene conversion, a form of concerted evolution, bears enormous potential to shape the trajectory of sequence and functional divergence of gene paralogs subsequent to duplication events. fog-2, a sex-determination gene unique to Caenorhabditis elegans and implicated in the origin of hermaphroditism in this species, resulted from the duplication of ftr-1, an upstream gene of unknown function. Synonymous sequence divergence in regions of fog-2 and ftr-1 (excluding recent gene conversion tracts) suggests that the duplication occurred 46 million generations ago. Gene conversion between fog-2 and ftr-1 was previously discovered in experimental fog-2 knockout lines of C. elegans, whereby hermaphroditism was restored in mutant obligately outcrossing male-female populations. We analyzed DNA-sequence variation in fog-2 and ftr-1 within 40 isolates of C. elegans from diverse geographic locations in order to evaluate the contribution of gene conversion to genetic variation in the two gene paralogs. The analysis shows that gene conversion contributes significantly to DNA-sequence diversity in fog-2 and ftr-1 (22% and 34%, respectively) and may have the potential to alter sexual phenotypes in natural populations. A radical amino acid change in a conserved region of the F-box domain of fog-2 was found in natural isolates of C. elegans with significantly lower fecundity. We hypothesize that the lowered fecundity is due to reduced masculinization and less sperm production and that amino acid replacement substitutions and gene conversion in fog-2 may contribute significantly to variation in the degree of inbreeding and outcrossing in natural populations.
Collapse
Affiliation(s)
- Hallie S Rane
- Department of Biology, University of New Mexico, NM, USA
| | | | | | | |
Collapse
|
37
|
Trombetta B, Cruciani F, Underhill PA, Sellitto D, Scozzari R. Footprints of X-to-Y gene conversion in recent human evolution. Mol Biol Evol 2009; 27:714-25. [PMID: 19812029 DOI: 10.1093/molbev/msp231] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Different X-homologous regions of the male-specific portion of the human Y chromosome (MSY) are characterized by a different content of putative single nucleotide polymorphisms (SNPs), as reported in public databases. The possible role of X-to-Y nonallelic gene conversion in contributing to these differences remains poorly understood. We explored this issue by analyzing sequence variation in three regions of the MSY characterized by a different degree of X-Y similarity and a different density of putative SNPs: the PCDH11Y gene in the X-transposed (X-Y identity 99%, high putative SNP content); the TBL1Y gene in the X-degenerate (X-Y identity 86-88%, low putative SNP content); and VCY genes-containing region in the P8 palindrome (X-Y identity 95%, low putative SNP content). Present findings do not provide any evidence for gene conversion in the PCDH11Y and TBL1Y genes; they also strongly suggest that most putative SNPs of the PCDH11Y gene (and possibly the entire X-transposed region) are most likely X-Y paralogous sequence variants, which have been entered in the databases as SNPs. On the other hand, clear evidence for the VCY genes in the P8 palindrome having acted as an acceptor of X-to-Y gene conversion was obtained. A rate of 1.8 x 10(-7) X-to-Y conversions/bp/year was estimated for these genes. These findings indicate that in the VCY region of the MSY, X-to-Y gene conversion can be highly effective to increase the level of diversity among human Y chromosomes and suggest an additional explanation for the ability of the Y chromosome to retard degradation during evolution. Present data are expected to pave the way for future investigations on the role of nonallelic gene conversion in double-strand break repair and the maintenance of Y chromosome integrity.
Collapse
Affiliation(s)
- Beniamino Trombetta
- Dipartimento di Genetica e Biologia Molecolare, Sapienza Università di Roma, Rome, Italy
| | | | | | | | | |
Collapse
|
38
|
Rosser ZH, Balaresque P, Jobling MA. Gene conversion between the X chromosome and the male-specific region of the Y chromosome at a translocation hotspot. Am J Hum Genet 2009; 85:130-4. [PMID: 19576564 PMCID: PMC2706966 DOI: 10.1016/j.ajhg.2009.06.009] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2009] [Revised: 06/15/2009] [Accepted: 06/16/2009] [Indexed: 11/18/2022] Open
Abstract
Outside the pseudoautosomal regions, the mammalian sex chromosomes are thought to have been genetically isolated for up to 350 million years. However, in humans pathogenic XY translocations occur in XY-homologous (gametologous) regions, causing sex-reversal and infertility. Gene conversion might accompany recombination intermediates that resolve without translocation and persist in the population. We resequenced X and Y copies of a translocation hotspot adjacent to the PRKX and PRKY genes and found evidence of historical exchange between the male-specific region of the human Y and the X in patchy flanking gene-conversion tracts on both chromosomes. The rate of X-to-Y conversion (per base per generation) is four to five orders of magnitude more rapid than the rate of Y-chromosomal base-substitution mutation, and given assumptions about the recombination history of the X locus, tract lengths have an overall average length of approximately 100 bp. Sequence exchange outside the pseudoautosomal regions could play a role in protecting the Y-linked copies of gametologous genes from degeneration.
Collapse
Affiliation(s)
- Zoë H Rosser
- Department of Genetics, University of Leicester, University Road, Leicester, UK
| | | | | |
Collapse
|
39
|
|
40
|
Nakken S, Rødland EA, Rognes T, Hovig E. Large-scale inference of the point mutational spectrum in human segmental duplications. BMC Genomics 2009; 10:43. [PMID: 19161616 PMCID: PMC2640414 DOI: 10.1186/1471-2164-10-43] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Accepted: 01/22/2009] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Recent segmental duplications are relatively large (> or = 1 kb) genomic regions of high sequence identity (> or = 90%). They cover approximately 4-5% of the human genome and play important roles in gene evolution and genomic disease. The DNA sequence differences between copies of a segmental duplication represent the result of various mutational events over time, since any two duplication copies originated from the same ancestral DNA sequence. Based on this fact, we have developed a computational scheme for inference of point mutational events in human segmental duplications, which we collectively term duplication-inferred mutations (DIMs). We have characterized these nucleotide substitutions by comparing them with high-quality SNPs from dbSNP, both in terms of sequence context and frequency of substitution types. RESULTS Overall, DIMs show a lower ratio of transitions relative to transversions than SNPs, although this ratio approaches that of SNPs when considering DIMs within most recent duplications. Our findings indicate that DIMs and SNPs in general are caused by similar mutational mechanisms, with some deviances at the CpG dinucleotide. Furthermore, we discover a large number of reference SNPs that coincide with computationally inferred DIMs. The latter reflects how sequence variation in duplicated sequences can be misinterpreted as ordinary allelic variation. CONCLUSION In summary, we show how DNA sequence analysis of segmental duplications can provide a genome-wide mutational spectrum that mirrors recent genome evolution. The inferred set of nucleotide substitutions represents a valuable complement to SNPs for the analysis of genetic variation and point mutagenesis.
Collapse
Affiliation(s)
- Sigve Nakken
- Department of Informatics, University of Oslo, PO Box 1080 Blindern, NO-0316 Oslo, Norway.
| | | | | | | |
Collapse
|
41
|
Balaresque P, Bowden GR, Parkin EJ, Omran GA, Heyer E, Quintana-Murci L, Roewer L, Stoneking M, Nasidze I, Carvalho-Silva DR, Tyler-Smith C, de Knijff P, Jobling MA. Dynamic nature of the proximal AZFc region of the human Y chromosome: multiple independent deletion and duplication events revealed by microsatellite analysis. Hum Mutat 2008; 29:1171-80. [PMID: 18470947 PMCID: PMC2689608 DOI: 10.1002/humu.20757] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The human Y chromosome shows frequent structural variants, some of which are selectively neutral, while others cause impaired fertility due to the loss of spermatogenic genes. The large-scale use of multiple Y-chromosomal microsatellites in forensic and population genetic studies can reveal such variants, through the absence or duplication of specific markers in haplotypes. We describe Y chromosomes in apparently normal males carrying null and duplicated alleles at the microsatellite DYS448, which lies in the proximal part of the azoospermia factor c (AZFc) region, important in spermatogenesis, and made up of "ampliconic" repeats that act as substrates for nonallelic homologous recombination (NAHR). Physical mapping in 26 DYS448 deletion chromosomes reveals that only three cases belong to a previously described class, representing independent occurrences of an approximately 1.5-Mb deletion mediated by recombination between the b1 and b3 repeat units. The remainder belong to five novel classes; none appears to be mediated through homologous recombination, and all remove some genes, but are likely to be compatible with normal fertility. A combination of deletion analysis with binary-marker and microsatellite haplotyping shows that the 26 deletions represent nine independent events. Nine DYS448 duplication chromosomes can be explained by four independent events. Some lineages have risen to high frequency in particular populations, in particular a deletion within haplogroup (hg) C(*)(xC3a,C3c) found in 18 Asian males. The nonrandom phylogenetic distribution of duplication and deletion events suggests possible structural predisposition to such mutations in hgs C and G.
Collapse
|
42
|
Münch C, Kirsch S, Fernandes AMG, Schempp W. Evolutionary analysis of the highly dynamic CHEK2 duplicon in anthropoids. BMC Evol Biol 2008; 8:269. [PMID: 18831734 PMCID: PMC2566985 DOI: 10.1186/1471-2148-8-269] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Accepted: 10/02/2008] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Segmental duplications (SDs) are euchromatic portions of genomic DNA (> or = 1 kb) that occur at more than one site within the genome, and typically share a high level of sequence identity (>90%). Approximately 5% of the human genome is composed of such duplicated sequences. Here we report the detailed investigation of CHEK2 duplications. CHEK2 is a multiorgan cancer susceptibility gene encoding a cell cycle checkpoint kinase acting in the DNA-damage response signalling pathway. The continuous presence of the CHEK2 gene in all eukaryotes and its important role in maintaining genome stability prompted us to investigate the duplicative evolution and phylogeny of CHEK2 and its paralogs during anthropoid evolution. RESULTS To study CHEK2 duplicon evolution in anthropoids we applied a combination of comparative FISH and in silico analyses. Our comparative FISH results with a CHEK2 fosmid probe revealed the single-copy status of CHEK2 in New World monkeys, Old World monkeys and gibbons. Whereas a single CHEK2 duplication was detected in orangutan, a multi-site signal pattern indicated a burst of duplication in African great apes and human. Phylogenetic analysis of paralogous and ancestral CHEK2 sequences in human, chimpanzee and rhesus macaque confirmed this burst of duplication, which occurred after the radiation of orangutan and African great apes. In addition, we used inter-species quantitative PCR to determine CHEK2 copy numbers. An amplification of CHEK2 was detected in African great apes and the highest CHEK2 copy number of all analysed species was observed in the human genome. Furthermore, we detected variation in CHEK2 copy numbers within the analysed set of human samples. CONCLUSION Our detailed analysis revealed the highly dynamic nature of CHEK2 duplication during anthropoid evolution. We determined a burst of CHEK2 duplication after the radiation of orangutan and African great apes and identified the highest CHEK2 copy number in human. In conclusion, our analysis of CHEK2 duplicon evolution revealed that SDs contribute to inter-species variation. Furthermore, our qPCR analysis led us to presume CHEK2 copy number variation in human, and molecular diagnostics of the cancer susceptibility gene CHEK2 inside the duplicated region might be hampered by the individual-specific set of duplicons.
Collapse
Affiliation(s)
- Claudia Münch
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - Stefan Kirsch
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - António MG Fernandes
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - Werner Schempp
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| |
Collapse
|
43
|
Kjeldbjerg AL, Villesen P, Aagaard L, Pedersen FS. Gene conversion and purifying selection of a placenta-specific ERV-V envelope gene during simian evolution. BMC Evol Biol 2008; 8:266. [PMID: 18826608 PMCID: PMC2567338 DOI: 10.1186/1471-2148-8-266] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2008] [Accepted: 09/30/2008] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND Most human endogenous retroviruses (HERVs) invaded our genome at least 25 million years ago. The majority of the viral genes are degenerated, since no selection preserves them within the genome. However, a few intact and very old HERV genes exist, and likely are beneficial for the host. We here address evolutionary aspects of two HERV-V envelope genes, ENVV1 and ENVV2, located in tandem and containing a long open reading frame. RESULTS The ENVV2 gene is preserved with an intact reading frame during simian evolution, but none of the ENVV genes are found in the prosimian species tested. While we observe many transposon insertions in the gag and pol regions of the ERV-V2 provirus, the ENVV2 genes have escaped transposon crossfire in all species tested. Additional analysis of nucleotide substitutions provides further strong evidence of purifying selection on the ENVV2 gene during primate evolution. The other copy, ENVV1, seems to be involved in gene conversion of the major part of the envelope. Furthermore, ENVV1 and ENVV2 show placenta-specific expression in human and a baboon species. CONCLUSION Our analyses show that ERV-V entered our genome after the split between simian and prosimian primates. Subsequent purifying selection and gene conversion have preserved two copies of the ENVV envelope gene in most species. This is the first case of gene conversion involving long open reading frames in HERVs. Together with the placenta-specific expression of the human and baboon ENVV1 and ENVV2 envelope genes, these data provide strong evidence of a beneficial role for the host.
Collapse
Affiliation(s)
- Anders L Kjeldbjerg
- Department of Molecular Biology, University of Aarhus, DK-8000 Aarhus C, Denmark.
| | | | | | | |
Collapse
|
44
|
Complex germline and somatic mutation processes at a haploid human minisatellite shown by single-molecule analysis. Mutat Res 2008; 648:46-53. [PMID: 18929582 PMCID: PMC2599865 DOI: 10.1016/j.mrfmmm.2008.09.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2008] [Revised: 09/03/2008] [Accepted: 09/12/2008] [Indexed: 11/21/2022]
Abstract
Mutation at most human minisatellites is driven by complex interallelic processes that give rise to a high degree of length polymorphism and internal structural variation. MSY1, the only highly variable minisatellite on the non-recombining region of the Y chromosome, is constitutively haploid and therefore precluded from interallelic interactions, yet maintains high diversity in both length and structure. To investigate the basis of its mutation processes, an unbiased structural analysis of >500 single-molecule MSY1 PCR products from matched sperm and blood samples from a single donor was undertaken. The overall mutation frequencies in sperm and blood DNAs were not significantly different, at 2.68% and 1.88%, respectively. Sperm DNA showed significantly more length mutants than blood DNA, with mutants in both tissues involving small-scale (1–3 repeat units in a 77 repeat progenitor allele) increases or decreases in repeat block lengths, with no gain or loss bias. Isometric mutations altering structure but not length were found in both tissues, and involved either the apparent shift of a boundary between repeat unit blocks (a ‘boundary switch’) or the conversion of a repeat within a block to a different repeat type (‘modular structure’ mutant). There was a significant excess of boundary switch mutants and deficit of modular structure mutants in sperm. A comparison of mutant structures with phylogenetically matched alleles in population samples showed that alleles with structures resembling the blood mutants were unlikely to arise in populations. Mutation seems likely to involve gene conversion via synthesis-dependent strand annealing, and the blood-sperm differences may reflect more relaxed constraint on sister chromatid alignment in blood.
Collapse
|
45
|
Balaresque P, Parkin EJ, Roewer L, Carvalho-Silva DR, Mitchell RJ, van Oorschot RAH, Henke J, Stoneking M, Nasidze I, Wetton J, de Knijff P, Tyler-Smith C, Jobling MA. Genomic complexity of the Y-STR DYS19: inversions, deletions and founder lineages carrying duplications. Int J Legal Med 2008; 123:15-23. [PMID: 18553096 DOI: 10.1007/s00414-008-0253-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2007] [Accepted: 05/02/2008] [Indexed: 11/30/2022]
Abstract
The Y-STR DYS19 is firmly established in the repertoire of Y-chromosomal markers used in forensic analysis yet is poorly understood at the molecular level, lying in a complex genomic environment and exhibiting null alleles, as well as duplications and occasional triplications in population samples. Here, we analyse three null alleles and 51 duplications and show that DYS19 can also be involved in inversion events, so that even its location within the short arm of the Y chromosome is uncertain. Deletion mapping in the three chromosomes carrying null alleles shows that their deletions are less than approximately 300 kb in size. Haplotypic analysis with binary markers shows that they belong to three different haplogroups and so represent independent events. In contrast, a collection of 51 DYS19 duplication chromosomes belong to only four haplogroups: two are singletons and may represent somatic mutation in lymphoblastoid cell lines, but two, in haplogroups G and C3c, represent founder lineages that have spread widely in Central Europe/West Asia and East Asia, respectively. Consideration of candidate mechanisms underlying both deletions and duplications provides no evidence for the involvement of non-allelic homologous recombination, and they are likely to represent sporadic events with low mutation rates. Understanding the basis and population distribution of these DYS19 alleles will aid in the utilisation and interpretation of profiles that contain them.
Collapse
Affiliation(s)
- Patricia Balaresque
- Department of Genetics, University of Leicester, University Road, Leicester, LE1 7RH, UK
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Xu S, Clark T, Zheng H, Vang S, Li R, Wong GKS, Wang J, Zheng X. Gene conversion in the rice genome. BMC Genomics 2008; 9:93. [PMID: 18298833 PMCID: PMC2277409 DOI: 10.1186/1471-2164-9-93] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2007] [Accepted: 02/25/2008] [Indexed: 02/01/2023] Open
Abstract
Background Gene conversion causes a non-reciprocal transfer of genetic information between similar sequences. Gene conversion can both homogenize genes and recruit point mutations thereby shaping the evolution of multigene families. In the rice genome, the large number of duplicated genes increases opportunities for gene conversion. Results To characterize gene conversion in rice, we have defined 626 multigene families in which 377 gene conversions were detected using the GENECONV program. Over 60% of the conversions we detected were between chromosomes. We found that the inter-chromosomal conversions distributed between chromosome 1 and 5, 2 and 6, and 3 and 5 are more frequent than genome average (Z-test, P < 0.05). The frequencies of gene conversion on the same chromosome decreased with the physical distance between gene conversion partners. Ka/Ks analysis indicates that gene conversion is not tightly linked to natural selection in the rice genome. To assess the contribution of segmental duplication on gene conversion statistics, we determined locations of conversion partners with respect to inter-chromosomal segment duplication. The number of conversions associated with segmentation is less than ten percent. Pseudogenes in the rice genome with low similarity to Arabidopsis genes showed greater likelihood for gene conversion than those with high similarity to Arabidopsis genes. Functional annotations suggest that at least 14 multigene families related to disease or bacteria resistance were involved in conversion events. Conclusion The evolution of gene families in the rice genome may have been accelerated by conversion with pseudogenes. Our analysis suggests a possible role for gene conversion in the evolution of pathogen-response genes.
Collapse
Affiliation(s)
- Shuqing Xu
- Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing Proteomics Institute, Beijing 101300, China.
| | | | | | | | | | | | | | | |
Collapse
|
47
|
Chen JM, Cooper DN, Chuzhanova N, Férec C, Patrinos GP. Gene conversion: mechanisms, evolution and human disease. Nat Rev Genet 2007; 8:762-75. [PMID: 17846636 DOI: 10.1038/nrg2193] [Citation(s) in RCA: 449] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Gene conversion, one of the two mechanisms of homologous recombination, involves the unidirectional transfer of genetic material from a 'donor' sequence to a highly homologous 'acceptor'. Considerable progress has been made in understanding the molecular mechanisms that underlie gene conversion, its formative role in human genome evolution and its implications for human inherited disease. Here we assess current thinking about how gene conversion occurs, explore the key part it has played in fashioning extant human genes, and carry out a meta-analysis of gene-conversion events that are known to have caused human genetic disease.
Collapse
|
48
|
Sassi SO, Braun EL, Benner SA. The evolution of seminal ribonuclease: pseudogene reactivation or multiple gene inactivation events? Mol Biol Evol 2007; 24:1012-24. [PMID: 17267422 DOI: 10.1093/molbev/msm020] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Two approaches, one novel, are applied to analyze the divergent evolution of ruminant seminal ribonucleases (RNases), paralogs of the well-known pancreatic RNases of mammals. Here, the goal was to identify periods of divergence of seminal RNase under functional constraints, periods of divergence as a pseudogene, and periods of divergence driven by positive selection pressures. The classical approach involves the analysis of nonsynonymous to synonymous replacements ratios (omega) for the branches of the seminal RNase evolutionary tree. The novel approach coupled these analyses with the mapping of substitutions on the folded structure of the protein. These analyses suggest that seminal RNase diverged during much of its history after divergence from pancreatic RNase as a functioning protein, followed by homoplastic inactivations to create pseudogenes in multiple ruminant lineages. Further, they are consistent with adaptive evolution only in the most recent episode leading to the gene in modern oxen. These conclusions contrast sharply with the view, cited widely in the literature, that seminal RNase decayed after its formation by gene duplication into an inactive pseudogene, whose lesions were repaired in a reactivation event. Further, the 2 approaches, omega estimation and mapping of replacements on the protein structure, were compared by examining their utility for establishing the functional status of the seminal RNase genes in 2 deer species. Hog and roe deer share common lesions, which strongly suggests that the gene was inactive in their last common ancestor. In this specific example, the crystallographic approach made the correct implication more strongly than the omega approach. Studies of this type should contribute to an integrated framework of tools to assign functional and nonfunctional episodes to recently created gene duplicates and to understand more broadly how gene duplication leads to the emergence of proteins with novel functions.
Collapse
Affiliation(s)
- Slim O Sassi
- Foundation for Applied Molecular Evolution, Gainesville, Florida, USA.
| | | | | |
Collapse
|
49
|
Jobling MA, Lo ICC, Turner DJ, Bowden GR, Lee AC, Xue Y, Carvalho-Silva D, Hurles ME, Adams SM, Chang YM, Kraaijenbrink T, Henke J, Guanti G, McKeown B, van Oorschot RAH, Mitchell RJ, de Knijff P, Tyler-Smith C, Parkin EJ. Structural variation on the short arm of the human Y chromosome: recurrent multigene deletions encompassing Amelogenin Y. Hum Mol Genet 2006; 16:307-16. [PMID: 17189292 PMCID: PMC2590852 DOI: 10.1093/hmg/ddl465] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Structural polymorphism is increasingly recognized as a major form of human genome variation, and is particularly prevalent on the Y chromosome. Assay of the Amelogenin Y gene (AMELY) on Yp is widely used in DNA-based sex testing, and sometimes reveals males who have interstitial deletions. In a collection of 45 deletion males from 12 populations, we used a combination of sequence-tagged site mapping, and binary-marker and Y-short tandem repeat haplotyping to understand the structural basis of this variation. Of the 45 deletion males, 41 carry indistinguishable deletions, 3.0-3.8 Mb in size. Breakpoint mapping strongly implicates a mechanism of non-allelic homologous recombination between the proximal major array of TSPY gene-containing repeats, and a single distal copy of TSPY; this is supported by the estimation of TSPY copy number in deleted and non-deleted males. The remaining four males carry three distinct non-recurrent deletions (2.5-4.0 Mb), which may be due to non-homologous mechanisms. Haplotyping shows that TSPY-mediated deletions have arisen seven times independently in the sample. One instance, represented by 30 chromosomes mostly of Indian origin within haplogroup J2e1*/M241, has a time-to-most-recent-common-ancestor of approximately 7700+/-1300 years. In addition to AMELY, deletion males all lack the genes PRKY and TBL1Y, and the rarer deletion classes also lack PCDH11Y. The persistence and expansion of deletion lineages, together with direct phenotypic evidence, suggests that absence of these genes has no major deleterious effects.
Collapse
Affiliation(s)
- Mark A Jobling
- Department of Genetics, University of Leicester, University Road, Leicester LE1 7RH, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Lindsay SJ, Khajavi M, Lupski JR, Hurles ME. A chromosomal rearrangement hotspot can be identified from population genetic variation and is coincident with a hotspot for allelic recombination. Am J Hum Genet 2006; 79:890-902. [PMID: 17033965 PMCID: PMC1698570 DOI: 10.1086/508709] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2006] [Accepted: 08/22/2006] [Indexed: 01/15/2023] Open
Abstract
Insights into the origins of structural variation and the mutational mechanisms underlying genomic disorders would be greatly improved by a genomewide map of hotspots of nonallelic homologous recombination (NAHR). Moreover, our understanding of sequence variation within the duplicated sequences that are substrates for NAHR lags far behind that of sequence variation within the single-copy portion of the genome. Perhaps the best-characterized NAHR hotspot lies within the 24-kb-long Charcot-Marie-Tooth disease type 1A (CMT1A)-repeats (REPs) that sponsor deletions and duplications that cause peripheral neuropathies. We investigated structural and sequence diversity within the CMT1A-REPs, both within and between species. We discovered a high frequency of retroelement insertions, accelerated sequence evolution after duplication, extensive paralogous gene conversion, and a greater than twofold enrichment of SNPs in humans relative to the genome average. We identified an allelic recombination hotspot underlying the known NAHR hotspot, which suggests that the two processes are intimately related. Finally, we used our data to develop a novel method for inferring the location of an NAHR hotspot from sequence variation within segmental duplications and applied it to identify a putative NAHR hotspot within the LCR22 repeats that sponsor velocardiofacial syndrome deletions. We propose that a large-scale project to map sequence variation within segmental duplications would reveal a wealth of novel chromosomal-rearrangement hotspots.
Collapse
Affiliation(s)
- Sarah J Lindsay
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | | | | | | |
Collapse
|