1
|
Schwarzkopf EJ, Brandt N, Heil CS. The recombination landscape of introgression in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.04.574263. [PMID: 39026729 PMCID: PMC11257466 DOI: 10.1101/2024.01.04.574263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Meiotic recombination is an evolutionary force that acts by breaking up genomic linkage, increasing the efficacy of selection. Recombination is initiated with a double-strand break which is resolved via a crossover, which involves the reciprocal exchange of genetic material between homologous chromosomes, or a non-crossover, which results in small tracts of non-reciprocal exchange of genetic material. Crossover and non-crossover rates vary between species, populations, individuals, and across the genome. In recent years, recombination rate has been associated with the distribution of ancestry derived from past interspecific hybridization (introgression) in a variety of species. We explore this interaction of recombination and introgression by sequencing spores and detecting crossovers and non-crossovers from two crosses of the yeast Saccharomyces uvarum. One cross is between strains which each contain introgression from their sister species, S. eubayanus, while the other cross has no introgression present. We find that the recombination landscape is significantly different between S. uvarum crosses, and that some of these differences can be explained by the presence of introgression in one cross. Crossovers are reduced and non-crossovers are increased in heterozygous introgression compared to syntenic regions in the cross without introgression. This translates to reduced allele shuffling within introgressed regions, and an overall reduction of shuffling on most chromosomes with introgression compared to the syntenic regions and chromosomes without introgression. Our results suggest that hybridization can significantly influence the recombination landscape, and that the reduction in allele shuffling contributes to the initial purging of introgression in the generations following a hybridization event.
Collapse
Affiliation(s)
| | - Nathan Brandt
- Department of Biological Sciences, North Carolina State University, Raleigh, NC
| | | |
Collapse
|
2
|
Lampitto M, Barchi M. Recent advances in mechanisms ensuring the pairing, synapsis and segregation of XY chromosomes in mice and humans. Cell Mol Life Sci 2024; 81:194. [PMID: 38653846 PMCID: PMC11039559 DOI: 10.1007/s00018-024-05216-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 03/04/2024] [Accepted: 03/20/2024] [Indexed: 04/25/2024]
Abstract
Sex chromosome aneuploidies are among the most common variations in human whole chromosome copy numbers, with an estimated prevalence in the general population of 1:400 to 1:1400 live births. Unlike whole-chromosome aneuploidies of autosomes, those of sex chromosomes, such as the 47, XXY aneuploidy that causes Klinefelter Syndrome (KS), often originate from the paternal side, caused by a lack of crossover (CO) formation between the X and Y chromosomes. COs must form between all chromosome pairs to pass meiotic checkpoints and are the product of meiotic recombination that occurs between homologous sequences of parental chromosomes. Recombination between male sex chromosomes is more challenging compared to both autosomes and sex chromosomes in females, as it is restricted within a short region of homology between X and Y, called the pseudo-autosomal region (PAR). However, in normal individuals, CO formation occurs in PAR with a higher frequency than in any other region, indicating the presence of mechanisms that promote the initiation and processing of recombination in each meiotic division. In recent years, research has made great strides in identifying genes and mechanisms that facilitate CO formation in the PAR. Here, we outline the most recent and relevant findings in this field. XY chromosome aneuploidy in humans has broad-reaching effects, contributing significantly also to Turner syndrome, spontaneous abortions, oligospermia, and even infertility. Thus, in the years to come, the identification of genes and mechanisms beyond XY aneuploidy is expected to have an impact on the genetic counseling of a wide number of families and adults affected by these disorders.
Collapse
Affiliation(s)
- Matteo Lampitto
- Section of Anatomy, Department of Biomedicine and Prevention, University of Rome Tor Vergata, Rome, Italy
| | - Marco Barchi
- Section of Anatomy, Department of Biomedicine and Prevention, University of Rome Tor Vergata, Rome, Italy.
- Section of Anatomy, Department of Medicine, Saint Camillus International University of Health Sciences, Rome, Italy.
| |
Collapse
|
3
|
Browning SR, Browning BL. Biobank-scale inference of multi-individual identity by descent and gene conversion. Am J Hum Genet 2024; 111:691-700. [PMID: 38513668 PMCID: PMC11023918 DOI: 10.1016/j.ajhg.2024.02.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 02/26/2024] [Accepted: 02/27/2024] [Indexed: 03/23/2024] Open
Abstract
We present a method for efficiently identifying clusters of identical-by-descent haplotypes in biobank-scale sequence data. Our multi-individual approach enables much more computationally efficient inference of identity by descent (IBD) than approaches that infer pairwise IBD segments and provides locus-specific IBD clusters rather than IBD segments. Our method's computation time, memory requirements, and output size scale linearly with the number of individuals in the dataset. We also present a method for using multi-individual IBD to detect alleles changed by gene conversion. Application of our methods to the autosomal sequence data for 125,361 White British individuals in the UK Biobank detects more than 9 million converted alleles. This is 2,900 times more alleles changed by gene conversion than were detected in a previous analysis of familial data. We estimate that more than 250,000 sequenced probands and a much larger number of additional genomes from multi-generational family members would be required to find a similar number of alleles changed by gene conversion using a family-based approach. Our IBD clustering method is implemented in the open-source ibd-cluster software package.
Collapse
Affiliation(s)
- Sharon R Browning
- Department of Biostatistics, University of Washington, Seattle, WA, USA.
| | - Brian L Browning
- Department of Biostatistics, University of Washington, Seattle, WA, USA; Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, USA.
| |
Collapse
|
4
|
Lu M, Zhou L, Gui JF. Evolutionary mechanisms and practical significance of reproductive success and clonal diversity in unisexual vertebrate polyploids. SCIENCE CHINA. LIFE SCIENCES 2024; 67:449-459. [PMID: 38198030 DOI: 10.1007/s11427-023-2486-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 11/01/2023] [Indexed: 01/11/2024]
Abstract
Unisexual reproduction is generally relevant to polyploidy, and unisexual vertebrates are often considered an evolutionary "dead end" due to the accumulation of deleterious mutations and absence of genetic diversity. However, some unisexual polyploids have developed strategies to avoid genomic decay, and thus provide ideal models to unveil unexplored evolutionary mechanisms, from the reproductive success to clonal diversity creation. This article reviews the evolutionary mechanisms for overcoming meiotic barrier and generating genetic diversity in unisexual vertebrates, and summarizes recent research advancements in the polyploid Carassius complex. Gynogenetic gibel carp (Carassius gibelio) is a unique amphitriploid that has undergone a recurrent autotriploidy and has overcome the bottleneck of triploid sterility via gynogenesis. Recently, an efficient strategy in which ploidy changes, including from amphitriploid to amphitetraploid, then from amphitetraploid to novel amphitriploid, drive unisexual-sexual-unisexual reproduction transition and clonal diversity has been revealed. Based on this new discovery, multigenomic reconstruction biotechnology has been used to breed a novel strain with superior growth and stronger disease resistance. Moreover, a unique reproduction mode that combines both abilities of ameiotic oogenesis and sperm-egg fusion, termed as ameio-fusiongensis, has been discovered, and it provides an efficient approach to synthesize sterile allopolyploids. In order to avoid ecological risks upon escape and protect the sustainable property rights of the aquaculture seed industry, a controllable fertility biotechnology approach for precise breeding is being developed by integrating sterile allopolyploid synthesis and gene-editing techniques. This review provides novel insights into the origin and evolution of unisexual vertebrates and into the attempts being made to exploit new breeding biotechnologies in aquaculture.
Collapse
Affiliation(s)
- Meng Lu
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, the Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Li Zhou
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, the Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Jian-Fang Gui
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, the Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
5
|
Versoza CJ, Weiss S, Johal R, La Rosa B, Jensen JD, Pfeifer SP. Novel Insights into the Landscape of Crossover and Noncrossover Events in Rhesus Macaques (Macaca mulatta). Genome Biol Evol 2024; 16:evad223. [PMID: 38051960 PMCID: PMC10773715 DOI: 10.1093/gbe/evad223] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 11/04/2023] [Accepted: 11/28/2023] [Indexed: 12/07/2023] Open
Abstract
Meiotic recombination landscapes differ greatly between distantly and closely related taxa, populations, individuals, sexes, and even within genomes; however, the factors driving this variation are yet to be well elucidated. Here, we directly estimate contemporary crossover rates and, for the first time, noncrossover rates in rhesus macaques (Macaca mulatta) from four three-generation pedigrees comprising 32 individuals. We further compare these results with historical, demography-aware, linkage disequilibrium-based recombination rate estimates. From paternal meioses in the pedigrees, 165 crossover events with a median resolution of 22.3 kb were observed, corresponding to a male autosomal map length of 2,357 cM-approximately 15% longer than an existing linkage map based on human microsatellite loci. In addition, 85 noncrossover events with a mean tract length of 155 bp were identified-similar to the tract lengths observed in the only other two primates in which noncrossovers have been studied to date, humans and baboons. Consistent with observations in other placental mammals with PRDM9-directed recombination, crossover (and to a lesser extent noncrossover) events in rhesus macaques clustered in intergenic regions and toward the chromosomal ends in males-a pattern in broad agreement with the historical, sex-averaged recombination rate estimates-and evidence of GC-biased gene conversion was observed at noncrossover sites.
Collapse
Affiliation(s)
- Cyril J Versoza
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
| | - Sarah Weiss
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Ravneet Johal
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Bruno La Rosa
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
| | - Susanne P Pfeifer
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
6
|
Hinch R, Donnelly P, Hinch AG. Meiotic DNA breaks drive multifaceted mutagenesis in the human germ line. Science 2023; 382:eadh2531. [PMID: 38033082 PMCID: PMC7615360 DOI: 10.1126/science.adh2531] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 09/29/2023] [Indexed: 12/02/2023]
Abstract
Meiotic recombination commences with hundreds of programmed DNA breaks; however, the degree to which they are accurately repaired remains poorly understood. We report that meiotic break repair is eightfold more mutagenic for single-base substitutions than was previously understood, leading to de novo mutation in one in four sperm and one in 12 eggs. Its impact on indels and structural variants is even higher, with 100- to 1300-fold increases in rates per break. We uncovered new mutational signatures and footprints relative to break sites, which implicate unexpected biochemical processes and error-prone DNA repair mechanisms, including translesion synthesis and end joining in meiotic break repair. We provide evidence that these mechanisms drive mutagenesis in human germ lines and lead to disruption of hundreds of genes genome wide.
Collapse
Affiliation(s)
- Robert Hinch
- Big Data Institute, University of Oxford; Oxford, UK
| | - Peter Donnelly
- Wellcome Centre for Human Genetics, University of Oxford; Oxford, UK
- Genomics plc; Oxford, UK
| | | |
Collapse
|
7
|
Browning SR, Browning BL. Biobank-scale inference of multi-individual identity by descent and gene conversion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.03.565574. [PMID: 37961601 PMCID: PMC10635131 DOI: 10.1101/2023.11.03.565574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
We present a method for efficiently identifying clusters of identical-by-descent haplotypes in biobank-scale sequence data. Our multi-individual approach enables much more efficient collection and storage of identity by descent (IBD) information than approaches that detect and store pairwise IBD segments. Our method's computation time, memory requirements, and output size scale linearly with the number of individuals in the dataset. We also present a method for using multi-individual IBD to detect alleles changed by gene conversion. Application of our methods to the autosomal sequence data for 125,361 White British individuals in the UK Biobank detects more than 9 million converted alleles. This is 2900 times more alleles changed by gene conversion than were detected in a previous analysis of familial data. We estimate that more than 250,000 sequenced probands and a much larger number of additional genomes from multi-generational family members would be required to find a similar number of alleles changed by gene conversion using a family-based approach.
Collapse
Affiliation(s)
| | - Brian L. Browning
- Department of Biostatistics, University of Washington, Seattle, WA
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA
| |
Collapse
|
8
|
Premkumar T, Paniker L, Kang R, Biot M, Humphrey E, Destain H, Ferranti I, Okulate I, Nguyen H, Kilaru V, Frasca M, Chakraborty P, Cole F. Genetic dissection of crossover mutants defines discrete intermediates in mouse meiosis. Mol Cell 2023; 83:2941-2958.e7. [PMID: 37595556 PMCID: PMC10469168 DOI: 10.1016/j.molcel.2023.07.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 03/28/2023] [Accepted: 07/21/2023] [Indexed: 08/20/2023]
Abstract
Crossovers (COs), the exchange of homolog arms, are required for accurate chromosome segregation during meiosis. Studies in yeast have described the single-end invasion (SEI) intermediate: a stabilized 3' end annealed with the homolog as the first detectible CO precursor. SEIs are thought to differentiate into double Holliday junctions (dHJs) that are resolved by MutLgamma (MLH1/MLH3) into COs. Currently, we lack knowledge of early steps of mammalian CO recombination or how intermediates are differentiated in any organism. Using comprehensive analysis of recombination in thirteen different genetic conditions with varying levels of compromised CO resolution, we infer CO precursors include asymmetric SEI-like intermediates and dHJs in mouse. In contrast to yeast, MLH3 is structurally required to differentiate CO precursors into dHJs. We verify conservation of aspects of meiotic recombination and show unique features in mouse, providing mechanistic insight into CO formation.
Collapse
Affiliation(s)
- Tolkappiyan Premkumar
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA; The University of Texas MD Anderson Cancer Center UTHealth Houston Graduate School of Biomedical Sciences, Program in Genetics and Epigenetics, Houston, TX, USA
| | - Lakshmi Paniker
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Rhea Kang
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA; The University of Texas MD Anderson Cancer Center UTHealth Houston Graduate School of Biomedical Sciences, Program in Genetics and Epigenetics, Houston, TX, USA
| | - Mathilde Biot
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Ericka Humphrey
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Honorine Destain
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Isabella Ferranti
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Iyinyeoluwa Okulate
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Holly Nguyen
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Vindhya Kilaru
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Melissa Frasca
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA; The University of Texas MD Anderson Cancer Center UTHealth Houston Graduate School of Biomedical Sciences, Program in Genetics and Epigenetics, Houston, TX, USA
| | - Parijat Chakraborty
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Francesca Cole
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA; The University of Texas MD Anderson Cancer Center UTHealth Houston Graduate School of Biomedical Sciences, Program in Genetics and Epigenetics, Houston, TX, USA.
| |
Collapse
|
9
|
Johri P, Pfeifer SP, Jensen JD. Developing an evolutionary baseline model for humans: jointly inferring purifying selection with population history. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.11.536488. [PMID: 37090533 PMCID: PMC10120674 DOI: 10.1101/2023.04.11.536488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics - including quantifying the relative contributions of adaptive vs. non-adaptive processes - but it is also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection ( e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome ( i.e. , purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, while strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.
Collapse
|
10
|
Sun Y, Yuan F, Wang L, Dai D, Zhang Z, Liang F, Liu N, Long J, Zhao X, Xi Y. Recombination and mutation shape variations in the major histocompatibility complex. J Genet Genomics 2022; 49:1151-1161. [PMID: 35358716 DOI: 10.1016/j.jgg.2022.03.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 03/07/2022] [Accepted: 03/08/2022] [Indexed: 01/14/2023]
Abstract
The major histocompatibility complex (MHC) is closely associated with numerous diseases, but its high degree of polymorphism complicates the discovery of disease-associated variants. In principle, recombination and de novo mutations are two critical factors responsible for MHC polymorphisms. However, direct evidence for this hypothesis is lacking. Here, we report the generation of fine-scale MHC recombination and de novo mutation maps of ∼5 Mb by deep sequencing (> 100×) of the MHC genome for 17 MHC recombination and 30 non-recombination Han Chinese families (a total of 190 individuals). Recombination hotspots and Han-specific breakpoints are located in close proximity at haplotype block boundaries. The average MHC de novo mutation rate is higher than the genome-wide de novo mutation rate, particularly in MHC recombinant individuals. Notably, mutation and recombination generated polymorphisms are located within and outside linkage disequilibrium regions of the MHC, respectively, and evolution of the MHC locus was mainly controlled by positive selection. These findings provide insights on the evolutionary causes of the MHC diversity and may facilitate the identification of disease-associated genetic variants.
Collapse
Affiliation(s)
- Yuying Sun
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China.
| | - Fang Yuan
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Ling Wang
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China
| | - Dongfa Dai
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China
| | - Zhijian Zhang
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China
| | - Fei Liang
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Nan Liu
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Juan Long
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Xiao Zhao
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Yongzhi Xi
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China.
| |
Collapse
|
11
|
Setter D, Ebdon S, Jackson B, Lohse K. Estimating the rates of crossover and gene conversion from individual genomes. Genetics 2022; 222:iyac100. [PMID: 35771626 PMCID: PMC9434185 DOI: 10.1093/genetics/iyac100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 06/01/2022] [Indexed: 11/14/2022] Open
Abstract
Recombination can occur either as a result of crossover or gene conversion events. Population genetic methods for inferring the rate of recombination from patterns of linkage disequilibrium generally assume a simple model of recombination that only involves crossover events and ignore gene conversion. However, distinguishing the 2 processes is not only necessary for a complete description of recombination, but also essential for understanding the evolutionary consequences of inversions and other genomic partitions in which crossover (but not gene conversion) is reduced. We present heRho, a simple composite likelihood scheme for coestimating the rate of crossover and gene conversion from individual diploid genomes. The method is based on analytic results for the distance-dependent probability of heterozygous and homozygous states at 2 loci. We apply heRho to simulations and data from the house mouse Mus musculus castaneus, a well-studied model. Our analyses show (1) that the rates of crossover and gene conversion can be accurately coestimated at the level of individual chromosomes and (2) that previous estimates of the population scaled rate of recombination ρ=4Ner under a pure crossover model are likely biased.
Collapse
Affiliation(s)
- Derek Setter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Sam Ebdon
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Ben Jackson
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Konrad Lohse
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
12
|
Lee B, Cyrill SL, Lee W, Melchiotti R, Andiappan AK, Poidinger M, Rötzschke O. Analysis of archaic human haplotypes suggests that 5hmC acts as an epigenetic guide for NCO recombination. BMC Biol 2022; 20:173. [PMID: 35927700 PMCID: PMC9354366 DOI: 10.1186/s12915-022-01353-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 06/17/2022] [Indexed: 11/17/2022] Open
Abstract
Background Non-crossover (NCO) refers to a mechanism of homologous recombination in which short tracks of DNA are copied between homologue chromatids. The allelic changes are typically restricted to one or few SNPs, which potentially allow for the gradual adaptation and maturation of haplotypes. It is assumed to be a stochastic process but the analysis of archaic and modern human haplotypes revealed a striking variability in local NCO recombination rates. Methods NCO recombination rates of 1.9 million archaic SNPs shared with Denisovan hominids were defined by a linkage study and correlated with functional and genomic annotations as well as ChIP-Seq data from modern humans. Results We detected a strong correlation between NCO recombination rates and the function of the respective region: low NCO rates were evident in introns and quiescent intergenic regions but high rates in splice sites, exons, 5′- and 3′-UTRs, as well as CpG islands. Correlations with ChIP-Seq data from ENCODE and other public sources further identified epigenetic modifications that associated directly with these recombination events. A particularly strong association was observed for 5-hydroxymethylcytosine marks (5hmC), which were enriched in virtually all of the functional regions associated with elevated NCO rates, including CpG islands and ‘poised’ bivalent regions. Conclusion Our results suggest that 5hmC marks may guide the NCO machinery specifically towards functionally relevant regions and, as an intermediate of oxidative demethylation, may open a pathway for environmental influence by specifically targeting recently opened gene loci. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01353-9.
Collapse
Affiliation(s)
- Bernett Lee
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Lee Kong Chian School of Medicine, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Samantha Leeanne Cyrill
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Cold Spring Harbor Laboratory, One Bungtown Road, NY, 11724, Cold Spring Harbor, USA
| | - Wendy Lee
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Rossella Melchiotti
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Anand Kumar Andiappan
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Michael Poidinger
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Murdoch Children's Research Institute, Royal Children's Hospital, Flemington Road, Parkville, Victoria, 3052, Australia
| | - Olaf Rötzschke
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.
| |
Collapse
|
13
|
Wall JD, Robinson JA, Cox LA. High-Resolution Estimates of Crossover and Noncrossover Recombination from a Captive Baboon Colony. Genome Biol Evol 2022; 14:evac040. [PMID: 35325119 PMCID: PMC9048888 DOI: 10.1093/gbe/evac040] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/02/2022] [Indexed: 11/17/2022] Open
Abstract
Homologous recombination has been extensively studied in humans and a handful of model organisms. Much less is known about recombination in other species, including nonhuman primates. Here, we present a study of crossovers (COs) and noncrossover (NCO) recombination in olive baboons (Papio anubis) from two pedigrees containing a total of 20 paternal and 17 maternal meioses, and compare these results to linkage disequilibrium (LD) based recombination estimates from 36 unrelated olive baboons. We demonstrate how COs, combined with LD-based recombination estimates, can be used to identify genome assembly errors. We also quantify sex-specific differences in recombination rates, including elevated male CO and reduced female CO rates near telomeres. Finally, we add to the increasing body of evidence suggesting that while most NCO recombination tracts in mammals are short (e.g., <500 bp), there is a non-negligible fraction of longer (e.g., >1 kb) NCO tracts. For NCO tracts shorter than 10 kb, we fit a mixture of two (truncated) geometric distributions model to the NCO tract length distribution and estimate that >99% of all NCO tracts are very short (mean 24 bp), but the remaining tracts can be quite long (mean 4.3 kb). A single geometric distribution model for NCO tract lengths is incompatible with the data, suggesting that LD-based methods for estimating NCO recombination rates that make this assumption may need to be modified.
Collapse
Affiliation(s)
- Jeffrey D. Wall
- Institute for Human Genetics, University of California San Francisco, USA
| | | | - Laura A. Cox
- Center for Precision Medicine, Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, USA
| |
Collapse
|
14
|
Lorenz A, Mpaulo SJ. Gene conversion: a non-Mendelian process integral to meiotic recombination. Heredity (Edinb) 2022; 129:56-63. [PMID: 35393552 PMCID: PMC9273591 DOI: 10.1038/s41437-022-00523-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/01/2022] [Accepted: 03/02/2022] [Indexed: 12/30/2022] Open
Abstract
Meiosis is undoubtedly the mechanism that underpins Mendelian genetics. Meiosis is a specialised, reductional cell division which generates haploid gametes (reproductive cells) carrying a single chromosome complement from diploid progenitor cells harbouring two chromosome sets. Through this process, the hereditary material is shuffled and distributed into haploid gametes such that upon fertilisation, when two haploid gametes fuse, diploidy is restored in the zygote. During meiosis the transient physical connection of two homologous chromosomes (one originally inherited from each parent) each consisting of two sister chromatids and their subsequent segregation into four meiotic products (gametes), is what enables genetic marker assortment forming the core of Mendelian laws. The initiating events of meiotic recombination are DNA double-strand breaks (DSBs) which need to be repaired in a certain way to enable the homologous chromosomes to find each other. This is achieved by DSB ends searching for homologous repair templates and invading them. Ultimately, the repair of meiotic DSBs by homologous recombination physically connects homologous chromosomes through crossovers. These physical connections provided by crossovers enable faithful chromosome segregation. That being said, the DSB repair mechanism integral to meiotic recombination also produces genetic transmission distortions which manifest as postmeiotic segregation events and gene conversions. These processes are non-reciprocal genetic exchanges and thus non-Mendelian.
Collapse
Affiliation(s)
- Alexander Lorenz
- Institute of Medical Sciences (IMS), University of Aberdeen, Aberdeen, UK.
| | - Samantha J Mpaulo
- Institute of Medical Sciences (IMS), University of Aberdeen, Aberdeen, UK
| |
Collapse
|
15
|
Matschiner M, Barth JMI, Tørresen OK, Star B, Baalsrud HT, Brieuc MSO, Pampoulie C, Bradbury I, Jakobsen KS, Jentoft S. Supergene origin and maintenance in Atlantic cod. Nat Ecol Evol 2022; 6:469-481. [PMID: 35177802 PMCID: PMC8986531 DOI: 10.1038/s41559-022-01661-x] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 01/10/2022] [Indexed: 12/19/2022]
Abstract
Supergenes are sets of genes that are inherited as a single marker and encode complex phenotypes through their joint action. They are identified in an increasing number of organisms, yet their origins and evolution remain enigmatic. In Atlantic cod, four megabase-scale supergenes have been identified and linked to migratory lifestyle and environmental adaptations. Here we investigate the origin and maintenance of these four supergenes through analysis of whole-genome-sequencing data, including a new long-read-based genome assembly for a non-migratory Atlantic cod individual. We corroborate the finding that chromosomal inversions underlie all four supergenes, and we show that they originated at different times between 0.40 and 1.66 million years ago. We reveal gene flux between supergene haplotypes where migratory and stationary Atlantic cod co-occur and conclude that this gene flux is driven by gene conversion, on the basis of an increase in GC content in exchanged sites. Additionally, we find evidence for double crossover between supergene haplotypes, leading to the exchange of an ~275 kilobase fragment with genes potentially involved in adaptation to low salinity in the Baltic Sea. Our results suggest that supergenes can be maintained over long timescales in the same way as hybridizing species, through the selective purging of introduced genetic variation. Atlantic cod carries four supergenes linked to migratory lifestyle and environmental adaptations. Using whole-genome sequencing, the authors show that the genome inversions that underlie the supergenes originated at different times and show gene flux between supergene haplotypes.
Collapse
Affiliation(s)
- Michael Matschiner
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway. .,Department of Palaeontology and Museum, University of Zurich, Zurich, Switzerland. .,Natural History Museum, University of Oslo, Oslo, Norway.
| | - Julia Maria Isis Barth
- Zoological Institute, Department of Environmental Sciences, University of Basel, Basel, Switzerland
| | - Ole Kristian Tørresen
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | - Bastiaan Star
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | - Helle Tessand Baalsrud
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | - Marine Servane Ono Brieuc
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | | | - Ian Bradbury
- Fisheries and Oceans Canada, St John's, Newfoundland and Labrador, Canada
| | - Kjetill Sigurd Jakobsen
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | - Sissel Jentoft
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway.
| |
Collapse
|
16
|
Yanovsky-Dagan S, Frumkin A, Lupski JR, Harel T. CRISPR/Cas9-induced gene conversion between ATAD3 paralogs. HGG ADVANCES 2022; 3:100092. [PMID: 35199044 PMCID: PMC8844715 DOI: 10.1016/j.xhgg.2022.100092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 01/19/2022] [Indexed: 11/23/2022] Open
Abstract
Paralogs and pseudogenes are abundant within the human genome, and can mediate non-allelic homologous recombination (NAHR) or gene conversion events. The ATAD3 locus contains three paralogs situated in tandem, and is therefore prone to NAHR-mediated deletions and duplications associated with severe neurological phenotypes. To study this locus further, we aimed to generate biallelic loss-of-function variants in ATAD3A by CRISPR/Cas9 genome editing. Unexpectedly, two of the generated clones underwent gene conversion, as evidenced by replacement of the targeted sequence of ATAD3A by a donor sequence from its paralog ATAD3B. We highlight the complexity of CRISPR/Cas9 design, end-product formation, and recombination repair mechanisms for CRISPR/Cas9 delivery as a nucleic acid molecular therapy when targeting genes that have paralogs or pseudogenes, and advocate meticulous evaluation of resultant clones in model organisms. In addition, we suggest that endogenous gene conversion may be used to repair missense variants in genes with paralogs or pseudogenes.
Collapse
Affiliation(s)
| | - Ayala Frumkin
- Department of Genetics, Hadassah Medical Organization, Jerusalem, Israel
| | - James R. Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Hospital, Houston, TX, USA
| | - Tamar Harel
- Department of Genetics, Hadassah Medical Organization, Jerusalem, Israel
- Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
- Corresponding author
| |
Collapse
|
17
|
Shaw BC, Estus S. Pseudogene-Mediated Gene Conversion After CRISPR-Cas9 Editing Demonstrated by Partial CD33 Conversion with SIGLEC22P. CRISPR J 2021; 4:699-709. [PMID: 34558988 DOI: 10.1089/crispr.2021.0052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Although gene editing workflows typically consider the possibility of off-target editing, pseudogene-directed homology repair has not, to our knowledge, been reported previously. Here, we employed a CRISPR-Cas9 strategy for targeted excision of exon 2 in CD33 in U937 human monocyte cell line. Candidate clonal cell lines were screened by using a clinically relevant antibody known to label the IgV domain encoded by exon 2 (P67.6, gemtuzumab). In addition to the anticipated deletion of exon 2, we also found unexpected P67.6-negative cell lines, which had apparently retained CD33 exon 2. Sequencing revealed that these lines underwent gene conversion from the nearby SIGLEC22P pseudogene during homology repair that resulted in three missense mutations relative to CD33. Ectopic expression studies confirmed that the P67.6 epitope is dependent upon these amino acids. In summation, we report that pseudogene-directed homology repair can lead to aberrant CRISPR gene editing.
Collapse
Affiliation(s)
- Benjamin C Shaw
- Department of Physiology and Sanders-Brown Center on Aging, University of Kentucky, Lexington, USA
| | - Steven Estus
- Department of Physiology and Sanders-Brown Center on Aging, University of Kentucky, Lexington, USA
| |
Collapse
|
18
|
Wang RJ, Radivojac P, Hahn MW. Distinct error rates for reference and nonreference genotypes estimated by pedigree analysis. Genetics 2021; 217:1-10. [PMID: 33683359 DOI: 10.1093/genetics/iyaa014] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 11/13/2020] [Indexed: 01/06/2023] Open
Abstract
Errors in genotype calling can have perverse effects on genetic analyses, confounding association studies, and obscuring rare variants. Analyses now routinely incorporate error rates to control for spurious findings. However, reliable estimates of the error rate can be difficult to obtain because of their variance between studies. Most studies also report only a single estimate of the error rate even though genotypes can be miscalled in more than one way. Here, we report a method for estimating the rates at which different types of genotyping errors occur at biallelic loci using pedigree information. Our method identifies potential genotyping errors by exploiting instances where the haplotypic phase has not been faithfully transmitted. The expected frequency of inconsistent phase depends on the combination of genotypes in a pedigree and the probability of miscalling each genotype. We develop a model that uses the differences in these frequencies to estimate rates for different types of genotype error. Simulations show that our method accurately estimates these error rates in a variety of scenarios. We apply this method to a dataset from the whole-genome sequencing of owl monkeys (Aotus nancymaae) in three-generation pedigrees. We find significant differences between estimates for different types of genotyping error, with the most common being homozygous reference sites miscalled as heterozygous and vice versa. The approach we describe is applicable to any set of genotypes where haplotypic phase can reliably be called and should prove useful in helping to control for false discoveries.
Collapse
Affiliation(s)
- Richard J Wang
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Predrag Radivojac
- Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, USA
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
19
|
Gergelits V, Parvanov E, Simecek P, Forejt J. Chromosome-wide characterization of meiotic noncrossovers (gene conversions) in mouse hybrids. Genetics 2021; 217:1-14. [PMID: 33683354 DOI: 10.1093/genetics/iyaa013] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Accepted: 11/13/2020] [Indexed: 01/16/2023] Open
Abstract
During meiosis, the recombination-initiating DNA double-strand breaks (DSBs) are repaired by crossovers or noncrossovers (gene conversions). While crossovers are easily detectable, noncrossover identification is hampered by the small size of their converted tracts and the necessity of sequence polymorphism. We report identification and characterization of a mouse chromosome-wide set of noncrossovers by next-generation sequencing of 10 mouse intersubspecific chromosome substitution strains. Based on 94 identified noncrossovers, we determined the mean length of a conversion tract to be 32 bp. The spatial chromosome-wide distribution of noncrossovers and crossovers significantly differed, although both sets overlapped the known hotspots of PRDM9-directed histone methylation and DNA DSBs, thus supporting their origin in the standard DSB repair pathway. A significant deficit of noncrossovers descending from asymmetric DSBs proved their proposed adverse effect on meiotic recombination and pointed to sister chromatids as an alternative template for their repair. The finding has implications for the molecular mechanism of hybrid sterility in mice from crosses between closely related Mus musculus musculus and Mus musculus domesticus subspecies.
Collapse
Affiliation(s)
- Vaclav Gergelits
- Laboratory of Mouse Molecular Genetics, Division BIOCEV, Institute of Molecular Genetics, Czech Academy of Sciences, CZ-25250 Vestec, Czech Republic.,Department of Cell Biology, Faculty of Science, Charles University, CZ-12000 Prague, Czech Republic
| | - Emil Parvanov
- Laboratory of Mouse Molecular Genetics, Division BIOCEV, Institute of Molecular Genetics, Czech Academy of Sciences, CZ-25250 Vestec, Czech Republic
| | - Petr Simecek
- Laboratory of Mouse Molecular Genetics, Division BIOCEV, Institute of Molecular Genetics, Czech Academy of Sciences, CZ-25250 Vestec, Czech Republic
| | - Jiri Forejt
- Laboratory of Mouse Molecular Genetics, Division BIOCEV, Institute of Molecular Genetics, Czech Academy of Sciences, CZ-25250 Vestec, Czech Republic
| |
Collapse
|
20
|
Dutta A, Dutreux F, Schacherer J. Loss of heterozygosity results in rapid but variable genome homogenization across yeast genetic backgrounds. eLife 2021; 10:70339. [PMID: 34159898 PMCID: PMC8245132 DOI: 10.7554/elife.70339] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 06/11/2021] [Indexed: 12/12/2022] Open
Abstract
The dynamics and diversity of the appearance of genetic variants play an essential role in the evolution of the genome and the shaping of biodiversity. Recent population-wide genome sequencing surveys have highlighted the importance of loss of heterozygosity (LOH) events and have shown that they are a neglected part of the genetic diversity landscape. To assess the extent, variability, and spectrum, we explored the accumulation of LOH events in 169 heterozygous diploid Saccharomyces cerevisiae mutation accumulation lines across nine genetic backgrounds. In total, we detected a large set of 22,828 LOH events across distinct genetic backgrounds with a heterozygous level ranging from 0.1% to 1%. LOH events are very frequent with a rate consistently much higher than the mutation rate, showing their importance for genome evolution. We observed that the interstitial LOH (I-LOH) events, resulting in internal short LOH tracts, were much frequent (n = 19,660) than the terminal LOH (T-LOH) events, that is, tracts extending to the end of the chromosome (n = 3168). However, the spectrum, the rate, and the fraction of the genome under LOH vary across genetic backgrounds. Interestingly, we observed that the more the ancestors were heterozygous, the more they accumulated T-LOH events. In addition, frequent short I-LOH tracts are a signature of the lines derived from hybrids with low spore fertility. Finally, we found lines showing almost complete homozygotization during vegetative progression. Overall, our results highlight that the variable dynamics of the LOH accumulation across distinct genetic backgrounds might lead to rapid differential genome evolution during vegetative growth.
Collapse
Affiliation(s)
- Abhishek Dutta
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Fabien Dutreux
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.,Institut Universitaire de France (IUF), Paris, France
| |
Collapse
|
21
|
Abstract
Accurately inferring the genome-wide landscape of recombination rates in natural populations is a central aim in genomics, as patterns of linkage influence everything from genetic mapping to understanding evolutionary history. Here, we describe recombination landscape estimation using recurrent neural networks (ReLERNN), a deep learning method for estimating a genome-wide recombination map that is accurate even with small numbers of pooled or individually sequenced genomes. Rather than use summaries of linkage disequilibrium as its input, ReLERNN takes columns from a genotype alignment, which are then modeled as a sequence across the genome using a recurrent neural network. We demonstrate that ReLERNN improves accuracy and reduces bias relative to existing methods and maintains high accuracy in the face of demographic model misspecification, missing genotype calls, and genome inaccessibility. We apply ReLERNN to natural populations of African Drosophila melanogaster and show that genome-wide recombination landscapes, although largely correlated among populations, exhibit important population-specific differences. Lastly, we connect the inferred patterns of recombination with the frequencies of major inversions segregating in natural Drosophila populations.
Collapse
Affiliation(s)
- Jeffrey R Adrion
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
| | - Jared G Galloway
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
| | - Andrew D Kern
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
| |
Collapse
|
22
|
Adapting Biased Gene Conversion theory to account for intensive GC-content deterioration in the human genome by novel mutations. PLoS One 2020; 15:e0232167. [PMID: 32353016 PMCID: PMC7192473 DOI: 10.1371/journal.pone.0232167] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 04/09/2020] [Indexed: 12/23/2022] Open
Abstract
We examined seventy million well-characterized human mutations, and their impact on G+C-compositional dynamics, in order to understand the formation and maintenance of major genomic nucleotide sequence patterns. Among novel mutations, those that change a strong (S) base pair G:C/C:G to a weak (W) pair A:T/T:A occur at nearly twice the frequency of the opposite mutations. Such imbalance puts strong downward pressure on overall GC-content. However, along protracted paths to fixation, S→W mutations are much less likely to propagate than W→S mutations. The magnitude of relative propagation disadvantages for S→W mutations is inexplicable by any currently-accepted model. This fact forced us to re-examine the quantitative features of Biased Gene Conversion (BGC) theory. Revised parameters of BGC that, per average individual, convert 7–14 W base pairs into S pairs, would account for the S-content turnover differences between new and old mutations, and make BGC an instrumental force for nucleotide dynamics and evolution. BGC should thus be considered seriously in both theories and biomedical practice. In particular, BGC should be taken into account during allele imputations, where missing SNP alleles are computationally predicted based on the information about several neighboring alleles. Finally, we analyzed the effect of neighboring nucleotide context on the mutation frequencies, dynamics, and GC-composition turnover. For this purpose, we examined genomic regions having extremely biased nucleotide compositions (enriched for S-, W-, purine/pyrimidine strand asymmetry, or AC/GT-strand asymmetry). It was found that point mutations in these regions preferentially degrade the nucleotide inhomogeneities, decreasing the sequence biases. Degradation of sequence bias is highest for novel mutations, and considerably lower for older mutations (those widespread across populations). Besides BGC, there may be additional, still uncharacterized molecular mechanisms that either preserve genomic regions with biased nucleotide compositions from mutational degradation or fail to degrade such inhomogeneities in specific chromosomal regions.
Collapse
|
23
|
Zhou Y, Browning SR, Browning BL. A Fast and Simple Method for Detecting Identity-by-Descent Segments in Large-Scale Data. Am J Hum Genet 2020; 106:426-437. [PMID: 32169169 PMCID: PMC7118582 DOI: 10.1016/j.ajhg.2020.02.010] [Citation(s) in RCA: 74] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 02/12/2020] [Indexed: 12/24/2022] Open
Abstract
Segments of identity by descent (IBD) are used in many genetic analyses. We present a method for detecting identical-by-descent haplotype segments in phased genotype data. Our method, called hap-IBD, combines a compressed representation of haplotype data, the positional Burrows-Wheeler transform, and multi-threaded execution to produce very fast analysis times. An attractive feature of hap-IBD is its simplicity: the input parameters clearly and precisely define the IBD segments that are reported, so that program correctness can be confirmed by users. We evaluate hap-IBD and four state-of-the-art IBD segment detection methods (GERMLINE, iLASH, RaPID, and TRUFFLE) using UK Biobank chromosome 20 data and simulated sequence data. We show that hap-IBD detects IBD segments faster and more accurately than competing methods, and that hap-IBD is the only method that can rapidly and accurately detect short 2-4 centiMorgan (cM) IBD segments in the full UK Biobank data. Analysis of 485,346 UK Biobank samples through the use of hap-IBD with 12 computational threads detects 231.5 billion autosomal IBD segments with length ≥2 cM in 24.4 h.
Collapse
Affiliation(s)
- Ying Zhou
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Sharon R Browning
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Brian L Browning
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA; Department of Medicine, Division of Medical Genetics, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
24
|
Castellano D, Eyre-Walker A, Munch K. Impact of Mutation Rate and Selection at Linked Sites on DNA Variation across the Genomes of Humans and Other Homininae. Genome Biol Evol 2020; 12:3550-3561. [PMID: 31596481 PMCID: PMC6944223 DOI: 10.1093/gbe/evz215] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/03/2019] [Indexed: 12/23/2022] Open
Abstract
DNA diversity varies across the genome of many species. Variation in diversity across a genome might arise from regional variation in the mutation rate, variation in the intensity and mode of natural selection, and regional variation in the recombination rate. We show that both noncoding and nonsynonymous diversity are positively correlated to a measure of the mutation rate and the recombination rate and negatively correlated to the density of conserved sequences in 50 kb windows across the genomes of humans and nonhuman homininae. Interestingly, we find that although noncoding diversity is equally affected by these three genomic variables, nonsynonymous diversity is mostly dominated by the density of conserved sequences. The positive correlation between diversity and our measure of the mutation rate seems to be largely a direct consequence of regions with higher mutation rates having more diversity. However, the positive correlation with recombination rate and the negative correlation with the density of conserved sequences suggest that selection at linked sites also affect levels of diversity. This is supported by the observation that the ratio of the number of nonsynonymous to noncoding polymorphisms is negatively correlated to a measure of the effective population size across the genome. We show these patterns persist even when we restrict our analysis to GC-conservative mutations, demonstrating that the patterns are not driven by GC biased gene conversion. In conclusion, our comparative analyses describe how recombination rate, gene density, and mutation rate interact to produce the patterns of DNA diversity that we observe along the hominine genomes.
Collapse
Affiliation(s)
- David Castellano
- Bioinformatics Research Centre, Aarhus University, Denmark
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona, Spain
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Kasper Munch
- Bioinformatics Research Centre, Aarhus University, Denmark
| |
Collapse
|
25
|
Abstract
Through recombination, genes are freed to evolve more independently of one another, unleashing genetic variance hidden in the linkage disequilibrium that accumulates through selection combined with drift. Yet crossover numbers are evolutionarily constrained, with at least one and not many more than one crossover per bivalent in most taxa. Crossover interference, whereby a crossover reduces the probability of a neighboring crossover, contributes to this homogeneity. The mechanisms by which interference is achieved and crossovers are regulated are a major current subject of inquiry, facilitated by novel methods to visualize crossovers and to pinpoint recombination events. Here, we review patterns of crossover interference and the models built to describe this process. We then discuss the selective forces that have likely shaped interference and the regulation of crossover numbers.
Collapse
Affiliation(s)
- Sarah P Otto
- Department of Zoology & Biodiversity Research Centre, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada;
| | - Bret A Payseur
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
26
|
van Sluis M, Gailín MÓ, McCarter JGW, Mangan H, Grob A, McStay B. Human NORs, comprising rDNA arrays and functionally conserved distal elements, are located within dynamic chromosomal regions. Genes Dev 2019; 33:1688-1701. [PMID: 31727772 PMCID: PMC6942050 DOI: 10.1101/gad.331892.119] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 10/23/2019] [Indexed: 01/09/2023]
Abstract
In this study, van Sluis et al. investigated the role of chromosomal context in nuclear organizer regions (NORs)/ribosomal gene (rDNA) in nucleolar formation and function. The analyses combined sequence capture and long-read sequencing to characterize the regions distal to rDNA arrays (DJ) on isolated acrocentric chromosomes, and their findings provide direct evidence for exchanges between heterologous human acrocentric p-arms, and uncover extensive structural variation between chromosomes and among individuals. Human nucleolar organizer regions (NORs), containing ribosomal gene (rDNA) arrays, are located on the p-arms of acrocentric chromosomes (HSA13–15, 21, and 22). Absence of these p-arms from genome references has hampered research on nucleolar formation. Previously, we assembled a distal junction (DJ) DNA sequence contig that abuts rDNA arrays on their telomeric side, revealing that it is shared among the acrocentrics and impacts nucleolar organization. To facilitate inclusion into genome references, we describe sequencing the DJ from all acrocentrics, including three versions of HSA21, ∼3 Mb of novel sequence. This was achieved by exploiting monochromosomal somatic cell hybrids containing single human acrocentric chromosomes with NORs that retain functional potential. Analyses revealed remarkable DJ sequence and functional conservation among human acrocentrics. Exploring chimpanzee acrocentrics, we show that “DJ-like” sequences and abutting rDNA arrays are inverted as a unit in comparison to humans. Thus, rDNA arrays and linked DJs represent a conserved functional locus. We provide direct evidence for exchanges between heterologous human acrocentric p-arms, and uncover extensive structural variation between chromosomes and among individuals. These findings lead us to revaluate the molecular definition of NORs, identify novel genomic structural variation, and provide a rationale for the distinctive chromosomal organization of NORs.
Collapse
Affiliation(s)
- Marjolein van Sluis
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Galway, Ireland
| | - Michael Ó Gailín
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Galway, Ireland
| | - Joseph G W McCarter
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Galway, Ireland
| | - Hazel Mangan
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Galway, Ireland
| | - Alice Grob
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Galway, Ireland
| | - Brian McStay
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Galway, Ireland
| |
Collapse
|
27
|
Kawakami T, Wallberg A, Olsson A, Wintermantel D, de Miranda JR, Allsopp M, Rundlöf M, Webster MT. Substantial Heritable Variation in Recombination Rate on Multiple Scales in Honeybees and Bumblebees. Genetics 2019; 212:1101-1119. [PMID: 31152071 PMCID: PMC6707477 DOI: 10.1534/genetics.119.302008] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 05/30/2019] [Indexed: 12/30/2022] Open
Abstract
Meiotic recombination shuffles genetic variation and promotes correct segregation of chromosomes. Rates of recombination vary on several scales, both within genomes and between individuals, and this variation is affected by both genetic and environmental factors. Social insects have extremely high rates of recombination, although the evolutionary causes of this are not known. Here, we estimate rates of crossovers and gene conversions in 22 colonies of the honeybee, Apis mellifera, and 9 colonies of the bumblebee, Bombus terrestris, using direct sequencing of 299 haploid drone offspring. We confirm that both species have extremely elevated crossover rates, with higher rates measured in the highly eusocial honeybee than the primitively social bumblebee. There are also significant differences in recombination rate between subspecies of honeybee. There is substantial variation in genome-wide recombination rate between individuals of both A. mellifera and B. terrestris and the distribution of these rates overlap between species. A large proportion of interindividual variation in recombination rate is heritable, which indicates the presence of variation in trans-acting factors that influence recombination genome-wide. We infer that levels of crossover interference are significantly lower in honeybees compared to bumblebees, which may be one mechanism that contributes to higher recombination rates in honeybees. We also find a significant increase in recombination rate with distance from the centromere, mirrored by methylation differences. We detect a strong transmission bias due to GC-biased gene conversion associated with noncrossover gene conversions. Our results shed light on the mechanistic causes of extreme rates of recombination in social insects and the genetic architecture of recombination rate variation.
Collapse
Affiliation(s)
- Takeshi Kawakami
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, 752 36, Sweden
- Department of Animal and Plant Sciences, University of Sheffield, S10 2TN, United Kingdom
| | - Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| | - Anna Olsson
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| | - Dimitry Wintermantel
- INRA, UE 1255 APIS, Le Magneraud, 17700 Surgères, France
- Centre d'Etudes Biologiques de Chizé, UMR 7372, CNRS and Université de La Rochelle, 79360 Villiers-en-Bois, France
| | - Joachim R de Miranda
- Department of Ecology, Swedish University of Agricultural Sciences, Uppsala 750 07, Sweden
| | - Mike Allsopp
- Plant Protection Research Institute, Agricultural Research Council, Stellenbosch, 7608, South Africa
| | - Maj Rundlöf
- Department of Biology, Lund University, 223 62, Sweden
| | - Matthew T Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| |
Collapse
|
28
|
Pilkington SM, Tahir J, Hilario E, Gardiner SE, Chagné D, Catanach A, McCallum J, Jesson L, Fraser LG, McNeilage MA, Deng C, Crowhurst RN, Datson PM, Zhang Q. Genetic and cytological analyses reveal the recombination landscape of a partially differentiated plant sex chromosome in kiwifruit. BMC PLANT BIOLOGY 2019; 19:172. [PMID: 31039740 PMCID: PMC6492441 DOI: 10.1186/s12870-019-1766-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 04/08/2019] [Indexed: 05/10/2023]
Abstract
BACKGROUND Angiosperm sex chromosomes, where present, are generally recently evolved. The key step in initiating the development of sex chromosomes from autosomes is the establishment of a sex-determining locus within a region of non-recombination. To better understand early sex chromosome evolution, it is important to determine the process by which recombination is suppressed around the sex determining genes. We have used the dioecious angiosperm kiwifruit Actinidia chinensis var. chinensis, which has an active-Y sex chromosome system, to study recombination rates around the sex locus, to better understand key events in the development of sex chromosomes. RESULTS We have confirmed the sex-determining region (SDR) in A. chinensis var. chinensis, using a combination of high density genetic mapping and fluorescent in situ hybridisation (FISH) of Bacterial Artificial Chromosomes (BACs) linked to the sex markers onto pachytene chromosomes. The SDR is a subtelomeric non-recombining region adjacent to the nucleolar organiser region (NOR). A region of restricted recombination of around 6 Mbp in size in both male and female maps spans the SDR and covers around a third of chromosome 25. CONCLUSIONS As recombination is suppressed over a similar region between X chromosomes and between and X and Y chromosomes, we propose that recombination is suppressed in this region because of the proximity of the NOR and the centromere, with both the NOR and centromere suppressing recombination, and this predates suppressed recombination due to differences between X and Y chromosomes. Such regions of suppressed recombination in the genome provide an opportunity for the evolution of sex chromosomes, if a sex-determining locus develops there or translocates into this region.
Collapse
Affiliation(s)
- S. M. Pilkington
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - J. Tahir
- PFR, Private Bag 11600, Palmerston North, 4442 New Zealand
| | - E. Hilario
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - S. E. Gardiner
- PFR, Private Bag 11600, Palmerston North, 4442 New Zealand
| | - D. Chagné
- PFR, Private Bag 11600, Palmerston North, 4442 New Zealand
| | - A. Catanach
- PFR, Private Bag 4704, Christchurch, 8140 New Zealand
| | - J. McCallum
- PFR, Private Bag 4704, Christchurch, 8140 New Zealand
| | - L. Jesson
- PFR, Private Bag 1401, Havelock North, 4157 New Zealand
| | - L. G. Fraser
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - M. A. McNeilage
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - C. Deng
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - R. N. Crowhurst
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - P. M. Datson
- The New Zealand Institute for Plant and Food Research Limited (PFR), Private Bag 92169, Auckland, 1142 New Zealand
| | - Q. Zhang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074 China
- The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, 430074 China
| |
Collapse
|
29
|
Heissl A, Betancourt AJ, Hermann P, Povysil G, Arbeithuber B, Futschik A, Ebner T, Tiemann-Boege I. The impact of poly-A microsatellite heterologies in meiotic recombination. Life Sci Alliance 2019; 2:2/2/e201900364. [PMID: 31023833 PMCID: PMC6485458 DOI: 10.26508/lsa.201900364] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 03/27/2019] [Accepted: 03/29/2019] [Indexed: 12/12/2022] Open
Abstract
Meiosis strongly influences the transmission and evolution of heterozygous poly-A repeats as measured experimentally in a large collection of single recombination products in a human hotspot. Meiotic recombination has strong, but poorly understood effects on short tandem repeat (STR) instability. Here, we screened thousands of single recombinant products with sperm typing to characterize the role of polymorphic poly-A repeats at a human recombination hotspot in terms of hotspot activity and STR evolution. We show that the length asymmetry between heterozygous poly-A’s strongly influences the recombination outcome: a heterology of 10 A’s (9A/19A) reduces the number of crossovers and elevates the frequency of non-crossovers, complex recombination products, and long conversion tracts. Moreover, the length of the heterology also influences the STR transmission during meiotic repair with a strong and significant insertion bias for the short heterology (6A/7A) and a deletion bias for the long heterology (9A/19A). In spite of this opposing insertion-/deletion-biased gene conversion, we find that poly-A’s are enriched at human recombination hotspots that could have important consequences in hotspot activation.
Collapse
Affiliation(s)
- Angelika Heissl
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | | | - Philipp Hermann
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Gundula Povysil
- Institute of Bioinformatics, Johannes Kepler University, Linz, Austria
| | | | - Andreas Futschik
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Thomas Ebner
- Department of Gynecology, Obstetrics and Gynecological Endocrinology, Kepler University Clinic, Linz, Austria
| | | |
Collapse
|
30
|
Petersdorf EW, O'hUigin C. The MHC in the era of next-generation sequencing: Implications for bridging structure with function. Hum Immunol 2019; 80:67-78. [PMID: 30321633 PMCID: PMC6542361 DOI: 10.1016/j.humimm.2018.10.002] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 09/24/2018] [Accepted: 10/01/2018] [Indexed: 12/19/2022]
Abstract
The MHC continues to have the most disease-associations compared to other regions of the human genome, even in the genome-wide association study (GWAS) and single nucleotide polymorphism (SNP) era. Analysis of non-coding variation and their impact on the level of expression of HLA allotypes has shed new light on the potential mechanisms underlying HLA disease associations and alloreactivity in transplantation. Next-generation sequencing (NGS) technology has the capability of delineating the phase of variants in the HLA antigen-recognition site (ARS) with non-coding regulatory polymorphisms. These relationships are critical for understanding the qualitative and quantitative implications of HLA gene diversity. This article summarizes current understanding of non-coding region variation of HLA loci, the consequences of regulatory variation on HLA expression, the role for evolution in shaping lineage-specific expression, and the impact of HLA expression on disease susceptibility and transplantation outcomes. A role for phased sequencing methods for the MHC, and perspectives for future directions in basic and applied immunogenetic studies of the MHC are presented.
Collapse
Affiliation(s)
- Effie W Petersdorf
- University of Washington, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, D4-115, Seattle, WA 98109, United States.
| | - Colm O'hUigin
- Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Microbiome and Genetics Core, Building 37, Room 4140B, Bethesda, MD 20852, United States.
| |
Collapse
|
31
|
Korunes KL, Noor MAF. Pervasive gene conversion in chromosomal inversion heterozygotes. Mol Ecol 2018; 28:1302-1315. [PMID: 30387889 DOI: 10.1111/mec.14921] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 09/27/2018] [Accepted: 10/22/2018] [Indexed: 12/30/2022]
Abstract
Chromosomal inversions shape recombination landscapes, and species differing by inversions may exhibit reduced gene flow in these regions of the genome. Though single crossovers within inversions are not usually recovered from inversion heterozygotes, the recombination barrier imposed by inversions is nuanced by noncrossover gene conversion. Here, we provide a genomewide empirical analysis of gene conversion rates both within species and in species hybrids. We estimate that gene conversion occurs at a rate of 1 × 10-5 to 2.5 × 10-5 converted sites per bp per generation in experimental crosses within Drosophila pseudoobscura and between D. pseudoobscura and its naturally hybridizing sister species D. persimilis. This analysis is the first direct empirical assessment of gene conversion rates within inversions of a species hybrid. Our data show that gene conversion rates in interspecies hybrids are at least as high as within-species estimates of gene conversion rates, and gene conversion occurs regularly within and around inverted regions of species hybrids, even near inversion breakpoints. We also found that several gene conversion events appeared to be mitotic rather than meiotic in origin. Finally, we observed that gene conversion rates are higher in regions of lower local sequence divergence, yet our observed gene conversion rates in more divergent inverted regions were at least as high as in less divergent collinear regions. Given our observed high rates of gene conversion despite the sequence differentiation between species, especially in inverted regions, gene conversion has the potential to reduce the efficacy of inversions as barriers to recombination over evolutionary time.
Collapse
|
32
|
Poriswanish N, Neumann R, Wetton JH, Wagstaff J, Larmuseau MHD, Jobling MA, May CA. Recombination hotspots in an extended human pseudoautosomal domain predicted from double-strand break maps and characterized by sperm-based crossover analysis. PLoS Genet 2018; 14:e1007680. [PMID: 30296256 PMCID: PMC6193736 DOI: 10.1371/journal.pgen.1007680] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Revised: 10/18/2018] [Accepted: 09/05/2018] [Indexed: 01/11/2023] Open
Abstract
The human X and Y chromosomes are heteromorphic but share a region of homology at the tips of their short arms, pseudoautosomal region 1 (PAR1), that supports obligate crossover in male meiosis. Although the boundary between pseudoautosomal and sex-specific DNA has traditionally been regarded as conserved among primates, it was recently discovered that the boundary position varies among human males, due to a translocation of ~110 kb from the X to the Y chromosome that creates an extended PAR1 (ePAR). This event has occurred at least twice in human evolution. So far, only limited evidence has been presented to suggest this extension is recombinationally active. Here, we sought direct proof by examining thousands of gametes from each of two ePAR-carrying men, for two subregions chosen on the basis of previously published male X-chromosomal meiotic double-strand break (DSB) maps. Crossover activity comparable to that seen at autosomal hotspots was observed between the X and the ePAR borne on the Y chromosome both at a distal and a proximal site within the 110-kb extension. Other hallmarks of classic recombination hotspots included evidence of transmission distortion and GC-biased gene conversion. We observed good correspondence between the male DSB clusters and historical recombination activity of this region in the X chromosomes of females, as ascertained from linkage disequilibrium analysis; this suggests that this region is similarly primed for crossover in both male and female germlines, although sex-specific differences may also exist. Extensive resequencing and inference of ePAR haplotypes, placed in the framework of the Y phylogeny as ascertained by both Y microsatellites and single nucleotide polymorphisms, allowed us to estimate a minimum rate of crossover over the entire ePAR region of 6-fold greater than genome average, comparable with pedigree estimates of PAR1 activity generally. We conclude ePAR very likely contributes to the critical crossover function of PAR1. 95% of our genome is contained in 22 pairs of chromosomes shared by all humans. However, women and men differ in their sex chromosomes: while women have two X chromosomes, men have an X and a smaller, sex-determining Y chromosome. To ensure correct partition of X and Y into sperm, genetic exchange (crossover) must occur between these very different chromosomes in a short, shared region. The location of the boundary of this region was thought to have been conserved since before the divergence from old world monkeys at least 27 million years ago, but recently it has been shown that some human males carry an extended version on their Y chromosomes, thanks to the transposition of a piece of DNA from the X chromosome. Here, we asked if genetic exchange occurs in this newly extended region. To do this, we used previously published information that signposted the positions within the X chromosome segment which exhibit the hallmarks of crossover initiation. We then sought direct evidence of crossover in the sperm of men carrying the extension. This work showed that the signposts were accurate, pointing to frequent crossover in this novel shared sex-chromosomal domain.
Collapse
Affiliation(s)
- Nitikorn Poriswanish
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
- Department of Forensic Medicine, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Rita Neumann
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Jon H. Wetton
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
| | - John Wagstaff
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Maarten H. D. Larmuseau
- Laboratory of Forensic Genetics and Molecular Archaeology, Department of Imaging and Pathology, KU Leuven, Belgium
| | - Mark A. Jobling
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Celia A. May
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
- * E-mail:
| |
Collapse
|
33
|
Egli D, Zuccaro MV, Kosicki M, Church GM, Bradley A, Jasin M. Inter-homologue repair in fertilized human eggs? Nature 2018; 560:E5-E7. [PMID: 30089924 DOI: 10.1038/s41586-018-0379-5] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 04/05/2018] [Indexed: 11/09/2022]
Affiliation(s)
- Dieter Egli
- Department of Obstetrics and Gynecology and Department of Pediatrics, Columbia University, New York, NY, USA.
| | - Michael V Zuccaro
- Graduate Program, Department of Physiology and Cellular Biophysics, Columbia University, New York, NY, USA
| | | | - George M Church
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | | | - Maria Jasin
- Developmental Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| |
Collapse
|
34
|
Tiemann-Boege I, Schwarz T, Striedner Y, Heissl A. The consequences of sequence erosion in the evolution of recombination hotspots. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0462. [PMID: 29109225 PMCID: PMC5698624 DOI: 10.1098/rstb.2016.0462] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/05/2017] [Indexed: 12/18/2022] Open
Abstract
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans-acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro. Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Irene Tiemann-Boege
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Theresa Schwarz
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Yasmin Striedner
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Angelika Heissl
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| |
Collapse
|
35
|
Latrille T, Duret L, Lartillot N. The Red Queen model of recombination hot-spot evolution: a theoretical investigation. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0463. [PMID: 29109226 PMCID: PMC5698625 DOI: 10.1098/rstb.2016.0463] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/06/2017] [Indexed: 11/12/2022] Open
Abstract
In humans and many other species, recombination events cluster in narrow and short-lived hot spots distributed across the genome, whose location is determined by the Zn-finger protein PRDM9. To explain these fast evolutionary dynamics, an intra-genomic Red Queen model has been proposed, based on the interplay between two antagonistic forces: biased gene conversion, mediated by double-strand breaks, resulting in hot-spot extinction, followed by positive selection favouring new PRDM9 alleles recognizing new sequence motifs. Thus far, however, this Red Queen model has not been formalized as a quantitative population-genetic model, fully accounting for the intricate interplay between biased gene conversion, mutation, selection, demography and genetic diversity at the PRDM9 locus. Here, we explore the population genetics of the Red Queen model of recombination. A Wright–Fisher simulator was implemented, allowing exploration of the behaviour of the model (mean equilibrium recombination rate, diversity at the PRDM9 locus or turnover rate) as a function of the parameters (effective population size, mutation and erosion rates). In a second step, analytical results based on self-consistent mean-field approximations were derived, reproducing the scaling relations observed in the simulations. Empirical fit of the model to current data from the mouse suggests both a high mutation rate at PRDM9 and strong biased gene conversion on its targets. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Thibault Latrille
- Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| | - Laurent Duret
- Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| | - Nicolas Lartillot
- Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| |
Collapse
|
36
|
Comeron JM. Background selection as null hypothesis in population genomics: insights and challenges from Drosophila studies. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0471. [PMID: 29109230 PMCID: PMC5698629 DOI: 10.1098/rstb.2016.0471] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2017] [Indexed: 12/11/2022] Open
Abstract
The consequences of selection at linked sites are multiple and widespread across the genomes of most species. Here, I first review the main concepts behind models of selection and linkage in recombining genomes, present the difficulty in parametrizing these models simply as a reduction in effective population size (Ne) and discuss the predicted impact of recombination rates on levels of diversity across genomes. Arguments are then put forward in favour of using a model of selection and linkage with neutral and deleterious mutations (i.e. the background selection model, BGS) as a sensible null hypothesis for investigating the presence of other forms of selection, such as balancing or positive. I also describe and compare two studies that have generated high-resolution landscapes of the predicted consequences of selection at linked sites in Drosophila melanogaster. Both studies show that BGS can explain a very large fraction of the observed variation in diversity across the whole genome, thus supporting its use as null model. Finally, I identify and discuss a number of caveats and challenges in studies of genetic hitchhiking that have been often overlooked, with several of them sharing a potential bias towards overestimating the evidence supporting recent selective sweeps to the detriment of a BGS explanation. One potential source of bias is the analysis of non-equilibrium populations: it is precisely because models of selection and linkage predict variation in Ne across chromosomes that demographic dynamics are not expected to be equivalent chromosome- or genome-wide. Other challenges include the use of incomplete genome annotations, the assumption of temporally stable recombination landscapes, the presence of genes under balancing selection and the consequences of ignoring non-crossover (gene conversion) recombination events. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, IA 52242, USA .,Interdisciplinary Program in Genetics, University of Iowa, Iowa City, IA 52242, USA
| |
Collapse
|
37
|
Dutta R, Saha-Mandal A, Cheng X, Qiu S, Serpen J, Fedorova L, Fedorov A. 1000 human genomes carry widespread signatures of GC biased gene conversion. BMC Genomics 2018; 19:256. [PMID: 29661137 PMCID: PMC5902838 DOI: 10.1186/s12864-018-4593-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 03/12/2018] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND GC-Biased Gene Conversion (gBGC) is one of the important theories put forward to explain profound long-range non-randomness in nucleotide compositions along mammalian chromosomes. Nucleotide changes due to gBGC are hard to distinguish from regular mutations. Here, we present an algorithm for analysis of millions of known SNPs that detects a subset of so-called "SNP flip-over" events representing recent gBGC nucleotide changes, which occurred in previous generations via non-crossover meiotic recombination. RESULTS This algorithm has been applied in a large-scale analysis of 1092 sequenced human genomes. Altogether, 56,328 regions on all autosomes have been examined, which revealed 223,955 putative gBGC cases leading to SNP flip-overs. We detected a strong bias (11.7% ± 0.2% excess) in AT- > GC over GC- > AT base pair changes within the entire set of putative gBGC cases. CONCLUSIONS On average, a human gamete acquires 7 SNP flip-over events, in which one allele is replaced by its complementary allele during the process of meiotic non-crossover recombination. In each meiosis event, on average, gBGC results in replacement of 7 AT base pairs by GC base pairs, while only 6 GC pairs are replaced by AT pairs. Therefore, every human gamete is enriched by one GC pair. Happening over millions of years of evolution, this bias may be a noticeable force in changing the nucleotide composition landscape along chromosomes.
Collapse
Affiliation(s)
- Rajib Dutta
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Present Address: Center for Cardiovascular and Pulmonary Research, Nationwide Children’s Hospital, 700 Children’s Dr, Columbus, OH USA
| | - Arnab Saha-Mandal
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Present Address: Biochemistry and Molecular Biology Graduate Program, Cumming School of Medicine, University of Calgary, Calgary, AB T2N4N1 Canada
| | - Xi Cheng
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| | - Shuhao Qiu
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| | - Jasmine Serpen
- SURF Program, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- College of Arts and Sciences, Washington University in St. Louis, 1 Brookings Dr, St. Louis, MO 63130 USA
| | | | - Alexei Fedorov
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| |
Collapse
|
38
|
Esin A, Bergendahl LT, Savolainen V, Marsh JA, Warnecke T. The genetic basis and evolution of red blood cell sickling in deer. Nat Ecol Evol 2018; 2:367-376. [PMID: 29255300 PMCID: PMC5777626 DOI: 10.1038/s41559-017-0420-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 11/20/2017] [Indexed: 11/09/2022]
Abstract
Crescent-shaped red blood cells, the hallmark of sickle-cell disease, present a striking departure from the biconcave disc shape normally found in mammals. Characterized by increased mechanical fragility, sickled cells promote haemolytic anaemia and vaso-occlusions and contribute directly to disease in humans. Remarkably, a similar sickle-shaped morphology has been observed in erythrocytes from several deer species, without obvious pathological consequences. The genetic basis of erythrocyte sickling in deer, however, remains unknown. Here, we determine the sequences of human β-globin orthologues in 15 deer species and use protein structural modelling to identify a sickling mechanism distinct from the human disease, coordinated by a derived valine (E22V) that is unique to sickling deer. Evidence for long-term maintenance of a trans-species sickling/non-sickling polymorphism suggests that sickling in deer is adaptive. Our results have implications for understanding the ecological regimes and molecular architectures that have promoted convergent evolution of sickling erythrocytes across vertebrates.
Collapse
Affiliation(s)
- Alexander Esin
- Molecular Systems Group, Medical Research Council London Institute of Medical Sciences, Du Cane Road, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Du Cane Road, London, United Kingdom
| | - L Therese Bergendahl
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, United Kingdom
| | - Vincent Savolainen
- Department of Life Sciences, Silwood Park Campus, Imperial College London, Ascot, United Kingdom
- University of Johannesburg, Auckland Park, Johannesburg, South Africa
| | - Joseph A Marsh
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, United Kingdom
| | - Tobias Warnecke
- Molecular Systems Group, Medical Research Council London Institute of Medical Sciences, Du Cane Road, London, United Kingdom.
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Du Cane Road, London, United Kingdom.
| |
Collapse
|
39
|
Stevison LS, Sefick S, Rushton C, Graze RM. Recombination rate plasticity: revealing mechanisms by design. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160459. [PMID: 29109222 PMCID: PMC5698621 DOI: 10.1098/rstb.2016.0459] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2017] [Indexed: 12/13/2022] Open
Abstract
For over a century, scientists have known that meiotic recombination rates can vary considerably among individuals, and that environmental conditions can modify recombination rates relative to the background. A variety of external and intrinsic factors such as temperature, age, sex and starvation can elicit 'plastic' responses in recombination rate. The influence of recombination rate plasticity on genetic diversity of the next generation has interesting and important implications for how populations evolve. Further, many questions remain regarding the mechanisms and molecular processes that contribute to recombination rate plasticity. Here, we review 100 years of experimental work on recombination rate plasticity conducted in Drosophila melanogaster We categorize this work into four major classes of experimental designs, which we describe via classic studies in D. melanogaster Based on these studies, we highlight molecular mechanisms that are supported by experimental results and relate these findings to studies in other systems. We synthesize lessons learned from this model system into experimental guidelines for using recent advances in genotyping technologies, to study recombination rate plasticity in non-model organisms. Specifically, we recommend (1) using fine-scale genome-wide markers, (2) collecting time-course data, (3) including crossover distribution measurements, and (4) using mixed effects models to analyse results. To illustrate this approach, we present an application adhering to these guidelines from empirical work we conducted in Drosophila pseudoobscuraThis article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Laurie S Stevison
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Stephen Sefick
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Chase Rushton
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Rita M Graze
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| |
Collapse
|
40
|
Tetrad analysis in plants and fungi finds large differences in gene conversion rates but no GC bias. Nat Ecol Evol 2017; 2:164-173. [PMID: 29158556 PMCID: PMC5733138 DOI: 10.1038/s41559-017-0372-7] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Accepted: 10/09/2017] [Indexed: 11/29/2022]
Abstract
GC-favoring gene conversion enables fixation of deleterious alleles, disturbs tests of natural selection and potentially explains both the evolution of recombination as well as the commonly reported intra-genomic correlation between G+C content and recombination rate. In addition, gene conversion disturbs linkage disequilibrium, potentially affecting the ability to detect causative variants. However, the importance and generality of these effects is unresolved, not simply because direct analyses are technically challenging but also because prior within- and between-species discrepant results can be hard to appraise owing to methodological differences. Here we report results of methodologically uniform whole-genome sequencing of all tetrad products in Saccharomyces, Neurospora, Chlamydomonas and Arabidopsis. The proportion of polymorphic markers converted varies over three orders of magnitude between species (from 2% of markers converted in yeast to only ~0.005% in the two plants) with at least 87.5% of the variance in per tetrad conversion rates being between-species. This is largely owing to differences in recombination rate and median tract length. Despite three of the species showing a positive GC-recombination correlation, there is no significant net AT->GC conversion bias in any, despite relatively high resolution in the two taxa (Saccharomyces and Neurospora) with relatively common gene conversion. The absence of a GC bias means: 1) that there should be no presumption that gene conversion is GC biased, nor 2) that a GC-recombination correlation necessarily implies biased gene conversion, 3) that Ka/Ks tests should be unaffected in these species and 4) it is unlikely that gene conversion explains the evolution of recombination.
Collapse
|
41
|
Frequent nonallelic gene conversion on the human lineage and its effect on the divergence of gene duplicates. Proc Natl Acad Sci U S A 2017; 114:12779-12784. [PMID: 29138319 DOI: 10.1073/pnas.1708151114] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Gene conversion is the copying of a genetic sequence from a "donor" region to an "acceptor." In nonallelic gene conversion (NAGC), the donor and the acceptor are at distinct genetic loci. Despite the role NAGC plays in various genetic diseases and the concerted evolution of gene families, the parameters that govern NAGC are not well characterized. Here, we survey duplicate gene families and identify converted tracts in 46% of them. These conversions reflect a large GC bias of NAGC. We develop a sequence evolution model that leverages substantially more information in duplicate sequences than used by previous methods and use it to estimate the parameters that govern NAGC in humans: a mean converted tract length of 250 bp and a probability of [Formula: see text] per generation for a nucleotide to be converted (an order of magnitude higher than the point mutation rate). Despite this high baseline rate, we show that NAGC slows down as duplicate sequences diverge-until an eventual "escape" of the sequences from its influence. As a result, NAGC has a small average effect on the sequence divergence of duplicates. This work improves our understanding of the NAGC mechanism and the role that it plays in the evolution of gene duplicates.
Collapse
|
42
|
Tatsumoto S, Go Y, Fukuta K, Noguchi H, Hayakawa T, Tomonaga M, Hirai H, Matsuzawa T, Agata K, Fujiyama A. Direct estimation of de novo mutation rates in a chimpanzee parent-offspring trio by ultra-deep whole genome sequencing. Sci Rep 2017; 7:13561. [PMID: 29093469 PMCID: PMC5666008 DOI: 10.1038/s41598-017-13919-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 10/04/2017] [Indexed: 12/30/2022] Open
Abstract
Mutations generate genetic variation and are a major driving force of evolution. Therefore, examining mutation rates and modes are essential for understanding the genetic basis of the physiology and evolution of organisms. Here, we aim to identify germline de novo mutations through the whole-genome surveyance of Mendelian inheritance error sites (MIEs), those not inherited through the Mendelian inheritance manner from either of the parents, using ultra-deep whole genome sequences (>150-fold) from a chimpanzee parent-offspring trio. We identified such 889 MIEs and classified them into four categories based on the pattern of inheritance and the sequence read depth: [i] de novo single nucleotide variants (SNVs), [ii] copy number neutral inherited variants, [iii] hemizygous deletion inherited variants, and [iv] de novo copy number variants (CNVs). From de novo SNV candidates, we estimated a germline de novo SNV mutation rate as 1.48 × 10-8 per site per generation or 0.62 × 10-9 per site per year. In summary, this study demonstrates the significance of ultra-deep whole genome sequencing not only for the direct estimation of mutation rates but also for discerning various mutation modes including de novo allelic conversion and de novo CNVs by identifying MIEs through the transmission of genomes from parents to offspring.
Collapse
Affiliation(s)
- Shoji Tatsumoto
- Department of Brain Sciences, Center for Novel Science Initiatives, National Institutes of Natural Sciences, Okazaki, Aichi, 444-8585, Japan
| | - Yasuhiro Go
- Department of Brain Sciences, Center for Novel Science Initiatives, National Institutes of Natural Sciences, Okazaki, Aichi, 444-8585, Japan. .,Department of System Neuroscience, National Institute for Physiological Sciences, Okazaki, Aichi, 444-8585, Japan. .,Department of Physiological Sciences, School of Life Science, SOKENDAI (The Graduate University for Advanced Studies), Okazaki, Aichi, 484-8585, Japan.
| | - Kentaro Fukuta
- Center for Genome Informatics, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Mishima, Shizuoka, 411-8540, Japan.,Advanced Genomics Center, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan
| | - Hideki Noguchi
- Center for Genome Informatics, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Mishima, Shizuoka, 411-8540, Japan.,Advanced Genomics Center, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan
| | - Takashi Hayakawa
- Department of Wildlife Science (Nagoya Railroad Co., Ltd.), Primate Research Institute, Kyoto University, Inuyama, Aichi, 484-8506, Japan.,Japan Monkey Centre, Inuyama, Aichi, 484-0081, Japan
| | - Masaki Tomonaga
- Department of Wildlife Science (Nagoya Railroad Co., Ltd.), Primate Research Institute, Kyoto University, Inuyama, Aichi, 484-8506, Japan.,Japan Monkey Centre, Inuyama, Aichi, 484-0081, Japan.,Language and Intelligence Section, Department of Cognitive Sciences, Primate Research Institute, Kyoto University, Inuyama, Aichi, 484-8506, Japan
| | - Hirohisa Hirai
- Molecular Biology Section, Department of Cellular and Molecular Biology, Primate Research Institute, Kyoto University, Inuyama, Aichi, 484-8506, Japan
| | - Tetsuro Matsuzawa
- Department of Wildlife Science (Nagoya Railroad Co., Ltd.), Primate Research Institute, Kyoto University, Inuyama, Aichi, 484-8506, Japan.,Japan Monkey Centre, Inuyama, Aichi, 484-0081, Japan.,Language and Intelligence Section, Department of Cognitive Sciences, Primate Research Institute, Kyoto University, Inuyama, Aichi, 484-8506, Japan.,Institute of Advanced Study, Kyoto University, Kyoto, 606-8501, Japan
| | - Kiyokazu Agata
- Laboratory for Biodiversity, Global COE Program, Graduate School of Science, Kyoto University, Kyoto, 606-8502, Japan.,Laboratory for Molecular Developmental Biology, Graduate School of Science, Kyoto University, Kyoto, 606-8502, Japan.,Graduate Course in Life Science, Gakushuin University, Tokyo, 171-8585, Japan
| | - Asao Fujiyama
- Center for Genome Informatics, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Mishima, Shizuoka, 411-8540, Japan. .,Advanced Genomics Center, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan. .,Department of Genetics, School of Life Science, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Shizuoka, 411-8540, Japan.
| |
Collapse
|
43
|
Bhérer C, Campbell CL, Auton A. Refined genetic maps reveal sexual dimorphism in human meiotic recombination at multiple scales. Nat Commun 2017; 8:14994. [PMID: 28440270 PMCID: PMC5414043 DOI: 10.1038/ncomms14994] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 02/20/2017] [Indexed: 11/09/2022] Open
Abstract
In humans, males have lower recombination rates than females over the majority of the genome, but the opposite is usually true near the telomeres. These broad-scale differences have been known for decades, yet little is known about differences at the fine scale. By combining data sets, we have collected recombination events from over 100,000 meioses and have constructed sex-specific genetic maps at a previously unachievable resolution. Here we show that, although a substantial fraction of the genome shows some degree of sexually dimorphic recombination, the vast majority of hotspots are shared between the sexes, with only a small number of putative sex-specific hotspots. Wavelet analysis indicates that most of the differences can be attributed to the fine scale, and that variation in rate between the sexes can mostly be explained by differences in hotspot magnitude, rather than location. Nonetheless, known recombination-associated genomic features, such as THE1B repeat elements, show systematic differences between the sexes. It is known that males have lower recombination rates than females over much of the genome but little is known about differences at a fine scale. Here the authors combine data from over 100,000 meioses and show that the majority of differences can be explained by variation in hotspot magnitude.
Collapse
Affiliation(s)
- Claude Bhérer
- Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, New York 10461, USA
| | - Christopher L Campbell
- Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, New York 10461, USA
| | - Adam Auton
- Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, New York 10461, USA
| |
Collapse
|
44
|
Zhao LP, Bolouri H, Zhao M, Geraghty DE, Lernmark Å. An Object-Oriented Regression for Building Disease Predictive Models with Multiallelic HLA Genes. Genet Epidemiol 2016; 40:315-32. [PMID: 27080919 DOI: 10.1002/gepi.21968] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Revised: 02/11/2016] [Accepted: 02/17/2016] [Indexed: 12/29/2022]
Abstract
Recent genome-wide association studies confirm that human leukocyte antigen (HLA) genes have the strongest associations with several autoimmune diseases, including type 1 diabetes (T1D), providing an impetus to reduce this genetic association to practice through an HLA-based disease predictive model. However, conventional model-building methods tend to be suboptimal when predictors are highly polymorphic with many rare alleles combined with complex patterns of sequence homology within and between genes. To circumvent this challenge, we describe an alternative methodology; treating complex genotypes of HLA genes as "objects" or "exemplars," one focuses on systemic associations of disease phenotype with "objects" via similarity measurements. Conceptually, this approach assigns disease risks base on complex genotype profiles instead of specific disease-associated genotypes or alleles. Effectively, it transforms large, discrete, and sparse HLA genotypes into a matrix of similarity-based covariates. By the Kernel representative theorem and machine learning techniques, it uses a penalized likelihood method to select disease-associated exemplars in building predictive models. To illustrate this methodology, we apply it to a T1D study with eight HLA genes (HLA-DRB1, HLA-DRB3, HLA-DRB4, HLA-DRB5, HLA-DQA1, HLA-DQB1, HLA-DPA1, and HLA-DPB1) to build a predictive model. The resulted predictive model has an area under curve of 0.92 in the training set, and 0.89 in the validating set, indicating that this methodology is useful to build predictive models with complex HLA genotypes.
Collapse
Affiliation(s)
- Lue Ping Zhao
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America.,Department of Biostatistics, University of Washington School of Public Health, Seattle, Washington, United States of America
| | - Hamid Bolouri
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Michael Zhao
- Bellevue High School, Seattle, Washington, United States of America
| | - Daniel E Geraghty
- Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Åke Lernmark
- Department of Clinical Sciences, Lund University/CRC, Skåne University Hospital, Malmö, Sweden
| | | |
Collapse
|
45
|
The rate of meiotic gene conversion varies by sex and age. Nat Genet 2016; 48:1377-1384. [PMID: 27643539 PMCID: PMC5083143 DOI: 10.1038/ng.3669] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 08/16/2016] [Indexed: 12/14/2022]
Abstract
Meiotic recombination involves a combination of gene conversion and crossover events that, along with mutations, produce germline genetic diversity. Here we report the discovery of 3,176 SNP and 61 indel gene conversions. Our estimate of the non-crossover (NCO) gene conversion rate (G) is 7.0 for SNPs and 5.8 for indels per megabase per generation, and the GC bias is 67.6%. For indels, we demonstrate a 65.6% preference for the shorter allele. NCO gene conversions from mothers are longer than those from fathers, and G is 2.17 times greater in mothers. Notably, G increases with the age of mothers, but not the age of fathers. A disproportionate number of NCO gene conversions in older mothers occur outside double-strand break (DSB) regions and in regions with relatively low GC content. This points to age-related changes in the mechanisms of meiotic gene conversion in oocytes.
Collapse
|
46
|
Miles A, Iqbal Z, Vauterin P, Pearson R, Campino S, Theron M, Gould K, Mead D, Drury E, O'Brien J, Ruano Rubio V, MacInnis B, Mwangi J, Samarakoon U, Ranford-Cartwright L, Ferdig M, Hayton K, Su XZ, Wellems T, Rayner J, McVean G, Kwiatkowski D. Indels, structural variation, and recombination drive genomic diversity in Plasmodium falciparum. Genome Res 2016; 26:1288-99. [PMID: 27531718 PMCID: PMC5052046 DOI: 10.1101/gr.203711.115] [Citation(s) in RCA: 136] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2015] [Accepted: 06/28/2016] [Indexed: 12/14/2022]
Abstract
The malaria parasite Plasmodium falciparum has a great capacity for evolutionary adaptation to evade host immunity and develop drug resistance. Current understanding of parasite evolution is impeded by the fact that a large fraction of the genome is either highly repetitive or highly variable and thus difficult to analyze using short-read sequencing technologies. Here, we describe a resource of deep sequencing data on parents and progeny from genetic crosses, which has enabled us to perform the first genome-wide, integrated analysis of SNP, indel and complex polymorphisms, using Mendelian error rates as an indicator of genotypic accuracy. These data reveal that indels are exceptionally abundant, being more common than SNPs and thus the dominant mode of polymorphism within the core genome. We use the high density of SNP and indel markers to analyze patterns of meiotic recombination, confirming a high rate of crossover events and providing the first estimates for the rate of non-crossover events and the length of conversion tracts. We observe several instances of meiotic recombination within copy number variants associated with drug resistance, demonstrating a mechanism whereby fitness costs associated with resistance mutations could be compensated and greater phenotypic plasticity could be acquired.
Collapse
Affiliation(s)
- Alistair Miles
- MRC Centre for Genomics and Global Health, University of Oxford, Oxford OX3 7BN, United Kingdom; Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Zamin Iqbal
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Paul Vauterin
- MRC Centre for Genomics and Global Health, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Richard Pearson
- MRC Centre for Genomics and Global Health, University of Oxford, Oxford OX3 7BN, United Kingdom; Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Susana Campino
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Michel Theron
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Kelda Gould
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Daniel Mead
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Eleanor Drury
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | | | | | - Bronwyn MacInnis
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts 02142, USA
| | - Jonathan Mwangi
- Department of Biochemistry, Medical School, Mount Kenya University, 01000 Thika, Kenya; Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, United Kingdom
| | - Upeka Samarakoon
- Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana 46556, USA
| | - Lisa Ranford-Cartwright
- Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, United Kingdom
| | - Michael Ferdig
- Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana 46556, USA
| | - Karen Hayton
- Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Disease, National Institutes of Health, Bethesda, Maryland 20892-9806, USA
| | - Xin-Zhuan Su
- Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Disease, National Institutes of Health, Bethesda, Maryland 20892-9806, USA
| | - Thomas Wellems
- Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Disease, National Institutes of Health, Bethesda, Maryland 20892-9806, USA
| | - Julian Rayner
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| | - Gil McVean
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom; Department of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom
| | - Dominic Kwiatkowski
- MRC Centre for Genomics and Global Health, University of Oxford, Oxford OX3 7BN, United Kingdom; Malaria Programme, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom
| |
Collapse
|
47
|
Huang J, Copenhaver GP, Ma H, Wang Y. New insights into the role of DNA synthesis in meiotic recombination. Sci Bull (Beijing) 2016. [DOI: 10.1007/s11434-016-1126-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
48
|
Korunes KL, Noor MAF. Gene conversion and linkage: effects on genome evolution and speciation. Mol Ecol 2016; 26:351-364. [DOI: 10.1111/mec.13736] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Revised: 06/07/2016] [Accepted: 06/22/2016] [Indexed: 12/12/2022]
|
49
|
Elevated Linkage Disequilibrium and Signatures of Soft Sweeps Are Common in Drosophila melanogaster. Genetics 2016; 203:863-80. [PMID: 27098909 DOI: 10.1534/genetics.115.184002] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 03/25/2016] [Indexed: 12/20/2022] Open
Abstract
The extent to which selection and demography impact patterns of genetic diversity in natural populations of Drosophila melanogaster is yet to be fully understood. We previously observed that linkage disequilibrium (LD) at scales of ∼10 kb in the Drosophila Genetic Reference Panel (DGRP), consisting of 145 inbred strains from Raleigh, North Carolina, measured both between pairs of sites and as haplotype homozygosity, is elevated above neutral demographic expectations. We also demonstrated that signatures of strong and recent soft sweeps are abundant. However, the extent to which these patterns are specific to this derived and admixed population is unknown. It is also unclear whether these patterns are a consequence of the extensive inbreeding performed to generate the DGRP data. Here we analyze LD statistics in a sample of >100 fully-sequenced strains from Zambia; an ancestral population to the Raleigh population that has experienced little to no admixture and was generated by sequencing haploid embryos rather than inbred strains. We find an elevation in long-range LD and haplotype homozygosity compared to neutral expectations in the Zambian sample, thus showing the elevation in LD is not specific to the DGRP data set. This elevation in LD and haplotype structure remains even after controlling for possible confounders including genomic inversions, admixture, population substructure, close relatedness of individual strains, and recombination rate variation. Furthermore, signatures of partial soft sweeps similar to those found in the DGRP as well as partial hard sweeps are common in Zambia. These results suggest that while the selective forces and sources of adaptive mutations may differ in Zambia and Raleigh, elevated long-range LD and signatures of soft sweeps are generic in D. melanogaster.
Collapse
|
50
|
Recombination hotspots: Models and tools for detection. DNA Repair (Amst) 2016; 40:47-56. [PMID: 26991854 DOI: 10.1016/j.dnarep.2016.02.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Accepted: 02/09/2016] [Indexed: 11/22/2022]
Abstract
Recombination hotspots are the regions within the genome where the rate, and the frequency of recombination are optimum with a size varying from 1 to 2kb. The recombination event is mediated by the double-stranded break formation, guided by the combined enzymatic action of DNA topoisomerase and Spo 11 endonuclease. These regions are distributed non-uniformly throughout the human genome and cause distortions in the genetic map. Numerous lines of evidence suggest that the number of hotspots known in humans has increased manifold in recent years. A few facts about the hotspot evolutions were also put forward, indicating the differences in the hotspot position between chimpanzees and humans. In mice, recombination hot spots were found to be clustered within the major histocompatibility complex (MHC) region. Several models, that help explain meiotic recombination has been proposed. Moreover, scientists also developed some computational tools to locate the hotspot position and estimate their recombination rate in humans is of great interest to population and medical geneticists. Here we reviewed the molecular mechanisms, models and in silico prediction techniques of hot spot residues.
Collapse
|