1
|
Porubsky D, Eichler EE. A 25-year odyssey of genomic technology advances and structural variant discovery. Cell 2024; 187:1024-1037. [PMID: 38290514 DOI: 10.1016/j.cell.2024.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 12/20/2023] [Accepted: 01/02/2024] [Indexed: 02/01/2024]
Abstract
This perspective focuses on advances in genome technology over the last 25 years and their impact on germline variant discovery within the field of human genetics. The field has witnessed tremendous technological advances from microarrays to short-read sequencing and now long-read sequencing. Each technology has provided genome-wide access to different classes of human genetic variation. We are now on the verge of comprehensive variant detection of all forms of variation for the first time with a single assay. We predict that this transition will further transform our understanding of human health and biology and, more importantly, provide novel insights into the dynamic mutational processes shaping our genomes.
Collapse
Affiliation(s)
- David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
2
|
Uncovering novel MHC alleles from RNA-Seq data: expanding the spectrum of MHC class I alleles in sheep. BMC Genom Data 2023; 24:1. [PMID: 36597020 PMCID: PMC9809118 DOI: 10.1186/s12863-022-01102-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 12/20/2022] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Major histocompatibility complex (MHC) class I glycoproteins present selected peptides or antigens to CD8 + T cells that control the cytotoxic immune response. The MHC class I genes are among the most polymorphic loci in the vertebrate genome, with more than twenty thousand alleles known in humans. In sheep, only a very small number of alleles have been described to date, making the development of genotyping systems or functional studies difficult. A cost-effective way to identify new alleles could be to use already available RNA-Seq data from sheep. Current strategies for aligning RNA-Seq reads against annotated genome sequences or transcriptomes fail to detect the majority of class I alleles. Here, I combine the alignment of RNA-Seq reads against a specific reference database with de novo assembly to identify alleles. The method allows the comprehensive discovery of novel MHC class I alleles from RNA-Seq data (DinoMfRS). RESULTS Using DinoMfRS, virtually all expressed MHC class I alleles could be determined. From 18 animals 75 MHC class I alleles were identified, of which 69 were novel. In addition, it was shown that DinoMfRS can be used to improve the annotation of MHC genes in the sheep genome sequence. CONCLUSIONS DinoMfRS allows for the first time the annotation of unknown, more divergent MHC alleles from RNA-Seq data. Successful application to RNA-Seq data from 16 animals has approximately doubled the number of known alleles in sheep. By using existing data, alleles can now be determined very inexpensively for populations that have not been well studied. In addition, MHC expression studies or evolutionary studies, for example, can be greatly improved in this way, and the method should be applicable to a broader spectrum of other multigene families or highly polymorphic genes.
Collapse
|
3
|
Javidi-Parsijani P, Lyu P, Makani V, Sarhan WM, Yoo KW, El-Korashi L, Atala A, Lu B. CRISPR/Cas9 increases mitotic gene conversion in human cells. Gene Ther 2020; 27:281-296. [PMID: 32020049 DOI: 10.1038/s41434-020-0126-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Revised: 01/09/2020] [Accepted: 01/27/2020] [Indexed: 12/25/2022]
Abstract
Gene conversion is a process of transferring genetic material from one homologous sequence to another. Most reported gene conversions are meiotic although mitotic gene conversion is also described. When using CRISPR/Cas9 to target the human hemoglobin subunit beta (HBB) gene, hemoglobin subunit delta (HBD) gene footprints were observed in HBB gene. However, it is unclear whether these were the results of gene conversion or PCR-mediated sequence shuffling between highly homologous sequences. Here we provide evidence that the HBD footprints in HBB were indeed results of gene conversion. We demonstrated that the CRISPR/Cas9 facilitated unidirectional sequence transfer from the homologous gene without double-strand breaks (DSB) to the one with DSBs, and showed that the rates of HBD footprint in HBB were positively correlated to the HBB insertion and deletion rates. We further showed that when targeting HBD gene, HBB footprints could also be observed in HBD gene. The mitotic gene conversion was observed not only in immortalized HEK293T cells, but also in human primary cells. Our work reveals mitotic gene conversion as an often overlooked effect of CRISPR/Cas9-mediated genome editing.
Collapse
Affiliation(s)
- Parisa Javidi-Parsijani
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Pin Lyu
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Vishruti Makani
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Walaa Mohamed Sarhan
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Kyung Whan Yoo
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Lobna El-Korashi
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Anthony Atala
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA
| | - Baisong Lu
- Wake Forest Institute for Regenerative Medicine, Wake Forest University Health Sciences, Winston-Salem, NC, USA.
| |
Collapse
|
4
|
Resolving MiSeq-Generated Ambiguities in HLA-DPB1 Typing by Using the Oxford Nanopore Technology. J Mol Diagn 2019; 21:852-861. [PMID: 31173929 DOI: 10.1016/j.jmoldx.2019.04.009] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 04/19/2019] [Accepted: 04/30/2019] [Indexed: 12/30/2022] Open
Abstract
The technical limitations of current next-generation sequencing technologies, combined with an ever-increasing number of human leukocyte antigen (HLA) alleles, form the basis for the additional ambiguities encountered at an increasing rate in clinical practice. HLA-DPB1 characterization, particularly, generates a significant percentage of ambiguities (25.5%), posing a challenge for accurate and unambiguous HLA-DPB1 genotyping. Phasing of exonic heterozygous positions between exon 2 and all other downstream exons has been the major cause of ambiguities. In this study, the Oxford Nanopore MinION, a third-generation sequencing technology, was used to resolve the phasing. The accurate MiSeq sequencing data, combined with the long reads obtained from the MinION platform, allow for the resolution of the tested ambiguities.
Collapse
|
5
|
Sasani TA, Cone KR, Quinlan AR, Elde NC. Long read sequencing reveals poxvirus evolution through rapid homogenization of gene arrays. eLife 2018; 7:35453. [PMID: 30156554 PMCID: PMC6115191 DOI: 10.7554/elife.35453] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 08/12/2018] [Indexed: 12/21/2022] Open
Abstract
Poxvirus adaptation can involve combinations of recombination-driven gene copy number variation and beneficial single nucleotide variants (SNVs) at the same loci. How these distinct mechanisms of genetic diversification might simultaneously facilitate adaptation to host immune defenses is unknown. We performed experimental evolution with vaccinia virus populations harboring a SNV in a gene actively undergoing copy number amplification. Using long sequencing reads from the Oxford Nanopore Technologies platform, we phased SNVs within large gene copy arrays for the first time. Our analysis uncovered a mechanism of adaptive SNV homogenization reminiscent of gene conversion, which is actively driven by selection. This study reveals a new mechanism for the fluid gain of beneficial mutations in genetic regions undergoing active recombination in viruses and illustrates the value of long read sequencing technologies for investigating complex genome dynamics in diverse biological systems.
Collapse
Affiliation(s)
- Thomas A Sasani
- Department of Human Genetics, University of Utah, Salt Lake, United States
| | - Kelsey R Cone
- Department of Human Genetics, University of Utah, Salt Lake, United States
| | - Aaron R Quinlan
- Department of Human Genetics, University of Utah, Salt Lake, United States
| | - Nels C Elde
- Department of Human Genetics, University of Utah, Salt Lake, United States
| |
Collapse
|
6
|
Linnenbrink M, Teschke M, Montero I, Vallier M, Tautz D. Meta-populational demes constitute a reservoir for large MHC allele diversity in wild house mice ( Mus musculus). Front Zool 2018; 15:15. [PMID: 29721030 PMCID: PMC5910556 DOI: 10.1186/s12983-018-0266-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 04/10/2018] [Indexed: 12/12/2022] Open
Abstract
Background The MHC class I and II loci mediate the adaptive immune response and belong to the most polymorphic loci in vertebrate genomes. In fact, the number of different alleles in a given species is often so large that it remains a challenge to provide an evolutionary model that can fully account for this. Results We provide here a general survey of MHC allele numbers in house mouse populations and two sub-species (M. m. domesticus and M. m. musculus) for H2 class I D and K, as well as class II A and E loci. Between 50 and 90% of the detected different sequences constitute new alleles, confirming that the discovery of new alleles is indeed far from complete. House mice live in separate demes with small effective population sizes, factors that were proposed to reduce, rather than enhance the possibility for the maintenance of many different alleles. To specifically investigate the occurrence of alleles within demes, we focused on the class II H2-Aa and H2-Eb exon 2 alleles in nine demes of M. m. domesticus from two different geographic regions. We find on the one hand a group of alleles that occur in different sampling regions and three quarters of these are also found in both sub-species. On the other hand, the larger group of different alleles (56%) occurs only in one of the regions and most of these (89%) only in single demes. We show that most of these region-specific alleles have apparently arisen through recombination and/or partial gene conversion from already existing alleles. Conclusions Demes can act as sources of alleles that outnumber the set of alleles that are shared across the species range. These findings support the reservoir model proposed for human MHC diversity, which states that large pools of rare MHC allele variants are continuously generated by neutral mutational mechanisms. Given that these can become important in the defense against newly emerging pathogens, the reservoir model complements the selection based models for MHC diversity and explains why the exceptional diversity exists. Electronic supplementary material The online version of this article (10.1186/s12983-018-0266-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Miriam Linnenbrink
- 3Max-Planck Institute for Evolutionary Biology, August-Thienemannstrasse 2, 24306 Plön, Germany
| | - Meike Teschke
- 1Present address: Deutsche Forschungsgemeinschaft, 53170 Bonn, Germany.,3Max-Planck Institute for Evolutionary Biology, August-Thienemannstrasse 2, 24306 Plön, Germany
| | - Inka Montero
- 2Present address: Medical Faculty, Eberhard Karls Universität Tübingen, Tübingen, Germany.,3Max-Planck Institute for Evolutionary Biology, August-Thienemannstrasse 2, 24306 Plön, Germany
| | - Marie Vallier
- 3Max-Planck Institute for Evolutionary Biology, August-Thienemannstrasse 2, 24306 Plön, Germany
| | - Diethard Tautz
- 3Max-Planck Institute for Evolutionary Biology, August-Thienemannstrasse 2, 24306 Plön, Germany
| |
Collapse
|
7
|
Fabreti-Oliveira R, Lasmar M, Oliveira C, Vale E, Nascimento E. Genetic Mechanisms Involved in the Generation of HLA Alleles in Brazilians: Description and Comparison of HLA Alleles. Transplant Proc 2018; 50:835-840. [DOI: 10.1016/j.transproceed.2018.02.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
8
|
Ramesh A, Darko S, Hua A, Overman G, Ransier A, Francica JR, Trama A, Tomaras GD, Haynes BF, Douek DC, Kepler TB. Structure and Diversity of the Rhesus Macaque Immunoglobulin Loci through Multiple De Novo Genome Assemblies. Front Immunol 2017; 8:1407. [PMID: 29163486 PMCID: PMC5663730 DOI: 10.3389/fimmu.2017.01407] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 10/11/2017] [Indexed: 11/13/2022] Open
Abstract
The rhesus macaque is a critically important animal model in biomedical research, most recently playing a key role in the development of vaccines against human immunodeficiency virus-1. Nevertheless, the immunoglobulin (Ig) loci of macaques are as yet incompletely determined and our understanding of differences between human and macaque humoral immunity remains deficient. We completed a high-coverage, high-quality whole genome sequencing and assembly project with a single rhesus macaque of Indian origin, and partial genome assemblies using genomic molecular targeting of the Ig loci in nine other rhesus macaques of Indian origin. These data indicate that the macaque Ig loci are substantially more diverse than those in humans, including greater sequence diversity and copy-number variation between individuals. It appears likely that such copy-number variation even occurs between allelic loci within individuals. Different Ig gene families in the macaque show distinct relationships to the corresponding human gene families and appear to evolve under different mechanisms. These results raise intriguing questions about the evolution of antigen receptors in primates but also have important practical implications for the design and interpretation of biomedical studies.
Collapse
Affiliation(s)
- Akshaya Ramesh
- Graduate Program in Genetics and Genomics, Boston University School of Medicine, Boston, MA, United States
| | - Sam Darko
- NIH Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Axin Hua
- Department of Microbiology, Boston University School of Medicine, Boston, MA, United States
| | - Glenn Overman
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, United States
| | - Amy Ransier
- NIH Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Joseph R Francica
- NIH Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Ashley Trama
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, United States
| | - Georgia D Tomaras
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, United States
| | - Barton F Haynes
- Duke Human Vaccine Institute, Duke University Medical Center, Durham, NC, United States
| | - Daniel C Douek
- NIH Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Thomas B Kepler
- Department of Microbiology, Boston University School of Medicine, Boston, MA, United States.,Department of Mathematics and Statistics, Boston University, Boston, MA, United States
| |
Collapse
|
9
|
Characterisation of MHC class I genes in the koala. Immunogenetics 2017; 70:125-133. [PMID: 28669101 DOI: 10.1007/s00251-017-1018-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 06/20/2017] [Indexed: 10/19/2022]
Abstract
Koala (Phascolarctos cinereus) populations are on the decline across the majority of Australia's mainland. Two major diseases threatening the long-term survival of affected koala populations are caused by obligate intracellular pathogens: Chlamydia and koala retrovirus (KoRV). To improve our understanding of the koala immune system, we characterised their major histocompatibility complex (MHC) class I genes, which are centrally involved in presenting foreign peptides derived from intracellular pathogens to cytotoxic T cells. A total of 11 class I genes were identified in the koala genome. Three genes, Phci-UA, UB and UC, showed relatively high genetic variability and were expressed in all 12 examined tissues, whereas the other eight genes had tissue-specific expression and limited polymorphism. Evidence of diversifying selection was detected in Phci-UA and UC, while gene conversion may have played a role in creating new alleles at Phci-UB. We propose that Phci-UA, UB and UC are likely classical MHC genes of koalas, and further research is needed to understand their role in koala chlamydial and KoRV infections.
Collapse
|
10
|
Trombetta B, D'Atanasio E, Cruciani F. Patterns of Inter-Chromosomal Gene Conversion on the Male-Specific Region of the Human Y Chromosome. Front Genet 2017; 8:54. [PMID: 28515739 PMCID: PMC5413550 DOI: 10.3389/fgene.2017.00054] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 04/18/2017] [Indexed: 12/31/2022] Open
Abstract
The male-specific region of the human Y chromosome (MSY) is characterized by the lack of meiotic recombination and it has long been considered an evolutionary independent region of the human genome. In recent years, however, the idea that human MSY did not have an independent evolutionary history begun to emerge with the discovery that inter-chromosomal gene conversion (ICGC) can modulate the genetic diversity of some portions of this genomic region. Despite the study of the dynamics of this molecular mechanism in humans is still in its infancy, some peculiar features and consequences of it can be summarized. The main effect of ICGC is to increase the allelic diversity of MSY by generating a significant excess of clustered single nucleotide polymorphisms (SNPs) (defined as groups of two or more SNPs occurring in close proximity and on the same branch of the Y phylogeny). On the human MSY, 13 inter-chromosomal gene conversion hotspots (GCHs) have been identified so far, involving donor sequences mainly from the X-chromosome and, to a lesser extent, from autosomes. Most of the GCHs are evolutionary conserved and overlap with regions involved in aberrant X–Y crossing-over. This review mainly focuses on the dynamics and the current knowledge concerning the recombinational landscape of the human MSY in the form of ICGC, on how this molecular mechanism may influence the evolution of the MSY, and on how it could affect the information enclosed within a genomic region which, until recently, appeared to be an evolutionary independent unit.
Collapse
Affiliation(s)
- Beniamino Trombetta
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di RomaRome, Italy
| | - Eugenia D'Atanasio
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di RomaRome, Italy
| | - Fulvio Cruciani
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di RomaRome, Italy.,Istituto di Biologia e Patologia Molecolari, Consiglio Nazionale delle Ricerche (CNR),Rome, Italy
| |
Collapse
|
11
|
Y chromosome palindromes and gene conversion. Hum Genet 2017; 136:605-619. [PMID: 28303348 DOI: 10.1007/s00439-017-1777-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2016] [Accepted: 03/07/2017] [Indexed: 02/02/2023]
Abstract
The presence of large and near-identical inverted repeat sequences (called palindromes) is a common feature of the constitutively haploid sex chromosomes of different species. Despite the fact palindromes originated in a non-recombining context, they have evolved a strong recombinational activity in the form of abundant arm-to-arm gene conversion. Their independent appearance in different species suggests they can have a profound biological significance that has yet to be fully clarified. It has been theorized that natural selection may have favored palindromic organization of male-specific genes and that the establishment of intra-palindrome gene conversion has strong adaptive significance. Arm-to-arm gene conversion allows the efficient removal of deleterious mutations, increases the fixation rate of beneficial mutations and has played an important role in modulating the equilibrium between gene loss and acquisition during Y chromosome evolution. Additionally, a palindromic organization of duplicates could favor the formation of unusual chromatin structures and could optimize the use of gene conversion as a mechanism to maintain the structural integrity of male-specific genes. In this review, we describe the structural features of palindromes on mammalian sex chromosomes and summarize different hypotheses regarding palindrome evolution and the functional benefits of arm-to-arm gene conversion on the unique haploid portion of the nuclear genome.
Collapse
|
12
|
Dumont BL. Interlocus gene conversion explains at least 2.7% of single nucleotide variants in human segmental duplications. BMC Genomics 2015; 16:456. [PMID: 26077037 PMCID: PMC4467073 DOI: 10.1186/s12864-015-1681-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 06/01/2015] [Indexed: 01/24/2023] Open
Abstract
Background Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. Results Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. Conclusions Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7 % of single nucleotide variants in duplicated regions of the human genome. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1681-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Beth L Dumont
- Initiative in Biological Complexity, North Carolina State University, 112 Derieux Place, 3510 Thomas Hall, Campus Box 7614, Raleigh, NC, 27695-7614, USA.
| |
Collapse
|
13
|
Ellison CE, Bachtrog D. Non-allelic gene conversion enables rapid evolutionary change at multiple regulatory sites encoded by transposable elements. eLife 2015; 4. [PMID: 25688566 PMCID: PMC4384637 DOI: 10.7554/elife.05899] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Accepted: 02/16/2015] [Indexed: 12/02/2022] Open
Abstract
Transposable elements (TEs) allow rewiring of regulatory networks, and the recent amplification of the ISX element dispersed 77 functional but suboptimal binding sites for the dosage compensation complex to a newly formed X chromosome in Drosophila. Here we identify two linked refining mutations within ISX that interact epistatically to increase binding affinity to the dosage compensation complex. Selection has increased the frequency of this derived haplotype in the population, which is fixed at 30% of ISX insertions and polymorphic among another 41%. Sharing of this haplotype indicates that high levels of gene conversion among ISX elements allow them to ‘crowd-source’ refining mutations, and a refining mutation that occurs at any single ISX element can spread in two dimensions: horizontally across insertion sites by non-allelic gene conversion, and vertically through the population by natural selection. These results describe a novel route by which fully functional regulatory elements can arise rapidly from TEs and implicate non-allelic gene conversion as having an important role in accelerating the evolutionary fine-tuning of regulatory networks. DOI:http://dx.doi.org/10.7554/eLife.05899.001 Mutations change genes and provide the raw material for evolution. Genes are sections of DNA that contain the instructions for making proteins or other molecules, and so determine the physical characteristics of each organism. Genetic mutations that increase an organism's number of offspring and chances of survival are more likely to be passed on to future generations. Changes to when or where a gene is switched on (so-called regulatory mutations) can also provide fitness benefits and can therefore be selected for during evolution. Transposable elements are sequences of DNA that are also called ‘jumping genes’ because they can make copies of themselves and these copies of the transposable element can move to other locations in the genome. Some transposable elements contain sequences that switch on nearby genes. If different copies of a transposable element that contains such a regulatory sequence insert themselves in more than one place, it can result in a network of genes that can all be controlled in the same way. The regulatory sequences contained within transposable elements are not always optimal, but they can be fine-tuned through evolution. A fruit fly called Drosophila miranda has a transposable element called ISX that has, over time, placed up to 77 regulatory sequences around one of this species' sex chromosomes. Just as in humans, female flies are XX and males are XY; but having only one copy of the X chromosome means that male flies need to increase the expression of certain genes to produce a full-dose of the molecules made by the genes. This process is called dosage compensation and in 2013 the 77 ISX regulatory sequences on the fruit fly's X chromosome were shown to help recruit the molecular machinery that carries out dosage compensation to nearby genes, albeit inefficiently. Now Ellison and Bachtrog—who also conducted the 2013 study—report how these transposable elements have been fine-tuned to make them more effective for dosage compensation. Ellison and Bachtrog uncovered two mutations that make the ISX transposable element better at recruiting the dosage compensation molecular machinery. ISX spread around different locations along the fly's X chromosome before these mutations arose; this means that initially none of the 77 insertions carried the two mutations, but now 30% of the 77 elements have the mutations in all flies, and 41% have them in only some flies. The same mutations have spread between the different ISX elements because transposable elements with the mutations have been used to directly convert other ISX elements without them. These mutations have also become more common in the fruit fly population by being passed on to offspring and increasing their survival. These two routes have accelerated the fine-tuning of these transposable elements for use in gene regulation. This implies that regulatory sequences derived from transposable elements evolve in a way that is fundamentally different from those that arise by other means, as the direct conversion between these insertions allows fine-tuning mutations to spread more rapidly. DOI:http://dx.doi.org/10.7554/eLife.05899.002
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Doris Bachtrog
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| |
Collapse
|
14
|
Dearborn DC, Gager AB, Gilmour ME, McArthur AG, Hinerfeld DA, Mauck RA. Non-neutral evolution and reciprocal monophyly of two expressed Mhc class II B genes in Leach’s storm-petrel. Immunogenetics 2014; 67:111-23. [DOI: 10.1007/s00251-014-0813-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Accepted: 10/30/2014] [Indexed: 12/21/2022]
|
15
|
Trowsdale J, Koch N. Reply to Ding et al.: Non-Neanderthal Origin of the HLA-DPB1*0401. J Biol Chem 2014; 289:10253. [DOI: 10.1074/jbc.l114.552885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
16
|
Padhukasahasram B, Rannala B. Meiotic gene-conversion rate and tract length variation in the human genome. Eur J Hum Genet 2013:ejhg201330. [PMID: 23443031 DOI: 10.1038/ejhg.2013.30] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Revised: 12/17/2012] [Accepted: 01/10/2013] [Indexed: 01/11/2023] Open
Abstract
Meiotic recombination occurs in the form of two different mechanisms called crossing-over and gene-conversion and both processes have an important role in shaping genetic variation in populations. Although variation in crossing-over rates has been studied extensively using sperm-typing experiments, pedigree studies and population genetic approaches, our knowledge of variation in gene-conversion parameters (ie, rates and mean tract lengths) remains far from complete. To explore variability in population gene-conversion rates and its relationship to crossing-over rate variation patterns, we have developed and validated using coalescent simulations a comprehensive Bayesian full-likelihood method that can jointly infer crossing-over and gene-conversion rates as well as tract lengths from population genomic data under general variable rate models with recombination hotspots. Here, we apply this new method to SNP data from multiple human populations and attempt to characterize for the first time the fine-scale variation in gene-conversion parameters along the human genome. We find that the estimated ratio of gene-conversion to crossing-over rates varies considerably across genomic regions as well as between populations. However, there is a great degree of uncertainty associated with such estimates. We also find substantial evidence for variation in the mean conversion tract length. The estimated tract lengths did not show any negative relationship with the local heterozygosity levels in our analysis.European Journal of Human Genetics advance online publication, 27 February 2013; doi:10.1038/ejhg.2013.30.
Collapse
Affiliation(s)
- Badri Padhukasahasram
- 1] Center for Health Policy and Health Services Research, Henry Ford Health System, Detroit, MI, USA [2] Genome Center and Department of Evolution and Ecology, University of California, Davis, Davis, CA, USA
| | - Bruce Rannala
- Genome Center and Department of Evolution and Ecology, University of California, Davis, Davis, CA, USA
| |
Collapse
|
17
|
[Polymorphic analysis of Mhc-DPB1 gene exon 2 in Tibetan macaques (Macaca thibetana)]. YI CHUAN = HEREDITAS 2012. [PMID: 23208139 DOI: 10.3724/sp.j.1005.2012.01417] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Major histocompatibility complex (MHC) molecules play an important role in the susceptibility and/or resistance to many diseases. To gain an insight into the MHC background of the Tibetan macaques (Macaca thibetana), and thereby facilitate their protection and application in biomedical research, the second exon of the Mhc-DPB1 genes from 70 Tibetan macaques in Sichuan Province were characterized by PCR, cloning, sequencing, and statistical analysis. A total of 18 Mhc-DPB1 alleles were identified from Tibetan macaques, of which one (Math-DPB1*01:06N) was a pseudogene. Math-DPB1*06:01:01 (67.14%) was the most frequent allele in all the 18 alleles detected, followed by Math-DPB1* 01:03:01 (37.14%), Math-DPB1*09:02 (25.71%), and Math-DPB1*22:01 (15.71%). The alignment of putative amino acid sequences of the 18 Math-DPB1 alleles showed that 5 variable sites were species-specific to Tibetan macaques. A phylogenetic tree constructed using DPB1 alleles in difference species demonstrated that the alleles for Math-DPB1, Mamu-DPB1, and Mafa-DPB1 tended to mix together, rather than cluster into a separate branch in a species-specific fashion, and the Trans-species polymorphism was also observed in the phylogenetic tree. Selection analysis revealed that balancing selection may play an important role in maintaining the polymorphism of Math-DPB1 genes.
Collapse
|
18
|
Klitz W, Hedrick P, Louis EJ. New reservoirs of HLA alleles: pools of rare variants enhance immune defense. Trends Genet 2012; 28:480-6. [PMID: 22867968 DOI: 10.1016/j.tig.2012.06.007] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2012] [Revised: 06/15/2012] [Accepted: 06/22/2012] [Indexed: 11/16/2022]
Abstract
Highly polymorphic exons of the major histocompatibility complex (MHC, or HLA in humans) encode critical amino acids that bind foreign peptides. Recognition of the peptide-MHC complexes by T cells initiates the adaptive immune response. The particular structure of these exons facilitates gene conversion(GC) events, leading to the generation of new alleles. Estimates for allele creation and loss indicate that more than 10000 such alleles are circulating at low frequencies in human populations. Empirical sampling has affirmed this expectation. This suggests that the MHC loci have a system for moving valuable and often complex variants into adaptive service. Here, we argue that HLA loci carry many new mutant alleles prepared to assume epidemiologically meaningful roles when called on by selection provoked by exposure to new and evolving pathogens. Because new mutant alleles appear in a population at the lowest possible frequency (i.e., a single copy), they have typically been thought of as having little consequence. However, this large population of rare yet potentially valuable new alleles may contribute to pathogen defense.
Collapse
Affiliation(s)
- William Klitz
- School of Public Health, University of California, Berkeley, CA, USA.
| | | | | |
Collapse
|
19
|
Deng L, Tang X, Hao X, Chen W, Lin J, Yu Y, Zhang D, Zeng C. Genetic flux between h1 and h2 haplotypes of the 17q21.31 inversion in European population. GENOMICS PROTEOMICS & BIOINFORMATICS 2012; 9:113-8. [PMID: 21802048 PMCID: PMC5054451 DOI: 10.1016/s1672-0229(11)60014-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2010] [Accepted: 05/16/2011] [Indexed: 12/03/2022]
Abstract
The chromosome 17q21.31 inversion is a 900-kb common structural polymorphism found primarily in European population. Although the genetic flux within inversion region was assumed to be considerable suppressed, it is still unclear about the details of genetic exchange between the H1 (non-inverted sequence) and H2 (inverted sequence) haplotypes of this inversion. Here we describe a refined map of genetic exchanges between pairs of gene arrangements within the 17q21.31 region. Using HapMap phase II data of 1,546 single nucleotide polymorphisms, we successfully deduced 96 H1 and 24 H2 haplotypes in European samples by neighbor-joining tree reconstruction. Furthermore, we identified 15 and 26 candidate tracts with reciprocal and non-reciprocal genetic exchanges, respectively. In all 15 regions harboring reciprocal exchange, haplotypes reconstructed by clone sequencing did not support these exchange events, suggesting that such signals of exchange between two sister chromosomes in certain heterozygous individual were caused by phasing error regions. On the other hand, the finished clone sequencing across 4 of 26 tracts with non-reciprocal genetic flux confirmed that this kind of genetic exchange was caused by gene conversion. In summary, as crossover between pairs of gene arrangements had been considerably suppressed, gene conversion might be the most important mechanism for genetic exchange at 17q21.31.
Collapse
|
20
|
Abstract
Meiotic recombination is a fundamental cellular mechanism in sexually reproducing organisms and its different forms, crossing over and gene conversion both play an important role in shaping genetic variation in populations. Here, we describe a coalescent-based full-likelihood Markov chain Monte Carlo (MCMC) method for jointly estimating the crossing-over, gene-conversion, and mean tract length parameters from population genomic data under a Bayesian framework. Although computationally more expensive than methods that use approximate likelihoods, the relative efficiency of our method is expected to be optimal in theory. Furthermore, it is also possible to obtain a posterior sample of genealogies for the data using this method. We first check the performance of the new method on simulated data and verify its correctness. We also extend the method for inference under models with variable gene-conversion and crossing-over rates and demonstrate its ability to identify recombination hotspots. Then, we apply the method to two empirical data sets that were sequenced in the telomeric regions of the X chromosome of Drosophila melanogaster. Our results indicate that gene conversion occurs more frequently than crossing over in the su-w and su-s gene sequences while the local rates of crossing over as inferred by our program are not low. The mean tract lengths for gene-conversion events are estimated to be ∼70 bp and 430 bp, respectively, for these data sets. Finally, we discuss ideas and optimizations for reducing the execution time of our algorithm.
Collapse
|
21
|
Abstract
Nearly all colonial marine invertebrates are capable of allorecognition--the ability to distinguish between self and genetically distinct members of the same species. When two or more colonies grow into contact, they either reject each other and compete for the contested space or fuse and form a single, chimeric colony. The specificity of this response is conferred by genetic systems that restrict fusion to self and close kin. Two selective pressures, intraspecific spatial competition between whole colonies and competition between stem cells for access to the germline in fused chimeras, are thought to drive the evolution of extensive polymorphism at invertebrate allorecognition loci. After decades of study, genes controlling allorecognition have been identified in two model systems, the protochordate Botryllus schlosseri and the cnidarian Hydractinia symbiolongicarpus. In both species, allorecognition specificity is determined by highly polymorphic cell-surface molecules, encoded by the fuhc and fester genes in Botryllus, and by the alr1 and alr2 genes in Hydractinia. Here we review allorecognition phenomena in both systems, summarizing recent molecular advances, comparing and contrasting the life history traits that shape the evolution of these distinct allorecognition systems, and highlighting questions that remain open in the field.
Collapse
|
22
|
Qin J, Munyard K, Lee CY, Wetherall JD, Groth DM. Characterization of the sheep Complement Factor B gene (CFB). Vet Immunol Immunopathol 2011; 140:170-4. [PMID: 21163535 DOI: 10.1016/j.vetimm.2010.11.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Revised: 11/04/2010] [Accepted: 11/11/2010] [Indexed: 10/18/2022]
Abstract
The Complement Factor B gene (CFB) of the alternative complement pathway has been identified in the sheep Major Histocompatibility Complex (MHC) and its genomic sequence determined. CFB is located approximately 600 bp upstream of the complement C2 gene, contains 18 exons, and manifests the domain signature characteristic of CFB protein. Thirteen single nucleotide polymorphisms were identified in merino sheep and interbreed variation was identified by comparison with International Sheep Genomics Consortium data. Two predicted non synonymous substitutions were observed and in-silico analysis indicates that these are likely to have a destabilizing effect on the protein structure. Sheep and cattle CFB were compared and shown to contain a common nine nucleotide deletion in exon 18 relative to human CFB. Predicted CFB amino acid sequences for these two species contain 761 aa relative to 764 aa in the human orthologue. Sequencing of the cosmid and BAC clones used in this study permitted the relative positions of three adjacent loci to be determined and showed that the previously described microsatellite locus (BfMs) is located within SKIV2L.
Collapse
Affiliation(s)
- Jinyi Qin
- School of Biomedical Sciences, Curtin Health Innovation Research Institute, Western Australian Biomedical Research Institute, Curtin University, GPO Box U1987, Perth, 6845 Western Australia, Australia
| | | | | | | | | |
Collapse
|
23
|
Czarnecki C, Van Domselaar G, Embreé J, Brunham R, Plummer FA, Luo M. Identification of HLA-G*01:17, a new allele possibly generated by an interloci gene conversion event involving exon 3 of HLA-B. Hum Immunol 2010; 72:238-40. [PMID: 21156191 DOI: 10.1016/j.humimm.2010.12.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Revised: 11/15/2010] [Accepted: 12/06/2010] [Indexed: 11/16/2022]
Abstract
HLA-G*01:17 was discovered in a woman of Kenyan descent who was enrolled in a mother-to-child HIV-1 transmission cohort. The new allele was identical to HLA-G*01:06 at exons 2, 3, and 4 with the exception of a base pair substitution at codon 169 (CAC → CGC) resulting in a coding change from histidine to arginine and codon 171 (TAC → CAC), resulting in turn in a coding change from tyrosine to histidine. The World Health Organization (WHO) Nomenclature Committee has named this allele HLA-G*01:17.
Collapse
Affiliation(s)
- Chris Czarnecki
- National Microbiology Laboratory, Winnipeg, Manitoba, Canada
| | | | | | | | | | | |
Collapse
|
24
|
Characterization of the major histocompatibility complex class II DOB, DPB1, and DQB1 alleles in cynomolgus macaques of Vietnamese origin. Immunogenetics 2010; 63:155-66. [PMID: 21132285 PMCID: PMC7080152 DOI: 10.1007/s00251-010-0498-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2010] [Accepted: 11/16/2010] [Indexed: 12/20/2022]
Abstract
Major histocompatibility complex (MHC) molecules play an important role in the susceptibility and/or resistance to many diseases. To gain an insight into the MHC background and to facilitate the experimental use of cynomolgus macaques, the second exon of the MhcMafa-DOB, -DPB1, and -DQB1 genes from 143 cynomolgus macaques were characterized by cloning to sequencing. A total of 16 Mafa-DOB, 16 Mafa-DPB1, and 34 Mafa-DQB1 alleles were identified, which revealed limited, moderate, and marked allelic polymorphism at DOB, DPB1, and DQB1, respectively, in a cohort of cynomolgus macaques of Vietnamese origin. In addition, 16 Mafa-DOB, 5 Mafa-DPB1, and 8 Mafa-DQB1 alleles represented novel sequences that had not been reported in earlier studies. Almost of the sequences detected at the DOB and DQB1 locus in the present study belonged to DOB*01 (100%) and DQB1*06 (62%) lineages, respectively. Interestingly, four, three, and one high-frequency alleles were detected at Mafa-DOB, -DPB1, and -DQB1, respectively, in this monkeys. The alleles with the highest frequency among these monkeys were Mafa-DOB*010102, Mafa-DPB1*13, and Mafa-DQB1*0616, and these were found in 33 (25.6%) of 129 monkeys, 32 (31.37%) of 102 monkeys, and 30 (31%) of 143 monkeys, respectively. The high-frequency alleles may represent high priority targets for additional characterization of immune function. We also carried out evolutionary and population analyses using these sequences to reveal population-specific alleles. This information will not only promote the understanding of MHC diversity and polymorphism in the cynomolgus macaque but will also increase the value of this species as a model for biomedical research.
Collapse
|
25
|
Aleshin A, Zhi D. Recombination-associated sequence homogenization of neighboring Alu elements: signature of nonallelic gene conversion. Mol Biol Evol 2010; 27:2300-11. [PMID: 20453015 DOI: 10.1093/molbev/msq116] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Recently, researchers have begun to recognize that, in order to establish neutral models for disease association and evolutionary genomics studies, it is crucial to have a clear understanding of the genomic impact of nonallelic gene conversion. Drawing on previous successes in characterizing this phenomenon over protein-coding gene families, we undertook a computational analysis of neighboring Alu sequences in the genome scale. For this purpose, we developed adjusted comutation rate (aCMR), a novel statistical method measuring the excess number of identical point mutations shared by adjacent Alu sequences, vis-à-vis random pairs. Using aCMR, we uncovered a remarkable genome-wide sequence homogenization of neighboring Alus, with the strongest signal observed in the pseudoautosomal regions of the X and Y chromosomes. The magnitude of sequence homogenization between Alu pairs is greater with shorter interlocus distance, higher sequence identity, and parallel orientation. Moreover, shared substitutions show a strong directionality toward GC nucleotides, with multiple substitutions tending to cluster within the Alu sequence. Taken together, these observed recombination-associated sequence homogenization patterns are best explained by frequent ubiquitous gene conversion events between neighboring Alus. We believe that these observations help to illuminate the nature and impact of the enigmatic phenomenon of gene conversion.
Collapse
Affiliation(s)
- Alexey Aleshin
- Department of Medicine, Division of Hematology, Oncology, David Geffen School of Medicine, University of California, Los Angeles, USA
| | | |
Collapse
|
26
|
Temperature and length-dependent modulation of the MH class II beta gene expression in brook charr (Salvelinus fontinalis) by a cis-acting minisatellite. Mol Immunol 2010; 47:1817-29. [PMID: 20381151 DOI: 10.1016/j.molimm.2009.12.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Accepted: 12/23/2009] [Indexed: 01/19/2023]
Abstract
It is widely recognized that the variation in gene regulation is an important factor from which evolutionary changes in diverse aspects of phenotype can be observed in all organisms. Distinctive elements with functional roles on gene regulation have been identified within the non-coding part of the genome, including repeated elements. Major histocompatibility complex (MHC) genes have been the subject of an abundant literature which made them unique candidates for studies of adaptation in natural populations. Yet, the vast majority of studies on MHC genes have dealt with patterns of polymorphism in sequence variation while very few paid attention to the possible implication of differential expression in adaptive responses. In this paper, we report the identification of a polymorphic minisatellite formed of a 32 nucleotides motif (38% G+C) involved in regulation of the major histocompatibility class II beta gene (MHII beta) of brook charr (Salvelinus fontinalis). Our main objectives were: to analyze the variability of this minisatellite found in the second intron of the MHII beta gene and to document its effect to the variation of expression level of this gene under different environmental conditions. Distinctive number of the minisatellite repeats were associated with each different MHII beta alleles identified from exon 2 sequences. Relative expression levels of specific alleles in heterozygous individuals were determined from fish lymphocytes in different genotypes. We found that alleles carrying the longest minisatellite showed a significant 1.67-2.56-fold reduction in the transcript expression relatively to the shortest one. Results obtained in three different genotypes also indicated that the repressive activity associated to the longest minisatellite was more effective at 18 degrees C compared to 6 degrees C. In contrast, no significant difference was observed in transcript levels between alleles with comparable minisatellite length at both temperatures. We also depicted a significant up-regulation of the total MHII beta transcript at 6 degrees C relative to 18 degrees C. These results reveal for the first time that a temperature-sensitive minisatellite could potentially play an important role in the gene regulation of the adaptive immune response in fishes.
Collapse
|
27
|
Diversity and evolution of MHII β genes in a non-model percid species—The Eurasian perch (Perca fluviatilis L.). Mol Immunol 2009; 46:3399-410. [DOI: 10.1016/j.molimm.2009.07.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2009] [Revised: 07/14/2009] [Accepted: 07/16/2009] [Indexed: 11/20/2022]
|
28
|
Runck AM, Moriyama H, Storz JF. Evolution of duplicated beta-globin genes and the structural basis of hemoglobin isoform differentiation in Mus. Mol Biol Evol 2009; 26:2521-32. [PMID: 19675095 DOI: 10.1093/molbev/msp165] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The functional diversification of multigene families may be strongly influenced by mechanisms of concerted evolution such as interparalog gene conversion. The beta-globin gene family of house mice (genus Mus) represents an especially promising system for evaluating the effects of gene conversion on the functional divergence of duplicated genes. Whereas the majority of mammalian species possess tandemly duplicated copies of the adult beta-globin gene that are identical in sequence, natural populations of house mice are often polymorphic for distinct two-locus haplotypes that differ in levels of functional divergence between duplicated beta-globin genes, HBB-T1 and HBB-T2. Here, we use a phylogenetic approach to unravel the complex evolutionary history of the HBB-T1 and HBB-T2 paralogs in a taxonomically diverse set of species in the genus Mus. The main objectives of this study were 1) to reconstruct the evolutionary history of the different HBB haplotypes of house mice, 2) to assess the role of recombinational exchange between HBB-T1 and HBB-T2 in promoting concerted evolution, 3) to assess the role of recombinational exchange between HBB-T1 and HBB-T2 in creating chimeric genes, and 4) to assess the structural basis of hemoglobin isoform differentiation in species that possess distinct HBB paralogs. Results of our phylogenetic survey revealed that the HBB-T1 and HBB-T2 genes in different species of Mus exhibit the full range of evolutionary outcomes with respect to levels of interparalog divergence. At one end of the spectrum, the two identical HBB paralogs on the Hbb(s) haplotype (shared by Mus domesticus, Mus musculus, and Mus spretus) represent a classic example of concerted evolution. At the other end of the spectrum, the two distinct HBB paralogs on the Hbb(d), Hbb(p), Hbb(w1), and Hbb(w2) haplotypes (shared by multiple species in the subgenus Mus) show no trace of gene conversion and are distinguished by a number of functionally important amino acid substitutions. Because the possession of distinct HBB paralogs expands the repertoire of functionally distinct hemoglobin isoforms that can be synthesized during fetal development and postnatal life, variation in the level of functional divergence between HBB-T1 and HBB-T2 may underlie important physiological variation within and among species.
Collapse
Affiliation(s)
- Amy M Runck
- School of Biological Sciences, University of Nebraska, NE, USA
| | | | | |
Collapse
|
29
|
Niculita-Hirzel H, Labbé J, Kohler A, Le Tacon F, Martin F, Sanders IR, Kües U. Gene organization of the mating type regions in the ectomycorrhizal fungus Laccaria bicolor reveals distinct evolution between the two mating type loci. THE NEW PHYTOLOGIST 2008; 180:329-342. [PMID: 18557817 DOI: 10.1111/j.1469-8137.2008.02525.x] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
In natural conditions, basidiomycete ectomycorrhizal fungi such as Laccaria bicolor are typically in the dikaryotic state when forming symbioses with trees, meaning that two genetically different individuals have to fuse or 'mate'. Nevertheless, nothing is known about the molecular mechanisms of mating in these ecologically important fungi. Here, advantage was taken of the first sequenced genome of the ectomycorrhizal fungus, Laccaria bicolor, to determine the genes that govern the establishment of cell-type identity and orchestrate mating. The L. bicolor mating type loci were identified through genomic screening. The evolutionary history of the genomic regions that contained them was determined by genome-wide comparison of L. bicolor sequences with those of known tetrapolar and bipolar basidiomycete species, and by phylogenetic reconstruction of gene family history. It is shown that the genes of the two mating type loci, A and B, are conserved across the Agaricales, but they are contained in regions of the genome with different evolutionary histories. The A locus is in a region where the gene order is under strong selection across the Agaricales. By contrast, the B locus is in a region where the gene order is likely under a low selection pressure but where gene duplication, translocation and transposon insertion are frequent.
Collapse
Affiliation(s)
- Hélène Niculita-Hirzel
- Department of Ecology and Evolution, University of Lausanne, Biophore Building, CH-1015 Lausanne, Switzerland
| | - Jessy Labbé
- UMR 1136, Interactions Arbres/Microorganismes, INRA-Nancy, F-54280 Champenoux, France
| | - Annegret Kohler
- UMR 1136, Interactions Arbres/Microorganismes, INRA-Nancy, F-54280 Champenoux, France
| | - François Le Tacon
- UMR 1136, Interactions Arbres/Microorganismes, INRA-Nancy, F-54280 Champenoux, France
| | - Francis Martin
- UMR 1136, Interactions Arbres/Microorganismes, INRA-Nancy, F-54280 Champenoux, France
| | - Ian R Sanders
- Department of Ecology and Evolution, University of Lausanne, Biophore Building, CH-1015 Lausanne, Switzerland
| | - Ursula Kües
- Molecular Wood Biotechnology and Technical Mycology, Büsgen-Institute, Georg-August-University Göttingen, D-37077 Göttingen, Germany
| |
Collapse
|
30
|
Abstract
Our understanding of the details of mammalian meiotic recombination has recently advanced significantly. Sperm typing technologies, linkage studies, and computational inferences from population genetic data have together provided information in unprecedented detail about the location and activity of the sites of crossing-over in mice and humans. The results show that the vast majority of meiotic recombination events are localized to narrow DNA regions (hot spots) that constitute only a small fraction of the genome. The data also suggest that the molecular basis of hot spot activity is unlikely to be strictly determined by specific DNA sequence motifs in cis. Further molecular studies are needed to understand how hot spots originate, function and evolve.
Collapse
Affiliation(s)
- Norman Arnheim
- Molecular and Computational Biology Program, University of Southern California, Los Angeles, CA 90089-2910, USA.
| | | | | |
Collapse
|
31
|
von Salomé J, Kukkonen JP. Sequence features of HLA-DRB1 locus define putative basis for gene conversion and point mutations. BMC Genomics 2008; 9:228. [PMID: 18489735 PMCID: PMC2408603 DOI: 10.1186/1471-2164-9-228] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2007] [Accepted: 05/19/2008] [Indexed: 11/10/2022] Open
Abstract
Background HLA/MHC class II molecules show high degree of polymorphism in the human population. The individual polymorphic motifs have been suggested to be propagated and mixed by transfer of genetic material (recombination, gene conversion) between alleles, but no clear molecular basis for this has been identified as yet. A large number of MHC class II allele sequences is publicly available and could be used to analyze the sequence features behind the recombination, revealing possible basis for such recombination processes both in HLA class II genes and other genes, which recombination acts upon. Results In this study we analyzed the vast dataset of human allelic variants (49 full coding sequences, 374 full exon 2 sequences) of the most polymorphic MHC class II locus, HLA-DRB1, and identified many previously unknown sequence features possibly contributing to the recombination. The CpG-dinucleotide content of exon 2 (containing the antigen-binding sites and subsequently a high degree of polymorphism) was much elevated as compared to the other exons despite similar overall G+C content. Furthermore, the CpG pattern was highly conserved. We also identified more complex, highly conserved sequence motifs in exon 2. Some of these can be identified as putative recombination motifs previously found in other genes, but most are previously unidentified. Conclusion The identified sequence features could putatively act in recombination allowing either less (CpG dinucleotides) or more specific DNA cleavage (complex sequences) or homologous recombination (complex sequences).
Collapse
Affiliation(s)
- Jenny von Salomé
- University of Helsinki, Department of Basic Veterinary Sciences, Helsinki, Finland.
| | | |
Collapse
|
32
|
Shichi D, Matsumori A, Naruse TK, Inoko H, Kimura A. HLA-DPbeta chain may confer the susceptibility to hepatitis C virus-associated hypertrophic cardiomyopathy. Int J Immunogenet 2008; 35:37-43. [PMID: 18186799 DOI: 10.1111/j.1744-313x.2007.00733.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Hypertrophic cardiomyopathy (HCM) is a heart muscle disease characterized by hypertrophy and diastolic dysfunction of cardiac ventricles. It is suggested that one possible aetiology of HCM is the hepatitis C virus (HCV) infection, but molecular mechanisms underlying development of HCV-associated HCM (HCV-HCM) remains unknown. Because the human leucocyte antigen (HLA) molecule is involved in the control of progression/suppression of viral infection, extensive HLA allelic diversity may modulate the post-infectious course of HCV and pathogenesis of HCV-HCM. Here we undertook a case-control study with 38 patients with HCV-HCM and 132 unrelated healthy controls to reveal the potential impact of polymorphisms in seven classical and two non-classical HLA genes on the pathogenesis of HCV-HCM. It was found that DPB1*0401 and DPB1*0901 were significantly associated with increased risk to HCV-HCM in dominant model (P < 0.028, OR = 3.94, 95% confidence interval (CI) = 1.19, 13.02) and in recessive model (P < 0.007, OR = 9.85, 95% CI = 1.83, 53.04), respectively. The disparity in the gene-dose effect by two susceptible DPB1 alleles may be attributable to the difference between the susceptible (36 A and 55 A) and resistant (8L, 9F, 11G, 57E and 76M) residue-combination consisting of DPbeta anchor pocket for antigenic peptide-binding. These results implied that the HLA-DP molecules with specificity pocket appropriate for HCV antigen(s) might confer the progressive process of HCM among the HCV-infected individuals.
Collapse
Affiliation(s)
- D Shichi
- Department of Molecular Pathogenesis, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | | | | | | | | |
Collapse
|
33
|
Chen JM, Cooper DN, Chuzhanova N, Férec C, Patrinos GP. Gene conversion: mechanisms, evolution and human disease. Nat Rev Genet 2007; 8:762-75. [PMID: 17846636 DOI: 10.1038/nrg2193] [Citation(s) in RCA: 449] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Gene conversion, one of the two mechanisms of homologous recombination, involves the unidirectional transfer of genetic material from a 'donor' sequence to a highly homologous 'acceptor'. Considerable progress has been made in understanding the molecular mechanisms that underlie gene conversion, its formative role in human genome evolution and its implications for human inherited disease. Here we assess current thinking about how gene conversion occurs, explore the key part it has played in fashioning extant human genes, and carry out a meta-analysis of gene-conversion events that are known to have caused human genetic disease.
Collapse
|
34
|
Crane CF. Patterned sequence in the transcriptome of vascular plants. BMC Genomics 2007; 8:173. [PMID: 17573970 PMCID: PMC1940011 DOI: 10.1186/1471-2164-8-173] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 06/15/2007] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Microsatellites (repeated subsequences based on motifs of one to six nucleotides) are widely used as codominant genetic markers because of their frequent polymorphism and relative selective neutrality. Minisatellites are repeats of motifs having seven or more nucleotides. The large number of EST sequences now available in public databases offers an opportunity to compare microsatellite and minisatellite properties and evaluate their evolution over a broad range of plant taxa. RESULTS Repeated motifs from one to 250 nucleotides long were identified in 6793306 expressed sequence tags (ESTs) from 88 genera of vascular plants, using a custom data-processing pipeline that allowed limited variation among repeats. The pipeline processed trimmed but otherwise unfiltered sequence and output nonredundant loci of at least 15 nucleotides, with degree of polymorphism and PCR primers wherever possible. Motifs that were an integral multiple of three in length were more abundant and richer in G/C than other motifs. From 80 to 85% of minisatellite motifs represented repeats within proteins, up to the 228-nucleotide repeat of ubiquitin, but not all of these repeats preserved reading frame. The remaining 15 to 20% of minisatellite motifs were associated with transcribed repetitive elements, e.g., retrotransposons. Relative microsatellite motif frequencies did not correlate tightly to phylogenetic relationship. Evolution of increased microsatellite and EST GC content was evident within the grasses. Microsatellites were less frequent in the transcriptome of genera with large genomes, but there was no evidence for greater dilution of the transcriptome with transposable element transcripts in these genera. CONCLUSION The relatively low correlation of microsatellite spectrum to phylogeny suggests that repeat loci evolve more rapidly than the surrounding sequence, although tissue specificity of the different EST libraries is a complicating factor. In-frame motifs are more abundant and higher in GC than frame-shifting motifs, but most EST minisatellite loci appear to represent repeats in translated sequence, regardless of whether reading frame is preserved. Motifs of four to six nucleotides are as polymorphic in EST collections as the commonly used motifs of two and three nucleotides, and they can be exploited as genetic markers with little additional effort.
Collapse
Affiliation(s)
- Charles F Crane
- Agricultural Research Service, United States Department of Agriculture, and Department of Botany and Plant Pathology, Purdue University, 915 W. State St, West Lafayette, Indiana 47907-2054, USA.
| |
Collapse
|
35
|
Mayer F, Brunner A. Non-neutral evolution of the major histocompatibility complex class II gene DRB1 in the sac-winged bat Saccopteryx bilineata. Heredity (Edinb) 2007; 99:257-64. [PMID: 17519971 PMCID: PMC7094720 DOI: 10.1038/sj.hdy.6800989] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The immune genes of the major histocompatibility complex (MHC) are classical examples for high levels of genetic diversity and non-neutral evolution. This is particularly true for the regions containing the antigen-binding sites as, for instance, in the exon 2 of the MHC class II gene DRB. We surveyed, for the first time in the order Chiroptera, the genetic diversity within this exon in the sac-winged bat Saccopteryx bilineata. We detected 11 alleles among 85 bats, of which 79 were sampled in one population. Pairwise comparisons revealed that interallelic sequence differences ranged between 3 and 22%, although nucleotide substitutions were not evenly distributed along the exon sequence. This was most probably the result of intragenic recombination. High levels of sequence divergence and significantly more nonsynonymous than synonymous substitutions (dN/dS>1) suggest long-term balancing selection. Thus, the data are consistent with the hypothesis that recombination gives rise to new alleles at the DRB locus of the sac-winged bat, and these are maintained in the population through balancing selection. In this respect, the sac-winged bat closely resembles other mammalian species.
Collapse
Affiliation(s)
- F Mayer
- Department of Zoology, University of Erlangen, Erlangen, Germany.
| | | |
Collapse
|
36
|
von Salomé J, Gyllensten U, Bergström TF. Full-length sequence analysis of the HLA-DRB1 locus suggests a recent origin of alleles. Immunogenetics 2007; 59:261-71. [PMID: 17345114 DOI: 10.1007/s00251-007-0196-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2006] [Accepted: 01/15/2007] [Indexed: 11/29/2022]
Abstract
The HLA region harbors some of the most polymorphic loci in the human genome. Among them is the class II locus HLA-DRB1, with more than 400 known alleles. The age of the polymorphism and the rate at which new alleles are generated at HLA loci has caused much controversy over the years. Previous studies have mostly been restricted to the 270 base pairs that constitute the second exon and represent the most variable part of the gene. Here, we investigate the evolutionary history of the HLA-DRB1 locus on the basis of an analysis of 15 genomic full-length alleles (10-15 kb). In addition, the variation in 49 complete coding sequences and 322 exon 2 sequences were analyzed. When excluding exon 2 from the analysis, the diversity at the synonymous sites was found to be similar to the intron diversity. The overall diversity in noncoding region was also similar to the genome average. The DRB1*03 lineage has been found in human, chimpanzee, bonobo, gorilla, and orangutan. An ancestral "proto HLA-DRB1*03 lineage" appeared to have diverged in the last 5 million years into the human-specific lineages *08, *11, *13, and *14. With exception to exon 2, both the coding- and the noncoding diversity suggests a recent origin (<1 million years ago) for most of the alleles at the HLA-DRB1 locus. Sites encoding for amino acids involved in antigen binding [antigen recognizing sites (ARS)] appear to have a more ancient origin. Taken together, the recent origin of most alleles, the high diversity between allelic lineages, and the ancient origin of sequence motifs in exon 2, is consistent with a relatively rapid generation of novel alleles by gene conversion like events.
Collapse
Affiliation(s)
- Jenny von Salomé
- Department of Genetics and Pathology, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden
| | | | | |
Collapse
|
37
|
Hasselmann M, Beye M. Pronounced differences of recombination activity at the sex determination locus of the honeybee, a locus under strong balancing selection. Genetics 2006; 174:1469-80. [PMID: 16951061 PMCID: PMC1667079 DOI: 10.1534/genetics.106.062018] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recombination decreases the association of linked nucleotide sites and can influence levels of polymorphism in natural populations. When coupled with selection, recombination may relax potential conflict among linked genes, a concept that has played a central role in research on the evolution of recombination. The sex determination locus (SDL) of the honeybee is an informative example for exploring the combined forces of recombination, selection, and linkage on sequence evolution. Balancing selection at SDL is very strong and homozygous individuals at SDL are eliminated by worker bees. The recombination rate is increased up to four times that of the genomewide average in the region surrounding SDL. Analysis of nucleotide diversity (pi) reveals a sevenfold increase of polymorphism within the sex determination gene complementary sex determiner (csd) that rapidly declines within 45 kb to levels of genomewide estimates. Although no recombination was observed within SDL, which contains csd, analyses of heterogeneity, shared polymorphic sites, and linkage disequilibrium (LD) show that recombination has contributed to the evolution of the 5' part of some csd sequences. Gene conversion, however, has not obviously contributed to the evolution of csd sequences. The local control of recombination appears to be related to SDL function and mode of selection. The homogenizing force of recombination is reduced within SDL, which preserves allelic differences and specificity, while the increase of recombination activity around SDL relaxes conflict between SDL and linked genes.
Collapse
Affiliation(s)
- Martin Hasselmann
- Heinrich Heine Universität Düsseldorf, Institut für Genetik, 40225 Düsseldorf, Germany.
| | | |
Collapse
|
38
|
Hallast P, Nagirnaja L, Margus T, Laan M. Segmental duplications and gene conversion: Human luteinizing hormone/chorionic gonadotropin beta gene cluster. Genome Res 2006; 15:1535-46. [PMID: 16251463 PMCID: PMC1310641 DOI: 10.1101/gr.4270505] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Segmental duplicons (>1 kb) of high sequence similarity (>90%) covering >5% of the human genome are characterized by complex sequence variation. Apart from a few well-characterized regions (MHC, beta-globin), the diversity and linkage disequilibrium (LD) patterns of duplicons and the role of gene conversion in shaping them have been poorly studied. To shed light on these issues, we have re-sequenced the human Luteinizing Hormone/Chorionic Gonadotropin beta (LHB/CGB) cluster (19q13.32) of three population samples (Estonians, Mandenka, and Han). The LHB/CGB cluster consists of seven duplicated genes critical in human reproduction. In the LHB/CGB region, high sequence diversity, concentration of gene-conversion acceptor sites, and strong LD colocalize with peripheral genes, whereas central loci are characterized by lower variation, gene-conversion donor activity, and breakdown of LD between close markers. The data highlight an important role of gene conversion in spreading polymorphisms among duplicon copies and generating LD around them. The directionality of gene-conversion events seems to be determined by the localization of a predicted recombination "hotspot" and "warm spot" in the vicinity of the most active acceptor genes at the periphery of the cluster. The data suggest that enriched crossover activity in direct and inverted segmental repeats is in accordance with the formation of palindromic secondary structures promoting double-strand breaks rather than fixed DNA sequence motifs. Also, this first detailed coverage of sequence diversity and structure of the LHB/CGB gene cluster will pave the way for studying the identified polymorphisms as well as potential genomic rearrangements in association with an individual's reproductive success.
Collapse
Affiliation(s)
- Pille Hallast
- Institute of Molecular and Cell Biology, University of Tartu, Riia 23, 51010 Tartu, Estonia
| | | | | | | |
Collapse
|
39
|
Doxiadis GGM, Rouweler AJM, de Groot NG, Louwerse A, Otting N, Verschoor EJ, Bontrop RE. Extensive sharing of MHC class II alleles between rhesus and cynomolgus macaques. Immunogenetics 2006; 58:259-68. [PMID: 16470376 DOI: 10.1007/s00251-006-0083-8] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2005] [Accepted: 12/27/2005] [Indexed: 10/25/2022]
Abstract
In contrast to rhesus monkeys, substantial knowledge on cynomolgus monkey major histocompatibility complex (MHC) class II haplotypes is lacking. Therefore, 17 animals, including one pedigreed family, were thoroughly characterized for polymorphic Mhc class II region genes as well as their mitochondrial DNA (mtDNA) sequences. Different cynomolgus macaque populations appear to exhibit unique mtDNA profiles reflecting their geographic origin. Within the present panel, 10 Mafa-DPB1, 14 Mafa-DQA1, 12 Mafa-DQB1, and 35 Mafa-DRB exon 2 sequences were identified. All of these alleles cluster into lineages that were previously described for rhesus macaques. Moreover, about half of the Mafa-DPB1, Mafa-DQA1, and Mafa-DQB1 alleles and one third of the Mafa-DRB exon 2 sequences are identical to rhesus macaque orthologues. Such a high level of Mhc class II allele sharing has not been reported for primate species. Pedigree analysis allowed the characterization of nine distinct Mafa class II haplotypes, and seven additional ones could be deduced. Two of these haplotypes harbor a duplication of the Mafa-DQB1 locus. Despite extensive allele sharing, rhesus and cynomolgus monkeys do not appear to possess identical Mhc class II haplotypes, thus illustrating that new haplotypes were generated after speciation by recombination-like processes.
Collapse
Affiliation(s)
- Gaby G M Doxiadis
- Department of Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, The Netherlands.
| | | | | | | | | | | | | |
Collapse
|
40
|
Abstract
Until around 1990, most multigene families were thought to be subject to concerted evolution, in which all member genes of a family evolve as a unit in concert. However, phylogenetic analysis of MHC and other immune system genes showed a quite different evolutionary pattern, and a new model called birth-and-death evolution was proposed. In this model, new genes are created by gene duplication and some duplicate genes stay in the genome for a long time, whereas others are inactivated or deleted from the genome. Later investigations have shown that most non-rRNA genes including highly conserved histone or ubiquitin genes are subject to this type of evolution. However, the controversy over the two models is still continuing because the distinction between the two models becomes difficult when sequence differences are small. Unlike concerted evolution, the model of birth-and-death evolution can give some insights into the origins of new genetic systems or new phenotypic characters.
Collapse
Affiliation(s)
- Masatoshi Nei
- Institute of Molecular Evolutionary Genetics and1 Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802;
| | - Alejandro P. Rooney
- U.S. Department of Agriculture, Agricultural Research Service, National Center for Agricultural Utilization Research, Peoria, Illinois 61604;
| |
Collapse
|
41
|
Reusch TBH, Langefors A. Inter- and Intralocus Recombination Drive MHC Class IIB Gene Diversification in a Teleost, the Three-Spined Stickleback Gasterosteus aculeatus. J Mol Evol 2005; 61:531-41. [PMID: 16132469 DOI: 10.1007/s00239-004-0340-0] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2004] [Accepted: 05/16/2005] [Indexed: 10/25/2022]
Abstract
The mutational mechanism underlying the striking diversity in MHC (major histocompatibility complex) genes in vertebrates is still controversial. In order to evaluate the role of inter- and intragenic recombination in MHC gene diversification, we examined patterns of nucleotide polymorphism across an exon/intron boundary in a sample of 31 MHC class IIB sequences of three-spined stickleback (Gasterosteus aculeatus). MHC class IIB genes of G. aculeatus were previously shown to be under diversifying (positive) selection in mate choice and pathogen selection experiments. Based on recoding of alignment gaps, complete intron 2 sequences were grouped into three clusters using maximum-parsimony analysis. Two of these groups had >90% bootstrap support and were tentatively assigned single locus status. Intron nucleotide diversity within and among loci was low (p-distance within and among groups = 0.016 and 0.019, respectively) and fourfold lower than the rate of silent mutations in exon 2, suggesting that noncoding regions are homogenized by frequent interlocus recombination. A substitution analysis using GENECONV: revealed as many intergenic conversion events as intragenic ones. Recombination between loci may explain the occurrence of sequence variants that are particularly divergent, as is the case in three-spined stickleback, with nucleotide diversity attaining dN = 0.39 (peptide-binding residues only). For both MHC class II loci we also estimated the amount of intragenic recombination as population rate (4N(e)r) under the coalescent and found it to be approximately three times higher compared to point mutations (Watterson estimate per gene, 4N(e)mu). Nonindependence of molecular evolution across loci and frequent recombination suggest that MHC class II genes of bony fish may follow different evolutionary dynamics than those of mammals. Our finding of widespread recombination suggests that phylogenies of MHC genes should not be based on coding segments but rather on noncoding introns.
Collapse
Affiliation(s)
- Thorsten B H Reusch
- Department of Evolutionary Ecology, Max-Planck-Institut für Limnologie, August-Thienemann-Str. 2, 24306 Plön, Germany.
| | | |
Collapse
|
42
|
Jiang Z, Zhang X, Deka R, Jin L. Genome amplification of single sperm using multiple displacement amplification. Nucleic Acids Res 2005; 33:e91. [PMID: 15942023 PMCID: PMC1143700 DOI: 10.1093/nar/gni089] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Sperm typing is an effective way to study recombination rate on a fine scale in regions of interest. There are two strategies for the amplification of single meiotic recombinants: repulsion-phase allele-specific PCR and whole genome amplification (WGA). The former can selectively amplify single recombinant molecules from a batch of sperm but is not scalable for high-throughput operation. Currently, primer extension pre-amplification is the only method used in WGA of single sperm, whereas it has limited capacity to produce high-coverage products enough for the analysis of local recombination rate in multiple large regions. Here, we applied for the first time a recently developed WGA method, multiple displacement amplification (MDA), to amplify single sperm DNA, and demonstrated its great potential for producing high-yield and high-coverage products. In a 50 mul reaction, 76 or 93% of loci can be amplified at least 2500- or 250-fold, respectively, from single sperm DNA, and second-round MDA can further offer >200-fold amplification. The MDA products are usable for a variety of genetic applications, including sequencing and microsatellite marker and single nucleotide polymorphism (SNP) analysis. The use of MDA in single sperm amplification may open a new era for studies on local recombination rates.
Collapse
Affiliation(s)
- Zhengwen Jiang
- Department of Environmental Health, Center for Genome Information, University of Cincinnati College of Medicine3223 Eden Ave, Cincinnati, OH 45267, USA
| | - Xingqi Zhang
- Department of Obstetrics and Gynecology, Northwestern University Medical SchoolChicago, IL, USA
| | - Ranjan Deka
- Department of Environmental Health, Center for Genome Information, University of Cincinnati College of Medicine3223 Eden Ave, Cincinnati, OH 45267, USA
| | - Li Jin
- Department of Environmental Health, Center for Genome Information, University of Cincinnati College of Medicine3223 Eden Ave, Cincinnati, OH 45267, USA
- State Key Laboratory of Genetic Engineering and Center for Anthropological Studies, School of Life Sciences and Morgan-Tan International Center for Life Sciences, Fudan UniversityShanghai, China
- To whom correspondence should be addressed at Liren Biology Building, Room 220, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China. Tel: +86 21 65642800; Fax: +86 21 55664388;
| |
Collapse
|
43
|
Abstract
We introduce a new method for jointly estimating crossing-over and gene conversion rates using sequence polymorphism data. The method calculates probabilities for subsets of the data consisting of three segregating sites and then forms a composite likelihood by multiplying together the probabilities of many subsets. Simulations show that this new method performs better than previously proposed methods for estimating gene conversion rates, but that all methods require large amounts of data to provide reliable estimates. While existing methods can easily estimate an "average" gene conversion rate over many loci, they cannot reliably estimate gene conversion rates for a single region of the genome.
Collapse
Affiliation(s)
- Jeffrey D Wall
- Program in Molecular and Computational Biology, University of Southern California, Los Angeles, California 90089, USA.
| |
Collapse
|
44
|
|
45
|
Padhukasahasram B, Marjoram P, Nordborg M. Estimating the rate of gene conversion on human chromosome 21. Am J Hum Genet 2004; 75:386-97. [PMID: 15250027 PMCID: PMC1182018 DOI: 10.1086/423451] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2004] [Accepted: 06/15/2004] [Indexed: 11/03/2022] Open
Abstract
There is a growing recognition that gene conversion can be an important factor in shaping fine-scale patterns of linkage disequilibrium in the human genome. We devised simple multilocus summary statistics for estimating gene-conversion rates from genomewide polymorphism data sets. In addition to being computationally feasible for very large data sets, these summaries were designed to yield robust estimates of gene-conversion rates in the presence of variation in crossing-over rates. Using our summaries, we analyzed 21,840 biallelic single-nucleotide polymorphisms (SNPs) on human chromosome 21. Our results indicate that models including both crossing over and gene conversion fit the overall short-range data (0-5 kb) of chromosome 21 much better than do models including crossing over alone. The estimated ratio of gene-conversion rate to crossing-over rate has a range of 1.6-9.4, depending on the assumed conversion tract length (in the range of 500-50 bp). Removal of the 5,696 SNPs that occur in known mutational hotspots (CpG sites) did not significantly change our conclusions, suggesting that recurrent mutations alone cannot explain our data.
Collapse
Affiliation(s)
- Badri Padhukasahasram
- Program in Molecular and Computational Biology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90089-1340, USA.
| | | | | |
Collapse
|
46
|
Han M, Tan YQ, Zhang Y, Tsai J, Vorhaben R, Moraes JR, Moraes ME, Stastny P. Multiplex single nucleotide extension: a robust and high throughput method for HLA-A locus typing. Hum Immunol 2004; 64:1111-22. [PMID: 14630394 DOI: 10.1016/j.humimm.2003.08.358] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This report describes a typing method that can identify all known human leukocyte antigen A (HLA-A) alleles by determining nucleotides present at polymorphic sites using single nucleotide extension. Allele specific primers are bound to capture oligonucleotides which allows for a multiplex approach during single nucleotide extension (SNE) reactions. Eleven group-specific polymerase chain reaction amplifications were performed to obtain the templates to be analyzed with sets of primers designed to investigate the polymorphisms. Extension of biotin-labeled ddNTPs onto allele-specific primers was catalyzed by a DNA polymerase and each primer was hybridized to a specific capture oligonucleotide covalently bound to a bead. After staining with streptavidin-PE, incorporated fluorescence was determined with a flow cytometer. Fluorescence intensities were interpreted by computer and the nucleotide sequence was translated into HLA-A genotypes. Group-specific amplification reactions and primer sets for SNE were validated with 42 reference samples of known HLA-A alleles. In addition, 296 samples from three populations (N. A. Caucasian, African-American, Terena S. A. Indian) were analyzed and results compared to previous typing by SSOP. Reproducibility between repeated typings was 100% and ambiguities were quite rare. The method has been found to be accurate, relatively simple to perform and fast. It is our method of choice for high resolution clinical HLA-A typing.
Collapse
Affiliation(s)
- Mei Han
- Department of Internal Medicine, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390-8886, USA
| | | | | | | | | | | | | | | |
Collapse
|
47
|
Bosch E, Hurles ME, Navarro A, Jobling MA. Dynamics of a human interparalog gene conversion hotspot. Genome Res 2004; 14:835-44. [PMID: 15123583 PMCID: PMC479110 DOI: 10.1101/gr.2177404] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2003] [Accepted: 02/16/2004] [Indexed: 11/25/2022]
Abstract
Gene conversion between paralogs can alter their patterns of sequence identity, thus obscuring their evolutionary relationships and affecting their propensity to sponsor genomic rearrangements. The details of this important process are poorly understood in the human genome because allelic diversity complicates the interpretation of interparalog sequence differences. Here we exploit the haploid nature of the Y chromosome, which obviates complicating interallelic processes, together with its known phylogeny, to understand the dynamics of conversion between two directly repeated HERVs flanking the 780-kb AZFa region on Yq. Sequence analysis of a 787-bp segment of each of the HERVs in 36 Y chromosomes revealed one of the highest nucleotide diversities in the human genome, as well as evidence of a complex patchwork of highly directional gene conversion events. The rate of proximal-to-distal conversion events was estimated as 2.4 x 10(-4) to 1.2 x 10(-3) per generation (3.9 x 10(-7) to 1.9 x 10(-6) per base per generation), and the distal-to-proximal rate as about one-twentieth of this. Minimum observed conversion tract lengths ranged from 1 to 158 bp and maximum lengths from 19 to 1365 bp, with an estimated mean of 31 bp. Analysis of great ape homologs shows that conversion in this hotspot has a deep evolutionary history.
Collapse
Affiliation(s)
- Elena Bosch
- Department of Genetics, University of Leicester, Leicester LE1 7RH, UK
| | | | | | | |
Collapse
|
48
|
Carrington M, Cullen M. Justified chauvinism: advances in defining meiotic recombination through sperm typing. Trends Genet 2004; 20:196-205. [PMID: 15041174 DOI: 10.1016/j.tig.2004.02.006] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Sperm typing offers an efficient means of studying the quantitative and qualitative aspects of meiotic recombination that are virtually unapproachable by pedigree analysis. Since the initial development of the technique >10 years ago, several salient findings based on empirically derived recombination data have been described. The precise rates and distributions of recombination have been reported for specific regions of the genome, serving as the prototype for high-resolution genome-wide recombination patterns. Identification and characterization of molecular genetic events, such as unequal crossing over, gene conversion and crossover asymmetry, are under close inspection for the first time as a result of this technology. The influence of these phenomena on the evolution of the genome is of primary interest from a scientific and medical perspective. In this article, we review the novel discoveries in mammalian meiotic recombination that have been revealed through sperm typing.
Collapse
Affiliation(s)
- Mary Carrington
- Laboratory of Genomic Diversity, National Cancer Institute-Frederick, National Institutes of Health, SAIC-Frederick, MD 21702, USA.
| | | |
Collapse
|
49
|
|
50
|
Jeffreys AJ, May CA. Intense and highly localized gene conversion activity in human meiotic crossover hot spots. Nat Genet 2004; 36:151-6. [PMID: 14704667 DOI: 10.1038/ng1287] [Citation(s) in RCA: 228] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2003] [Accepted: 12/04/2003] [Indexed: 11/09/2022]
Abstract
Meiotic gene conversion has an important role in allele diversification and in the homogenization of gene and other repeat DNA sequence families, sometimes with pathological consequences. But little is known about the dynamics of gene conversion in humans and its relationship to meiotic crossover. We therefore developed screening and selection methods to characterize sperm conversions in two meiotic crossover hot spots in the major histocompatibility complex (MHC) and one in the sex chromosomal pseudoautosomal pairing region PAR1 (ref. 9). All three hot spots are active in gene conversion and crossover. Conversion tracts are short and define a steep bidirectional gradient centered at the peak of crossover activity, consistent with crossovers and conversions being produced by the same recombination-initiating events. These initiations seem to be spread over a narrow zone, rather than occurring at a single site, and seem preferentially to yield conversions rather than crossovers. Crossover breakpoints are more broadly diffused than conversion breakpoints, suggesting either differences between conversion and crossover processing after initiation or the existence of a quality control checkpoint at which short interactions between homologous chromosomes are preferentially aborted as conversions.
Collapse
Affiliation(s)
- Alec J Jeffreys
- Department of Genetics, University of Leicester, Leicester LE1 7RH, UK.
| | | |
Collapse
|