1
|
Assis R, Conant G, Holland B, Liberles DA, O'Reilly MM, Wilson AE. Models for the retention of duplicate genes and their biological underpinnings. F1000Res 2024; 12:1400. [PMID: 38173826 PMCID: PMC10762295 DOI: 10.12688/f1000research.141786.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/08/2024] [Indexed: 01/05/2024] Open
Abstract
Gene content in genomes changes through several different processes, with gene duplication being an important contributor to such changes. Gene duplication occurs over a range of scales from individual genes to whole genomes, and the dynamics of this process can be context dependent. Still, there are rules by which genes are retained or lost from genomes after duplication, and probabilistic modeling has enabled characterization of these rules, including their context-dependence. Here, we describe the biology and corresponding mathematical models that are used to understand duplicate gene retention and its contribution to the set of biochemical functions encoded in a genome.
Collapse
Affiliation(s)
- Raquel Assis
- Florida Atlantic University, Boca Raton, Florida, USA
| | - Gavin Conant
- North Carolina State University, Raleigh, North Carolina, USA
| | | | | | | | | |
Collapse
|
2
|
Ma F, Lau CY, Zheng C. Young duplicate genes show developmental stage- and cell type-specific expression and function in Caenorhabditis elegans. CELL GENOMICS 2024; 4:100467. [PMID: 38190105 PMCID: PMC10794840 DOI: 10.1016/j.xgen.2023.100467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 10/06/2023] [Accepted: 11/23/2023] [Indexed: 01/09/2024]
Abstract
Gene duplication produces the material that fuels evolutionary innovation. The "out-of-testis" hypothesis suggests that sperm competition creates selective pressure encouraging the emergence of new genes in male germline, but the somatic expression and function of the newly evolved genes are not well understood. We systematically mapped the expression of young duplicate genes throughout development in Caenorhabditis elegans using both whole-organism and single-cell transcriptomic data. Based on the expression dynamics across developmental stages, young duplicate genes fall into three clusters that are preferentially expressed in early embryos, mid-stage embryos, and late-stage larvae. Early embryonic genes are involved in protein degradation and develop essentiality comparable to the genomic average. In mid-to-late embryos and L4-stage larvae, young genes are enriched in intestine, epidermal cells, coelomocytes, and amphid chemosensory neurons. Their molecular functions and inducible expression indicate potential roles in innate immune response and chemosensory perceptions, which may contribute to adaptation outside of the sperm.
Collapse
Affiliation(s)
- Fuqiang Ma
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
| | - Chun Yin Lau
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
| | - Chaogu Zheng
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China.
| |
Collapse
|
3
|
Liang X, Taylor M, Napier-Jameson R, Calovich-Benne C, Norris A. A Conserved Role for Stomatin Domain Genes in Olfactory Behavior. eNeuro 2023; 10:ENEURO.0457-22.2023. [PMID: 36858824 PMCID: PMC10035767 DOI: 10.1523/eneuro.0457-22.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 02/15/2023] [Accepted: 02/23/2023] [Indexed: 03/03/2023] Open
Abstract
The highly-conserved stomatin domain has been identified in genes throughout all classes of life. In animals, different stomatin domain-encoding genes have been implicated in the function of the kidney, red blood cells, and specific neuron types, although the underlying mechanisms remain unresolved. In one well-studied example of stomatin domain gene function, the Caenorhabditis elegans gene mec-2 and its mouse homolog Stoml3 are required for the function of mechanosensory neurons, where they modulate the activity of mechanosensory ion channels on the plasma membrane. Here, we identify an additional shared function for mec-2 and Stoml3 in a very different sensory context, that of olfaction. In worms, we find that a subset of stomatin domain genes are expressed in olfactory neurons, but only mec-2 is strongly required for olfactory behavior. mec-2 acts cell-autonomously and multiple alternatively-spliced isoforms of mec-2 can be substituted for each other. We generate a Stoml3 knock-out (KO) mouse and demonstrate that, like its worm homolog mec-2, it is required for olfactory behavior. In mice, Stoml3 is not required for odor detection, but is required for odor discrimination. Therefore, in addition to their shared roles in mechanosensory behavior, mec-2 and Stoml3 also have a shared role in olfactory behavior.
Collapse
Affiliation(s)
- Xiaoyu Liang
- Department of Biological Sciences, Southern Methodist University, Dallas, Texas 75275
| | - Morgan Taylor
- Department of Biological Sciences, Southern Methodist University, Dallas, Texas 75275
| | | | - Canyon Calovich-Benne
- Department of Biological Sciences, Southern Methodist University, Dallas, Texas 75275
| | - Adam Norris
- Department of Biological Sciences, Southern Methodist University, Dallas, Texas 75275
| |
Collapse
|
4
|
Bonthala VS, Stich B. Genetic Divergence of Lineage-Specific Tandemly Duplicated Gene Clusters in Four Diploid Potato Genotypes. FRONTIERS IN PLANT SCIENCE 2022; 13:875202. [PMID: 35645998 PMCID: PMC9131075 DOI: 10.3389/fpls.2022.875202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Accepted: 04/20/2022] [Indexed: 05/04/2023]
Abstract
Potato (Solanum tuberosum L.) is the most important non-grain food crop. Tandem duplication significantly contributes to genome evolution. The objectives of this study were to (i) identify tandemly duplicated genes and compare their genomic distributions across potato genotypes, (ii) investigate the bias in functional specificities, (iii) explore the relationships among coding sequence, promoter and expression divergences associated with tandemly duplicated genes, (iv) examine the role of tandem duplication in generating and expanding lineage-specific gene families, (v) investigate the evolutionary forces affecting tandemly duplicated genes, and (vi) assess the similarities and differences with respect to above mentioned aspects between cultivated genotypes and their wild-relative. In this study, we used well-annotated and chromosome-scale de novo genome assemblies of multiple potato genotypes. Our results showed that tandemly duplicated genes are abundant and dispersed through the genome. We found that several functional specificities, such as disease resistance, stress-tolerance, and biosynthetic pathways of tandemly duplicated genes were differentially enriched across multiple potato genomes. Our results indicated the existence of a significant correlation among expression, promoter, and protein divergences in tandemly duplicated genes. We found about one fourth of tandemly duplicated gene clusters as lineage-specific among multiple potato genomes, and these tended to localize toward centromeres and revealed distinct selection signatures and expression patterns. Furthermore, our results showed that a majority of duplicated genes were retained through sub-functionalization followed by genetic redundancy, while only a small fraction of duplicated genes was retained though neo-functionalization. The lineage-specific expansion of gene families by tandem duplication coupled with functional bias might have significantly contributed to potato's genotypic diversity, and, thus, to adaption to environmental stimuli.
Collapse
Affiliation(s)
- Venkata Suresh Bonthala
- Institute of Quantitative Genetics and Genomics of Plants, Heinrich Heine University of Düsseldorf, Düsseldorf, Germany
- *Correspondence: Venkata Suresh Bonthala,
| | - Benjamin Stich
- Max Planck Institute for Plant Breeding Research, Köln, Germany
- Cluster of Excellence on Plant Sciences, From Complex Traits Towards Synthetic Modules, Düsseldorf, Germany
| |
Collapse
|
5
|
Zhang X, Hu Y, Smith DR. HSDFinder: A BLAST-Based Strategy for Identifying Highly Similar Duplicated Genes in Eukaryotic Genomes. FRONTIERS IN BIOINFORMATICS 2021; 1:803176. [PMID: 36303740 PMCID: PMC9580922 DOI: 10.3389/fbinf.2021.803176] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 11/25/2021] [Indexed: 01/01/2023] Open
Abstract
Gene duplication is an important evolutionary mechanism capable of providing new genetic material for adaptive and nonadaptive evolution. However, bioinformatics tools for identifying duplicate genes are often limited to the detection of paralogs in multiple species or to specific types of gene duplicates, such as retrocopies. Here, we present a user-friendly, BLAST-based web tool, called HSDFinder, which can identify, annotate, categorize, and visualize highly similar duplicate genes (HSDs) in eukaryotic nuclear genomes. HSDFinder includes an online heatmap plotting option, allowing users to compare HSDs among different species and visualize the results in different Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway functional categories. The external software requirements are BLAST, InterProScan, and KEGG. The utility of HSDFinder was tested on various model eukaryotic species, including Chlamydomonas reinhardtii, Arabidopsis thaliana, Oryza sativa, and Zea mays as well as the psychrophilic green alga Chlamydomonas sp. UWO241, and was proven to be a practical and accurate tool for gene duplication analyses. The web tool is free to use at http://hsdfinder.com. Documentation and tutorials can be found via the GitHub: https://github.com/zx0223winner/HSDFinder.
Collapse
Affiliation(s)
- Xi Zhang
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada
- Institute for Comparative Genomics, Dalhousie University, Halifax, NS, Canada
- *Correspondence: Xi Zhang, ; David Roy Smith,
| | - Yining Hu
- Department of Computer Science, Western University, London, ON, Canada
| | - David Roy Smith
- Department of Biology, Western University, London, ON, Canada
- *Correspondence: Xi Zhang, ; David Roy Smith,
| |
Collapse
|
6
|
Morgenthaler AB, Fritts RK, Copley SD. Amplicon remodeling and genomic mutations drive population dynamics after segmental amplification. Mol Biol Evol 2021; 39:6377342. [PMID: 34581806 PMCID: PMC8763031 DOI: 10.1093/molbev/msab289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
New enzymes often evolve by duplication and divergence of genes encoding enzymes with promiscuous activities that have become important in the face of environmental opportunities or challenges. Amplifications that increase the copy number of the gene under selection commonly amplify many surrounding genes. Extra copies of these coamplified genes must be removed, either during or after evolution of a new enzyme. Here we report that amplicon remodeling can begin even before mutations occur in the gene under selection. Amplicon remodeling and mutations elsewhere in the genome that indirectly increase fitness result in complex population dynamics, leading to emergence of clones that have improved fitness by different mechanisms. In this work, one of the two most successful clones had undergone two episodes of amplicon remodeling, leaving only four coamplified genes surrounding the gene under selection. Amplicon remodeling in the other clone resulted in removal of 111 genes from the genome, an acceptable solution under these selection conditions, but one that would certainly impair fitness under other environmental conditions.
Collapse
Affiliation(s)
- Andrew B Morgenthaler
- Department of Molecular, Cellular and Developmental Biology and Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO, 80309
| | - Ryan K Fritts
- Department of Molecular, Cellular and Developmental Biology and Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO, 80309
| | - Shelley D Copley
- Department of Molecular, Cellular and Developmental Biology and Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO, 80309
| |
Collapse
|
7
|
Fanelli E, Troccoli A, Tarasco E, De Luca F. Molecular Characterization and Functional Analysis of the Hb-hsp90-1 Gene in Relation to Temperature Changes in Heterorhabditis bacteriophora. Front Physiol 2021; 12:615653. [PMID: 33732162 PMCID: PMC7959791 DOI: 10.3389/fphys.2021.615653] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 01/26/2021] [Indexed: 12/17/2022] Open
Abstract
Understanding how entomopathogenic nematodes respond to temperature changes and have adapted to the local environment is crucial to improve their potential as biocontrol agents. In order to improve understanding of Heterorhabditis bacteriophora's potential adaptability to future climate changes, full-length cDNA and the corresponding gene of heat shock protein 90 (Hsp90) were isolated and fully characterized. The reproductive potential of the Apulian strain of H. bacteriophora increased when the temperature rose from 23 to 30°C, but no reproduction was found at 12°C. Expression analyses revealed that Hb-hsp90-1 was differentially expressed in Infective Juveniles (IJs) and adults (hermaphrodites, females and males). Up-regulation of Hb-hsp90-1 was higher during the recovery process in Galleria mellonella larvae than adults, thus confirming the protective role of Hb-hsp90-1 in coping with the host environment. Silencing of Hb-hsp90-1 resulted in a significant reduction (76%) in the expression level. Silenced IJs took longer than untreated nematodes to infect G. mellonella, showing that Hb-hsp90-1 could be also involved in chemosensation. Furthermore, the number of adults and IJs recovered from G. mellonella infected with silenced nematodes and incubated at 30°C was higher than that obtained from G. mellonella infected with untreated nematodes. These data confirm the crucial role of Hb-hsp90-1 allowing acclimation to increased temperatures and modulation of the recovery process.
Collapse
Affiliation(s)
- Elena Fanelli
- Institute for Sustainable Plant Protection-CNR, Bari, Italy
| | | | - Eustachio Tarasco
- Institute for Sustainable Plant Protection-CNR, Bari, Italy
- Section of Entomology and Zoology, Department of Soil, Plant and Food Sciences, University of Bari “A. Moro”, Bari, Italy
| | | |
Collapse
|
8
|
Copley SD. Evolution of new enzymes by gene duplication and divergence. FEBS J 2021; 287:1262-1283. [PMID: 32250558 DOI: 10.1111/febs.15299] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2020] [Revised: 03/13/2020] [Accepted: 03/17/2020] [Indexed: 12/22/2022]
Abstract
Thousands of new metabolic and regulatory enzymes have evolved by gene duplication and divergence since the dawn of life. New enzyme activities often originate from promiscuous secondary activities that have become important for fitness due to a change in the environment or a mutation. Mutations that make a promiscuous activity physiologically relevant can occur in the gene encoding the promiscuous enzyme itself, but can also occur elsewhere, resulting in increased expression of the enzyme or decreased competition between the native and novel substrates for the active site. If a newly useful activity is inefficient, gene duplication/amplification will set the stage for divergence of a new enzyme. Even a few mutations can increase the efficiency of a new activity by orders of magnitude. As efficiency increases, amplified gene arrays will shrink to provide two alleles, one encoding the original enzyme and one encoding the new enzyme. Ultimately, genomic rearrangements eliminate co-amplified genes and move newly evolved paralogs to a distant region of the genome.
Collapse
Affiliation(s)
- Shelley D Copley
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, CO, USA
| |
Collapse
|
9
|
Faber-Hammond JJ, Bezault E, Lunt DH, Joyce DA, Renn SCP. The Genomic Substrate for Adaptive Radiation: Copy Number Variation across 12 Tribes of African Cichlid Species. Genome Biol Evol 2020; 11:2856-2874. [PMID: 31504491 DOI: 10.1093/gbe/evz185] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/27/2019] [Indexed: 02/06/2023] Open
Abstract
The initial sequencing of five cichlid genomes revealed an accumulation of genetic variation, including extensive copy number variation in cichlid lineages particularly those that have undergone dramatic evolutionary radiation. Gene duplication has the potential to generate substantial molecular substrate for the origin of evolutionary novelty. We use array-based comparative heterologous genomic hybridization to identify copy number variation events (CNVEs) for 168 samples representing 53 cichlid species including the 5 species for which full genome sequence is available. We identify an average of 50-100 CNVEs per individual. For those species represented by multiple samples, we identify 150-200 total CNVEs suggesting a substantial amount of intraspecific variation. For these species, only ∼10% of the detected CNVEs are fixed. Hierarchical clustering of species according to CNVE data recapitulates phylogenetic relationships fairly well at both the tribe and radiation level. Although CNVEs are detected on all linkage groups, they tend to cluster in "hotspots" and are likely to contain and be flanked by transposable elements. Furthermore, we show that CNVEs impact functional categories of genes with potential roles in adaptive phenotypes that could reasonably promote divergence and speciation in the cichlid clade. These data contribute to a more complete understanding of the molecular basis for adaptive natural selection, speciation, and evolutionary radiation.
Collapse
Affiliation(s)
| | - Etienne Bezault
- BOREA Research Unit, MNHN, CNRS 7208, Sorbonne Université, IRD 207, UCN, UA, Paris, France
| | - David H Lunt
- Department of Biological and Marine Sciences, University of Hull, Hull Kingston-Upon-Hull, United Kingdom
| | - Domino A Joyce
- Department of Biological and Marine Sciences, University of Hull, Hull Kingston-Upon-Hull, United Kingdom
| | - Suzy C P Renn
- Department of Biology, Reed College, Portland OR 97202
| |
Collapse
|
10
|
Nilsson P, Solbakken MH, Schmid BV, Orr RJS, Lv R, Cui Y, Song Y, Zhang Y, Baalsrud HT, Tørresen OK, Stenseth NC, Yang R, Jakobsen KS, Easterday WR, Jentoft S. The Genome of the Great Gerbil Reveals Species-Specific Duplication of an MHCII Gene. Genome Biol Evol 2020; 12:3832-3849. [PMID: 31971556 PMCID: PMC7046166 DOI: 10.1093/gbe/evaa008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/13/2020] [Indexed: 12/13/2022] Open
Abstract
The great gerbil (Rhombomys opimus) is a social rodent living in permanent, complex burrow systems distributed throughout Central Asia, where it serves as the main host of several important vector-borne infectious pathogens including the well-known plague bacterium (Yersinia pestis). Here, we present a continuous annotated genome assembly of the great gerbil, covering over 96% of the estimated 2.47-Gb genome. Taking advantage of the recent genome assemblies of the sand rat (Psammomys obesus) and the Mongolian gerbil (Meriones unguiculatus), comparative immunogenomic analyses reveal shared gene losses within TLR gene families (i.e., TLR8, TLR10, and the entire TLR11-subfamily) for Gerbillinae, accompanied with signs of diversifying selection of TLR7 and TLR9. Most notably, we find a great gerbil-specific duplication of the MHCII DRB locus. In silico analyses suggest that the duplicated gene provides high peptide binding affinity for Yersiniae epitopes as well as Leishmania and Leptospira epitopes, putatively leading to increased capability to withstand infections by these pathogens. Our study demonstrates the power of whole-genome sequencing combined with comparative genomic analyses to gain deeper insight into the immunogenomic landscape of the great gerbil and its close relatives.
Collapse
Affiliation(s)
- Pernille Nilsson
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | - Monica H Solbakken
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | - Boris V Schmid
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | | | - Ruichen Lv
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing, China
| | - Yujun Cui
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing, China
| | - Yajun Song
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing, China
| | - Yujiang Zhang
- Xinjiang Center for Disease Control and Prevention, Urumqi, China
| | - Helle T Baalsrud
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | - Ole K Tørresen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | - Nils Chr Stenseth
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
- Ministry of Education Key Laboratory for Earth System Modeling, Department of Earth System Science, Tsinghua University, Beijing, China
| | - Ruifu Yang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing, China
| | - Kjetill S Jakobsen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | - William Ryan Easterday
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| | - Sissel Jentoft
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Norway
| |
Collapse
|
11
|
Origination and evolution of orphan genes and de novo genes in the genome of Caenorhabditis elegans. SCIENCE CHINA-LIFE SCIENCES 2019; 62:579-593. [DOI: 10.1007/s11427-019-9482-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 01/22/2019] [Indexed: 12/31/2022]
|
12
|
The Evolution and Biocatalysis of FAD2 Indicate Its Correlation to the Content of Seed Oil in Plants. Int J Mol Sci 2019; 20:ijms20040849. [PMID: 30781405 PMCID: PMC6412433 DOI: 10.3390/ijms20040849] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2019] [Revised: 02/07/2019] [Accepted: 02/12/2019] [Indexed: 01/16/2023] Open
Abstract
Unsaturated fatty acids are the main components of vegetable oils. Fatty acid desaturase 2 (FAD2) catalyzes oleic acid (OA) into linoleic acid (LA) transformations, which are essential to the profile of FAs in seeds. To further understand the roles of FAD2s in the synthesis of oil, the evolution and biocatalysis of FAD2s were comprehensively analyzed. The evolution history of the FAD2 gene family showed that most of the FAD2 genes formed monophyletic clades except in eudicots. The FAD2 genes in some eudicots diverged into constitutive and seed-specific expression clades. Notably, the biocatalysis of seed-specific or -abundant expression FAD2s in soybean, perilla, rice, and spruce revealed that their catalytic activity was strongly correlated with the total oil content of their seeds in nature. Additionally, it was found that I and Y in site 143 of GmaFAD2-1 were strictly conserved in the seed-specific and constitutive expression clades of Fabaceae, respectively. Furthermore, the site-directed mutation demonstrated that I and Y are vital to improving and reducing the activity of GmaFAD2s. Therefore, the results indicate that the activity of FAD2s in seeds might be a reference to the total oil content of seeds, and site 143 might have been specifically evolved to be required for the activity of FAD2s in some expression-diverged eudicots, especially in legumes.
Collapse
|
13
|
Prabh N, Roeseler W, Witte H, Eberhardt G, Sommer RJ, Rödelsperger C. Deep taxon sampling reveals the evolutionary dynamics of novel gene families in Pristionchus nematodes. Genome Res 2018; 28:1664-1674. [PMID: 30232197 PMCID: PMC6211646 DOI: 10.1101/gr.234971.118] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 09/05/2018] [Indexed: 01/20/2023]
Abstract
The widespread identification of genes without detectable homology in related taxa is a hallmark of genome sequencing projects in animals, together with the abundance of gene duplications. Such genes have been called novel, young, taxon-restricted, or orphans, but little is known about the mechanisms accounting for their origin, age, and mode of evolution. Phylogenomic studies relying on deep and systematic taxon sampling and using the comparative method can provide insight into the evolutionary dynamics acting on novel genes. We used a phylogenomic approach for the nematode model organism Pristionchus pacificus and sequenced six additional Pristionchus and two outgroup species. This resulted in 10 genomes with a ladder-like phylogeny, sequenced in one laboratory using the same platform and analyzed by the same bioinformatic procedures. Our analysis revealed that 68%-81% of genes are assignable to orthologous gene families, the majority of which defined nine age classes with presence/absence patterns that can be explained by single evolutionary events. Contrasting different age classes, we find that older age classes are concentrated at chromosome centers, whereas novel gene families preferentially arise at the periphery, are weakly expressed, evolve rapidly, and have a high propensity of being lost. Over time, they increase in expression and become more constrained. Thus, the detailed phylogenetic resolution allowed a comprehensive characterization of the evolutionary dynamics of Pristionchus genomes indicating that distribution of age classes and their associated differences shape chromosomal divergence. This study establishes the Pristionchus system for future research on the mechanisms that drive the formation of novel genes.
Collapse
Affiliation(s)
- Neel Prabh
- Department of Integrative Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Max-Planck-Ring 9, 72076 Tübingen, Germany
| | - Waltraud Roeseler
- Department of Integrative Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Max-Planck-Ring 9, 72076 Tübingen, Germany
| | - Hanh Witte
- Department of Integrative Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Max-Planck-Ring 9, 72076 Tübingen, Germany
| | - Gabi Eberhardt
- Department of Integrative Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Max-Planck-Ring 9, 72076 Tübingen, Germany
| | - Ralf J Sommer
- Department of Integrative Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Max-Planck-Ring 9, 72076 Tübingen, Germany
| | - Christian Rödelsperger
- Department of Integrative Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Max-Planck-Ring 9, 72076 Tübingen, Germany
| |
Collapse
|
14
|
Assogba BS, Alout H, Koffi A, Penetier C, Djogbénou LS, Makoundou P, Weill M, Labbé P. Adaptive deletion in resistance gene duplications in the malaria vector Anopheles gambiae. Evol Appl 2018; 11:1245-1256. [PMID: 30151037 PMCID: PMC6099818 DOI: 10.1111/eva.12619] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Accepted: 02/15/2018] [Indexed: 12/11/2022] Open
Abstract
While gene copy-number variations play major roles in long-term evolution, their early dynamics remains largely unknown. However, examples of their role in short-term adaptation are accumulating: identical repetitions of a locus (homogeneous duplications) can provide a quantitative advantage, while the association of differing alleles (heterogeneous duplications) allows carrying two functions simultaneously. Such duplications often result from rearrangements of sometimes relatively large chromosome fragments, and even when adaptive, they can be associated with deleterious side effects that should, however, be reduced by subsequent evolution. Here, we took advantage of the unique model provided by the malaria mosquito Anopheles gambiae s.l. to investigate the early evolution of several duplications, heterogeneous and homogeneous, segregating in natural populations from West Africa. These duplications encompass ~200 kb and 11 genes, including the adaptive insecticide resistance ace-1 locus. Through the survey of several populations from three countries over 3-4 years, we showed that an internal deletion of all coamplified genes except ace-1 is currently spreading in West Africa and introgressing from An. gambiae s.s. to An. coluzzii. Both observations provide evidences of its selection, most likely due to reducing the gene-dosage disturbances caused by the excessive copies of the nonadaptive genes. Our study thus provides a unique example of the early adaptive trajectory of duplications and underlines the role of the environmental conditions (insecticide treatment practices and species ecology). It also emphasizes the striking diversity of adaptive responses in these mosquitoes and reveals a worrisome process of resistance/cost trade-off evolution that could impact the control of malaria vectors in Africa.
Collapse
Affiliation(s)
- Benoît S. Assogba
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS‐UM‐IRD‐EPHE)Université de MontpellierMontpellierFrance
- Disease Control and Elimination DepartmentMedical Research Council, Unit The GambiaBanjulThe Gambia
- Institut Régional de Santé PubliqueUniversité d'Abomey CalaviCotonouBenin
- Faculté des Sciences et TechniquesLaboratoire de Biologie et de Typage Moléculaire en MicrobiologieUniversité d'Abomey CalaviCotonouBénin
| | - Haoues Alout
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS‐UM‐IRD‐EPHE)Université de MontpellierMontpellierFrance
| | - Alphonsine Koffi
- Institut Pierre Richet (IPR)/Institut National de Santé Publique (INSP)BouakéCôte d'Ivoire
| | - Cédric Penetier
- Institut de Recherche pour le Développement (IRD)UMR MIVEGECMontpellierFrance
| | - Luc S. Djogbénou
- Institut Régional de Santé PubliqueUniversité d'Abomey CalaviCotonouBenin
- Faculté des Sciences et TechniquesLaboratoire de Biologie et de Typage Moléculaire en MicrobiologieUniversité d'Abomey CalaviCotonouBénin
| | - Patrick Makoundou
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS‐UM‐IRD‐EPHE)Université de MontpellierMontpellierFrance
| | - Mylène Weill
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS‐UM‐IRD‐EPHE)Université de MontpellierMontpellierFrance
| | - Pierrick Labbé
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS‐UM‐IRD‐EPHE)Université de MontpellierMontpellierFrance
| |
Collapse
|
15
|
Rajaraman J, Douchkov D, Lück S, Hensel G, Nowara D, Pogoda M, Rutten T, Meitzel T, Brassac J, Höfle C, Hückelhoven R, Klinkenberg J, Trujillo M, Bauer E, Schmutzer T, Himmelbach A, Mascher M, Lazzari B, Stein N, Kumlehn J, Schweizer P. Evolutionarily conserved partial gene duplication in the Triticeae tribe of grasses confers pathogen resistance. Genome Biol 2018; 19:116. [PMID: 30111359 PMCID: PMC6092874 DOI: 10.1186/s13059-018-1472-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Accepted: 07/04/2018] [Indexed: 11/11/2022] Open
Abstract
Background The large and highly repetitive genomes of the cultivated species Hordeum vulgare (barley), Triticum aestivum (wheat), and Secale cereale (rye) belonging to the Triticeae tribe of grasses appear to be particularly rich in gene-like sequences including partial duplicates. Most of them have been classified as putative pseudogenes. In this study we employ transient and stable gene silencing- and over-expression systems in barley to study the function of HvARM1 (for H. vulgare Armadillo 1), a partial gene duplicate of the U-box/armadillo-repeat E3 ligase HvPUB15 (for H. vulgare Plant U-Box 15). Results The partial ARM1 gene is derived from a gene-duplication event in a common ancestor of the Triticeae and contributes to quantitative host as well as nonhost resistance to the biotrophic powdery mildew fungus Blumeria graminis. In barley, allelic variants of HvARM1 but not of HvPUB15 are significantly associated with levels of powdery mildew infection. Both HvPUB15 and HvARM1 proteins interact in yeast and plant cells with the susceptibility-related, plastid-localized barley homologs of THF1 (for Thylakoid formation 1) and of ClpS1 (for Clp-protease adaptor S1) of Arabidopsis thaliana. A genome-wide scan for partial gene duplicates reveals further events in barley resulting in stress-regulated, potentially neo-functionalized, genes. Conclusion The results suggest neo-functionalization of the partial gene copy HvARM1 increases resistance against powdery mildew infection. It further links plastid function with susceptibility to biotrophic pathogen attack. These findings shed new light on a novel mechanism to employ partial duplication of protein-protein interaction domains to facilitate the expansion of immune signaling networks. Electronic supplementary material The online version of this article (10.1186/s13059-018-1472-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jeyaraman Rajaraman
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany.
| | - Dimitar Douchkov
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany.
| | - Stefanie Lück
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Götz Hensel
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Daniela Nowara
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Maria Pogoda
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Twan Rutten
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Tobias Meitzel
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Jonathan Brassac
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Caroline Höfle
- Technische Universität München, Emil-Ramann-Straße 2, D-85354, Freising, Germany
| | - Ralph Hückelhoven
- Technische Universität München, Emil-Ramann-Straße 2, D-85354, Freising, Germany
| | - Jörn Klinkenberg
- Leibniz Institut für Pflanzenbiochemie, Weinberg 3, D-06120, Halle (Saale), Germany
| | - Marco Trujillo
- Leibniz Institut für Pflanzenbiochemie, Weinberg 3, D-06120, Halle (Saale), Germany.,Albert-Ludwigs-Universität Freiburg, Institut für Biologie II, Zellbiologie, D-79104, Freiburg, Germany
| | - Eva Bauer
- Technische Universität München, Liesel-Beckmann-Straße 2, D-85354, Freising, Germany
| | - Thomas Schmutzer
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Axel Himmelbach
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Martin Mascher
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Barbara Lazzari
- Parco Technologico Padano, Via Einstein, Loc. Cascina Codazza, 26900, Lodi, Italy
| | - Nils Stein
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Jochen Kumlehn
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| | - Patrick Schweizer
- Leibniz Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK Gatersleben), Corrensstrasse 3, D-06466, Stadt Seeland, Germany
| |
Collapse
|
16
|
Mutational and transcriptional landscape of spontaneous gene duplications and deletions in Caenorhabditis elegans. Proc Natl Acad Sci U S A 2018; 115:7386-7391. [PMID: 29941601 DOI: 10.1073/pnas.1801930115] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Gene duplication and deletion are pivotal processes shaping the structural and functional repertoire of genomes, with implications for disease, adaptation, and evolution. We employed a mutation accumulation (MA) framework partnered with high-throughput genomics to assess the molecular and transcriptional characteristics of newly arisen gene copy-number variants (CNVs) in Caenorhabditis elegans populations subjected to varying intensity of selection. Here, we report a direct spontaneous genome-wide rate of gene duplication of 2.9 × 10-5/gene per generation in C. elegans, the highest for any species to date. The rate of gene deletion is sixfold lower (5 × 10-6/gene per generation). Deletions of highly expressed genes are particularly deleterious, given their paucity in even the N = 1 lines with minimal efficacy of selection. The increase in average transcript abundance of new duplicates arising under minimal selection is significantly greater than twofold compared with single copies of the same gene, suggesting that genes in segmental duplications are frequently overactive at inception. The average increase in transcriptional activity of gene duplicates is greater in the N = 1 MA lines than in MA lines with larger population bottlenecks. There is an inverse relationship between the ancestral transcription levels of new gene duplicates and population size, with duplicate copies of highly expressed genes less likely to accumulate in larger populations. Our results demonstrate a fitness cost of increased transcription following duplication, which results in purifying selection against new gene duplicates. However, on average, duplications also provide a significant increase in gene expression that can facilitate adaptation to novel environmental challenges.
Collapse
|
17
|
|
18
|
Abstract
Nematodes, such as Caenorhabditis elegans, form one of the most species-rich animal phyla. By now more than 30 nematode genomes have been published allowing for comparative genomic analyses at various different time-scales. The majority of a nematode's gene repertoire is represented by either duplicated or so-called orphan genes of unknown origin. This indicates the importance of mechanisms that generate new genes during the course of evolution. While it is certain that nematodes have acquired genes by horizontal gene transfer from various donors, this process only explains a small portion of the nematode gene content. As evolutionary genomic analyses strongly support that most orphan genes are indeed protein-coding, future studies will have to decide, whether they are result from extreme divergence or evolved de novo from previously noncoding sequences. In this contribution, I summarize several studies investigating gene loss and gain in nematodes and discuss the strengths and weaknesses of individual approaches and datasets. These approaches can be used to ask nematode-specific questions such as associated with the evolution of parasitism or with switches in mating systems, but also can complement studies in other animal phyla like vertebrates and insects to broaden our general view on genome evolution.
Collapse
Affiliation(s)
- Christian Rödelsperger
- Department for Evolutionary Biology, Max Planck Institute for Developmental Biology, Spemannstr. 35, 72076, Tübingen, Germany.
| |
Collapse
|
19
|
Janoušek V, Laukaitis CM, Yanchukov A, Karn RC. The Role of Retrotransposons in Gene Family Expansions in the Human and Mouse Genomes. Genome Biol Evol 2016; 8:2632-50. [PMID: 27503295 PMCID: PMC5631067 DOI: 10.1093/gbe/evw192] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Retrotransposons comprise a large portion of mammalian genomes. They contribute to structural changes and more importantly to gene regulation. The expansion and diversification of gene families have been implicated as sources of evolutionary novelties. Given the roles retrotransposons play in genomes, their contribution to the evolution of gene families warrants further exploration. In this study, we found a significant association between two major retrotransposon classes, LINEs and LTRs, and lineage-specific gene family expansions in both the human and mouse genomes. The distribution and diversity differ between LINEs and LTRs, suggesting that each has a distinct involvement in gene family expansion. LTRs are associated with open chromatin sites surrounding the gene families, supporting their involvement in gene regulation, whereas LINEs may play a structural role promoting gene duplication. Our findings also suggest that gene family expansions, especially in the mouse genome, undergo two phases. The first phase is characterized by elevated deposition of LTRs and their utilization in reshaping gene regulatory networks. The second phase is characterized by rapid gene family expansion due to continuous accumulation of LINEs and it appears that, in some instances at least, this could become a runaway process. We provide an example in which this has happened and we present a simulation supporting the possibility of the runaway process. Altogether we provide evidence of the contribution of retrotransposons to the expansion and evolution of gene families. Our findings emphasize the putative importance of these elements in diversification and adaptation in the human and mouse lineages.
Collapse
Affiliation(s)
- Václav Janoušek
- Department of Zoology, Faculty of Science, Charles University in Prague, Prague, Czech Republic Institute of Vertebrate Biology, ASCR, Brno, Czech Republic
| | | | - Alexey Yanchukov
- Institute of Vertebrate Biology, ASCR, Brno, Czech Republic Department of Biology, Faculty of Arts and Sciences, Bülent Ecevit University, Zonguldak, Turkey
| | - Robert C Karn
- Department of Medicine, College of Medicine, University of Arizona
| |
Collapse
|
20
|
Steenwyk JL, Soghigian JS, Perfect JR, Gibbons JG. Copy number variation contributes to cryptic genetic variation in outbreak lineages of Cryptococcus gattii from the North American Pacific Northwest. BMC Genomics 2016; 17:700. [PMID: 27590805 PMCID: PMC5009542 DOI: 10.1186/s12864-016-3044-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 08/24/2016] [Indexed: 12/13/2022] Open
Abstract
Background Copy number variants (CNVs) are a class of structural variants (SVs) and are defined as fragments of DNA that are present at variable copy number in comparison with a reference genome. Recent advances in bioinformatics methodologies and sequencing technologies have enabled the high-resolution quantification of genome-wide CNVs. In pathogenic fungi SVs have been shown to alter gene expression, influence host specificity, and drive fungicide resistance, but little attention has focused specifically on CNVs. Using publicly available sequencing data, we identified 90 isolates across 212 Cryptococcus gattii genomes that belong to the VGII subgroups responsible for the recent deadly outbreaks in the North American Pacific Northwest. We generated CNV profiles for each sample to investigate the prevalence and function of CNV in C. gattii. Results We identified eight genetic clusters among publicly available Illumina whole genome sequence data from 212 C. gattii isolates through population structure analysis. Three clusters represent the VGIIa, VGIIb, and VGIIc subgroups from the North American Pacific Northwest. CNV was bioinformatically predicted and affected ~300–400 Kilobases (Kb) of the C. gattii VGII subgroup genomes. Sixty-seven loci, encompassing 58 genes, showed highly divergent patterns of copy number variation between VGII subgroups. Analysis of PFam domains within divergent CN variable genes revealed enrichment of protein domains associated with transport, cell wall organization and external encapsulating structure. Conclusions CNVs may contribute to pathological and phenotypic differences observed between the C. gattii VGIIa, VGIIb, and VGIIc subpopulations. Genes overlapping with population differentiated CNVs were enriched for several virulence related functional terms. These results uncover novel candidate genes to examine the genetic and functional underpinnings of C. gattii pathogenicity. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3044-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jacob L Steenwyk
- Biology Department, Clark University, 950 Main Street, Worcester, MA, USA.,Current address: Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - John S Soghigian
- Biology Department, Clark University, 950 Main Street, Worcester, MA, USA.,Current address: Department of Environmental Sciences, The Connecticut Agricultural Experiment Station, New Haven, CT, USA
| | - John R Perfect
- Division of Infectious Diseases, Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - John G Gibbons
- Biology Department, Clark University, 950 Main Street, Worcester, MA, USA.
| |
Collapse
|
21
|
Extracting functional trends from whole genome duplication events using comparative genomics. Biol Proced Online 2016; 18:11. [PMID: 27168732 PMCID: PMC4862183 DOI: 10.1186/s12575-016-0041-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 04/24/2016] [Indexed: 01/06/2023] Open
Abstract
Background The number of species with completed genomes, including those with evidence for recent whole genome duplication events has exploded. The recently sequenced Atlantic salmon genome has been through two rounds of whole genome duplication since the divergence of teleost fish from the lineage that led to amniotes. This quadrupoling of the number of potential genes has led to complex patterns of retention and loss among gene families. Results Methods have been developed to characterize the interplay of duplicate gene retention processes across both whole genome duplication events and additional smaller scale duplication events. Further, gene expression divergence data has become available as well for Atlantic salmon and the closely related, pre-whole genome duplication pike and methods to describe expression divergence are also presented. These methods for the characterization of duplicate gene retention and gene expression divergence that have been applied to salmon are described. Conclusions With the growth in available genomic and functional data, the opportunities to extract functional inference from large scale duplicates using comparative methods have expanded dramatically. Recently developed methods that further this inference for duplicated genes have been described. Electronic supplementary material The online version of this article (doi:10.1186/s12575-016-0041-2) contains supplementary material, which is available to authorized users.
Collapse
|
22
|
Farslow JC, Lipinski KJ, Packard LB, Edgley ML, Taylor J, Flibotte S, Moerman DG, Katju V, Bergthorsson U. Rapid Increase in frequency of gene copy-number variants during experimental evolution in Caenorhabditis elegans. BMC Genomics 2015; 16:1044. [PMID: 26645535 PMCID: PMC4673709 DOI: 10.1186/s12864-015-2253-2] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 11/27/2015] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND Gene copy-number variation (CNVs), which provides the raw material for the evolution of novel genes, is widespread in natural populations. We investigated whether CNVs constitute a common mechanism of genetic change during adaptation in experimental Caenorhabditis elegans populations. Outcrossing C. elegans populations with low fitness were evolved for >200 generations. The frequencies of CNVs in these populations were analyzed by oligonucleotide array comparative genome hybridization, quantitative PCR, PCR, DNA sequencing across breakpoints, and single-worm PCR. RESULTS Multiple duplications and deletions rose to intermediate or high frequencies in independent populations. Several lines of evidence suggest that these changes were adaptive: (i) copy-number changes reached high frequency or were fixed in a short time, (ii) many independent populations harbored CNVs spanning the same genes, and (iii) larger average size of CNVs in adapting populations relative to spontaneous CNVs. The latter is expected if larger CNVs are more likely to encompass genes under selection for a change in gene dosage. Several convergent CNVs originated in populations descended from different low fitness ancestors as well as high fitness controls. CONCLUSIONS We show that gene copy-number changes are a common class of adaptive genetic change. Due to the high rates of origin of spontaneous duplications and deletions, copy-number changes containing the same genes arose readily in independent populations. Duplications that reached high frequencies in these adapting populations were significantly larger in span. Many convergent CNVs may be general adaptations to laboratory conditions. These results demonstrate the great potential borne by CNVs for evolutionary adaptation.
Collapse
Affiliation(s)
- James C Farslow
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA
| | - Kendra J Lipinski
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA
| | - Lucille B Packard
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA
| | - Mark L Edgley
- Department of Zoology, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Jon Taylor
- Department of Zoology, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Stephane Flibotte
- Department of Zoology, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Donald G Moerman
- Department of Zoology, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA.,Present address: Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77843-4458, USA
| | - Ulfar Bergthorsson
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA. .,Present address: Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77843-4458, USA.
| |
Collapse
|
23
|
Bu L, Katju V. Early evolutionary history and genomic features of gene duplicates in the human genome. BMC Genomics 2015; 16:621. [PMID: 26290067 PMCID: PMC4546093 DOI: 10.1186/s12864-015-1827-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2015] [Accepted: 08/07/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Human gene duplicates have been the focus of intense research since the development of array-based and targeted next-generation sequencing approaches in the last decade. These studies have primarily concentrated on determining the extant copy-number variation from a population-genomic perspective but lack a robust evolutionary framework to elucidate the early structural and genomic characteristics of gene duplicates at emergence and their subsequent evolution with increasing age. RESULTS We analyzed 184 gene duplicate pairs comprising small gene families in the draft human genome with 10% or less synonymous sequence divergence. Human gene duplicates primarily originate from DNA-mediated events, taking up genomic residence as intrachromosomal copies in direct or inverse orientation. The distribution of paralogs on autosomes follows random expectations in contrast to their significant enrichment on the sex chromosomes. Furthermore, human gene duplicates exhibit a skewed gradient of distribution along the chromosomal length with significant clustering in pericentromeric regions. Surprisingly, despite the large average length of human genes, the majority of extant duplicates (83%) are complete duplicates, wherein the entire ORF of the ancestral copy was duplicated. The preponderance of complete duplicates is in accord with an extremely large median duplication span of 36 kb, which enhances the probability of capturing ancestral ORFs in their entirety. With increasing evolutionary age, human paralogs exhibit declines in (i) the frequency of intrachromosomal paralogs, and (ii) the proportion of complete duplicates. These changes may reflect lower survival rates of certain classes of duplicates and/or the role of purifying selection. Duplications arising from RNA-mediated events comprise a small fraction (11.4%) of all human paralogs and are more numerous in older evolutionary cohorts of duplicates. CONCLUSIONS The degree of structural resemblance, genomic location and duplication span appear to influence the long-term maintenance of paralogs in the human genome. The median duplication span in the human genome far exceeds that in C. elegans and yeast and likely contributes to the high prevalence of complete duplicates relative to structurally heterogeneous duplicates (partial and chimeric). The relative roles of regulatory sequence versus exon-intron structure changes in the acquisition of novel function by human paralogs remains to be determined.
Collapse
Affiliation(s)
- Lijing Bu
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA.
| | - Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM, 87131, USA. .,Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas, TX, 77843-4458, USA.
| |
Collapse
|
24
|
Microevolution of Duplications and Deletions and Their Impact on Gene Expression in the Nematode Pristionchus pacificus. PLoS One 2015; 10:e0131136. [PMID: 26125626 PMCID: PMC4488370 DOI: 10.1371/journal.pone.0131136] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 05/27/2015] [Indexed: 11/19/2022] Open
Abstract
The evolution of diversity across the animal kingdom has been accompanied by tremendous gene loss and gain. While comparative genomics has been fruitful to characterize differences in gene content across highly diverged species, little is known about the microevolution of structural variations that cause these differences in the first place. In order to investigate the genomic impact of structural variations, we made use of genomic and transcriptomic data from the nematode Pristionchus pacificus, which has been established as a satellite model to Caenorhabditis elegans for comparative biology. We exploit the fact that P. pacificus is a highly diverse species for which various genomic data including the draft genome of a sister species P. exspectatus is available. Based on resequencing coverage data for two natural isolates we identified large (> 2kb) deletions and duplications relative to the reference strain. By restriction to completely syntenic regions between P. pacificus and P. exspectatus, we were able to polarize the comparison and to assess the impact of structural variations on expression levels. We found that while loss of genes correlates with lack of expression, duplication of genes has virtually no effect on gene expression. Further investigating expression of individual copies at sites that segregate between the duplicates, we found in the majority of cases only one of the copies to be expressed. Nevertheless, we still find that certain gene classes are strongly depleted in deletions as well as duplications, suggesting evolutionary constraint acting on synteny. In summary, our results are consistent with a model, where most structural variations are either deleterious or neutral and provide first insights into the microevolution of structural variations in the P. pacificus genome.
Collapse
|
25
|
Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes. Biochem Soc Trans 2015; 42:851-60. [PMID: 25109968 DOI: 10.1042/bst20140042] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.
Collapse
|
26
|
Karn RC, Chung AG, Laukaitis CM. Did androgen-binding protein paralogs undergo neo- and/or Subfunctionalization as the Abp gene region expanded in the mouse genome? PLoS One 2014; 9:e115454. [PMID: 25531410 PMCID: PMC4274081 DOI: 10.1371/journal.pone.0115454] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 11/24/2014] [Indexed: 11/19/2022] Open
Abstract
The Androgen-binding protein (Abp) region of the mouse genome contains 30 Abpa genes encoding alpha subunits and 34 Abpbg genes encoding betagamma subunits, their products forming dimers composed of an alpha and a betagamma subunit. We endeavored to determine how many Abp genes are expressed as proteins in tears and saliva, and as transcripts in the exocrine glands producing them. Using standard PCR, we amplified Abp transcripts from cDNA libraries of C57BL/6 mice and found fifteen Abp gene transcripts in the lacrimal gland and five in the submandibular gland. Proteomic analyses identified proteins corresponding to eleven of the lacrimal gland transcripts, all of them different from the three salivary ABPs reported previously. Our qPCR results showed that five of the six transcripts that lacked corresponding proteins are expressed at very low levels compared to those transcripts with proteins. We found 1) no overlap in the repertoires of expressed Abp paralogs in lacrimal gland/tears and salivary glands/saliva; 2) substantial sex-limited expression of lacrimal gland/tear expressed-paralogs in males but no sex-limited expression in females; and 3) that the lacrimal gland/tear expressed-paralogs are found exclusively in ancestral clades 1, 2 and 3 of the five clades described previously while the salivary glands/saliva expressed-paralogs are found only in clade 5. The number of instances of extremely low levels of transcription without corresponding protein production in paralogs specific to tears and saliva suggested the role of subfunctionalization, a derived condition wherein genes that may have been expressed highly in both glands ancestrally were down-regulated subsequent to duplication. Thus, evidence for subfunctionalization can be seen in our data and we argue that the partitioning of paralog expression between lacrimal and salivary glands that we report here occurred as the result of adaptive evolution.
Collapse
Affiliation(s)
- Robert C. Karn
- College of Medicine, University of Arizona, Tucson, Arizona, 85724, United States of America
- * E-mail:
| | - Amanda G. Chung
- College of Medicine, University of Arizona, Tucson, Arizona, 85724, United States of America
| | - Christina M. Laukaitis
- College of Medicine, University of Arizona, Tucson, Arizona, 85724, United States of America
| |
Collapse
|
27
|
Nuclear receptors in nematode development: Natural experiments made by a phylum. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2014; 1849:224-37. [PMID: 24984201 DOI: 10.1016/j.bbagrm.2014.06.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Revised: 06/21/2014] [Accepted: 06/23/2014] [Indexed: 11/21/2022]
Abstract
The development of complex multicellular organisms is dependent on regulatory decisions that are necessary for the establishment of specific differentiation and metabolic cellular states. Nuclear receptors (NRs) form a large family of transcription factors that play critical roles in the regulation of development and metabolism of Metazoa. Based on their DNA binding and ligand binding domains, NRs are divided into eight NR subfamilies from which representatives of six subfamilies are present in both deuterostomes and protostomes indicating their early evolutionary origin. In some nematode species, especially in Caenorhabditis, the family of NRs expanded to a large number of genes strikingly exceeding the number of NR genes in vertebrates or insects. Nematode NRs, including the multiplied Caenorhabditis genes, show clear relation to vertebrate and insect homologues belonging to six of the eight main NR subfamilies. This review summarizes advances in research of nematode NRs and their developmental functions. Nematode NRs can reveal evolutionarily conserved mechanisms that regulate specific developmental and metabolic processes as well as new regulatory adaptations. They represent the results of a large number of natural experiments with structural and functional potential of NRs for the evolution of the phylum. The conserved and divergent character of nematode NRs adds a new dimension to our understanding of the general biology of regulation by NRs. This article is part of a Special Issue entitled: Nuclear receptors in animal development.
Collapse
|
28
|
Wang Y, Levy DE. Comparative evolutionary genomics of the STAT family of transcription factors. JAKSTAT 2014; 1:23-33. [PMID: 24058748 PMCID: PMC3670131 DOI: 10.4161/jkst.19418] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2011] [Revised: 01/16/2012] [Accepted: 01/19/2012] [Indexed: 01/23/2023] Open
Abstract
The STAT signaling pathway is one of the seven common pathways that govern cell fate decisions during animal development. Comparative genomics revealed multiple incidences of stat gene duplications throughout metazoan evolutionary history. While pseudogenization is a frequent fate of duplicated genes, many of these STAT duplications evolved into novel genes through rapid sequence diversification and neofunctionalization. Additionally, the core of STAT gene regulatory networks, comprising stat1 through 4, stat5 and stat6, arose early in vertebrate evolution, probably through the two whole genome duplication events that occurred after the split of Cephalochordates but before the rise of Chondrichthyes. While another complete genome duplication event took place during the evolution of bony fish after their separation from the tetrapods about 450 million years ago (Mya), modern fish have only one set of these core stats, suggesting the rapid loss of most duplicated stat genes. The two stat5 genes in mammals likely arose from a duplication event in early Eutherian evolution, a period from about 310 Mya at the avian-mammal divergence to the separation of marsupials from other mammals about 130 Mya. These analyses indicate that whole genome duplications and gene duplications by unequal chromosomal crossing over were likely the major mechanisms underlying the evolution of STATs.
Collapse
Affiliation(s)
- Yaming Wang
- Departments of Pathology and Microbiology and NYU Cancer Institute; New York University School of Medicine; New York, NY USA
| | | |
Collapse
|
29
|
Abstract
Gene duplication is widely believed to facilitate adaptation, but unambiguous evidence for this hypothesis has been found in only a small number of cases. Although gene duplication may increase the fitness of the involved organisms by doubling gene dosage or neofunctionalization, it may also result in a simple division of ancestral functions into daughter genes, which need not promote adaptation. Hence, the general validity of the adaptation by gene duplication hypothesis remains uncertain. Indeed, a genome-scale experiment found similar fitness effects of deleting pairs of duplicate genes and deleting individual singleton genes from the yeast genome, leading to the conclusion that duplication rarely results in adaptation. Here we contend that the above comparison is unfair because of a known duplication bias among genes with different fitness contributions. To rectify this problem, we compare homologous genes from the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe. We discover that simultaneously deleting a duplicate gene pair in S. cerevisiae reduces fitness significantly more than deleting their singleton counterpart in S. pombe, revealing post-duplication adaptation. The duplicates-singleton difference in fitness effect is not attributable to a potential increase in gene dose after duplication, suggesting that the adaptation is owing to neofunctionalization, which we find to be explicable by acquisitions of binary protein-protein interactions rather than gene expression changes. These results provide genomic evidence for the role of gene duplication in organismal adaptation and are important for understanding the genetic mechanisms of evolutionary innovation.
Collapse
Affiliation(s)
- Wenfeng Qian
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA; Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
30
|
Gaudry MJ, Storz JF, Butts GT, Campbell KL, Hoffmann FG. Repeated evolution of chimeric fusion genes in the β-globin gene family of laurasiatherian mammals. Genome Biol Evol 2014; 6:1219-34. [PMID: 24814285 PMCID: PMC4041002 DOI: 10.1093/gbe/evu097] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/05/2014] [Indexed: 12/13/2022] Open
Abstract
The evolutionary fate of chimeric fusion genes may be strongly influenced by their recombinational mode of origin and the nature of functional divergence between the parental genes. In the β-globin gene family of placental mammals, the two postnatally expressed δ- and β-globin genes (HBD and HBB, respectively) have a propensity for recombinational exchange via gene conversion and unequal crossing-over. In the latter case, there are good reasons to expect differences in retention rates for the reciprocal HBB/HBD and HBD/HBB fusion genes due to thalassemia pathologies associated with the HBD/HBB "Lepore" deletion mutant in humans. Here, we report a comparative genomic analysis of the mammalian β-globin gene cluster, which revealed that chimeric HBB/HBD fusion genes originated independently in four separate lineages of laurasiatherian mammals: Eulipotyphlans (shrews, moles, and hedgehogs), carnivores, microchiropteran bats, and cetaceans. In cases where an independently derived "anti-Lepore" duplication mutant has become fixed, the parental HBD and/or HBB genes have typically been inactivated or deleted, so that the newly created HBB/HBD fusion gene is primarily responsible for synthesizing the β-type subunits of adult and fetal hemoglobin (Hb). Contrary to conventional wisdom that the HBD gene is a vestigial relict that is typically inactivated or expressed at negligible levels, we show that HBD-like genes often encode a substantial fraction (20-100%) of β-chain Hbs in laurasiatherian taxa. Our results indicate that the ascendancy or resuscitation of genes with HBD-like coding sequence requires the secondary acquisition of HBB-like promoter sequence via unequal crossing-over or interparalog gene conversion.
Collapse
Affiliation(s)
- Michael J Gaudry
- Department of Biological Sciences, University of Manitoba, Winnipeg, MB, Canada
| | - Jay F Storz
- School of Biological Sciences, University of Nebraska, Lincoln
| | - Gary Tyler Butts
- Department of Biochemistry, Molecular Biology, Entomology, and Plant Pathology, Mississippi State University
| | - Kevin L Campbell
- Department of Biological Sciences, University of Manitoba, Winnipeg, MB, Canada
| | - Federico G Hoffmann
- Department of Biochemistry, Molecular Biology, Entomology, and Plant Pathology, Mississippi State UniversityInstitute for Genomics, Biocomputing and Biotechnology, Mississippi State University
| |
Collapse
|
31
|
Labbé P, Milesi P, Yébakima A, Pasteur N, Weill M, Lenormand T. GENE-dosage effects on fitness in recent adaptive duplications: ace-1 in the mosquito Culex pipiens. Evolution 2014; 68:2092-101. [PMID: 24494966 DOI: 10.1111/evo.12372] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 01/22/2014] [Indexed: 01/13/2023]
Abstract
Gene duplications have long been advocated to contribute to the evolution of new functions. The role of selection in their early spread is more controversial. Unless duplications are favored for a direct benefit of increased expression, they are likely detrimental. In this article, we investigated the case of duplications favored because they combine already functionally divergent alleles. Their gene-dosage/fitness relations are poorly known because selection may operate on both overall expression and duplicates relative dosage. Using the well-documented case of Culex pipiens resistance to insecticides, we compared strains with various ace-1 allele combinations, including two duplicated alleles carrying both susceptible and resistant copies. The overall protein activity was nearly additive, but, surprisingly, fitness correlated better with the relative proportion of susceptible and resistant copies rather than any absolute measure of activity. Gene dosage is thus crucial, duplications stabilizing a "heterozygote" phenotype. It corroborates the view that these were favored because they fix a permanent heterosis, thereby solving the irreducible trade-off between resistance and synaptic transmission. Moreover, we showed that the contrasted successes of the two duplicated alleles in natural populations depend on genetic changes unrelated to ace-1, confirming the probable implication of recessive sublethal mutations linked to structural rearrangements in some duplications.
Collapse
Affiliation(s)
- Pierrick Labbé
- CNRS, Université Montpellier 2, ISEM - UMR 5554, 34065, Montpellier, France.
| | | | | | | | | | | |
Collapse
|
32
|
Diss G, Ascencio D, DeLuna A, Landry CR. Molecular mechanisms of paralogous compensation and the robustness of cellular networks. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2013; 322:488-99. [PMID: 24376223 DOI: 10.1002/jez.b.22555] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Accepted: 11/26/2013] [Indexed: 11/10/2022]
Abstract
Robustness is the ability of a system to maintain its function despite environmental or genetic perturbation. Genetic robustness is a key emerging property of living systems and is achieved notably by the presence of partially redundant parts that result from gene duplication. Functional overlap between paralogs allows them to compensate for each other's loss, as commonly revealed by aggravating genetic interactions. However, the molecular mechanisms linking the genotype (loss of function of a gene) to the phenotype (genetic buffering by a paralog) are still poorly understood and the molecular aspects of this compensation are rarely addressed in studies of gene duplicates. Here, we review molecular mechanisms of functional compensation between paralogous genes, many of which from studies that were not meant to study this phenomenon. We propose a standardized terminology and, depending on whether or not the molecular behavior of the intact gene is modified in response to the deletion of its paralog, we classify mechanisms of compensation into passive and active events. We further describe three non-exclusive mechanisms of active paralogous compensation for which there is evidence in the literature: changes in abundance, in localization, and in protein interactions. This review will serve as a framework for the genetic and molecular analysis of paralogous compensation, one of the universal features of genetic systems.
Collapse
Affiliation(s)
- Guillaume Diss
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes, PROTEO, Pavillon Charles-Eugène-Marchand, Québec, QC, Canada
| | | | | | | |
Collapse
|
33
|
Abstract
Genes are perpetually added to and deleted from genomes during evolution. Thus, it is important to understand how new genes are formed and how they evolve to be critical components of the genetic systems that determine the biological diversity of life. Two decades of effort have shed light on the process of new gene origination and have contributed to an emerging comprehensive picture of how new genes are added to genomes, ranging from the mechanisms that generate new gene structures to the presence of new genes in different organisms to the rates and patterns of new gene origination and the roles of new genes in phenotypic evolution. We review each of these aspects of new gene evolution, summarizing the main evidence for the origination and importance of new genes in evolution. We highlight findings showing that new genes rapidly change existing genetic systems that govern various molecular, cellular, and phenotypic functions.
Collapse
Affiliation(s)
- Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637;
| | | | | | | |
Collapse
|
34
|
Primate genome architecture influences structural variation mechanisms and functional consequences. Proc Natl Acad Sci U S A 2013; 110:15764-9. [PMID: 24014587 DOI: 10.1073/pnas.1305904110] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Although nucleotide resolution maps of genomic structural variants (SVs) have provided insights into the origin and impact of phenotypic diversity in humans, comparable maps in nonhuman primates have thus far been lacking. Using massively parallel DNA sequencing, we constructed fine-resolution genomic structural variation maps in five chimpanzees, five orang-utans, and five rhesus macaques. The SV maps, which are comprised of thousands of deletions, duplications, and mobile element insertions, revealed a high activity of retrotransposition in macaques compared with great apes. By comparison, nonallelic homologous recombination is specifically active in the great apes, which is correlated with architectural differences between the genomes of great apes and macaque. Transcriptome analyses across nonhuman primates and humans revealed effects of species-specific whole-gene duplication on gene expression. We identified 13 gene duplications coinciding with the species-specific gain of tissue-specific gene expression in keeping with a role of gene duplication in the promotion of diversification and the acquisition of unique functions. Differences in the present day activity of SV formation mechanisms that our study revealed may contribute to ongoing diversification and adaptation of great ape and Old World monkey lineages.
Collapse
|
35
|
Janoušek V, Karn RC, Laukaitis CM. The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family. BMC Evol Biol 2013; 13:107. [PMID: 23718880 PMCID: PMC3669608 DOI: 10.1186/1471-2148-13-107] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2013] [Accepted: 05/14/2013] [Indexed: 12/01/2022] Open
Abstract
Background Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Results Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. Conclusions We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in the most recent duplication are the main contributions of our study.
Collapse
Affiliation(s)
- Václav Janoušek
- Department of Zoology, Faculty of Science, Charles University in Prague, Prague 128 43, Czech Republic
| | | | | |
Collapse
|
36
|
Katju V. To the beat of a different drum: determinants implicated in the asymmetric sequence divergence of Caenorhabditis elegans paralogs. BMC Evol Biol 2013; 13:73. [PMID: 23530733 PMCID: PMC3637608 DOI: 10.1186/1471-2148-13-73] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 03/20/2013] [Indexed: 12/18/2022] Open
Abstract
Background Gene duplicates often exhibit asymmetric rates of molecular evolution in their early evolutionary existence. This asymmetry in rates is thought to signify the maintenance of the ancestral function by one copy and the removal of functional constraint on the other copy, enabling it to embark on a novel evolutionary trajectory. Here I focused on a large population of evolutionarily young gene duplicates (KS ≤ 0.14) in the Caenorhabditis elegans genome in order to conduct the first combined analysis of four predictors (evolutionary age, chromosomal location, structural resemblance between duplicates, and duplication span) which may be implicated in the asymmetric sequence divergence of paralogs at the nucleotide and amino acid level. In addition, I investigate if either paralog is equally likely to embark on a trajectory of accelerated sequence evolution or whether the derived paralog is more likely to exhibit faster sequence evolution. Results Three predictors (evolutionary age of duplicates, chromosomal location and duplication span) serve as major determinants of sequence asymmetry between C. elegans paralogs. Paralogs diverge asymmetrically in sequence with increasing evolutionary age, the relocation of one copy to a different chromosome and attenuated duplication spans that likely fail to capture the entire ancestral repertoire of coding sequence and regulatory elements. Furthermore, for paralogs residing on the same chromosome, opposite transcriptional orientation and increased genomic distance do not increase sequence asymmetry between paralogs. For a subset of duplicate pairs wherein the ancestral versus derived paralog could be distinguished, the derived paralogs are more likely to evolve at accelerated rates. Conclusions This genome-wide study of evolutionarily young duplicates stemming primarily from DNA-mediated small-scale duplication events demonstrates that genomic relocation to a new chromosome has important consequences for asymmetric divergence of paralogs, akin to paralogs arising from RNA-mediated duplication events. Additionally, the duplication span is negatively correlated with sequence rate asymmetry among paralogs, suggesting that attenuated duplication spans stemming from incomplete duplication of the ORF and/or ancestral regulatory elements further accelerate sequence divergence between paralogs. Cumulatively, derived copies exhibit accelerated rates of sequence evolution suggesting that they are primed for a divergent evolutionary trajectory by changes in structure and genomic context at inception.
Collapse
Affiliation(s)
- Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA.
| |
Collapse
|
37
|
Owens SM, Harberson NA, Moore RC. Asymmetric functional divergence of young, dispersed gene duplicates in Arabidopsis thaliana. J Mol Evol 2013; 76:13-27. [PMID: 23344714 DOI: 10.1007/s00239-012-9530-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2012] [Accepted: 10/29/2012] [Indexed: 11/28/2022]
Abstract
One prediction of the classic Ohno model of gene duplication predicts that new genes form from the asymmetric functional divergence of a newly arisen, redundant duplicate locus. In order to understand the mechanisms which give rise to functional divergence of newly formed dispersed duplicates, we assessed the expression and molecular evolutionary divergence of a suite of 19 highly similar dispersed duplicates in Arabidopsis thaliana. These duplicates have a K sil equal to or less than 5 % and are specific to the A. thaliana lineage; thus, they predictably represent some of the youngest duplicates in the A. thaliana genome. We found that the majority of young duplicate loci exhibit asymmetric expression patterns, with the daughter locus exhibiting reduced expression across all tissues analyzed relative to the progenitor locus or simply not expressed. Furthermore, daughter loci, on the whole, have significantly more nonsynonymous substitutions than the progenitor loci. We also identified four pairs of loci which exhibit significant (P < 0.05) evolutionary rate asymmetry, three of which exhibit elevated dN/dS in the duplicate copy. We suggest, based on these data, that functional diversification initially takes the form of asymmetric regulatory divergence that can be a direct consequence of the mode of duplication. The reduced and/or absence of expression in the daughter copy relaxes functional constraint on its protein coding sequence leading to the asymmetric accumulation of nonsynonymous mutations. Thus, our data both affirm Ohno's prediction while explaining the mechanism by which functional divergence initially occurs following duplication for dispersed gene duplicates.
Collapse
Affiliation(s)
- Sarah M Owens
- Botany Department, Miami University, Oxford, OH 45056, USA
| | | | | |
Collapse
|
38
|
Serbielle C, Dupas S, Perdereau E, Héricourt F, Dupuy C, Huguet E, Drezen JM. Evolutionary mechanisms driving the evolution of a large polydnavirus gene family coding for protein tyrosine phosphatases. BMC Evol Biol 2012; 12:253. [PMID: 23270369 PMCID: PMC3573978 DOI: 10.1186/1471-2148-12-253] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 12/11/2012] [Indexed: 11/20/2022] Open
Abstract
Background Gene duplications have been proposed to be the main mechanism involved in genome evolution and in acquisition of new functions. Polydnaviruses (PDVs), symbiotic viruses associated with parasitoid wasps, are ideal model systems to study mechanisms of gene duplications given that PDV genomes consist of virulence genes organized into multigene families. In these systems the viral genome is integrated in a wasp chromosome as a provirus and virus particles containing circular double-stranded DNA are injected into the parasitoids’ hosts and are essential for parasitism success. The viral virulence factors, organized in gene families, are required collectively to induce host immune suppression and developmental arrest. The gene family which encodes protein tyrosine phosphatases (PTPs) has undergone spectacular expansion in several PDV genomes with up to 42 genes. Results Here, we present strong indications that PTP gene family expansion occurred via classical mechanisms: by duplication of large segments of the chromosomally integrated form of the virus sequences (segmental duplication), by tandem duplications within this form and by dispersed duplications. We also propose a novel duplication mechanism specific to PDVs that involves viral circle reintegration into the wasp genome. The PTP copies produced were shown to undergo conservative evolution along with episodes of adaptive evolution. In particular recently produced copies have undergone positive selection in sites most likely involved in defining substrate selectivity. Conclusion The results provide evidence about the dynamic nature of polydnavirus proviral genomes. Classical and PDV-specific duplication mechanisms have been involved in the production of new gene copies. Selection pressures associated with antagonistic interactions with parasitized hosts have shaped these genes used to manipulate lepidopteran physiology with evidence for positive selection involved in adaptation to host targets.
Collapse
Affiliation(s)
- Céline Serbielle
- Institut de Recherche sur la Biologie de l'Insecte, UMR CNRS 7261, Faculté des Sciences et Techniques, Université F. Rabelais, Parc de Grandmont, 37200, Tours, France
| | | | | | | | | | | | | |
Collapse
|
39
|
Pavy N, Pelgas B, Laroche J, Rigault P, Isabel N, Bousquet J. A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers. BMC Biol 2012; 10:84. [PMID: 23102090 PMCID: PMC3519789 DOI: 10.1186/1741-7007-10-84] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/26/2012] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. RESULTS To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. CONCLUSIONS Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants.
Collapse
Affiliation(s)
- Nathalie Pavy
- Canada Research Chair in Forest and Environmental Genomics, Centre for Forest Research and Institute for Systems and Integrative Biology, Université Laval, Québec, Québec G1V 0A6, Canada.
| | | | | | | | | | | |
Collapse
|
40
|
Gagnon-Arsenault I, Marois Blanchet FC, Rochette S, Diss G, Dubé AK, Landry CR. Transcriptional divergence plays a role in the rewiring of protein interaction networks after gene duplication. J Proteomics 2012; 81:112-25. [PMID: 23063722 DOI: 10.1016/j.jprot.2012.09.038] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2012] [Revised: 09/26/2012] [Accepted: 09/30/2012] [Indexed: 01/27/2023]
Abstract
Gene duplication plays a key role in the evolution of protein-protein interaction (PPI) networks. After a gene duplication event, paralogous proteins may diverge through the gain and loss of PPIs. This divergence can be explained by two non-exclusive mechanisms. First, mutations may accumulate in the coding sequences of the paralogs and affect their protein sequences, which can modify, for instance, their binding interfaces and thus their interaction specificity. Second, mutations may accumulate in the non-coding region of the genes and affect their regulatory sequences. The resulting changes in expression profiles can lead to paralogous proteins being differentially expressed and occurring in the cell with different sets of potential interaction partners. These changes could also alter splicing regulation and lead to the inclusion or exclusion of alternative exons. The evolutionary role of these regulatory mechanisms remains largely unexplored. We use bioinformatics analyses of existing PPI data and proteome-wide PPI screening to show that the divergence of transcriptional regulation between paralogs plays a significant role in determining their PPI specificity. Because many gene duplication events are followed by rapid changes in transcriptional regulation, our results suggest that PPI networks may be rewired by gene duplication, without the need for protein to diverge in their binding specificities. This article is part of a Special Issue entitled: From protein structures to clinical applications.
Collapse
Affiliation(s)
- Isabelle Gagnon-Arsenault
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes, PROTEO, 1030 Avenue de la Médecine, Université Laval, Québec, Québec, Canada G1V 0A6
| | | | | | | | | | | |
Collapse
|
41
|
Garzón-Ospina D, López C, Forero-Rodríguez J, Patarroyo MA. Genetic diversity and selection in three Plasmodium vivax merozoite surface protein 7 (Pvmsp-7) genes in a Colombian population. PLoS One 2012; 7:e45962. [PMID: 23049905 PMCID: PMC3458108 DOI: 10.1371/journal.pone.0045962] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2012] [Accepted: 08/28/2012] [Indexed: 11/18/2022] Open
Abstract
A completely effective vaccine for malaria (one of the major infectious diseases worldwide) is not yet available; different membrane proteins involved in parasite-host interactions have been proposed as candidates for designing it. It has been found that proteins encoded by the merozoite surface protein (msp)-7 multigene family are antibody targets in natural infection; the nucleotide diversity of three Pvmsp-7 genes was thus analyzed in a Colombian parasite population. By contrast with P. falciparum msp-7 loci and ancestral P. vivax msp-7 genes, specie-specific duplicates of the latter specie display high genetic variability, generated by single nucleotide polymorphisms, repeat regions, and recombination. At least three major allele types are present in Pvmsp-7C, Pvmsp-7H and Pvmsp-7I and positive selection seems to be operating on the central region of these msp-7 genes. Although this region has high genetic polymorphism, the C-terminus (Pfam domain ID: PF12948) is conserved and could be an important candidate when designing a subunit-based antimalarial vaccine.
Collapse
Affiliation(s)
- Diego Garzón-Ospina
- Fundación Instituto de Inmunología de Colombia – FIDIC, Bogotá DC, Colombia
- Microbiology postgraduate program, Universidad Nacional de Colombia, Bogotá DC, Colombia
- School of Medicine and Health Sciences, Universidad del Rosario, Bogotá DC, Colombia
| | - Carolina López
- Fundación Instituto de Inmunología de Colombia – FIDIC, Bogotá DC, Colombia
- Microbiology postgraduate program, Universidad Nacional de Colombia, Bogotá DC, Colombia
- School of Medicine and Health Sciences, Universidad del Rosario, Bogotá DC, Colombia
| | | | - Manuel A. Patarroyo
- Fundación Instituto de Inmunología de Colombia – FIDIC, Bogotá DC, Colombia
- School of Medicine and Health Sciences, Universidad del Rosario, Bogotá DC, Colombia
- * E-mail:
| |
Collapse
|
42
|
Katju V. In with the old, in with the new: the promiscuity of the duplication process engenders diverse pathways for novel gene creation. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:341932. [PMID: 23008799 PMCID: PMC3449122 DOI: 10.1155/2012/341932] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 06/03/2012] [Indexed: 01/26/2023]
Abstract
The gene duplication process has exhibited far greater promiscuity in the creation of paralogs with novel exon-intron structures than anticipated even by Ohno. In this paper I explore the history of the field, from the neo-Darwinian synthesis through Ohno's formulation of the canonical model for the evolution of gene duplicates and culminating in the present genomic era. I delineate the major tenets of Ohno's model and discuss its failure to encapsulate the full complexity of the duplication process as revealed in the era of genomics. I discuss the diverse classes of paralogs originating from both DNA- and RNA-mediated duplication events and their evolutionary potential for assuming radically altered functions, as well as the degree to which they can function unconstrained from the pressure of gene conversion. Lastly, I explore theoretical population-genetic considerations of how the effective population size (N(e)) of a species may influence the probability of emergence of genes with radically altered functions.
Collapse
Affiliation(s)
- Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| |
Collapse
|
43
|
Mechanisms of Gene Duplication and Translocation and Progress towards Understanding Their Relative Contributions to Animal Genome Evolution. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:846421. [PMID: 22919542 PMCID: PMC3420103 DOI: 10.1155/2012/846421] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2012] [Revised: 05/30/2012] [Accepted: 06/27/2012] [Indexed: 01/10/2023]
Abstract
Duplication of genetic material is clearly a major route to genetic change, with consequences for both evolution and disease. A variety of forms and mechanisms of duplication are recognised, operating across the scales of a few base pairs upto entire genomes. With the ever-increasing amounts of gene and genome sequence data that are becoming available, our understanding of the extent of duplication is greatly improving, both in terms of the scales of duplication events as well as their rates of occurrence. An accurate understanding of these processes is vital if we are to properly understand important events in evolution as well as mechanisms operating at the level of genome organisation. Here we will focus on duplication in animal genomes and how the duplicated sequences are distributed, with the aim of maintaining a focus on principles of evolution and organisation that are most directly applicable to the shaping of our own genome.
Collapse
|
44
|
Abstract
New genes are a major source of genetic innovation in genomes. However, until recently, understanding how new genes originate and how they evolve was hampered by the lack of appropriate genetic datasets. The advent of the genomic era brought about a revolution in the amount of data available to study new genes. For the first time, decades-old theoretical principles could be tested empirically and novel and unexpected avenues of research opened up. This chapter explores how genomic data can and is being used to study both the origin and evolution of new genes and the surprising discoveries made thus far.
Collapse
|
45
|
Marotta M, Piontkivska H, Tanaka H. Molecular trajectories leading to the alternative fates of duplicate genes. PLoS One 2012; 7:e38958. [PMID: 22720000 PMCID: PMC3375281 DOI: 10.1371/journal.pone.0038958] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 05/14/2012] [Indexed: 11/21/2022] Open
Abstract
Gene duplication generates extra gene copies in which mutations can accumulate without risking the function of pre-existing genes. Such mutations modify duplicates and contribute to evolutionary novelties. However, the vast majority of duplicates appear to be short-lived and experience duplicate silencing within a few million years. Little is known about the molecular mechanisms leading to these alternative fates. Here we delineate differing molecular trajectories of a relatively recent duplication event between humans and chimpanzees by investigating molecular properties of a single duplicate: DNA sequences, gene expression and promoter activities. The inverted duplication of the Glutathione S-transferase Theta 2 (GSTT2) gene had occurred at least 7 million years ago in the common ancestor of African great apes and is preserved in chimpanzees (Pan troglodytes), whereas a deletion polymorphism is prevalent in humans. The alternative fates are associated with expression divergence between these species, and reduced expression in humans is regulated by silencing mutations that have been propagated between duplicates by gene conversion. In contrast, selective constraint preserved duplicate divergence in chimpanzees. The difference in evolutionary processes left a unique DNA footprint in which dying duplicates are significantly more similar to each other (99.4%) than preserved ones. Such molecular trajectories could provide insights for the mechanisms underlying duplicate life and death in extant genomes.
Collapse
Affiliation(s)
- Michael Marotta
- Department of Molecular Genetics, Cleveland Clinic Foundation, Cleveland, Ohio, United States of America
| | - Helen Piontkivska
- Department of Biological Sciences, Kent State University, Kent, Ohio, United States of America
| | - Hisashi Tanaka
- Department of Molecular Genetics, Cleveland Clinic Foundation, Cleveland, Ohio, United States of America
| |
Collapse
|
46
|
Audemard E, Schiex T, Faraut T. Detecting long tandem duplications in genomic sequences. BMC Bioinformatics 2012; 13:83. [PMID: 22568762 PMCID: PMC3464658 DOI: 10.1186/1471-2105-13-83] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2011] [Accepted: 05/08/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication. RESULTS In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS < 1) and that it is also able to predict tandem duplications involving non coding elements such as pseudo-genes or RNA genes. CONCLUSIONS ReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.
Collapse
Affiliation(s)
- Eric Audemard
- Unité de Biométrie et Intelligence Artificielle, UR 875, INRA, Toulouse, France.
| | | | | |
Collapse
|
47
|
Assis R, Kondrashov AS. A strong deletion bias in nonallelic gene conversion. PLoS Genet 2012; 8:e1002508. [PMID: 22359514 PMCID: PMC3280953 DOI: 10.1371/journal.pgen.1002508] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Accepted: 12/12/2011] [Indexed: 11/19/2022] Open
Abstract
Gene conversion is the unidirectional transfer of genetic information between orthologous (allelic) or paralogous (nonallelic) genomic segments. Though a number of studies have examined nucleotide replacements, little is known about length difference mutations produced by gene conversion. Here, we investigate insertions and deletions produced by nonallelic gene conversion in 338 Drosophila and 10,149 primate paralogs. Using a direct phylogenetic approach, we identify 179 insertions and 614 deletions in Drosophila paralogs, and 132 insertions and 455 deletions in primate paralogs. Thus, nonallelic gene conversion is strongly deletion-biased in both lineages, with almost 3.5 times as many conversion-induced deletions as insertions. In primates, the deletion bias is considerably stronger for long indels and, in both lineages, the per-site rate of gene conversion is orders of magnitudes higher than that of ordinary mutation. Due to this high rate, deletion-biased nonallelic gene conversion plays a key role in genome size evolution, leading to the cooperative shrinkage and eventual disappearance of selectively neutral paralogs. Gene conversion is a process whereby a DNA sequence is copied from one segment of the genome (donor) to another (recipient), resulting in the replacement, insertion, or deletion of a DNA sequence in the recipient. This exchange is facilitated by the high sequence similarity of the two segments, which is due to their evolutionary relationship. Here, we study insertions and deletions produced by gene conversion between paralogs, segments related by DNA duplication events. By comparing paralog sequences in multiple species of fruit flies and primates, we find that deletions occur more than three times as frequently as insertions. We also discover that the rate of gene conversion between paralogs is quite high. The deletion bias and high rate of this process causes paralogs to shrink cooperatively and eventually be eliminated from the genome. Because of the abundance of paralogs in animal genomes, this phenomenon can lead to a significant reduction in genome size. Therefore, our finding enhances our understanding of the forces that lead to changes in genome size during evolution.
Collapse
Affiliation(s)
- Raquel Assis
- Department of Integrative Biology, Center for Theoretical Evolutionary Genomics, University of California Berkeley, Berkeley, California, USA.
| | | |
Collapse
|
48
|
Abstract
Gene duplication is arguably the most significant source of new functional genetic material. A better understanding of the processes that lead to the stable incorporation of gene duplications into the genome is important both because it relates to interspecific differences in genome composition and because it can shed light on why some classes of gene are more prone to duplication than others. Typically, models of gene duplication consider the periods before duplication, during the spread and fixation of a new duplicate, and following duplication as distinct phases without a common underlying selective environment. I consider a scenario where a gene that is initially expressed in multiple contexts can undergo mutations that alter its expression profile or its functional coding sequence. The selective regime that acts on the functional output of the allele copies carried by an individual is constant. If there is a potential selective benefit to having different coding sequences expressed in each context, then, regardless of the constraints on functional variation at the single-locus gene, the waiting time until a gene duplication is incorporated goes down as population size increases.
Collapse
|
49
|
Fischer SEJ, Montgomery TA, Zhang C, Fahlgren N, Breen PC, Hwang A, Sullivan CM, Carrington JC, Ruvkun G. The ERI-6/7 helicase acts at the first stage of an siRNA amplification pathway that targets recent gene duplications. PLoS Genet 2011; 7:e1002369. [PMID: 22102828 PMCID: PMC3213143 DOI: 10.1371/journal.pgen.1002369] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2011] [Accepted: 09/19/2011] [Indexed: 01/19/2023] Open
Abstract
Endogenous small interfering RNAs (siRNAs) are a class of naturally occuring regulatory RNAs found in fungi, plants, and animals. Some endogenous siRNAs are required to silence transposons or function in chromosome segregation; however, the specific roles of most endogenous siRNAs are unclear. The helicase gene eri-6/7 was identified in the nematode Caenorhabditis elegans by the enhanced response to exogenous double-stranded RNAs (dsRNAs) of the null mutant. eri-6/7 encodes a helicase homologous to small RNA factors Armitage in Drosophila, SDE3 in Arabidopsis, and Mov10 in humans. Here we show that eri-6/7 mutations cause the loss of 26-nucleotide (nt) endogenous siRNAs derived from genes and pseudogenes in oocytes and embryos, as well as deficiencies in somatic 22-nucleotide secondary siRNAs corresponding to the same loci. About 80 genes are eri-6/7 targets that generate the embryonic endogenous siRNAs that silence the corresponding mRNAs. These 80 genes share extensive nucleotide sequence homology and are poorly conserved, suggesting a role for these endogenous siRNAs in silencing of and thereby directing the fate of recently acquired, duplicated genes. Unlike most endogenous siRNAs in C. elegans, eri-6/7–dependent siRNAs require Dicer. We identify that the eri-6/7–dependent siRNAs have a passenger strand that is ∼19 nt and is inset by ∼3–4 nts from both ends of the 26 nt guide siRNA, suggesting non-canonical Dicer processing. Mutations in the Argonaute ERGO-1, which associates with eri-6/7–dependent 26 nt siRNAs, cause passenger strand stabilization, indicating that ERGO-1 is required to separate the siRNA duplex, presumably through endonucleolytic cleavage of the passenger strand. Thus, like several other siRNA–associated Argonautes with a conserved RNaseH motif, ERGO-1 appears to be required for siRNA maturation. Endogenous small interfering RNAs (siRNAs) are a class of small RNAs present in fungi, plants, and animals. Small RNAs, including microRNAs, are known to regulate the expression levels of genes, silence invading elements such as transposons, and act in cell division. However, the function of many endogenous siRNAs is unknown. We have found that the ERI-6/7 helicase is required for a subset of endogenous siRNAs present in the nematode Caenorhabditis elegans. The ERI-6/7 helicase acts in a pathway together with the Argonaute protein ERGO-1 to produce two types of siRNAs: a primary class of 26 nucleotides in length present in oocytes and embryos, and a class of 22 nucleotide siRNAs present in later stages of development. These siRNAs correspond to only about one hundred genes. Interestingly, we found that these genes fall into groups of genes that contain nearly identical DNA sequences. The sequences of these genes are not conserved in other organisms, not even in related nematodes. These results point to a potential function of these endogenous siRNAs: silencing of recently acquired, duplicated genes. Our work demonstrates a new role of small RNAs, different from known functions in transposon silencing and regulation of gene expression.
Collapse
Affiliation(s)
- Sylvia E. J. Fischer
- Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Taiowa A. Montgomery
- Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Chi Zhang
- Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Noah Fahlgren
- Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, United States of America
| | - Peter C. Breen
- Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Alexia Hwang
- Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Christopher M. Sullivan
- Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, United States of America
| | - James C. Carrington
- Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, United States of America
| | - Gary Ruvkun
- Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
50
|
Bu L, Bergthorsson U, Katju V. Local synteny and codon usage contribute to asymmetric sequence divergence of Saccharomyces cerevisiae gene duplicates. BMC Evol Biol 2011; 11:279. [PMID: 21955875 PMCID: PMC3190396 DOI: 10.1186/1471-2148-11-279] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Accepted: 09/28/2011] [Indexed: 11/10/2022] Open
Abstract
Background Duplicated genes frequently experience asymmetric rates of sequence evolution. Relaxed selective constraints and positive selection have both been invoked to explain the observation that one paralog within a gene-duplicate pair exhibits an accelerated rate of sequence evolution. In the majority of studies where asymmetric divergence has been established, there is no indication as to which gene copy, ancestral or derived, is evolving more rapidly. In this study we investigated the effect of local synteny (gene-neighborhood conservation) and codon usage on the sequence evolution of gene duplicates in the S. cerevisiae genome. We further distinguish the gene duplicates into those that originated from a whole-genome duplication (WGD) event (ohnologs) versus small-scale duplications (SSD) to determine if there exist any differences in their patterns of sequence evolution. Results For SSD pairs, the derived copy evolves faster than the ancestral copy. However, there is no relationship between rate asymmetry and synteny conservation (ancestral-like versus derived-like) in ohnologs. mRNA abundance and optimal codon usage as measured by the CAI is lower in the derived SSD copies relative to ancestral paralogs. Moreover, in the case of ohnologs, the faster-evolving copy has lower CAI and lowered expression. Conclusions Together, these results suggest that relaxation of selection for codon usage and gene expression contribute to rate asymmetry in the evolution of duplicated genes and that in SSD pairs, the relaxation of selection stems from the loss of ancestral regulatory information in the derived copy.
Collapse
Affiliation(s)
- Lijing Bu
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| | | | | |
Collapse
|