1
|
Cellier MFM. Nramp: Deprive and conquer? Front Cell Dev Biol 2022; 10:988866. [PMID: 36313567 PMCID: PMC9606685 DOI: 10.3389/fcell.2022.988866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 09/20/2022] [Indexed: 11/13/2022] Open
Abstract
Solute carriers 11 (Slc11) evolved from bacterial permease (MntH) to eukaryotic antibacterial defense (Nramp) while continuously mediating proton (H+)-dependent manganese (Mn2+) import. Also, Nramp horizontal gene transfer (HGT) toward bacteria led to mntH polyphyly. Prior demonstration that evolutionary rate-shifts distinguishing Slc11 from outgroup carriers dictate catalytic specificity suggested that resolving Slc11 family tree may provide a function-aware phylogenetic framework. Hence, MntH C (MC) subgroups resulted from HGTs of prototype Nramp (pNs) parologs while archetype Nramp (aNs) correlated with phagocytosis. PHI-Blast based taxonomic profiling confirmed MntH B phylogroup is confined to anaerobic bacteria vs. MntH A (MA)’s broad distribution; suggested niche-related spread of MC subgroups; established that MA-variant MH, which carries ‘eukaryotic signature’ marks, predominates in archaea. Slc11 phylogeny shows MH is sister to Nramp. Site-specific analysis of Slc11 charge network known to interact with the protonmotive force demonstrates sequential rate-shifts that recapitulate Slc11 evolution. 3D mapping of similarly coevolved sites across Slc11 hydrophobic core revealed successive targeting of discrete areas. The data imply that pN HGT could advantage recipient bacteria for H+-dependent Mn2+ acquisition and Alphafold 3D models suggest conformational divergence among MC subgroups. It is proposed that Slc11 originated as a bacterial stress resistance function allowing Mn2+-dependent persistence in conditions adverse for growth, and that archaeal MH could contribute to eukaryogenesis as a Mn2+ sequestering defense perhaps favoring intracellular growth-competent bacteria.
Collapse
|
2
|
Williams AM, Carter OG, Forsythe ES, Mendoza HK, Sloan DB. Gene duplication and rate variation in the evolution of plastid ACCase and Clp genes in angiosperms. Mol Phylogenet Evol 2022; 168:107395. [PMID: 35033670 PMCID: PMC9673162 DOI: 10.1016/j.ympev.2022.107395] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 11/16/2021] [Accepted: 12/13/2021] [Indexed: 11/19/2022]
Abstract
While the chloroplast (plastid) is known for its role in photosynthesis, it is also involved in many other metabolic pathways essential for plant survival. As such, plastids contain an extensive suite of enzymes required for non-photosynthetic processes. The evolution of the associated genes has been especially dynamic in flowering plants (angiosperms), including examples of gene duplication and extensive rate variation. We examined the role of ongoing gene duplication in two key plastid enzymes, the acetyl-CoA carboxylase (ACCase) and the caseinolytic protease (Clp), responsible for fatty acid biosynthesis and protein turnover, respectively. In plants, there are two ACCase complexes-a homomeric version present in the cytosol and a heteromeric version present in the plastid. Duplications of the nuclear-encoded homomeric ACCase gene and retargeting of one resultant protein to the plastid have been previously reported in multiple species. We find that these retargeted homomeric ACCase proteins exhibit elevated rates of sequence evolution, consistent with neofunctionalization and/or relaxation of selection. The plastid Clp complex catalytic core is composed of nine paralogous proteins that arose via ancient gene duplication in the cyanobacterial/plastid lineage. We show that further gene duplication occurred more recently in the nuclear-encoded core subunits of this complex, yielding additional paralogs in many species of angiosperms. Moreover, in six of eight cases, subunits that have undergone recent duplication display increased rates of sequence evolution relative to those that have remained single copy. We also compared substitution patterns between pairs of Clp core paralogs to gain insight into post-duplication evolutionary routes. These results show that gene duplication and rate variation continue to shape the plastid proteome.
Collapse
Affiliation(s)
- Alissa M Williams
- Department of Biology, Colorado State University, Fort Collins, CO 80523, United States; Program in Cell and Molecular Biology, Colorado State University, Fort Collins, CO 80523, United States.
| | - Olivia G Carter
- Department of Biology, Colorado State University, Fort Collins, CO 80523, United States
| | - Evan S Forsythe
- Department of Biology, Colorado State University, Fort Collins, CO 80523, United States
| | - Hannah K Mendoza
- Department of Biology, Colorado State University, Fort Collins, CO 80523, United States
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO 80523, United States
| |
Collapse
|
3
|
Begum T, Serrano‐Serrano ML, Robinson‐Rechavi M. Performance of a phylogenetic independent contrast method and an improved pairwise comparison under different scenarios of trait evolution after speciation and duplication. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13680] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Tina Begum
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- SIB Swiss Institute of Bioinformatics Lausanne Switzerland
| | - Martha Liliana Serrano‐Serrano
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- SIB Swiss Institute of Bioinformatics Lausanne Switzerland
| | - Marc Robinson‐Rechavi
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- SIB Swiss Institute of Bioinformatics Lausanne Switzerland
| |
Collapse
|
4
|
Begum T, Robinson-Rechavi M. Special Care Is Needed in Applying Phylogenetic Comparative Methods to Gene Trees with Speciation and Duplication Nodes. Mol Biol Evol 2021; 38:1614-1626. [PMID: 33169790 PMCID: PMC8042747 DOI: 10.1093/molbev/msaa288] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
How gene function evolves is a central question of evolutionary biology. It can be investigated by comparing functional genomics results between species and between genes. Most comparative studies of functional genomics have used pairwise comparisons. Yet it has been shown that this can provide biased results, as genes, like species, are phylogenetically related. Phylogenetic comparative methods should be used to correct for this, but they depend on strong assumptions, including unbiased tree estimates relative to the hypothesis being tested. Such methods have recently been used to test the “ortholog conjecture,” the hypothesis that functional evolution is faster in paralogs than in orthologs. Although pairwise comparisons of tissue specificity (τ) provided support for the ortholog conjecture, phylogenetic independent contrasts did not. Our reanalysis on the same gene trees identified problems with the time calibration of duplication nodes. We find that the gene trees used suffer from important biases, due to the inclusion of trees with no duplication nodes, to the relative age of speciations and duplications, to systematic differences in branch lengths, and to non-Brownian motion of tissue specificity on many trees. We find that incorrect implementation of phylogenetic method in empirical gene trees with duplications can be problematic. Controlling for biases allows successful use of phylogenetic methods to study the evolution of gene function and provides some support for the ortholog conjecture using three different phylogenetic approaches.
Collapse
Affiliation(s)
- Tina Begum
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
5
|
Recurrent sequence evolution after independent gene duplication. BMC Evol Biol 2020; 20:98. [PMID: 32770961 PMCID: PMC7414715 DOI: 10.1186/s12862-020-01660-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 07/17/2020] [Indexed: 11/10/2022] Open
Abstract
Background Convergent and parallel evolution provide unique insights into the mechanisms of natural selection. Some of the most striking convergent and parallel (collectively recurrent) amino acid substitutions in proteins are adaptive, but there are also many that are selectively neutral. Accordingly, genome-wide assessment has shown that recurrent sequence evolution in orthologs is chiefly explained by nearly neutral evolution. For paralogs, more frequent functional change is expected because additional copies are generally not retained if they do not acquire their own niche. Yet, it is unknown to what extent recurrent sequence differentiation is discernible after independent gene duplications in different eukaryotic taxa. Results We develop a framework that detects patterns of recurrent sequence evolution in duplicated genes. This is used to analyze the genomes of 90 diverse eukaryotes. We find a remarkable number of families with a potentially predictable functional differentiation following gene duplication. In some protein families, more than ten independent duplications show a similar sequence-level differentiation between paralogs. Based on further analysis, the sequence divergence is found to be generally asymmetric. Moreover, about 6% of the recurrent sequence evolution between paralog pairs can be attributed to recurrent differentiation of subcellular localization. Finally, we reveal the specific recurrent patterns for the gene families Hint1/Hint2, Sco1/Sco2 and vma11/vma3. Conclusions The presented methodology provides a means to study the biochemical underpinning of functional differentiation between paralogs. For instance, two abundantly repeated substitutions are identified between independently derived Sco1 and Sco2 paralogs. Such identified substitutions allow direct experimental testing of the biological role of these residues for the repeated functional differentiation. We also uncover a diverse set of families with recurrent sequence evolution and reveal trends in the functional and evolutionary trajectories of this hitherto understudied phenomenon.
Collapse
|
6
|
Alvarez-Ponce D, Feyertag F, Chakraborty S. Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein-Protein Interaction Network. Genome Biol Evol 2018; 9:1742-1756. [PMID: 28854629 PMCID: PMC5570066 DOI: 10.1093/gbe/evx117] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2017] [Indexed: 02/06/2023] Open
Abstract
The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets.
Collapse
|
7
|
Positive diversifying selection is a pervasive adaptive force throughout the Drosophila radiation. Mol Phylogenet Evol 2017; 112:230-243. [DOI: 10.1016/j.ympev.2017.04.023] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Revised: 04/26/2017] [Accepted: 04/26/2017] [Indexed: 01/02/2023]
|
8
|
Noda-Garcia L, Romero Romero ML, Longo LM, Kolodkin-Gal I, Tawfik DS. Bacilli glutamate dehydrogenases diverged via coevolution of transcription and enzyme regulation. EMBO Rep 2017; 18:1139-1149. [PMID: 28468957 DOI: 10.15252/embr.201743990] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Revised: 03/23/2017] [Accepted: 03/27/2017] [Indexed: 12/29/2022] Open
Abstract
The linkage between regulatory elements of transcription, such as promoters, and their protein products is central to gene function. Promoter-protein coevolution is therefore expected, but rarely observed, and the manner by which these two regulatory levels are linked remains largely unknown. We study glutamate dehydrogenase-a hub of carbon and nitrogen metabolism. In Bacillus subtilis, two paralogues exist: GudB is constitutively transcribed whereas RocG is tightly regulated. In their active, oligomeric states, both enzymes show similar enzymatic rates. However, swaps of enzymes and promoters cause severe fitness losses, thus indicating promoter-enzyme coevolution. Characterization of the proteins shows that, compared to RocG, GudB's enzymatic activity is highly dependent on glutamate and pH Promoter-enzyme swaps therefore result in excessive glutamate degradation when expressing a constitutive enzyme under a constitutive promoter, or insufficient activity when both the enzyme and its promoter are tightly regulated. Coevolution of transcriptional and enzymatic regulation therefore underlies paralogue-specific spatio-temporal control, especially under diverse growth conditions.
Collapse
Affiliation(s)
- Lianet Noda-Garcia
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | | | - Liam M Longo
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Ilana Kolodkin-Gal
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
9
|
Comparing the Statistical Fate of Paralogous and Orthologous Sequences. Genetics 2016; 204:475-482. [PMID: 27474728 DOI: 10.1534/genetics.116.193912] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 07/26/2016] [Indexed: 02/01/2023] Open
Abstract
For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance, finding homologous sequences with a known function in large databases is used to get insight into the function of nonannotated genomic regions. Very efficient tools like BLAST have been developed to identify and rank possible homologous sequences. To estimate the significance of the homology, the ranking of alignment scores takes a background model for random sequences into account. Using this model we can estimate the probability to find two exactly matching subsequences by chance in two unrelated sequences. For two homologous sequences, the corresponding probability is much higher, which allows us to identify them. Here we focus on the distribution of lengths of exact sequence matches between protein-coding regions of pairs of evolutionarily distant genomes. We show that this distribution exhibits a power-law tail with an exponent [Formula: see text] Developing a simple model of sequence evolution by substitutions and segmental duplications, we show analytically and computationally that paralogous and orthologous gene pairs contribute differently to this distribution. Our model explains the differences observed in the comparison of coding and noncoding parts of genomes, thus providing a better understanding of statistical properties of genomic sequences and their evolution.
Collapse
|
10
|
Mei Q, Sadovy Y, Dvornyk V. Molecular evolution of cryptochromes in fishes. Gene 2015; 574:112-20. [PMID: 26238701 DOI: 10.1016/j.gene.2015.07.086] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Revised: 07/03/2015] [Accepted: 07/30/2015] [Indexed: 11/18/2022]
Abstract
Circadian rhythmicity is an endogenous biological cycle of about 24h, which exists in cyanobacteria and fungi, plants and animals. Circadian rhythms improve the adaptability of organisms in both constant and changing environments. The cryptochrome (CRY) is a key element of the circadian system in various animal groups including fishes. We studied evolution of cryptochromes in the phylogenetically and ecologically diverse fish taxa. The phylogenetic tree of fish Cry features two major clades: Cry1 and Cry2. Teleosts possess extra copies of Cry1 due to the genome duplication, which resulted in 3 main paralogous subfamilies (1A, 1B and 1C). Cry1 experienced further diversification through additional duplications in some taxa. 1A of Cry1 is more conserved than the other paralogs (dN=0.010 ± 0.003, π=0.119 ± 0.058). The analysis of selection indicated that, while the Cry homologs in fish evolved under the different levels of selection pressure, strong purifying selection (average ω=0.017) dominated in their evolution.
Collapse
Affiliation(s)
- Qiming Mei
- Key Laboratory of Vegetation Restoration and Management of Degraded Ecosystems, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, People's Republic of China
| | - Yvonne Sadovy
- School of Biological Sciences, University of Hong Kong, Pokfulam Rd., Hong Kong, SAR, People's Republic of China
| | - Volodymyr Dvornyk
- School of Biological Sciences, University of Hong Kong, Pokfulam Rd., Hong Kong, SAR, People's Republic of China; Department of Life Sciences, College of Science and General Studies, Alfaisal University, Riyadh, Saudi Arabia.
| |
Collapse
|
11
|
Gasparini F, Skobo T, Benato F, Gioacchini G, Voskoboynik A, Carnevali O, Manni L, Dalla Valle L. Characterization of Ambra1 in asexual cycle of a non-vertebrate chordate, the colonial tunicate Botryllus schlosseri, and phylogenetic analysis of the protein group in Bilateria. Mol Phylogenet Evol 2015; 95:46-57. [PMID: 26611831 DOI: 10.1016/j.ympev.2015.11.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Revised: 11/02/2015] [Accepted: 11/03/2015] [Indexed: 12/19/2022]
Abstract
Ambra1 is a positive regulator of autophagy, a lysosome-mediated degradative process involved both in physiological and pathological conditions. Nowadays, Ambra1 has been characterized only in mammals and zebrafish. Through bioinformatics searches and targeted cloning, we report the identification of the complete Ambra1 transcript in a non-vertebrate chordate, the tunicate Botryllus schlosseri. Tunicata is the sister group of Vertebrata and the only chordate group possessing species that reproduce also by blastogenesis (asexual reproduction). B. schlosseri Ambra1 deduced amino acid sequence is shorter than vertebrate homologues but still contains the typical WD40 domain. qPCR analyses revealed that the level of B. schlosseri Ambra1 transcription is temporally regulated along the colonial blastogenetic cycle. By means of similarity searches we identified Wdr5 and Katnb1 as proteins evolutionarily associated to Ambra1. Phylogenetic analyses on Bilateria indicate that: (i) Wdr5 is the most related to Ambra1, so that they may derive from an ancestral gene, (ii) Ambra1 forms a group of ancient genes evolved before the radiation of the taxon, (iii) these orthologous Ambra1 share the two conserved WD40/YVTN repeat-like-containing domains, and (iv) they are characterized by ancient duplications of WD40 repeats within the N-terminal domain.
Collapse
Affiliation(s)
- Fabio Gasparini
- Department of Biology, University of Padova, Via Ugo Bassi 35131 Padova, Italy.
| | - Tatjana Skobo
- Department of Biology, University of Padova, Via Ugo Bassi 35131 Padova, Italy.
| | - Francesca Benato
- Department of Biology, University of Padova, Via Ugo Bassi 35131 Padova, Italy.
| | - Giorgia Gioacchini
- Department of Life Science and Environment, Marche Polytechnic University, Via Brecce Bianche, 60131 Ancona, Italy.
| | - Ayelet Voskoboynik
- Department of Pathology, Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, 265 Campus Drive, 3rd Floor, CA 94305, Stanford, United States.
| | - Oliana Carnevali
- Department of Life Science and Environment, Marche Polytechnic University, Via Brecce Bianche, 60131 Ancona, Italy.
| | - Lucia Manni
- Department of Biology, University of Padova, Via Ugo Bassi 35131 Padova, Italy.
| | - Luisa Dalla Valle
- Department of Biology, University of Padova, Via Ugo Bassi 35131 Padova, Italy.
| |
Collapse
|
12
|
Liu SL, Pan AQ, Adams KL. Protein subcellular relocalization of duplicated genes in Arabidopsis. Genome Biol Evol 2014; 6:2501-15. [PMID: 25193306 PMCID: PMC4202327 DOI: 10.1093/gbe/evu191] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Gene duplications during eukaroytic evolution, by successive rounds of polyploidy and by smaller scale duplications, have provided an enormous reservoir of new genes for the evolution of new functions. Preservation of many duplicated genes can be ascribed to changes in sequences, expression patterns, and functions. Protein subcellular relocalization (protein targeting to a new location within the cell) is another way that duplicated genes can diverge. We studied subcellular relocalization of gene pairs duplicated during the evolution of the Brassicaceae including gene pairs from the alpha whole genome duplication that occurred at the base of the family. We analyzed experimental localization data from green fluorescent protein experiments for 128 duplicate pairs in Arabidopsis thaliana, revealing 19 pairs with subcellular relocalization. Many more of the duplicate pairs with relocalization than with the same localization showed an accelerated rate of amino acid sequence evolution in one duplicate, and one gene showed evidence for positive selection. We studied six duplicate gene pairs in more detail. We used gene family analysis with several pairs to infer which gene shows relocalization. We identified potential sequence mutations through comparative analysis that likely result in relocalization of two duplicated gene products. We show that four cases of relocalization have new expression patterns, compared with orthologs in outgroup species, including two with novel expression in pollen. This study provides insights into subcellular relocalization of evolutionarily recent gene duplicates and features of genes whose products have been relocalized.
Collapse
Affiliation(s)
- Shao-Lun Liu
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada Present address: Department of Life Science, Tunghai University, Taichung, Taiwan
| | - An Qi Pan
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada Present address: Mintec Inc., Vancouver, BC, Canada
| | - Keith L Adams
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
13
|
Pich I Roselló O, Kondrashov FA. Long-term asymmetrical acceleration of protein evolution after gene duplication. Genome Biol Evol 2014; 6:1949-55. [PMID: 25070510 PMCID: PMC4159008 DOI: 10.1093/gbe/evu159] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Rapid divergence of gene copies after duplication is thought to determine the fate of the copies and evolution of novel protein functions. However, data on how long the gene copies continue to experience an elevated rate of evolution remain scarce. Standard theory of gene duplications based on some level of genetic redundancy of gene copies predicts that the period of accelerated evolution must end relatively quickly. Using a maximum-likelihood approach we estimate preduplication, initial postduplication, and recent postduplication rates of evolution that occurred in the mammalian lineage. We find that both gene copies experience a similar in magnitude acceleration in their rate of evolution. The copy located in the original genomic position typically returns to the preduplication rates of evolution in a short period of time. The burst of faster evolution of the copy that is located in a new genomic position typically lasts longer. Furthermore, the fast-evolving copies on average continue to evolve faster than the preduplication rates far longer than predicted by standard theory of gene duplications. We hypothesize that the prolonged elevated rates of evolution are determined by functional properties that were acquired during, or soon after, the gene duplication event.
Collapse
Affiliation(s)
- Oriol Pich I Roselló
- Facultat de Medicina, Universitat de Barcelona (UB), SpainBioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, SpainUniversitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Fyodor A Kondrashov
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, SpainUniversitat Pompeu Fabra (UPF), Barcelona, SpainInstitució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
14
|
Pegueroles C, Laurie S, Albà MM. Accelerated evolution after gene duplication: a time-dependent process affecting just one copy. Mol Biol Evol 2013; 30:1830-42. [PMID: 23625888 DOI: 10.1093/molbev/mst083] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Gene duplication is widely regarded as a major mechanism modeling genome evolution and function. However, the mechanisms that drive the evolution of the two, initially redundant, gene copies are still ill defined. Many gene duplicates experience evolutionary rate acceleration, but the relative contribution of positive selection and random drift to the retention and subsequent evolution of gene duplicates, and for how long the molecular clock may be distorted by these processes, remains unclear. Focusing on rodent genes that duplicated before and after the mouse and rat split, we find significantly increased sequence divergence after duplication in only one of the copies, which in nearly all cases corresponds to the novel daughter copy, independent of the mechanism of duplication. We observe that the evolutionary rate of the accelerated copy, measured as the ratio of nonsynonymous to synonymous substitutions, is on average 5-fold higher in the period spanning 4-12 My after the duplication than it was before the duplication. This increase can be explained, at least in part, by the action of positive selection according to the results of the maximum likelihood-based branch-site test. Subsequently, the rate decelerates until purifying selection completely returns to preduplication levels. Reversion to the original rates has already been accomplished 40.5 My after the duplication event, corresponding to a genetic distance of about 0.28 synonymous substitutions per site. Differences in tissue gene expression patterns parallel those of substitution rates, reinforcing the role of neofunctionalization in explaining the evolution of young gene duplicates.
Collapse
Affiliation(s)
- Cinta Pegueroles
- Evolutionary Genomics Group, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Research Institute (IMIM), Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | | | | |
Collapse
|
15
|
Katju V. To the beat of a different drum: determinants implicated in the asymmetric sequence divergence of Caenorhabditis elegans paralogs. BMC Evol Biol 2013; 13:73. [PMID: 23530733 PMCID: PMC3637608 DOI: 10.1186/1471-2148-13-73] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 03/20/2013] [Indexed: 12/18/2022] Open
Abstract
Background Gene duplicates often exhibit asymmetric rates of molecular evolution in their early evolutionary existence. This asymmetry in rates is thought to signify the maintenance of the ancestral function by one copy and the removal of functional constraint on the other copy, enabling it to embark on a novel evolutionary trajectory. Here I focused on a large population of evolutionarily young gene duplicates (KS ≤ 0.14) in the Caenorhabditis elegans genome in order to conduct the first combined analysis of four predictors (evolutionary age, chromosomal location, structural resemblance between duplicates, and duplication span) which may be implicated in the asymmetric sequence divergence of paralogs at the nucleotide and amino acid level. In addition, I investigate if either paralog is equally likely to embark on a trajectory of accelerated sequence evolution or whether the derived paralog is more likely to exhibit faster sequence evolution. Results Three predictors (evolutionary age of duplicates, chromosomal location and duplication span) serve as major determinants of sequence asymmetry between C. elegans paralogs. Paralogs diverge asymmetrically in sequence with increasing evolutionary age, the relocation of one copy to a different chromosome and attenuated duplication spans that likely fail to capture the entire ancestral repertoire of coding sequence and regulatory elements. Furthermore, for paralogs residing on the same chromosome, opposite transcriptional orientation and increased genomic distance do not increase sequence asymmetry between paralogs. For a subset of duplicate pairs wherein the ancestral versus derived paralog could be distinguished, the derived paralogs are more likely to evolve at accelerated rates. Conclusions This genome-wide study of evolutionarily young duplicates stemming primarily from DNA-mediated small-scale duplication events demonstrates that genomic relocation to a new chromosome has important consequences for asymmetric divergence of paralogs, akin to paralogs arising from RNA-mediated duplication events. Additionally, the duplication span is negatively correlated with sequence rate asymmetry among paralogs, suggesting that attenuated duplication spans stemming from incomplete duplication of the ORF and/or ancestral regulatory elements further accelerate sequence divergence between paralogs. Cumulatively, derived copies exhibit accelerated rates of sequence evolution suggesting that they are primed for a divergent evolutionary trajectory by changes in structure and genomic context at inception.
Collapse
Affiliation(s)
- Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA.
| |
Collapse
|
16
|
Vizcaíno JA, Côté RG, Csordas A, Dianes JA, Fabregat A, Foster JM, Griss J, Alpi E, Birim M, Contell J, O'Kelly G, Schoenegger A, Ovelleiro D, Pérez-Riverol Y, Reisinger F, Ríos D, Wang R, Hermjakob H. The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res 2012. [PMID: 23203882 PMCID: PMC3531176 DOI: 10.1093/nar/gks1262] [Citation(s) in RCA: 1594] [Impact Index Per Article: 132.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The PRoteomics IDEntifications (PRIDE, http://www.ebi.ac.uk/pride) database at the European Bioinformatics Institute is one of the most prominent data repositories of mass spectrometry (MS)-based proteomics data. Here, we summarize recent developments in the PRIDE database and related tools. First, we provide up-to-date statistics in data content, splitting the figures by groups of organisms and species, including peptide and protein identifications, and post-translational modifications. We then describe the tools that are part of the PRIDE submission pipeline, especially the recently developed PRIDE Converter 2 (new submission tool) and PRIDE Inspector (visualization and analysis tool). We also give an update about the integration of PRIDE with other MS proteomics resources in the context of the ProteomeXchange consortium. Finally, we briefly review the quality control efforts that are ongoing at present and outline our future plans.
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Tong Y, Zheng K, Zhao S, Xiao G, Luo C. Sequence divergence in the 3'-untranslated region has an effect on the subfunctionalization of duplicate genes. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2012; 318:531-44. [PMID: 22674856 DOI: 10.1002/jez.b.22457] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2011] [Revised: 02/01/2012] [Accepted: 04/03/2012] [Indexed: 12/20/2022]
Abstract
Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes.
Collapse
Affiliation(s)
- Ying Tong
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, People's Republic of China
| | | | | | | | | |
Collapse
|
18
|
Bu L, Bergthorsson U, Katju V. Local synteny and codon usage contribute to asymmetric sequence divergence of Saccharomyces cerevisiae gene duplicates. BMC Evol Biol 2011; 11:279. [PMID: 21955875 PMCID: PMC3190396 DOI: 10.1186/1471-2148-11-279] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Accepted: 09/28/2011] [Indexed: 11/10/2022] Open
Abstract
Background Duplicated genes frequently experience asymmetric rates of sequence evolution. Relaxed selective constraints and positive selection have both been invoked to explain the observation that one paralog within a gene-duplicate pair exhibits an accelerated rate of sequence evolution. In the majority of studies where asymmetric divergence has been established, there is no indication as to which gene copy, ancestral or derived, is evolving more rapidly. In this study we investigated the effect of local synteny (gene-neighborhood conservation) and codon usage on the sequence evolution of gene duplicates in the S. cerevisiae genome. We further distinguish the gene duplicates into those that originated from a whole-genome duplication (WGD) event (ohnologs) versus small-scale duplications (SSD) to determine if there exist any differences in their patterns of sequence evolution. Results For SSD pairs, the derived copy evolves faster than the ancestral copy. However, there is no relationship between rate asymmetry and synteny conservation (ancestral-like versus derived-like) in ohnologs. mRNA abundance and optimal codon usage as measured by the CAI is lower in the derived SSD copies relative to ancestral paralogs. Moreover, in the case of ohnologs, the faster-evolving copy has lower CAI and lowered expression. Conclusions Together, these results suggest that relaxation of selection for codon usage and gene expression contribute to rate asymmetry in the evolution of duplicated genes and that in SSD pairs, the relaxation of selection stems from the loss of ancestral regulatory information in the derived copy.
Collapse
Affiliation(s)
- Lijing Bu
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| | | | | |
Collapse
|
19
|
Courtiade J, Pauchet Y, Vogel H, Heckel DG. A comprehensive characterization of the caspase gene family in insects from the order Lepidoptera. BMC Genomics 2011; 12:357. [PMID: 21740565 PMCID: PMC3141678 DOI: 10.1186/1471-2164-12-357] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Accepted: 07/08/2011] [Indexed: 11/22/2022] Open
Abstract
Background The cell suicide pathway of apoptosis is a necessary event in the life of multicellular organisms. It is involved in many biological processes ranging from development to the immune response. Evolutionarily conserved proteases, called caspases, play a central role in regulating apoptosis. Reception of death stimuli triggers the activation of initiator caspases, which in turn activate the effector caspases. In Lepidoptera, apoptosis is crucial in processes such as metamorphosis or defending against baculovirus infection. The discovery of p35, a baculovirus protein inhibiting caspase activity, has led to the characterization of the first lepidopteran caspase, Sf-Caspase-1. Studies on Sf-Caspase-1 mode of activation suggested that apoptosis in Lepidoptera requires a cascade of caspase activation, as demonstrated in many other species. Results In order to get insights into this gene family in Lepidoptera, we performed an extensive survey of lepidopteran-derived EST datasets. We identified 66 sequences distributed among 27 species encoding putative caspases. Phylogenetic analyses showed that Lepidoptera possess at least 5 caspases, for which we propose a unified nomenclature. According to homology to their Drosophila counterparts and their primary structure, we determined that Lep-Caspase-1, -2 and -3 are putative effector caspases, whereas Lep-Caspase-5 and -6 are putative initiators. The likely function of Lep-Caspase-4 remains unclear. Lep-Caspase-2 is absent from the silkworm genome and appears to be noctuid-specific, and to have arisen from a tandem duplication of the Caspase-1 gene. In the tobacco hawkmoth, 3 distinct transcripts encoding putative Caspase-4 were identified, suggesting at least 2 duplication events in this species. Conclusions The basic repertoire of five major types of caspases shared among Lepidoptera seems to be smaller than for most other groups studied to date, but gene duplication still plays a role in lineage-specific increases in diversity, just as in Diptera and mammals.
Collapse
Affiliation(s)
- Juliette Courtiade
- Department of Entomology, Max Planck Institute for Chemical Ecology, Jena, Germany
| | | | | | | |
Collapse
|