1
|
Bourret J, Borvető F, Bravo IG. Subfunctionalisation of paralogous genes and evolution of differential codon usage preferences: The showcase of polypyrimidine tract binding proteins. J Evol Biol 2023; 36:1375-1392. [PMID: 37667674 DOI: 10.1111/jeb.14212] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 07/11/2023] [Accepted: 07/12/2023] [Indexed: 09/06/2023]
Abstract
Gene paralogs are copies of an ancestral gene that appear after gene or full genome duplication. When two sister gene copies are maintained in the genome, redundancy may release certain evolutionary pressures, allowing one of them to access novel functions. Here, we focused our study on gene paralogs on the evolutionary history of the three polypyrimidine tract binding protein genes (PTBP) and their concurrent evolution of differential codon usage preferences (CUPrefs) in vertebrate species. PTBP1-3 show high identity at the amino acid level (up to 80%) but display strongly different nucleotide composition, divergent CUPrefs and, in humans and in many other vertebrates, distinct tissue-specific expression levels. Our phylogenetic inference results show that the duplication events leading to the three extant PTBP1-3 lineages predate the basal diversification within vertebrates, and genomic context analysis illustrates that local synteny has been well preserved over time for the three paralogs. We identify a distinct evolutionary pattern towards GC3-enriching substitutions in PTBP1, concurrent with enrichment in frequently used codons and with a tissue-wide expression. In contrast, PTBP2s are enriched in AT-ending, rare codons, and display tissue-restricted expression. As a result of this substitution trend, CUPrefs sharply differ between mammalian PTBP1s and the rest of PTBPs. Genomic context analysis suggests that GC3-rich nucleotide composition in PTBP1s is driven by local substitution processes, while the evidence in this direction is thinner for PTBP2-3. An actual lack of co-variation between the observed GC composition of PTBP2-3 and that of the surrounding non-coding genomic environment would raise an interrogation on the origin of CUPrefs, warranting further research on a putative tissue-specific translational selection. Finally, we communicate an intriguing trend for the use of the UUG-Leu codon, which matches the trends of AT-ending codons. Our results are compatible with a scenario in which a combination of directional mutation-selection processes would have differentially shaped CUPrefs of PTBPs in vertebrates: the observed GC-enrichment of PTBP1 in placental mammals may be linked to genomic location and to the strong and broad tissue-expression, while AT-enrichment of PTBP2 and PTBP3 would be associated with rare CUPrefs and thus, possibly to specialized spatio-temporal expression. Our interpretation is coherent with a gene subfunctionalisation process by differential expression regulation associated with the evolution of specific CUPrefs.
Collapse
Affiliation(s)
- Jérôme Bourret
- Laboratoire MIVEGEC (CNRS IRD Univ Montpellier), Centre National de la Recherche Scientifique (CNRS), Montpellier, France
| | - Fanni Borvető
- Laboratoire MIVEGEC (CNRS IRD Univ Montpellier), Centre National de la Recherche Scientifique (CNRS), Montpellier, France
| | - Ignacio G Bravo
- Laboratoire MIVEGEC (CNRS IRD Univ Montpellier), Centre National de la Recherche Scientifique (CNRS), Montpellier, France
| |
Collapse
|
2
|
Herrboldt MA, Steffen MA, McGouran CN, Bonett RM. Pheromone Gene Diversification and the Evolution of Courtship Glands in Plethodontid Salamanders. J Mol Evol 2021; 89:576-587. [PMID: 34392385 DOI: 10.1007/s00239-021-10026-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 07/28/2021] [Indexed: 11/30/2022]
Abstract
Proteinaceous pheromones that diversify through gene duplication can result in shifts in courtship cocktails that may serve as a mechanism for reproductive isolation. The molecular evolution of pheromones has been extensively studied in salamanders, but how these genes and associated novel courtship glands have codiversified has not been evaluated. In this study we used transcriptional analyses to examine the relationship between pheromone diversification and gland type in three divergent lineages of plethodontid salamanders. Our results revealed that plethodontid salamanders express up to eight divergent Sodefrin Precursor-like Factor genes (spf, representing both alpha and beta subfamilies) along with Plethodontid Modulating Factor (pmf) and Plethodontid Receptivity Factor (prf). Expression of pheromone genes is tissue specific with pmf, prf, and some spf genes restricted to the mental gland. In contrast, the caudal gland shows strong expression of the other spf genes. We found evidence for punctuated changes in pheromone cocktail composition related to the loss of metamorphosis, and subsequent extreme reduction of the mental gland, in a paedomorphic lineage. Our study provides insight into how pheromone diversification can be partitioned into unique glands, which may lead to cocktail specificity in behavioral modules during courtship.
Collapse
Affiliation(s)
- Madison A Herrboldt
- Department of Biological Science, University of Tulsa, Tulsa, OK, 74104, USA.
| | - Michael A Steffen
- Department of Biological Science, University of Tulsa, Tulsa, OK, 74104, USA
| | - Carissa N McGouran
- Department of Biological Science, University of Tulsa, Tulsa, OK, 74104, USA
| | - Ronald M Bonett
- Department of Biological Science, University of Tulsa, Tulsa, OK, 74104, USA
| |
Collapse
|
3
|
Brohard-Julien S, Frouin V, Meyer V, Chalabi S, Deleuze JF, Le Floch E, Battail C. Region-specific expression of young small-scale duplications in the human central nervous system. BMC Ecol Evol 2021; 21:59. [PMID: 33882820 PMCID: PMC8059171 DOI: 10.1186/s12862-021-01794-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 04/11/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The duplication of genes is one of the main genetic mechanisms that led to the gain in complexity of biological tissue. Although the implication of duplicated gene expression in brain evolution was extensively studied through comparisons between organs, their role in the regional specialization of the adult human central nervous system has not yet been well described. RESULTS Our work explored intra-organ expression properties of paralogs through multiple territories of the human central nervous system (CNS) using transcriptome data generated by the Genotype-Tissue Expression (GTEx) consortium. Interestingly, we found that paralogs were associated with region-specific expression in CNS, suggesting their involvement in the differentiation of these territories. Beside the influence of gene expression level on region-specificity, we observed the contribution of both duplication age and duplication type to the CNS region-specificity of paralogs. Indeed, we found that small scale duplicated genes (SSDs) and in particular ySSDs (SSDs younger than the 2 rounds of whole genome duplications) were more CNS region-specific than other paralogs. Next, by studying the two paralogs of ySSD pairs, we observed that when they were region-specific, they tend to be specific to the same region more often than for other paralogs, showing the high co-expression of ySSD pairs. The extension of this analysis to families of paralogs showed that the families with co-expressed gene members (i.e. homogeneous families) were enriched in ySSDs. Furthermore, these homogeneous families tended to be region-specific families, where the majority of their gene members were specifically expressed in the same region. CONCLUSIONS Overall, our study suggests the involvement of ySSDs in the differentiation of human central nervous system territories. Therefore, we show the relevance of exploring region-specific expression of paralogs at the intra-organ level.
Collapse
Affiliation(s)
- Solène Brohard-Julien
- Centre National de Recherche en Génomique Humaine (CNRGH), Institut François Jacob, CEA, Université Paris-Saclay, Evry, France.
- UNATI, Neurospin, Institut Joliot, CEA, Université Paris-Saclay, 91191, Gif-sur-Yvette, France.
- Université Paris-Sud, Université Paris-Saclay, Orsay, France.
| | - Vincent Frouin
- UNATI, Neurospin, Institut Joliot, CEA, Université Paris-Saclay, 91191, Gif-sur-Yvette, France
| | - Vincent Meyer
- Centre National de Recherche en Génomique Humaine (CNRGH), Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | - Smahane Chalabi
- Centre National de Recherche en Génomique Humaine (CNRGH), Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
| | - Jean-François Deleuze
- Centre National de Recherche en Génomique Humaine (CNRGH), Institut François Jacob, CEA, Université Paris-Saclay, Evry, France
- Centre d'Etude du Polymorphisme Humain, Fondation Jean Dausset, Paris, France
- Centre de Référence, d'Innovation, d'expertise et de transfert (CREFIX), Evry, France
| | - Edith Le Floch
- Centre National de Recherche en Génomique Humaine (CNRGH), Institut François Jacob, CEA, Université Paris-Saclay, Evry, France.
| | - Christophe Battail
- Centre National de Recherche en Génomique Humaine (CNRGH), Institut François Jacob, CEA, Université Paris-Saclay, Evry, France.
- CEA, Univ. Grenoble Alpes, INSERM, IRIG, Biology of Cancer and Infection UMR1292, 38000, Grenoble, France.
| |
Collapse
|
4
|
Chau LM, Goodisman MAD. Gene duplication and the evolution of phenotypic diversity in insect societies. Evolution 2017; 71:2871-2884. [PMID: 28875541 DOI: 10.1111/evo.13356] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2017] [Revised: 08/29/2017] [Accepted: 08/31/2017] [Indexed: 12/16/2022]
Abstract
Gene duplication is an important evolutionary process thought to facilitate the evolution of phenotypic diversity. We investigated if gene duplication was associated with the evolution of phenotypic differences in a highly social insect, the honeybee Apis mellifera. We hypothesized that the genetic redundancy provided by gene duplication could promote the evolution of social and sexual phenotypes associated with advanced societies. We found a positive correlation between sociality and rate of gene duplications across the Apoidea, indicating that gene duplication may be associated with sociality. We also discovered that genes showing biased expression between A. mellifera alternative phenotypes tended to be found more frequently than expected among duplicated genes than singletons. Moreover, duplicated genes had higher levels of caste-, sex-, behavior-, and tissue-biased expression compared to singletons, as expected if gene duplication facilitated phenotypic differentiation. We also found that duplicated genes were maintained in the A. mellifera genome through the processes of conservation, neofunctionalization, and specialization, but not subfunctionalization. Overall, we conclude that gene duplication may have facilitated the evolution of social and sexual phenotypes, as well as tissue differentiation. Thus this study further supports the idea that gene duplication allows species to evolve an increased range of phenotypic diversity.
Collapse
Affiliation(s)
- Linh M Chau
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332
| | - Michael A D Goodisman
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332
| |
Collapse
|
5
|
Schmitz JF, Zimmer F, Bornberg-Bauer E. Mechanisms of transcription factor evolution in Metazoa. Nucleic Acids Res 2016; 44:6287-97. [PMID: 27288445 PMCID: PMC5291267 DOI: 10.1093/nar/gkw492] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Revised: 05/18/2016] [Accepted: 05/22/2016] [Indexed: 11/12/2022] Open
Abstract
Transcriptions factors (TFs) are pivotal for the regulation of virtually all cellular processes, including growth and development. Expansions of TF families are causally linked to increases in organismal complexity. Here we study the evolutionary dynamics, genetic causes and functional implications of the five largest metazoan TF families. We find that family expansions dominate across the whole metazoan tree; however, some branches experience exceptional family-specific accelerated expansions. Additionally, we find that such expansions are often predated by modular domain rearrangements, which spur the expansion of a new sub-family by separating it from the rest of the TF family in terms of protein-protein interactions. This separation allows for radical shifts in the functional spectrum of a duplicated TF. We also find functional differentiation inside TF sub-families as changes in expression specificity. Furthermore, accelerated family expansions are facilitated by repeats of sequence motifs such as C2H2 zinc fingers. We quantify whole genome duplications and single gene duplications as sources of TF family expansions, implying that some, but not all, TF duplicates are preferentially retained. We conclude that trans-regulatory changes (domain rearrangements) are instrumental for fundamental functional innovations, that cis-regulatory changes (affecting expression) accomplish wide-spread fine tuning and both jointly contribute to the functional diversification of TFs.
Collapse
Affiliation(s)
- Jonathan F Schmitz
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, Hüfferstrasse 1, D-48149 Münster, Germany
| | - Fabian Zimmer
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, Hüfferstrasse 1, D-48149 Münster, Germany Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Erich Bornberg-Bauer
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, Hüfferstrasse 1, D-48149 Münster, Germany
| |
Collapse
|
6
|
Xu A, Li G, Yang D, Wu S, Ouyang H, Xu P, He F. Evolutionary Characteristics of Missing Proteins: Insights into the Evolution of Human Chromosomes Related to Missing-Protein-Encoding Genes. J Proteome Res 2015; 14:4985-94. [DOI: 10.1021/acs.jproteome.5b00450] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Aishi Xu
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China
- Animal Sciences College of Jilin University, Changchun 130062, P. R. China
| | - Guang Li
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China
| | - Dong Yang
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China
| | - Songfeng Wu
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China
| | - Hongsheng Ouyang
- Animal Sciences College of Jilin University, Changchun 130062, P. R. China
| | - Ping Xu
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China
| | - Fuchu He
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China
| |
Collapse
|
7
|
Tanaka K, Diekmann Y, Hazbun A, Hijazi A, Vreede B, Roch F, Sucena É. Multispecies Analysis of Expression Pattern Diversification in the Recently Expanded Insect Ly6 Gene Family. Mol Biol Evol 2015; 32:1730-47. [PMID: 25743545 PMCID: PMC4476152 DOI: 10.1093/molbev/msv052] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Gene families often consist of members with diverse expression domains reflecting their functions in a wide variety of tissues. However, how the expression of individual members, and thus their tissue-specific functions, diversified during the course of gene family expansion is not well understood. In this study, we approached this question through the analysis of the duplication history and transcriptional evolution of a rapidly expanding subfamily of insect Ly6 genes. We analyzed different insect genomes and identified seven Ly6 genes that have originated from a single ancestor through sequential duplication within the higher Diptera. We then determined how the original embryonic expression pattern of the founding gene diversified by characterizing its tissue-specific expression in the beetle Tribolium castaneum, the butterfly Bicyclus anynana, and the mosquito Anopheles stephensi and those of its duplicates in three higher dipteran species, representing various stages of the duplication history (Megaselia abdita, Ceratitis capitata, and Drosophila melanogaster). Our results revealed that frequent neofunctionalization episodes contributed to the increased expression breadth of this subfamily and that these events occurred after duplication and speciation events at comparable frequencies. In addition, at each duplication node, we consistently found asymmetric expression divergence. One paralog inherited most of the tissue-specificities of the founder gene, whereas the other paralog evolved drastically reduced expression domains. Our approach attests to the power of combining a well-established duplication history with a comprehensive coverage of representative species in acquiring unequivocal information about the dynamics of gene expression evolution in gene families.
Collapse
Affiliation(s)
| | | | | | - Assia Hijazi
- Centre de Biologie du Développement, CNRS UMR 5547, Université de Toulouse UPS, Toulouse, France
| | | | - Fernando Roch
- Centre de Biologie du Développement, CNRS UMR 5547, Université de Toulouse UPS, Toulouse, France
| | - Élio Sucena
- Instituto Gulbenkian de Ciência, Oeiras, Portugal Departamento de Biologia Animal, Faculdade de Ciências, Edifício C2, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
8
|
Complexity of gene expression evolution after duplication: protein dosage rebalancing. GENETICS RESEARCH INTERNATIONAL 2014; 2014:516508. [PMID: 25197576 PMCID: PMC4150538 DOI: 10.1155/2014/516508] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Accepted: 08/03/2014] [Indexed: 11/17/2022]
Abstract
Ongoing debates about functional importance of gene duplications have been recently intensified by a heated discussion of the “ortholog conjecture” (OC). Under the OC, which is central to functional annotation of genomes, orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of gene ontology (GO) annotations and expression profiles, among within-species paralogs compared to orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. Subsequent studies suggested that the OC appears to be generally valid when applied to mammalian evolution but the complete picture of evolution of gene expression also has to incorporate lineage-specific aspects of paralogy. The observed complexity of gene expression evolution after duplication can be explained through selection for gene dosage effect combined with the duplication-degeneration-complementation model. This paper discusses expression divergence of recent duplications occurring before functional divergence of proteins encoded by duplicate genes.
Collapse
|
9
|
Diekmann Y, Seixas E, Gouw M, Tavares-Cadete F, Seabra MC, Pereira-Leal JB. Thousands of rab GTPases for the cell biologist. PLoS Comput Biol 2011; 7:e1002217. [PMID: 22022256 PMCID: PMC3192815 DOI: 10.1371/journal.pcbi.1002217] [Citation(s) in RCA: 146] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2011] [Accepted: 08/17/2011] [Indexed: 01/27/2023] Open
Abstract
Rab proteins are small GTPases that act as essential regulators of vesicular trafficking. 44 subfamilies are known in humans, performing specific sets of functions at distinct subcellular localisations and tissues. Rab function is conserved even amongst distant orthologs. Hence, the annotation of Rabs yields functional predictions about the cell biology of trafficking. So far, annotating Rabs has been a laborious manual task not feasible for current and future genomic output of deep sequencing technologies. We developed, validated and benchmarked the Rabifier, an automated bioinformatic pipeline for the identification and classification of Rabs, which achieves up to 90% classification accuracy. We cataloged roughly 8.000 Rabs from 247 genomes covering the entire eukaryotic tree. The full Rab database and a web tool implementing the pipeline are publicly available at www.RabDB.org. For the first time, we describe and analyse the evolution of Rabs in a dataset covering the whole eukaryotic phylogeny. We found a highly dynamic family undergoing frequent taxon-specific expansions and losses. We dated the origin of human subfamilies using phylogenetic profiling, which enlarged the Rab repertoire of the Last Eukaryotic Common Ancestor with Rab14, 32 and RabL4. Furthermore, a detailed analysis of the Choanoflagellate Monosiga brevicollis Rab family pinpointed the changes that accompanied the emergence of Metazoan multicellularity, mainly an important expansion and specialisation of the secretory pathway. Lastly, we experimentally establish tissue specificity in expression of mouse Rabs and show that neo-functionalisation best explains the emergence of new human Rab subfamilies. With the Rabifier and RabDB, we provide tools that easily allows non-bioinformaticians to integrate thousands of Rabs in their analyses. RabDB is designed to enable the cell biology community to keep pace with the increasing number of fully-sequenced genomes and change the scale at which we perform comparative analysis in cell biology. Intracellular compartmentalisation via membrane-delimited organelles is a fundamental feature of the eukaryotic cell. Understanding its origins and specialisation into functionally distinct compartments is a major challenge in evolutionary cell biology. We focus on the Rab enzymes, critical organisers of the trafficking pathways that link the endomembrane system. Rabs form a large family of evolutionarily related proteins, regulating distinct steps in vesicle transport. They mark pathways and organelles due to their specific subcellular and tissue localisation. We propose a solution to the problem of identifying and annotating Rabs in hundreds of sequenced genomes. We developed an accurate bioinformatics pipeline that is able to take into account pre-existing and often inconsistent, manual annotations. We made it available to the community in form of a web tool, as well as a database containing thousands of Rabs assigned to sub-families, which yields clear functional predictions. Thousands of Rabs allow for a new level of analysis. We illustrate this by characterising for the first time the global evolutionary dynamics of the Rab family. We dated the emergence of subfamilies and suggest that the Rab family expands by duplicates acquiring new functions.
Collapse
Affiliation(s)
- Yoan Diekmann
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- * E-mail: (YD); (JBPL)
| | - Elsa Seixas
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- Centro de Estudos de Doenças Crónicas (CEDOC), Faculdade de Ciências Médicas da Universidade Nova de Lisboa, Lisboa, Portugal
| | - Marc Gouw
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | | | - Miguel C. Seabra
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- Centro de Estudos de Doenças Crónicas (CEDOC), Faculdade de Ciências Médicas da Universidade Nova de Lisboa, Lisboa, Portugal
| | | |
Collapse
|
10
|
Huerta-Cepas J, Dopazo J, Huynen MA, Gabaldón T. Evidence for short-time divergence and long-time conservation of tissue-specific expression after gene duplication. Brief Bioinform 2011; 12:442-8. [PMID: 21515902 DOI: 10.1093/bib/bbr022] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Gene duplication is one of the main mechanisms by which genomes can acquire novel functions. It has been proposed that the retention of gene duplicates can be associated to processes of tissue expression divergence. These models predict that acquisition of divergent expression patterns should be acquired shortly after the duplication, and that larger divergence in tissue expression would be expected for paralogs, as compared to orthologs of a similar age. Many studies have shown that gene duplicates tend to have divergent expression patterns and that gene family expansions are associated with high levels of tissue specificity. However, the timeframe in which these processes occur have rarely been investigated in detail, particularly in vertebrates, and most analyses do not include direct comparisons of orthologs as a baseline for the expected levels of tissue specificity in absence of duplications. To assess the specific contribution of duplications to expression divergence, we combine here phylogenetic analyses and expression data from human and mouse. In particular, we study differences in spatial expression among human-mouse paralogs, specifically duplicated after the radiation of mammals, and compare them to pairs of orthologs in the same species. Our results show that gene duplication leads to increased levels of tissue specificity and that this tends to occur promptly after the duplication event.
Collapse
|
11
|
Hase T, Niimura Y, Tanaka H. Difference in gene duplicability may explain the difference in overall structure of protein-protein interaction networks among eukaryotes. BMC Evol Biol 2010; 10:358. [PMID: 21087510 PMCID: PMC2994879 DOI: 10.1186/1471-2148-10-358] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2010] [Accepted: 11/18/2010] [Indexed: 11/19/2022] Open
Abstract
Background A protein-protein interaction network (PIN) was suggested to be a disassortative network, in which interactions between high- and low-degree nodes are favored while hub-hub interactions are suppressed. It was postulated that a disassortative structure minimizes unfavorable cross-talks between different hub-centric functional modules and was positively selected in evolution. However, by re-examining yeast PIN data, several researchers reported that the disassortative structure observed in a PIN might be an experimental artifact. Therefore, the existence of a disassortative structure and its possible evolutionary mechanism remains unclear. Results In this study, we investigated PINs from the yeast, worm, fly, human, and malaria parasite including four different yeast PIN datasets. The analyses showed that the yeast, worm, fly, and human PINs are disassortative while the malaria parasite PIN is not. By conducting simulation studies on the basis of a duplication-divergence model, we demonstrated that a preferential duplication of low- and high-degree nodes can generate disassortative and non-disassortative networks, respectively. From this observation, we hypothesized that the difference in degree dependence on gene duplications accounts for the difference in assortativity of PINs among species. Comparison of 55 proteomes in eukaryotes revealed that genes with lower degrees showed higher gene duplicabilities in the yeast, worm, and fly, while high-degree genes tend to have high duplicabilities in the malaria parasite, supporting the above hypothesis. Conclusions These results suggest that disassortative structures observed in PINs are merely a byproduct of preferential duplications of low-degree genes, which might be caused by an organism's living environment.
Collapse
Affiliation(s)
- Takeshi Hase
- Department of Bioinformatics, Medical Research Institute, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Japan
| | | | | |
Collapse
|
12
|
Tian X, Pascal G, Monget P. Evolution and functional divergence of NLRP genes in mammalian reproductive systems. BMC Evol Biol 2009; 9:202. [PMID: 19682372 PMCID: PMC2735741 DOI: 10.1186/1471-2148-9-202] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2009] [Accepted: 08/14/2009] [Indexed: 12/31/2022] Open
Abstract
Background NLRPs (Nucleotide-binding oligomerization domain, Leucine rich Repeat and Pyrin domain containing Proteins) are members of NLR (Nod-like receptors) protein family. Recent researches have shown that NLRP genes play important roles in both mammalian innate immune system and reproductive system. Several of NLRP genes were shown to be specifically expressed in the oocyte in mammals. The aim of the present work was to study how these genes evolved and diverged after their duplication, as well as whether natural selection played a role during their evolution. Results By using in silico methods, we have evaluated the evolution and functional divergence of NLRP genes, in particular of mouse reproduction-related Nlrp genes. We found that (1) major NLRP genes have been duplicated before the divergence of mammals, with certain lineage-specific duplications in primates (NLRP7 and 11) and in rodents (Nlrp1, 4 and 9 duplicates); (2) tandem duplication events gave rise to a mammalian reproduction-related NLRP cluster including NLRP2, 4, 5, 7, 8, 9, 11, 13 and 14 genes; (3) the function of mammalian oocyte-specific NLRP genes (NLRP4, 5, 9 and 14) might have diverged during gene evolution; (4) recent segmental duplications concerning Nlrp4 copies and vomeronasal 1 receptor encoding genes (V1r) have been undertaken in the mouse; and (5) duplicates of Nlrp4 and 9 in the mouse might have been subjected to adaptive evolution. Conclusion In conclusion, this study brings us novel information on the evolution of mammalian reproduction-related NLRPs. On the one hand, NLRP genes duplicated and functionally diversified in mammalian reproductive systems (such as NLRP4, 5, 9 and 14). On the other hand, during evolution, different lineages adapted to develop their own NLRP genes, particularly in reproductive function (such as the specific expansion of Nlrp4 and Nlrp9 in the mouse).
Collapse
Affiliation(s)
- Xin Tian
- Physiologie de la Reproduction et des Comportements, UMR 6175 INRA-CNRS-Université François Rabelais de Tours-Haras Nationaux, 37380 Nouzilly, France.
| | | | | |
Collapse
|
13
|
Hoogewijs D, De Henau S, Dewilde S, Moens L, Couvreur M, Borgonie G, Vinogradov SN, Roy SW, Vanfleteren JR. The Caenorhabditis globin gene family reveals extensive nematode-specific radiation and diversification. BMC Evol Biol 2008; 8:279. [PMID: 18844991 PMCID: PMC2576238 DOI: 10.1186/1471-2148-8-279] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2008] [Accepted: 10/09/2008] [Indexed: 01/27/2023] Open
Abstract
Background Globin isoforms with variant properties and functions have been found in the pseudocoel, body wall and cuticle of various nematode species and even in the eyespots of the insect-parasite Mermis nigrescens. In fact, much higher levels of complexity exist, as shown by recent whole genome analysis studies. In silico analysis of the genome of Caenorhabditis elegans revealed an unexpectedly high number of globin genes featuring a remarkable diversity in gene structure, amino acid sequence and expression profiles. Results In the present study we have analyzed whole genomic data from C. briggsae, C. remanei, Pristionchus pacificus and Brugia malayi and EST data from several other nematode species to study the evolutionary history of the nematode globin gene family. We find a high level of conservation of the C. elegans globin complement, with even distantly related nematodes harboring orthologs to many Caenorhabditis globins. Bayesian phylogenetic analysis resolves all nematode globins into two distinct globin classes. Analysis of the globin intron-exon structures suggests extensive loss of ancestral introns and gain of new positions in deep nematode ancestors, and mainly loss in the Caenorhabditis lineage. We also show that the Caenorhabditis globin genes are expressed in distinct, mostly non-overlapping, sets of cells and that they are all under strong purifying selection. Conclusion Our results enable reconstruction of the evolutionary history of the globin gene family in the nematode phylum. A duplication of an ancestral globin gene occurred before the divergence of the Platyhelminthes and the Nematoda and one of the duplicated genes radiated further in the nematode phylum before the split of the Spirurina and Rhabditina and was followed by further radiation in the lineage leading to Caenorhabditis. The resulting globin genes were subject to processes of subfunctionalization and diversification leading to cell-specific expression patterns. Strong purifying selection subsequently dampened further evolution and facilitated fixation of the duplicated genes in the genome.
Collapse
Affiliation(s)
- David Hoogewijs
- Department of Biology and Center for Molecular Phylogeny and Evolution, Ghent University, B-9000 Ghent, Belgium.
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Woody OZ, Doxey AC, McConkey BJ. Assessing the evolution of gene expression using microarray data. Evol Bioinform Online 2008; 4:139-52. [PMID: 19204814 PMCID: PMC2614203 DOI: 10.4137/ebo.s628] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Classical studies of the evolution of gene function have predominantly focused on mutations within protein coding regions. With the advent of microarrays, however, it has become possible to evaluate the transcriptional activity of a gene as an additional characteristic of function. Recent studies have revealed an equally important role for gene regulation in the retention and evolution of duplicate genes. Here we review approaches to assessing the evolution of gene expression using microarray data, and discuss potential influences on expression divergence. Currently, there are no established standards on how best to identify and quantify instances of expression divergence. There have also been few efforts to date that incorporate suspected influences into mathematical models of expression divergence. Such developments will be crucial to a comprehensive understanding of the role gene duplications and expression evolution play in the emergence of complex traits and functional diversity. An integrative approach to gene family evolution, including both orthologous and paralogous genes, has the potential to bring strong predictive power both to the functional annotation of extant proteins and to the inference of functional characteristics of ancestral gene family members.
Collapse
Affiliation(s)
- Owen Z Woody
- Department of Biology, University of Waterloo, Waterloo, Ontario Canada
| | | | | |
Collapse
|
15
|
Construction, visualisation, and clustering of transcription networks from microarray expression data. PLoS Comput Biol 2007; 3:2032-42. [PMID: 17967053 PMCID: PMC2041979 DOI: 10.1371/journal.pcbi.0030206] [Citation(s) in RCA: 198] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2007] [Accepted: 09/05/2007] [Indexed: 12/20/2022] Open
Abstract
Network analysis transcends conventional pairwise approaches to data analysis as the context of components in a network graph can be taken into account. Such approaches are increasingly being applied to genomics data, where functional linkages are used to connect genes or proteins. However, while microarray gene expression datasets are now abundant and of high quality, few approaches have been developed for analysis of such data in a network context. We present a novel approach for 3-D visualisation and analysis of transcriptional networks generated from microarray data. These networks consist of nodes representing transcripts connected by virtue of their expression profile similarity across multiple conditions. Analysing genome-wide gene transcription across 61 mouse tissues, we describe the unusual topography of the large and highly structured networks produced, and demonstrate how they can be used to visualise, cluster, and mine large datasets. This approach is fast, intuitive, and versatile, and allows the identification of biological relationships that may be missed by conventional analysis techniques. This work has been implemented in a freely available open-source application named BioLayout Express3D. This paper describes a novel approach for analysis of gene expression data. In this approach, normalized gene expression data is transformed into a graph where nodes in the graph represent transcripts connected to each other by virtue of their coexpression across multiple tissues or samples. The graph paradigm has many advantages for such analyses. Graph clustering of the derived network performs extremely well in comparison to traditional pairwise schemes. We show that this approach is robust and able to accommodate large datasets such as the Genomics Institute of the Novartis Research Foundation mouse tissue atlas. The entire approach and algorithms are combined into a single open-source JAVA application that allows users to perform this analysis and further mining on their own data and to visualize the results interactively in 3-D. The approach is not limited to gene expression data but would also be useful for other complex biological datasets. We use the method to investigate the relationship between the phylogenetic age of transcripts and their tissue specificity.
Collapse
|
16
|
Liu Z, Adams KL. Expression partitioning between genes duplicated by polyploidy under abiotic stress and during organ development. Curr Biol 2007; 17:1669-74. [PMID: 17825563 DOI: 10.1016/j.cub.2007.08.030] [Citation(s) in RCA: 125] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2007] [Revised: 07/24/2007] [Accepted: 08/12/2007] [Indexed: 02/01/2023]
Abstract
Allopolyploidy has been a prominent mode of speciation and a recurrent process during plant evolution and has contributed greatly to the large number of duplicated genes in plant genomes [1-4]. Polyploidy often leads to changes in genome organization and gene expression [5-9]. The expression of genes that are duplicated by polyploidy (termed homeologs) can be partitioned between the duplicates so that one copy is expressed and functions only in some organs and the other copy is expressed only in other organs, indicative of subfunctionalization [10]. To determine how homeologous-gene expression patterns change during organ development and in response to abiotic stress conditions, we have examined expression of the alcohol dehydrogenase gene AdhA in allopolyploid cotton (Gossypium hirsutum). Expression ratios of the two homeologs vary considerably during the development of organs from seedlings and fruits. Abiotic stress treatments, including cold, dark, and water submersion, altered homeologous-gene expression. Most notably, only one copy is expressed in hypocotyls during a water-submersion treatment, and only the other copy is expressed during cold stress. These results imply that subfunctionalization of genes duplicated by polyploidy has occurred in response to abiotic stress conditions. Partitioning of duplicate gene expression in response to environmental stress may lead to duplicate gene retention during subsequent evolution.
Collapse
Affiliation(s)
- Zhenlan Liu
- UBC Botanical Garden and Centre for Plant Research and Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia V6T 1Z4, Canada
| | | |
Collapse
|
17
|
Liang H, Li WH. Gene essentiality, gene duplicability and protein connectivity in human and mouse. Trends Genet 2007; 23:375-8. [PMID: 17512629 DOI: 10.1016/j.tig.2007.04.005] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2006] [Revised: 02/16/2007] [Accepted: 04/25/2007] [Indexed: 11/28/2022]
Abstract
It has previously been found that, in yeast, gene essentiality is positively correlated with protein connectivity (number of interaction partners) but negatively correlated with the existence of gene duplicates and that highly connected proteins tend to have a low gene duplicability. Using data from human and mouse, we show here that, in mammals, the first of these relationships holds true, but unlike the second relationship in yeast, highly connected mammalian proteins tend to have a high gene duplicability, and there is no correlation between gene essentiality and gene duplication in mammals.
Collapse
Affiliation(s)
- Han Liang
- Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, IL 60637, USA
| | | |
Collapse
|