1
|
Yuan X, Kadowaki T. Protein subcellular relocalization and function of duplicated flagellar calcium binding protein genes in honey bee trypanosomatid parasite. PLoS Genet 2024; 20:e1011195. [PMID: 38437202 PMCID: PMC10939215 DOI: 10.1371/journal.pgen.1011195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 03/14/2024] [Accepted: 02/23/2024] [Indexed: 03/06/2024] Open
Abstract
The honey bee trypanosomatid parasite, Lotmaria passim, contains two genes that encode the flagellar calcium binding protein (FCaBP) through tandem duplication in its genome. FCaBPs localize in the flagellum and entire body membrane of L. passim through specific N-terminal sorting sequences. This finding suggests that this is an example of protein subcellular relocalization resulting from gene duplication, altering the intracellular localization of FCaBP. However, this phenomenon may not have occurred in Leishmania, as one or both of the duplicated genes have become pseudogenes. Multiple copies of the FCaBP gene are present in several Trypanosoma species and Leptomonas pyrrhocoris, indicating rapid evolution of this gene in trypanosomatid parasites. The N-terminal flagellar sorting sequence of L. passim FCaBP1 is in close proximity to the BBSome complex, while that of Trypanosoma brucei FCaBP does not direct GFP to the flagellum in L. passim. Deletion of the two FCaBP genes in L. passim affected growth and impaired flagellar morphogenesis and motility, but it did not impact host infection. Therefore, FCaBP represents a duplicated gene with a rapid evolutionary history that is essential for flagellar structure and function in a trypanosomatid parasite.
Collapse
Affiliation(s)
- Xuye Yuan
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou Dushu Lake Higher Education Town, Jiangsu Province, China
| | - Tatsuhiko Kadowaki
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou Dushu Lake Higher Education Town, Jiangsu Province, China
| |
Collapse
|
2
|
Li P, Zhang Y, Zhao C, Jiang M. Evolution of the Tóxicos en Levadura 63 (TL63) gene family in plants and functional characterization of Arabidopsis thaliana TL63 under oxidative stress. PLANTA 2023; 258:87. [PMID: 37750983 DOI: 10.1007/s00425-023-04243-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 09/13/2023] [Indexed: 09/27/2023]
Abstract
MAIN CONCLUSION TL63 orthologs were angiosperm specific and had undergone motifs loss and gain, and increased purifying selection. AtTL63 was involved in the response of yeast and Arabidopsis plants to oxidative stress. The Tóxicos en Levadura (TL) family, a class of E3 ubiquitin ligases with typical RING-H2 type zinc finger structure, plays a pivotal role in mediating physiological processes and responding to stress in plants. However, the evolution and function of TL63 remain unclear. In this study, TL63 homologs were dated roughly back to the origin of land plants and confirmed to have subjected to the gain and loss of motifs and increased purifying selection. Phylogenetic analysis displayed that 279 TL63s could be divided into four main clades (Clade A-D). Notably, the ancestral tandem TL40/41 cluster contributed to the expansion of modern Brassicaceae TL40/41. The substitution rate tests revealed that the TL63 lineage was evidently different from other lineages. The codon usage index exhibited that monocotyledons preferred to use not A3s and T3s, but C3s, G3s, CAI, CBI and Fop. Sequence analysis showed that the TL63 homologs had conserved TM and GLD motifs and RING-H2 domain whose key amino acid residues accounted for the high average abundance. Particularly, Arabidopsis thaliana TL63 (AtTL63) was located in the nuclei, cell membranes and peroxisomes and expressed universally and significantly throughout A. thaliana development. Under H2O2 treatment, low or moderate expression of the AtTL63 held beneficial effects on the growth and viability of yeast cells and the mutation or overexpression of the AtTL63 positively affected the growth of A. thaliana plants. In brief, this study could supply useful insight into the evolution of the plant TL63s and the AtTL63 functions under oxidative stress.
Collapse
Affiliation(s)
- Peng Li
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Yuxin Zhang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Changling Zhao
- College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming, 650201, China.
| | - Min Jiang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China.
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China.
| |
Collapse
|
3
|
Evolutionary conservation of sequence motifs at sites of protein modification. J Biol Chem 2023; 299:104617. [PMID: 36933807 PMCID: PMC10139944 DOI: 10.1016/j.jbc.2023.104617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 02/20/2023] [Accepted: 03/11/2023] [Indexed: 03/18/2023] Open
Abstract
Gene duplications are common in biology and are likely to be an important source of functional diversification and specialization. The yeast Saccharomyces cerevisiae underwent a whole genome duplication event early in evolution, and a substantial number of duplicated genes have been retained. We identified more than 3,500 instances where only one of two paralogous proteins undergoes post-translational modification despite having retained the same amino acid residue in both. We also developed a web-based search algorithm (CoSMoS.c.) that scores conservation of amino acid sequences based on 1011 wild and domesticated yeast isolates and used it to compare differentially-modified pairs of paralogous proteins. We found that the most common modifications - phosphorylation, ubiquitylation and acylation but not N-glycosylation - occur in regions of high sequence conservation. Such conservation is evident even for ubiquitylation and succinylation, where there is no established 'consensus site' for modification. Differences in phosphorylation were not associated with predicted secondary structure or solvent accessibility, but did mirror known differences in kinase-substrate interactions. Thus, differences in post-translational modification likely result from differences in adjoining amino acids and their interactions with modifying enzymes. By integrating data from large scale proteomics and genomics analysis, in a system with such substantial genetic diversity, we obtained a more comprehensive understanding of the functional basis for genetic redundancies that have persisted for 100 million years.
Collapse
|
4
|
Genome-Wide Comprehensive Analysis of PtLACs: Prediction and Verification of the Functional Divergence of Tandem-Duplicated Genes. FORESTS 2022. [DOI: 10.3390/f13020157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Laccases (EC 1.10.3.2) have been widely considered to participate in the metabolic processes of lignin synthesis, osmotic stress response, and flavonoid oxidation in higher plants. The research into Populus trichocarpa laccase focused on the synthesis of lignin in the past few years. In this study, for the first time, a comprehensive analysis of 53 laccase copies in the P. trichocarpa genome was conducted. Positive selection analysis using the branch-site model indicated that LAC genes in terrestrial plants have undergone selective pressure for adaptive evolution. On the basis of the phylogenetic relationship, we reconstructed the evolutionary process of terrestrial plant laccase and found that this gene family began to expand during the evolution of angiosperms. Tandem duplication is the main form of expansion of the PtLAC gene family. The analysis of the sequence characteristics, gene structure, expression pattern, and gene synonymous mutation rate of PtLACs provided a theoretical basis for the functional divergence of tandem duplicated genes. The synonymous mutation rate was used to quantify the divergence time of 11 tandem duplicated gene clusters. Cluster 2, with the earliest divergence time and lower share of sequence similarity, and cluster 5, with the latest divergence time and higher share of similarity, were selected in this study to explore the functional divergence of tandem-duplicated gene clusters. Tobacco subcellular localization and Arabidopsis transgenes verified the functional differentiation of PtLAC genes in cluster 2 and the functional non-differentiation of PtLAC genes in cluster 5. The results of this study provide a reference for the functional differentiation of tandem-duplicated PtLAC.
Collapse
|
5
|
Kuzmin E, Taylor JS, Boone C. Retention of duplicated genes in evolution. Trends Genet 2022; 38:59-72. [PMID: 34294428 PMCID: PMC8678172 DOI: 10.1016/j.tig.2021.06.016] [Citation(s) in RCA: 61] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 06/22/2021] [Accepted: 06/24/2021] [Indexed: 01/03/2023]
Abstract
Gene duplication is a prevalent phenomenon across the tree of life. The processes that lead to the retention of duplicated genes are not well understood. Functional genomics approaches in model organisms, such as yeast, provide useful tools to test the mechanisms underlying retention with functional redundancy and divergence of duplicated genes, including fates associated with neofunctionalization, subfunctionalization, back-up compensation, and dosage amplification. Duplicated genes may also be retained as a consequence of structural and functional entanglement. Advances in human gene editing have enabled the interrogation of duplicated genes in the human genome, providing new tools to evaluate the relative contributions of each of these factors to duplicate gene retention and the evolution of genome structure.
Collapse
Affiliation(s)
- Elena Kuzmin
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Centre, McGill University, 1160 Ave des Pins Ouest, Montreal, Quebec, H3A 1A3, Canada.,Correspondence to: (E.K.)
| | - John S. Taylor
- Department of Biology, University of Victoria, PO Box 1700, Station CSC, Victoria, BC, V8W 2Y2, Canada
| | - Charles Boone
- Department of Molecular Genetics, Donnelly Centre, University of Toronto, 160 College Street, Toronto ON, M5S 3E1, Canada.,RIKEN Centre for Sustainable Resource Science, Waiko, Saitama, Japan
| |
Collapse
|
6
|
Ayaz A, Saqib S, Huang H, Zaman W, Lü S, Zhao H. Genome-wide comparative analysis of long-chain acyl-CoA synthetases (LACSs) gene family: A focus on identification, evolution and expression profiling related to lipid synthesis. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2021; 161:1-11. [PMID: 33556720 DOI: 10.1016/j.plaphy.2021.01.042] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 01/25/2021] [Indexed: 05/27/2023]
Abstract
In plants, Long-chain acyl-CoA synthetases (LACSs) play key roles in activating fatty acids to fatty acyl-CoA thioesters, which are then further involved in lipid synthesis and fatty acid catabolism. LACSs have been intensively studied in Arabidopsis, but its evolutionary relationship in green plants is unexplored. In this study, we performed a comprehensive genome-wide analysis of the LACS gene family across green plants followed by phylogenetic clustering analysis, gene structure determination, detection of conserved motifs, gene expression in tissues and subcellular localization. Our results identified LACS genes in 122 plant species including algae, low land plants (i.e., mosses and lycophytes), monocots, and eudicots. In total, 697 sequences were identified, and 629 sequences were selected because of alignment and some duplication errors. The retrieved amino acid sequences ranged from 271 to 1056 residues and diversified in intron/exon patterns in different LACSs. Phylogenetic clustering grouped LACS gene family into six major clades with distinct potential functions. This classification is well supported by examining gene structure and conserved motifs. Also, gene expression analysis and subcellular localization substantiate with clade division in the phylogeny, indicating that the evolutionary pattern is visible in their functionality. Additionally, experimental analysis of lacs2 mutant validated that LACS2 plays key roles in suberin synthesis. Thus, our study not only provides an evolutionary mechanism underlying functional diversification but also lays the foundation for further elucidation of the LACS gene family.
Collapse
Affiliation(s)
- Asma Ayaz
- State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University, Wuhan, 430062, China.
| | - Saddam Saqib
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China; University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Haodong Huang
- State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University, Wuhan, 430062, China.
| | - Wajid Zaman
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China; University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Shiyou Lü
- State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University, Wuhan, 430062, China.
| | - Huayan Zhao
- State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University, Wuhan, 430062, China.
| |
Collapse
|
7
|
Costello R, Emms DM, Kelly S. Gene Duplication Accelerates the Pace of Protein Gain and Loss from Plant Organelles. Mol Biol Evol 2021; 37:969-981. [PMID: 31750917 PMCID: PMC7086175 DOI: 10.1093/molbev/msz275] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Organelle biogenesis and function is dependent on the concerted action of both organellar-encoded (if present) and nuclear-encoded proteins. Differences between homologous organelles across the Plant Kingdom arise, in part, as a result of differences in the cohort of nuclear-encoded proteins that are targeted to them. However, neither the rate at which differences in protein targeting accumulate nor the evolutionary consequences of these changes are known. Using phylogenomic approaches coupled to ancestral state estimation, we show that the plant organellar proteome has diversified in proportion with molecular sequence evolution such that the proteomes of plant chloroplasts and mitochondria lose or gain on average 3.6 proteins per million years. We further demonstrate that changes in organellar protein targeting are associated with an increase in the rate of molecular sequence evolution and that such changes predominantly occur in genes with regulatory rather than metabolic functions. Finally, we show that gain and loss of protein target signals occurs at a higher rate following gene duplication, revealing that gene and genome duplication are a key facilitator of plant organelle evolution.
Collapse
Affiliation(s)
- Rona Costello
- Department of Plant Sciences, University of Oxford, Oxford, United Kingdom
| | - David M Emms
- Department of Plant Sciences, University of Oxford, Oxford, United Kingdom
| | - Steven Kelly
- Department of Plant Sciences, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
8
|
Kuzmin E, VanderSluis B, Nguyen Ba AN, Wang W, Koch EN, Usaj M, Khmelinskii A, Usaj MM, van Leeuwen J, Kraus O, Tresenrider A, Pryszlak M, Hu MC, Varriano B, Costanzo M, Knop M, Moses A, Myers CL, Andrews BJ, Boone C. Exploring whole-genome duplicate gene retention with complex genetic interaction analysis. Science 2020; 368:eaaz5667. [PMID: 32586993 PMCID: PMC7539174 DOI: 10.1126/science.aaz5667] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 05/06/2020] [Indexed: 12/25/2022]
Abstract
Whole-genome duplication has played a central role in the genome evolution of many organisms, including the human genome. Most duplicated genes are eliminated, and factors that influence the retention of persisting duplicates remain poorly understood. We describe a systematic complex genetic interaction analysis with yeast paralogs derived from the whole-genome duplication event. Mapping of digenic interactions for a deletion mutant of each paralog, and of trigenic interactions for the double mutant, provides insight into their roles and a quantitative measure of their functional redundancy. Trigenic interaction analysis distinguishes two classes of paralogs: a more functionally divergent subset and another that retained more functional overlap. Gene feature analysis and modeling suggest that evolutionary trajectories of duplicated genes are dictated by combined functional and structural entanglement factors.
Collapse
Affiliation(s)
- Elena Kuzmin
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Benjamin VanderSluis
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Alex N Nguyen Ba
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
- Center for Analysis of Evolution and Function, University of Toronto, Toronto, Ontario, Canada
| | - Wen Wang
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Elizabeth N Koch
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Matej Usaj
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Anton Khmelinskii
- Zentrum für Molekulare Biologie der Universität Heidelberg (ZMBH), DKFZ-ZMBH Alliance, 69120 Heidelberg, Germany
| | | | | | - Oren Kraus
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Amy Tresenrider
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Michael Pryszlak
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Ming-Che Hu
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Brenda Varriano
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Michael Costanzo
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Michael Knop
- Zentrum für Molekulare Biologie der Universität Heidelberg (ZMBH), DKFZ-ZMBH Alliance, 69120 Heidelberg, Germany
- Cell Morphogenesis and Signal Transduction, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Alan Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
- Center for Analysis of Evolution and Function, University of Toronto, Toronto, Ontario, Canada
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Chad L Myers
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Brenda J Andrews
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Charles Boone
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| |
Collapse
|
9
|
Zhou Y, Li X, Katsuma S, Xu Y, Shi L, Shimada T, Wang H. Duplication and diversification of trehalase confers evolutionary advantages on lepidopteran insects. Mol Ecol 2019; 28:5282-5298. [PMID: 31674075 DOI: 10.1111/mec.15291] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Accepted: 10/23/2019] [Indexed: 01/06/2023]
Abstract
Gene duplication provides a major source of new genes for evolutionary novelty and ecological adaptation. However, the maintenance of duplicated genes and their relevance to adaptive evolution has long been debated. Insect trehalase (Treh) plays key roles in energy metabolism, growth, and stress recovery. Here, we show that the duplication of Treh in Lepidoptera (butterflies and moths) is linked with their adaptation to various environmental stresses. Generally, two Treh genes are present in insects: Treh1 and Treh2. We report three distinct forms of Treh in lepidopteran insects, where Treh1 was duplicated into two gene clusters (Treh1a and Treh1b). These gene clusters differ in gene expression patterns, enzymatic properties, and subcellular localizations, suggesting that the enzymes probably underwent sub- and/or neofunctionalization in the lepidopteran insects. Interestingly, selective pressure analysis provided significant evidence of positive selection on duplicate Treh1b gene in lepidopteran insect lineages. Most positively selected sites were located in the alpha-helical region, and several sites were close to the trehalose binding and catalytic sites. Subcellular adaptation of duplicate Treh1b driven by positive selection appears to have occurred as a result of selected changes in specific sequences, allowing for rapid reprogramming of duplicated Treh during evolution. Our results suggest that gene duplication of Treh and subsequent functional diversification could increase the survival rate of lepidopteran insects through various regulations of intracellular trehalose levels, facilitating their adaptation to diverse habitats. This study provides evidence regarding the mechanism by which gene family expansion can contribute to species adaptation through gene duplication and subsequent functional diversification.
Collapse
Affiliation(s)
- Yanyan Zhou
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Xiaotong Li
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Susumu Katsuma
- Laboratory of Insect Genetics and Bioscience, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, Japan
| | - Yusong Xu
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Liangen Shi
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Toru Shimada
- Laboratory of Insect Genetics and Bioscience, Graduate School of Agricultural and Life Sciences, University of Tokyo, Tokyo, Japan
| | - Huabing Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
10
|
Decoding the Divergent Subcellular Location of Two Highly Similar Paralogous LEA Proteins. Int J Mol Sci 2018; 19:ijms19061620. [PMID: 29857468 PMCID: PMC6032150 DOI: 10.3390/ijms19061620] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2018] [Revised: 05/25/2018] [Accepted: 05/28/2018] [Indexed: 11/17/2022] Open
Abstract
Many mitochondrial proteins are synthesized as precursors in the cytosol with an N-terminal mitochondrial targeting sequence (MTS) which is cleaved off upon import. Although much is known about import mechanisms and MTS structural features, the variability of MTS still hampers robust sub-cellular software predictions. Here, we took advantage of two paralogous late embryogenesis abundant proteins (LEA) from Arabidopsis with different subcellular locations to investigate structural determinants of mitochondrial import and gain insight into the evolution of the LEA genes. LEA38 and LEA2 are short proteins of the LEA_3 family, which are very similar along their whole sequence, but LEA38 is targeted to mitochondria while LEA2 is cytosolic. Differences in the N-terminal protein sequences were used to generate a series of mutated LEA2 which were expressed as GFP-fusion proteins in leaf protoplasts. By combining three types of mutation (substitution, charge inversion, and segment replacement), we were able to redirect the mutated LEA2 to mitochondria. Analysis of the effect of the mutations and determination of the LEA38 MTS cleavage site highlighted important structural features within and beyond the MTS. Overall, these results provide an explanation for the likely loss of mitochondrial location after duplication of the ancestral gene.
Collapse
|
11
|
Dias O, Gomes D, Vilaca P, Cardoso J, Rocha M, Ferreira EC, Rocha I. Genome-Wide Semi-Automated Annotation of Transporter Systems. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:443-456. [PMID: 26887005 DOI: 10.1109/tcbb.2016.2527647] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Usually, transport reactions are added to genome-scale metabolic models (GSMMs) based on experimental data and literature. This approach does not allow associating specific genes with transport reactions, which impairs the ability of the model to predict effects of gene deletions. Novel methods for systematic genome-wide transporter functional annotation and their integration into GSMMs are therefore necessary. In this work, an automatic system to detect and classify all potential membrane transport proteins for a given genome and integrate the related reactions into GSMMs is proposed, based on the identification and classification of genes that encode transmembrane proteins. The Transport Reactions Annotation and Generation (TRIAGE) tool identifies the metabolites transported by each transmembrane protein and its transporter family. The localization of the carriers is also predicted and, consequently, their action is confined to a given membrane. The integration of the data provided by TRIAGE with highly curated models allowed the identification of new transport reactions. TRIAGE is included in the new release of merlin, a software tool previously developed by the authors, which expedites the GSMM reconstruction processes.
Collapse
|
12
|
Li S, Wang N, Ji D, Xue Z, Yu Y, Jiang Y, Liu J, Liu Z, Xiang F. Evolutionary and Functional Analysis of Membrane-Bound NAC Transcription Factor Genes in Soybean. PLANT PHYSIOLOGY 2016; 172:1804-1820. [PMID: 27670816 PMCID: PMC5100753 DOI: 10.1104/pp.16.01132] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Accepted: 09/18/2016] [Indexed: 05/07/2023]
Abstract
Functional divergence is thought to be an important evolutionary driving force for the retention of duplicate genes. We reconstructed the evolutionary history of soybean (Glycine max) membrane-bound NAC transcription factor (NTL) genes. NTLs are thought to be components of stress signaling and unique in their requirement for proteolytic cleavage to free them from the membrane. Most of the 15 GmNTL genes appear to have evolved under strong purifying selection. By analyzing the phylogenetic tree and gene synteny, we identified seven duplicate gene pairs generated by the latest whole-genome duplication. The members of each pair were shown to have variously diverged at the transcriptional (organ specificity and responsiveness to stress), posttranscriptional (alternative splicing), and protein (proteolysis-mediated membrane release and transactivation activity) levels. The dormant (full-length protein) and active (protein without a transmembrane motif) forms of one pair of duplicated gene products (GmNTL1/GmNLT11) were each separately constitutively expressed in Arabidopsis (Arabidopsis thaliana). The heteroexpression of active but not dormant forms of these proteins caused improved tolerance to abiotic stresses, suggesting that membrane release was required for their functionality. Arabidopsis carrying the dormant form of GmNTL1 was more tolerant to hydrogen peroxide, which induces its membrane release. Tolerance was not increased in the line carrying dormant GmNTL11, which was not released by hydrogen peroxide treatment. Thus, NTL-release pattern changes may cause phenotypic divergence. It was concluded that a variety of functional divergences contributed to the retention of these GmNTL duplicates.
Collapse
Affiliation(s)
- Shuo Li
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Nan Wang
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Dandan Ji
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Zheyong Xue
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Yanchong Yu
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Yupei Jiang
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Jinglin Liu
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Zhenhua Liu
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.)
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| | - Fengning Xiang
- Key Laboratory of the Plant Cell Engineering and Germplasm Innovation, School of Life Sciences, Shandong University, Jinan 250100, Shandong, China (S.L., N.W., D.J., Y.Y., Y.J., J.L., Z.L., F.X.);
- Qilu University of Technology, Jinan 250353, Shandong, China (D.J.); and
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Fragrant Hill, Beijing 100093, China (Z.X.)
| |
Collapse
|
13
|
The Glycolytic Enzyme Triosephosphate Isomerase of Trichomonas vaginalis Is a Surface-Associated Protein Induced by Glucose That Functions as a Laminin- and Fibronectin-Binding Protein. Infect Immun 2016; 84:2878-94. [PMID: 27481251 DOI: 10.1128/iai.00538-16] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2016] [Accepted: 07/13/2016] [Indexed: 12/20/2022] Open
Abstract
Triosephosphate isomerase of Trichomonas vaginalis (TvTIM) is a 27-kDa cytoplasmic protein encoded by two genes, tvtim1 and tvtim2, that participates in glucose metabolism. TvTIM is also localized to the parasite surface. Thus, the goal of this study was to identify the novel functions of the surface-associated TvTIM in T. vaginalis and to assess the effect of glucose as an environmental factor that regulates its expression and localization. Reverse transcription-PCR (RT-PCR) showed that the tvtim genes were differentially expressed in response to glucose concentration. tvtim1 was overexpressed under glucose-restricted (GR) conditions, whereas tvtim2 was overexpressed under glucose-rich, or high-glucose (HG), conditions. Western blot and indirect immunofluorescence assays also showed that glucose positively affected the amount and surface localization of TvTIM in T. vaginalis Affinity ligand assays demonstrated that the recombinant TvTIM1 and TvTIM2 proteins bound to laminin (Lm) and fibronectin (Fn) but not to plasminogen. Moreover, higher levels of adherence to Lm and Fn were detected in parasites grown under HG conditions than in those grown under GR conditions. Furthermore, pretreatment of trichomonads with an anti-TvTIMr polyclonal antibody or pretreatment of Lm- or Fn-coated wells with both recombinant proteins (TvTIM1r and TvTIM2r) specifically reduced the binding of live parasites to Lm and Fn in a concentration-dependent manner. Moreover, T. vaginalis was exposed to different glucose concentrations during vaginal infection of women with trichomoniasis. Our data indicate that TvTIM is a surface-associated protein under HG conditions that mediates specific binding to Lm and Fn as a novel virulence factor of T. vaginalis.
Collapse
|
14
|
He G, Guan CN, Chen QX, Gou XJ, Liu W, Zeng QY, Lan T. Genome-Wide Analysis of the Glutathione S-Transferase Gene Family in Capsella rubella: Identification, Expression, and Biochemical Functions. FRONTIERS IN PLANT SCIENCE 2016; 7:1325. [PMID: 27630652 PMCID: PMC5005422 DOI: 10.3389/fpls.2016.01325] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Accepted: 08/18/2016] [Indexed: 05/06/2023]
Abstract
Extensive subfunctionalization might explain why so many genes have been maintained after gene duplication, which provides the engine for gene family expansion. However, it is still a particular challenge to trace the evolutionary dynamics and features of functional divergences in a supergene family over the course of evolution. In this study, we identified 49 Glutathione S-transferase (GST) genes from the Capsella rubella, a close relative of Arabidopsis thaliana and a member of the mustard family. Capsella GSTs can be categorized into eight classes, with tau and phi GSTs being the most numerous. The expansion of the two classes mainly occurs through tandem gene duplication, which results in tandem-arrayed gene clusters on chromosomes. By integrating phylogenetic analysis, expression patterns, and biochemical functions of Capsella and Arabidopsis GSTs, functional divergence, both in gene expression and enzymatic properties, were clearly observed in paralogous gene pairs in Capsella (even the most recent duplicates), and orthologous GSTs in Arabidopsis/Capsella. This study provides functional evidence for the expansion and organization of a large gene family in closely related species.
Collapse
Affiliation(s)
- Gang He
- Functional Genomics and Protein Evolution Group, State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of SciencesBeijing, China
- The Key Laboratory of Medicinal and Edible Plants Resources Development of Sichuan Education Commission, Chengdu UniversityChengdu, China
| | - Chao-Nan Guan
- College of Biological Sciences and Biotechnology, Beijing Forestry UniversityBeijing, China
| | - Qiang-Xin Chen
- Functional Genomics and Protein Evolution Group, State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of SciencesBeijing, China
| | - Xiao-Jun Gou
- The Key Laboratory of Medicinal and Edible Plants Resources Development of Sichuan Education Commission, Chengdu UniversityChengdu, China
| | - Wei Liu
- The Key Laboratory of Medicinal and Edible Plants Resources Development of Sichuan Education Commission, Chengdu UniversityChengdu, China
| | - Qing-Yin Zeng
- Functional Genomics and Protein Evolution Group, State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of SciencesBeijing, China
| | - Ting Lan
- Functional Genomics and Protein Evolution Group, State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of SciencesBeijing, China
| |
Collapse
|
15
|
Acharya D, Ghosh TC. Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution. BMC Genomics 2016; 17:71. [PMID: 26801093 PMCID: PMC4724117 DOI: 10.1186/s12864-016-2392-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Accepted: 01/13/2016] [Indexed: 12/13/2022] Open
Abstract
Background Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. Results We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Conclusions Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2392-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Debarun Acharya
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata, 700054, West Bengal, India
| | - Tapash C Ghosh
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata, 700054, West Bengal, India.
| |
Collapse
|
16
|
Byun SA, Singh S. Protein subcellular relocalization increases the retention of eukaryotic duplicate genes. Genome Biol Evol 2014; 5:2402-9. [PMID: 24265504 PMCID: PMC3879971 DOI: 10.1093/gbe/evt183] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Gene duplication is widely accepted as a key evolutionary process, leading to new genes and novel protein functions. By providing the raw genetic material necessary for functional expansion, the mechanisms that involve the retention and functional diversification of duplicate genes are one of the central topics in evolutionary and comparative genomics. One proposed source of retention and functional diversification is protein subcellular relocalization (PSR). PSR postulates that changes in the subcellular location of eukaryotic duplicate proteins can positively modify function and therefore be beneficial to the organism. As such, PSR would promote retention of those relocalized duplicates and result in significantly lower death rates compared with death rates of nonrelocalized duplicate pairs. We surveyed both relocalized and nonrelocalized duplicate proteins from the available genomes and proteomes of 59 eukaryotic species and compared their relative death rates over a Ks range between 0 and 1. Using the Cox proportional hazard model, we observed that the death rates of relocalized duplicate pairs were significantly lower than the death rates of the duplicates without relocalization in most eukaryotic species examined in this study. These observations suggest that PSR significantly increases retention of duplicate genes and that it plays an important, but currently underappreciated, role in the evolution of eukaryotic genomes.
Collapse
|
17
|
Ren LL, Liu YJ, Liu HJ, Qian TT, Qi LW, Wang XR, Zeng QY. Subcellular Relocalization and Positive Selection Play Key Roles in the Retention of Duplicate Genes of Populus Class III Peroxidase Family. THE PLANT CELL 2014; 26:2404-2419. [PMID: 24934172 PMCID: PMC4114941 DOI: 10.1105/tpc.114.124750] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2014] [Revised: 04/18/2014] [Accepted: 05/24/2014] [Indexed: 05/20/2023]
Abstract
Gene duplication is the primary source of new genes and novel functions. Over the course of evolution, many duplicate genes lose their function and are eventually removed by deletion. However, some duplicates have persisted and evolved diverse functions. A particular challenge is to understand how this diversity arises and whether positive selection plays a role. In this study, we reconstructed the evolutionary history of the class III peroxidase (PRX) genes from the Populus trichocarpa genome. PRXs are plant-specific enzymes that play important roles in cell wall metabolism and in response to biotic and abiotic stresses. We found that two large tandem-arrayed clusters of PRXs evolved from an ancestral cell wall type PRX to vacuole type, followed by tandem duplications and subsequent functional specification. Substitution models identified seven positively selected sites in the vacuole PRXs. These positively selected sites showed significant effects on the biochemical functions of the enzymes. We also found that positive selection acts more frequently on residues adjacent to, rather than directly at, a critical active site of the enzyme, and on flexible regions rather than on rigid structural elements of the protein. Our study provides new insights into the adaptive molecular evolution of plant enzyme families.
Collapse
Affiliation(s)
- Lin-Ling Ren
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yan-Jing Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Hai-Jing Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Ting-Ting Qian
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Li-Wang Qi
- Laboratory of Cell Biology, Research Institute of Forestry, Chinese Academy of Forestry, Beijing 100091, China
| | - Xiao-Ru Wang
- Department of Ecology and Environmental Science, UPSC, Umeå University, SE-90187 Umeå, Sweden
| | - Qing-Yin Zeng
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| |
Collapse
|
18
|
Zarin T, Moses AM. Insights into molecular evolution from yeast genomics. Yeast 2014; 31:233-41. [PMID: 24760744 DOI: 10.1002/yea.3018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2014] [Revised: 04/09/2014] [Accepted: 04/10/2014] [Indexed: 12/13/2022] Open
Abstract
Enabled by comparative genomics, yeasts have increasingly developed into a powerful model system for molecular evolution. Here we survey several areas in which yeast studies have made important contributions, including regulatory evolution, gene duplication and divergence, evolution of gene order and evolution of complexity. In each area we highlight key studies and findings based on techniques ranging from statistical analysis of large datasets to direct laboratory measurements of fitness. Future work will combine traditional evolutionary genetics analysis and experimental evolution with tools from systems biology to yield mechanistic insight into complex phenotypes.
Collapse
Affiliation(s)
- Taraneh Zarin
- Department of Cell and Systems Biology, University of Toronto, ON, Canada
| | | |
Collapse
|
19
|
Liu YJ, Han XM, Ren LL, Yang HL, Zeng QY. Functional divergence of the glutathione S-transferase supergene family in Physcomitrella patens reveals complex patterns of large gene family evolution in land plants. PLANT PHYSIOLOGY 2013; 161. [PMID: 23188805 PMCID: PMC3561018 DOI: 10.1104/pp.112.205815] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Plant glutathione S-transferases (GSTs) are multifunctional proteins encoded by a large gene family that play major roles in the detoxification of xenobiotics and oxidative stress metabolism. To date, studies on the GST gene family have focused mainly on vascular plants (particularly agricultural plants). In contrast, little information is available on the molecular characteristics of this large gene family in nonvascular plants. In addition, the evolutionary patterns of this family in land plants remain unclear. In this study, we identified 37 GST genes from the whole genome of the moss Physcomitrella patens, a nonvascular representative of early land plants. The 37 P. patens GSTs were divided into 10 classes, including two new classes (hemerythrin and iota). However, no tau GSTs were identified, which represent the largest class among vascular plants. P. patens GST gene family members showed extensive functional divergence in their gene structures, gene expression responses to abiotic stressors, enzymatic characteristics, and the subcellular locations of the encoded proteins. A joint phylogenetic analysis of GSTs from P. patens and other higher vascular plants showed that different class GSTs had distinct duplication patterns during the evolution of land plants. By examining multiple characteristics, this study revealed complex patterns of evolutionary divergence among the GST gene family in land plants.
Collapse
|
20
|
The ortholog conjecture is untestable by the current gene ontology but is supported by RNA sequencing data. PLoS Comput Biol 2012; 8:e1002784. [PMID: 23209392 PMCID: PMC3510086 DOI: 10.1371/journal.pcbi.1002784] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 10/02/2012] [Indexed: 11/19/2022] Open
Abstract
The ortholog conjecture posits that orthologous genes are functionally more similar than paralogous genes. This conjecture is a cornerstone of phylogenomics and is used daily by both computational and experimental biologists in predicting, interpreting, and understanding gene functions. A recent study, however, challenged the ortholog conjecture on the basis of experimentally derived Gene Ontology (GO) annotations and microarray gene expression data in human and mouse. It instead proposed that the functional similarity of homologous genes is primarily determined by the cellular context in which the genes act, explaining why a greater functional similarity of (within-species) paralogs than (between-species) orthologs was observed. Here we show that GO-based functional similarity between human and mouse orthologs, relative to that between paralogs, has been increasing in the last five years. Further, compared with paralogs, orthologs are less likely to be included in the same study, causing an underestimation in their functional similarity. A close examination of functional studies of homologs with identical protein sequences reveals experimental biases, annotation errors, and homology-based functional inferences that are labeled in GO as experimental. These problems and the temporary nature of the GO-based finding make the current GO inappropriate for testing the ortholog conjecture. RNA sequencing (RNA-Seq) is known to be superior to microarray for comparing the expressions of different genes or in different species. Our analysis of a large RNA-Seq dataset of multiple tissues from eight mammals and the chicken shows that the expression similarity between orthologs is significantly higher than that between within-species paralogs, supporting the ortholog conjecture and refuting the cellular context hypothesis for gene expression. We conclude that the ortholog conjecture remains largely valid to the extent that it has been tested, but further scrutiny using more and better functional data is needed.
Collapse
|
21
|
Altenhoff AM, Studer RA, Robinson-Rechavi M, Dessimoz C. Resolving the ortholog conjecture: orthologs tend to be weakly, but significantly, more similar in function than paralogs. PLoS Comput Biol 2012; 8:e1002514. [PMID: 22615551 PMCID: PMC3355068 DOI: 10.1371/journal.pcbi.1002514] [Citation(s) in RCA: 133] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 03/26/2012] [Indexed: 02/07/2023] Open
Abstract
The function of most proteins is not determined experimentally, but is extrapolated from homologs. According to the “ortholog conjecture”, or standard model of phylogenomics, protein function changes rapidly after duplication, leading to paralogs with different functions, while orthologs retain the ancestral function. We report here that a comparison of experimentally supported functional annotations among homologs from 13 genomes mostly supports this model. We show that to analyze GO annotation effectively, several confounding factors need to be controlled: authorship bias, variation of GO term frequency among species, variation of background similarity among species pairs, and propagated annotation bias. After controlling for these biases, we observe that orthologs have generally more similar functional annotations than paralogs. This is especially strong for sub-cellular localization. We observe only a weak decrease in functional similarity with increasing sequence divergence. These findings hold over a large diversity of species; notably orthologs from model organisms such as E. coli, yeast or mouse have conserved function with human proteins. To infer the function of an unknown gene, possibly the most effective way is to identify a well-characterized evolutionarily related gene, and assume that they have both kept their ancestral function. If several such homologs are available, all else being equal, it has long been assumed that those that diverged by speciation (“ortholog”) are functionally closer than those that diverged by duplication (“paralogs”); thus function is more reliably inferred from the former. But despite its prevalence, this model mostly rests on first principles, as for the longest time we have not had sufficient data to test it empirically. Recently, some studies began investigating this question and have cast doubt on the validity of this model. Here, we show that by considering a wide range of organisms and data, and, crucially, by correcting for several easily overlooked biases affecting functional annotations, the standard model is corroborated by the presently available experimental data.
Collapse
Affiliation(s)
- Adrian M. Altenhoff
- ETH Zurich, Department of Computer Science, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Romain A. Studer
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, United Kingdom
| | - Marc Robinson-Rechavi
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Christophe Dessimoz
- ETH Zurich, Department of Computer Science, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- EMBL-European Bioinformatics Institute, Hinxton, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
22
|
Abstract
Despite our extensive knowledge about the rate of protein sequence evolution for thousands of genes in hundreds of species, the corresponding rate of protein function evolution is virtually unknown, especially at the genomic scale. This lack of knowledge is primarily because of the huge diversity in protein function and the consequent difficulty in gauging and comparing rates of protein function evolution. Nevertheless, most proteins function through interacting with other proteins, and protein-protein interaction (PPI) can be tested by standard assays. Thus, the rate of protein function evolution may be measured by the rate of PPI evolution. Here, we experimentally examine 87 potential interactions between Kluyveromyces waltii proteins, whose one to one orthologs in the related budding yeast Saccharomyces cerevisiae have been reported to interact. Combining our results with available data from other eukaryotes, we estimate that the evolutionary rate of protein interaction is (2.6 ± 1.6) × 10(-10) per PPI per year, which is three orders of magnitude lower than the rate of protein sequence evolution measured by the number of amino acid substitutions per protein per year. The extremely slow evolution of protein molecular function may account for the remarkable conservation of life at molecular and cellular levels and allow for studying the mechanistic basis of human disease in much simpler organisms.
Collapse
|
23
|
Characterization of the deleted in autism 1 protein family: implications for studying cognitive disorders. PLoS One 2011; 6:e14547. [PMID: 21283809 PMCID: PMC3023760 DOI: 10.1371/journal.pone.0014547] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2010] [Accepted: 12/21/2010] [Indexed: 12/21/2022] Open
Abstract
Autism spectrum disorders (ASDs) are a group of commonly occurring, highly-heritable developmental disabilities. Human genes c3orf58 or Deleted In Autism-1 (DIA1) and cXorf36 or Deleted in Autism-1 Related (DIA1R) are implicated in ASD and mental retardation. Both gene products encode signal peptides for targeting to the secretory pathway. As evolutionary medicine has emerged as a key tool for understanding increasing numbers of human diseases, we have used an evolutionary approach to study DIA1 and DIA1R. We found DIA1 conserved from cnidarians to humans, indicating DIA1 evolution coincided with the development of the first primitive synapses. Nematodes lack a DIA1 homologue, indicating Caenorhabditis elegans is not suitable for studying all aspects of ASD etiology, while zebrafish encode two DIA1 paralogues. By contrast to DIA1, DIA1R was found exclusively in vertebrates, with an origin coinciding with the whole-genome duplication events occurring early in the vertebrate lineage, and the evolution of the more complex vertebrate nervous system. Strikingly, DIA1R was present in schooling fish but absent in fish that have adopted a more solitary lifestyle. An additional DIA1-related gene we named DIA1-Like (DIA1L), lacks a signal peptide and is restricted to the genomes of the echinoderm Strongylocentrotus purpuratus and cephalochordate Branchiostoma floridae. Evidence for remarkable DIA1L gene expansion was found in B. floridae. Amino acid alignments of DIA1 family gene products revealed a potential Golgi-retention motif and a number of conserved motifs with unknown function. Furthermore, a glycine and three cysteine residues were absolutely conserved in all DIA1-family proteins, indicating a critical role in protein structure and/or function. We have therefore identified a new metazoan protein family, the DIA1-family, and understanding the biological roles of DIA1-family members will have implications for our understanding of autism and mental retardation.
Collapse
|
24
|
Liao BY, Weng MP, Zhang J. Impact of extracellularity on the evolutionary rate of mammalian proteins. Genome Biol Evol 2010; 2:39-43. [PMID: 20333223 PMCID: PMC2839354 DOI: 10.1093/gbe/evp058] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/23/2009] [Indexed: 01/04/2023] Open
Abstract
It is of fundamental importance to understand the determinants of the rate of protein evolution. Eukaryotic extracellular proteins are known to evolve faster than intracellular proteins. Although this rate difference appears to be due to the lower essentiality of extracellular proteins than intracellular proteins in yeast, we here show that, in mammals, the impact of extracellularity is independent from the impact of gene essentiality. Our partial correlation analysis indicated that the impact of extracellularity on mammalian protein evolutionary rate is also independent from those of tissue-specificity, expression level, gene compactness, and the number of protein–protein interactions and, surprisingly, is the strongest among all the factors we examined. Similar results were also found from principal component regression analysis. Our findings suggest that different rules govern the pace of protein sequence evolution in mammals and yeasts.
Collapse
Affiliation(s)
- Ben-Yang Liao
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Miaoli County 350, Taiwan, ROC
- Corresponding author: E-mail:
| | - Meng-Pin Weng
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Miaoli County 350, Taiwan, ROC
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan
| |
Collapse
|