1
|
Hajieghrari B, Niazi A. Phylogenetic and Evolutionary Analysis of Plant Small RNA 2'-O-Methyltransferase (HEN1) Protein Family. J Mol Evol 2023:10.1007/s00239-023-10109-0. [PMID: 37191719 DOI: 10.1007/s00239-023-10109-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 04/05/2023] [Indexed: 05/17/2023]
Abstract
HUA ENHANCER 1 (HEN1) is a pivotal mediator in protecting sRNAs from 3'-end uridylation and 3' to 5' exonuclease-mediated degradation in plants. Here, we investigated the pattern of the HEN1 protein family evolutionary history and possible relationships in the plant lineages using protein sequence analyses and conserved motifs composition, functional domain identification, architecture, and phylogenetic tree reconstruction and evolutionary history inference. According to our results, HEN1 protein sequences bear several highly conserved motifs in plant species retained during the evolution from their ancestor. However, several motifs are present only in Gymnosperms and Angiosperms. A similar trend showed for their domain architecture. At the same time, phylogenetic analysis revealed the grouping of the HEN1 proteins in the three main super clads. In addition, the Neighbor-net network analysis result provides some nodes have multiple parents indicating a few conflicting signals in the data, which is not the consequence of sampling error, the effect of the selected model, or the estimation method. By reconciling the protein and species tree, we considered the gene duplications in several given species and found 170 duplication events in the evolution of HEN1 in the plant lineages. According to our analysis, the main HEN1 superclass mostly showed orthologous sequences that illustrate the vertically transmitting of HEN1 to the main lines. However, in both orthologous and paralogs, we predicted insignificant structural deviations. Our analysis implies that small local structural changes that occur continuously during the folds can moderate the changes created in the sequence. According to our results, we proposed a hypothetical model and evolutionary trajectory for the HEN1 protein family in the plant kingdom.
Collapse
Affiliation(s)
- Behzad Hajieghrari
- Department of Agricultural Biotechnology, College of Agriculture, Jahrom University, P.O. Box 74135-111, Jahrom, Islamic Republic of Iran.
| | - Ali Niazi
- Institute of Biotechnology, School of Agriculture, Shiraz University, Shiraz, Islamic Republic of Iran
| |
Collapse
|
2
|
Bajczyk M, Jarmolowski A, Jozwiak M, Pacak A, Pietrykowska H, Sierocka I, Swida-Barteczka A, Szewc L, Szweykowska-Kulinska Z. Recent Insights into Plant miRNA Biogenesis: Multiple Layers of miRNA Level Regulation. PLANTS (BASEL, SWITZERLAND) 2023; 12:plants12020342. [PMID: 36679055 PMCID: PMC9864873 DOI: 10.3390/plants12020342] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 01/06/2023] [Accepted: 01/08/2023] [Indexed: 05/27/2023]
Abstract
MicroRNAs are small RNAs, 20-22 nt long, the main role of which is to downregulate gene expression at the level of mRNAs. MiRNAs are fundamental regulators of plant growth and development in response to internal signals as well as in response to abiotic and biotic factors. Therefore, the deficiency or excess of individual miRNAs is detrimental to particular aspects of a plant's life. In consequence, the miRNA levels must be appropriately adjusted. To obtain proper expression of each miRNA, their biogenesis is controlled at multiple regulatory layers. Here, we addressed processes discovered to influence miRNA steady-state levels, such as MIR transcription, co-transcriptional pri-miRNA processing (including splicing, polyadenylation, microprocessor assembly and activity) and miRNA-encoded peptides synthesis. MiRNA stability, RISC formation and miRNA export out of the nucleus and out of the plant cell also define the levels of miRNAs in various plant tissues. Moreover, we show the evolutionary conservation of miRNA biogenesis core proteins across the plant kingdom.
Collapse
|
3
|
Comparative phylogeny and evolutionary analysis of Dicer-like protein family in two plant monophyletic lineages. J Genet Eng Biotechnol 2022; 20:103. [PMID: 35821291 PMCID: PMC9276914 DOI: 10.1186/s43141-022-00380-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 06/14/2022] [Indexed: 11/10/2022]
Abstract
BACKGROUND Small RNAs (sRNAs) that do not get untranslated into proteins exhibit a pivotal role in the expression regulation of their cognate gene(s) in almost all eukaryotic lineages, including plants. Hitherto, numerous protein families such as Dicer, a unique class of Ribonuclease III, have been reported to be involved in sRNAs processing pathways and silencing. In this study, we aimed to investigate the phylogenetic relationship and evolutionary history of the DCL protein family. RESULTS Our results illustrated the DCL family of proteins grouped into four main subfamilies (DCLs 1-4) presented in either Eudicotyledons or Liliopsids. The accurate observation of the phylogenetic trees supports the independent expansion of DCL proteins among the Eudicotyledons and Liliopsids species. They share the common origin, and the main duplication events for the formation of the DCL subfamilies occurred before the Eudicotyledons/Liliopsids split from their ancestral DCL. In addition, shreds of evidence revealed that the divergence happened when multicellularization started and since the need for complex gene regulation considered being a necessity by organisms. At that time, they have evolved independently among the monophyletic lineages. The other finding was that the combination of DCL protein subfamilies bears several highly conserved functional domains in plant species that originated from their ancestor architecture. The conservation of these domains happens to be both lineage-specific and inter lineage-specific. CONCLUSIONS DCL subfamilies (i.e., DCL1-DCL4) distribute in their single clades after diverging from their common ancestor and before emerging into higher plants. Therefore, it seems that the main duplication events for the formation of the DCL subfamilies occurred before the Eudicotyledons/Liliopsida split and before the appearance of moss, and after the single-cell green algae. We also observed the same trends among the main DCL subfamilies from functional unit composition and architecture. Despite the long evolutionary course from the divergence of Liliopsida lineage from the Eudicotyledons, a significant diversifying force to domain composition and orientation was absent. The results of this study provide a deeper insight into DCL protein evolutionary history and possible sequence and structural relationships between DCL protein subfamilies in the main higher plant monophyletic lineages; i.e., Eudicotyledons and Liliopsida.
Collapse
|
4
|
Tassia MG, David KT, Townsend JP, Halanych KM. TIAMMAt: Leveraging biodiversity to revise protein domain models, evidence from innate immunity. Mol Biol Evol 2021; 38:5806-5818. [PMID: 34459919 PMCID: PMC8662601 DOI: 10.1093/molbev/msab258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Sequence annotation is fundamental for studying the evolution of protein families, particularly when working with nonmodel species. Given the rapid, ever-increasing number of species receiving high-quality genome sequencing, accurate domain modeling that is representative of species diversity is crucial for understanding protein family sequence evolution and their inferred function(s). Here, we describe a bioinformatic tool called Taxon-Informed Adjustment of Markov Model Attributes (TIAMMAt) which revises domain profile hidden Markov models (HMMs) by incorporating homologous domain sequences from underrepresented and nonmodel species. Using innate immunity pathways as a case study, we show that revising profile HMM parameters to directly account for variation in homologs among underrepresented species provides valuable insight into the evolution of protein families. Following adjustment by TIAMMAt, domain profile HMMs exhibit changes in their per-site amino acid state emission probabilities and insertion/deletion probabilities while maintaining the overall structure of the consensus sequence. Our results show that domain revision can heavily impact evolutionary interpretations for some families (i.e., NLR’s NACHT domain), whereas impact on other domains (e.g., rel homology domain and interferon regulatory factor domains) is minimal due to high levels of sequence conservation across the sampled phylogenetic depth (i.e., Metazoa). Importantly, TIAMMAt revises target domain models to reflect homologous sequence variation using the taxonomic distribution under consideration by the user. TIAMMAt’s flexibility to revise any subset of the Pfam database using a user-defined taxonomic pool will make it a valuable tool for future protein evolution studies, particularly when incorporating (or focusing) on nonmodel species.
Collapse
Affiliation(s)
- Michael G Tassia
- Department of Biological Sciences, Auburn University, Auburn, Alabama
| | - Kyle T David
- Department of Biological Sciences, Auburn University, Auburn, Alabama
| | - James P Townsend
- Whitman Center, Marine Biological Laboratory, Woods Hole, Massachusetts.,Department of Biology, Providence College, Providence, Rhode Island
| | | |
Collapse
|
5
|
Nayar S, Thangavel G. CsubMADS1, a lag phase transcription factor, controls development of polar eukaryotic microalga Coccomyxa subellipsoidea C-169. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:1228-1242. [PMID: 34160095 DOI: 10.1111/tpj.15380] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/30/2021] [Accepted: 06/19/2021] [Indexed: 06/13/2023]
Abstract
MADS-box transcription factors (TFs) have not been functionally delineated in microalgae. In this study, the role of CsubMADS1 from microalga Coccomyxa subellipsoidea C-169 has been explored. Unlike Type II MADS-box proteins of seed plants with MADS, Intervening, K-box, and C domains, CsubMADS1 only has MADS and Intervening domains. It forms a group with MADS TFs from algae in the phylogenetic tree within the Type II MIKCC clade. CsubMADS1 is expressed strongly in the lag phase of growth. The CsubMADS1 monomer does not have a specific localization in the nucleus, and it forms homodimers to localize exclusively in the nucleus. The monomer has two nuclear localization signals (NLSs): an N-terminal NLS and an internal NLS. The internal NLS is functional, and the homodimer requires two NLSs for specific nuclear localization. Overexpression (OX) of CsubMADS1 slows down the growth of the culture and leads to the creation of giant polyploid multinucleate cells, resembling autospore mother cells. This implies that the release of autospores from autospore mother cells may be delayed. Thus, in wild-type (WT) cells, CsubMADS1 may play a crucial role in slowing down growth during the lag phase. Due to starvation in 2-month-old colonies on solid media, the WT colonies produce mucilage, whereas OX colonies produce significantly less mucilage. Thus, CsubMADS1 also negatively regulates stress-induced mucilage production and probably plays a role in stress tolerance during the lag phase. Taken together, our results reveal that CsubMADS1 is a key TF involved in the development and stress tolerance of this polar microalga.
Collapse
Affiliation(s)
- Saraswati Nayar
- Division of Plant Molecular Biology, Rajiv Gandhi Centre for Biotechnology, Thiruvananthapuram, 695014, India
| | - Gokilavani Thangavel
- Division of Plant Molecular Biology, Rajiv Gandhi Centre for Biotechnology, Thiruvananthapuram, 695014, India
| |
Collapse
|
6
|
Mix and match: Patchwork domain evolution of the land plant-specific Ca2+-permeable mechanosensitive channel MCA. PLoS One 2021; 16:e0249735. [PMID: 33857196 PMCID: PMC8049495 DOI: 10.1371/journal.pone.0249735] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 03/23/2021] [Indexed: 11/19/2022] Open
Abstract
Multidomain proteins can have a complex evolutionary history that may involve de novo domain evolution, recruitment and / or recombination of existing domains and domain losses. Here, the domain evolution of the plant-specific Ca2+-permeable mechanosensitive channel protein, MID1-COMPLEMENTING ACTIVITY (MCA), was investigated. MCA, a multidomain protein, possesses a Ca2+-influx-MCAfunc domain and a PLAC8 domain. Profile Hidden Markov Models (HMMs) of domains were assessed in 25 viridiplantae proteomes. While PLAC8 was detected in plants, animals, and fungi, MCAfunc was found in streptophytes but not in chlorophytes. Full MCA proteins were only found in embryophytes. We identified the MCAfunc domain in all streptophytes including charophytes where it appeared in E3 ubiquitin ligase-like proteins. Our Maximum Likelihood (ML) analyses suggested that the MCAfunc domain evolved early in the history of streptophytes. The PLAC8 domain showed similarity to Plant Cadmium Resistance (PCR) genes, and the coupling of MCAfunc and PLAC8 seemed to represent a single evolutionary event. This combination is unique in MCA, and does not exist in other plant mechanosensitive channels. Within angiosperms, gene duplications increased the number of MCAs. Considering their role in mechanosensing in roots, MCA might be instrumental for the rise of land plants. This study provides a textbook example of de novo domain emergence, recombination, duplication, and losses, leading to the convergence of function of proteins in plants.
Collapse
|
7
|
Xiao X, Xue GF, Stamatovic B, Qiu WR. Using Cellular Automata to Simulate Domain Evolution in Proteins. Front Genet 2020; 11:515. [PMID: 32582278 PMCID: PMC7296063 DOI: 10.3389/fgene.2020.00515] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 04/28/2020] [Indexed: 11/26/2022] Open
Abstract
Proteins play primary roles in important biological processes such as catalysis, physiological functions, and immune system functions. Thus, the research on how proteins evolved has been a nuclear question in the field of evolutionary biology. General models of protein evolution help to determine the baseline expectations for evolution of sequences, and these models have been extensively useful in sequence analysis as well as for the computer simulation of artificial sequence data sets. We have developed a new method of simulating multi-domain protein evolution, including fusions of domains, insertion, and deletion. It has been observed via the simulation test that the success rates achieved by the proposed predictor are remarkably high. For the convenience of the most experimental scientists, a user-friendly web server has been established at http://jci-bioinfo.cn/domainevo, by which users can easily get their desired results without having to go through the detailed mathematics. Through the simulation results of this website, users can predict the evolution trend of the protein domain architecture.
Collapse
Affiliation(s)
- Xuan Xiao
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Guang-Fu Xue
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Biljana Stamatovic
- Faculty of Information Systems and Technologies, University of Donja Gorica, Podgorica, Montenegro
| | - Wang-Ren Qiu
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| |
Collapse
|
8
|
Garrido-Bigotes A, Valenzuela-Riffo F, Figueroa CR. Evolutionary Analysis of JAZ Proteins in Plants: An Approach in Search of the Ancestral Sequence. Int J Mol Sci 2019; 20:ijms20205060. [PMID: 31614709 PMCID: PMC6829463 DOI: 10.3390/ijms20205060] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 10/09/2019] [Accepted: 10/10/2019] [Indexed: 12/20/2022] Open
Abstract
Jasmonates are phytohormones that regulate development, metabolism and immunity. Signal transduction is critical to activate jasmonate responses, but the evolution of some key regulators such as jasmonate-ZIM domain (JAZ) repressors is not clear. Here, we identified 1065 JAZ sequence proteins in 66 lower and higher plants and analyzed their evolution by bioinformatics methods. We found that the TIFY and Jas domains are highly conserved along the evolutionary scale. Furthermore, the canonical degron sequence LPIAR(R/K) of the Jas domain is conserved in lower and higher plants. It is noteworthy that degron sequences showed a large number of alternatives from gymnosperms to dicots. In addition, ethylene-responsive element binding factor-associated amphiphilic repression (EAR) motifs are displayed in all plant lineages from liverworts to angiosperms. However, the cryptic MYC2-interacting domain (CMID) domain appeared in angiosperms for the first time. The phylogenetic analysis performed using the Maximum Likelihood method indicated that JAZ ortholog proteins are grouped according to their similarity and plant lineage. Moreover, ancestral JAZ sequences were constructed by PhyloBot software and showed specific changes in the TIFY and Jas domains during evolution from liverworts to dicots. Finally, we propose a model for the evolution of the ancestral sequences of the main eight JAZ protein subgroups. These findings contribute to the understanding of the JAZ family origin and expansion in land plants.
Collapse
Affiliation(s)
- Adrián Garrido-Bigotes
- Laboratorio de Epigenética Vegetal, Facultad de Ciencias Forestales, Universidad de Concepción; Concepción 4070386, Chile.
| | - Felipe Valenzuela-Riffo
- Institute of Biological Sciences, Campus Talca, Universidad de Talca, Talca 34655488, Chile.
| | - Carlos R Figueroa
- Institute of Biological Sciences, Campus Talca, Universidad de Talca, Talca 34655488, Chile.
| |
Collapse
|
9
|
Wang L, Yang J, Xu Y, Piao X, Lv J. Domain-based Comparative Analysis of Bacterial Proteomes: Uniqueness, Interactions, and the Dark Matter. Curr Genomics 2019; 20:115-123. [PMID: 31555062 PMCID: PMC6728903 DOI: 10.2174/1389202920666190320134438] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2018] [Revised: 11/15/2018] [Accepted: 01/10/2019] [Indexed: 01/05/2023] Open
Abstract
Background Proteins may have none, single, double, or multiple domains, while a single domain may appear in multiple proteins. Their distribution patterns may have impacts on bacterial physi-ology and lifestyle.Objective: This study aims to understand how domains are distributed and duplicated in bacterial prote-omes, in order to better understand bacterial physiology and lifestyles. Methods In this study, we used 16712 Hidden Markov Models to screen 944 bacterial reference prote-omes versus a threshold E-value<0.001. The number of non-redundant domains and duplication rates of redundant domains for each species were calculated. The unique domains, if any, were also identified for each species. In addition, the properties of no-domain proteins were investigated in terms of physico-chemical properties. Results The increasing number of non-redundant domains for a bacterial proteome follows the trend of an asymptotic function. The domain duplication rate is positively correlated with proteome size and in-creases more rapidly. The high percentage of single-domain proteins is more associated with small pro-teome size. For each proteome, unique domains were also obtained. Moreover, no-domain proteins show differences with the other three groups for several physicochemical properties analysed in this study. Conclusion The study confirmed that a low domain duplication rate and a high percentage of single-domain proteins are more likely to be associated with bacterial host-dependent or restricted niche-adapted lifestyle. In addition, the unique lifestyle and physiology were revealed based on the analysis of species-specific domains and core domain interactions or co-occurrences.
Collapse
Affiliation(s)
- Liang Wang
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Jianye Yang
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Yaping Xu
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Xue Piao
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Jichang Lv
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| |
Collapse
|
10
|
Abstract
This chapter reviews current research on how protein domain architectures evolve. We begin by summarizing work on the phylogenetic distribution of proteins, as this will directly impact which domain architectures can be formed in different species. Studies relating domain family size to occurrence have shown that they generally follow power law distributions, both within genomes and larger evolutionary groups. These findings were subsequently extended to multi-domain architectures. Genome evolution models that have been suggested to explain the shape of these distributions are reviewed, as well as evidence for selective pressure to expand certain domain families more than others. Each domain has an intrinsic combinatorial propensity, and the effects of this have been studied using measures of domain versatility or promiscuity. Next, we study the principles of protein domain architecture evolution and how these have been inferred from distributions of extant domain arrangements. Following this, we review inferences of ancestral domain architecture and the conclusions concerning domain architecture evolution mechanisms that can be drawn from these. Finally, we examine whether all known cases of a given domain architecture can be assumed to have a single common origin (monophyly) or have evolved convergently (polyphyly). We end by a discussion of some available tools for computational analysis or exploitation of protein domain architectures and their evolution.
Collapse
|
11
|
Bitard‐Feildel T, Lamiable A, Mornon J, Callebaut I. Order in Disorder as Observed by the "Hydrophobic Cluster Analysis" of Protein Sequences. Proteomics 2018; 18:e1800054. [PMID: 30299594 PMCID: PMC7168002 DOI: 10.1002/pmic.201800054] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 08/29/2018] [Indexed: 12/17/2022]
Abstract
Hydrophobic cluster analysis (HCA) is an original approach for protein sequence analysis, which provides access to the foldable repertoire of the protein universe, including yet unannotated protein segments ("dark proteome"). Foldable segments correspond to ordered regions, as well as to intrinsically disordered regions (IDRs) undergoing disorder to order transitions. In this review, how HCA can be used to give insight into this last category of foldable segments is illustrated, with examples matching known 3D structures. After reviewing the HCA principles, examples of short foldable segments are given, which often contain short linear motifs, typically matching hydrophobic clusters. These segments become ordered upon contact with partners, with secondary structure preferences generally corresponding to those observed in the 3D structures within the complexes. Such small foldable segments are sometimes larger than the segments of known 3D structures, including flanking hydrophobic clusters that may be critical for interaction specificity or regulation, as well as intervening sequences allowing fuzziness. Cases of larger conditionally disordered domains are also presented, with lower density in hydrophobic clusters than well-folded globular domains or with exposed hydrophobic patches, which are stabilized by interaction with partners.
Collapse
Affiliation(s)
- Tristan Bitard‐Feildel
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
- Laboratoire de Biologie Computationnelle et Quantitative (LCQB)Institute of Biology Paris‐Seine (IBPS)Centre national de la recherche scientifique (CNRS)Sorbonne Université75005ParisFrance
| | - Alexis Lamiable
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
| | - Jean‐Paul Mornon
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
| | - Isabelle Callebaut
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
| |
Collapse
|
12
|
Kapase VU, Nesamma AA, Jutur PP. Identification and characterization of candidates involved in production of OMEGAs in microalgae: a gene mining and phylogenomic approach. Prep Biochem Biotechnol 2018; 48:619-628. [PMID: 29932840 DOI: 10.1080/10826068.2018.1476886] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Optimizing the production of the high-value renewables such as OMEGAs through pathway engineering requires an in-depth understanding of the structure-function relationship of genes involved in the OMEGA biosynthetic pathways. In this preliminary study, our rationale is to identify and characterize the ∼221 putative genes involved in production of OMEGAs using bioinformatic analysis from the Streptophyte (plants), Chlorophyte (green algae), Rhodophyta (red algae), and Bacillariophyta (diatoms) lineages based on their phylogenomic profiling, conserved motif/domain organization and physico-chemical properties. The MEME suite predicted 12 distinct protein domains, which are conserved among these putative genes. The phylogenomic analysis of the putative candidate genes [such as FAD2 (delta-12 desaturase); ECR (enoyl-CoA reductase); FAD2 (delta-12 desaturase); ACOT (acyl CoA thioesterase); ECH (enoyl-CoA hydratase); and ACAT (acetyl-CoA acyltransferase)] with similar domains and motif patterns were remarkably well conserved. Furthermore, the subcellular network prediction of OMEGA biosynthetic pathway genes revealed a unique interaction between the light-dependent chlorophyll biosynthesis and glycerol-3-phosphate dehydrogenase, which predicts a major cross-talk between the key essential pathways. Such bioinformatic analysis will provide insights in finding the key regulatory genes to optimize the productivity of OMEGAs in microalgal cell factories.
Collapse
Affiliation(s)
- Vikas U Kapase
- a Omics of Algae Group, Integrative Biology , International Centre for Genetic Engineering and Biotechnology , New Delhi , India
| | - Asha A Nesamma
- a Omics of Algae Group, Integrative Biology , International Centre for Genetic Engineering and Biotechnology , New Delhi , India
| | - Pannaga P Jutur
- a Omics of Algae Group, Integrative Biology , International Centre for Genetic Engineering and Biotechnology , New Delhi , India
| |
Collapse
|
13
|
Tevatia R, Oyler GA. Evolution of DDB1-binding WD40 (DWD) in the viridiplantae. PLoS One 2018; 13:e0190282. [PMID: 29293590 PMCID: PMC5749748 DOI: 10.1371/journal.pone.0190282] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 12/11/2017] [Indexed: 12/16/2022] Open
Abstract
Damaged DNA Binding 1 (DDB1)—binding WD40 (DWD) proteins are highly conserved and involved in a plethora of developmental and physiological processes such as flowering time control, photomorphogenesis, and abiotic stress responses. The phylogeny of this family of proteins in plants and algae of viridiplante is a critical area to understand the emergence of this family in such important and diverse functions. We aimed to investigate the putative homologs of DWD in the viridiplante and establish a deeper DWD evolutionary grasp. The advancement in publicly available genomic data allowed us to perform an extensive genome-wide DWD retrieval. Using annotated Arabidopsis thaliana DWDs as the reference, we generated and characterized a comprehensive DWD database for the studied photoautotrophs. Further, a generic DWD classification system (Type A to K), based on (i) position of DWD motifs, (ii) number of DWD motifs, and (iii) presence/absence of other domains, was adopted. About 72–80% DWDs have one DWD motif, whereas 17–24% DWDs have two and 0.5–4.7% DWDs have three DWD motifs. Neighbor-joining phylogenetic construction of A. thaliana DWDs facilitated us to tune these substrate receptors into 15 groups. Though the DWD count increases from microalgae to higher land plants, the ratio of DWD to WD40 remained constant throughout the viridiplante. The DWD expansion appeared to be the consequence of consistent DWD genetic flow accompanied by several gene duplication events. The network, phylogenetic, and statistical analysis delineated DWD evolutionary relevance in the viridiplante.
Collapse
Affiliation(s)
- Rahul Tevatia
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- * E-mail: (RT); (GAO)
| | - George A. Oyler
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- Synaptic Research LLC, Baltimore, Maryland, United States of America
- * E-mail: (RT); (GAO)
| |
Collapse
|
14
|
Shapiro JA. Living Organisms Author Their Read-Write Genomes in Evolution. BIOLOGY 2017; 6:E42. [PMID: 29211049 PMCID: PMC5745447 DOI: 10.3390/biology6040042] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 11/17/2017] [Accepted: 11/28/2017] [Indexed: 12/18/2022]
Abstract
Evolutionary variations generating phenotypic adaptations and novel taxa resulted from complex cellular activities altering genome content and expression: (i) Symbiogenetic cell mergers producing the mitochondrion-bearing ancestor of eukaryotes and chloroplast-bearing ancestors of photosynthetic eukaryotes; (ii) interspecific hybridizations and genome doublings generating new species and adaptive radiations of higher plants and animals; and, (iii) interspecific horizontal DNA transfer encoding virtually all of the cellular functions between organisms and their viruses in all domains of life. Consequently, assuming that evolutionary processes occur in isolated genomes of individual species has become an unrealistic abstraction. Adaptive variations also involved natural genetic engineering of mobile DNA elements to rewire regulatory networks. In the most highly evolved organisms, biological complexity scales with "non-coding" DNA content more closely than with protein-coding capacity. Coincidentally, we have learned how so-called "non-coding" RNAs that are rich in repetitive mobile DNA sequences are key regulators of complex phenotypes. Both biotic and abiotic ecological challenges serve as triggers for episodes of elevated genome change. The intersections of cell activities, biosphere interactions, horizontal DNA transfers, and non-random Read-Write genome modifications by natural genetic engineering provide a rich molecular and biological foundation for understanding how ecological disruptions can stimulate productive, often abrupt, evolutionary transformations.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
15
|
Van Holle S, De Schutter K, Eggermont L, Tsaneva M, Dang L, Van Damme EJM. Comparative Study of Lectin Domains in Model Species: New Insights into Evolutionary Dynamics. Int J Mol Sci 2017; 18:ijms18061136. [PMID: 28587095 PMCID: PMC5485960 DOI: 10.3390/ijms18061136] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2017] [Revised: 05/20/2017] [Accepted: 05/22/2017] [Indexed: 01/07/2023] Open
Abstract
Lectins are present throughout the plant kingdom and are reported to be involved in diverse biological processes. In this study, we provide a comparative analysis of the lectin families from model species in a phylogenetic framework. The analysis focuses on the different plant lectin domains identified in five representative core angiosperm genomes (Arabidopsisthaliana, Glycine max, Cucumis sativus, Oryza sativa ssp. japonica and Oryza sativa ssp. indica). The genomes were screened for genes encoding lectin domains using a combination of Basic Local Alignment Search Tool (BLAST), hidden Markov models, and InterProScan analysis. Additionally, phylogenetic relationships were investigated by constructing maximum likelihood phylogenetic trees. The results demonstrate that the majority of the lectin families are present in each of the species under study. Domain organization analysis showed that most identified proteins are multi-domain proteins, owing to the modular rearrangement of protein domains during evolution. Most of these multi-domain proteins are widespread, while others display a lineage-specific distribution. Furthermore, the phylogenetic analyses reveal that some lectin families evolved to be similar to the phylogeny of the plant species, while others share a closer evolutionary history based on the corresponding protein domain architecture. Our results yield insights into the evolutionary relationships and functional divergence of plant lectins.
Collapse
Affiliation(s)
- Sofie Van Holle
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
| | - Kristof De Schutter
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
- Department of Crop Protection, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
| | - Lore Eggermont
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
| | - Mariya Tsaneva
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
| | - Liuyi Dang
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
| | - Els J M Van Damme
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Ghent, Belgium.
| |
Collapse
|
16
|
Singh A, Jethva M, Singla-Pareek SL, Pareek A, Kushwaha HR. Analyses of Old "Prokaryotic" Proteins Indicate Functional Diversification in Arabidopsis and Oryza sativa. FRONTIERS IN PLANT SCIENCE 2016; 7:304. [PMID: 27014324 PMCID: PMC4792156 DOI: 10.3389/fpls.2016.00304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 02/26/2016] [Indexed: 06/05/2023]
Abstract
During evolution, various processes such as duplication, divergence, recombination, and many other events leads to the evolution of new genes with novel functions. These evolutionary events, thus significantly impact the evolution of cellular, physiological, morphological, and other phenotypic trait of organisms. While evolving, eukaryotes have acquired large number of genes from the earlier prokaryotes. This work is focused upon identification of old "prokaryotic" proteins in Arabidopsis and Oryza sativa genome, further highlighting their possible role(s) in the two genomes. Our results suggest that with respect to their genome size, the fraction of old "prokaryotic" proteins is higher in Arabidopsis than in Oryza sativa. The large fractions of such proteins encoding genes were found to be localized in various endo-symbiotic organelles. The domain architecture of the old "prokaryotic" proteins revealed similar distribution in both Arabidopsis and Oryza sativa genomes showing their conserved evolution. In Oryza sativa, the old "prokaryotic" proteins were more involved in developmental processes, might be due to constant man-made selection pressure for better agronomic traits/productivity. While in Arabidopsis, these proteins were involved in metabolic functions. Overall, the analysis indicates the distinct pattern of evolution of old "prokaryotic" proteins in Arabidopsis and Oryza sativa.
Collapse
Affiliation(s)
- Anupama Singh
- School of Computational and Integrative Sciences, Jawaharlal Nehru UniversityNew Delhi, India
| | - Minesh Jethva
- International Center for Genetic Engineering and BiotechnologyNew Delhi, India
| | - Sneh L. Singla-Pareek
- Plant Stress Biology, International Center for Genetic Engineering and BiotechnologyNew Delhi, India
| | - Ashwani Pareek
- Stress Physiology and Molecular Biology Laboratory, School of Life Sciences, Jawaharlal Nehru UniversityNew Delhi, India
| | - Hemant R. Kushwaha
- International Center for Genetic Engineering and BiotechnologyNew Delhi, India
| |
Collapse
|
17
|
Hsu CH, Chiang AWT, Hwang MJ, Liao BY. Proteins with Highly Evolvable Domain Architectures Are Nonessential but Highly Retained. Mol Biol Evol 2016; 33:1219-30. [DOI: 10.1093/molbev/msw006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
|
18
|
Ghosh A, Kushwaha HR, Hasan MR, Pareek A, Sopory SK, Singla-Pareek SL. Presence of unique glyoxalase III proteins in plants indicates the existence of shorter route for methylglyoxal detoxification. Sci Rep 2016; 6:18358. [PMID: 26732528 PMCID: PMC4702089 DOI: 10.1038/srep18358] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 11/03/2015] [Indexed: 11/16/2022] Open
Abstract
Glyoxalase pathway, comprising glyoxalase I (GLY I) and glyoxalase II (GLY II) enzymes, is the major pathway for detoxification of methylglyoxal (MG) into D-lactate involving reduced glutathione (GSH). However, in bacteria, glyoxalase III (GLY III) with DJ-1/PfpI domain(s) can do the same conversion in a single step without GSH. Our investigations for the presence of DJ-1/PfpI domain containing proteins in plants have indicated the existence of GLY III-like proteins in monocots, dicots, lycopods, gymnosperm and bryophytes. A deeper in silico analysis of rice genome identified twelve DJ-1 proteins encoded by six genes. Detailed analysis has been carried out including their chromosomal distribution, genomic architecture and localization. Transcript profiling under multiple stress conditions indicated strong induction of OsDJ-1 in response to exogenous MG. A member of OsDJ-1 family, OsDJ-1C, showed high constitutive expression at all developmental stages and tissues of rice. MG depletion study complemented by simultaneous formation of D-lactate proved OsDJ-1C to be a GLY III enzyme that converts MG directly into D-lactate in a GSH-independent manner. Site directed mutagenesis of Cys-119 to Alanine significantly reduces its GLY III activity indicating towards the existence of functional GLY III enzyme in rice—a shorter route for MG detoxification.
Collapse
Affiliation(s)
- Ajit Ghosh
- Plant Molecular Biology Group, International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi-110067, India.,Present address: Department of Biochemistry and Molecular Biology, Shahjalal University of Science and Technology, Sylhet-3114, Bangladesh
| | - Hemant R Kushwaha
- Plant Molecular Biology Group, International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi-110067, India
| | - Mohammad R Hasan
- Plant Molecular Biology Group, International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi-110067, India
| | - Ashwani Pareek
- Stress Physiology and Molecular Biology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi-110067, India
| | - Sudhir K Sopory
- Plant Molecular Biology Group, International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi-110067, India
| | - Sneh L Singla-Pareek
- Plant Molecular Biology Group, International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi-110067, India
| |
Collapse
|
19
|
Van Holle S, Van Damme EJM. Distribution and evolution of the lectin family in soybean (Glycine max). Molecules 2015; 20:2868-91. [PMID: 25679048 PMCID: PMC6272470 DOI: 10.3390/molecules20022868] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Accepted: 02/06/2015] [Indexed: 01/02/2023] Open
Abstract
Lectins are a diverse group of proteins that bind specific carbohydrates and are found throughout all kingdoms. In plants, lectins are involved in a range of important processes such as plant defense and stress signaling. Although the genome sequence of Glycine max (soybean) has been published, little is known about the abundance and expansion patterns of lectin genes in soybean. Using BLAST and hidden Markov models, a total of 359 putative lectin genes have been identified. Furthermore, these sequences could be classified in nine of the twelve plant lectin families identified today. Analysis of the domain organization demonstrated that most of the identified lectin genes encode chimerolectins, consisting of one or multiple lectin domains combined with other known protein domains. Both tandem and segmental duplication events have contributed to the expansion of the lectin gene family. These data provide a detailed understanding of the domain architecture and molecular evolution of the lectin gene family in soybean.
Collapse
Affiliation(s)
- Sofie Van Holle
- Laboratory of Biochemistry and Glycobiology, Department of Molecular Biotechnology, Ghent University, Coupure links 653, 9000 Ghent, Belgium.
| | - Els J M Van Damme
- Laboratory of Biochemistry and Glycobiology, Department of Molecular Biotechnology, Ghent University, Coupure links 653, 9000 Ghent, Belgium.
| |
Collapse
|
20
|
Hayashida M, Ruan P, Akutsu T. Proteome compression via protein domain compositions. Methods 2014; 67:380-5. [DOI: 10.1016/j.ymeth.2014.01.012] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Revised: 12/25/2013] [Accepted: 01/15/2014] [Indexed: 01/10/2023] Open
|
21
|
Yao Q, Ge H, Wu S, Zhang N, Chen W, Xu C, Gao J, Thelen JJ, Xu D. P³DB 3.0: From plant phosphorylation sites to protein networks. Nucleic Acids Res 2013; 42:D1206-13. [PMID: 24243849 PMCID: PMC3965113 DOI: 10.1093/nar/gkt1135] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
In the past few years, the Plant Protein Phosphorylation Database (P3DB, http://p3db.org) has become one of the most significant in vivo data resources for studying plant phosphoproteomics. We have substantially updated P3DB with respect to format, new datasets and analytic tools. In the P3DB 3.0, there are altogether 47 923 phosphosites in 16 477 phosphoproteins curated across nine plant organisms from 32 studies, which have met our multiple quality standards for acquisition of in vivo phosphorylation site data. Centralized by these phosphorylation data, multiple related data and annotations are provided, including protein–protein interaction (PPI), gene ontology, protein tertiary structures, orthologous sequences, kinase/phosphatase classification and Kinase Client Assay (KiC Assay) data—all of which provides context for the phosphorylation event. In addition, P3DB 3.0 incorporates multiple network viewers for the above features, such as PPI network, kinase-substrate network, phosphatase-substrate network, and domain co-occurrence network to help study phosphorylation from a systems point of view. Furthermore, the new P3DB reflects a community-based design through which users can share datasets and automate data depository processes for publication purposes. Each of these new features supports the goal of making P3DB a comprehensive, systematic and interactive platform for phosphoproteomics research.
Collapse
Affiliation(s)
- Qiuming Yao
- Department of Computer Science, University of Missouri, Columbia, MO 65211, USA, Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA, School of Communication and Information Engineering, Shanghai University, Shanghai 200444, People's Republic of China, Department of Biology, Brandeis University, MA 02453, USA, Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and Department of Biochemistry, University of Missouri, Columbia, MO 65211, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Cabreira C, Cagliari A, Bücker-Neto L, Wiebke-Strohm B, de Freitas LB, Marcelino-Guimarães FC, Nepomuceno AL, Margis-Pinheiro MMAN, Bodanese-Zanettini MH. The Lesion Simulating Disease (LSD) gene family as a variable in soybean response to Phakopsora pachyrhizi infection and dehydration. Funct Integr Genomics 2013; 13:323-38. [DOI: 10.1007/s10142-013-0326-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Revised: 05/20/2013] [Accepted: 05/27/2013] [Indexed: 12/13/2022]
|
23
|
Giri J, Dansana PK, Kothari KS, Sharma G, Vij S, Tyagi AK. SAPs as novel regulators of abiotic stress response in plants. Bioessays 2013; 35:639-48. [PMID: 23640876 DOI: 10.1002/bies.201200181] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Stress associated proteins (SAPs), novel A20/AN1 zinc-finger domain-containing proteins, are fast emerging as potential candidates for biotechnological approaches in order to improve abiotic stress tolerance in plants - the ultimate aim of which is crop-yield protection. Until relatively recently, such proteins had only been identified in humans, where they had been shown to be key regulators of innate immunity. Their phylogenetic relationship and recruitment of diverse protein domains reflect an architectural and mechanistic diversity. Emerging evidence suggests that SAPs may act as ubiquitin ligase, redox sensor, and regulator of gene expression during stress. Here, we evaluate the new knowledge on SAPs with a view to understand their mechanism of action. Furthermore, we set an agenda for investigating hitherto unexplored roles of these proteins.
Collapse
Affiliation(s)
- Jitender Giri
- National Institute of Plant Genome Research, New Delhi, India
| | | | | | | | | | | |
Collapse
|
24
|
Guo YL. Gene family evolution in green plants with emphasis on the origination and evolution of Arabidopsis thaliana genes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 73:941-51. [PMID: 23216999 DOI: 10.1111/tpj.12089] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Revised: 11/01/2012] [Accepted: 11/23/2012] [Indexed: 05/04/2023]
Abstract
Gene family size variation is an important mechanism that shapes the natural variation for adaptation in various species. Despite its importance, the pattern of gene family size variation in green plants is still not well understood. In particular, the evolutionary pattern of genes and gene families remains unknown in the model plant Arabidopsis thaliana in the context of green plants. In this study, eight representative genomes of green plants are sampled to study gene family evolution and characterize the origination of A. thaliana genes, respectively. Four important insights gained are that: (i) the rate of gene gains and losses is about 0.001359 per gene every million years, similar to the rate in yeast, Drosophila, and mammals; (ii) some gene families evolved rapidly with extreme expansions or contractions, and 2745 gene families present in all the eight species represent the 'core' proteome of green plants; (iii) 70% of A. thaliana genes could be traced back to 450 million years ago; and (iv) intriguingly, A. thaliana genes with early origination are under stronger purifying selection and more conserved. In summary, the present study provides genome-wide insights into evolutionary history and mechanisms of genes and gene families in green plants and especially in A. thaliana.
Collapse
Affiliation(s)
- Ya-Long Guo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
| |
Collapse
|
25
|
Tan S, Zhong Y, Hou H, Yang S, Tian D. Variation of presence/absence genes among Arabidopsis populations. BMC Evol Biol 2012; 12:86. [PMID: 22697058 PMCID: PMC3433342 DOI: 10.1186/1471-2148-12-86] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 06/14/2012] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Gene presence/absence (P/A) polymorphisms are commonly observed in plants and are important in individual adaptation and species differentiation. Detecting their abundance, distribution and variation among individuals would help to understand the role played by these polymorphisms in a given species. The recently sequenced 80 Arabidopsis genomes provide an opportunity to address these questions. RESULTS By systematically investigating these accessions, we identified 2,407 P/A genes (or 8.9%) absent in one or more genomes, averaging 444 absent genes per accession. 50.6% of P/A genes belonged to multi-copy gene families, or 31.0% to clustered genes. However, the highest proportion of P/A genes, outnumbered in singleton genes, was observed in the regions near centromeres. In addition, a significant correlation was observed between the P/A gene frequency among the 80 accessions and the diversity level at P/A loci. Furthermore, the proportion of P/A genes was different among functional gene categories. Finally, a P/A gene tree showed a diversified population structure in the worldwide Arabidopsis accessions. CONCLUSIONS An estimate of P/A genes and their frequency distribution in the worldwide Arabidopsis accessions was obtained. Our results suggest that there are diverse mechanisms to generate or maintain P/A genes, by which individuals and functionally different genes can selectively maintain P/A polymorphisms for a specific adaptation.
Collapse
Affiliation(s)
- Shengjun Tan
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Yan Zhong
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Huan Hou
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, Department of Biology, Nanjing University, Nanjing, 210093, China
| |
Collapse
|