1
|
Kang JE, Jun JH, Kwon JH, Lee JH, Hwang K, Kim S, Jeong N. Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool-Liquid/Liquid Phase Separation, Oligomerization, GO Analysis: A Toolkit for Interaction Data-Based Domain Analysis. Genes (Basel) 2023; 14:1476. [PMID: 37510380 PMCID: PMC10379056 DOI: 10.3390/genes14071476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 07/04/2023] [Accepted: 07/14/2023] [Indexed: 07/30/2023] Open
Abstract
Although a large number of databases are available for regulatory elements, a bottleneck has been created by the lack of bioinformatics tools to predict the interaction modes of regulatory elements. To reduce this gap, we developed the Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool-liquid/liquid phase separation (LLPS), oligomerization, GO analysis (ART FOUNDATION-LOG), a useful toolkit for protein-nucleic acid interaction (PNI) and protein-protein interaction (PPI) analysis based on domain-domain interactions (DDIs). LLPS, protein oligomerization, the structural properties of protein domains, and protein modifications are major components in the orchestration of the spatiotemporal dynamics of PPIs and PNIs. Our goal is to integrate PPI/PNI information into the development of a prediction model for identifying important genetic variants in peaches. Our program unified interdatabase relational keys based on protein domains to facilitate inference from the model species. A key advantage of this program lies in the integrated information of related features, such as protein oligomerization, LOG analysis, structural characterizations of domains (e.g., domain linkers, intrinsically disordered regions, DDIs, domain-motif (peptide) interactions, beta sheets, and transmembrane helices), and post-translational modification. We provided simple tests to demonstrate how to use this program, which can be applied to other eukaryotic organisms.
Collapse
Affiliation(s)
- Jee Eun Kang
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Ji Hae Jun
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Jung Hyun Kwon
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Ju-Hyun Lee
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Kidong Hwang
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Sungjong Kim
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| | - Namhee Jeong
- Fruit Research Division, National Institute of Horticultural and Herbal Science, Wanju 55365, Republic of Korea
| |
Collapse
|
2
|
Abstract
Proteins are intimately involved in executing and controlling virtually all cellular processes. To understand the molecular mechanisms that underlie plant phenotypes, it is essential to investigate protein expression, interactions, and modifications, to name a few. The proteome is highly dynamic in time and space, and a plethora of protein modifications, protein interactions, and network constellations are at play under specific conditions and developmental stages. Analysis of proteomes aims to characterize the entire protein complement of a particular cell type, tissue, or organism-a challenging task, given the dynamic nature of the proteome. Modern mass spectrometry-based proteomics technology can be used to address this complexity at a system-wide scale by the global identification and quantification of thousands of proteins. In this review, we present current methods and technologies employed in mass spectrometry-based proteomics and provide examples of dynamic changes in the plant proteome elucidated by proteomic approaches.
Collapse
Affiliation(s)
- Julia Mergner
- Bavarian Center for Biomolecular Mass Spectrometry at Klinikum rechts der Isar (BayBioMS@MRI), Technical University of Munich, Munich, Germany;
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany;
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany;
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), Technical University of Munich, Freising, Germany
| |
Collapse
|
3
|
Jiang S, Du Q, Feng C, Ma L, Zhang Z. CompoDynamics: a comprehensive database for characterizing sequence composition dynamics. Nucleic Acids Res 2022; 50:D962-D969. [PMID: 34718745 PMCID: PMC8728180 DOI: 10.1093/nar/gkab979] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 10/02/2021] [Accepted: 10/06/2021] [Indexed: 11/15/2022] Open
Abstract
Sequence compositions of nucleic acids and proteins have significant impact on gene expression, RNA stability, translation efficiency, RNA/protein structure and molecular function, and are associated with genome evolution and adaptation across all kingdoms of life. Therefore, a devoted resource of sequence compositions and associated features is fundamentally crucial for a wide range of biological research. Here, we present CompoDynamics (https://ngdc.cncb.ac.cn/compodynamics/), a comprehensive database of sequence compositions of coding sequences (CDSs) and genomes for all kinds of species. Taking advantage of the exponential growth of RefSeq data, CompoDynamics presents a wealth of sequence compositions (nucleotide content, codon usage, amino acid usage) and derived features (coding potential, physicochemical property and phase separation) for 118 689 747 high-quality CDSs and 34 562 genomes across 24 995 species. Additionally, interactive analytical tools are provided to enable comparative analyses of sequence compositions and molecular features across different species and gene groups. Collectively, CompoDynamics bears the great potential to better understand the underlying roles of sequence composition dynamics across genes and genomes, providing a fundamental resource in support of a broad spectrum of biological studies.
Collapse
Affiliation(s)
- Shuai Jiang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- China National Center for Bioinformation, Beijing 100101, China
| | - Qiang Du
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Changrui Feng
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Lina Ma
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhang Zhang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
4
|
Iqbal Z, Iqbal MS, Khan MIR, Ansari MI. Toward Integrated Multi-Omics Intervention: Rice Trait Improvement and Stress Management. FRONTIERS IN PLANT SCIENCE 2021; 12:741419. [PMID: 34721467 PMCID: PMC8554098 DOI: 10.3389/fpls.2021.741419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 09/20/2021] [Indexed: 05/04/2023]
Abstract
Rice (Oryza sativa) is an imperative staple crop for nearly half of the world's population. Challenging environmental conditions encompassing abiotic and biotic stresses negatively impact the quality and yield of rice. To assure food supply for the unprecedented ever-growing world population, the improvement of rice as a crop is of utmost importance. In this era, "omics" techniques have been comprehensively utilized to decipher the regulatory mechanisms and cellular intricacies in rice. Advancements in omics technologies have provided a strong platform for the reliable exploration of genetic resources involved in rice trait development. Omics disciplines like genomics, transcriptomics, proteomics, and metabolomics have significantly contributed toward the achievement of desired improvements in rice under optimal and stressful environments. The present review recapitulates the basic and applied multi-omics technologies in providing new orchestration toward the improvement of rice desirable traits. The article also provides a catalog of current scenario of omics applications in comprehending this imperative crop in relation to yield enhancement and various environmental stresses. Further, the appropriate databases in the field of data science to analyze big data, and retrieve relevant information vis-à-vis rice trait improvement and stress management are described.
Collapse
Affiliation(s)
- Zahra Iqbal
- Molecular Crop Research Unit, Department of Biochemistry, Chulalongkorn University, Bangkok, Thailand
| | | | | | | |
Collapse
|
5
|
Bagnaresi P, Cattivelli L. Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes. PLoS One 2020; 15:e0234782. [PMID: 32559249 PMCID: PMC7304597 DOI: 10.1371/journal.pone.0234782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 06/02/2020] [Indexed: 11/20/2022] Open
Abstract
A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds–related FNTDC has gained increased interest in recent years, but little ab-initio attempts aiming to the identification of FNTDCs covering all biological functions, including primary metabolism compounds, have been carried out. We report an extensive FNTDC dataset accompanied by a detailed assessment on parameters used for genome scanning and their impact on FNTDC detection. We propose 70% identity and 70% alignment coverage as intermediate settings to exclude tandem duplicated genes and a dynamic scanning window of 24 genes. These settings were applied to rice, arabidopsis and grapevine genomes to call for FNTDCs. Besides the best-known secondary metabolism clusters, we identified many FNTDCs associated to primary metabolism ranging from macromolecules synthesis/editing, TOR signalling, ubiquitination, proton and electron transfer complexes. Using the intermediate FNTDC setting parameters (at P-value 1e-6), 130, 70 and 140 candidate FNTDCs were called in rice, arabidopsis and grapevine, respectively, and 20 to 30% of GO tags associated to called FNTDC were common among the 3 genomes. The datasets developed along with this work provide a rich framework for pinpointing candidate FNTDCs reflecting all GO-BP tags covering both primary and secondary metabolism with large macromolecular complexes/metabolons as the most represented FNTDCs. Noteworthy, several FNTDCs are tagged with GOs referring to organelle-targeted multi-enzyme complex, a finding that suggest the migration of endosymbiont gene chunks towards nuclei could be at the basis of these class of candidate FNTDCs. Most FNTDC appear to have evolved prior of genome duplication events. More than one-third of genes interspersed/adjacent to called FNTDCs lacked any functional annotation; however, their co-localization may provide hints towards a candidate biological role.
Collapse
Affiliation(s)
- Paolo Bagnaresi
- CREA Research Centre for Genomics and Bioinformatics, Fiorenzuola d’Arda, Italy
- * E-mail:
| | - Luigi Cattivelli
- CREA Research Centre for Genomics and Bioinformatics, Fiorenzuola d’Arda, Italy
| |
Collapse
|
6
|
Hong WJ, Kim YJ, Chandran AKN, Jung KH. Infrastructures of systems biology that facilitate functional genomic study in rice. RICE (NEW YORK, N.Y.) 2019; 12:15. [PMID: 30874968 PMCID: PMC6419666 DOI: 10.1186/s12284-019-0276-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Accepted: 03/06/2019] [Indexed: 05/08/2023]
Abstract
Rice (Oryza sativa L.) is both a major staple food for the worldwide population and a model crop plant for studying the mode of action of agronomically valuable traits, providing information that can be applied to other crop plants. Due to the development of high-throughput technologies such as next generation sequencing and mass spectrometry, a huge mass of multi-omics data in rice has been accumulated. Through the integration of those data, systems biology in rice is becoming more advanced.To facilitate such systemic approaches, we have summarized current resources, such as databases and tools, for systems biology in rice. In this review, we categorize the resources using six omics levels: genomics, transcriptomics, proteomics, metabolomics, integrated omics, and functional genomics. We provide the names, websites, references, working states, and number of citations for each individual database or tool and discuss future prospects for the integrated understanding of rice gene functions.
Collapse
Affiliation(s)
- Woo-Jong Hong
- Graduate School of Biotechnology & Crop Biotech Institute, Kyung Hee University, Yongin, 17104, Korea
| | - Yu-Jin Kim
- Graduate School of Biotechnology & Crop Biotech Institute, Kyung Hee University, Yongin, 17104, Korea
| | | | - Ki-Hong Jung
- Graduate School of Biotechnology & Crop Biotech Institute, Kyung Hee University, Yongin, 17104, Korea.
| |
Collapse
|
7
|
Liao P, Li S, Cui X, Zheng Y. A comprehensive review of web-based resources of non-coding RNAs for plant science research. Int J Biol Sci 2018; 14:819-832. [PMID: 29989090 PMCID: PMC6036741 DOI: 10.7150/ijbs.24593] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2017] [Accepted: 03/14/2018] [Indexed: 01/06/2023] Open
Abstract
Non-coding RNAs (ncRNAs) are transcribed from genome but not translated into proteins. Many ncRNAs are key regulators of plants growth and development, metabolism and stress tolerance. In order to make the web-based ncRNA resources for plant science research be more easily accessible and understandable, we made a comprehensive review for 83 web-based resources of three types, including genome databases containing ncRNA data, microRNA (miRNA) databases and long non-coding RNA (lncRNA) databases. To facilitate effective usage of these resources, we also suggested some preferred resources of miRNAs and lncRNAs for performing meaningful analysis.
Collapse
Affiliation(s)
- Peiran Liao
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
| | - Shipeng Li
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
| | - Xiuming Cui
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
- Yunnan key laboratory of Panax notoginseng, Kunming, Yunnan, 650500, China
| | - Yun Zheng
- Yunnan Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming, Yunnan, 650500, China
| |
Collapse
|
8
|
Podia V, Milioni D, Martzikou M, Haralampidis K. The role of Arabidopsis thaliana RASD1 gene in ABA-dependent abiotic stress response. PLANT BIOLOGY (STUTTGART, GERMANY) 2018; 20:307-317. [PMID: 29125669 DOI: 10.1111/plb.12662] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 11/06/2017] [Indexed: 06/07/2023]
Abstract
Abiotic stress is one of the key parameters affecting plant productivity. Drought and soil salinity, in particular, challenge plants to activate various response mechanisms to withstand these adverse growth conditions. While the molecular events that take place are complex and to a large extent unclear, the plant hormone abscisic acid (ABA) is considered a major player in mediating the adaptation of plants to stress. Here we report the identification of an ABA-insensitive mutant from Arabidopsis thaliana. A combination of molecular, genetic and physiology approaches were implemented, to characterise the AtRASD1 locus (RESPONSIVENESS TO ABA SALT AND DROUGHT 1) and to investigate its role in plant development. RASD1 is expressed predominantly in the vascular system of A. thaliana and encodes a peptide of unknown function with no similarity to any known sequence to date. The protein is localised in the nucleus and the cytoplasm, and RASD1-impaired plants are drought-intolerant and insensitive to exogenous ABA and NaCl during germination and root growth. Our data indicate that RASD1 is involved in ABA-dependent signal transduction pathways and therefore in enabling plants to activate response mechanisms related to seed germination and abiotic stress.
Collapse
Affiliation(s)
- V Podia
- Faculty of Biology, Department of Botany, National and Kapodistrian University of Athens, Athens, Greece
| | - D Milioni
- Department of Agricultural Biotechnology, Agricultural University of Athens, Athens, Greece
| | - M Martzikou
- Faculty of Biology, Department of Botany, National and Kapodistrian University of Athens, Athens, Greece
| | - K Haralampidis
- Faculty of Biology, Department of Botany, National and Kapodistrian University of Athens, Athens, Greece
| |
Collapse
|
9
|
Kurotani A, Yamada Y, Sakurai T. Alga-PrAS (Algal Protein Annotation Suite): A Database of Comprehensive Annotation in Algal Proteomes. PLANT & CELL PHYSIOLOGY 2017; 58:e6. [PMID: 28069893 PMCID: PMC5444574 DOI: 10.1093/pcp/pcw212] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 11/24/2016] [Indexed: 06/06/2023]
Abstract
Algae are smaller organisms than land plants and offer clear advantages in research over terrestrial species in terms of rapid production, short generation time and varied commercial applications. Thus, studies investigating the practical development of effective algal production are important and will improve our understanding of both aquatic and terrestrial plants. In this study we estimated multiple physicochemical and secondary structural properties of protein sequences, the predicted presence of post-translational modification (PTM) sites, and subcellular localization using a total of 510,123 protein sequences from the proteomes of 31 algal and three plant species. Algal species were broadly selected from green and red algae, glaucophytes, oomycetes, diatoms and other microalgal groups. The results were deposited in the Algal Protein Annotation Suite database (Alga-PrAS; http://alga-pras.riken.jp/), which can be freely accessed online.
Collapse
Affiliation(s)
- Atsushi Kurotani
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan
| | - Yutaka Yamada
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan
| | - Tetsuya Sakurai
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan
- Interdisciplinary Science Unit, Multidisciplinary Science Cluster, Research and Education Faculty, Kochi University, 200 Otsu, Monobe, Nankoku, Kochi, 783-8502, Japan
| |
Collapse
|
10
|
Mukhi N, Dhindwal S, Uppal S, Kapoor A, Arya R, Kumar P, Kaur J, Kundu S. Structural and Functional Significance of the N- and C-Terminal Appendages in Arabidopsis Truncated Hemoglobin. Biochemistry 2016; 55:1724-40. [PMID: 26913482 DOI: 10.1021/acs.biochem.5b01013] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Plant hemoglobins constitute three distinct groups: symbiotic, nonsymbiotic, and truncated hemoglobins. Structural investigation of symbiotic and nonsymbiotic (class I) hemoglobins revealed the presence of a vertebrate-like 3/3 globin fold in these proteins. In contrast, plant truncated hemoglobins are similar to bacterial truncated hemoglobins with a putative 2/2 α-helical globin fold. While multiple structures have been reported for plant hemoglobins of the first two categories, for plant truncated globins only one structure has been reported of late. Here, we report yet another crystal structure of the truncated hemoglobin from Arabidopsis thaliana (AHb3) with two water molecules in the heme pocket, of which one is distinctly coordinated to the heme iron, unlike the only available crystal structure of AHb3 with a hydroxyl ligand. AHb3 was monomeric in its crystallographic asymmetric unit; however, dimer was evident in the crystallographic symmetry, and the globin indeed existed as a stable dimer in solution. The tertiary structure of the protein exhibited a bacterial-like 2/2 α-helical globin fold with an additional N-terminal α-helical extension and disordered C-termini. To address the role of these extended termini in AHb3, which is yet unknown, N- and C-terminal deletion mutants were created and characterized and molecular dynamics simulations performed. The C-terminal deletion had an insignificant effect on most properties but perturbed the dimeric equilibrium of AHb3 and significantly influenced azide binding kinetics in the ferric state. These results along with the disordered nature of the C-terminus indicated its putative role in intramolecular or intermolecular interactions probably regulating protein-ligand and protein-protein interactions. While the N-terminal deletion did not change the overall globin fold, stability, or ligand binding kinetics, it seemed to have influenced coordination at the heme iron, the hydration status of the active site, and the quaternary structure of AHb3. Evidence indicated that the N-terminus is the predominant factor regulating the quaternary interaction appropriate to physiological requirements, dynamics of the side chains in the heme pocket, and tunnel organization in the protein matrix.
Collapse
Affiliation(s)
- Nitika Mukhi
- Department of Genetics, University of Delhi South Campus , New Delhi 110021, India
| | - Sonali Dhindwal
- Department of Biotechnology, Indian Institute of Technology , Roorkee, Uttarakhand 247667, India
| | - Sheetal Uppal
- Department of Biochemistry, University of Delhi South Campus , New Delhi 110021, India
| | - Abhijeet Kapoor
- Department of Biochemistry, University of Delhi South Campus , New Delhi 110021, India
| | - Richa Arya
- Department of Biochemistry, University of Delhi South Campus , New Delhi 110021, India
| | - Pravindra Kumar
- Department of Biotechnology, Indian Institute of Technology , Roorkee, Uttarakhand 247667, India
| | - Jagreet Kaur
- Department of Genetics, University of Delhi South Campus , New Delhi 110021, India
| | - Suman Kundu
- Department of Biochemistry, University of Delhi South Campus , New Delhi 110021, India
| |
Collapse
|
11
|
Tokmakov AA, Kurotani A, Ikeda M, Terazawa Y, Shirouzu M, Stefanov V, Sakurai T, Yokoyama S. Content of intrinsic disorder influences the outcome of cell-free protein synthesis. Sci Rep 2015; 5:14079. [PMID: 26359642 PMCID: PMC4566126 DOI: 10.1038/srep14079] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Accepted: 08/14/2015] [Indexed: 01/04/2023] Open
Abstract
Cell-free protein synthesis is used to produce proteins with various structural traits. Recent bioinformatics analyses indicate that more than half of eukaryotic proteins possess long intrinsically disordered regions. However, no systematic study concerning the connection between intrinsic disorder and expression success of cell-free protein synthesis has been presented until now. To address this issue, we examined correlations of the experimentally observed cell-free protein expression yields with the contents of intrinsic disorder bioinformatically predicted in the expressed sequences. This analysis revealed strong relationships between intrinsic disorder and protein amenability to heterologous cell-free expression. On the one hand, elevated disorder content was associated with the increased ratio of soluble expression. On the other hand, overall propensity for detectable protein expression decreased with disorder content. We further demonstrated that these tendencies are rooted in some distinct features of intrinsically disordered regions, such as low hydrophobicity, elevated surface accessibility and high abundance of sequence motifs for proteolytic degradation, including sites of ubiquitination and PEST sequences. Our findings suggest that identification of intrinsically disordered regions in the expressed amino acid sequences can be of practical use for predicting expression success and optimizing cell-free protein synthesis.
Collapse
Affiliation(s)
- Alexander A Tokmakov
- Research Center for Environmental Genomics, Kobe University, Nada 657-8501, Japan.,RIKEN Systems and Structural Biology Center, Yokohama 230-0045, Japan
| | - Atsushi Kurotani
- RIKEN Center for Sustainable Resource Science, Yokohama 230-0045, Japan
| | - Mariko Ikeda
- RIKEN Systems and Structural Biology Center, Yokohama 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama 230-0045, Japan
| | - Yumiko Terazawa
- RIKEN Systems and Structural Biology Center, Yokohama 230-0045, Japan
| | - Mikako Shirouzu
- RIKEN Systems and Structural Biology Center, Yokohama 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama 230-0045, Japan
| | - Vasily Stefanov
- Department of Biochemistry, Saint-Petersburg State University, St. Petersburg 199034, Russia
| | - Tetsuya Sakurai
- RIKEN Center for Sustainable Resource Science, Yokohama 230-0045, Japan
| | - Shigeyuki Yokoyama
- RIKEN Systems and Structural Biology Center, Yokohama 230-0045, Japan.,RIKEN Structural Biology Laboratory, Yokohama 230-0045, Japan
| |
Collapse
|
12
|
Kurotani A, Sakurai T. In Silico Analysis of Correlations between Protein Disorder and Post-Translational Modifications in Algae. Int J Mol Sci 2015; 16:19812-35. [PMID: 26307970 PMCID: PMC4581327 DOI: 10.3390/ijms160819812] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Revised: 08/12/2015] [Accepted: 08/13/2015] [Indexed: 12/23/2022] Open
Abstract
Recent proteome analyses have reported that intrinsically disordered regions (IDRs) of proteins play important roles in biological processes. In higher plants whose genomes have been sequenced, the correlation between IDRs and post-translational modifications (PTMs) has been reported. The genomes of various eukaryotic algae as common ancestors of plants have also been sequenced. However, no analysis of the relationship to protein properties such as structure and PTMs in algae has been reported. Here, we describe correlations between IDR content and the number of PTM sites for phosphorylation, glycosylation, and ubiquitination, and between IDR content and regions rich in proline, glutamic acid, serine, and threonine (PEST) and transmembrane helices in the sequences of 20 algae proteomes. Phosphorylation, O-glycosylation, ubiquitination, and PEST preferentially occurred in disordered regions. In contrast, transmembrane helices were favored in ordered regions. N-glycosylation tended to occur in ordered regions in most of the studied algae; however, it correlated positively with disordered protein content in diatoms. Additionally, we observed that disordered protein content and the number of PTM sites were significantly increased in the species-specific protein clusters compared to common protein clusters among the algae. Moreover, there were specific relationships between IDRs and PTMs among the algae from different groups.
Collapse
Affiliation(s)
- Atsushi Kurotani
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.
| | - Tetsuya Sakurai
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.
| |
Collapse
|
13
|
Ohyanagi H, Obayashi T, Yano K. Editorial: Plant and Cell Physiology's 2015 database issue. PLANT & CELL PHYSIOLOGY 2015; 56:4-6. [PMID: 25756138 DOI: 10.1093/pcp/pcu206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
|