1
|
Sanaboyana VR, Elcock AH. Improving Signal and Transit Peptide Predictions Using AlphaFold2-predicted Protein Structures. J Mol Biol 2024; 436:168393. [PMID: 38065275 PMCID: PMC10843742 DOI: 10.1016/j.jmb.2023.168393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 12/01/2023] [Accepted: 12/04/2023] [Indexed: 12/21/2023]
Abstract
Many proteins contain cleavable signal or transit peptides that direct them to their final subcellular locations. Such peptides are usually predicted from sequence alone using methods such as TargetP 2.0 and SignalP 6.0. While these methods are usually very accurate, we show here that an analysis of a protein's AlphaFold2-predicted structure can often be used to identify false positive predictions. We start by showing that when given a protein's full-length sequence, AlphaFold2 builds experimentally annotated signal and transit peptides in orientations that point away from the main body of the protein. This indicates that AlphaFold2 correctly identifies that a signal is not destined to be part of the mature protein's structure and suggests, as a corollary, that predicted signals that AlphaFold2 folds with high confidence into the main body of the protein are likely to be false positives. To explore this idea, we analyzed predicted signal peptides in 48 proteomes made available in DeepMind's AlphaFold2 database (https://alphafold.ebi.ac.uk). Applying TargetP 2.0 and SignalP 6.0 to the 561,562 proteins in the database results in 95,236 being predicted to contain a cleavable signal or transit peptide. In 95.1% of these cases, the AlphaFold2 structure of the full-length protein is fully consistent with the prediction of TargetP 2.0 or SignalP 6.0. In the remaining 4.9% of cases where the AlphaFold2 structure does not appear consistent with the prediction, the signal is often only predicted with low confidence. The potential false positives identified here may be useful for training even more accurate signal prediction methods.
Collapse
Affiliation(s)
| | - Adrian H Elcock
- Department of Biochemistry & Molecular Biology, University of Iowa, USA.
| |
Collapse
|
2
|
Sun Y, Xin Y, Zhang L, Wang Y, Liu R, Li X, Zhou C, Zhang L, Han J. Enhancement of violaxanthin accumulation in Nannochloropsis oceanica by overexpressing a carotenoid isomerase gene from Phaeodactylum tricornutum. Front Microbiol 2022; 13:942883. [PMID: 36118188 PMCID: PMC9471142 DOI: 10.3389/fmicb.2022.942883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 07/26/2022] [Indexed: 11/30/2022] Open
Abstract
Nannochloropsis has been considered as a promising feedstock for the industrial production of violaxanthin. However, a rational breeding strategy for the enhancement of violaxanthin content in this microalga is still vacant, thereby limiting its industrial application. All-trans-lycopene locates in the first branch point of carotenogenesis. The carotenoid isomerase (CRTISO), catalyzing the lycopene formation, is thus regarded as a key enzyme for carotenogenesis. Phaeodactylum tricornutum can accumulate high-level carotenoids under optimal conditions. Therefore, it is feasible to improve violaxanthin level in Nannochloropsis by overexpression of PtCRTISO. Protein targeting analysis of seven PtCRTISO candidates (PtCRTISO1–6 and PtCRTISO-like) demonstrated that PtCRTISO4 was most likely the carotenoid isomerase of P. tricornutum. Moreover, the transcriptional pattern of PtCRTISO4 at different cultivation periods was quite similar to other known carotenogenesis genes. Thus, PtCRTISO4 was transformed into N. oceanica. Compared to the wild type (WT), all three transgenic lines (T1–T3) of N. oceanica exhibited higher levels of total carotenoid and violaxanthin. Notably, T3 exhibited the peak violaxanthin content of 4.48 mg g–1 dry cell weight (DCW), which was 1.68-folds higher than WT. Interestingly, qRT-polymerase chain reaction (PCR) results demonstrated that phytoene synthase (NoPSY) rather than ζ-carotene desaturase (NoZDS) and lycopene β-cyclase (NoLCYB) exhibited the highest upregulation, suggesting that PtCRTISO4 played an additional regulatory role in terms of carotenoid accumulation. Moreover, PtCRTISO4 overexpression increased C18:1n-9 but decreased C16:1n-7, implying that C18:1 may serve as a main feedstock for xanthophyll esterification in Nannochloropsis. Our results will provide valuable information for the violaxanthin production from Nannochloropsis.
Collapse
Affiliation(s)
- Yan Sun
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
| | - Yi Xin
- State Key Laboratory of Marine Resource Utilization in the South China Sea, College of Oceanology, Hainan University, Haikou, China
| | - Luyao Zhang
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
| | - Ying Wang
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
| | - Ruolan Liu
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
| | - Xiaohui Li
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
| | - Chengxu Zhou
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
| | - Lin Zhang
- Key Laboratory of Applied Marine Biotechnology of Ministry of Education of China, School of Marine Science, Ningbo University, Ningbo, China
- *Correspondence: Lin Zhang,
| | - Jichang Han
- College of Food and Pharmaceutical Sciences, Ningbo University, Ningbo, China
- College of Marine Life Science, Ocean University of China, Qingdao, China
- Jichang Han,
| |
Collapse
|
3
|
Dohnálek J, Dušková J, Tishchenko G, Kolenko P, Skálová T, Novák P, Fejfarová K, Šimůnek J. Chitinase Chit62J4 Essential for Chitin Processing by Human Microbiome Bacterium Clostridium paraputrificum J4. Molecules 2021; 26:molecules26195978. [PMID: 34641521 PMCID: PMC8512545 DOI: 10.3390/molecules26195978] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 09/24/2021] [Accepted: 09/27/2021] [Indexed: 12/26/2022] Open
Abstract
Commensal bacterium Clostridium paraputrificum J4 produces several extracellular chitinolytic enzymes including a 62 kDa chitinase Chit62J4 active toward 4-nitrophenyl N,N'-diacetyl-β-d-chitobioside (pNGG). We characterized the crude enzyme from bacterial culture fluid, recombinant enzyme rChit62J4, and its catalytic domain rChit62J4cat. This major chitinase, securing nutrition of the bacterium in the human intestinal tract when supplied with chitin, has a pH optimum of 5.5 and processes pNGG with Km = 0.24 mM and kcat = 30.0 s-1. Sequence comparison of the amino acid sequence of Chit62J4, determined during bacterial genome sequencing, characterizes the enzyme as a family 18 glycosyl hydrolase with a four-domain structure. The catalytic domain has the typical TIM barrel structure and the accessory domains-2x Fn3/Big3 and a carbohydrate binding module-that likely supports enzyme activity on chitin fibers. The catalytic domain is highly homologous to a single-domain chitinase of Bacillus cereus NCTU2. However, the catalytic profiles significantly differ between the two enzymes despite almost identical catalytic sites. The shift of pI and pH optimum of the commensal enzyme toward acidic values compared to the soil bacterium is the likely environmental adaptation that provides C. paraputrificum J4 a competitive advantage over other commensal bacteria.
Collapse
Affiliation(s)
- Jan Dohnálek
- Laboratory of Structure and Function of Biomolecules, Institute of Biotechnology of the Czech Academy of Sciences, v. v. i., Biocev, Průmyslová 595, 252 50 Vestec, Czech Republic; (J.D.); (P.K.); (T.S.); (K.F.)
- Correspondence: ; Tel.: +420-325-873-758; Fax: +420-325-873-710
| | - Jarmila Dušková
- Laboratory of Structure and Function of Biomolecules, Institute of Biotechnology of the Czech Academy of Sciences, v. v. i., Biocev, Průmyslová 595, 252 50 Vestec, Czech Republic; (J.D.); (P.K.); (T.S.); (K.F.)
| | - Galina Tishchenko
- Department of Structural Analysis of Biomacromolecules, Institute of Macromolecular Chemistry of the Czech Academy of Sciences, v. v. i., Heyrovsky Sq. 2, 162 06 Prague, Czech Republic;
| | - Petr Kolenko
- Laboratory of Structure and Function of Biomolecules, Institute of Biotechnology of the Czech Academy of Sciences, v. v. i., Biocev, Průmyslová 595, 252 50 Vestec, Czech Republic; (J.D.); (P.K.); (T.S.); (K.F.)
| | - Tereza Skálová
- Laboratory of Structure and Function of Biomolecules, Institute of Biotechnology of the Czech Academy of Sciences, v. v. i., Biocev, Průmyslová 595, 252 50 Vestec, Czech Republic; (J.D.); (P.K.); (T.S.); (K.F.)
| | - Petr Novák
- Laboratory of Structural Biology and Cell Signaling, Institute of Microbiology of the Czech Academy of Sciences, v. v. i., Biocev, Průmyslová 595, 252 50 Vestec, Czech Republic;
| | - Karla Fejfarová
- Laboratory of Structure and Function of Biomolecules, Institute of Biotechnology of the Czech Academy of Sciences, v. v. i., Biocev, Průmyslová 595, 252 50 Vestec, Czech Republic; (J.D.); (P.K.); (T.S.); (K.F.)
| | - Jiří Šimůnek
- Laboratory of Anaerobic Microbiology, Institute of Animal Physiology and Genetics of the Czech Academy of Sciences, v. v. i., Vídeňská 1083, 142 00 Prague, Czech Republic;
| |
Collapse
|
4
|
Tien NQD, Ma X, Man LQ, Chi DTK, Huy NX, Nhut DT, Rombauts S, Ut T, Loc NH. De novo whole-genome assembly and discovery of genes involved in triterpenoid saponin biosynthesis of Vietnamese ginseng ( Panax vietnamensis Ha et Grushv.). PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2021; 27:2215-2229. [PMID: 34744362 PMCID: PMC8526660 DOI: 10.1007/s12298-021-01076-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 09/13/2021] [Accepted: 09/17/2021] [Indexed: 05/04/2023]
Abstract
UNLABELLED Vietnamese ginseng (Panax vietnamensis Ha et Grushv.), also known as Ngoc Linh ginseng, is a high-value herb in Vietnam. Vietnamese ginseng has been proven to be effective in enhancing the immune system, human memory, anti-stress, anti-inflammatory, anti-cancer, and prevent aging. The present study reports the first draft whole-genome of Vietnamese ginseng and the identification of potential genes involved in the triterpenoid metabolic pathway. De novo whole-genome assembly was performed successfully from a data of approximately 139 Gbps of 394,802,120 high quality reads to generate 9815 scaffolds with an N50 value of 572,722 bp from the leaf of Vietnamese ginseng. The assembled genome of Vietnamese ginseng is 3,001,967,204 bp long containing 79,374 gene models. Among them, there are 55,012 genes (69.30%) were annotated by various public molecular biology databases. The potential genes involved in triterpenoid saponin biosynthesis in Vietnamese ginseng and their metabolic pathway were also predicted." Three genes encoding squalene monooxygenase isozymes in Vietnamese ginseng were cloned, sequenced and characterized. Moreover, expression levels of several key genes involved in terpenoid biosynthesis in different parts of Vietnamese ginseng were also analyzed. The SSR markers were detected by various programs from both of assembly full dataset of Vietnamese ginseng genome and predicted genes. The present work provided important data of the draft whole-genome of Vietnamese ginseng for further studies to understand the role of genes involved in ginsenoside biosynthesis and their metabolic pathway at the molecular level of this rare medicinal species. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s12298-021-01076-1.
Collapse
Affiliation(s)
- Nguyen Quang Duc Tien
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
- Department of Biology, Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| | - Xiao Ma
- VIB-UGent Center for Plant Systems Biology, Ghent University, 9000 Ghent, Belgium
| | - Le Quang Man
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| | - Duong Thi Kim Chi
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| | | | - Duong-Tan Nhut
- Tay Nguyen Institute of Scientific Research, Vietnam Academy of Science and Technology, Dalat, 670000 Vietnam
| | - Stephane Rombauts
- VIB-UGent Center for Plant Systems Biology, Ghent University, 9000 Ghent, Belgium
| | - Tran Ut
- Ngoc Linh Ginseng and Medicinal Materials Development Center, Quang Nam Quang Ngai, 51000 Vietnam
| | - Nguyen Hoang Loc
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
- Department of Biology, Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| |
Collapse
|
5
|
Cao X, Wang Y, Rogers J, Hartson S, Kanost MR, Jiang H. Changes in composition and levels of hemolymph proteins during metamorphosis of Manduca sexta. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2020; 127:103489. [PMID: 33096211 PMCID: PMC7704632 DOI: 10.1016/j.ibmb.2020.103489] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/08/2020] [Accepted: 10/10/2020] [Indexed: 06/11/2023]
Abstract
The tobacco hornworm, Manduca sexta, is a lepidopteran model species widely used to study insect biochemical processes. Some of its larval hemolymph proteins are well studied, and a detailed proteomic analysis of larval plasma proteins became available in 2016, revealing features such as correlation with transcriptome data, formation of immune complexes, and constitution of an immune signaling system in hemolymph. It is unclear how the composition of these proteins may change in other developmental stages. In this paper, we report the proteomes of cell-free hemolymph from prepupae, pupae on day 4 and day 13, and young adults. Of the 1824 proteins identified, 907 have a signal peptide and 410 are related to immunity. Drastic changes in abundance of the storage proteins, lipophorins and vitellogenin, for instance, reflect physiological differences among prepupae, pupae, and adults. Considerably more proteins lacking signal peptide are present in the late pupae, suggesting that plasma contains relatively low concentrations of intracellular components released from remodeling tissues during metamorphosis. The defense proteins detected include 43 serine proteases and 11 serine protease homologs. Some of these proteins are members of the extracellular immune signaling network found in feeding larvae, and others may play additional roles and hence confer new features in the later life stages. In summary, the proteins and their levels revealed in this study, together with their transcriptome data, are expected to stimulate focused explorations of humoral immunity and other physiological systems in wandering larvae, pupae, and adults of M. sexta and shed light upon functional and comparative genomic research in other holometabolous insects.
Collapse
Affiliation(s)
- Xiaolong Cao
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK, 74078, USA
| | - Yang Wang
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK, 74078, USA
| | - Janet Rogers
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, 74078, USA
| | - Steve Hartson
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, 74078, USA
| | - Michael R Kanost
- Department of Biochemistry and Molecular Biophysics, Kansas State University, Manhattan, KS, 66506, USA
| | - Haobo Jiang
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK, 74078, USA.
| |
Collapse
|
6
|
Wu C, Li Q, Xing R, Fan GL. Using the Chou’s Pseudo Component to Predict the ncRNA Locations Based on the Improved K-Nearest Neighbor (iKNN) Classifier. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191003142406] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
The non-coding RNA identification at the organelle genome level is a
challenging task. In our previous work, an ncRNA dataset with less than 80% sequence identity
was built, and a method incorporating an increment of diversity combining with support vector
machine method was proposed.
Objective:
Based on the ncRNA_361 dataset, a novel decision-making method-an improved
KNN (iKNN) classifier was proposed.
Methods:
In this paper, based on the iKNN algorithm, the physicochemical features of nucleotides,
the degeneracy of genetic codons, and topological secondary structure were selected to represent
the effective ncRNA characters. Then, the incremental feature selection method was utilized to optimize
the feature set.
Results:
The results of iKNN indicated that the decision-making method of mean value is distinctly
superior to the traditional decision-making method of majority vote the Increment of Diversity
Combining Support Vector Machine (ID-SVM). The iKNN algorithm achieved an overall accuracy
of 97.368% in the jackknife test, when k=3.
Conclusion:
It should be noted that the triplets of the structure-sequence mode under reading
frames not only contains the entire sequence information but also reflects whether the base was
paired or not, and the secondary structural topological parameters further describe the ncRNA secondary
structure on the spatial level. The ncRNA dataset and the iKNN classifier are freely available
at http://202.207.14.87:8032/fuwu/iKNN/index.asp.
Collapse
Affiliation(s)
- Chengyan Wu
- Baotou Teacher’s College, Inner Mongolia University of Science and Technology, Baotou, China
| | - Qianzhong Li
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| | - Ru Xing
- Baotou Teacher’s College, Inner Mongolia University of Science and Technology, Baotou, China
| | - Guo-Liang Fan
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| |
Collapse
|
7
|
Hussain W, Rasool N, Khan YD. Insights into Machine Learning-based Approaches for Virtual Screening in Drug Discovery: Existing Strategies and Streamlining Through FP-CADD. Curr Drug Discov Technol 2020; 18:463-472. [PMID: 32767944 DOI: 10.2174/1570163817666200806165934] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 07/01/2020] [Accepted: 07/03/2020] [Indexed: 11/22/2022]
Abstract
BACKGROUND Machine learning is an active area of research in computer science by the availability of big data collection of all sorts prompting interest in the development of novel tools for data mining. Machine learning methods have wide applications in computer-aided drug discovery methods. Most incredible approaches to machine learning are used in drug designing, which further aid the process of biological modelling in drug discovery. Mainly, two main categories are present which are Ligand-Based Virtual Screening (LBVS) and Structure-Based Virtual Screening (SBVS), however, the machine learning approaches fall mostly in the category of LBVS. OBJECTIVES This study exposits the major machine learning approaches being used in LBVS. Moreover, we have introduced a protocol named FP-CADD which depicts a 4-steps rule of thumb for drug discovery, the four protocols of computer-aided drug discovery (FP-CADD). Various important aspects along with SWOT analysis of FP-CADD are also discussed in this article. CONCLUSION By this thorough study, we have observed that in LBVS algorithms, Support Vector Machines (SVM) and Random Forest (RF) are those which are widely used due to high accuracy and efficiency. These virtual screening approaches have the potential to revolutionize the drug designing field. Also, we believe that the process flow presented in this study, named FP-CADD, can streamline the whole process of computer-aided drug discovery. By adopting this rule, the studies related to drug discovery can be made homogeneous and this protocol can also be considered as an evaluation criterion in the peer-review process of research articles.
Collapse
Affiliation(s)
| | | | - Yaser Daanial Khan
- Department of Computer Science, University of Management and Technology, Lahore, Pakistan
| |
Collapse
|
8
|
Zhang WX, Pan X, Shen HB. Signal-3L 3.0: Improving Signal Peptide Prediction through Combining Attention Deep Learning with Window-Based Scoring. J Chem Inf Model 2020; 60:3679-3686. [PMID: 32501689 DOI: 10.1021/acs.jcim.0c00401] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Signal peptides play an important role in guiding and transferring transmembrane proteins and secreted proteins. In recent years, with the explosive growth of protein sequences, computationally predicting signal peptides and their cleavage sites from protein sequences is highly desired. In this work, we present an improved approach, Signal-3L 3.0, for signal peptide recognition and cleavage-site prediction using a 3-layer hybrid method of integrating deep learning algorithms and window-based scoring. There are three main components in the Signal-3L 3.0 prediction engine: (1) a deep bidirectional long short-term memory (Bi-LSTM) network with a soft self-attention learns abstract features from sequences to determine whether a query protein contains a signal peptide; (2) the statistics propensity window-based cleavage site screening method is applied to generate the set of candidate cleavage sites; (3) the prediction of a conditional random field with a hybrid convolutional neural network (CNN) and Bi-LSTM is fused with the window-based score for identifying the final unique cleavage site. Experimental results on the benchmark datasets show that the new deep learning-driven Signal-3L 3.0 yields promising performance. The online server of Signal-3L 3.0 is available at http://www.csbio.sjtu.edu.cn/bioinf/Signal-3L/.
Collapse
Affiliation(s)
- Wei-Xun Zhang
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai Jiao Tong University, and Institute of Image Processing and Pattern Recognition, Shanghai 200240, China
| | - Xiaoyong Pan
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai Jiao Tong University, and Institute of Image Processing and Pattern Recognition, Shanghai 200240, China
| | - Hong-Bin Shen
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai Jiao Tong University, and Institute of Image Processing and Pattern Recognition, Shanghai 200240, China
| |
Collapse
|
9
|
Mbadinga Mbadinga DL, Li Q, Ranocha P, Martinez Y, Dunand C. Global analysis of non-animal peroxidases provides insights into the evolution of this gene family in the green lineage. JOURNAL OF EXPERIMENTAL BOTANY 2020; 71:3350-3360. [PMID: 32185389 DOI: 10.1093/jxb/eraa141] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 03/13/2020] [Indexed: 05/13/2023]
Abstract
The non-animal peroxidases belong to a superfamily of oxidoreductases that reduce hydrogen peroxide and oxidize numerous substrates. Since their initial characterization in 1992, a number of studies have provided an understanding of the origin and evolution of this protein family. Here, we report a comprehensive evolutionary analysis of non-animal peroxidases using integrated in silico and biochemical approaches. Thanks to the availability of numerous genomic sequences from more than 2500 species belonging to 14 kingdoms together with expert and comprehensive annotation of peroxidase sequences that have been centralized in a dedicated database, we have been able to use phylogenetic reconstructions to increase our understanding of the evolutionary processes underlying the diversification of non-animal peroxidases. We analysed the distribution of all non-animal peroxidases in more than 200 eukaryotic organisms in silico. First, we show that the presence or absence of non-animal peroxidases correlates with the presence or absence of certain organelles or with specific biological processes. Examination of almost 2000 organisms determined that ascorbate peroxidases (APxs) and cytochrome c peroxidases (CcPs) are present in those containing chloroplasts and mitochondria, respectively. Plants, which contain both organelles, are an exception and contain only APxs without CcP. Class II peroxidases (CII Prxs) are only found in fungi with wood-decay and plant-degradation abilities. Class III peroxidases (CIII Prxs) are only found in streptophyte algae and land plants, and have been subjected to large family expansion. Biochemical activities of APx, CcP, and CIII Prx assessed using protein extracts from 30 different eukaryotic organisms support the distribution of the sequences resulting from our in silico analysis. The biochemical results confirmed both the presence and classification of the non-animal peroxidase encoding sequences.
Collapse
Affiliation(s)
| | - Qiang Li
- Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, Toulouse, France
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, Beibei, Chongqing, PR China
| | - Philippe Ranocha
- Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, Toulouse, France
| | - Yves Martinez
- Fédération de Recherche 3450, Plateforme Imagerie, Pôle de Biotechnologie Végétale, Castanet-Tolosan, France
| | - Christophe Dunand
- Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, Toulouse, France
| |
Collapse
|
10
|
Dautermann O, Lyska D, Andersen-Ranberg J, Becker M, Fröhlich-Nowoisky J, Gartmann H, Krämer LC, Mayr K, Pieper D, Rij LM, Wipf HML, Niyogi KK, Lohr M. An algal enzyme required for biosynthesis of the most abundant marine carotenoids. SCIENCE ADVANCES 2020; 6:eaaw9183. [PMID: 32181334 PMCID: PMC7056318 DOI: 10.1126/sciadv.aaw9183] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Accepted: 12/11/2019] [Indexed: 05/04/2023]
Abstract
Fucoxanthin and its derivatives are the main light-harvesting pigments in the photosynthetic apparatus of many chromalveolate algae and represent the most abundant carotenoids in the world's oceans, thus being major facilitators of marine primary production. A central step in fucoxanthin biosynthesis that has been elusive so far is the conversion of violaxanthin to neoxanthin. Here, we show that in chromalveolates, this reaction is catalyzed by violaxanthin de-epoxidase-like (VDL) proteins and that VDL is also involved in the formation of other light-harvesting carotenoids such as peridinin or vaucheriaxanthin. VDL is closely related to the photoprotective enzyme violaxanthin de-epoxidase that operates in plants and most algae, revealing that in major phyla of marine algae, an ancient gene duplication triggered the evolution of carotenoid functions beyond photoprotection toward light harvesting.
Collapse
Affiliation(s)
- O. Dautermann
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - D. Lyska
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - J. Andersen-Ranberg
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - M. Becker
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - J. Fröhlich-Nowoisky
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - H. Gartmann
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - L. C. Krämer
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - K. Mayr
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - D. Pieper
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - L. M. Rij
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| | - H. M.-L. Wipf
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - K. K. Niyogi
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| | - M. Lohr
- Institut für Molekulare Physiologie, Pflanzenbiochemie, Johannes Gutenberg-Universität, 55128 Mainz, Germany
| |
Collapse
|
11
|
Yang H, Ma Y, Zhao Y, Shen W, Chen X. Systematic engineering of transport and transcription to boost alkaline α-amylase production in Bacillus subtilis. Appl Microbiol Biotechnol 2020; 104:2973-2985. [DOI: 10.1007/s00253-020-10435-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 01/23/2020] [Accepted: 02/04/2020] [Indexed: 01/25/2023]
|
12
|
Thermotogales origin scenario of eukaryogenesis. J Theor Biol 2020; 492:110192. [PMID: 32044287 DOI: 10.1016/j.jtbi.2020.110192] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2019] [Accepted: 02/06/2020] [Indexed: 12/11/2022]
Abstract
How eukaryotes were generated is an enigma of evolutionary biology. Widely accepted archaeal-origin eukaryogenesis scenarios, based on similarities of genes and related characteristics between archaea and eukaryotes, cannot explain several eukaryote-specific features of the last eukaryotic common ancestor, such as glycerol-3-phosphate-type membrane lipids, large cells and genomes, and endomembrane formation. Thermotogales spheroids, having multicopy-integrated large nucleoids and producing progeny in periplasm, may explain all of these features as well as endoplasmic reticulum-type signal cleavage sites, although they cannot divide. We hypothesize that the progeny chromosome is formed by random joining small DNAs in immature progeny, followed by reorganization by mechanisms including homologous recombination enabled with multicopy-integrated large genome (MILG). We propose that Thermotogales ancestor spheroids came to divide owing to the archaeal cell division genes horizontally transferred via virus-related particles, forming the first eukaryotic common ancestor (FECA). Referring to the hypothesis, the archaeal information-processing system would have been established in FECA by random joining DNAs excised from the MILG, which contained horizontally transferred archaeal and bacterial DNAs, followed by reorganization by the MILG-enabled homologous recombination. Thus, the large genome may have been a prerequisite, but not a consequence, of eukaryogenesis. The random joining of DNAs likely provided the basic mechanisms for eukaryotic evolution: producing the diversity by the formations of supergroups, novel genes, and introns that are involved in exon shuffling.
Collapse
|
13
|
Malebary SJ, Rehman MSU, Khan YD. iCrotoK-PseAAC: Identify lysine crotonylation sites by blending position relative statistical features according to the Chou's 5-step rule. PLoS One 2019; 14:e0223993. [PMID: 31751380 PMCID: PMC6874067 DOI: 10.1371/journal.pone.0223993] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 10/02/2019] [Indexed: 01/22/2023] Open
Abstract
Among different post-translational modifications (PTMs), one of the most important one is the lysine crotonylation in proteins. Its importance cannot be undermined related to different diseases and essential biological practice. The key step for finding the hidden mechanisms of crotonylation along with their occurrence sites is to completely apprehend the mechanism behind this biological process. In previously reported studies, researchers have used different techniques, like position weighted matrix (PWM), support vector machine (SVM), k nearest neighbors (KNN), and many others. However, the maximum prediction accuracy achieved was not such high. To address this, herein, we propose an improved predictor for lysine crotonylation sites named iCrotoK-PseAAC, in which we have incorporated various position and composition relative features along with statistical moments into PseAAC. The results of self-consistency testing were 100% accurate, while the 10-fold cross validation gave 99.0% accuracy. Based on the validation and comparison of model, it is concluded that the iCrotoK-PseAAC is more accurate than the previously proposed models.
Collapse
Affiliation(s)
- Sharaf Jameel Malebary
- Department of Information Technology, King Abdul Aziz University, Rabigh, Kingdom of Saudi Arabia
| | - Muhammad Safi ur Rehman
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Yaser Daanial Khan
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| |
Collapse
|
14
|
Chou KC. Advances in Predicting Subcellular Localization of Multi-label Proteins and its Implication for Developing Multi-target Drugs. Curr Med Chem 2019; 26:4918-4943. [PMID: 31060481 DOI: 10.2174/0929867326666190507082559] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 01/29/2019] [Accepted: 01/31/2019] [Indexed: 12/16/2022]
Abstract
The smallest unit of life is a cell, which contains numerous protein molecules. Most
of the functions critical to the cell’s survival are performed by these proteins located in its different
organelles, usually called ‘‘subcellular locations”. Information of subcellular localization
for a protein can provide useful clues about its function. To reveal the intricate pathways at the
cellular level, knowledge of the subcellular localization of proteins in a cell is prerequisite.
Therefore, one of the fundamental goals in molecular cell biology and proteomics is to determine
the subcellular locations of proteins in an entire cell. It is also indispensable for prioritizing
and selecting the right targets for drug development. Unfortunately, it is both timeconsuming
and costly to determine the subcellular locations of proteins purely based on experiments.
With the avalanche of protein sequences generated in the post-genomic age, it is highly
desired to develop computational methods for rapidly and effectively identifying the subcellular
locations of uncharacterized proteins based on their sequences information alone. Actually,
considerable progresses have been achieved in this regard. This review is focused on those
methods, which have the capacity to deal with multi-label proteins that may simultaneously
exist in two or more subcellular location sites. Protein molecules with this kind of characteristic
are vitally important for finding multi-target drugs, a current hot trend in drug development.
Focused in this review are also those methods that have use-friendly web-servers established so
that the majority of experimental scientists can use them to get the desired results without the
need to go through the detailed mathematics involved.
Collapse
Affiliation(s)
- Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, United States
| |
Collapse
|
15
|
Abstract
The smallest unit of life is a cell, which contains numerous protein molecules. Most
of the functions critical to the cell’s survival are performed by these proteins located in its different
organelles, usually called ‘‘subcellular locations”. Information of subcellular localization
for a protein can provide useful clues about its function. To reveal the intricate pathways at the
cellular level, knowledge of the subcellular localization of proteins in a cell is prerequisite.
Therefore, one of the fundamental goals in molecular cell biology and proteomics is to determine
the subcellular locations of proteins in an entire cell. It is also indispensable for prioritizing
and selecting the right targets for drug development. Unfortunately, it is both timeconsuming
and costly to determine the subcellular locations of proteins purely based on experiments.
With the avalanche of protein sequences generated in the post-genomic age, it is highly
desired to develop computational methods for rapidly and effectively identifying the subcellular
locations of uncharacterized proteins based on their sequences information alone. Actually,
considerable progresses have been achieved in this regard. This review is focused on those
methods, which have the capacity to deal with multi-label proteins that may simultaneously
exist in two or more subcellular location sites. Protein molecules with this kind of characteristic
are vitally important for finding multi-target drugs, a current hot trend in drug development.
Focused in this review are also those methods that have use-friendly web-servers established so
that the majority of experimental scientists can use them to get the desired results without the
need to go through the detailed mathematics involved.
Collapse
Affiliation(s)
- Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, United States
| |
Collapse
|
16
|
Carbo M, Brandi V, Pascarella G, Staid DS, Colotti G, Polticelli F, Ilari A, Morea V. Bioinformatics analysis of Ras homologue enriched in the striatum, a potential target for Huntington's disease therapy. Int J Mol Med 2019; 44:2223-2233. [PMID: 31638189 PMCID: PMC6844632 DOI: 10.3892/ijmm.2019.4373] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Accepted: 08/19/2019] [Indexed: 11/15/2022] Open
Abstract
Huntington's disease (HD) is a lethal neurodegenerative disorder for which no cure is available yet. It is caused by abnormal expansion of a CAG triplet in the gene encoding the huntingtin protein (Htt), with consequent expansion of a polyglutamine repeat in mutated Htt (mHtt). This makes mHtt highly unstable and aggregation prone. Soluble mHtt is linked to cytotoxicity and neurotoxicity, whereas mHtt aggregates are thought to be neuroprotective. While Htt and mHtt are ubiquitously expressed throughout the brain and peripheral tissues, HD is characterized by selective degradation of the corpus striatum, without notable alterations in peripheral tissues. Screening for mRNAs preferentially expressed in rodent striatum led to the discovery of a GTP binding protein homologous to Ras family members. Due to these features, the newly discovered protein was termed Ras Homolog Enriched in Striatum (RHES). The aetiological role of RHES in HD has been ascribed to its small ubiquitin-like modifier (SUMO)-E3 ligase function. RHES sumoylates mHtt with higher efficiency than wild-type Htt, thereby protecting mHtt from degradation and increasing the amounts of the soluble form. Although RHES is an attractive target for HD treatment, essential information about protein structure and function are still missing. With the aim of investigating RHES 3D structure and function, bioinformatic analyses and molecular modelling have been performed in the present study, based on which, RHES regions predicted to be involved in the interaction with mHtt or the SUMO-E2 ligase Ubc9 have been identified. These regions have been used to design peptides aimed at inhibiting RHES interactions and, therefore, mHtt sumoylation; in turn, these peptides will be used to develop small molecule inhibitors by both rational design and virtual screening of large compound libraries. Once identified, RHES sumoylation inhibitors may open the road to the development of therapeutic agents against the severe, and currently untreatable, HD.
Collapse
Affiliation(s)
- Miriam Carbo
- Department of Biochemical Sciences 'A. Rossi Fanelli', Sapienza University, I‑00185 Rome, Italy
| | - Valentina Brandi
- Department of Sciences, Roma Tre University, I‑00159 Rome, Italy
| | - Gianmarco Pascarella
- Department of Biochemical Sciences 'A. Rossi Fanelli', Sapienza University, I‑00185 Rome, Italy
| | - David Sasah Staid
- Department of Biochemical Sciences 'A. Rossi Fanelli', Sapienza University, I‑00185 Rome, Italy
| | - Gianni Colotti
- Institute of Molecular Biology and Pathology of The National Research Council of Italy, I‑00185 Rome, Italy
| | - Fabio Polticelli
- Department of Sciences, Roma Tre University, I‑00159 Rome, Italy
| | - Andrea Ilari
- Institute of Molecular Biology and Pathology of The National Research Council of Italy, I‑00185 Rome, Italy
| | - Veronica Morea
- Institute of Molecular Biology and Pathology of The National Research Council of Italy, I‑00185 Rome, Italy
| |
Collapse
|
17
|
Karim KMR, Husaini A, Sing NN, Tasnim T, Mohd Sinang F, Hussain H, Hossain MA, Roslan H. Characterization and expression in Pichia pastoris of a raw starch degrading glucoamylase (GA2) derived from Aspergillus flavus NSH9. Protein Expr Purif 2019; 164:105462. [PMID: 31351992 DOI: 10.1016/j.pep.2019.105462] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Revised: 06/19/2019] [Accepted: 07/24/2019] [Indexed: 11/25/2022]
Abstract
The Aspergillus flavus NSH9 gene, encoding a pH and thermostable glucoamylase with a starch binding domain (SBD), was expressed in Pichia pastoris to produce recombinant glucoamylase (rGA2). The full-length glucoamylase gene (2039 bp), and cDNA (1839 bp) encode a 612 amino acid protein most similar to glucoamylase from Aspergillus oryzae RIB40; the first 19 amino acids are presumed to be a signal peptide for secretion, and the SBD is at the C-terminal. The cDNA was successfully secreted by Pichia at 8.23 U mL-1, and the rGA2 was found to be: a 80 kDa monomer, stable from pH 3.0-9.0, with optimum catalytic activity at pH 5.0, active at temperatures up to 80°C (rGA2 retained 58% of its activity after 60 min of incubation at 70°C), and metal ions such as Na+, K+, Ca++ and Mg++ enhanced rGA2 enzyme activity. The starch degrading ability of rGA2 was also observed on raw sago starch and where prolonged incubation generated larger, deeper, holes on the starch granules, indicating rGA2 is an excellent candidate for industrial starch processing applications.
Collapse
Affiliation(s)
| | - Ahmad Husaini
- Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300, Kota, Samarahan, Sarawak, Malaysia.
| | - Ngieng Ngui Sing
- Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300, Kota, Samarahan, Sarawak, Malaysia
| | - Tasmia Tasnim
- Department of Nutrition and Food Engineering, Daffodil International University, Dhaka, 1207, Bangladesh
| | - Fazia Mohd Sinang
- Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300, Kota, Samarahan, Sarawak, Malaysia
| | - Hasnain Hussain
- Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300, Kota, Samarahan, Sarawak, Malaysia
| | - Md Anowar Hossain
- Department of Biochemistry and Molecular Biology, University of Rajshahi, Rajshahi, 6205, Bangladesh
| | - Hairul Roslan
- Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300, Kota, Samarahan, Sarawak, Malaysia
| |
Collapse
|
18
|
Liao HL, Bonito G, Rojas JA, Hameed K, Wu S, Schadt CW, Labbé J, Tuskan GA, Martin F, Grigoriev IV, Vilgalys R. Fungal Endophytes of Populus trichocarpa Alter Host Phenotype, Gene Expression, and Rhizobiome Composition. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2019; 32:853-864. [PMID: 30699306 DOI: 10.1094/mpmi-05-18-0133-r] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Mortierella and Ilyonectria genera include common species of soil fungi that are frequently detected as root endophytes in many plants, including Populus spp. However, the ecological roles of these and other endophytic fungi with respect to plant growth and function are still not well understood. The functional ecology of two key taxa from the P. trichocarpa rhizobiome, M. elongata PMI93 and I. europaea PMI82, was studied by coupling forest soil bioassays with environmental metatranscriptomics. Using soil bioassay experiments amended with fungal inoculants, M. elongata was observed to promote the growth of P. trichocarpa. This response was cultivar independent. In contrast, I. europaea had no visible effect on P. trichocarpa growth. Metatranscriptomic studies revealed that these fungi impacted rhizophytic and endophytic activities in P. trichocarpa and induced shifts in soil and root microbial communities. Differential expression of core genes in P. trichocarpa roots was observed in response to both fungal species. Expression of P. trichocarpa genes for lipid signaling and nutrient uptake were upregulated, and expression of genes associated with gibberellin signaling were altered in plants inoculated with M. elongata, but not I. europaea. Upregulation of genes for growth promotion, downregulation of genes for several leucine-rich repeat receptor kinases, and alteration of expression of genes associated with plant defense responses (e.g., jasmonic acid, salicylic acid, and ethylene signal pathways) also suggest that M. elongata manipulates plant defenses while promoting plant growth.
Collapse
Affiliation(s)
- Hui-Ling Liao
- 1 North Florida Research and Education Center, University of Florida, 155 Research Road, Quincy, FL 32351, U.S.A
| | - Gregory Bonito
- 2 Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI 48824, U.S.A
| | - J Alejandro Rojas
- 3 Department of Biology, Duke University, Durham, NC, U.S.A
- 4 Plant Pathology Department, University of Arkansas, 211 PTSC-Fayetteville, AR 72701, U.S.A
| | - Khalid Hameed
- 3 Department of Biology, Duke University, Durham, NC, U.S.A
| | - Steven Wu
- 5 Independent Researcher, Davis, CA, U.S.A
| | - Christopher W Schadt
- 6 Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN 37830, U.S.A
| | - Jessy Labbé
- 6 Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN 37830, U.S.A
| | - Gerald A Tuskan
- 6 Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN 37830, U.S.A
| | - Francis Martin
- 7 INRA, UMR 1136 INRA-University of Lorraine, Interactions Arbres/Microorganismes, Laboratory of Excellence ARBRE, INRA-Nancy, 54280, Champenoux, France
| | - Igor V Grigoriev
- 8 U.S. Department of Energy Joint Genome Institute and Department of Plant and Microbial Biology, University of California Berkeley, 2800 Mitchell Drive, Walnut Creek, CA 94598, U.S.A
| | - Rytas Vilgalys
- 3 Department of Biology, Duke University, Durham, NC, U.S.A
| |
Collapse
|
19
|
Abstract
Ever since the signal hypothesis was proposed in 1971, the exact nature of signal peptides has been a focus point of research. The prediction of signal peptides and protein subcellular location from amino acid sequences has been an important problem in bioinformatics since the dawn of this research field, involving many statistical and machine learning technologies. In this review, we provide a historical account of how position-weight matrices, artificial neural networks, hidden Markov models, support vector machines and, lately, deep learning techniques have been used in the attempts to predict where proteins go. Because the secretory pathway was the first one to be studied both experimentally and through bioinformatics, our main focus is on the historical development of prediction methods for signal peptides that target proteins for secretion; prediction methods to identify targeting signals for other cellular compartments are treated in less detail.
Collapse
Affiliation(s)
- Henrik Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark.
| | - Konstantinos D Tsirigos
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Søren Brunak
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark
- Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Gunnar von Heijne
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
- Science for Life Laboratory, Stockholm University, Solna, Sweden
| |
Collapse
|
20
|
Muhammed MT, Son ÇD, İzgü F. Three dimensional structure prediction of panomycocin, a novel Exo-β-1,3-glucanase isolated from Wickerhamomyces anomalus NCYC 434 and the computational site-directed mutagenesis studies to enhance its thermal stability for therapeutic applications. Comput Biol Chem 2019; 80:270-277. [PMID: 31054539 DOI: 10.1016/j.compbiolchem.2019.04.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 04/04/2019] [Accepted: 04/09/2019] [Indexed: 10/26/2022]
Abstract
Panomycocin is a naturally produced potent antimycotic/antifungal protein secreted by the yeast Wickerhamomyces anomalus NCYC 434 with an exo-β-1,3-glucanase activity. In this study the three dimensional structure of panomycocin was predicted and the computational site-directed mutagenesis was performed to enhance its thermal stability in liquid formulations over the body temperature for topical therapeutic applications. Homology modeling was performed with MODELLER and I-TASSER. Among the generated models, the model with the lowest energy and DOPE score was selected for further loop modeling. The loop model was optimized and the reliability of the model was confirmed with ERRAT, Verify 3D and Ramachandran plot values. Enhancement of the thermal stability of the model was done using contemporary servers and programs such as SPDBViewer, CNA, I-Mutant2.0, Eris, AUTO-MUTE and MUpro. In the region outside the binding site of the model Leu52 Arg, Phe223Arg and Gly254Arg were found to be the best thermostabilizing mutations with 6.26 K, 6.26 K and 8.27 K increases, respectively. In the binding site Glu186Arg was found to be the best thermostabilizer mutation with a 9.58 K temperature increase. The results obtained in this study led us to design a mutant panomycocin that can be used as a novel antimycotic/antifungal drug in a liquid formulation for topical applications over the normal body temperature.
Collapse
Affiliation(s)
- Muhammed Tilahun Muhammed
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Suleyman Demirel University, Isparta, Turkey
| | - Çağdaş Devrim Son
- Department of Molecular Biology and Genetics, Middle East Technical University, Ankara, Turkey
| | - Fatih İzgü
- Department of Molecular Biology and Genetics, Middle East Technical University, Ankara, Turkey.
| |
Collapse
|
21
|
SPrenylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-prenylation sites in proteins. J Theor Biol 2019; 468:1-11. [DOI: 10.1016/j.jtbi.2019.02.007] [Citation(s) in RCA: 98] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 02/07/2019] [Accepted: 02/11/2019] [Indexed: 11/22/2022]
|
22
|
The classical NLRP3 inflammasome controls FADD unconventional secretion through microvesicle shedding. Cell Death Dis 2019; 10:190. [PMID: 30804327 PMCID: PMC6389912 DOI: 10.1038/s41419-019-1412-9] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 01/11/2019] [Accepted: 01/22/2019] [Indexed: 12/20/2022]
Abstract
Fas-associated death domain (FADD) is a key adaptor molecule involved in numerous physiological processes including cell death, proliferation, innate immunity and inflammation. Therefore, changes in FADD expression have dramatic cellular consequences. In mice and humans, FADD regulation can occur through protein secretion. However, the molecular mechanisms accounting for human FADD secretion were still unknown. Here we report that canonical, non-canonical, but not alternative, NLRP3 inflammasome activation in human monocytes/macrophages induced FADD secretion. NLRP3 inflammasome activation by the bacterial toxin nigericin led to the proinflammatory interleukin-1β (IL-1β) release and to the induction of cell death by pyroptosis. However, we showed that FADD secretion could occur in absence of increased IL-1β release and pyroptosis and, reciprocally, that IL-1β release and pyroptosis could occur in absence of FADD secretion. Especially, FADD, but not IL-1β, secretion following NLRP3 inflammasome activation required extracellular glucose. Thus, FADD secretion was an active process distinct from unspecific release of proteins during pyroptosis. This FADD secretion process required K+ efflux, NLRP3 sensor, ASC adaptor and CASPASE-1 molecule. Moreover, we identified FADD as a leaderless protein unconventionally secreted through microvesicle shedding, but not exosome release. Finally, we established human soluble FADD as a new marker of joint inflammation in gout and rheumatoid arthritis, two rheumatic diseases involving the NLRP3 inflammasome. Whether soluble FADD could be an actor in these diseases remains to be determined. Nevertheless, our results advance our understanding of the mechanisms contributing to the regulation of the FADD protein expression in human cells.
Collapse
|
23
|
Khan SA, Khan YD, Ahmad S, Allehaibi KH. N-MyristoylG-PseAAC: Sequence-based Prediction of N-Myristoyl Glycine Sites in Proteins by Integration of PseAAC and Statistical Moments. LETT ORG CHEM 2019. [DOI: 10.2174/1570178616666181217153958] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
N-Myristoylation, an irreversible protein modification, occurs by the covalent attachment of myristate with the N-terminal glycine of the eukaryotic and viral proteins, and is associated with a variety of pathogens and disease-related proteins. Identification of myristoylation sites through experimental mechanisms can be costly, labour associated and time-consuming. Due to the association of N-myristoylation with various diseases, its timely prediction can help in diagnosing and controlling the associated fatal diseases. Herein, we present a method named N-MyristoylG-PseAAC in which we have incorporated PseAAC with statistical moments for the prediction of N-Myristoyl Glycine (NMG) sites. A benchmark dataset of 893 positive and 1093 negative samples was collected and used in this study. For feature vector, various position and composition relative features along with the statistical moments were calculated. Later on, a back propagation neural network was trained using feature vectors and scaled conjugate gradient descent with adaptive learning was used as an optimizer. Selfconsistency testing and 10-fold cross-validation were performed to evaluate the performance of N-MyristoylG-PseAAC, by using accuracy metrics. For self-consistency testing, 99.80% Acc, 99.78% Sp, 99.81% Sn and 0.99 MCC were observed, whereas, for 10-fold cross validation, 97.18% Acc, 98.54% Sp, 96.07% Sn and 0.94 MCC were observed. Thus, it was found that the proposed predictor can help in predicting the myristoylation sites in an efficient and accurate way.
Collapse
Affiliation(s)
- Sher Afzal Khan
- Department of Information Technology, Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Yaser Daanial Khan
- Department of Computer Science, University of Management Technology, Lahore, Pakistan
| | - Shakeel Ahmad
- Department of Computer Sciences, FCITR, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Khalid H. Allehaibi
- Department of Computer Sciences, FCIT, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
24
|
Cao X, Jiang H. Building a platform for predicting functions of serine protease-related proteins in Drosophila melanogaster and other insects. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2018; 103:53-69. [PMID: 30367934 PMCID: PMC6358214 DOI: 10.1016/j.ibmb.2018.10.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Revised: 10/19/2018] [Accepted: 10/21/2018] [Indexed: 05/15/2023]
Abstract
Serine proteases (SPs) and serine protease homologs (SPHs) play essential roles in insect physiological processes including digestion, defense and development. Studies of insect genomes, transcriptomes and proteomes have generated a vast amount of information on these proteins, dwarfing the biological data acquired from a few model species. The large number and high diversity of homologous sequences makes it a challenge to use the limited functional information for making predictions across a broad taxonomic group of insects. In this work, we have extensively updated the framework of knowledge on the SP-related proteins in Drosophila melanogaster by identifying 52 new SPs/SPHs, classifying the 257 proteins into four groups (CLIP, gut, single- and multi-domain SPs/SPHs), and detecting inherent connections among phylogenetic relationships, genomic locations and expression profiles for 99 of the genes. Information on the existence of specific proteins in eggs, larvae, pupae and adults is presented to facilitate future research. More importantly, we have developed an approach to reveal close homologous or orthologous relationships among SPs/SPHs from D. melanogaster, Anopheles gambiae, Apis mellifera, Manduca sexta, and Tribolium castaneum thus inspiring functional studies in these and other holometabolous insects. This approach is useful for tackling similar problems on large and diverse protein families in other groups of organisms.
Collapse
Affiliation(s)
- Xiaolong Cao
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK, 74078, USA
| | - Haobo Jiang
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK, 74078, USA.
| |
Collapse
|
25
|
Burdukiewicz M, Sobczyk P, Chilimoniuk J, Gagat P, Mackiewicz P. Prediction of Signal Peptides in Proteins from Malaria Parasites. Int J Mol Sci 2018; 19:E3709. [PMID: 30469512 PMCID: PMC6321056 DOI: 10.3390/ijms19123709] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Revised: 11/15/2018] [Accepted: 11/17/2018] [Indexed: 01/08/2023] Open
Abstract
Signal peptides are N-terminal presequences responsible for targeting proteins to the endomembrane system, and subsequent subcellular or extracellular compartments, and consequently condition their proper function. The significance of signal peptides stimulates development of new computational methods for their detection. These methods employ learning systems trained on datasets comprising signal peptides from different types of proteins and taxonomic groups. As a result, the accuracy of predictions are high in the case of signal peptides that are well-represented in databases, but might be low in other, atypical cases. Such atypical signal peptides are present in proteins found in apicomplexan parasites, causative agents of malaria and toxoplasmosis. Apicomplexan proteins have a unique amino acid composition due to their AT-biased genomes. Therefore, we designed a new, more flexible and universal probabilistic model for recognition of atypical eukaryotic signal peptides. Our approach called signalHsmm includes knowledge about the structure of signal peptides and physicochemical properties of amino acids. It is able to recognize signal peptides from the malaria parasites and related species more accurately than popular programs. Moreover, it is still universal enough to provide prediction of other signal peptides on par with the best preforming predictors.
Collapse
Affiliation(s)
- Michał Burdukiewicz
- Faculty of Mathematics and Information Science, Warsaw University of Technology, 00-661 Warszawa, Poland.
| | - Piotr Sobczyk
- Department of Mathematics, Wrocław University of Technology, 50-370 Wrocław, Poland.
| | | | - Przemysław Gagat
- Department of Genomics, University of Wrocław, 50-383 Wrocław, Poland.
| | - Paweł Mackiewicz
- Department of Genomics, University of Wrocław, 50-383 Wrocław, Poland.
| |
Collapse
|
26
|
Kolahi M, Yazdi M, Goldson-Barnaby A, Tabandeh MR. In silico prediction, phylogenetic and bioinformatic analysis of SoPCS gene, survey of its protein characterization and gene expression in response to cadmium in Saccharum officinarum. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2018; 163:7-18. [PMID: 30031266 DOI: 10.1016/j.ecoenv.2018.07.032] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Revised: 07/02/2018] [Accepted: 07/08/2018] [Indexed: 06/08/2023]
Abstract
Phytochelatin synthase isolated from microorganisms, yeasts, algae and plant, serve a fundamental role in reducing heavy metals. In this research the in silico PCS gene structure (SoPCS) of sugarcane, its secondary and 3D protein structure, physicochemical properties, cell localization and phylogenetic tree were predicted utilizing bioinformatics tools. SoPCS expression in the leaves and roots of sugarcane in tissue culture treated with cadmium was also studied utilizing real time PCR. The predicted SoPCS gene contains 1524 nucleotides, a protein encoded with 508 amino acids of which the molecular weight is 55953.3 Da, 6 exons and 5 introns. The subcellular position of the enzyme is mitochondrion or cytoplasmic. Two domains belonging to the phytochelatin synthase family with similar features was found in Pfam having more than 97% similarity with the predicted SoPCS protein. Phylogeny analyses of plant species were well isolated from other organisms. Ten disulfide-bonded cysteines were excluded from the structure of SoPCS. The predicted 3D structure of SoPCS showed that it is able to bind to L-gamma-glutamylcysteine as substrate. The binding site sequence of PCS included amino acids 52(Q),55(P),56(A),57(F), 58(C),103(G),104(I),151(S),163(G),165(F),206(D), 213(R). The common amino acid with conserved sequence in the binding site of the plant was 103Gly. Gene expression indicated that SoPCS has an important role in the response of sugarcane to cadmium with potential use in genetic engineering to remove metal contaminants in the environment. This is the first characterization of a PCS from sugarcane.
Collapse
Affiliation(s)
- Maryam Kolahi
- Department of Biology, Faculty of Science, Shahid Chamran University of Ahvaz, Ahvaz, Iran.
| | - Milad Yazdi
- Department of Genetic, Faculty of Science, Shahid Chamran University of Ahvaz, Ahvaz, Iran
| | | | - Mohammad Reza Tabandeh
- Department of Biochemistry and Molecular Biology, Faculty of Veterinary Medicine, Shahid Chamran University of Ahvaz, Ahvaz, Iran
| |
Collapse
|
27
|
Khan YD, Rasool N, Hussain W, Khan SA, Chou KC. iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC. Mol Biol Rep 2018; 45:2501-2509. [DOI: 10.1007/s11033-018-4417-z] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 10/01/2018] [Indexed: 10/28/2022]
|
28
|
Hoernstein SNW, Fode B, Wiedemann G, Lang D, Niederkrüger H, Berg B, Schaaf A, Frischmuth T, Schlosser A, Decker EL, Reski R. Host Cell Proteome of Physcomitrella patens Harbors Proteases and Protease Inhibitors under Bioproduction Conditions. J Proteome Res 2018; 17:3749-3760. [PMID: 30226384 DOI: 10.1021/acs.jproteome.8b00423] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Host cell proteins are inevitable contaminants of biopharmaceuticals. Here, we performed detailed analyses of the host cell proteome of moss ( Physcomitrella patens) bioreactor supernatants using mass spectrometry and subsequent bioinformatics analysis. Distinguishing between the apparent secretome and intracellular contaminants, a complex extracellular proteolytic network including subtilisin-like proteases, metallo-proteases, and aspartic proteases was identified. Knockout of a subtilisin-like protease affected the overall extracellular proteolytic activity. Besides proteases, also secreted protease-inhibiting proteins such as serpins were identified. Further, we confirmed predicted cleavage sites of 40 endogenous signal peptides employing an N-terminomics approach. The present data provide novel aspects to optimize both product stability of recombinant biopharmaceuticals as well as their maturation along the secretory pathway. Data are available via ProteomeXchange with identifier PXD009517.
Collapse
Affiliation(s)
- Sebastian N W Hoernstein
- Plant Biotechnology, Faculty of Biology , University of Freiburg , Schaenzlestrasse 1 , D-79104 Freiburg , Germany
| | - Benjamin Fode
- Greenovation Biotech GmbH , Hans-Bunte-Strasse 19 , D-79108 Freiburg , Germany
| | - Gertrud Wiedemann
- Plant Biotechnology, Faculty of Biology , University of Freiburg , Schaenzlestrasse 1 , D-79104 Freiburg , Germany
| | - Daniel Lang
- Plant Biotechnology, Faculty of Biology , University of Freiburg , Schaenzlestrasse 1 , D-79104 Freiburg , Germany.,Plant Genome and System Biology , Helmholtz Center Munich , D-85764 Neuherberg , Germany
| | - Holger Niederkrüger
- Greenovation Biotech GmbH , Hans-Bunte-Strasse 19 , D-79108 Freiburg , Germany
| | - Birgit Berg
- Greenovation Biotech GmbH , Hans-Bunte-Strasse 19 , D-79108 Freiburg , Germany
| | - Andreas Schaaf
- Greenovation Biotech GmbH , Hans-Bunte-Strasse 19 , D-79108 Freiburg , Germany
| | - Thomas Frischmuth
- Greenovation Biotech GmbH , Hans-Bunte-Strasse 19 , D-79108 Freiburg , Germany
| | - Andreas Schlosser
- Rudolf-Virchow-Center for Experimental Biomedicine , University of Wuerzburg , D-97080 Wuerzburg , Germany
| | - Eva L Decker
- Plant Biotechnology, Faculty of Biology , University of Freiburg , Schaenzlestrasse 1 , D-79104 Freiburg , Germany
| | - Ralf Reski
- Plant Biotechnology, Faculty of Biology , University of Freiburg , Schaenzlestrasse 1 , D-79104 Freiburg , Germany.,BIOSS - Centre for Biological Signalling Studies , University of Freiburg , D-79104 Freiburg , Germany
| |
Collapse
|
29
|
Qiu WR, Sun BQ, Xiao X, Xu ZC, Jia JH, Chou KC. iKcr-PseEns: Identify lysine crotonylation sites in histone proteins with pseudo components and ensemble classifier. Genomics 2018; 110:239-246. [DOI: 10.1016/j.ygeno.2017.10.008] [Citation(s) in RCA: 99] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 10/23/2017] [Accepted: 10/25/2017] [Indexed: 01/23/2023]
|
30
|
Khan YD, Rasool N, Hussain W, Khan SA, Chou KC. iPhosT-PseAAC: Identify phosphothreonine sites by incorporating sequence statistical moments into PseAAC. Anal Biochem 2018; 550:109-116. [DOI: 10.1016/j.ab.2018.04.021] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 04/19/2018] [Accepted: 04/21/2018] [Indexed: 01/29/2023]
|
31
|
Qiu WR, Sun BQ, Xiao X, Xu ZC, Chou KC. iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC. Oncotarget 2018; 7:44310-44321. [PMID: 27322424 PMCID: PMC5190098 DOI: 10.18632/oncotarget.10027] [Citation(s) in RCA: 141] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Accepted: 05/29/2016] [Indexed: 12/30/2022] Open
Abstract
Protein hydroxylation is a posttranslational modification (PTM), in which a CH group in Pro (P) or Lys (K) residue has been converted into a COH group, or a hydroxyl group (−OH) is converted into an organic compound. Closely associated with cellular signaling activities, this type of PTM is also involved in some major diseases, such as stomach cancer and lung cancer. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of P or K, which ones can be hydroxylated, and which ones cannot? With the explosive growth of protein sequences in the post-genomic age, the problem has become even more urgent. To address such a problem, we have developed a predictor called iHyd-PseCp by incorporating the sequence-coupled information into the general pseudo amino acid composition (PseAAC) and introducing the “Random Forest” algorithm to operate the calculation. Rigorous jackknife tests indicated that the new predictor remarkably outperformed the existing state-of-the-art prediction method for the same purpose. For the convenience of most experimental scientists, a user-friendly web-server for iHyd-PseCp has been established at http://www.jci-bioinfo.cn/iHyd-PseCp, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved.
Collapse
Affiliation(s)
- Wang-Ren Qiu
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China.,Department of Computer Science and Bond Life Science Center, University of Missouri, Columbia, MO, USA
| | - Bi-Qian Sun
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Xuan Xiao
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China.,Gordon Life Science Institute, Boston, MA, USA
| | - Zhao-Chun Xu
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA, USA.,Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah, Saudi Arabia.,Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| |
Collapse
|
32
|
Chen W, Feng P, Yang H, Ding H, Lin H, Chou KC. iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences. Oncotarget 2018; 8:4208-4217. [PMID: 27926534 PMCID: PMC5354824 DOI: 10.18632/oncotarget.13758] [Citation(s) in RCA: 199] [Impact Index Per Article: 33.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Accepted: 11/23/2016] [Indexed: 01/14/2023] Open
Abstract
Catalyzed by adenosine deaminase (ADAR), the adenosine to inosine (A-to-I) editing in RNA is not only involved in various important biological processes, but also closely associated with a series of major diseases. Therefore, knowledge about the A-to-I editing sites in RNA is crucially important for both basic research and drug development. Given an uncharacterized RNA sequence that contains many adenosine (A) residues, can we identify which one of them can be of A-to-I editing, and which one cannot? Unfortunately, so far no computational method whatsoever has been developed to address such an important problem based on the RNA sequence information alone. To fill this empty area, we have proposed a predictor called iRNA-AI by incorporating the chemical properties of nucleotides and their sliding occurrence density distribution along a RNA sequence into the general form of pseudo nucleotide composition (PseKNC). It has been shown by the rigorous jackknife test and independent dataset test that the performance of the proposed predictor is quite promising. For the convenience of most experimental scientists, a user-friendly web-server for iRNA-AI has been established at http://lin.uestc.edu.cn/server/iRNA-AI/, by which users can easily get their desired results without the need to go through the mathematical details.
Collapse
Affiliation(s)
- Wei Chen
- Department of Physics, School of Sciences, and Center for Genomics and Computational Biology, North China University of Science and Technology, Tangshan, Tangshan, China.,Gordon Life Science Institute, Belmont, Massachusetts, United States of America
| | - Pengmian Feng
- Hebei Province Key Laboratory of Occupational Health and Safety for Coal Industry, School of Public Health, North China University of Science and Technology, Tangshan, China
| | - Hui Yang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hui Ding
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,Gordon Life Science Institute, Belmont, Massachusetts, United States of America
| | - Kuo-Chen Chou
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,Gordon Life Science Institute, Belmont, Massachusetts, United States of America
| |
Collapse
|
33
|
Qiu WR, Xiao X, Xu ZC, Chou KC. iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier. Oncotarget 2018; 7:51270-51283. [PMID: 27323404 PMCID: PMC5239474 DOI: 10.18632/oncotarget.9987] [Citation(s) in RCA: 132] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Accepted: 05/23/2016] [Indexed: 11/26/2022] Open
Abstract
Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in a wide range of cellular processes; meanwhile its dysregulation is also involved with many diseases. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of S, T, or Y, which ones can be phosphorylated, and which ones cannot? To address this problem, we have developed a predictor called iPhos-PseEn by fusing four different pseudo component approaches (amino acids’ disorder scores, nearest neighbor scores, occurrence frequencies, and position weights) into an ensemble classifier via a voting system. Rigorous cross-validations indicated that the proposed predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iPhos-PseEn has been established at http://www.jci-bioinfo.cn/iPhos-PseEn, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved.
Collapse
Affiliation(s)
- Wang-Ren Qiu
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China.,Department of Computer Science and Bond Life Science Center, University of Missouri, Columbia, MO, USA
| | - Xuan Xiao
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China.,Gordon Life Science Institute, Boston, MA, USA
| | - Zhao-Chun Xu
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA, USA.,Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah, Saudi Arabia.,Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| |
Collapse
|
34
|
A Novel Modeling in Mathematical Biology for Classification of Signal Peptides. Sci Rep 2018; 8:1039. [PMID: 29348418 PMCID: PMC5773712 DOI: 10.1038/s41598-018-19491-y] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Accepted: 01/02/2018] [Indexed: 11/17/2022] Open
Abstract
The molecular structure of macromolecules in living cells is ambiguous unless we classify them in a scientific manner. Signal peptides are of vital importance in determining the behavior of newly formed proteins towards their destined path in cellular and extracellular location in both eukaryotes and prokaryotes. In the present research work, a novel method is offered to foreknow the behavior of signal peptides and determine their cleavage site. The proposed model employs neural networks using isolated sets of prokaryote and eukaryote primary sequences. Protein sequences are classified as secretory or non-secretory in order to investigate secretory proteins and their signal peptides. In comparison with the previous prediction tools, the proposed algorithm is more rigorous, well-organized, significantly appropriate and highly accurate for the examination of signal peptides even in extensive collection of protein sequences.
Collapse
|
35
|
Liu B, Yang F, Huang DS, Chou KC. iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC. Bioinformatics 2017; 34:33-40. [DOI: 10.1093/bioinformatics/btx579] [Citation(s) in RCA: 235] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 09/13/2017] [Indexed: 12/30/2022] Open
Affiliation(s)
- Bin Liu
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China
- The Gordon Life Science Institute, Boston, MA, USA
| | - Fan Yang
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China
| | - De-Shuang Huang
- Institute of Machine Learning and Systems Biology, School of Electronics and Information Engineering, Tongji University, Shanghai, China
| | - Kuo-Chen Chou
- The Gordon Life Science Institute, Boston, MA, USA
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
- Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
36
|
Cao X, Gulati M, Jiang H. Serine protease-related proteins in the malaria mosquito, Anopheles gambiae. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2017; 88:48-62. [PMID: 28780069 PMCID: PMC5586530 DOI: 10.1016/j.ibmb.2017.07.008] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Revised: 07/25/2017] [Accepted: 07/29/2017] [Indexed: 05/27/2023]
Abstract
Insect serine proteases (SPs) and serine protease homologs (SPHs) participate in digestion, defense, development, and other physiological processes. In mosquitoes, some clip-domain SPs and SPHs (i.e. CLIPs) have been investigated for possible roles in antiparasitic responses. In a recent test aimed at improving quality of gene models in the Anopheles gambiae genome using RNA-seq data, we observed various discrepancies between gene models in AgamP4.5 and corresponding sequences selected from those modeled by Cufflinks, Trinity and Bridger. Here we report a comparative analysis of the 337 SP-related proteins in A. gambiae by examining their domain structures, sequence diversity, chromosomal locations, and expression patterns. One hundred and ten CLIPs contain 1 to 5 clip domains in addition to their protease domains (PDs) or non-catalytic, protease-like domains (PLDs). They are divided into five subgroups: CLIPAs (22) are clip1-5-PLD; CLIPBs (29), CLIPCs (12) and CLIPDs (14) are mainly clip-PD; most CLIPEs (33) have a domain structure of PD/PLD-PLD-clip-PLD0-1. While expression of the CLIP genes in group-1 is generally low and detected in various tissue- and stage-specific RNA-seq libraries, some putative GPs/GPHs (i.e. single domain gut SPs/SPHs) in group-2 are highly expressed in midgut, whole larva or whole adult libraries. In comparison, 46 SPs, 26 SPHs, and 37 multi-domain SPs/SPHs (i.e. PD/PLD-PLD≥1) in group-3 do not seem to be specifically expressed in digestive tract. There are 16 SPs and 2 SPH containing other types of putative regulatory domains (e.g. LDLa, CUB, Gd). Of the 337 SP and SPH genes, 159 were sorted into 46 groups (2-8 members/group) based on similar phylogenetic tree position, chromosomal location, and expression profile. This information and analysis, including improved gene models and protein sequences, constitute a solid foundation for functional analysis of the SP-related proteins in A. gambiae.
Collapse
Affiliation(s)
- Xiaolong Cao
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Mansi Gulati
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Haobo Jiang
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA.
| |
Collapse
|
37
|
He X, Cao X, He Y, Bhattarai K, Rogers J, Hartson S, Jiang H. Hemolymph proteins of Anopheles gambiae larvae infected by Escherichia coli. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2017; 74:110-124. [PMID: 28431895 PMCID: PMC5531190 DOI: 10.1016/j.dci.2017.04.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Revised: 04/12/2017] [Accepted: 04/13/2017] [Indexed: 06/07/2023]
Abstract
Anopheles gambiae is a major vector of human malaria and its immune system in part determines the fate of ingested parasites. Proteins, hemocytes and fat body in hemolymph are critical components of this system, mediating both humoral and cellular defenses. Here we assessed differences in the hemolymph proteomes of water- and E. coli-pricked mosquito larvae by a gel-LC-MS approach. Among the 1756 proteins identified, 603 contained a signal peptide but accounted for two-third of the total protein amount on the quantitative basis. The sequence homology search indicated that 233 of the 1756 may be related to defense. In general, we did not detect substantial differences between the control and induced plasma samples in terms of protein numbers or levels. Protein distributions in the gel slices suggested post-translational modifications (e.g. proteolysis) and formation of serpin-protease complexes and high Mr immune complexes. Based on the twenty-five most abundant proteins, we further suggest that major functions of the larval hemolymph are storage, transport, and immunity. In summary, this study provided first data on constitution, levels, and possible functions of hemolymph proteins in the mosquito larvae, reflecting complex changes occurring in the fight against E. coli infection.
Collapse
Affiliation(s)
- Xuesong He
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Xiaolong Cao
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA; Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Yan He
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Krishna Bhattarai
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA; Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Janet Rogers
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Steve Hartson
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Haobo Jiang
- Department of Entomology and Plant Pathology, Oklahoma State University, Stillwater, OK 74078, USA; Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA.
| |
Collapse
|
38
|
Wang MY, Liu XF, Xia J, Li Y, Geng JL, Hu CJ. Vibrio vulnificus VvhA induces Th1 and Tfh cells to proliferate against Vibrio vulnificus in a mouse model of infection. Future Microbiol 2017; 12:953-965. [PMID: 28816516 DOI: 10.2217/fmb-2017-0040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
AIM To characterize the roles of VvhA in host's acquired immune response to Vibrio vulnificus infection. MATERIALS & METHODS The recombinant VvhA fusion protein was used to immunize mice and the anti-VvhA polyclonal antibody was produced in vivo for prophylactic and therapeutic efficacy assay. The roles of VvhA in T helper (Th) cells differentiation were analyzed by vvhA-deleted mutant during the early phase of infection, while the ratio of Th2 and T follicular helper (Tfh) cells were examined in VvhA immunization. RESULTS Anti-VvhA antibody exhibited neutralization activity against V. vulnificus. Wild-type strain induced higher level of Th1 cells than the mutant, and the concentrations of IgG2a and IFN-γ were increased during the early phase of infection. The spontaneous development of Tfh was observed in immunized model, and the serum IL-21 was increased. CONCLUSION V. vulnificus VvhA elicited cellular and humoral immune responses by Th1 and Tfh cells to provide protection against VvhA.
Collapse
Affiliation(s)
- Ming-Yi Wang
- Department of Laboratory Medicine, General Hospital of Jinan Military Region of PLA, Jinan, Shandong, 250031, PR China.,Department of Central Lab, Weihai Municipal Hospital Affiliated to Dalian Medical University, Weihai, Shandong, 264200, PR China
| | - Xiao-Fei Liu
- Department of Laboratory Medicine, General Hospital of Jinan Military Region of PLA, Jinan, Shandong, 250031, PR China
| | - Jun Xia
- Department of Medical Administration, Weihai Health & Family Planning Commission, Weihai, Shandong, 264200, PR China
| | - Yu Li
- Department of Central Lab, Weihai Municipal Hospital Affiliated to Dalian Medical University, Weihai, Shandong, 264200, PR China
| | - Jian-Li Geng
- Department of Central Lab, Weihai Municipal Hospital Affiliated to Dalian Medical University, Weihai, Shandong, 264200, PR China
| | - Cheng-Jin Hu
- Department of Laboratory Medicine, General Hospital of Jinan Military Region of PLA, Jinan, Shandong, 250031, PR China
| |
Collapse
|
39
|
Li G, Niu H, Zhang Y, Li Y, Xie F, Langford PR, Liu S, Wang C. Haemophilus parasuis cytolethal distending toxin induces cell cycle arrest and p53-dependent apoptosis. PLoS One 2017; 12:e0177199. [PMID: 28545143 PMCID: PMC5436662 DOI: 10.1371/journal.pone.0177199] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Accepted: 04/24/2017] [Indexed: 01/20/2023] Open
Abstract
Haemophilus parasuis is the causative agent of Glasser’s disease in pigs. Cytolethal distending toxin (CDT) is an important virulence factor of H. parasuis. It is composed of three subunits: CdtA, CdtB and CdtC and all were successfully expressed in soluble form in Escherichia coli when the signal peptides were removed. Purified CdtB had DNase activity, i.e. caused DNA double strand damage, in vitro and in vivo prior to cell arrest and apoptosis. Flow cytometry analysis showed CdtB alone could induce cell cycle arrest and apoptosis in PK-15 porcine kidney and pulmonary alveolar macrophage (PAM) cells, which could be enhanced by CdtA or/and CdtC. CDT holotoxin could lead to significant cell distension, G2 arrest and apoptotic death in PK-15 and PAM cells. The apoptosis induced by CDT holotoxin was significantly inhibited by pifithrin-α, which indicates that it is p53-dependent. The results suggest that H. parasuis CDT holotoxin is a major virulence factor.
Collapse
Affiliation(s)
- Gang Li
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
| | - Hui Niu
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
| | - Yanhe Zhang
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
| | - Yanling Li
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
| | - Fang Xie
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
| | - Paul R. Langford
- Section of Paediatrics, Department of Medicine, Imperial College London, St. Mary’s Campus, London, United Kingdom
| | - Siguo Liu
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
| | - Chunlai Wang
- State Key Laboratory of Veterinary Biotechnology, Division of Bacterial Diseases, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China
- * E-mail:
| |
Collapse
|
40
|
Transcriptome Analysis of Mango (Mangifera indica L.) Fruit Epidermal Peel to Identify Putative Cuticle-Associated Genes. Sci Rep 2017; 7:46163. [PMID: 28425468 PMCID: PMC5397871 DOI: 10.1038/srep46163] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Accepted: 03/09/2017] [Indexed: 11/10/2022] Open
Abstract
Mango fruit (Mangifera indica L.) are highly perishable and have a limited shelf life, due to postharvest desiccation and senescence, which limits their global distribution. Recent studies of tomato fruit suggest that these traits are influenced by the expression of genes that are associated with cuticle metabolism. However, studies of these phenomena in mango fruit are limited by the lack of genome-scale data. In order to gain insight into the mango cuticle biogenesis and identify putative cuticle-associated genes, we analyzed the transcriptomes of peels from ripe and overripe mango fruit using RNA-Seq. Approximately 400 million reads were generated and de novo assembled into 107,744 unigenes, with a mean length of 1,717 bp and with this information an online Mango RNA-Seq Database (http://bioinfo.bti.cornell.edu/cgi-bin/mango/index.cgi) which is a valuable genomic resource for molecular research into the biology of mango fruit was created. RNA-Seq analysis suggested that the pathway leading to biosynthesis of the cuticle component, cutin, is up-regulated during overripening. This data was supported by analysis of the expression of several putative cuticle-associated genes and by gravimetric and microscopic studies of cuticle deposition, revealing a complex continuous pattern of cuticle deposition during fruit development and involving substantial accumulation during ripening/overripening.
Collapse
|
41
|
Liu B, Yang F, Chou KC. 2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function. MOLECULAR THERAPY-NUCLEIC ACIDS 2017. [PMID: 28624202 PMCID: PMC5415553 DOI: 10.1016/j.omtn.2017.04.008] [Citation(s) in RCA: 194] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Involved with important cellular or gene functions and implicated with many kinds of cancers, piRNAs, or piwi-interacting RNAs, are of small non-coding RNA with around 19–33 nt in length. Given a small non-coding RNA molecule, can we predict whether it is of piRNA according to its sequence information alone? Furthermore, there are two types of piRNA: one has the function of instructing target mRNA deadenylation, and the other does not. Can we discriminate one from the other? With the avalanche of RNA sequences emerging in the postgenomic age, it is urgent to address the two problems for both basic research and drug development. Unfortunately, to the best of our knowledge, so far no computational methods whatsoever could be used to deal with the second problem, let alone deal with the two problems together. Here, by incorporating the physicochemical properties of nucleotides into the pseudo K-tuple nucleotide composition (PseKNC), we proposed a powerful predictor called 2L-piRNA. It is a two-layer ensemble classifier, in which the first layer is for identifying whether a query RNA molecule is piRNA or non-piRNA, and the second layer for identifying whether a piRNA is with or without the function of instructing target mRNA deadenylation. Rigorous cross-validations have indicated that the success rates achieved by the proposed predictor are quite high. For the convenience of most biologists and drug development scientists, the web server for 2L-piRNA has been established at http://bioinformatics.hitsz.edu.cn/2L-piRNA/, by which users can easily get their desired results without the need to go through the mathematical details.
Collapse
Affiliation(s)
- Bin Liu
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China; Key Laboratory of Network Oriented Intelligent Computation, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China; Gordon Life Science Institute, Belmont, MA 02478, USA.
| | - Fan Yang
- School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
| | - Kuo-Chen Chou
- Gordon Life Science Institute, Belmont, MA 02478, USA; Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China; Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
42
|
Zhang YZ, Shen HB. Signal-3L 2.0: A Hierarchical Mixture Model for Enhancing Protein Signal Peptide Prediction by Incorporating Residue-Domain Cross-Level Features. J Chem Inf Model 2017; 57:988-999. [PMID: 28298081 DOI: 10.1021/acs.jcim.6b00484] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Signal peptides play key roles in targeting and translocation of integral membrane proteins and secretory proteins. However, signal peptides present several challenges for automatic prediction methods. One challenge is that it is difficult to discriminate signal peptides from transmembrane helices, as both the H-region of the peptides and the transmembrane helices are hydrophobic. Another is that it is difficult to identify the cleavage site between signal peptides and mature proteins, as cleavage motifs or patterns are still unclear for most proteins. To solve these problems and further enhance automatic signal peptide recognition, we report a new Signal-3L 2.0 predictor. Our new model is constructed with a hierarchical protocol, where it first determines the existence of a signal peptide. For this, we propose a new residue-domain cross-level feature-driven approach, and we demonstrate that protein functional domain information is particularly useful for discriminating between the transmembrane helices and signal peptides as they perform different functions. Next, in order to accurately identify the unique signal peptide cleavage sites along the sequence, we designed a top-down approach where a subset of potential cleavage sites are screened using statistical learning rules, and then a final unique site is selected according to its evolution conservation score. Because this mixed approach utilizes both statistical learning and evolution analysis, it shows a strong capacity for recognizing cleavage sites. Signal-3L 2.0 has been benchmarked on multiple data sets, and the experimental results have demonstrated its accuracy. The online server is available at www.csbio.sjtu.edu.cn/bioinf/Signal-3L/ .
Collapse
Affiliation(s)
- Yi-Ze Zhang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University , Shanghai, 200240, China.,Key Laboratory of System Control and Information Processing, Ministry of Education of China , Shanghai, 200240, China
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University , Shanghai, 200240, China.,Key Laboratory of System Control and Information Processing, Ministry of Education of China , Shanghai, 200240, China
| |
Collapse
|
43
|
Bezerra MA, Pereira LM, Bononi A, Biella CA, Baroni L, Pollo-Oliveira L, Yatsuda AP. Constitutive expression and characterization of a surface SRS (NcSRS67) protein of Neospora caninum with no orthologue in Toxoplasma gondii. Parasitol Int 2017; 66:173-180. [PMID: 28108401 DOI: 10.1016/j.parint.2017.01.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Revised: 12/22/2016] [Accepted: 01/17/2017] [Indexed: 02/06/2023]
Abstract
Neospora caninum is a parasite of the Apicomplexa phylum responsible for abortion and losses of fertility in cattle. As part of its intracellular cycle, the first interaction of the parasite with the target cell is performed with the surface proteins known as the SRS superfamily (Surface Antigen Glycoprotein - Related Sequences). SAG related or SRS proteins have been a target of intense research due to its immunodominant pattern, exhibiting potential as diagnostic and/or vaccine candidates. The aim of this study was the cloning, expression and characterization of the gene NcSRS67 of N. caninum using a novel designed plasmid. The coding sequence of NcSRS67 (without the signal peptide and the GPI anchor) was cloned and expressed constitutively instead of the ccdB system of pCR-Blunt II-TOPO. The protein was purified in a nickel sepharose column and identified by mass spectrometry (MS/MS). The constitutive expression did not affect the final bacterial growth, with a similar OD 600nm compared to the non-transformed strains. The recombinant NcSRS67 was over expressed and the native form was detected by the anti-rNcSRS67 serum on 1D western blot as a single band of approximately 38kDa as predicted. On an in vitro assay, the inhibitory effect of the polyclonal antiserum anti-rNcSRS67 was nearly 20% on adhesion/invasion of host cells. The NcSRS67 native protein was localised on part of the surface of N. caninum tachyzoite when compared to the nucleus by confocal immunofluorescence.
Collapse
Affiliation(s)
- Marcos Alexandre Bezerra
- Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av do café, s/n, 14040-903 Ribeirão Preto, SP, Brazil
| | - Luiz Miguel Pereira
- Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av do café, s/n, 14040-903 Ribeirão Preto, SP, Brazil
| | - Aline Bononi
- Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av do café, s/n, 14040-903 Ribeirão Preto, SP, Brazil
| | - Carla Agostino Biella
- Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av do café, s/n, 14040-903 Ribeirão Preto, SP, Brazil
| | - Luciana Baroni
- Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av do café, s/n, 14040-903 Ribeirão Preto, SP, Brazil
| | - Leticia Pollo-Oliveira
- Núcleo de Apoio à Pesquisa em Produtos Naturais e Sintéticos, Universidade de São Paulo, Ribeirão Preto, SP, Brazil
| | - Ana Patrícia Yatsuda
- Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Av do café, s/n, 14040-903 Ribeirão Preto, SP, Brazil; Núcleo de Apoio à Pesquisa em Produtos Naturais e Sintéticos, Universidade de São Paulo, Ribeirão Preto, SP, Brazil.
| |
Collapse
|
44
|
Zambanini T, Hosseinpour Tehrani H, Geiser E, Sonntag CK, Buescher JM, Meurer G, Wierckx N, Blank LM. Metabolic engineering of Ustilago trichophora TZ1 for improved malic acid production. Metab Eng Commun 2017; 4:12-21. [PMID: 29142828 PMCID: PMC5678829 DOI: 10.1016/j.meteno.2017.01.002] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Revised: 12/22/2016] [Accepted: 01/16/2017] [Indexed: 12/30/2022] Open
Abstract
Ustilago trichophora RK089 has been found recently as a good natural malic acid producer from glycerol. This strain has previously undergone adaptive laboratory evolution for enhanced substrate uptake rate resulting in the strain U. trichophora TZ1. Medium optimization and investigation of process parameters enabled titers and rates that are able to compete with those of organisms overexpressing major parts of the underlying metabolic pathways. Metabolic engineering can likely further increase the efficiency of malate production by this organism, provided that basic genetic tools and methods can be established for this rarely used and relatively obscure species. Here we investigate and adapt existing molecular tools from U. maydis for use in U. trichophora. Selection markers from U. maydis that confer carboxin, hygromycin, nourseothricin, and phleomycin resistance are applicable in U. trichophora. A plasmid was constructed containing the ip-locus of U. trichophora RK089, resulting in site-specific integration into the genome. Using this plasmid, overexpression of pyruvate carboxylase, two malate dehydrogenases (mdh1, mdh2), and two malate transporters (ssu1, ssu2) was possible in U. trichophora TZ1 under control of the strong Petef promoter. Overexpression of mdh1, mdh2, ssu1, and ssu2 increased the product (malate) to substrate (glycerol) yield by up to 54% in shake flasks reaching a titer of up to 120 g L−1. In bioreactor cultivations of U. trichophora TZ1 Petefssu2 and U. trichophora TZ1 Petefmdh2 a drastically lowered biomass formation and glycerol uptake rate resulted in 29% (Ssu1) and 38% (Mdh2) higher specific production rates and 38% (Ssu1) and 46% (Mdh2) increased yields compared to the reference strain U. trichophora TZ1. Investigation of the product spectrum resulted in an 87% closed carbon balance with 134 g L−1 malate and biomass (73 g L−1), succinate (20 g L−1), CO2 (17 g L−1), and α-ketoglutarate (8 g L−1) as main by-products. These results open up a wide range of possibilities for further optimization, especially combinatorial metabolic engineering to increase the flux from pyruvate to malic acid and to reduce by-product formation. Metabolic engineering tools established for U. trichophora Potentially carbon neutral process through CO2 co-metabolism Optimized malic acid production from glycerol by overexpression of rTCA cycle Specific production rate and yield increased up to 38% and 46%, respectively
Collapse
Affiliation(s)
- Thiemo Zambanini
- iAMB - Institute of Applied Microbiology, ABBt - Aachen Biology and Biotechnology, RWTH Aachen University, Worringerweg 1, D-52074 Aachen, Germany
| | - Hamed Hosseinpour Tehrani
- iAMB - Institute of Applied Microbiology, ABBt - Aachen Biology and Biotechnology, RWTH Aachen University, Worringerweg 1, D-52074 Aachen, Germany
| | - Elena Geiser
- iAMB - Institute of Applied Microbiology, ABBt - Aachen Biology and Biotechnology, RWTH Aachen University, Worringerweg 1, D-52074 Aachen, Germany
| | - Christiane K Sonntag
- iAMB - Institute of Applied Microbiology, ABBt - Aachen Biology and Biotechnology, RWTH Aachen University, Worringerweg 1, D-52074 Aachen, Germany
| | - Joerg M Buescher
- BRAIN AG, Darmstädter Straße 34-36, D-64673 Zwingenberg, Germany.,MPI Immunobiology and Epigenetics, Stübeweg 51, D-79108 Freiburg, Germany
| | - Guido Meurer
- BRAIN AG, Darmstädter Straße 34-36, D-64673 Zwingenberg, Germany
| | - Nick Wierckx
- iAMB - Institute of Applied Microbiology, ABBt - Aachen Biology and Biotechnology, RWTH Aachen University, Worringerweg 1, D-52074 Aachen, Germany
| | - Lars M Blank
- iAMB - Institute of Applied Microbiology, ABBt - Aachen Biology and Biotechnology, RWTH Aachen University, Worringerweg 1, D-52074 Aachen, Germany
| |
Collapse
|
45
|
Köbis JM, Rebl H, Goldammer T, Rebl A. Multiple gene and transcript variants encoding trout C-polysaccharide binding proteins are differentially but strongly induced after infection with Aeromonas salmonicida. FISH & SHELLFISH IMMUNOLOGY 2017; 60:509-519. [PMID: 27836722 DOI: 10.1016/j.fsi.2016.11.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 11/03/2016] [Accepted: 11/07/2016] [Indexed: 06/06/2023]
Abstract
Two 'trout C-polysaccharide-binding proteins,' TCBP1 and -2, with relevance to early inflammatory events have been discovered in the last century. The present study characterises the respective cDNA sequences from rainbow trout (Oncorhynchus mykiss), including multiple TCBP1 transcript variants. These variants are generated either by the use of alternative splice sites or the exclusion of exons. The longest mRNA isoform, TCBP1-1, encodes a 245-aa protein with a large signal peptide and a complement component C1q domain. The shortest mRNA isoform, TCBP1-5, contains a premature termination codon and hence fails to encode a functional factor. The 224-aa-long TCBP2 protein consists of a comparably shorter signal peptide and a pentraxin domain. Evolutionary analyses clearly separated TCBP1 and -2 because of distinctive protein motifs. Expression profiling in the liver, spleen, and head kidney tissues of healthy trout revealed that TCBP2 mRNA concentrations were higher than the concentrations of all five TCBP1 mRNA isoforms together. The hepatic levels of these TCBP1 variants increased significantly upon infection with Aeromonas salmonicida, whereas TCBP2 transcript levels rose moderately. As the biological function of TCBP1 is barely understood, we tagged this factor with the green fluorescent protein and visualised its expression in HEK-293 cells. Overexpression of TCBP1 increased the level of active NF-κB factors and induced cell death, indicating its involvement in proapoptotic NF-κB-dependent signalling routes.
Collapse
Affiliation(s)
- Judith M Köbis
- Leibniz Institute for Farm Animal Biology (FBN), Institute for Genome Biology, Wilhelm-Stahl-Allee 2, 18196 Dummerstorf, Germany
| | - Henrike Rebl
- Rostock University Medical Center, Department of Cell Biology, Schillingallee 69, 18057 Rostock, Germany
| | - Tom Goldammer
- Leibniz Institute for Farm Animal Biology (FBN), Institute for Genome Biology, Wilhelm-Stahl-Allee 2, 18196 Dummerstorf, Germany
| | - Alexander Rebl
- Leibniz Institute for Farm Animal Biology (FBN), Institute for Genome Biology, Wilhelm-Stahl-Allee 2, 18196 Dummerstorf, Germany.
| |
Collapse
|
46
|
Prasad S, Xu J, Zhang Y, Wang N. SEC-Translocon Dependent Extracytoplasmic Proteins of Candidatus Liberibacter asiaticus. Front Microbiol 2016; 7:1989. [PMID: 28066334 PMCID: PMC5167687 DOI: 10.3389/fmicb.2016.01989] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 11/28/2016] [Indexed: 11/30/2022] Open
Abstract
Citrus Huanglongbing (HLB) is the most destructive citrus disease worldwide. HLB is associated with three species of the phloem-limited, gram-negative, fastidious α-proteobacteria: Candidatus Liberibacter asiaticus (Las), Ca. L. americanus (Lam), and Ca. L. africanus (Laf) with Las being the most widespread species. Las has not been cultured in artificial media, which has greatly hampered our efforts to understand its virulence mechanisms. Las contains a complete Sec-translocon, which has been suggested to transport Las proteins including virulence factors into the extracytoplasmic milieu. In this study, we characterized the Sec-translocon dependent, signal peptide containing extracytoplasmic proteins of Las. A total of 166 proteins of Las-psy62 strain were predicted to contain signal peptides targeting them out of the cell cytoplasm via the Sec-translocon using LipoP, SigalP 3.0, SignalP 4.1, and Phobius. We also predicated SP containing extracytoplasmic proteins for Las-gxpsy and Las-Ishi-1, Lam, Laf, Ca. L. solanacearum (Lso), and L. crescens (Lcr). For experimental validation of the predicted extracytoplasmic proteins, Escherichia coli based alkaline phosphatase (PhoA) gene fusion assays were conducted. A total of 86 out of the 166 predicted Las proteins were experimentally validated to contain signal peptides. Additionally, Las-psy62 lepB (CLIBASIA_04190), the gene encodes signal peptidase I, was able to partially complement the amber mutant of lepB of E. coli. This work will contribute to the identification of Sec-translocon dependent effector proteins of Las, which might be involved in virulence of Las.
Collapse
Affiliation(s)
| | | | | | - Nian Wang
- Citrus Research and Education Center, Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Lake AlfredFL, USA
| |
Collapse
|
47
|
Pierce SK, Mahadevan P, Massey SE, Middlebrooks ML. A Preliminary Molecular and Phylogenetic Analysis of the Genome of a Novel Endogenous Retrovirus in the Sea Slug Elysia chlorotica. THE BIOLOGICAL BULLETIN 2016; 231:236-244. [PMID: 28048954 DOI: 10.1086/691071] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
An endogenous retrovirus that is present in the sea slug Elysia chlorotica is expressed in all individuals at the end of the annual life cycle. But the precise role of the virus, if any, in slug senescence or death is unknown. We have determined the genomic sequence of the virus and performed a phylogenetic analysis of the data. The 6060-base pair genome of the virus possesses a reverse transcriptase-domain-containing protein that shows similarity to retrotransposon sequences found in Aplysia californica and Strongylocentrotus purpuratus. However, nucleotide BLAST analysis of the whole genome resulted in hits to only a few portions of the genome, indicating that the Elysia chlorotica retrovirus is novel, has not been previously sequenced, and does not have great genetic similarity to other known viral species. When more invertebrate retroviral genomes are examined, a more precise phylogenetic placement of the Elysia chlorotica retrovirus can be determined.
Collapse
|
48
|
Endogenous signal peptides in recombinant protein production by Pichia pastoris: From in-silico analysis to fermentation. J Theor Biol 2016; 408:22-33. [DOI: 10.1016/j.jtbi.2016.07.039] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2016] [Revised: 07/15/2016] [Accepted: 07/24/2016] [Indexed: 11/20/2022]
|
49
|
Kumar S, Gu X, Kim Y. A viral histone H4 suppresses insect insulin signal and delays host development. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2016; 63:66-77. [PMID: 27216029 DOI: 10.1016/j.dci.2016.05.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2016] [Revised: 05/18/2016] [Accepted: 05/18/2016] [Indexed: 06/05/2023]
Abstract
Parasitization by an endoparasitoid wasp, Cotesia plutellae, alters host development of Plutella xylostella by extending larval period and preventing metamorphosis. Insulin signal plays a crucial role in mediating insect development and controlling blood sugar level in insects. In this study, three insulin-like peptide genes (PxILP1-3) were predicted from the genome of P. xylostella. However, only PxILP1 was confirmed to be expressed in P. xylostella. Starvation suppressed the expression level of PxILP1 and up-regulated plasma trehalose level. RNA interference against PxILP1 mimicked starvation effect and extended the larval period of P. xylostella. Parasitized larvae exhibited significantly lower levels of PxILP1 expression compared to nonparasitized larvae. Injection of wasp-symbiotic polydnavirus C. plutellae bracovirus (CpBV) also suppressed PxILP1 expression and extended the larval period. Injection of a viral segment (CpBV-S30) containing a viral histone H4 (CpBV-H4) also suppressed PxILP1 expression. Co-injection of CpBV-S30 and double-stranded RNA (dsCpBV-H4) specific to CpBV-H4 rescued the suppression of PxILP1 expression. Injection of CpBV-S30 significantly extended larval development. Co-injection of CpBV-S30 with dsCpBV-H4 rescued the delay of larval development. Injection of a bovine insulin to parasitized larvae prevented parasitoid development. These results indicate that parasitism of C. plutellae can down-regulate host insulin signaling with the help of parasitic factor CpBV-H4.
Collapse
Affiliation(s)
- Sunil Kumar
- Department of Bioresource Sciences, Andong National University, Andong 36729, South Korea
| | - Xiaojun Gu
- College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Yonggyun Kim
- Department of Bioresource Sciences, Andong National University, Andong 36729, South Korea.
| |
Collapse
|
50
|
Qiu WR, Zheng QS, Sun BQ, Xiao X. Multi-iPPseEvo: A Multi-label Classifier for Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into Chou′s General PseAAC via Grey System Theory. Mol Inform 2016; 36. [DOI: 10.1002/minf.201600085] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Accepted: 09/07/2016] [Indexed: 01/19/2023]
Affiliation(s)
- Wang-Ren Qiu
- Computer Department; Jingdezhen Ceramic Institute; Jingdezhen 333403 China
- Department of Computer Science; University of Missouri; Columbia, MO USA
- Bond Life Science Center; University of Missouri; Columbia, MO USA
| | - Quan-Shu Zheng
- Computer Department; Jingdezhen Ceramic Institute; Jingdezhen 333403 China
| | - Bi-Qian Sun
- Computer Department; Jingdezhen Ceramic Institute; Jingdezhen 333403 China
| | - Xuan Xiao
- Computer Department; Jingdezhen Ceramic Institute; Jingdezhen 333403 China
- Gordon Life Science Institute; Boston, Massachusetts 02478 United States of America
| |
Collapse
|