1
|
Gao S, Huan F, Wu MX, Ni LN, Gu Y, Liu YX, Han TJ, Liu M, Lai D, Liu GM. Mutation of Disulfide Bond Sites Reduces the Immunoreactivity of Cra a 4 by Changing the Structural Characteristics. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024. [PMID: 38598840 DOI: 10.1021/acs.jafc.4c01120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Sarcoplasmic calcium-binding protein (Cra a 4) from Crassostrea angulata belongs to the EF-hand superfamily, and understanding of its structure-allergenicity relationship is still insufficient. In this study, chemical denaturants were used to destroy the structure of Cra a 4, showing that disruption of the structure reduced its IgG-/IgE-binding activity. To explore which critical amino acid site affects the allergenicity of Cra a 4, the mutants were obtained by site-directed mutations in the disulfide bonds site (C97), conformational epitopes (I105, D114), or Ca2+-binding region (D106, D110) and their IgG-/IgE-binding activity was reduced significantly using serological tests. Notably, C97A had the lowest immunoreactivity. In addition, two conformational epitopes of Cra 4 were verified. Meanwhile, the increase of the α-helical content, surface hydrophobicity, and surface electrostatic potential of C97A affected its allergenicity. Overall, the understanding of the structure-allergenicity relationship of Cra a 4 allowed the development of a hypoallergenic mutant.
Collapse
Affiliation(s)
- Shuai Gao
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Fei Huan
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Ming-Xuan Wu
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Ling-Na Ni
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Yi Gu
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Ya-Xin Liu
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Tian-Jiao Han
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Meng Liu
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| | - Dong Lai
- The Second Affiliated Hospital of Xiamen Medical College, Xiamen 361021, Fujian, China
| | - Guang-Ming Liu
- College of Ocean Food and Biological Engineering, Xiamen Key Laboratory of Marine Functional Food, Fujian Provincial Engineering Technology Research Center of Marine Functional Food, Jimei University, Xiamen 361021, Fujian, China
| |
Collapse
|
2
|
Mufassirin MMM, Newton MAH, Sattar A. Artificial intelligence for template-free protein structure prediction: a comprehensive review. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10350-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
3
|
Mishra A, Kabir MWU, Hoque MT. diSBPred: A machine learning based approach for disulfide bond prediction. Comput Biol Chem 2021; 91:107436. [PMID: 33550156 DOI: 10.1016/j.compbiolchem.2021.107436] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 12/28/2020] [Accepted: 01/10/2021] [Indexed: 12/25/2022]
Abstract
The protein disulfide bond is a covalent bond that forms during post-translational modification by the oxidation of a pair of cysteines. In protein, the disulfide bond is the most frequent covalent link between amino acids after the peptide bond. It plays a significant role in three-dimensional (3D) ab initio protein structure prediction (aiPSP), stabilizing protein conformation, post-translational modification, and protein folding. In aiPSP, the location of disulfide bonds can strongly reduce the conformational space searching by imposing geometrical constraints. Existing experimental techniques for the determination of disulfide bonds are time-consuming and expensive. Thus, developing sequence-based computational methods for disulfide bond prediction becomes indispensable. This study proposed a stacking-based machine learning approach for disulfide bond prediction (diSBPred). Various useful sequence and structure-based features are extracted for effective training, including conservation profile, residue solvent accessibility, torsion angle flexibility, disorder probability, a sequential distance between cysteines, and more. The prediction of disulfide bonds is carried out in two stages: first, individual cysteines are predicted as either bonding or non-bonding; second, the cysteine-pairs are predicted as either bonding or non-bonding by including the results from cysteine bonding prediction as a feature. The examination of the relevance of the features employed in this study and the features utilized in the existing nearest neighbor algorithm (NNA) method shows that the features used in this study improve about 7.39 % in jackknife validation balanced accuracy. Moreover, for individual cysteine bonding prediction and cysteine-pair bonding prediction, diSBPred provides a 10-fold cross-validation balanced accuracy of 82.29 % and 94.20 %, respectively. Altogether, our predictor achieves an improvement of 43.25 % based on balanced accuracy compared to the existing NNA based approach. Thus, diSBPred can be utilized to annotate the cysteine bonding residues of protein sequences whose structures are unknown as well as improve the accuracy of the aiPSP method, which can further aid in experimental studies of the disulfide bond and structure determination.
Collapse
Affiliation(s)
- Avdesh Mishra
- Department of Electrical Engineering and Computer Science, Texas A&M University-Kingsville, Kingsville, TX, USA
| | - Md Wasi Ul Kabir
- Department of Computer Science, University of New Orleans, New Orleans, LA, USA
| | - Md Tamjidul Hoque
- Department of Computer Science, University of New Orleans, New Orleans, LA, USA.
| |
Collapse
|
4
|
Franzoi M, Sturlese M, Bellanda M, Mammi S. A molecular dynamics strategy for CSαβ peptides disulfide-assisted model refinement. J Biomol Struct Dyn 2017; 35:2736-2744. [PMID: 27581488 DOI: 10.1080/07391102.2016.1231081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Many cysteine-stabilized antimicrobial peptides from a variety of living organisms could be good candidates for the development of anti-infective agents. In the absence of experimentally obtained structural data, peptide modeling is an essential tool for understanding structure-activity relationships and for optimizing the bioactive moieties. Focusing on cysteine-rich peptide structures, we reproduced the case of structure predictions in the so-called midnight zone. We developed our protocol on a training set derived by clustering the available cysteine-stabilized αβ (CSαβ) structures in nine different representative families and tested it on peptides randomly selected from each family. Starting from draft models, we tested a structure-based disulfide predictor and we used cysteine distances as constraints during molecular dynamics. Finally, we proposed an analysis for final structure selection. Accordingly, we obtained a mean root mean square deviation improvement of 21% for the test set. Our findings demonstrate that it is possible to predict the network of disulfide bridges in cysteine-stabilized peptides and to use this result to improve the accuracy of structural predictions. Finally, we applied the methods to predict the structure of royalisin, a cysteine-rich peptide with unknown structure.
Collapse
Affiliation(s)
- Marco Franzoi
- a Department of Biology , University of Padova , Via Ugo Bassi 58/B, Padova 35131 , Italy
| | - Mattia Sturlese
- b Molecular Modeling Section (MMS), Department of Pharmaceutical and Pharmacological Sciences , University of Padova , Via Marzolo 5, Padova 35131 , Italy
| | - Massimo Bellanda
- c Department of Chemical Sciences , University of Padova , Via Marzolo 1, Padova 35131 , Italy
| | - Stefano Mammi
- c Department of Chemical Sciences , University of Padova , Via Marzolo 1, Padova 35131 , Italy
| |
Collapse
|
5
|
Fedoseev SV, Belikov MY, Ershov OV, Tafeenko VA. Reductive alkylation of disulfides. Synthesis of 2-(alkylsulfanyl)-1H-pyrrole-3-carbonitriles. RUSSIAN JOURNAL OF ORGANIC CHEMISTRY 2017. [DOI: 10.1134/s1070428016120125] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Senthilkumar B, Rajasekaran R. In Silico Template Selection of Short Antimicrobial Peptide Viscotoxin for Improving Its Antimicrobial Efficiency in Development of Potential Therapeutic Drugs. Appl Biochem Biotechnol 2016; 181:898-913. [PMID: 27696138 DOI: 10.1007/s12010-016-2257-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 09/19/2016] [Indexed: 12/22/2022]
Abstract
Rapid increase in antibiotic resistance has posed a worldwide threat, due to increased mortality, morbidity, and expenditure caused by antibiotic-resistant microbes. Recent development of the antimicrobial peptides like viscotoxin (Vt) has been successfully comprehended as a substitute for classical antibiotics. A structurally stable peptide, Vt can enhance antimicrobial property and can be used for various developmental purposes. Thus, structural stability among the antimicrobial peptides, Vt A1 (3C8P), A2 (1JMN), A3 (1ED0), B (1JMP), and C (1ORL) of Viscus album was computationally analyzed. In specific, the static confirmation of VtA3 showed high number of intramolecular interactions, along with an increase in hydrophobicity than others comparatively. Further, conformational sampling was used to analyze various geometrical parameters such as root mean square deviation, root mean square fluctuation, radius of gyration, and ovality which also revealed the structural stability of VtA3. Moreover, the statistically validated contours of surface area, lipophilicity, and distance constraints of disulfide bonds also supported the priority of VtA3 with respect to stability. Finally, the functional activity of peptides was accessed by computing their free energy of membrane association and membrane interactions, which defined VtA3 as functionally stable. Currently, peptide-based antibiotics and nanoparticles have attracted the pharmaceutical industries for their potential therapeutic applications. Thereby, it is proposed that viscotoxin A3 (1ED0) could be used as a preeminent template for scaffolding potentially efficient antimicrobial peptide-based drugs and nanomaterials in future.
Collapse
Affiliation(s)
- B Senthilkumar
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, Tamil Nadu, 632014, India
| | - R Rajasekaran
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, Tamil Nadu, 632014, India.
| |
Collapse
|
7
|
Bhatnagar A, Apostol MI, Bandyopadhyay D. Amino acid function relates to its embedded protein microenvironment: A study on disulfide-bridged cystine. Proteins 2016; 84:1576-1589. [DOI: 10.1002/prot.25101] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Revised: 06/30/2016] [Accepted: 07/03/2016] [Indexed: 11/08/2022]
Affiliation(s)
- Akshay Bhatnagar
- Department of Biological Sciences; Birla Institute of Technology and Science; Hyderabad 500078 India
| | - Marcin I. Apostol
- ADRx. Inc. 515 Marin St., Suite 314, Thousand Oaks; California 91360
| | - Debashree Bandyopadhyay
- Department of Biological Sciences; Birla Institute of Technology and Science; Hyderabad 500078 India
| |
Collapse
|
8
|
Shabelnikov S, Kiselev A. Cysteine-Rich Atrial Secretory Protein from the Snail Achatina achatina: Purification and Structural Characterization. PLoS One 2015; 10:e0138787. [PMID: 26444993 PMCID: PMC4596865 DOI: 10.1371/journal.pone.0138787] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 09/03/2015] [Indexed: 11/28/2022] Open
Abstract
Despite extensive studies of cardiac bioactive peptides and their functions in molluscs, soluble proteins expressed in the heart and secreted into the circulation have not yet been reported. In this study, we describe an 18.1-kDa, cysteine-rich atrial secretory protein (CRASP) isolated from the terrestrial snail Achatina achatina that has no detectable sequence similarity to any known protein or nucleotide sequence. CRASP is an acidic, 158-residue, N-glycosylated protein composed of eight alpha-helical segments stabilized with five disulphide bonds. A combination of fold recognition algorithms and ab initio folding predicted that CRASP adopts an all-alpha, right-handed superhelical fold. CRASP is most strongly expressed in the atrium in secretory atrial granular cells, and substantial amounts of CRASP are released from the heart upon nerve stimulation. CRASP is detected in the haemolymph of intact animals at nanomolar concentrations. CRASP is the first secretory protein expressed in molluscan atrium to be reported. We propose that CRASP is an example of a taxonomically restricted gene that might be responsible for adaptations specific for terrestrial pulmonates.
Collapse
Affiliation(s)
- Sergey Shabelnikov
- Department of Cytology and Histology, Saint-Petersburg State University, St. Petersburg, Russia
- Laboratory of Cell Morphology, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia
| | - Artem Kiselev
- Laboratory of Cell Morphology, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia
- Institute of Molecular Biology and Genetics, Almazov Federal Medical Research Centre, St. Petersburg, Russia
| |
Collapse
|
9
|
Yang J, He BJ, Jang R, Zhang Y, Shen HB. Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins. Bioinformatics 2015; 31:3773-81. [PMID: 26254435 DOI: 10.1093/bioinformatics/btv459] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 08/02/2015] [Indexed: 01/19/2023] Open
Abstract
MOTIVATION Cysteine-rich proteins cover many important families in nature but there are currently no methods specifically designed for modeling the structure of these proteins. The accuracy of disulfide connectivity pattern prediction, particularly for the proteins of higher-order connections, e.g., >3 bonds, is too low to effectively assist structure assembly simulations. RESULTS We propose a new hierarchical order reduction protocol called Cyscon for disulfide-bonding prediction. The most confident disulfide bonds are first identified and bonding prediction is then focused on the remaining cysteine residues based on SVR training. Compared with purely machine learning-based approaches, Cyscon improved the average accuracy of connectivity pattern prediction by 21.9%. For proteins with more than 5 disulfide bonds, Cyscon improved the accuracy by 585% on the benchmark set of PDBCYS. When applied to 158 non-redundant cysteine-rich proteins, Cyscon predictions helped increase (or decrease) the TM-score (or RMSD) of the ab initio QUARK modeling by 12.1% (or 14.4%). This result demonstrates a new avenue to improve the ab initio structure modeling for cysteine-rich proteins. AVAILABILITY AND IMPLEMENTATION http://www.csbio.sjtu.edu.cn/bioinf/Cyscon/ CONTACT zhng@umich.edu or hbshen@sjtu.edu.cn. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jing Yang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Bao-Ji He
- State Key Laboratory of Theoretical Physics, Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing 100190, China, Department of Computational Medicine and Bioinformatics and
| | - Richard Jang
- Department of Computational Medicine and Bioinformatics and
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics and Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China, Department of Computational Medicine and Bioinformatics and
| |
Collapse
|
10
|
Raimondi D, Orlando G, Vranken WF. An Evolutionary View on Disulfide Bond Connectivities Prediction Using Phylogenetic Trees and a Simple Cysteine Mutation Model. PLoS One 2015; 10:e0131792. [PMID: 26161671 PMCID: PMC4498770 DOI: 10.1371/journal.pone.0131792] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2015] [Accepted: 06/07/2015] [Indexed: 01/09/2023] Open
Abstract
Disulfide bonds are crucial for many structural and functional aspects of proteins. They have a stabilizing role during folding, can regulate enzymatic activity and can trigger allosteric changes in the protein structure. Moreover, knowledge of the topology of the disulfide connectivity can be relevant in genomic annotation tasks and can provide long range constraints for ab-initio protein structure predictors. In this paper we describe PhyloCys, a novel unsupervised predictor of disulfide bond connectivity from known cysteine oxidation states. For each query protein, PhyloCys retrieves and aligns homologs with HHblits and builds a phylogenetic tree using ClustalW. A simplified model of cysteine co-evolution is then applied to the tree in order to hypothesize the presence of oxidized cysteines in the inner nodes of the tree, which represent ancestral protein sequences. The tree is then traversed from the leaves to the root and the putative disulfide connectivity is inferred by observing repeated patterns of tandem mutations between a sequence and its ancestors. A final correction is applied using the Edmonds-Gabow maximum weight perfect matching algorithm. The evolutionary approach applied in PhyloCys results in disulfide bond predictions equivalent to Sephiroth, another approach that takes whole sequence information into account, and is 26-29% better than state of the art methods based on cysteine covariance patterns in multiple sequence alignments, while requiring one order of magnitude fewer homologous sequences (10(3) instead of 10(4)), thus extending its range of applicability. The software described in this article and the datasets used are available at http://ibsquare.be/phylocys.
Collapse
Affiliation(s)
- Daniele Raimondi
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Department of Structural Biology, VIB, Brussels, Belgium
- Machine Learning Group, ULB, Brussels, Belgium
| | - Gabriele Orlando
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Department of Structural Biology, VIB, Brussels, Belgium
- Machine Learning Group, ULB, Brussels, Belgium
| | - Wim F. Vranken
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Department of Structural Biology, VIB, Brussels, Belgium
| |
Collapse
|
11
|
Raimondi D, Orlando G, Vranken WF. Clustering-based model of cysteine co-evolution improves disulfide bond connectivity prediction and reduces homologous sequence requirements. Bioinformatics 2014; 31:1219-25. [DOI: 10.1093/bioinformatics/btu794] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 11/18/2014] [Indexed: 12/23/2022] Open
|
12
|
A word of caution about biological inference - Revisiting cysteine covalent state predictions. FEBS Open Bio 2014; 4:310-4. [PMID: 24918043 PMCID: PMC4048844 DOI: 10.1016/j.fob.2014.03.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Revised: 03/05/2014] [Accepted: 03/07/2014] [Indexed: 11/20/2022] Open
Abstract
High prediction accuracy is often believed to validate implicit biological assumptions. Cys redox state predictions assume a local sequence environmental effect. Removing local sequence signals did not reduce prediction accuracy. Cys redox state predictions apparently correlate with subcellular localization. Subcellular localization depends on global sequence composition.
The success of methods for predicting the redox state of cysteine residues from the sequence environment seemed to validate the basic assumption that this state is mainly determined locally. However, the accuracy of predictions on randomized sequences or of non-cysteine residues remained high, suggesting that these predictions rather capture global features of proteins such as subcellular localization, which depends on composition. This illustrates that even high prediction accuracy is insufficient to validate implicit assumptions about a biological phenomenon. Correctly identifying the relevant underlying biochemical reasons for the success of a method is essential to gain proper biological insights and develop more accurate and novel bioinformatics tools.
Collapse
|
13
|
Murad W, Singh R. The MS2DB ${++}$ Webserver: Disulfide Bond Determination Through Evidence Combination. IEEE Trans Nanobioscience 2013; 12:340-2. [DOI: 10.1109/tnb.2013.2289391] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
14
|
Góngora-Benítez M, Tulla-Puche J, Albericio F. Multifaceted Roles of Disulfide Bonds. Peptides as Therapeutics. Chem Rev 2013; 114:901-26. [DOI: 10.1021/cr400031z] [Citation(s) in RCA: 388] [Impact Index Per Article: 35.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Miriam Góngora-Benítez
- Institute
for Research in Biomedicine (IRB Barcelona), Barcelona, 08028 Spain
- CIBER-BBN, Barcelona Science
Park, Barcelona, 08028 Spain
| | - Judit Tulla-Puche
- Institute
for Research in Biomedicine (IRB Barcelona), Barcelona, 08028 Spain
- CIBER-BBN, Barcelona Science
Park, Barcelona, 08028 Spain
| | - Fernando Albericio
- Institute
for Research in Biomedicine (IRB Barcelona), Barcelona, 08028 Spain
- CIBER-BBN, Barcelona Science
Park, Barcelona, 08028 Spain
- Department
of Organic Chemistry, University of Barcelona, Barcelona, 08028 Spain
- School of Chemistry & Physics, University of KwaZulu-Natal, 4001 Durban, South Africa
| |
Collapse
|
15
|
Yaseen A, Li Y. Dinosolve: a protein disulfide bonding prediction server using context-based features to enhance prediction accuracy. BMC Bioinformatics 2013; 14 Suppl 13:S9. [PMID: 24267383 PMCID: PMC3849605 DOI: 10.1186/1471-2105-14-s13-s9] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Disulfide bonds play an important role in protein folding and structure stability. Accurately predicting disulfide bonds from protein sequences is important for modeling the structural and functional characteristics of many proteins. Methods In this work, we introduce an approach of enhancing disulfide bonding prediction accuracy by taking advantage of context-based features. We firstly derive the first-order and second-order mean-force potentials according to the amino acid environment around the cysteine residues from large number of cysteine samples. The mean-force potentials are integrated as context-based scores to estimate the favorability of a cysteine residue in disulfide bonding state as well as a cysteine pair in disulfide bond connectivity. These context-based scores are then incorporated as features together with other sequence and evolutionary information to train neural networks for disulfide bonding state prediction and connectivity prediction. Results The 10-fold cross validated accuracy is 90.8% at residue-level and 85.6% at protein-level in classifying an individual cysteine residue as bonded or free, which is around 2% accuracy improvement. The average accuracy for disulfide bonding connectivity prediction is also improved, which yields overall sensitivity of 73.42% and specificity of 91.61%. Conclusions Our computational results have shown that the context-based scores are effective features to enhance the prediction accuracies of both disulfide bonding state prediction and connectivity prediction. Our disulfide prediction algorithm is implemented on a web server named "Dinosolve" available at: http://hpcr.cs.odu.edu/dinosolve.
Collapse
|
16
|
S-linked protein homocysteinylation: identifying targets based on structural, physicochemical and protein-protein interactions of homocysteinylated proteins. Amino Acids 2013; 44:1307-16. [PMID: 23400378 DOI: 10.1007/s00726-013-1465-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2012] [Accepted: 01/23/2013] [Indexed: 10/27/2022]
Abstract
An elevated level of homocysteine, a thiol-containing amino acid is associated with a wide spectrum of disease conditions. A majority (>80 %) of the circulating homocysteine exist in protein-bound form. Homocysteine can bind to free cysteine residues in the protein or could cleave accessible cysteine disulfide bonds via thiol disulfide exchange reaction. Binding of homocysteine to proteins could potentially alter the structure and/or function of the protein. To date only 21 proteins have been experimentally shown to bind homocysteine. In this study we attempted to identify other proteins that could potentially bind to homocysteine based on the criteria that such proteins will have significant 3D structural homology with the proteins that have been experimentally validated and have solvent accessible cysteine residues either with high dihedral strain energy (for cysteine-cysteine disulfide bonds) or low pKa (for free cysteine residues). This analysis led us to the identification of 78 such proteins of which 68 proteins had 154 solvent accessible disulfide cysteine pairs with high dihedral strain energy and 10 proteins had free cysteine residues with low pKa that could potentially bind to homocysteine. Further, protein-protein interaction network was built to identify the interacting partners of these putative homocysteine binding proteins. We found that the 21 experimentally validated proteins had 174 interacting partners while the 78 proteins identified in our analysis had 445 first interacting partners. These proteins are mainly involved in biological activities such as complement and coagulation pathway, focal adhesion, ECM-receptor, ErbB signalling and cancer pathways, etc. paralleling the disease-specific attributes associated with hyperhomocysteinemia.
Collapse
|
17
|
Lin HH, Tseng LY. Prediction of disulfide bonding pattern based on a support vector machine and multiple trajectory search. Inf Sci (N Y) 2012. [DOI: 10.1016/j.ins.2012.02.035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
18
|
Marques JRF, da Fonseca RR, Drury B, Melo A. Amino acid patterns around disulfide bonds. Int J Mol Sci 2010; 11:4673-86. [PMID: 21151463 PMCID: PMC3000107 DOI: 10.3390/ijms11114673] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2010] [Revised: 11/04/2010] [Accepted: 11/11/2010] [Indexed: 11/16/2022] Open
Abstract
Disulfide bonds provide an inexhaustible source of information on molecular evolution and biological specificity. In this work, we described the amino acid composition around disulfide bonds in a set of disulfide-rich proteins using appropriate descriptors, based on ANOVA (for all twenty natural amino acids or classes of amino acids clustered according to their chemical similarities) and Scheffé (for the disulfide-rich proteins superfamilies) statistics. We found that weakly hydrophilic and aromatic amino acids are quite abundant in the regions around disulfide bonds, contrary to aliphatic and hydrophobic amino acids. The density distributions (as a function of the distance to the center of the disulfide bonds) for all defined entities presented an overall unimodal behavior: the densities are null at short distances, have maxima at intermediate distances and decrease for long distances. In the end, the amino acid environment around the disulfide bonds was found to be different for different superfamilies, allowing the clustering of proteins in a biologically relevant way, suggesting that this type of chemical information might be used as a tool to assess the relationship between very divergent sets of disulfide-rich proteins.
Collapse
Affiliation(s)
- José R. F. Marques
- REQUIMTE/Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, 687, 4169-007 Porto, Portugal; E-Mail:
| | - Rute R. da Fonseca
- CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal; E-Mail:
| | - Brett Drury
- LIAAD-INESC, Rua de Ceuta, 118, 6°, 4050-190 Porto, Portugal; E-Mail:
| | - André Melo
- REQUIMTE/Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, 687, 4169-007 Porto, Portugal; E-Mail:
| |
Collapse
|
19
|
Marques JRF, da Fonseca RR, Drury B, Melo A. Conformational characterization of disulfide bonds: a tool for protein classification. J Theor Biol 2010; 267:388-95. [PMID: 20851707 DOI: 10.1016/j.jtbi.2010.09.012] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2010] [Revised: 08/29/2010] [Accepted: 09/08/2010] [Indexed: 11/25/2022]
Abstract
BACKGROUND Throughout evolution, mutations in particular regions of some protein structures have resulted in extra covalent bonds that increase the overall robustness of the fold: disulfide bonds. The two strategically placed cysteines can also have a more direct role in protein function, either by assisting thiol or disulfide exchange, or through allosteric effects. In this work, we verified how the structural similarities between disulfides can reflect functional and evolutionary relationships between different proteins. We analyzed the conformational patterns of the disulfide bonds in a set of disulfide-rich proteins that included twelve SCOP superfamilies: thioredoxin-like and eleven superfamilies containing small disulfide-rich proteins (SDP). RESULTS The twenty conformations considered in the present study were characterized by both structural and energetic parameters. The corresponding frequencies present diverse patterns for the different superfamilies. The least-strained conformations are more abundant for the SDP superfamilies, while the "catalytic" +/-RHook is dominant for the thioredoxin-like superfamily. The "allosteric" -RHSaple is moderately abundant for BBI, Crisp and Thioredoxin-like superfamilies and less frequent for the remaining superfamilies. Using a hierarchical clustering analysis we found that the twelve superfamilies were grouped in biologically significant clusters. CONCLUSIONS In this work, we carried out an extensive statistical analysis of the conformational motifs for the disulfide bonds present in a set of disulfide-rich proteins. We show that the conformational patterns observed in disulfide bonds are sufficient to group proteins that share both functional and structural patterns and can therefore be used as a criterion for protein classification.
Collapse
Affiliation(s)
- José Rui Ferreira Marques
- REQUIMTE/Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre 687, 4169-007 Porto, Portugal.
| | | | | | | |
Collapse
|
20
|
Elumalai P, Wu JW, Liu HL. Current advances in disulfide connectivity predictions. J Taiwan Inst Chem Eng 2010. [DOI: 10.1016/j.jtice.2010.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
21
|
Lin HH, Tseng LY. DBCP: a web server for disulfide bonding connectivity pattern prediction without the prior knowledge of the bonding state of cysteines. Nucleic Acids Res 2010; 38:W503-7. [PMID: 20530534 PMCID: PMC2896133 DOI: 10.1093/nar/gkq514] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The proper prediction of the location of disulfide bridges is efficient in helping to solve the protein folding problem. Most of the previous works on the prediction of disulfide connectivity pattern use the prior knowledge of the bonding state of cysteines. The DBCP web server provides prediction of disulfide bonding connectivity pattern without the prior knowledge of the bonding state of cysteines. The method used in this server improves the accuracy of disulfide connectivity pattern prediction (Qp) over the previous studies reported in the literature. This DBCP server can be accessed at http://120.107.8.16/dbcp or http://140.120.14.136/dbcp.
Collapse
Affiliation(s)
- Hsuan-Hung Lin
- Department of Applied Mathematics, National Chung Hsing University, Taiwan, ROC
| | | |
Collapse
|
22
|
Rajgaria R, McAllister SR, Floudas CA. Towards accurate residue-residue hydrophobic contact prediction for alpha helical proteins via integer linear optimization. Proteins 2009; 74:929-47. [PMID: 18767158 DOI: 10.1002/prot.22202] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
A new optimization-based method is presented to predict the hydrophobic residue contacts in alpha-helical proteins. The proposed approach uses a high resolution distance dependent force field to calculate the interaction energy between different residues of a protein. The formulation predicts the hydrophobic contacts by minimizing the sum of these contact energies. These residue contacts are highly useful in narrowing down the conformational space searched by protein structure prediction algorithms. The proposed algorithm also offers the algorithmic advantage of producing a rank ordered list of the best contact sets. This model was tested on four independent alpha-helical protein test sets and was found to perform very well. The average accuracy of the predictions (separated by at least six residues) obtained using the presented method was approximately 66% for single domain proteins. The average true positive and false positive distances were also calculated for each protein test set and they are 8.87 and 14.67 A, respectively.
Collapse
Affiliation(s)
- R Rajgaria
- Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, USA
| | | | | |
Collapse
|
23
|
Thangudu RR, Manoharan M, Srinivasan N, Cadet F, Sowdhamini R, Offmann B. Analysis on conservation of disulphide bonds and their structural features in homologous protein domain families. BMC STRUCTURAL BIOLOGY 2008; 8:55. [PMID: 19111067 PMCID: PMC2628669 DOI: 10.1186/1472-6807-8-55] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2008] [Accepted: 12/26/2008] [Indexed: 11/22/2022]
Abstract
Background Disulphide bridges are well known to play key roles in stability, folding and functions of proteins. Introduction or deletion of disulphides by site-directed mutagenesis have produced varying effects on stability and folding depending upon the protein and location of disulphide in the 3-D structure. Given the lack of complete understanding it is worthwhile to learn from an analysis of extent of conservation of disulphides in homologous proteins. We have also addressed the question of what structural interactions replaces a disulphide in a homologue in another homologue. Results Using a dataset involving 34,752 pairwise comparisons of homologous protein domains corresponding to 300 protein domain families of known 3-D structures, we provide a comprehensive analysis of extent of conservation of disulphide bridges and their structural features. We report that only 54% of all the disulphide bonds compared between the homologous pairs are conserved, even if, a small fraction of the non-conserved disulphides do include cytoplasmic proteins. Also, only about one fourth of the distinct disulphides are conserved in all the members in protein families. We note that while conservation of disulphide is common in many families, disulphide bond mutations are quite prevalent. Interestingly, we note that there is no clear relationship between sequence identity between two homologous proteins and disulphide bond conservation. Our analysis on structural features at the sites where cysteines forming disulphide in one homologue are replaced by non-Cys residues show that the elimination of a disulphide in a homologue need not always result in stabilizing interactions between equivalent residues. Conclusion We observe that in the homologous proteins, disulphide bonds are conserved only to a modest extent. Very interestingly, we note that extent of conservation of disulphide in homologous proteins is unrelated to the overall sequence identity between homologues. The non-conserved disulphides are often associated with variable structural features that were recruited to be associated with differentiation or specialisation of protein function.
Collapse
Affiliation(s)
- Ratna R Thangudu
- Laboratoire de Biochimie et Génétique Moléculaire, Université de La Réunion, BP 7151, 15 avenue René Cassin, 97715 Saint Denis Messag Cedex 09, La Réunion, France.
| | | | | | | | | | | |
Collapse
|
24
|
Ullah AHJ, Sethumadhavan K, Mullaney EJ. Unfolding and refolding of Aspergillus niger PhyB phytase: role of disulfide bridges. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2008; 56:8179-8183. [PMID: 18683944 DOI: 10.1021/jf8013712] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The role of disulfide bridges in the folding of Aspergillus niger phytase pH 2.5-optimum (PhyB) was investigated using dynamic light scattering (DLS). Guanidinium chloride (GuCl) at 1.0 M unfolded phytase; however, its removal by dialysis refolded the protein. The thiol reagent tris(2-carboxyethyl)phosphine (TCEP) reduces the refolding activity by 68%. The hydrodynamic radius (R(H)) of PhyB phytase decreased from 5.5 to 4.14 nm when the protein was subjected to 1.0 M GuCl concentration. The active homodimer, 183 kDa, was reduced to a 92 kDa monomer. The DLS data taken together with activity measurements could indicate whether refolding took place or not in PhyB phytase. The correlation between molecular mass and the state of unfolding and refolding is a very strong one in fungal phytase belonging to histidine acid phosphatase (HAP). Unlike PhyA phytase, for which sodium chloride treatment boosted the activity at 0.5 M salt concentration, PhyB phytase activity was severely inhibited under identical condition. Thus, PhyA and PhyB phytases are structurally very different, and their chemical environment in the active site and substrate-binding domain may be different to elicit such an opposite reaction to monovalent cations.
Collapse
Affiliation(s)
- Abul H J Ullah
- Commodity Utilization Research Unit, Southern Regional Research Center, Agricultural Research Service, U.S. Department of Agriculture, 1100 Robert E. Lee Boulevard, New Orleans, Louisiana 70124, USA.
| | | | | |
Collapse
|
25
|
Rubinstein R, Fiser A. Predicting disulfide bond connectivity in proteins by correlated mutations analysis. Bioinformatics 2008; 24:498-504. [PMID: 18203772 DOI: 10.1093/bioinformatics/btm637] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION Prediction of disulfide bond connectivity facilitates structural and functional annotation of proteins. Previous studies suggest that cysteines of a disulfide bond mutate in a correlated manner. RESULTS We developed a method that analyzes correlated mutation patterns in multiple sequence alignments in order to predict disulfide bond connectivity. Proteins with known experimental structures and varying numbers of disulfide bonds, and that spanned various evolutionary distances, were aligned. We observed frequent variation of disulfide bond connectivity within members of the same protein families, and it was also observed that in 99% of the cases, cysteine pairs forming non-conserved disulfide bonds mutated in concert. Our data support the notion that substitution of a cysteine in a disulfide bond prompts the substitution of its cysteine partner and that oxidized cysteines appear in pairs. The method we developed predicts disulfide bond connectivity patterns with accuracies of 73, 69 and 61% for proteins with two, three and four disulfide bonds, respectively.
Collapse
Affiliation(s)
- Rotem Rubinstein
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA.
| | | |
Collapse
|
26
|
Song J, Yuan Z, Tan H, Huber T, Burrage K. Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure. ACTA ACUST UNITED AC 2007; 23:3147-54. [PMID: 17942444 DOI: 10.1093/bioinformatics/btm505] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications. RESULTS We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects. AVAILABILITY The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide
Collapse
Affiliation(s)
- Jiangning Song
- Advanced Computational Modelling Centre, The University of Queensland, Brisbane, QLD 4072, Australia
| | | | | | | | | |
Collapse
|
27
|
Abstract
Disulfide bonds play an important role in stabilizing protein structure and regulating protein function. Therefore, the ability to infer disulfide connectivity from protein sequences will be valuable in structural modeling and functional analysis. However, to predict disulfide connectivity directly from sequences presents a challenge to computational biologists due to the nonlocal nature of disulfide bonds, i.e., the close spatial proximity of the cysteine pair that forms the disulfide bond does not necessarily imply the short sequence separation of the cysteine residues. Recently, Chen and Hwang (Proteins 2005;61:507-512) treated this problem as a multiple class classification by defining each distinct disulfide pattern as a class. They used multiple support vector machines based on a variety of sequence features to predict the disulfide patterns. Their results compare favorably with those in the literature for a benchmark dataset sharing less than 30% sequence identity. However, since the number of disulfide patterns grows rapidly when the number of disulfide bonds increases, their method performs unsatisfactorily for the cases of large number of disulfide bonds. In this work, we propose a novel method to represent disulfide connectivity in terms of cysteine pairs, instead of disulfide patterns. Since the number of bonding states of the cysteine pairs is independent of that of disulfide bonds, the problem of class explosion is avoided. The bonding states of the cysteine pairs are predicted using the support vector machines together with the genetic algorithm optimization for feature selection. The complete disulfide patterns are then determined from the connectivity matrices that are constructed from the predicted bonding states of the cysteine pairs. Our approach outperforms the current approaches in the literature.
Collapse
Affiliation(s)
- Chih-Hao Lu
- Institute of Bioinformatics, National Chiao Tung University, Hsinchu 30050, Taiwan
| | | | | | | |
Collapse
|
28
|
Thangudu RR, Sharma P, Srinivasan N, Offmann B. Analycys: A database for conservation and conformation of disulphide bonds in homologous protein domains. Proteins 2007; 67:255-61. [PMID: 17285632 DOI: 10.1002/prot.21318] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Disulphide bonds in proteins are known to play diverse roles ranging from folding to structure to function. Thorough knowledge of the conservation status and structural state of the disulphide bonds will help in understanding of the differences in homologous proteins. Here we present a database for the analysis of conservation and conformation of disulphide bonds in SCOP structural families. This database has a wide range of applications including mapping of disulphide bond mutation patterns, identification of disulphide bonds important for folding and stabilization, modeling of protein tertiary structures and in protein engineering. The database can be accessed at: http://bioinformatics.univ-reunion.fr/analycys/.
Collapse
Affiliation(s)
- Ratna R Thangudu
- Laboratoire de Biochimie et Génétique Moléculaire, Université de La Réunion, BP 7151, 15 Avenue René Cassin, 97715 Saint Denis Messag Cedex 09, La Réunion, France
| | | | | | | |
Collapse
|
29
|
Liu HL, Chen SC. Prediction of disulfide connectivity in proteins with support vector machine. ACTA ACUST UNITED AC 2007. [DOI: 10.1016/j.jcice.2006.09.002] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
30
|
Song J, Wang M, Burrage K. Exploring synonymous codon usage preferences of disulfide-bonded and non-disulfide bonded cysteines in the E. coli genome. J Theor Biol 2006; 241:390-401. [PMID: 16427089 DOI: 10.1016/j.jtbi.2005.12.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2005] [Revised: 10/31/2005] [Accepted: 12/05/2005] [Indexed: 11/27/2022]
Abstract
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Collapse
Affiliation(s)
- Jiangning Song
- Advanced Computational Modelling Centre, The University of Queensland, Brisbane, Qld 4072, Australia.
| | | | | |
Collapse
|
31
|
Thangudu RR, Vinayagam A, Pugalenthi G, Manonmani A, Offmann B, Sowdhamini R. Native and modeled disulfide bonds in proteins: knowledge-based approaches toward structure prediction of disulfide-rich polypeptides. Proteins 2006; 58:866-79. [PMID: 15645448 DOI: 10.1002/prot.20369] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Structure prediction and three-dimensional modeling of disulfide-rich systems are challenging due to the limited number of such folds in the structural databank. We exploit the stereochemical compatibility of substructures in known protein structures to accommodate disulfide bonds in predicting the structures of disulfide-rich polypeptides directly from disulfide connectivity pattern and amino acid sequence in the absence of structural homologs and any other structural information. This knowledge-based approach is illustrated using structure prediction of 40 nonredundant bioactive disulfide-rich polypeptides such as toxins, growth factors, and endothelins available in the structural databank. The polypeptide conformation could be predicted in 35 out of 40 nonredundant entries (87%). Nonhomologous templates could be identified and models could be obtained within 2 A deviation from the query in 29 peptides (72%). This procedure can be accessed from the World Wide Web (http://www.ncbs.res.in/ approximately faculty/mini/dsdbase/dsdbase.html).
Collapse
Affiliation(s)
- Ratna Rajesh Thangudu
- Laboratoire de Biochimie et Génétique Moléculaire, Université de La Réunion, La Réunion, France
| | | | | | | | | | | |
Collapse
|
32
|
Milbradt AG, Boulegue C, Moroder L, Renner C. The two cysteine-rich head domains of minicollagen from Hydra nematocysts differ in their cystine framework and overall fold despite an identical cysteine sequence pattern. J Mol Biol 2005; 354:591-600. [PMID: 16257007 DOI: 10.1016/j.jmb.2005.09.080] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2005] [Revised: 09/16/2005] [Accepted: 09/27/2005] [Indexed: 12/01/2022]
Abstract
Synthetic replicates of naturally occurring cysteine-rich peptides such as hormones, neurotransmitters, growth factors, enzyme inhibitors, defensins and toxins often can be oxidatively folded in high yields to their native structure in simple redox buffers. Thereby, identical cysteine patterns in the sequence were found to generate identical disulfide connectivities and homologous spatial structures despite significant variability in the non-cysteine positions. Minicollagen-1 from the nematocysts of Hydra is a trimeric protein that contains cysteine-rich domains at the N and C termini, which are involved in the assembly of an intermolecular disulfide network. Determination of the three-dimensional structures of peptides corresponding to the N-terminal and C-terminal domains by NMR spectroscopy revealed a remarkable exception from the general rule. Despite an identical cysteine pattern, the two domains of minicollagen-1 form different disulfide bridges and exhibit distinctly different folds, both of which are not found in the current structural databases. To our knowledge, this is the first case where two relatively short peptides with the abundant cysteine residues in identical sequence positions fold uniquely and with high yields into defined, but differing, structures. Therefore, the cysteine-rich domains of minicollagen constitute ideal model systems for studies of the interplay between folding and oxidation in proteins.
Collapse
|
33
|
Abstract
Many eukaryotic proteins share a sequence designated as the zona pellucida (ZP) domain. This structural element, present in extracellular proteins from a wide variety of organisms, from nematodes to mammals, consists of approximately 260 amino acids with eight conserved cysteine (Cys) residues and is located close to the C terminus of the polypeptide. ZP domain proteins are often glycosylated, modular structures consisting of multiple types of domains. Predictions can be made about some of the structural features of the ZP domain and ZP domain proteins. The functions of ZP domain proteins vary tremendously, from serving as structural components of egg coats, appendicularian mucous houses, and nematode dauer larvae, to serving as mechanotransducers in flies and receptors in mammals and nonmammals. Generally, ZP domain proteins are present in filaments and/or matrices, which is consistent with the role of the domain in protein polymerization. A general mechanism for assembly of ZP domain proteins has been presented. It is likely that the ZP domain plays a common role despite its presence in proteins of widely diverse functions.
Collapse
Affiliation(s)
- Luca Jovine
- Brookdale Department of Molecular, Cell, and Developmental Biology, Mount Sinai School of Medicine, New York, New York 10029-6574, USA.
| | | | | | | |
Collapse
|
34
|
Abstract
The difficulties in predicting disulfide connectivity from protein sequences lie in the nonlocal properties of the disulfide bridges that involve cysteine pairs at large sequence separation. Though some progress has been recently made in the prediction of disulfide connectivity, the current methods predict less than half of the disulfide patterns for the data set sharing less than 30% sequence identity. In this report, we use the support vector machines based on sequence features such as the coupling between the local sequence environments of cysteine pair, the cysteines sequence separations, and the global sequence descriptor, such as amino acid content. Our approach is able to predict 55% of the disulfide patterns of proteins with two to five disulfide bridges, which is 11-26% higher than other methods in the literature.
Collapse
Affiliation(s)
- Yu-Ching Chen
- Institute of Bioinformatics, National Chiao Tung University, Taiwan, Republic of China
| | | |
Collapse
|
35
|
Abstract
Correctly predicting the disulfide bond topology in a protein is of crucial importance for the understanding of protein function and can be of great help for tertiary prediction methods. The web server http://clavius.bc.edu/~clotelab/DiANNA/ outputs the disulfide connectivity prediction given input of a protein sequence. The following procedure is performed. First, PSIPRED is run to predict the protein's secondary structure, then PSIBLAST is run against the non-redundant SwissProt to obtain a multiple alignment of the input sequence. The predicted secondary structure and the profile arising from this alignment are used in the training phase of our neural network. Next, cysteine oxidation state is predicted, then each pair of cysteines in the protein sequence is assigned a likelihood of forming a disulfide bond--this is performed by means of a novel architecture (diresidue neural network). Finally, Rothberg's implementation of Gabow's maximum weighted matching algorithm is applied to diresidue neural network scores in order to produce the final connectivity prediction. Our novel neural network-based approach achieves results that are comparable and in some cases better than the current state-of-the-art methods.
Collapse
Affiliation(s)
- F. Ferrè
- Department of Biology, Boston CollegeChestnut Hill, MA 02467, USA
| | - P. Clote
- Department of Biology, Boston CollegeChestnut Hill, MA 02467, USA
- Department of Computer Science (courtesy appointment), Boston CollegeChestnut Hill, MA 02467, USA
- To whom correspondence should be addressed. Tel: +1 617 552 1332; Fax: +1 617 552 2011;
| |
Collapse
|
36
|
Gupta A, Van Vlijmen HWT, Singh J. A classification of disulfide patterns and its relationship to protein structure and function. Protein Sci 2005; 13:2045-58. [PMID: 15273305 PMCID: PMC2279833 DOI: 10.1110/ps.04613004] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
We report a detailed classification of disulfide patterns to further understand the role of disulfides in protein structure and function. The classification is applied to a unique searchable database of disulfide patterns derived from the SwissProt and Pfam databases. The disulfide database contains seven times the number of publicly available disulfide annotations. Each disulfide pattern in the database captures the topology and cysteine spacing of a protein domain. We have clustered the domains by their disulfide patterns and visualized the results using a novel representation termed the "classification wheel." The classification is applied to 40,620 protein domains with 2-10 disulfides. The effectiveness of the classification is evaluated by determining the extent to which proteins of similar structure and function are grouped together through comparison with the SCOP and Pfam databases, respectively. In general, proteins with similar disulfide patterns have similar structure and function, even in cases of low sequence similarity, and we illustrate this with specific examples. Using a measure of disulfide topology complexity, we find that there is a predominance of less complex topologies. We also explored the importance of loss or addition of disulfides to protein structure and function by linking classification wheels through disulfide subpattern comparisons. This classification, when coupled with our disulfide database, will serve as a useful resource for searching and comparing disulfide patterns, and understanding their role in protein structure, folding, and stability. Proteins in the disulfide clusters that do not contain structural information are prime candidates for structural genomics initiatives, because they may correspond to novel structures.
Collapse
Affiliation(s)
- Abhas Gupta
- Computational Drug Design Group, Biogen Idec, Inc., Cambridge, Masschusetts 02142, USA
| | | | | |
Collapse
|
37
|
Jiang-Ning S, Wei-Jiang L, Wen-Bo X. Cooperativity of the oxidization of cysteines in globular proteins. J Theor Biol 2004; 231:85-95. [PMID: 15363931 DOI: 10.1016/j.jtbi.2004.06.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2003] [Revised: 06/01/2004] [Accepted: 06/07/2004] [Indexed: 11/17/2022]
Abstract
Based on the 639 non-homologous proteins with 2910 cysteine-containing segments of well-resolved three-dimensional structures, a novel approach has been proposed to predict the disulfide-bonding state of cysteines in proteins by constructing a two-stage classifier combining a first global linear discriminator based on their amino acid composition and a second local support vector machine classifier. The overall prediction accuracy of this hybrid classifier for the disulfide-bonding state of cysteines in proteins has scored 84.1% and 80.1%, when measured on cysteine and protein basis using the rigorous jack-knife procedure, respectively. It shows that whether cysteines should form disulfide bonds depends not only on the global structural features of proteins but also on the local sequence environment of proteins. The result demonstrates the applicability of this novel method and provides comparable prediction performance compared with existing methods for the prediction of the oxidation states of cysteines in proteins.
Collapse
Affiliation(s)
- Song Jiang-Ning
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Southern Yangtze University, 170 Huihe Road, Wuxi 214036, China.
| | | | | |
Collapse
|
38
|
Barrientos LG, Martin AM, Rollin PE, Sanchez A. Disulfide bond assignment of the Ebola virus secreted glycoprotein SGP. Biochem Biophys Res Commun 2004; 323:696-702. [PMID: 15369806 DOI: 10.1016/j.bbrc.2004.08.148] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2004] [Indexed: 11/17/2022]
Abstract
The non-structural glycoprotein (SGP) of Ebola virus (EboV) is secreted in large amounts from infected cells as a disulfide-linked homodimer. In this communication, highly purified SGP, derived from Vero E6 cultures infected with the Zaire species of EboV, was used to determine the correct localization of inter- and intrachain disulfide bonds. Matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry analysis of proteolytic cleavage fragments indicates that all cysteines (six per monomeric unit) form unique disulfide bonds. Monomers of the SGP homodimer are joined in a parallel manner by two intersubunit disulfide bonds formed between paired N-terminal and C-terminal cysteines (C53-C53' and C306-C306'). The remaining cysteines are involved in intrachain disulfide bonding (paired as C108-C135 and C121-C147), which resembles the disulfide bond topology of fibronectin type II domains. The findings presented here provide the foundation for future studies aimed at defining the structural and functional properties of SGP.
Collapse
Affiliation(s)
- Laura G Barrientos
- Special Pathogens Branch, Division of Viral and Rickettsial Diseases, Scientific Resources Program, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | | | | | | |
Collapse
|
39
|
Lenffer J, Lai P, El Mejaber W, Khan AM, Koh JLY, Tan PTJ, Seah SH, Brusic V. CysView: protein classification based on cysteine pairing patterns. Nucleic Acids Res 2004; 32:W350-5. [PMID: 15215409 PMCID: PMC441613 DOI: 10.1093/nar/gkh475] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
CysView is a web-based application tool that identifies and classifies proteins according to their disulfide connectivity patterns. It accepts a dataset of annotated protein sequences in various formats and returns a graphical representation of cysteine pairing patterns. CysView displays cysteine patterns for those records in the data with disulfide annotations. It allows the viewing of records grouped by connectivity patterns. CysView's utility as an analysis tool was demonstrated by the rapid and correct classification of scorpion toxin entries from GenPept on the basis of their disulfide pairing patterns. It has proved useful for rapid detection of irrelevant and partial records, or those with incomplete annotations. CysView can be used to support distant homology between proteins. CysView is publicly available at http://research.i2r.a-star.edu.sg/CysView/.
Collapse
Affiliation(s)
- Johann Lenffer
- Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613 Singapore
| | | | | | | | | | | | | | | |
Collapse
|
40
|
Wu W, Huang W, Qi J, Chou YT, Torng E, Watson JT. ‘Signature Sets', Minimal Fragment Sets for Identifying Protein Disulfide Structures with Cyanylation-Based Mass Mapping Methodology. J Proteome Res 2004; 3:770-7. [PMID: 15359730 DOI: 10.1021/pr049961t] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Our cyanylation (CN)-based methodology for determining disulfide structure of cystinyl proteins overcomes the limitations of conventional proteolytic methods. However, the CN-based method has the potential drawback that occasionally some CN-induced cleavage fragments may not be detected. We show that CN-based methods can overcome the failure to detect fragments by demonstrating the existence of small 'signature sets' of fragments. The link between signature sets and the robustness of CN-based methodology is validated by two case studies.
Collapse
Affiliation(s)
- Wei Wu
- Departments of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | | | | | | | | | | |
Collapse
|
41
|
Song JN, Wang ML, Li WJ, Xu WB. Prediction of the disulfide-bonding state of cysteines in proteins based on dipeptide composition. Biochem Biophys Res Commun 2004; 318:142-7. [PMID: 15110765 DOI: 10.1016/j.bbrc.2004.03.189] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2004] [Indexed: 11/24/2022]
Abstract
In this paper, a novel approach has been introduced to predict the disulfide-bonding state of cysteines in proteins by means of a linear discriminator based on their dipeptide composition. The prediction is performed with a newly enlarged dataset with 8114 cysteine-containing segments extracted from 1856 non-homologous proteins of well-resolved three-dimensional structures. The oxidation of cysteines exhibits obvious cooperativity: almost all cysteines in disulfide-bond-containing proteins are in the oxidized form. This cooperativity can be well described by protein's dipeptide composition, based on which the prediction accuracy of the oxidation form of cysteines scores as high as 89.1% and 85.2%, when measured on cysteine and protein basis using the rigorous jack-knife procedure, respectively. The result demonstrates the applicability of this new relatively simple method and provides superior prediction performance compared with existing methods for the prediction of the oxidation states of cysteines in proteins.
Collapse
Affiliation(s)
- Jiang-Ning Song
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Southern Yangtze University, Wuxi 214036, China.
| | | | | | | |
Collapse
|
42
|
Chen YC, Lin YS, Lin CJ, Hwang JK. Prediction of the bonding states of cysteines Using the support vector machines based on multiple feature vectors and cysteine state sequences. Proteins 2004; 55:1036-42. [PMID: 15146500 DOI: 10.1002/prot.20079] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The support vector machine (SVM) method is used to predict the bonding states of cysteines. Besides using local descriptors such as the local sequences, we include global information, such as amino acid compositions and the patterns of the states of cysteines (bonded or nonbonded), or cysteine state sequences, of the proteins. We found that SVM based on local sequences or global amino acid compositions yielded similar prediction accuracies for the data set comprising 4136 cysteine-containing segments extracted from 969 nonhomologous proteins. However, the SVM method based on multiple feature vectors (combining local sequences and global amino acid compositions) significantly improves the prediction accuracy, from 80% to 86%. If coupled with cysteine state sequences, SVM based on multiple feature vectors yields 90% in overall prediction accuracy and a 0.77 Matthews correlation coefficient, around 10% and 22% higher than the corresponding values obtained by SVM based on local sequence information.
Collapse
Affiliation(s)
- Yu-Ching Chen
- Institute of Bioinformatics, National Chiao Tung University, HsinChu, Taiwan, ROC
| | | | | | | |
Collapse
|