1
|
Khrustalev VV, Khrustaleva OV, Stojarov AN, Akunevich AA, Baranov OE, Popinako AV, Samoilovich EO, Yermolovich MA, Semeiko GV, Cheprasova VI, Sapon EG, Shalygo NV, Poboinev VV, Khrustaleva TA, Ranishenka BV, Kharytonova UV, Bush D. Conjugation with the Carrier Helped to Reveal acidification-Induced Structural Shift in the Peptide from Phospholipase Domain of Parvovirus B19. Protein J 2024:10.1007/s10930-024-10209-w. [PMID: 38980534 DOI: 10.1007/s10930-024-10209-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2024] [Indexed: 07/10/2024]
Abstract
Spectroscopic studies on domains and peptides of large proteins are complicated because of the tendency of short peptides to form oligomers in aquatic buffers, but conjugation of a peptide with a carrier protein may be helpful. In this study we approved that a fragment of SK30 peptide from phospholipase A2 domain of VP1 Parvovirus B19 capsid protein (residues: 144-159; 164; 171-183; sequence: SAVDSAARIHDFRYSQLAKLGINPYTHWTVADEELLKNIK) turns from random coil to alpha helix in the acidic medium only in case if it had been conjugated with BSA (through additional N-terminal Cys residue, turning it into CSK31 peptide, and SMCC linker) according to CD-spectroscopy results. In contrast, unconjugated SK30 peptide does not undergo such shift because it forms stable oligomers connected by intermolecular antiparallel beta sheet, according to IR-spectroscopy, CD-spectroscopy, blue native gel electrophoresis and centrifugal ultrafiltration, as, probably, the whole isolated phospholipase domain of VP1 protein does. However, being a part of the long VP1 capsid protein, phospholipase domain may change its fold during the acidification of the medium in the endolysosome by the way of the formation of contacts between protonated His153 and Asp175, promoting the shift from random coil to alpha helix in its N-terminal part. This study opens up a perspective of vaccine development, since rabbit polyclonal antibodies against the conjugate of CSK31 peptide with BSA, in which the structure of the second alpha helix from the phospholipase A2 domain should be reproduced, can bind epitopes of the complete recombinant unique part of VP1 Parvovirus B19 capsid (residues: 1-227).
Collapse
Affiliation(s)
| | - Olga Victorovna Khrustaleva
- Department of General Chemistry, Belarusian State Medical University, Dzerzhinskogo 83, Minsk, 220045, 220083, Belarus
| | | | | | - Oleg Evgenyevich Baranov
- Bach Institute of Biochemistry, Shared-Access Equipment Centre "Industrial Biotechnology" of Russian Academy of Science, Leninskiy prospect, 33/2, Moscow, 119071, Russian Federation
| | - Anna Vladimirovna Popinako
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, Leninskiy prospect, 33/2, Moscow, 119071, Russian Federation
| | - Elena Olegovna Samoilovich
- Laboratory of Vaccine-controlled Infections, Republican Research and Practical Center for Epidemiology and Microbiology, Filimonova 23, Minsk, 220114, Belarus
| | - Marina Anatolyevna Yermolovich
- Laboratory of Vaccine-controlled Infections, Republican Research and Practical Center for Epidemiology and Microbiology, Filimonova 23, Minsk, 220114, Belarus
| | - Galina Valeryevna Semeiko
- Laboratory of Vaccine-controlled Infections, Republican Research and Practical Center for Epidemiology and Microbiology, Filimonova 23, Minsk, 220114, Belarus
| | - Victoria Igorevna Cheprasova
- Laboratory of infra-red spectroscopy and infra-red microscopy, Belarusian State Technological University, Sverdlova 13a, Minsk, 220006, Belarus
| | - Egor Gennadyevich Sapon
- Laboratory of infra-red spectroscopy and infra-red microscopy, Belarusian State Technological University, Sverdlova 13a, Minsk, 220006, Belarus
| | - Nikolai Vladimirovich Shalygo
- Department of General Chemistry, Belarusian State Medical University, Dzerzhinskogo 83, Minsk, 220045, 220083, Belarus
| | - Victor Vitoldovich Poboinev
- Department of General Chemistry, Belarusian State Medical University, Dzerzhinskogo 83, Minsk, 220045, 220083, Belarus
| | - Tatyana Aleksandrovna Khrustaleva
- Laboratory of Biomedical Technologies and Medical Rehabilitation, Institute of Physiology of the National Academy of Sciences of Belarus, Academicheskaya 28, Minsk, 220072, Belarus
| | - Bahdan Vyacheslavovich Ranishenka
- Laboratory of Chemistry of Bioconjugates, Institute of Physical-organic Chemistry of the National Academy of Sciences of Belarus, Surganova 13, Minsk, 220072, Belarus
| | - Ulyana Vitalyevna Kharytonova
- Department of General Chemistry, Belarusian State Medical University, Dzerzhinskogo 83, Minsk, 220045, 220083, Belarus
| | - Daniel Bush
- Department of General Chemistry, Belarusian State Medical University, Dzerzhinskogo 83, Minsk, 220045, 220083, Belarus
| |
Collapse
|
2
|
Khrustalev VV. The PentaFOLD 3.0 Algorithm for the Selection of Stable Elements of Secondary Structure to be Included in Vaccine Peptides. Protein Pept Lett 2021; 28:573-588. [PMID: 33172366 DOI: 10.2174/0929866527666201110123851] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 10/18/2020] [Accepted: 10/19/2020] [Indexed: 12/18/2022]
Abstract
AIMS The aim of this study was to create a new version of the PentaFOLD algorithm and to test its performance experimentally in several proteins and peptides. BACKGROUND Synthetic vaccines can cause production of neutralizing antibodies only in case if short peptides form the same secondary structure as fragments of full-length proteins. The Penta- FOLD 3.0 algorithm was designed to check stability of alpha helices, beta strands, and random coils using several propensity scales obtained during analysis of 1730 3D structures of proteins. OBJECTIVE The algorithm has been tested in the three peptides known to keep the secondary structure of the corresponding fragments of full-length proteins: the NY25 peptide from the Influenza H1N1 hemagglutinin, the SF23 peptide from the diphtheria toxin, the NQ21 peptide from the HIV1 gp120; as well as in the CC36 peptide from the human major prion protein. METHODS Affine chromatography for antibodies against peptides accompanied by circular dichroism and fluorescence spectroscopy were used to check the predictions of the algorithm. RESULTS Immunological experiments showed that all abovementioned peptides are more or less immunogenic in rabbits. The fact that antibodies against the NY25, the SF23, and the NQ21 form stable complexes with corresponding full-length proteins has been confirmed by affine chromatography. The surface of SARS CoV-2 spike receptor-binding domain interacting with hACE2 has been shown to be unstable according to the results of the PentaFOLD 3.0. CONCLUSION The PentaFOLD 3.0 algorithm (http://chemres.bsmu.by/PentaFOLD30.htm) can be used with the aim to design vaccine peptides with stable secondary structure elements.
Collapse
|
3
|
Kleppe AS, Bornberg-Bauer E. Robustness by intrinsically disordered C-termini and translational readthrough. Nucleic Acids Res 2019; 46:10184-10194. [PMID: 30247639 PMCID: PMC6365619 DOI: 10.1093/nar/gky778] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 09/20/2018] [Indexed: 12/20/2022] Open
Abstract
During protein synthesis genetic instructions are passed from DNA via mRNA to the ribosome to assemble a protein chain. Occasionally, stop codons in the mRNA are bypassed and translation continues into the untranslated region (3′-UTR). This process, called translational readthrough (TR), yields a protein chain that becomes longer than would be predicted from the DNA sequence alone. Protein sequences vary in propensity for translational errors, which may yield evolutionary constraints by limiting evolutionary paths. Here we investigated TR in Saccharomyces cerevisiae by analysing ribosome profiling data. We clustered proteins as either prone or non-prone to TR, and conducted comparative analyses. We find that a relatively high frequency (5%) of genes undergo TR, including ribosomal subunit proteins. Our main finding is that proteins undergoing TR are highly expressed and have a higher proportion of intrinsically disordered C-termini. We suggest that highly expressed proteins may compensate for the deleterious effects of TR by having intrinsically disordered C-termini, which may provide conformational flexibility but without distorting native function. Moreover, we discuss whether minimizing deleterious effects of TR is also enabling exploration of the phenotypic landscape of protein isoforms.
Collapse
Affiliation(s)
- April Snofrid Kleppe
- Institute of Biodiversity and Evolution, University of Münster, Hüfferstr. 1, 48151 Münster, Germany
| | - Erich Bornberg-Bauer
- Institute of Biodiversity and Evolution, University of Münster, Hüfferstr. 1, 48151 Münster, Germany
| |
Collapse
|
4
|
Khrustalev VV, Khrustaleva TA, Poboinev VV. Amino acid content of beta strands and alpha helices depends on their flanking secondary structure elements. Biosystems 2018; 168:45-54. [DOI: 10.1016/j.biosystems.2018.04.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2017] [Revised: 04/21/2018] [Accepted: 04/23/2018] [Indexed: 12/17/2022]
|
5
|
Manjegowda DS, Karunakar P, Ramachandra NB. Effect of Structural Changes in Proteins Derived from GATA4 Nonsynonymous Single Nucleotide Polymorphisms in Congenital Heart Disease. Indian J Pharm Sci 2016; 77:735-41. [PMID: 26997702 PMCID: PMC4778234 DOI: 10.4103/0250-474x.174988] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Congenital heart disease is the most common type of birth defect. The single nucleotide polymorphism in GATA4 is associated with various congenital heart disease phenotypes. In the present study, we analysed the nonsynonymous single nucleotide polymorphism of GATA4, which are involved in congenital heart disease by predicting the changes in protein structures. Total of 49 nonsynonymous single nucleotide polymorphisms of GATA4 was screened from congenital heart disease patients of Mysore and also globally reported nonsynonymous single nucleotide polymorphisms. To understand the role of nonsynonymous single nucleotide polymorphisms, we mutated the sequence and translated into amino acids. Further the mutated protein secondary structure is predicted and tertiary structure is predicted using homology modeling. The quantitative evaluation of protein structure quality was verified with Volume Area Dihedral Angle Reporter server. Results revealed the secondary, tertiary structural changes along with changes in free energy of folding, volume and accessible surface area. Thus, the structural changes in the mutated proteins impaired the normal function of GATA4.
Collapse
Affiliation(s)
- D S Manjegowda
- NUCSER, K S Hegde Medical Academy, NITTE University, Deralakatte, Mangalore-575 018, India
| | - P Karunakar
- Department of Biotechnology, PES Institute of Technology, BSK III Stage, Bengaluru-560 085, India, India
| | - N B Ramachandra
- Department of Studies in Zoology, Genomics Laboratory, University of Mysore, Manasagangotri, Mysore-570 006, India
| |
Collapse
|
6
|
de Oliveira SHP, Shi J, Deane CM. Building a better fragment library for de novo protein structure prediction. PLoS One 2015; 10:e0123998. [PMID: 25901595 PMCID: PMC4406757 DOI: 10.1371/journal.pone.0123998] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2014] [Accepted: 02/25/2015] [Indexed: 01/11/2023] Open
Abstract
Fragment-based approaches are the current standard for de novo protein structure prediction. These approaches rely on accurate and reliable fragment libraries to generate good structural models. In this work, we describe a novel method for structure fragment library generation and its application in fragment-based de novo protein structure prediction. The importance of correct testing procedures in assessing the quality of fragment libraries is demonstrated. In particular, the exclusion of homologs to the target from the libraries to correctly simulate a de novo protein structure prediction scenario, something which surprisingly is not always done. We demonstrate that fragments presenting different predominant predicted secondary structures should be treated differently during the fragment library generation step and that exhaustive and random search strategies should both be used. This information was used to develop a novel method, Flib. On a validation set of 41 structurally diverse proteins, Flib libraries presents both a higher precision and coverage than two of the state-of-the-art methods, NNMake and HHFrag. Flib also achieves better precision and coverage on the set of 275 protein domains used in the two previous experiments of the the Critical Assessment of Structure Prediction (CASP9 and CASP10). We compared Flib libraries against NNMake libraries in a structure prediction context. Of the 13 cases in which a correct answer was generated, Flib models were more accurate than NNMake models for 10. “Flib is available for download at: http://www.stats.ox.ac.uk/research/proteins/resources”.
Collapse
Affiliation(s)
| | - Jiye Shi
- Department of Informatics, UCB Pharma, Slough, United Kingdom
- Shanghai Institute of Applied Physics, Chinese Academy of Sciences, Shanghai, China
| | - Charlotte M. Deane
- Department of Statistics, Oxford University, Oxford, Oxfordshire, United Kingdom
| |
Collapse
|
7
|
Decoding mechanisms by which silent codon changes influence protein biogenesis and function. Int J Biochem Cell Biol 2015; 64:58-74. [PMID: 25817479 DOI: 10.1016/j.biocel.2015.03.011] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Revised: 03/02/2015] [Accepted: 03/14/2015] [Indexed: 02/07/2023]
Abstract
SCOPE Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. PURPOSE This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. PHYSIOLOGICAL AND MEDICAL RELEVANCE Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies.
Collapse
|
8
|
Singh RK, Tiwari MK, Singh R, Lee JK. From protein engineering to immobilization: promising strategies for the upgrade of industrial enzymes. Int J Mol Sci 2013; 14:1232-77. [PMID: 23306150 PMCID: PMC3565319 DOI: 10.3390/ijms14011232] [Citation(s) in RCA: 265] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2012] [Revised: 11/14/2012] [Accepted: 12/24/2012] [Indexed: 11/16/2022] Open
Abstract
Enzymes found in nature have been exploited in industry due to their inherent catalytic properties in complex chemical processes under mild experimental and environmental conditions. The desired industrial goal is often difficult to achieve using the native form of the enzyme. Recent developments in protein engineering have revolutionized the development of commercially available enzymes into better industrial catalysts. Protein engineering aims at modifying the sequence of a protein, and hence its structure, to create enzymes with improved functional properties such as stability, specific activity, inhibition by reaction products, and selectivity towards non-natural substrates. Soluble enzymes are often immobilized onto solid insoluble supports to be reused in continuous processes and to facilitate the economical recovery of the enzyme after the reaction without any significant loss to its biochemical properties. Immobilization confers considerable stability towards temperature variations and organic solvents. Multipoint and multisubunit covalent attachments of enzymes on appropriately functionalized supports via linkers provide rigidity to the immobilized enzyme structure, ultimately resulting in improved enzyme stability. Protein engineering and immobilization techniques are sequential and compatible approaches for the improvement of enzyme properties. The present review highlights and summarizes various studies that have aimed to improve the biochemical properties of industrially significant enzymes.
Collapse
Affiliation(s)
- Raushan Kumar Singh
- Department of Chemical Engineering, Konkuk University, 1 Hwayang-Dong, Gwangjin-Gu, Seoul 143-701, Korea.
| | | | | | | |
Collapse
|
9
|
Mao Y, Wang W, Cheng N, Li Q, Tao S. Universally increased mRNA stability downstream of the translation initiation site in eukaryotes and prokaryotes. Gene 2013; 517:230-5. [PMID: 23313297 DOI: 10.1016/j.gene.2012.12.062] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2012] [Accepted: 12/03/2012] [Indexed: 11/26/2022]
Abstract
Local secondary structures in coding sequences have important functions across various translational processes. To date, however, the local structures and their functions in the early stage of translation elongation remain poorly understood. Here, we surveyed the structural stability in the first 180 nucleotides of the coding sequence of 27 species using computational method. We found that the structural stability in the 30-80 nucleotide interval was significantly higher than that in other regions in eukaryotes and most prokaryotes. No significant correlation between local translation efficiency and structural stability was observed, suggesting that this structural region has undergone selection pressure directly to maintain high stability. Furthermore, ribosome was blocked by this region, providing an opportunity for co-translational regulation. Remarkably, in eukaryotes, we found that mRNAs with higher structural stability in the 30-80 nucleotide interval tended to encode the secreted proteins. Overall, our results revealed a previously unappreciated correlation between structural stability and protein localization.
Collapse
Affiliation(s)
- Yuanhui Mao
- State Key Laboratory of Crop Stress Biology in Arid Areas and College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China
| | | | | | | | | |
Collapse
|
10
|
Västermark Å, Krishnan A, Houle ME, Fredriksson R, Cerdá-Reverter JM, Schiöth HB. Identification of distant Agouti-like sequences and re-evaluation of the evolutionary history of the Agouti-related peptide (AgRP). PLoS One 2012; 7:e40982. [PMID: 22815886 PMCID: PMC3397983 DOI: 10.1371/journal.pone.0040982] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2012] [Accepted: 06/15/2012] [Indexed: 01/27/2023] Open
Abstract
The Agouti-like peptides including AgRP, ASIP and the teleost-specific A2 (ASIP2 and AgRP2) peptides have potent and diverse functional roles in feeding, pigmentation and background adaptation mechanisms. There are contradictory theories about the evolution of the Agouti-like peptide family as well the nomenclature. Here we performed comprehensive mining and annotation of vertebrate Agouti-like sequences. We identified A2 sequences from salmon, trout, seabass, cod, cichlid, tilapia, gilt-headed sea bream, Antarctic toothfish, rainbow smelt, common carp, channel catfish and interestingly also in lobe-finned fish. Moreover, we surprisingly found eight novel homologues from the kingdom of arthropods and three from fungi, some sharing the characteristic C-x(6)-C-C motif which are present in the Agouti-like sequences, as well as approximate sequence length (130 amino acids), positioning of the motif sequence and sharing of exon-intron structures that are similar to the other Agouti-like peptides providing further support for the common origin of these sequences. Phylogenetic analysis shows that the AgRP sequences cluster basally in the tree, suggesting that these sequences split from a cluster containing both the ASIP and the A2 sequences. We also used a novel approach to determine the statistical evidence for synteny, a sinusoidal Hough transform pattern recognition technique. Our analysis shows that the teleost AgRP2 resides in a chromosomal region that has synteny with Hsa 8, but we found no convincing synteny between the regions that A2, AgRP and ASIP reside in, which would support that the Agouti-like peptides were formed by whole genome tetraplodization events. Here we suggest that the Agouti-like peptide genes were formed through classical subsequent gene duplications where the AgRP is the most distantly related to the three other members of that group, first splitting from a common ancestor to ASIP and A2, and then later the A2 split from ASIP followed by a split resulting in ASIP2 and AgRP2.
Collapse
Affiliation(s)
- Åke Västermark
- Department of Neuroscience, Uppsala University, Uppsala, Sweden.
| | | | | | | | | | | |
Collapse
|
11
|
The imminent role of protein engineering in synthetic biology. Biotechnol Adv 2011; 30:541-9. [PMID: 21963685 DOI: 10.1016/j.biotechadv.2011.09.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Revised: 09/13/2011] [Accepted: 09/13/2011] [Indexed: 01/08/2023]
Abstract
Protein engineering has for decades been a powerful tool in biotechnology for generating vast numbers of useful enzymes for industrial applications. Today, protein engineering has a crucial role in advancing the emerging field of synthetic biology, where metabolic engineering efforts alone are insufficient to maximize the full potential of synthetic biology. This article reviews the advancements in protein engineering techniques for improving biocatalytic properties to optimize engineered pathways in host systems, which are instrumental to achieve high titer production of target molecules. We also discuss the specific means by which protein engineering has improved metabolic engineering efforts and provide our assessment on its potential to continue to advance biology engineering as a whole.
Collapse
|
12
|
Deane CM, Saunders R. The imprint of codons on protein structure. Biotechnol J 2011; 6:641-9. [DOI: 10.1002/biot.201000329] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Revised: 03/10/2011] [Accepted: 03/23/2011] [Indexed: 12/23/2022]
|
13
|
Saunders R, Mann M, Deane CM. Signatures of co-translational folding. Biotechnol J 2011; 6:742-51. [DOI: 10.1002/biot.201000330] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2010] [Revised: 03/01/2011] [Accepted: 03/03/2011] [Indexed: 12/11/2022]
|
14
|
Zhang H, Zhang T, Chen K, Kedarisetti KD, Mizianty MJ, Bao Q, Stach W, Kurgan L. Critical assessment of high-throughput standalone methods for secondary structure prediction. Brief Bioinform 2011; 12:672-88. [PMID: 21252072 DOI: 10.1093/bib/bbq088] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Sequence-based prediction of protein secondary structure (SS) enjoys wide-spread and increasing use for the analysis and prediction of numerous structural and functional characteristics of proteins. The lack of a recent comprehensive and large-scale comparison of the numerous prediction methods results in an often arbitrary selection of a SS predictor. To address this void, we compare and analyze 12 popular, standalone and high-throughput predictors on a large set of 1975 proteins to provide in-depth, novel and practical insights. We show that there is no universally best predictor and thus detailed comparative studies are needed to support informed selection of SS predictors for a given application. Our study shows that the three-state accuracy (Q3) and segment overlap (SOV3) of the SS prediction currently reach 82% and 81%, respectively. We demonstrate that carefully designed consensus-based predictors improve the Q3 by additional 2% and that homology modeling-based methods are significantly better by 1.5% Q3 than ab initio approaches. Our empirical analysis reveals that solvent exposed and flexible coils are predicted with a higher quality than the buried and rigid coils, while inverse is true for the strands and helices. We also show that longer helices are easier to predict, which is in contrast to longer strands that are harder to find. The current methods confuse 1-6% of strand residues with helical residues and vice versa and they perform poorly for residues in the β- bridge and 3(10)-helix conformations. Finally, we compare predictions of the standalone implementations of four well-performing methods with their corresponding web servers.
Collapse
Affiliation(s)
- Hua Zhang
- Zhejiang Gongshang University, Hangzhou, Zhejiang, P.R. China
| | | | | | | | | | | | | | | |
Collapse
|