Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kuznetsov IB, McDuffie M. FlexPred: a web-server for predicting residue positions involved in conformational switches in proteins. Bioinformation 2008;3:134-6. [PMID: 19238251 PMCID: PMC2639688 DOI: 10.6026/97320630003134] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2008] [Accepted: 11/01/2008] [Indexed: 11/23/2022] Open

For:	Kuznetsov IB, McDuffie M. FlexPred: a web-server for predicting residue positions involved in conformational switches in proteins. Bioinformation 2008;3:134-6. [PMID: 19238251 PMCID: PMC2639688 DOI: 10.6026/97320630003134] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2008] [Accepted: 11/01/2008] [Indexed: 11/23/2022] Open

Number

Cited by Other Article(s)

Sivadas A, Rathore S, Sahana S, Jolly B, Bhoyar RC, Jain A, Sharma D, Imran M, Senthilvel V, Divakar MK, Mishra A, Sivasubbu S, Scaria V. The genomic landscape of CYP2D6 variation in the Indian population. Pharmacogenomics 2024;25:147-160. [PMID: 38426301 DOI: 10.2217/pgs-2023-0233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open

Affiliation(s)

Ambily Sivadas Division of Nutrition, St. John's Research Institute, St. John's National Academy of Health Sciences, Bangalore, Karnataka, 560034, India
Surabhi Rathore CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
S Sahana CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Bani Jolly CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Rahul C Bhoyar CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India
Abhinav Jain CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Disha Sharma CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India
Mohamed Imran CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Vigneshwar Senthilvel CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Mohit Kumar Divakar CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Anushree Mishra CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
Sridhar Sivasubbu CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India Vishwanath Cancer Care Foundation, B 702, 7th Floor, Neelkanth Business Park Kirol Village, Vidya Vihar, West Mumbai, 400086, India
Vinod Scaria CSIR Institute of Genomics & Integrative Biology, Mathura Road, New Delhi, 110025, India Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India Vishwanath Cancer Care Foundation, B 702, 7th Floor, Neelkanth Business Park Kirol Village, Vidya Vihar, West Mumbai, 400086, India

Collapse

Saih A, Bouqdayr M, Baba H, Hamdi S, Moussamih S, Bennani H, Saile R, Wakrim L, Kettani A. Computational Analysis of Missense Variants in the Human Transmembrane Protease Serine 2 (TMPRSS2) and SARS-CoV-2. BIOMED RESEARCH INTERNATIONAL 2021;2021:9982729. [PMID: 34692848 PMCID: PMC8531787 DOI: 10.1155/2021/9982729] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 07/06/2021] [Accepted: 09/11/2021] [Indexed: 01/08/2023]

Pushpavanam K, Hellner B, Baneyx F. Interrogating biomineralization one amino acid at a time: amplification of mutational effects in protein-aided titania morphogenesis through reaction-diffusion control. Chem Commun (Camb) 2021;57:4803-4806. [PMID: 33982711 DOI: 10.1039/d1cc01521d] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Characterization of SdGA, a cold-adapted glucoamylase from Saccharophagus degradans. ACTA ACUST UNITED AC 2021;30:e00625. [PMID: 34041001 PMCID: PMC8141877 DOI: 10.1016/j.btre.2021.e00625] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 04/24/2021] [Accepted: 04/28/2021] [Indexed: 11/24/2022]

Veevers R, Cawley G, Hayward S. Investigation of sequence features of hinge-bending regions in proteins with domain movements using kernel logistic regression. BMC Bioinformatics 2020;21:137. [PMID: 32272894 PMCID: PMC7147021 DOI: 10.1186/s12859-020-3464-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 03/20/2020] [Indexed: 11/12/2022] Open

Abstract

Background

Hinge-bending movements in proteins comprising two or more domains form a large class of functional movements. Hinge-bending regions demarcate protein domains and collectively control the domain movement. Consequently, the ability to recognise sequence features of hinge-bending regions and to be able to predict them from sequence alone would benefit various areas of protein research. For example, an understanding of how the sequence features of these regions relate to dynamic properties in multi-domain proteins would aid in the rational design of linkers in therapeutic fusion proteins.

Results

The DynDom database of protein domain movements comprises sequences annotated to indicate whether the amino acid residue is located within a hinge-bending region or within an intradomain region. Using statistical methods and Kernel Logistic Regression (KLR) models, this data was used to determine sequence features that favour or disfavour hinge-bending regions. This is a difficult classification problem as the number of negative cases (intradomain residues) is much larger than the number of positive cases (hinge residues). The statistical methods and the KLR models both show that cysteine has the lowest propensity for hinge-bending regions and proline has the highest, even though it is the most rigid amino acid. As hinge-bending regions have been previously shown to occur frequently at the terminal regions of the secondary structures, the propensity for proline at these regions is likely due to its tendency to break secondary structures. The KLR models also indicate that isoleucine may act as a domain-capping residue. We have found that a quadratic KLR model outperforms a linear KLR model and that improvement in performance occurs up to very long window lengths (eighty residues) indicating long-range correlations.

Conclusion

In contrast to the only other approach that focused solely on interdomain hinge-bending regions, the method provides a modest and statistically significant improvement over a random classifier. An explanation of the KLR results is that in the prediction of hinge-bending regions a long-range correlation is at play between a small number amino acids that either favour or disfavour hinge-bending regions. The resulting sequence-based prediction tool, HingeSeek, is available to run through a webserver at hingeseek.cmp.uea.ac.uk.

Collapse

Gyulkhandanyan A, Rezaie AR, Roumenina L, Lagarde N, Fremeaux-Bacchi V, Miteva MA, Villoutreix BO. Analysis of protein missense alterations by combining sequence- and structure-based methods. Mol Genet Genomic Med 2020;8:e1166. [PMID: 32096919 PMCID: PMC7196459 DOI: 10.1002/mgg3.1166] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 01/20/2020] [Accepted: 01/27/2020] [Indexed: 12/11/2022] Open

Narwani TJ, Etchebest C, Craveur P, Léonard S, Rebehmed J, Srinivasan N, Bornot A, Gelly JC, de Brevern AG. In silico prediction of protein flexibility with local structure approach. Biochimie 2019;165:150-155. [PMID: 31377194 DOI: 10.1016/j.biochi.2019.07.025] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 07/26/2019] [Indexed: 12/30/2022]

Affiliation(s)

Tarun J Narwani INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France
Catherine Etchebest INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France
Pierrick Craveur INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France; Molecular Graphics Laboratory, Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
Sylvain Léonard INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France
Joseph Rebehmed INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France; Department of Computer Science and Mathematics, Lebanese American University, Byblos 1h401 2010, Lebanon
Narayanaswamy Srinivasan MBU, IISc, Bangalore, India
Aurélie Bornot INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France
Jean-Christophe Gelly INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France
Alexandre G de Brevern INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France; Molecular Graphics Laboratory, Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA.

Collapse

Wu X, Fraser K, Zha J, Dordick JS. Flexible Peptide Linkers Enhance the Antimicrobial Activity of Surface-Immobilized Bacteriolytic Enzymes. ACS APPLIED MATERIALS & INTERFACES 2018;10:36746-36756. [PMID: 30281274 DOI: 10.1021/acsami.8b14411] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Abstract

Chemical linkers are frequently used in enzyme immobilization to improve enzyme flexibility and activity, whereas peptide linkers, although ubiquitous in protein engineering, are much less explored in enzyme immobilization. Here, we report peptide-linker-assisted noncovalent immobilization of the bacteriolytic enzyme lysostaphin (Lst) to generate anti- Staphylococcus aureus surfaces. Lst was immobilized through affinity tags onto a silica surface (glass slides) and nickel nitrilotriacetic acid (NiNTA) agarose beads via silica-binding peptides (SiBPs) or a hexahistidine tag (His-tag) fused at the C-terminus of Lst, respectively. By inserting specific peptide linkers upstream of the SiBP or His-tag, the immobilized enzymes killed >99.5% of S. aureus ATCC 6538 cells (10⁸ CFU/mL) within 3 h in buffer and could be reused multiple times without significant loss of activity. In contrast, immobilized Lst without a peptide linker was less active/stable. Molecular modeling of Lst-linker-affinity tag constructs illustrated that the presence of the peptide linkers enhanced the molecular flexibility of the proximal Lst binding domain, which interacts with the bacterial substrate, and such increased flexibility correlated with increased antimicrobial activity. We further show that Lst immobilized onto NiNTA beads retained the ability to kill ∼99% of a 10⁸ CFU/mL microbial challenge even in the presence of 1% of a commercial anionic surfactant, C12-14 alcohol EO 3:1 sodium sulfate, when the Lst construct contained a decapeptide linker containing glycine, serine, and alanine residues. This linker-assisted immobilization strategy could be extended to an unrelated lytic enzyme, the endolysin PlyPH, to target Bacillus anthracis Sterne cells either in buffer or in the presence of anionic surfactants. Our approach, therefore, provides a facile route to the use of antimicrobial enzymes on surfaces.

Collapse

Khalid Z, Sezerman OU. Prediction of HIV Drug Resistance by Combining Sequence and Structural Properties. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:966-973. [PMID: 27992346 DOI: 10.1109/tcbb.2016.2638821] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Meng F, Kurgan L. DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences. Bioinformatics 2017;32:i341-i350. [PMID: 27307636 PMCID: PMC4908364 DOI: 10.1093/bioinformatics/btw280] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Abstract

Motivation: Disordered flexible linkers (DFLs) are disordered regions that serve as flexible linkers/spacers in multi-domain proteins or between structured constituents in domains. They are different from flexible linkers/residues because they are disordered and longer. Availability of experimentally annotated DFLs provides an opportunity to build high-throughput computational predictors of these regions from protein sequences. To date, there are no computational methods that directly predict DFLs and they can be found only indirectly by filtering predicted flexible residues with predictions of disorder.

Results: We conceptualized, developed and empirically assessed a first-of-its-kind sequence-based predictor of DFLs, DFLpred. This method outputs propensity to form DFLs for each residue in the input sequence. DFLpred uses a small set of empirically selected features that quantify propensities to form certain secondary structures, disordered regions and structured regions, which are processed by a fast linear model. Our high-throughput predictor can be used on the whole-proteome scale; it needs <1 h to predict entire proteome on a single CPU. When assessed on an independent test dataset with low sequence-identity proteins, it secures area under the receiver operating characteristic curve equal 0.715 and outperforms existing alternatives that include methods for the prediction of flexible linkers, flexible residues, intrinsically disordered residues and various combinations of these methods. Prediction on the complete human proteome reveals that about 10% of proteins have a large content of over 30% DFL residues. We also estimate that about 6000 DFL regions are long with ≥30 consecutive residues.

Availability and implementation:http://biomine.ece.ualberta.ca/DFLpred/.

Contact:lkurgan@vcu.edu

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

Yi G, Ybe JA, Saha SS, Caviness G, Raymond E, Ganesan R, Mbow ML, Kao CC. Structural and Functional Attributes of the Interleukin-36 Receptor. J Biol Chem 2016;291:16597-609. [PMID: 27307043 DOI: 10.1074/jbc.m116.723064] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Indexed: 12/22/2022] Open

Yenenler A, Sezerman OU. Design and characterizations of two novel cellulases through single-gene shuffling of Cel12A (EG3) gene fromTrichoderma reseei. Protein Eng Des Sel 2016;29:219-229. [DOI: 10.1093/protein/gzw011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Accepted: 03/24/2016] [Indexed: 11/14/2022] Open

Palmitoylation controls DLK localization, interactions and activity to ensure effective axonal injury signaling. Proc Natl Acad Sci U S A 2015;113:763-8. [PMID: 26719418 DOI: 10.1073/pnas.1514123113] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Prediction of neddylation sites from protein sequences and sequence-derived properties. BMC Bioinformatics 2015;16 Suppl 18:S9. [PMID: 26679222 PMCID: PMC4682398 DOI: 10.1186/1471-2105-16-s18-s9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Craveur P, Joseph AP, Esque J, Narwani TJ, Noël F, Shinada N, Goguet M, Leonard S, Poulain P, Bertrand O, Faure G, Rebehmed J, Ghozlane A, Swapna LS, Bhaskara RM, Barnoud J, Téletchéa S, Jallu V, Cerny J, Schneider B, Etchebest C, Srinivasan N, Gelly JC, de Brevern AG. Protein flexibility in the light of structural alphabets. Front Mol Biosci 2015;2:20. [PMID: 26075209 PMCID: PMC4445325 DOI: 10.3389/fmolb.2015.00020] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2015] [Accepted: 04/30/2015] [Indexed: 01/01/2023] Open

Affiliation(s)

Pierrick Craveur Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Agnel P Joseph Rutherford Appleton Laboratory, Science and Technology Facilities Council Didcot, UK
Jeremy Esque Institut National de la Santé et de la Recherche Médicale U964,7 UMR Centre National de la Recherche Scientifique 7104, IGBMC, Université de Strasbourg Illkirch, France
Tarun J Narwani Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Floriane Noël Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Nicolas Shinada Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Matthieu Goguet Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Sylvain Leonard Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Pierre Poulain Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France ; Ets Poulain Pointe-Noire, Congo
Olivier Bertrand Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Guilhem Faure National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health Bethesda, MD, USA
Joseph Rebehmed Centre National de la Recherche Scientifique UMR7590, Sorbonne Universités, Université Pierre et Marie Curie - MNHN - IRD - IUC Paris, France
Amine Ghozlane Metagenopolis, INRA Jouy-en-Josas, France
Lakshmipuram S Swapna Molecular Biophysics Unit, Indian Institute of Science, Bangalore Bangalore, India ; Hospital for Sick Children, and Departments of Biochemistry and Molecular Genetics, University of Toronto Toronto, ON, Canada
Ramachandra M Bhaskara Molecular Biophysics Unit, Indian Institute of Science, Bangalore Bangalore, India ; Department of Theoretical Biophysics, Max Planck Institute of Biophysics Frankfurt, Germany
Jonathan Barnoud Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France ; Laboratoire de Physique, École Normale Supérieure de Lyon, Université de Lyon, Centre National de la Recherche Scientifique UMR 5672 Lyon, France
Stéphane Téletchéa Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France ; Faculté des Sciences et Techniques, Université de Nantes, Unité Fonctionnalité et Ingénierie des Protéines, Centre National de la Recherche Scientifique UMR 6286, Université Nantes Nantes, France
Vincent Jallu Platelet Unit, Institut National de la Transfusion Sanguine Paris, France
Jiri Cerny Institute of Biotechnology, The Czech Academy of Sciences Prague, Czech Republic
Bohdan Schneider Institute of Biotechnology, The Czech Academy of Sciences Prague, Czech Republic
Catherine Etchebest Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Narayanaswamy Srinivasan Molecular Biophysics Unit, Indian Institute of Science, Bangalore Bangalore, India
Jean-Christophe Gelly Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
Alexandre G de Brevern Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France

Collapse

Yavuz AS, Sezerman OU. Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder. BMC Genomics 2014;15 Suppl 9:S18. [PMID: 25521314 PMCID: PMC4290605 DOI: 10.1186/1471-2164-15-s9-s18] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Sumoylation, which is a reversible and dynamic post-translational modification, is one of the vital processes in a cell. Before a protein matures to perform its function, sumoylation may alter its localization, interactions, and possibly structural conformation. Abberations in protein sumoylation has been linked with a variety of disorders and developmental anomalies. Experimental approaches to identification of sumoylation sites may not be effective due to the dynamic nature of sumoylation, laborsome experiments and their cost. Therefore, computational approaches may guide experimental identification of sumoylation sites and provide insights for further understanding sumoylation mechanism.

RESULTS

In this paper, the effectiveness of using various sequence properties in predicting sumoylation sites was investigated with statistical analyses and machine learning approach employing support vector machines. These sequence properties were derived from windows of size 7 including position-specific amino acid composition, hydrophobicity, estimated sub-window volumes, predicted disorder, and conformational flexibility. 5-fold cross-validation results on experimentally identified sumoylation sites revealed that our method successfully predicts sumoylation sites with a Matthew's correlation coefficient, sensitivity, specificity, and accuracy equal to 0.66, 73%, 98%, and 97%, respectively. Additionally, we have showed that our method compares favorably to the existing prediction methods and basic regular expressions scanner.

CONCLUSIONS

By using support vector machines, a new, robust method for sumoylation site prediction was introduced. Besides, the possible effects of predicted conformational flexibility and disorder on sumoylation site recognition were explored computationally for the first time to our knowledge as an additional parameter that could aid in sumoylation site prediction.

Collapse

Preeprem T, Gibson G. SDS, a structural disruption score for assessment of missense variant deleteriousness. Front Genet 2014;5:82. [PMID: 24795746 PMCID: PMC4001065 DOI: 10.3389/fgene.2014.00082] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2014] [Accepted: 03/26/2014] [Indexed: 11/17/2022] Open

Abstract

We have developed a novel structure-based evaluation for missense variants that explicitly models protein structure and amino acid properties to predict the likelihood that a variant disrupts protein function. A structural disruption score (SDS) is introduced as a measure to depict the likelihood that a case variant is functional. The score is constructed using characteristics that distinguish between causal and neutral variants within a group of proteins. The SDS score is correlated with standard sequence-based deleteriousness, but shows promise for improving discrimination between neutral and causal variants at less conserved sites. The prediction was performed on 3-dimentional structures of 57 gene products whose homozygous SNPs were identified as case-exclusive variants in an exome sequencing study of epilepsy disorders. We contrasted the candidate epilepsy variants with scores for likely benign variants found in the EVS database, and for positive control variants in the same genes that are suspected to promote a range of diseases. To derive a characteristic profile of damaging SNPs, we transformed continuous scores into categorical variables based on the score distribution of each measurement, collected from all possible SNPs in this protein set, where extreme measures were assumed to be deleterious. A second epilepsy dataset was used to replicate the findings. Causal variants tend to receive higher sequence-based deleterious scores, induce larger physico-chemical changes between amino acid pairs, locate in protein domains, buried sites or on conserved protein surface clusters, and cause protein destabilization, relative to negative controls. These measures were agglomerated for each variant. A list of nine high-priority putative functional variants for epilepsy was generated. Our newly developed SDS protocol facilitates SNP prioritization for experimental validation.

Collapse

Preeprem T, Gibson G. An association-adjusted consensus deleterious scheme to classify homozygous Mis-sense mutations for personal genome interpretation. BioData Min 2013;6:24. [PMID: 24365473 PMCID: PMC3892026 DOI: 10.1186/1756-0381-6-24] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2013] [Accepted: 12/17/2013] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Personal genome analysis is now being considered for evaluation of disease risk in healthy individuals, utilizing both rare and common variants. Multiple scores have been developed to predict the deleteriousness of amino acid substitutions, using information on the allele frequencies, level of evolutionary conservation, and averaged structural evidence. However, agreement among these scores is limited and they likely over-estimate the fraction of the genome that is deleterious.

METHOD

This study proposes an integrative approach to identify a subset of homozygous non-synonymous single nucleotide polymorphisms (nsSNPs). An 8-level classification scheme is constructed from the presence/absence of deleterious predictions combined with evidence of association with disease or complex traits. Detailed literature searches and structural validations are then performed for a subset of homozygous 826 mis-sense mutations in 575 proteins found in the genomes of 12 healthy adults.

RESULTS

Implementation of the Association-Adjusted Consensus Deleterious Scheme (AACDS) classifies 11% of all predicted highly deleterious homozygous variants as most likely to influence disease risk. The number of such variants per genome ranges from 0 to 8 with no significant difference between African and Caucasian Americans. Detailed analysis of mutations affecting the APOE, MTMR2, THSB1, CHIA, αMyHC, and AMY2A proteins shows how the protein structure is likely to be disrupted, even though the associated phenotypes have not been documented in the corresponding individuals.

CONCLUSIONS

The classification system for homozygous nsSNPs provides an opportunity to systematically rank nsSNPs based on suggestive evidence from annotations and sequence-based predictions. The ranking scheme, in-depth literature searches, and structural validations of highly prioritized mis-sense mutations compliment traditional sequence-based approaches and should have particular utility for the development of individualized health profiles. An online tool reporting the AACDS score for any variant is provided at the authors' website.

Collapse

Sai Ramesh A, Sethumadhavan R, Thiagarajan P. Structure–Function Studies on Non-synonymous SNPs of Chemokine Receptor Gene Implicated in Cardiovascular Disease: A Computational Approach. Protein J 2013;32:657-65. [DOI: 10.1007/s10930-013-9529-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Yu H, Huang H. Engineering proteins for thermostability through rigidifying flexible sites. Biotechnol Adv 2013;32:308-15. [PMID: 24211474 DOI: 10.1016/j.biotechadv.2013.10.012] [Citation(s) in RCA: 163] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Revised: 09/04/2013] [Accepted: 10/29/2013] [Indexed: 01/06/2023]

Verma R, Schwaneberg U, Roccatano D. Computer-Aided Protein Directed Evolution: a Review of Web Servers, Databases and other Computational Tools for Protein Engineering. Comput Struct Biotechnol J 2012;2:e201209008. [PMID: 24688649 PMCID: PMC3962222 DOI: 10.5936/csbj.201209008] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Revised: 10/07/2012] [Accepted: 10/12/2012] [Indexed: 12/01/2022] Open

de Brevern AG, Bornot A, Craveur P, Etchebest C, Gelly JC. PredyFlexy: flexibility and local structure prediction from sequence. Nucleic Acids Res 2012;40:W317-22. [PMID: 22689641 PMCID: PMC3394303 DOI: 10.1093/nar/gks482] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Hirose S, Yokota K, Kuroda Y, Wako H, Endo S, Kanai S, Noguchi T. Prediction of protein motions from amino acid sequence and its application to protein-protein interaction. BMC STRUCTURAL BIOLOGY 2010;10:20. [PMID: 20626880 PMCID: PMC3245509 DOI: 10.1186/1472-6807-10-20] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2009] [Accepted: 07/13/2010] [Indexed: 11/10/2022]

Abstract

BACKGROUND

Structural flexibility is an important characteristic of proteins because it is often associated with their function. The movement of a polypeptide segment in a protein can be broken down into two types of motions: internal and external ones. The former is deformation of the segment itself, but the latter involves only rotational and translational motions as a rigid body. Normal Model Analysis (NMA) can derive these two motions, but its application remains limited because it necessitates the gathering of complete structural information.

RESULTS

In this work, we present a novel method for predicting two kinds of protein motions in ordered structures. The prediction uses only information from the amino acid sequence. We prepared a dataset of the internal and external motions of segments in many proteins by application of NMA. Subsequently, we analyzed the relation between thermal motion assessed from X-ray crystallographic B-factor and internal/external motions calculated by NMA. Results show that attributes of amino acids related to the internal motion have different features from those related to the B-factors, although those related to the external motion are correlated strongly with the B-factors. Next, we developed a method to predict internal and external motions from amino acid sequences based on the Random Forest algorithm. The proposed method uses information associated with adjacent amino acid residues and secondary structures predicted from the amino acid sequence. The proposed method exhibited moderate correlation between predicted internal and external motions with those calculated by NMA. It has the highest prediction accuracy compared to a naïve model and three published predictors.

CONCLUSIONS

Finally, we applied the proposed method predicting the internal motion to a set of 20 proteins that undergo large conformational change upon protein-protein interaction. Results show significant overlaps between the predicted high internal motion regions and the observed conformational change regions.

Collapse

Liu YC, Yang MH, Lin WL, Huang CK, Oyang YJ. A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins. BMC Genomics 2009;10 Suppl 3:S22. [PMID: 19958486 PMCID: PMC2788375 DOI: 10.1186/1471-2164-10-s3-s22] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Abstract

Background

Proteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment. As it has been observed in laboratories that protein flexibility is correlated to essential biological functions, scientists have been designing various types of predictors for identifying structurally flexible regions in proteins. In this respect, there are two major categories of predictors. One category of predictors attempts to identify conformationally flexible regions through analysis of protein tertiary structures. Another category of predictors works completely based on analysis of the polypeptide sequences. As the availability of protein tertiary structures is generally limited, the design of predictors that work completely based on sequence information is crucial for advances of molecular biology research.

Results

In this article, we propose a novel approach to design a sequence-based predictor for identifying conformationally ambivalent regions in proteins. The novelty in the design stems from incorporating two classifiers based on two distinctive supervised learning algorithms that provide complementary prediction powers. Experimental results show that the overall performance delivered by the hybrid predictor proposed in this article is superior to the performance delivered by the existing predictors. Furthermore, the case study presented in this article demonstrates that the proposed hybrid predictor is capable of providing the biologists with valuable clues about the functional sites in a protein chain. The proposed hybrid predictor provides the users with two optional modes, namely, the high-sensitivity mode and the high-specificity mode. The experimental results with an independent testing data set show that the proposed hybrid predictor is capable of delivering sensitivity of 0.710 and specificity of 0.608 under the high-sensitivity mode, while delivering sensitivity of 0.451 and specificity of 0.787 under the high-specificity mode.

Conclusion

Though experimental results show that the hybrid approach designed to exploit the complementary prediction powers of distinctive supervised learning algorithms works more effectively than conventional approaches, there exists a large room for further improvement with respect to the achieved performance. In this respect, it is of interest to investigate the effects of exploiting additional physiochemical properties that are related to conformational ambivalence. Furthermore, it is of interest to investigate the effects of incorporating lately-developed machine learning approaches, e.g. the random forest design and the multi-stage design. As conformational transition plays a key role in carrying out several essential types of biological functions, the design of more advanced predictors for identifying conformationally ambivalent regions in proteins deserves our continuous attention.

Collapse