1
|
Liang F, Sun M, Xie L, Zhao X, Liu D, Zhao K, Zhang G. Recent advances and challenges in protein complex model accuracy estimation. Comput Struct Biotechnol J 2024; 23:1824-1832. [PMID: 38707538 PMCID: PMC11066466 DOI: 10.1016/j.csbj.2024.04.049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 04/18/2024] [Accepted: 04/18/2024] [Indexed: 05/07/2024] Open
Abstract
Estimation of model accuracy plays a crucial role in protein structure prediction, aiming to evaluate the quality of predicted protein structure models accurately and objectively. This process is not only key to screening candidate models that are close to the real structure, but also provides guidance for further optimization of protein structures. With the significant advancements made by AlphaFold2 in monomer structure, the problem of single-domain protein structure prediction has been widely solved. Correspondingly, the importance of assessing the quality of single-domain protein models decreased, and the research focus has shifted to estimation of model accuracy of protein complexes. In this review, our goal is to provide a comprehensive overview of the reference and statistical metrics, as well as representative methods, and the current challenges within four distinct facets (Topology Global Score, Interface Total Score, Interface Residue-Wise Score, and Tertiary Residue-Wise Score) in the field of complex EMA.
Collapse
Affiliation(s)
| | | | - Lei Xie
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Xuanfeng Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Dong Liu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Kailong Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Guijun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
2
|
Waterhouse AM, Studer G, Robin X, Bienert S, Tauriello G, Schwede T. The structure assessment web server: for proteins, complexes and more. Nucleic Acids Res 2024; 52:W318-W323. [PMID: 38634802 PMCID: PMC11223858 DOI: 10.1093/nar/gkae270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/21/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024] Open
Abstract
The 'structure assessment' web server is a one-stop shop for interactive evaluation and benchmarking of structural models of macromolecular complexes including proteins and nucleic acids. A user-friendly web dashboard links sequence with structure information and results from a variety of state-of-the-art tools, which facilitates the visual exploration and evaluation of structure models. The dashboard integrates stereochemistry information, secondary structure information, global and local model quality assessment of the tertiary structure of comparative protein models, as well as prediction of membrane location. In addition, a benchmarking mode is available where a model can be compared to a reference structure, providing easy access to scores that have been used in recent CASP experiments and CAMEO. The structure assessment web server is available at https://swissmodel.expasy.org/assess.
Collapse
Affiliation(s)
- Andrew M Waterhouse
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Xavier Robin
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| |
Collapse
|
3
|
Harihar B, Saravanan KM, Gromiha MM, Selvaraj S. Importance of Inter-residue Contacts for Understanding Protein Folding and Unfolding Rates, Remote Homology, and Drug Design. Mol Biotechnol 2024:10.1007/s12033-024-01119-4. [PMID: 38498284 DOI: 10.1007/s12033-024-01119-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 02/10/2024] [Indexed: 03/20/2024]
Abstract
Inter-residue interactions in protein structures provide valuable insights into protein folding and stability. Understanding these interactions can be helpful in many crucial applications, including rational design of therapeutic small molecules and biologics, locating functional protein sites, and predicting protein-protein and protein-ligand interactions. The process of developing machine learning models incorporating inter-residue interactions has been improved recently. This review highlights the theoretical models incorporating inter-residue interactions in predicting folding and unfolding rates of proteins. Utilizing contact maps to depict inter-residue interactions aids researchers in developing computer models for detecting remote homologs and interface residues within protein-protein complexes which, in turn, enhances our knowledge of the relationship between sequence and structure of proteins. Further, the application of contact maps derived from inter-residue interactions is highlighted in the field of drug discovery. Overall, this review presents an extensive assessment of the significant models that use inter-residue interactions to investigate folding rates, unfolding rates, remote homology, and drug development, providing potential future advancements in constructing efficient computational models in structural biology.
Collapse
Affiliation(s)
- Balasubramanian Harihar
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Konda Mani Saravanan
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, 600073, India
| | - Michael M Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Samuel Selvaraj
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India.
| |
Collapse
|
4
|
Nguyen KQT, Nguyen HH, Phung HTT, Chung KL, Vu TY. A close-up shot of protein-protein docking, from experiment to theory and reverse with the PROTAC performers. J Biomol Struct Dyn 2024:1-8. [PMID: 38284361 DOI: 10.1080/07391102.2024.2308778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 01/14/2024] [Indexed: 01/30/2024]
Abstract
PROTACs (Proteolysis Targeting Chimeras), heterobifunctional molecules, exhibit selectivity in degrading target proteins through E3 ubiquitin ligases. Designing effective PROTACs requires a deep understanding of the intricate binding interactions in the ternary complex (POI/PROTAC/E3 ligase), crucial for efficient target protein degradation. To address this challenge, we introduce a novel computational virtual screening method that considers essential amino acid interactions between the protein of interest and the chosen E3 ligase. This approach enhances accuracy and reliability, facilitating the strategic development of potent PROTACs. Utilizing a crystallized model of the VHL:PROTAC:SMARCA2BD ternary complex (PDB: 7Z6L), we assessed the effectiveness of our method. Our study reveals that increasing the number of essential restraints between the two proteins reduces the generated docking poses, leading to closer alignment with the experimental ternary complex. Specifically, utilizing three restraints showed the closest resemblance to the published complex, highlighting crucial interactions such as an H-bond between A:Gln 89 and B:Asn 67, along with two hydrophobic interactions: A:Gly 22 with B:Arg 69 and A:Glu 37 with B:Pro 99. This resulted in a significant decrease in the mean RMSD value from 31.8 and 31.0 Å to 24.4 Å, respectively. This underscores the importance of incorporating multiple essential restraints to enhance docking accuracy. Building on this progress, we introduce a systematic approach to design potential PROTACs between the Estrogen receptor and the E3 ligase, utilizing bridging intermediates with 4, 6, or 7 carbon atoms. By providing a more accurate and efficient means of identifying optimal PROTAC candidates, this approach has the potential to accelerate the development of targeted therapies and reduce the time and costs associated with drug discovery.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
| | - Hieu Hien Nguyen
- Faculty of Pharmacy, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Huong Thi Thu Phung
- NTT Hi-Tech Institute, Nguyen Tat Thanh University, Ho Chi Minh City, Vietnam
| | - Khanh Linh Chung
- Faculty of Pharmacy, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Thien Y Vu
- Faculty of Pharmacy, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| |
Collapse
|
5
|
Kuder KJ. Docking Foundations: From Rigid to Flexible Docking. Methods Mol Biol 2024; 2780:3-14. [PMID: 38987460 DOI: 10.1007/978-1-0716-3985-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Despite the development of methods for the experimental determination of protein structures, the dissonance between the number of known sequences and their solved structures is still enormous. This is particularly evident in protein-protein complexes. To fill this gap, diverse technologies have been developed to study protein-protein interactions (PPIs) in a cellular context including a range of biological and computational methods. The latter derive from techniques originally published and applied almost half a century ago and are based on interdisciplinary knowledge from the nexus of the fields of biology, chemistry, and physics about protein sequences, structures, and their folding. Protein-protein docking, the main protagonist of this chapter, is routinely treated as an integral part of protein research. Herein, we describe the basic foundations of the whole process in general terms, but step by step from protein representations through docking methods and evaluation of complexes to their final validation.
Collapse
Affiliation(s)
- Kamil J Kuder
- Department of Technology and Biotechnology of Drugs, Faculty of Pharmacy, Jagiellonian University Medical College, Kraków, Poland.
| |
Collapse
|
6
|
Singh A, Copeland MM, Kundrotas PJ, Vakser IA. GRAMM Web Server for Protein Docking. Methods Mol Biol 2024; 2714:101-112. [PMID: 37676594 DOI: 10.1007/978-1-0716-3441-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Prediction of the structure of protein complexes by docking methods is a well-established research field. The intermolecular energy landscapes in protein-protein interactions can be used to refine docking predictions and to detect macro-characteristics, such as the binding funnel. A new GRAMM web server for protein docking predicts a spectrum of docking poses that characterize the intermolecular energy landscape in protein interaction. A user-friendly interface provides options to choose free or template-based docking, as well as other advanced features, such as clustering of the docking poses, and interactive visualization of the docked models.
Collapse
Affiliation(s)
- Amar Singh
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA
| | - Matthew M Copeland
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA
| | - Petras J Kundrotas
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA.
| | - Ilya A Vakser
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA.
| |
Collapse
|
7
|
Kiani YS, Jabeen I. Challenges of Protein-Protein Docking of the Membrane Proteins. Methods Mol Biol 2024; 2780:203-255. [PMID: 38987471 DOI: 10.1007/978-1-0716-3985-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Despite the recent advances in the determination of high-resolution membrane protein (MP) structures, the structural and functional characterization of MPs remains extremely challenging, mainly due to the hydrophobic nature, low abundance, poor expression, purification, and crystallization difficulties associated with MPs. Whereby the major challenges/hurdles for MP structure determination are associated with the expression, purification, and crystallization procedures. Although there have been significant advances in the experimental determination of MP structures, only a limited number of MP structures (approximately less than 1% of all) are available in the Protein Data Bank (PDB). Therefore, the structures of a large number of MPs still remain unresolved, which leads to the availability of widely unplumbed structural and functional information related to MPs. As a result, recent developments in the drug discovery realm and the significant biological contemplation have led to the development of several novel, low-cost, and time-efficient computational methods that overcome the limitations of experimental approaches, supplement experiments, and provide alternatives for the characterization of MPs. Whereby the fine tuning and optimizations of these computational approaches remains an ongoing endeavor.Computational methods offer a potential way for the elucidation of structural features and the augmentation of currently available MP information. However, the use of computational modeling can be extremely challenging for MPs mainly due to insufficient knowledge of (or gaps in) atomic structures of MPs. Despite the availability of numerous in silico methods for 3D structure determination the applicability of these methods to MPs remains relatively low since all methods are not well-suited or adequate for MPs. However, sophisticated methods for MP structure predictions are constantly being developed and updated to integrate the modifications required for MPs. Currently, different computational methods for (1) MP structure prediction, (2) stability analysis of MPs through molecular dynamics simulations, (3) modeling of MP complexes through docking, (4) prediction of interactions between MPs, and (5) MP interactions with its soluble partner are extensively used. Towards this end, MP docking is widely used. It is notable that the MP docking methods yet few in number might show greater potential in terms of filling the knowledge gap. In this chapter, MP docking methods and associated challenges have been reviewed to improve the applicability, accuracy, and the ability to model macromolecular complexes.
Collapse
Affiliation(s)
- Yusra Sajid Kiani
- School of Interdisciplinary Engineering and Sciences (SINES), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Ishrat Jabeen
- School of Interdisciplinary Engineering and Sciences (SINES), National University of Sciences and Technology (NUST), Islamabad, Pakistan.
| |
Collapse
|
8
|
Baek M, McHugh R, Anishchenko I, Jiang H, Baker D, DiMaio F. Accurate prediction of protein-nucleic acid complexes using RoseTTAFoldNA. Nat Methods 2024; 21:117-121. [PMID: 37996753 PMCID: PMC10776382 DOI: 10.1038/s41592-023-02086-5] [Citation(s) in RCA: 40] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 10/16/2023] [Indexed: 11/25/2023]
Abstract
Protein-RNA and protein-DNA complexes play critical roles in biology. Despite considerable recent advances in protein structure prediction, the prediction of the structures of protein-nucleic acid complexes without homology to known complexes is a largely unsolved problem. Here we extend the RoseTTAFold machine learning protein-structure-prediction approach to additionally predict nucleic acid and protein-nucleic acid complexes. We develop a single trained network, RoseTTAFoldNA, that rapidly produces three-dimensional structure models with confidence estimates for protein-DNA and protein-RNA complexes. Here we show that confident predictions have considerably higher accuracy than current state-of-the-art methods. RoseTTAFoldNA should be broadly useful for modeling the structure of naturally occurring protein-nucleic acid complexes, and for designing sequence-specific RNA and DNA-binding proteins.
Collapse
Affiliation(s)
- Minkyung Baek
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
| | - Ryan McHugh
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Ivan Anishchenko
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Hanlun Jiang
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
| |
Collapse
|
9
|
Pereira GP, Jiménez-García B, Pellarin R, Launay G, Wu S, Martin J, Souza PCT. Rational Prediction of PROTAC-Compatible Protein-Protein Interfaces by Molecular Docking. J Chem Inf Model 2023; 63:6823-6833. [PMID: 37877240 DOI: 10.1021/acs.jcim.3c01154] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023]
Abstract
Proteolysis targeting chimeras (PROTACs) are heterobifunctional ligands that mediate the interaction between a protein target and an E3 ligase, resulting in a ternary complex, whose interaction with the ubiquitination machinery leads to target degradation. This technology is emerging as an exciting new avenue for therapeutic development, with several PROTACs currently undergoing clinical trials targeting cancer. Here, we describe a general and computationally efficient methodology combining restraint-based docking, energy-based rescoring, and a filter based on the minimal solvent-accessible surface distance to produce PROTAC-compatible PPIs suitable for when there is no a priori known PROTAC ligand. In a benchmark employing a manually curated data set of 13 ternary complex crystals, we achieved an accuracy of 92% when starting from bound structures and 77% when starting from unbound structures, respectively. Our method only requires that the ligand-bound structures of the monomeric forms of the E3 ligase and target proteins be given to run, making it general, accurate, and highly efficient, with the ability to impact early-stage PROTAC-based drug design campaigns where no structural information about the ternary complex structure is available.
Collapse
Affiliation(s)
- Gilberto P Pereira
- Molecular Microbiology and Structural Biochemistry, CNRS UMR 5086 and Université Claude Bernard Lyon 1, 7 Passage du Vercors, 69007 Lyon, France
- Laboratory of Biology and Modeling of the Cell, École Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5239 and Inserm U1293, 46 Allée d'Italie, 69007 Lyon, France
| | | | - Riccardo Pellarin
- Molecular Microbiology and Structural Biochemistry, CNRS UMR 5086 and Université Claude Bernard Lyon 1, 7 Passage du Vercors, 69007 Lyon, France
- Laboratory of Biology and Modeling of the Cell, École Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5239 and Inserm U1293, 46 Allée d'Italie, 69007 Lyon, France
| | - Guillaume Launay
- Molecular Microbiology and Structural Biochemistry, CNRS UMR 5086 and Université Claude Bernard Lyon 1, 7 Passage du Vercors, 69007 Lyon, France
- Laboratory of Biology and Modeling of the Cell, École Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5239 and Inserm U1293, 46 Allée d'Italie, 69007 Lyon, France
| | - Sangwook Wu
- PharmCADD, Busan 48792, Republic of Korea
- Department of Physics, Pukyong National University, Busan 48513, Republic of Korea
| | - Juliette Martin
- Molecular Microbiology and Structural Biochemistry, CNRS UMR 5086 and Université Claude Bernard Lyon 1, 7 Passage du Vercors, 69007 Lyon, France
- Laboratory of Biology and Modeling of the Cell, École Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5239 and Inserm U1293, 46 Allée d'Italie, 69007 Lyon, France
| | - Paulo C T Souza
- Molecular Microbiology and Structural Biochemistry, CNRS UMR 5086 and Université Claude Bernard Lyon 1, 7 Passage du Vercors, 69007 Lyon, France
- Laboratory of Biology and Modeling of the Cell, École Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5239 and Inserm U1293, 46 Allée d'Italie, 69007 Lyon, France
| |
Collapse
|
10
|
Schweke H, Xu Q, Tauriello G, Pantolini L, Schwede T, Cazals F, Lhéritier A, Fernandez-Recio J, Rodríguez-Lumbreras LÁ, Schueler-Furman O, Varga JK, Jiménez-García B, Réau MF, Bonvin A, Savojardo C, Martelli PL, Casadio R, Tubiana J, Wolfson H, Oliva R, Barradas-Bautista D, Ricciardelli T, Cavallo L, Venclovas Č, Olechnovič K, Guerois R, Andreani J, Martin J, Wang X, Kihara D, Marchand A, Correia B, Zou X, Dey S, Dunbrack R, Levy E, Wodak S. Discriminating physiological from non-physiological interfaces in structures of protein complexes: A community-wide study. Proteomics 2023; 23:e2200323. [PMID: 37365936 PMCID: PMC10937251 DOI: 10.1002/pmic.202200323] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 05/11/2023] [Accepted: 05/11/2023] [Indexed: 06/28/2023]
Abstract
Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Julia K. Varga
- Hebrew University of Jerusalem Institute for Medical Research Israel-Canada
| | | | | | | | | | | | | | - Jérôme Tubiana
- Tel Aviv University Blavatnik School of Computer Science
| | - Haim Wolfson
- Tel Aviv University Blavatnik School of Computer Science
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Xiaoqin Zou
- Dalton Cardiovascular Research Center, Institute for Data Science and Informatics, University of Missouri
| | | | | | | | | |
Collapse
|
11
|
Stainthorp AK, Lin CC, Wang D, Medhi R, Ahmed Z, Suen KM, Miska EA, Whitehouse A, Ladbury JE. Regulation of microRNA expression by the adaptor protein GRB2. Sci Rep 2023; 13:9784. [PMID: 37328606 PMCID: PMC10276003 DOI: 10.1038/s41598-023-36996-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/14/2023] [Indexed: 06/18/2023] Open
Abstract
Protein interactions with the microRNA (miRNA)-mediated gene silencing protein Argonaute 2 (AGO2) control miRNA expression. miRNA biogenesis starts with the production of precursor transcripts and culminates with the loading of mature miRNA onto AGO2 by DICER1. Here we reveal an additional component to the regulatory mechanism for miRNA biogenesis involving the adaptor protein, growth factor receptor-bound protein 2 (GRB2). The N-terminal SH3 domain of GRB2 is recruited to the PAZ domain of AGO2 forming a ternary complex containing GRB2, AGO2 and DICER1. Using small-RNA sequencing we identified two groups of miRNAs which are regulated by the binding of GRB2. First, mature and precursor transcripts of mir-17~92 and mir-221 miRNAs are enhanced. Second, mature, but not precursor, let-7 family miRNAs are diminished suggesting that GRB2 directly affects loading of these miRNAs. Notably, the resulting loss of let-7 augments expression of oncogenic targets such as RAS. Thus, a new role for GRB2 is established with implications for cancer pathogenesis through regulation of miRNA biogenesis and oncogene expression.
Collapse
Affiliation(s)
- Amy K Stainthorp
- School of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
| | - Chi-Chuan Lin
- School of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
| | - Dapeng Wang
- LeedsOmics, University of Leeds, Leeds, LS2 9JT, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Heart and Lung Institute, Imperial College London, London, SW3 6LY, UK
| | - Ragini Medhi
- Wellcome Trust Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK
| | - Zamal Ahmed
- Department of Molecular and Cellular Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Kin Man Suen
- School of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
| | - Eric A Miska
- Wellcome Trust Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK
| | - Adrian Whitehouse
- School of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
| | - John E Ladbury
- School of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK.
| |
Collapse
|
12
|
Shuvo MH, Karim M, Roche R, Bhattacharya D. PIQLE: protein-protein interface quality estimation by deep graph learning of multimeric interaction geometries. BIOINFORMATICS ADVANCES 2023; 3:vbad070. [PMID: 37351310 PMCID: PMC10281963 DOI: 10.1093/bioadv/vbad070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 05/17/2023] [Accepted: 06/01/2023] [Indexed: 06/24/2023]
Abstract
Motivation Accurate modeling of protein-protein interaction interface is essential for high-quality protein complex structure prediction. Existing approaches for estimating the quality of a predicted protein complex structural model utilize only the physicochemical properties or energetic contributions of the interacting atoms, ignoring evolutionarily information or inter-atomic multimeric geometries, including interaction distance and orientations. Results Here, we present PIQLE, a deep graph learning method for protein-protein interface quality estimation. PIQLE leverages multimeric interaction geometries and evolutionarily information along with sequence- and structure-derived features to estimate the quality of individual interactions between the interfacial residues using a multi-head graph attention network and then probabilistically combines the estimated quality for scoring the overall interface. Experimental results show that PIQLE consistently outperforms existing state-of-the-art methods including DProQA, TRScore, GNN-DOVE and DOVE on multiple independent test datasets across a wide range of evaluation metrics. Our ablation study and comparison with the self-assessment module of AlphaFold-Multimer repurposed for protein complex scoring reveal that the performance gains are connected to the effectiveness of the multi-head graph attention network in leveraging multimeric interaction geometries and evolutionary information along with other sequence- and structure-derived features adopted in PIQLE. Availability and implementation An open-source software implementation of PIQLE is freely available at https://github.com/Bhattacharya-Lab/PIQLE. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Md Hossain Shuvo
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Mohimenul Karim
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Rahmatullah Roche
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | | |
Collapse
|
13
|
Wodak SJ, Vajda S, Lensink MF, Kozakov D, Bates PA. Critical Assessment of Methods for Predicting the 3D Structure of Proteins and Protein Complexes. Annu Rev Biophys 2023; 52:183-206. [PMID: 36626764 PMCID: PMC10885158 DOI: 10.1146/annurev-biophys-102622-084607] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Advances in a scientific discipline are often measured by small, incremental steps. In this review, we report on two intertwined disciplines in the protein structure prediction field, modeling of single chains and modeling of complexes, that have over decades emulated this pattern, as monitored by the community-wide blind prediction experiments CASP and CAPRI. However, over the past few years, dramatic advances were observed for the accurate prediction of single protein chains, driven by a surge of deep learning methodologies entering the prediction field. We review the mainscientific developments that enabled these recent breakthroughs and feature the important role of blind prediction experiments in building up and nurturing the structure prediction field. We discuss how the new wave of artificial intelligence-based methods is impacting the fields of computational and experimental structural biology and highlight areas in which deep learning methods are likely to lead to future developments, provided that major challenges are overcome.
Collapse
Affiliation(s)
- Shoshana J Wodak
- VIB-VUB Center for Structural Biology, Vrije Universiteit Brussel, Brussels, Belgium;
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA;
- Department of Chemistry, Boston University, Boston, Massachusetts, USA
| | - Marc F Lensink
- Univ. Lille, CNRS, UMR 8576-UGSF-Unité de Glycobiologie Structurale et Fonctionnelle, Lille, France;
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA;
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
| | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, United Kingdom;
| |
Collapse
|
14
|
Yin R, Ribeiro-Filho HV, Lin V, Gowthaman R, Cheung M, Pierce BG. TCRmodel2: high-resolution modeling of T cell receptor recognition using deep learning. Nucleic Acids Res 2023:7151345. [PMID: 37140040 DOI: 10.1093/nar/gkad356] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 04/08/2023] [Accepted: 04/25/2023] [Indexed: 05/05/2023] Open
Abstract
The cellular immune system, which is a critical component of human immunity, uses T cell receptors (TCRs) to recognize antigenic proteins in the form of peptides presented by major histocompatibility complex (MHC) proteins. Accurate definition of the structural basis of TCRs and their engagement of peptide-MHCs can provide major insights into normal and aberrant immunity, and can help guide the design of vaccines and immunotherapeutics. Given the limited amount of experimentally determined TCR-peptide-MHC structures and the vast amount of TCRs within each individual as well as antigenic targets, accurate computational modeling approaches are needed. Here, we report a major update to our web server, TCRmodel, which was originally developed to model unbound TCRs from sequence, to now model TCR-peptide-MHC complexes from sequence, utilizing several adaptations of AlphaFold. This method, named TCRmodel2, allows users to submit sequences through an easy-to-use interface and shows similar or greater accuracy than AlphaFold and other methods to model TCR-peptide-MHC complexes based on benchmarking. It can generate models of complexes in 15 minutes, and output models are provided with confidence scores and an integrated molecular viewer. TCRmodel2 is available at https://tcrmodel.ibbr.umd.edu.
Collapse
Affiliation(s)
- Rui Yin
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Helder V Ribeiro-Filho
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Brazilian Biosciences National Laboratory, Brazilian Center for Research in Energy and Materials, Campinas 13083-100, Brazil
| | - Valerie Lin
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Thomas S. Wootton High School, Rockville, MD 20850, USA
| | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Melyssa Cheung
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
- University of Maryland Marlene and Stewart Greenebaum Comprehensive Cancer Center, Baltimore, MD 21201, USA
| |
Collapse
|
15
|
Jung Y, Geng C, Bonvin AMJJ, Xue LC, Honavar VG. MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations. Biomolecules 2023; 13:121. [PMID: 36671507 PMCID: PMC9855734 DOI: 10.3390/biom13010121] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 12/22/2022] [Accepted: 12/26/2022] [Indexed: 01/11/2023] Open
Abstract
Protein-protein interactions play a ubiquitous role in biological function. Knowledge of the three-dimensional (3D) structures of the complexes they form is essential for understanding the structural basis of those interactions and how they orchestrate key cellular processes. Computational docking has become an indispensable alternative to the expensive and time-consuming experimental approaches for determining the 3D structures of protein complexes. Despite recent progress, identifying near-native models from a large set of conformations sampled by docking-the so-called scoring problem-still has considerable room for improvement. We present MetaScore, a new machine-learning-based approach to improve the scoring of docked conformations. MetaScore utilizes a random forest (RF) classifier trained to distinguish near-native from non-native conformations using their protein-protein interfacial features. The features include physicochemical properties, energy terms, interaction-propensity-based features, geometric properties, interface topology features, evolutionary conservation, and also scores produced by traditional scoring functions (SFs). MetaScore scores docked conformations by simply averaging the score produced by the RF classifier with that produced by any traditional SF. We demonstrate that (i) MetaScore consistently outperforms each of the nine traditional SFs included in this work in terms of success rate and hit rate evaluated over conformations ranked among the top 10; (ii) an ensemble method, MetaScore-Ensemble, that combines 10 variants of MetaScore obtained by combining the RF score with each of the traditional SFs outperforms each of the MetaScore variants. We conclude that the performance of traditional SFs can be improved upon by using machine learning to judiciously leverage protein-protein interfacial features and by using ensemble methods to combine multiple scoring functions.
Collapse
Affiliation(s)
- Yong Jung
- Bioinformatics & Genomics Graduate Program, Pennsylvania State University, University Park, PA 16802, USA
- Artificial Intelligence Research Laboratory, Pennsylvania State University, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Cunliang Geng
- Bijvoet Centre for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | - Alexandre M. J. J. Bonvin
- Bijvoet Centre for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | - Li C. Xue
- Bijvoet Centre for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
- Center for Molecular and Biomolecular Informatics, Radboudumc, Greet Grooteplein 26-28, 6525 GA Nijmegen, The Netherlands
| | - Vasant G. Honavar
- Bioinformatics & Genomics Graduate Program, Pennsylvania State University, University Park, PA 16802, USA
- Artificial Intelligence Research Laboratory, Pennsylvania State University, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
- Clinical and Translational Sciences Institute, Pennsylvania State University, University Park, PA 16802, USA
- College of Information Sciences & Technology, Pennsylvania State University, University Park, PA 16802, USA
- Institute for Computational and Data Sciences, Pennsylvania State University, University Park, PA 16802, USA
- Center for Big Data Analytics and Discovery Informatics, Pennsylvania State University, University Park, PA 16823, USA
| |
Collapse
|
16
|
Kumar V, Kumar P, Chauhan L, Dwivedi A, Ramamurthy HR. Novel combination of FLNC (c.5707G>A; p. Glu1903Lys) and BAG3 (c.610G>A; p.Gly204Arg) genetic variant expressing restrictive cardiomyopathy phenotype in an adolescent girl. J Genet 2022. [DOI: 10.1007/s12041-022-01402-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
17
|
Park HM, Park Y, Berani U, Bang E, Vankerschaver J, Van Messem A, De Neve W, Shim H. In silico optimization of RNA-protein interactions for CRISPR-Cas13-based antimicrobials. Biol Direct 2022; 17:27. [PMID: 36207756 PMCID: PMC9547417 DOI: 10.1186/s13062-022-00339-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/19/2022] [Indexed: 12/04/2022] Open
Abstract
RNA–protein interactions are crucial for diverse biological processes. In prokaryotes, RNA–protein interactions enable adaptive immunity through CRISPR-Cas systems. These defence systems utilize CRISPR RNA (crRNA) templates acquired from past infections to destroy foreign genetic elements through crRNA-mediated nuclease activities of Cas proteins. Thanks to the programmability and specificity of CRISPR-Cas systems, CRISPR-based antimicrobials have the potential to be repurposed as new types of antibiotics. Unlike traditional antibiotics, these CRISPR-based antimicrobials can be designed to target specific bacteria and minimize detrimental effects on the human microbiome during antibacterial therapy. In this study, we explore the potential of CRISPR-based antimicrobials by optimizing the RNA–protein interactions of crRNAs and Cas13 proteins. CRISPR-Cas13 systems are unique as they degrade specific foreign RNAs using the crRNA template, which leads to non-specific RNase activities and cell cycle arrest. We show that a high proportion of the Cas13 systems have no colocalized CRISPR arrays, and the lack of direct association between crRNAs and Cas proteins may result in suboptimal RNA–protein interactions in the current tools. Here, we investigate the RNA–protein interactions of the Cas13-based systems by curating the validation dataset of Cas13 protein and CRISPR repeat pairs that are experimentally validated to interact, and the candidate dataset of CRISPR repeats that reside on the same genome as the currently known Cas13 proteins. To find optimal CRISPR-Cas13 interactions, we first validate the 3-D structure prediction of crRNAs based on their experimental structures. Next, we test a number of RNA–protein interaction programs to optimize the in silico docking of crRNAs with the Cas13 proteins. From this optimized pipeline, we find a number of candidate crRNAs that have comparable or better in silico docking with the Cas13 proteins of the current tools. This study fully automatizes the in silico optimization of RNA–protein interactions as an efficient preliminary step for designing effective CRISPR-Cas13-based antimicrobials.
Collapse
Affiliation(s)
- Ho-Min Park
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea.,Department of Electronics and Information Systems, Ghent University, Ghent, Belgium
| | - Yunseol Park
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea
| | - Urta Berani
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea
| | - Eunkyu Bang
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea
| | - Joris Vankerschaver
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | | | - Wesley De Neve
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea.,Department of Electronics and Information Systems, Ghent University, Ghent, Belgium
| | - Hyunjin Shim
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea.
| |
Collapse
|
18
|
Xu Z, Davila A, Wiamowski J, Teraguchi S, Standley DM. Improved antibody-specific epitope prediction using AlphaFold and AbAdapt. Chembiochem 2022; 23:e202200303. [PMID: 35893479 PMCID: PMC9543094 DOI: 10.1002/cbic.202200303] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 07/27/2022] [Indexed: 11/17/2022]
Abstract
Antibodies recognize their cognate antigens with high affinity and specificity, but the prediction of binding sites on the antigen (epitope) corresponding to a specific antibody remains a challenging problem. To address this problem, we developed AbAdapt, a pipeline that integrates antibody and antigen structural modeling with rigid docking in order to derive antibody‐antigen specific features for epitope prediction. In this study, we systematically assessed the impact of integrating the state‐of‐the‐art protein modeling method AlphaFold with the AbAdapt pipeline. By incorporating more accurate antibody models, we observed improvement in docking, paratope prediction, and prediction of antibody‐specific epitopes. We further applied AbAdapt‐AF in an anti‐receptor binding domain (RBD) antibody complex benchmark and found AbAdapt‐AF outperformed three alternative docking methods. Also, AbAdapt‐AF demonstrated higher epitope prediction accuracy than other tested epitope prediction tools in the anti‐RBD antibody complex benchmark. We anticipate that AbAdapt‐AF will facilitate prediction of antigen‐antibody interactions in a wide range of applications.
Collapse
Affiliation(s)
- Zichang Xu
- Osaka University Research Institute for Microbial Diseases: Osaka Daigaku Biseibutsubyo Kenkyujo, Genome Informatics, JAPAN
| | - Ana Davila
- Osaka University Research Institute for Microbial Diseases: Osaka Daigaku Biseibutsubyo Kenkyujo, Department of Genome Informatics, JAPAN
| | - Jan Wiamowski
- Osaka University Research Institute for Microbial Diseases: Osaka Daigaku Biseibutsubyo Kenkyujo, Department of Genome Informatics, JAPAN
| | - Shunsuke Teraguchi
- Osaka University Research Institute for Microbial Diseases: Osaka Daigaku Biseibutsubyo Kenkyujo, Department of Genome Informatics, JAPAN
| | | |
Collapse
|
19
|
Jones G, Jindal A, Ghani U, Kotelnikov S, Egbert M, Hashemi N, Vajda S, Padhorny D, Kozakov D. Elucidation of protein function using computational docking and hotspot analysis by ClusPro and FTMap. Acta Crystallogr D Struct Biol 2022; 78:690-697. [PMID: 35647916 PMCID: PMC9159284 DOI: 10.1107/s2059798322002741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 03/10/2022] [Indexed: 08/30/2023] Open
Abstract
Starting with a crystal structure of a macromolecule, computational structural modeling can help to understand the associated biological processes, structure and function, as well as to reduce the number of further experiments required to characterize a given molecular entity. In the past decade, two classes of powerful automated tools for investigating the binding properties of proteins have been developed: the protein–protein docking program ClusPro and the FTMap and FTSite programs for protein hotspot identification. These methods have been widely used by the research community by means of publicly available online servers, and models built using these automated tools have been reported in a large number of publications. Importantly, additional experimental information can be leveraged to further improve the predictive power of these approaches. Here, an overview of the methods and their biological applications is provided together with a brief interpretation of the results.
Collapse
|
20
|
Davila A, Xu Z, Li S, Rozewicki J, Wilamowski J, Kotelnikov S, Kozakov D, Teraguchi S, Standley DM. AbAdapt: an adaptive approach to predicting antibody-antigen complex structures from sequence. BIOINFORMATICS ADVANCES 2022; 2:vbac015. [PMID: 36699363 PMCID: PMC9710585 DOI: 10.1093/bioadv/vbac015] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 02/15/2022] [Accepted: 03/03/2022] [Indexed: 01/28/2023]
Abstract
Motivation The scoring of antibody-antigen docked poses starting from unbound homology models has not been systematically optimized for a large and diverse set of input sequences. Results To address this need, we have developed AbAdapt, a webserver that accepts antibody and antigen sequences, models their 3D structures, predicts epitope and paratope, and then docks the modeled structures using two established docking engines (Piper and Hex). Each of the key steps has been optimized by developing and training new machine-learning models. The sequences from a diverse set of 622 antibody-antigen pairs with known structure were used as inputs for leave-one-out cross-validation. The final set of cluster representatives included at least one 'Adequate' pose for 550/622 (88.4%) of the queries. The median (interquartile range) ranks of these 'Adequate' poses were 22 (5-77). Similar results were obtained on a holdout set of 100 unrelated antibody-antigen pairs. When epitopes were repredicted using docking-derived features for specific antibodies, the median ROC AUC increased from 0.679 to 0.720 in cross-validation and from 0.694 to 0.730 in the holdout set. Availability and implementation AbAdapt and related data are available at https://sysimm.org/abadapt/. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Ana Davila
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan
| | - Zichang Xu
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan
| | - Songling Li
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan
| | - John Rozewicki
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan
| | - Jan Wilamowski
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan
| | - Sergei Kotelnikov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794-5252, USA,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794-5252, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794-5252, USA,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794-5252, USA
| | - Shunsuke Teraguchi
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan,Faculty of Data Science, Shiga University, Hikone 522-8522, Japan
| | - Daron M Standley
- Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan,Immunology Frontier Research Center, Department of Systems Immunology, Osaka University, Suita 565-0871, Japan,To whom correspondence should be addressed.
| |
Collapse
|
21
|
Lensink MF, Brysbaert G, Mauri T, Nadzirin N, Velankar S, Chaleil RAG, Clarence T, Bates PA, Kong R, Liu B, Yang G, Liu M, Shi H, Lu X, Chang S, Roy RS, Quadir F, Liu J, Cheng J, Antoniak A, Czaplewski C, Giełdoń A, Kogut M, Lipska AG, Liwo A, Lubecka EA, Maszota-Zieleniak M, Sieradzan AK, Ślusarz R, Wesołowski PA, Zięba K, Del Carpio Muñoz CA, Ichiishi E, Harmalkar A, Gray JJ, Bonvin AMJJ, Ambrosetti F, Vargas Honorato R, Jandova Z, Jiménez-García B, Koukos PI, Van Keulen S, Van Noort CW, Réau M, Roel-Touris J, Kotelnikov S, Padhorny D, Porter KA, Alekseenko A, Ignatov M, Desta I, Ashizawa R, Sun Z, Ghani U, Hashemi N, Vajda S, Kozakov D, Rosell M, Rodríguez-Lumbreras LA, Fernandez-Recio J, Karczynska A, Grudinin S, Yan Y, Li H, Lin P, Huang SY, Christoffer C, Terashi G, Verburgt J, Sarkar D, Aderinwale T, Wang X, Kihara D, Nakamura T, Hanazono Y, Gowthaman R, Guest JD, Yin R, Taherzadeh G, Pierce BG, Barradas-Bautista D, Cao Z, Cavallo L, Oliva R, Sun Y, Zhu S, Shen Y, Park T, Woo H, Yang J, Kwon S, Won J, Seok C, Kiyota Y, Kobayashi S, Harada Y, Takeda-Shitaka M, Kundrotas PJ, Singh A, Vakser IA, Dapkūnas J, Olechnovič K, Venclovas Č, Duan R, Qiu L, Xu X, Zhang S, Zou X, Wodak SJ. Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment. Proteins 2021; 89:1800-1823. [PMID: 34453465 PMCID: PMC8616814 DOI: 10.1002/prot.26222] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 07/24/2021] [Accepted: 08/05/2021] [Indexed: 12/19/2022]
Abstract
We present the results for CAPRI Round 50, the fourth joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of twelve targets, including six dimers, three trimers, and three higher-order oligomers. Four of these were easy targets, for which good structural templates were available either for the full assembly, or for the main interfaces (of the higher-order oligomers). Eight were difficult targets for which only distantly related templates were found for the individual subunits. Twenty-five CAPRI groups including eight automatic servers submitted ~1250 models per target. Twenty groups including six servers participated in the CAPRI scoring challenge submitted ~190 models per target. The accuracy of the predicted models was evaluated using the classical CAPRI criteria. The prediction performance was measured by a weighted scoring scheme that takes into account the number of models of acceptable quality or higher submitted by each group as part of their five top-ranking models. Compared to the previous CASP-CAPRI challenge, top performing groups submitted such models for a larger fraction (70-75%) of the targets in this Round, but fewer of these models were of high accuracy. Scorer groups achieved stronger performance with more groups submitting correct models for 70-80% of the targets or achieving high accuracy predictions. Servers performed less well in general, except for the MDOCKPP and LZERD servers, who performed on par with human groups. In addition to these results, major advances in methodology are discussed, providing an informative overview of where the prediction of protein assemblies currently stands.
Collapse
Affiliation(s)
- Marc F Lensink
- CNRS UMR8576 UGSF, Institute for Structural and Functional Glycobiology, University of Lille, Lille, France
| | - Guillaume Brysbaert
- CNRS UMR8576 UGSF, Institute for Structural and Functional Glycobiology, University of Lille, Lille, France
| | - Théo Mauri
- CNRS UMR8576 UGSF, Institute for Structural and Functional Glycobiology, University of Lille, Lille, France
| | - Nurul Nadzirin
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | | | - Tereza Clarence
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
| | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
| | - Ren Kong
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Bin Liu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Guangbo Yang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Ming Liu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Hang Shi
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Xufeng Lu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Shan Chang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Raj S Roy
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
| | - Farhan Quadir
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
| | - Jian Liu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA
| | - Anna Antoniak
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
| | | | - Artur Giełdoń
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
| | - Mateusz Kogut
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
| | | | - Adam Liwo
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
| | - Emilia A Lubecka
- Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Gdansk, Poland
| | | | | | - Rafał Ślusarz
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
| | - Patryk A Wesołowski
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
- Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
| | - Karolina Zięba
- Faculty of Chemistry, University of Gdansk, Gdansk, Poland
| | | | - Eiichiro Ichiishi
- International University of Health and Welfare Hospital (IUHW Hospital), Nasushiobara City, Japan
| | - Ameya Harmalkar
- Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, USA
| | - Jeffrey J Gray
- Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, USA
| | - Alexandre M J J Bonvin
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Francesco Ambrosetti
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Rodrigo Vargas Honorato
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Zuzana Jandova
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Brian Jiménez-García
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Panagiotis I Koukos
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Siri Van Keulen
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Charlotte W Van Noort
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Manon Réau
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Jorge Roel-Touris
- Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
| | - Sergei Kotelnikov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
- Innopolis University, Russia
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
| | - Kathryn A Porter
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Andrey Alekseenko
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
- Institute of Computer-Aided Design of the Russian Academy of Sciences, Moscow, Russia
| | - Mikhail Ignatov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
| | - Israel Desta
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Ryota Ashizawa
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
| | - Zhuyezi Sun
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Usman Ghani
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Nasser Hashemi
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
- Department of Chemistry, Boston University, Boston, Massachusetts, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
| | - Mireia Rosell
- Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de la Rioja - Gobierno de La Rioja, Logrono, Spain
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Luis A Rodríguez-Lumbreras
- Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de la Rioja - Gobierno de La Rioja, Logrono, Spain
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Juan Fernandez-Recio
- Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de la Rioja - Gobierno de La Rioja, Logrono, Spain
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | | | - Sergei Grudinin
- Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, China
| | - Hao Li
- School of Physics, Huazhong University of Science and Technology, Wuhan, China
| | - Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, China
| | - Charles Christoffer
- Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
| | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
| | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
| | - Tunde Aderinwale
- Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
| | - Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
| | - Tsukasa Nakamura
- Graduate School of Information Sciences, Tohoku University, Sendai, Miyagi, Japan
| | - Yuya Hanazono
- Institute for Quantum Life Science, National Institutes for Quantum and Radiological Science and Technology, Tokai, Ibaraki, Japan
| | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
| | - Johnathan D Guest
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
| | - Rui Yin
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
| | - Ghazaleh Taherzadeh
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
| | | | - Zhen Cao
- King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Luigi Cavallo
- King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Romina Oliva
- University of Naples "Parthenope", Napoli, Italy
| | - Yuanfei Sun
- Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA
| | - Shaowen Zhu
- Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA
| | - Yang Shen
- Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA
| | - Taeyong Park
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Hyeonuk Woo
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Jinsol Yang
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Sohee Kwon
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Jonghun Won
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Chaok Seok
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Yasuomi Kiyota
- School of Pharmacy, Kitasato University, Minato-ku, Tokyo, Japan
| | | | - Yoshiki Harada
- School of Pharmacy, Kitasato University, Minato-ku, Tokyo, Japan
| | | | - Petras J Kundrotas
- Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, USA
| | - Amar Singh
- Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, USA
| | - Ilya A Vakser
- Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, USA
| | - Justas Dapkūnas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Kliment Olechnovič
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Česlovas Venclovas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Rui Duan
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Liming Qiu
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Xianjin Xu
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Shuang Zhang
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Xiaoqin Zou
- Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
- Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA
- Department of Biochemistry, University of Missouri, Columbia, Missouri, USA
| | | |
Collapse
|
22
|
Abstract
The biological significance of proteins attracted the scientific community in exploring their characteristics. The studies shed light on the interaction patterns and functions of proteins in a living body. Due to their practical difficulties, reliable experimental techniques pave the way for introducing computational methods in the interaction prediction. Automated methods reduced the difficulties but could not yet replace experimental studies as the field is still evolving. Interaction prediction problem being critical needs highly accurate results, but none of the existing methods could offer reliable performance that can parallel with experimental results yet. This article aims to assess the existing computational docking algorithms, their challenges, and future scope. Blind docking techniques are quite helpful when no information other than the individual structures are available. As more and more complex structures are being added to different databases, information-driven approaches can be a good alternative. Artificial intelligence, ruling over the major fields, is expected to take over this domain very shortly.
Collapse
|
23
|
Sinha P, Arunachalam KD, Nagarajan SK, Madhavan T, R Jayakumar A, Saiyad Musthafa M. Radio-protective efficacy of Gymnema sylvestre on Pangasius sutchi against gamma ( 60Co) irradiation. Int J Radiat Biol 2021; 98:212-229. [PMID: 34705596 DOI: 10.1080/09553002.2022.1998701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
PURPOSE Freshwater fish Pangasius sutchi was used in this study as a vertebrate model. We evaluated the induction of certain antioxidant enzymes in various vital organs. The radioprotective efficacy of Gymnema sylvestre leaves extract (GS) [25 mg/kg Body Weight (B.W)] and its bioactive compound Gymnemagenin (GG) [0.3 mg/kg B.W] was compared with Amifostine (Ami), the only radioprotector clinically approved by the US-FDA [Ami- 83.3 mg/kg B.W] against different doses of gamma radiation - 60Co (Lethal Dose: LD30-9.2 Gy, LD50-10.2 Gy and LD70-11.4 Gy). MATERIALS AND METHODS This study was done via stress marker enzymes, cell cycle analysis (CCA) and DNA damage assay prediction with molecular docking, which are reported here for the first time. The results indicate an elevated LPO level and decreased level of CAT, SOD and GSH due to oxidative stress initiation by 60Co Ionizing Radiation (IR) on 4th day and slightly reduced on 32nd day while the reverse observed when the fishes were pretreated with Ami, GS and GG. Similarly, CCA and dead/live cells counts were conducted with pretreatment of Ami, GS and GG against 60Co IR dose (LD50-10.2 Gy). RESULTS In CCA, G0/G1 phase was observed to be the highest in Ami and lowest in GG, against 60Co IR doses 10.2 Gy which was 51.76 ± 7.55. The dead cells range observed in pretreated group of Ami, GS and GG was lowest in Ami and highest in GG and live cells (highest in Ami and lowest in GG) as compared to 60Co IR group (86.43 ± 3.42 and 8.77 ± 5.95). Thus, antioxidant profile improvement by oxidative stress reduction and gradual progression of different phases of cell cycle except the apoptotic phase along with the live cells counts indicates that the radio-protective efficacy of GS is similar to Ami. CONCLUSION Predictive assessment was carried out by docking of Ami, various components of GS with p53, NF-κβ cells and Rad51 proteins structures responsible for CCA, apoptosis and repair mechanism. These structural proteins were docked with other structural proteins like USP7, TNF-α and partner and localizer of BRCA2 associated (PALB2/BRCA2) complex which made us perform these systemic efforts to find the functional activity of these known radio-protectants.
Collapse
Affiliation(s)
- Pamela Sinha
- Center for Environmental and Nuclear Research (CENR), SRM Institute of Science and Technology, Kattankulathur, India
| | - Kantha Devi Arunachalam
- Center for Environmental and Nuclear Research (CENR), SRM Institute of Science and Technology, Kattankulathur, India
| | | | - Thirumurthy Madhavan
- Department of Genetic Engineering, School of Bioengineering, SRMIST, Kattankulathur, India
| | - Arumugam R Jayakumar
- Department of Obstetrics, Gynecology and Reproductive Sciences, University of Miami Miller School of Medicine, Miami, FL, USA.,Neuropathology/Oncology Unit, Miami VA Foundation for Research and Education Inc, Miami, FL, USA
| | - Mohamed Saiyad Musthafa
- Unit of Research in Radiation Biology & Environmental Radioactivity (URRBER), P.G. & Research Department of Zoology, The New College (Autonomous), Affiliated to University of Madras, Chennai, India
| |
Collapse
|
24
|
Pozzati G, Zhu W, Bassot C, Lamb J, Kundrotas P, Elofsson A. Limits and potential of combined folding and docking. Bioinformatics 2021; 38:954-961. [PMID: 34788800 PMCID: PMC8796369 DOI: 10.1093/bioinformatics/btab760] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 09/23/2021] [Accepted: 11/02/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, also be used to extract information about evolutionary-based contacts across protein-protein interfaces. However, most earlier studies have not used the latest DL methods for inter-chain contact distance prediction. This article introduces a fold-and-dock method based on predicted residue-residue distances with trRosetta. RESULTS The method can simultaneously predict the tertiary and quaternary structure of a protein pair, even when the structures of the monomers are not known. The straightforward application of this method to a standard dataset for protein-protein docking yielded limited success. However, using alternative methods for generating MSAs allowed us to dock accurately significantly more proteins. We also introduced a novel scoring function, PconsDock, that accurately separates 98% of correctly and incorrectly folded and docked proteins. The average performance of the method is comparable to the use of traditional, template-based or ab initio shape-complementarity-only docking methods. Moreover, the results of conventional and fold-and-dock approaches are complementary, and thus a combined docking pipeline could increase overall docking success significantly. This methodology contributed to the best model for one of the CASP14 oligomeric targets, H1065. AVAILABILITY AND IMPLEMENTATION All scripts for predictions and analysis are available from https://github.com/ElofssonLab/bioinfo-toolbox/ and https://gitlab.com/ElofssonLab/benchmark5/. All models joined alignments, and evaluation results are available from the following figshare repository https://doi.org/10.6084/m9.figshare.14654886.v2. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | | | - John Lamb
- Science for Life Laboratory and Department of Biochemistry and Biophysics, Stockholm University, 171 21 Solna, Sweden
| | - Petras Kundrotas
- Science for Life Laboratory and Department of Biochemistry and Biophysics, Stockholm University, 171 21 Solna, Sweden,Center for Computational Biology, The University of Kansas, Lawrence, KS 66047, USA
| | | |
Collapse
|
25
|
Jandova Z, Vargiu AV, Bonvin AMJJ. Native or Non-Native Protein-Protein Docking Models? Molecular Dynamics to the Rescue. J Chem Theory Comput 2021; 17:5944-5954. [PMID: 34342983 PMCID: PMC8444332 DOI: 10.1021/acs.jctc.1c00336] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Indexed: 11/29/2022]
Abstract
Molecular docking excels at creating a plethora of potential models of protein-protein complexes. To correctly distinguish the favorable, native-like models from the remaining ones remains, however, a challenge. We assessed here if a protocol based on molecular dynamics (MD) simulations would allow distinguishing native from non-native models to complement scoring functions used in docking. To this end, the first models for 25 protein-protein complexes were generated using HADDOCK. Next, MD simulations complemented with machine learning were used to discriminate between native and non-native complexes based on a combination of metrics reporting on the stability of the initial models. Native models showed higher stability in almost all measured properties, including the key ones used for scoring in the Critical Assessment of PRedicted Interaction (CAPRI) competition, namely the positional root mean square deviations and fraction of native contacts from the initial docked model. A random forest classifier was trained, reaching a 0.85 accuracy in correctly distinguishing native from non-native complexes. Reasonably modest simulation lengths of the order of 50-100 ns are sufficient to reach this accuracy, which makes this approach applicable in practice.
Collapse
Affiliation(s)
- Zuzana Jandova
- Computational
Structural Biology Group, Bijvoet Centre for Biomolecular Research,
Faculty of Science—Chemistry, Utrecht
University, Padualaan 8, 3584 CH Utrecht, the Netherlands
| | - Attilio Vittorio Vargiu
- Physics
Department, University of Cagliari, Cittadella
Universitaria, S.P. 8 km 0.700, 09042 Monserrato, Italy
| | - Alexandre M. J. J. Bonvin
- Computational
Structural Biology Group, Bijvoet Centre for Biomolecular Research,
Faculty of Science—Chemistry, Utrecht
University, Padualaan 8, 3584 CH Utrecht, the Netherlands
| |
Collapse
|
26
|
Liu X, Luo Y, Li P, Song S, Peng J. Deep geometric representations for modeling effects of mutations on protein-protein binding affinity. PLoS Comput Biol 2021; 17:e1009284. [PMID: 34347784 PMCID: PMC8366979 DOI: 10.1371/journal.pcbi.1009284] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 08/16/2021] [Accepted: 07/17/2021] [Indexed: 11/19/2022] Open
Abstract
Modeling the impact of amino acid mutations on protein-protein interaction plays a crucial role in protein engineering and drug design. In this study, we develop GeoPPI, a novel structure-based deep-learning framework to predict the change of binding affinity upon mutations. Based on the three-dimensional structure of a protein, GeoPPI first learns a geometric representation that encodes topology features of the protein structure via a self-supervised learning scheme. These representations are then used as features for training gradient-boosting trees to predict the changes of protein-protein binding affinity upon mutations. We find that GeoPPI is able to learn meaningful features that characterize interactions between atoms in protein structures. In addition, through extensive experiments, we show that GeoPPI achieves new state-of-the-art performance in predicting the binding affinity changes upon both single- and multi-point mutations on six benchmark datasets. Moreover, we show that GeoPPI can accurately estimate the difference of binding affinities between a few recently identified SARS-CoV-2 antibodies and the receptor-binding domain (RBD) of the S protein. These results demonstrate the potential of GeoPPI as a powerful and useful computational tool in protein design and engineering. Our code and datasets are available at: https://github.com/Liuxg16/GeoPPI. Estimating the binding affinities of protein-protein interactions (PPIs) is crucial to understand protein function and design new functional proteins. Since the experimental measurement in wet-labs is labor-intensive and time-consuming, fast and accurate in silico approaches have received much attention. Although considerable efforts have been made in this direction, predicting the effects of mutations on the protein-protein binding affinity is still a challenging research problem. In this work, we introduce GeoPPI, a novel computational approach that uses deep geometric representations of protein complexes to predict the effects of mutations on the binding affinity. The geometric representations are first learned via a self-supervised learning scheme and then integrated with gradient-boosting trees to accomplish the prediction. We find that the learned representations encode meaningful patterns underlying the interactions between atoms in protein structures. Also, extensive tests on major benchmark datasets show that GeoPPI has made an important improvement over the existing methods in predicting the effects of mutations on the binding affinity.
Collapse
Affiliation(s)
- Xianggen Liu
- Laboratory for Brain and Intelligence and Department of Biomedical Engineering, Tsinghua University, Beijing, China
- School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
- Beijing Innovation Center for Future Chip, Tsinghua University, Beijing, China
| | - Yunan Luo
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Pengyong Li
- Laboratory for Brain and Intelligence and Department of Biomedical Engineering, Tsinghua University, Beijing, China
- Beijing Innovation Center for Future Chip, Tsinghua University, Beijing, China
| | - Sen Song
- Laboratory for Brain and Intelligence and Department of Biomedical Engineering, Tsinghua University, Beijing, China
- Beijing Innovation Center for Future Chip, Tsinghua University, Beijing, China
- * E-mail: (JP); (SS)
| | - Jian Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail: (JP); (SS)
| |
Collapse
|
27
|
Krupa P, Karczyńska AS, Mozolewska MA, Liwo A, Czaplewski C. UNRES-Dock-protein-protein and peptide-protein docking by coarse-grained replica-exchange MD simulations. Bioinformatics 2021; 37:1613-1615. [PMID: 33079977 DOI: 10.1093/bioinformatics/btaa897] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2020] [Revised: 09/04/2020] [Accepted: 10/06/2020] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The majority of the proteins in living organisms occur as homo- or hetero-multimeric structures. Although there are many tools to predict the structures of single-chain proteins or protein complexes with small ligands, peptide-protein and protein-protein docking is more challenging. In this work, we utilized multiplexed replica-exchange molecular dynamics (MREMD) simulations with the physics-based heavily coarse-grained UNRES model, which provides more than a 1000-fold simulation speed-up compared with all-atom approaches to predict structures of protein complexes. RESULTS We present a new protein-protein and peptide-protein docking functionality of the UNRES package, which includes a variable degree of conformational flexibility. UNRES-Dock protocol was tested on a set of 55 complexes with size from 43 to 587 amino-acid residues, showing that structures of the complexes can be predicted with good quality, if the sampling of the conformational space is sufficient, especially for flexible peptide-protein systems. The developed automatized protocol has been implemented in the standalone UNRES package and in the UNRES server. AVAILABILITY AND IMPLEMENTATION UNRES server: http://unres-server.chem.ug.edu.pl; UNRES package and data used in testing of UNRES-Dock: http://unres.pl. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Paweł Krupa
- Faculty of Chemistry, University of Gdańsk, Gdańsk 80-308, Poland.,Institute of Physics, Polish Academy of Sciences, Warsaw 02-668, Poland
| | - Agnieszka S Karczyńska
- Faculty of Chemistry, University of Gdańsk, Gdańsk 80-308, Poland.,University of Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble 38000, France
| | | | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Gdańsk 80-308, Poland
| | | |
Collapse
|
28
|
Iannuzzi R, Rossetti G, Spitaleri A, Bonnal RJP, Pagani M, Mollica L. A Simplified Amino Acidic Alphabet to Unveil the T-Cells Receptors Antigens: A Computational Perspective. Front Chem 2021; 9:598802. [PMID: 33718327 PMCID: PMC7947793 DOI: 10.3389/fchem.2021.598802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 01/19/2021] [Indexed: 11/15/2022] Open
Abstract
The exposure to pathogens triggers the activation of adaptive immune responses through antigens bound to surface receptors of antigen presenting cells (APCs). T cell receptors (TCR) are responsible for initiating the immune response through their physical direct interaction with antigen-bound receptors on the APCs surface. The study of T cell interactions with antigens is considered of crucial importance for the comprehension of the role of immune responses in cancer growth and for the subsequent design of immunomodulating anticancer drugs. RNA sequencing experiments performed on T cells represented a major breakthrough for this branch of experimental molecular biology. Apart from the gene expression levels, the hypervariable CDR3α/β sequences of the TCR loops can now be easily determined and modelled in the three dimensions, being the portions of TCR mainly responsible for the interaction with APC receptors. The most direct experimental method for the investigation of antigens would be based on peptide libraries, but their huge combinatorial nature, size, cost, and the difficulty of experimental fine tuning makes this approach complicated time consuming, and costly. We have implemented in silico methodology with the aim of moving from CDR3α/β sequences to a library of potentially antigenic peptides that can be used in immunologically oriented experiments to study T cells’ reactivity. To reduce the size of the library, we have verified the reproducibility of experimental benchmarks using the permutation of only six residues that can be considered representative of all ensembles of 20 natural amino acids. Such a simplified alphabet is able to correctly find the poses and chemical nature of original antigens within a small subset of ligands of potential interest. The newly generated library would have the advantage of leading to potentially antigenic ligands that would contribute to a better understanding of the chemical nature of TCR-antigen interactions. This step is crucial in the design of immunomodulators targeted towards T-cells response as well as in understanding the first principles of an immune response in several diseases, from cancer to autoimmune disorders.
Collapse
Affiliation(s)
- Raffaele Iannuzzi
- Istituto Nazionale Genetica Molecolare INGM 'Romeo ed Enrica Invernizzi', Milan, Italy
| | - Grazisa Rossetti
- Molecular Oncology and Immunology, FIRC Institute of Molecular Oncology (IFOM), Milan, Italy
| | - Andrea Spitaleri
- Emerging Bacterial Pathogens Unit, Division of Immunology, Transplantation and Infectious Diseases, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Raoul J P Bonnal
- Molecular Oncology and Immunology, FIRC Institute of Molecular Oncology (IFOM), Milan, Italy
| | - Massimiliano Pagani
- Molecular Oncology and Immunology, FIRC Institute of Molecular Oncology (IFOM), Milan, Italy.,Department of Medical Biotechnology and Translational Medicine, University of Milan, Milan, Italy
| | - Luca Mollica
- Department of Medical Biotechnology and Translational Medicine, University of Milan, Milan, Italy
| |
Collapse
|
29
|
Johansson-Åkhe I, Mirabello C, Wallner B. InterPep2: global peptide-protein docking using interaction surface templates. Bioinformatics 2020; 36:2458-2465. [PMID: 31917413 PMCID: PMC7178396 DOI: 10.1093/bioinformatics/btaa005] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 12/16/2019] [Accepted: 01/03/2020] [Indexed: 12/23/2022] Open
Abstract
Motivation Interactions between proteins and peptides or peptide-like intrinsically disordered regions are involved in many important biological processes, such as gene expression and cell life-cycle regulation. Experimentally determining the structure of such interactions is time-consuming and difficult because of the inherent flexibility of the peptide ligand. Although several prediction-methods exist, most are limited in performance or availability. Results InterPep2 is a freely available method for predicting the structure of peptide–protein interactions. Improved performance is obtained by using templates from both peptide–protein and regular protein–protein interactions, and by a random forest trained to predict the DockQ-score for a given template using sequence and structural features. When tested on 252 bound peptide–protein complexes from structures deposited after the complexes used in the construction of the training and templates sets of InterPep2, InterPep2-Refined correctly positioned 67 peptides within 4.0 Å LRMSD among top10, similar to another state-of-the-art template-based method which positioned 54 peptides correctly. However, InterPep2 displays a superior ability to evaluate the quality of its own predictions. On a previously established set of 27 non-redundant unbound-to-bound peptide–protein complexes, InterPep2 performs on-par with leading methods. The extended InterPep2-Refined protocol managed to correctly model 15 of these complexes within 4.0 Å LRMSD among top10, without using templates from homologs. In addition, combining the template-based predictions from InterPep2 with ab initio predictions from PIPER-FlexPepDock resulted in 22% more near-native predictions compared to the best single method (22 versus 18). Availability and implementation The program is available from: http://wallnerlab.org/InterPep2. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Isak Johansson-Åkhe
- Division of Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden
| | - Claudio Mirabello
- Division of Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden
| | - Björn Wallner
- Division of Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden
| |
Collapse
|
30
|
Dos Santos-Silva CA, Zupin L, Oliveira-Lima M, Vilela LMB, Bezerra-Neto JP, Ferreira-Neto JR, Ferreira JDC, de Oliveira-Silva RL, Pires CDJ, Aburjaile FF, de Oliveira MF, Kido EA, Crovella S, Benko-Iseppon AM. Plant Antimicrobial Peptides: State of the Art, In Silico Prediction and Perspectives in the Omics Era. Bioinform Biol Insights 2020; 14:1177932220952739. [PMID: 32952397 PMCID: PMC7476358 DOI: 10.1177/1177932220952739] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 07/30/2020] [Indexed: 12/14/2022] Open
Abstract
Even before the perception or interaction with pathogens, plants rely on constitutively guardian molecules, often specific to tissue or stage, with further expression after contact with the pathogen. These guardians include small molecules as antimicrobial peptides (AMPs), generally cysteine-rich, functioning to prevent pathogen establishment. Some of these AMPs are shared among eukaryotes (eg, defensins and cyclotides), others are plant specific (eg, snakins), while some are specific to certain plant families (such as heveins). When compared with other organisms, plants tend to present a higher amount of AMP isoforms due to gene duplications or polyploidy, an occurrence possibly also associated with the sessile habit of plants, which prevents them from evading biotic and environmental stresses. Therefore, plants arise as a rich resource for new AMPs. As these molecules are difficult to retrieve from databases using simple sequence alignments, a description of their characteristics and in silico (bioinformatics) approaches used to retrieve them is provided, considering resources and databases available. The possibilities and applications based on tools versus database approaches are considerable and have been so far underestimated.
Collapse
Affiliation(s)
| | - Luisa Zupin
- Genetic Immunology laboratory, Institute for Maternal and Child Health-IRCCS, Burlo Garofolo, Trieste, Italy
| | - Marx Oliveira-Lima
- Departamento de Genética, Universidade Federal de Pernambuco, Recife, Brazil
| | | | | | | | - José Diogo Cavalcanti Ferreira
- Departamento de Genética, Universidade Federal de Pernambuco, Recife, Brazil.,Departamento de Genética, Instituto Federal de Pernambuco, Pesqueira, Brazil
| | | | | | | | | | - Ederson Akio Kido
- Departamento de Genética, Universidade Federal de Pernambuco, Recife, Brazil
| | - Sergio Crovella
- Genetic Immunology laboratory, Institute for Maternal and Child Health-IRCCS, Burlo Garofolo, Trieste, Italy.,Department of Medicine, Surgery and Health Sciences, University of Trieste, Trieste, Italy
| | | |
Collapse
|
31
|
Nagarajan S, Babu S, Sohn H, Madhavan T. Molecular-Level Understanding of the Somatostatin Receptor 1 (SSTR1)-Ligand Binding: A Structural Biology Study Based on Computational Methods. ACS OMEGA 2020; 5:21145-21161. [PMID: 32875251 PMCID: PMC7450625 DOI: 10.1021/acsomega.0c02847] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 07/31/2020] [Indexed: 06/11/2023]
Abstract
Somatostatin receptor 1 (SSTR1), a subtype of somatostatin receptors, is involved in various signaling mechanisms in different parts of the human body. Like most of the G-protein-coupled receptors (GPCRs), the available information on the structural features of SSTR1 responsible for the biological activity is scarce. In this study, we report a molecular-level understanding of SSTR1-ligand binding, which could be helpful in solving the structural complexities involved in SSTR1 functioning. Based on a three-dimensional quantitative structure-activity relationship (3D-QSAR) study using comparative molecular field analysis (CoMFA) and comparative molecular similarity index analysis (CoMSIA), we have identified that an electronegative, less-bulkier, and hydrophobic atom substitution can substantially increase the biological activity of SSTR1 ligands. A density functional theory (DFT) study has been followed to study the electron-related properties of the SSTR1 ligands and to validate the results obtained via the 3D-QSAR study. 3D models of SSTR1-ligand systems have been embedded in lipid-lipid bilayer membranes to perform molecular dynamics (MD) simulations. Analysis of the MD trajectories reveals important information about the crucial residues involved in SSTR1-ligand binding and various conformational changes in the protein that occur after ligand binding. Additionally, we have identified the probable ligand-binding site of SSTR1 and validated it using MD. We have also studied the favorable conditions that are essential for forming the most stable and lowest-energy bioactive conformation of the ligands inside the binding site. The results of the study could be useful in constructing more potent and novel SSTR1 antagonists and agonists.
Collapse
Affiliation(s)
- Santhosh
Kumar Nagarajan
- Computational
Biology Lab, Department of Genetic Engineering, School of Bioengineering, SRM Institute of Science and Technology, SRM Nagar, Kattankulathur, Chennai 603203, India
| | - Sathya Babu
- Computational
Biology Lab, Department of Genetic Engineering, School of Bioengineering, SRM Institute of Science and Technology, SRM Nagar, Kattankulathur, Chennai 603203, India
| | - Honglae Sohn
- Department
of Chemistry and Department of Carbon Materials, Chosun University, Gwangju, South Korea
| | - Thirumurthy Madhavan
- Computational
Biology Lab, Department of Genetic Engineering, School of Bioengineering, SRM Institute of Science and Technology, SRM Nagar, Kattankulathur, Chennai 603203, India
| |
Collapse
|
32
|
Brooks BD, Closmore A, Yang J, Holland M, Cairns T, Cohen GH, Bailey-Kellogg C. Characterizing Epitope Binding Regions of Entire Antibody Panels by Combining Experimental and Computational Analysis of Antibody: Antigen Binding Competition. Molecules 2020; 25:molecules25163659. [PMID: 32796656 PMCID: PMC7464469 DOI: 10.3390/molecules25163659] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 07/27/2020] [Accepted: 07/28/2020] [Indexed: 11/16/2022] Open
Abstract
Vaccines and immunotherapies depend on the ability of antibodies to sensitively and specifically recognize particular antigens and specific epitopes on those antigens. As such, detailed characterization of antibody-antigen binding provides important information to guide development. Due to the time and expense required, high-resolution structural characterization techniques are typically used sparingly and late in a development process. Here, we show that antibody-antigen binding can be characterized early in a process for whole panels of antibodies by combining experimental and computational analyses of competition between monoclonal antibodies for binding to an antigen. Experimental "epitope binning" of monoclonal antibodies uses high-throughput surface plasmon resonance to reveal which antibodies compete, while a new complementary computational analysis that we call "dock binning" evaluates antibody-antigen docking models to identify why and where they might compete, in terms of possible binding sites on the antigen. Experimental and computational characterization of the identified antigenic hotspots then enables the refinement of the competitors and their associated epitope binding regions on the antigen. While not performed at atomic resolution, this approach allows for the group-level identification of functionally related monoclonal antibodies (i.e., communities) and identification of their general binding regions on the antigen. By leveraging extensive epitope characterization data that can be readily generated both experimentally and computationally, researchers can gain broad insights into the basis for antibody-antigen recognition in wide-ranging vaccine and immunotherapy discovery and development programs.
Collapse
Affiliation(s)
- Benjamin D. Brooks
- Department of Biomedical Sciences, Rocky Vista University, Ivins, UT 84738, USA
- Inovan Inc., Fargo, ND 58102, USA
- Department of Microbiology, School of Dental Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; (T.C.); (G.H.C.)
- Correspondence: ; Tel.: +1-435-222-1403
| | - Adam Closmore
- Department of Pharmacy, North Dakota State University, Fargo, ND 58102, USA;
| | - Juechen Yang
- Department of Biomedical Engineering, North Dakota State University, Fargo, ND 58102, USA; (J.Y.); (M.H.)
| | - Michael Holland
- Department of Biomedical Engineering, North Dakota State University, Fargo, ND 58102, USA; (J.Y.); (M.H.)
| | - Tina Cairns
- Department of Microbiology, School of Dental Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; (T.C.); (G.H.C.)
| | - Gary H. Cohen
- Department of Microbiology, School of Dental Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; (T.C.); (G.H.C.)
| | | |
Collapse
|
33
|
Khramushin A, Marcu O, Alam N, Shimony O, Padhorny D, Brini E, Dill KA, Vajda S, Kozakov D, Schueler-Furman O. Modeling beta-sheet peptide-protein interactions: Rosetta FlexPepDock in CAPRI rounds 38-45. Proteins 2020; 88:1037-1049. [PMID: 31891416 PMCID: PMC7539656 DOI: 10.1002/prot.25871] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 12/17/2019] [Accepted: 12/26/2019] [Indexed: 01/09/2023]
Abstract
Peptide-protein docking is challenging due to the considerable conformational freedom of the peptide. CAPRI rounds 38-45 included two peptide-protein interactions, both characterized by a peptide forming an additional beta strand of a beta sheet in the receptor. Using the Rosetta FlexPepDock peptide docking protocol we generated top-performing, high-accuracy models for targets 134 and 135, involving an interaction between a peptide derived from L-MAG with DLC8. In addition, we were able to generate the only medium-accuracy models for a particularly challenging target, T121. In contrast to the classical peptide-mediated interaction, in which receptor side chains contact both peptide backbone and side chains, beta-sheet complementation involves a major contribution to binding by hydrogen bonds between main chain atoms. To establish how binding affinity and specificity are established in this special class of peptide-protein interactions, we extracted PeptiDBeta, a benchmark of solved structures of different protein domains that are bound by peptides via beta-sheet complementation, and tested our protocol for global peptide-docking PIPER-FlexPepDock on this dataset. We find that the beta-strand part of the peptide is sufficient to generate approximate and even high resolution models of many interactions, but inclusion of adjacent motif residues often provides additional information necessary to achieve high resolution model quality.
Collapse
Affiliation(s)
- Alisa Khramushin
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Orly Marcu
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Nawsad Alam
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Orly Shimony
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony
Brook University, New York, New York
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
| | - Emiliano Brini
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
- Department of Physics and Astronomy, Stony Brook
University, New York, New York
- Department of Chemistry, Stony Brook University, New York,
New York
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University,
Boston, Massachusetts
- Department of Chemistry, Boston University, Boston,
Massachusetts
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony
Brook University, New York, New York
- Laufer Center for Physical and Quantitative Biology, Stony
Brook University, New York, New York
| | - Ora Schueler-Furman
- Department of Microbiologyand Molecular Genetics, Institute
for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University,
Jerusalem, Israel
| |
Collapse
|
34
|
Desta IT, Porter KA, Xia B, Kozakov D, Vajda S. Performance and Its Limits in Rigid Body Protein-Protein Docking. Structure 2020; 28:1071-1081.e3. [PMID: 32649857 DOI: 10.1016/j.str.2020.06.006] [Citation(s) in RCA: 274] [Impact Index Per Article: 68.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 04/19/2020] [Accepted: 06/19/2020] [Indexed: 12/13/2022]
Abstract
The development of fast Fourier transform (FFT) algorithms enabled the sampling of billions of complex conformations and thus revolutionized protein-protein docking. FFT-based methods are now widely available and have been used in hundreds of thousands of docking calculations. Although the methods perform "soft" docking, which allows for some overlap of component proteins, the rigid body assumption clearly introduces limitations on accuracy and reliability. In addition, the method can work only with energy expressions represented by sums of correlation functions. In this paper we use a well-established protein-protein docking benchmark set to evaluate the results of these limitations by focusing on the performance of the docking server ClusPro, which implements one of the best rigid body methods. Furthermore, we explore the theoretical limits of accuracy when using established energy terms for scoring, provide comparison with flexible docking algorithms, and review the historical performance of servers in the CAPRI docking experiment.
Collapse
Affiliation(s)
- Israel T Desta
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Kathryn A Porter
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Bing Xia
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA.
| |
Collapse
|
35
|
Hofmann SM, Frost CV, Podewin T, Gailer M, Weber E, Zacharias M, Zinth W, Hoffmann-Röder A. Folding and Unfolding of the Short Light-Triggered β-Hairpin Peptide AzoChignolin Occurs within 100 ns. J Phys Chem B 2020; 124:5113-5121. [PMID: 32479079 DOI: 10.1021/acs.jpcb.0c02021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
To map the underlying molecular mechanisms of folding dynamics in proteins, light-operated peptides have emerged as promising tools. In this study, we reveal the complete sequence of light-induced structural changes of AzoChignolin, a short β-hairpin peptide containing an azobenzene photoswitch in its loop region. Light-triggered structural changes were monitored by time-resolved IR spectroscopy. Formation and destruction of the hairpin structure is very fast and occurs within 100 ns for AzoChignolin in methanol. Atomistic molecular dynamics simulations using two explicit solvents, methanol and water, revealed the underlying molecular processes and allowed us to gain further insight into the reaction mechanism. Despite its rapid reaction time, hairpin formation in these solvents is not force-driven by the molecular switch but proceeded via formation of interstrand hydrogen bonds and contacts between aromatic residues. Moreover, the combined experimental and theoretical study demonstrates that the solvent (methanol vs water) does not dictate the velocity of β-hairpin formation in the AzoChignolin peptide comprising only a few hydrophobic residues in the strands.
Collapse
Affiliation(s)
- Stefan M Hofmann
- BioMolecular Optics and Center for Integrated Protein Science, Faculty of Physics, Ludwig-Maximilians-Universität München, Oettingenstr. 67, München 80538, Germany
| | - Christina V Frost
- TUM Department of Physics T38, Technical University of Munich, James-Franck-Str. 1, Garching 85748, Germany
| | - Tom Podewin
- Department of Organic Chemistry and Center for Integrated Protein Science, Ludwig-Maximilians-Universität München, Butenandtstr. 5-13, München 81377, Germany
| | - Manuel Gailer
- Department of Organic Chemistry and Center for Integrated Protein Science, Ludwig-Maximilians-Universität München, Butenandtstr. 5-13, München 81377, Germany
| | - Elisa Weber
- BioMolecular Optics and Center for Integrated Protein Science, Faculty of Physics, Ludwig-Maximilians-Universität München, Oettingenstr. 67, München 80538, Germany
| | - Martin Zacharias
- TUM Department of Physics T38, Technical University of Munich, James-Franck-Str. 1, Garching 85748, Germany
| | - Wolfgang Zinth
- BioMolecular Optics and Center for Integrated Protein Science, Faculty of Physics, Ludwig-Maximilians-Universität München, Oettingenstr. 67, München 80538, Germany
| | - Anja Hoffmann-Röder
- Department of Organic Chemistry and Center for Integrated Protein Science, Ludwig-Maximilians-Universität München, Butenandtstr. 5-13, München 81377, Germany
| |
Collapse
|
36
|
Duan R, Qiu L, Xu X, Ma Z, Merideth BR, Shyu CR, Zou X. Performance of human and server prediction in CAPRI rounds 38-45. Proteins 2020; 88:1110-1120. [PMID: 32483825 DOI: 10.1002/prot.25956] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 03/26/2020] [Accepted: 05/27/2020] [Indexed: 11/11/2022]
Abstract
CAPRI challenges offer a variety of blind tests for protein-protein interaction prediction. In CAPRI Rounds 38-45, we generated a set of putative binding modes for each target with an FFT-based docking algorithm, and then scored and ranked these binding modes with a proprietary scoring function, ITScorePP. We have also developed a novel web server, Rebipp. The algorithm utilizes information retrieval to identify relevant biological information to significantly reduce the search space for a particular protein. In parallel, we have also constructed a GPU-based docking server, MDockPP, for protein-protein complex structure prediction. Here, the performance of our protocol in CAPRI rounds 38-45 is reported, which include 16 docking and scoring targets. Among them, three targets contain multiple interfaces: Targets 124, 125, and 136 have 2, 4, and 3 interfaces, respectively. In the predictor experiments, we predicted correct binding modes for nine targets, including one high-accuracy interface, six medium-accuracy binding modes, and six acceptable-accuracy binding modes. For the docking server prediction experiments, we predicted correct binding modes for eight targets, including one high-accuracy, three medium-accuracy, and five acceptable-accuracy binding modes.
Collapse
Affiliation(s)
- Rui Duan
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Liming Qiu
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Xianjin Xu
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
| | - Zhiwei Ma
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA.,Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA
| | - Benjamin Ryan Merideth
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA.,Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA
| | - Chi-Ren Shyu
- Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA.,Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
| | - Xiaoqin Zou
- Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA.,Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA.,Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA.,Department of Biochemistry, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
37
|
Orengo C, Velankar S, Wodak S, Zoete V, Bonvin AMJJ, Elofsson A, Feenstra KA, Gerloff DL, Hamelryck T, Hancock JM, Helmer-Citterich M, Hospital A, Orozco M, Perrakis A, Rarey M, Soares C, Sussman JL, Thornton JM, Tuffery P, Tusnady G, Wierenga R, Salminen T, Schneider B. A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community). F1000Res 2020; 9. [PMID: 32566135 PMCID: PMC7284151 DOI: 10.12688/f1000research.20559.1] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/05/2020] [Indexed: 12/11/2022] Open
Abstract
Structural bioinformatics provides the scientific methods and tools to analyse, archive, validate, and present the biomolecular structure data generated by the structural biology community. It also provides an important link with the genomics community, as structural bioinformaticians also use the extensive sequence data to predict protein structures and their functional sites. A very broad and active community of structural bioinformaticians exists across Europe, and 3D-Bioinfo will establish formal platforms to address their needs and better integrate their activities and initiatives. Our mission will be to strengthen the ties with the structural biology research communities in Europe covering life sciences, as well as chemistry and physics and to bridge the gap between these researchers in order to fully realize the potential of structural bioinformatics. Our Community will also undertake dedicated educational, training and outreach efforts to facilitate this, bringing new insights and thus facilitating the development of much needed innovative applications e.g. for human health, drug and protein design. Our combined efforts will be of critical importance to keep the European research efforts competitive in this respect. Here we highlight the major European contributions to the field of structural bioinformatics, the most pressing challenges remaining and how Europe-wide interactions, enabled by ELIXIR and its platforms, will help in addressing these challenges and in coordinating structural bioinformatics resources across Europe. In particular, we present recent activities and future plans to consolidate an ELIXIR 3D-Bioinfo Community in structural bioinformatics and propose means to develop better links across the community. These include building new consortia, organising workshops to establish data standards and seeking community agreement on benchmark data sets and strategies. We also highlight existing and planned collaborations with other ELIXIR Communities and other European infrastructures, such as the structural biology community supported by Instruct-ERIC, with whom we have synergies and overlapping common interests.
Collapse
Affiliation(s)
- Christine Orengo
- Structural and Molecular Biology Department, University College, London, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, CB10 1SD, UK
| | - Shoshana Wodak
- VIB-VUB Center for Structural Biology, Brussels, Belgium
| | - Vincent Zoete
- Department of Oncology, Lausanne University, Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Alexandre M J J Bonvin
- Bijvoet Center, Faculty of Science - Chemistry, Utrecht University, Utrecht, 3584CH, The Netherlands
| | - Arne Elofsson
- Science for Life Laboratory, Stockholm University, Solna, S-17121, Sweden
| | - K Anton Feenstra
- Dept. Computer Science, Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit, Amsterdam, 1081 HV, The Netherlands
| | - Dietland L Gerloff
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
| | - Thomas Hamelryck
- Bioinformatics center, Department of Biology, University of Copenhagen, Copenhagen, DK-2200, Denmark
| | | | | | - Adam Hospital
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, 08028, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, 08028, Spain
| | | | - Matthias Rarey
- ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, D-20146, Germany
| | - Claudio Soares
- Instituto de Tecnologia Química e Biológica Antonio Xavier, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Joel L Sussman
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, CB10 1SD, UK
| | - Pierre Tuffery
- Ressource Parisienne en Bioinformatique Structurale, Université de Paris, Paris, F-75205, France
| | - Gabor Tusnady
- Membrane Bioinformatics Research Group, Institute of Enzymology, Budapest, H-1117, Hungary
| | | | - Tiina Salminen
- Structural Bioinformatics Laboratory, Åbo Akademi University, Turku, FI-20500, Finland
| | - Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, Vestec, CZ-25250, Czech Republic
| |
Collapse
|
38
|
Koukos P, Bonvin A. Integrative Modelling of Biomolecular Complexes. J Mol Biol 2020; 432:2861-2881. [DOI: 10.1016/j.jmb.2019.11.009] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Revised: 11/12/2019] [Accepted: 11/13/2019] [Indexed: 12/31/2022]
|
39
|
Kong R, Liu R, Xu X, Zhang D, Xu X, Shi H, Chang S. Template‐based modeling and ab‐initio docking using
CoDock
in
CAPRI. Proteins 2020; 88:1100-1109. [DOI: 10.1002/prot.25892] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 12/21/2019] [Accepted: 03/07/2020] [Indexed: 01/11/2023]
Affiliation(s)
- Ren Kong
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
| | - Ran‐Ran Liu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
| | - Xi‐Ming Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
- Innovation Center for Marine Drug Screening and Evaluation, Qingdao National Laboratory for Marine Science and Technology Qingdao China
| | - Da‐Wei Zhang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
| | - Xiao‐Shuang Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
| | - Hang Shi
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
| | - Shan Chang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou China
- Innovation Center for Marine Drug Screening and Evaluation, Qingdao National Laboratory for Marine Science and Technology Qingdao China
| |
Collapse
|
40
|
Singh A, Dauzhenka T, Kundrotas PJ, Sternberg MJE, Vakser IA. Application of docking methodologies to modeled proteins. Proteins 2020; 88:1180-1188. [PMID: 32170770 DOI: 10.1002/prot.25889] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Revised: 02/15/2020] [Accepted: 03/07/2020] [Indexed: 12/12/2022]
Abstract
Protein docking is essential for structural characterization of protein interactions. Besides providing the structure of protein complexes, modeling of proteins and their complexes is important for understanding the fundamental principles and specific aspects of protein interactions. The accuracy of protein modeling, in general, is still less than that of the experimental approaches. Thus, it is important to investigate the applicability of docking techniques to modeled proteins. We present new comprehensive benchmark sets of protein models for the development and validation of protein docking, as well as a systematic assessment of free and template-based docking techniques on these sets. As opposed to previous studies, the benchmark sets reflect the real case modeling/docking scenario where the accuracy of the models is assessed by the modeling procedure, without reference to the native structure (which would be unknown in practical applications). We also expanded the analysis to include docking of protein pairs where proteins have different structural accuracy. The results show that, in general, the template-based docking is less sensitive to the structural inaccuracies of the models than the free docking. The near-native docking poses generated by the template-based approach, typically, also have higher ranks than those produces by the free docking (although the free docking is indispensable in modeling the multiplicity of protein interactions in a crowded cellular environment). The results show that docking techniques are applicable to protein models in a broad range of modeling accuracy. The study provides clear guidelines for practical applications of docking to protein models.
Collapse
Affiliation(s)
- Amar Singh
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Taras Dauzhenka
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Petras J Kundrotas
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Michael J E Sternberg
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, South Kensington, London, UK
| | - Ilya A Vakser
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA.,Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, USA
| |
Collapse
|
41
|
A topology-based network tree for the prediction of protein-protein binding affinity changes following mutation. NAT MACH INTELL 2020; 2:116-123. [PMID: 34170981 PMCID: PMC7223817 DOI: 10.1038/s42256-020-0149-6] [Citation(s) in RCA: 83] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 01/10/2020] [Indexed: 12/14/2022]
Abstract
The ability to predict protein-protein interactions is crucial to our understanding of a wide range of biological activities and functions in the human body, and for guiding drug discovery. Despite considerable efforts to develop suitable computational methods, predicting protein-protein interaction binding affinity changes following mutation (ΔΔG) remains a severe challenge. Algebraic topology, a champion in recent worldwide competitions for protein-ligand binding affinity predictions, is a promising approach to simplifying the complexity of biological structures. Here we introduce element- and site-specific persistent homology (a new branch of algebraic topology) to simplify the structural complexity of protein-protein complexes and embed crucial biological information into topological invariants. We also propose a new deep learning algorithm called NetTree to take advantage of convolutional neural networks and gradient-boosting trees. A topology-based network tree is constructed by integrating the topological representation and NetTree for predicting protein-protein interaction ΔΔG. Tests on major benchmark datasets indicate that the proposed topology-based network tree is an important improvement over the current state of the art in predicting ΔΔG.
Collapse
|
42
|
Yan Y, He J, Feng Y, Lin P, Tao H, Huang SY. Challenges and opportunities of automated protein-protein docking: HDOCK server vs human predictions in CAPRI Rounds 38-46. Proteins 2020; 88:1055-1069. [PMID: 31994779 DOI: 10.1002/prot.25874] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 01/02/2020] [Accepted: 01/22/2020] [Indexed: 12/12/2022]
Abstract
Protein-protein docking plays an important role in the computational prediction of the complex structure between two proteins. For years, a variety of docking algorithms have been developed, as witnessed by the critical assessment of prediction interactions (CAPRI) experiments. However, despite their successes, many docking algorithms often require a series of manual operations like modeling structures from sequences, incorporating biological information, and selecting final models. The difficulties in these manual steps have significantly limited the applications of protein-protein docking, as most of the users in the community are nonexperts in docking. Therefore, automated docking like a web server, which can give a comparable performance to human docking protocol, is pressingly needed. As such, we have participated in the blind CAPRI experiments for Rounds 38-45 and CASP13-CAPRI challenge for Round 46 with both our HDOCK automated docking web server and human docking protocol. It was shown that our HDOCK server achieved an "acceptable" or higher CAPRI-rated model in the top 10 submitted predictions for 65.5% and 59.1% of the targets in the docking experiments of CAPRI and CASP13-CAPRI, respectively, which are comparable to 66.7% and 54.5% for human docking protocol. Similar trends can also be observed in the scoring experiments. These results validated our HDOCK server as an efficient automated docking protocol for nonexpert users. Challenges and opportunities of automated docking are also discussed.
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Huanyu Tao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| |
Collapse
|
43
|
Chakravarty D, McElfresh GW, Kundrotas PJ, Vakser IA. How to choose templates for modeling of protein complexes: Insights from benchmarking template-based docking. Proteins 2020; 88:1070-1081. [PMID: 31994759 DOI: 10.1002/prot.25875] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 01/07/2020] [Accepted: 01/22/2020] [Indexed: 01/01/2023]
Abstract
Comparative docking is based on experimentally determined structures of protein-protein complexes (templates), following the paradigm that proteins with similar sequences and/or structures form similar complexes. Modeling utilizing structure similarity of target monomers to template complexes significantly expands structural coverage of the interactome. Template-based docking by structure alignment can be performed for the entire structures or by aligning targets to the bound interfaces of the experimentally determined complexes. Systematic benchmarking of docking protocols based on full and interface structure alignment showed that both protocols perform similarly, with top 1 docking success rate 26%. However, in terms of the models' quality, the interface-based docking performed marginally better. The interface-based docking is preferable when one would suspect a significant conformational change in the full protein structure upon binding, for example, a rearrangement of the domains in multidomain proteins. Importantly, if the same structure is selected as the top template by both full and interface alignment, the docking success rate increases 2-fold for both top 1 and top 10 predictions. Matching structural annotations of the target and template proteins for template detection, as a computationally less expensive alternative to structural alignment, did not improve the docking performance. Sophisticated remote sequence homology detection added templates to the pool of those identified by structure-based alignment, suggesting that for practical docking, the combination of the structure alignment protocols and the remote sequence homology detection may be useful in order to avoid potential flaws in generation of the structural templates library.
Collapse
Affiliation(s)
| | - G W McElfresh
- Computational Biology Program, The University of Kansas, Lawrence, Kansas
| | - Petras J Kundrotas
- Computational Biology Program, The University of Kansas, Lawrence, Kansas
| | - Ilya A Vakser
- Computational Biology Program, The University of Kansas, Lawrence, Kansas.,Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas
| |
Collapse
|
44
|
Lensink MF, Nadzirin N, Velankar S, Wodak SJ. Modeling protein‐protein, protein‐peptide, and protein‐oligosaccharide complexes: CAPRI 7th edition. Proteins 2020; 88:916-938. [DOI: 10.1002/prot.25870] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 12/19/2019] [Accepted: 12/26/2019] [Indexed: 12/19/2022]
Affiliation(s)
- Marc F. Lensink
- University of Lille, CNRS UMR8576 UGSF, Unité de Glycobiologie Structurale et Fonctionnelle F‐59000 Lille France
| | - Nurul Nadzirin
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI), Wellcome Trust Genome Campus Cambridge UK
| | - Sameer Velankar
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI), Wellcome Trust Genome Campus Cambridge UK
| | | |
Collapse
|
45
|
Geng C, Jung Y, Renaud N, Honavar V, Bonvin AMJJ, Xue LC. iScore: a novel graph kernel-based function for scoring protein-protein docking models. Bioinformatics 2020; 36:112-121. [PMID: 31199455 PMCID: PMC6956772 DOI: 10.1093/bioinformatics/btz496] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 05/08/2019] [Accepted: 06/11/2019] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Protein complexes play critical roles in many aspects of biological functions. Three-dimensional (3D) structures of protein complexes are critical for gaining insights into structural bases of interactions and their roles in the biomolecular pathways that orchestrate key cellular processes. Because of the expense and effort associated with experimental determinations of 3D protein complex structures, computational docking has evolved as a valuable tool to predict 3D structures of biomolecular complexes. Despite recent progress, reliably distinguishing near-native docking conformations from a large number of candidate conformations, the so-called scoring problem, remains a major challenge. RESULTS Here we present iScore, a novel approach to scoring docked conformations that combines HADDOCK energy terms with a score obtained using a graph representation of the protein-protein interfaces and a measure of evolutionary conservation. It achieves a scoring performance competitive with, or superior to, that of state-of-the-art scoring functions on two independent datasets: (i) Docking software-specific models and (ii) the CAPRI score set generated by a wide variety of docking approaches (i.e. docking software-non-specific). iScore ranks among the top scoring approaches on the CAPRI score set (13 targets) when compared with the 37 scoring groups in CAPRI. The results demonstrate the utility of combining evolutionary, topological and energetic information for scoring docked conformations. This work represents the first successful demonstration of graph kernels to protein interfaces for effective discrimination of near-native and non-native conformations of protein complexes. AVAILABILITY AND IMPLEMENTATION The iScore code is freely available from Github: https://github.com/DeepRank/iScore (DOI: 10.5281/zenodo.2630567). And the docking models used are available from SBGrid: https://data.sbgrid.org/dataset/684). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Cunliang Geng
- Bijvoet Center for Biomolecular Research, Faculty of Science – Chemistry, Utrecht University, Utrecht 3584 CH, The Netherlands
| | - Yong Jung
- Bioinformatics & Genomics Graduate Program, Pennsylvania State University, University Park, PA 16802, USA
- Artificial Intelligence Research Laboratory, Pennsylvania State University, University Park, PA 16823, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Nicolas Renaud
- Netherlands eScience Center, Amsterdam 1098 XG, The Netherlands
| | - Vasant Honavar
- Bioinformatics & Genomics Graduate Program, Pennsylvania State University, University Park, PA 16802, USA
- Artificial Intelligence Research Laboratory, Pennsylvania State University, University Park, PA 16823, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
- Center for Big Data Analytics and Discovery Informatics, Pennsylvania State University, University Park, PA 16823, USA
- Institute for Cyberscience, University Park, PA 16802, USA
- Clinical and Translational Sciences Institute, University Park, PA 16802, USA
- College of Information Sciences & Technology, Pennsylvania State University, University Park, PA 16802, USA
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science – Chemistry, Utrecht University, Utrecht 3584 CH, The Netherlands
| | - Li C Xue
- Bijvoet Center for Biomolecular Research, Faculty of Science – Chemistry, Utrecht University, Utrecht 3584 CH, The Netherlands
| |
Collapse
|
46
|
Alekseenko A, Ignatov M, Jones G, Sabitova M, Kozakov D. Protein-Protein and Protein-Peptide Docking with ClusPro Server. Methods Mol Biol 2020; 2165:157-174. [PMID: 32621224 DOI: 10.1007/978-1-0716-0708-4_9] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The process of creating a model of the structure formed by a pair of interacting molecules is commonly referred to as docking. Protein docking is one of the most studied topics in computational and structural biology with applications to drug design and beyond. In this chapter, we describe ClusPro, a web server for protein-protein and protein-peptide docking. As an input, the server requires two Protein Data Bank (PDB) files (protein-protein mode) or a PDB file for the protein and a sequence for the ligand (protein-peptide mode). Its output consists of ten models of the resulting structure formed by the two objects upon interaction. The server typically produces results in less than 4 h. The server also provides tools (via "Advanced Options" list) for a user to fine-tune the results using any additional knowledge about the interaction process, e.g., small-angle X-ray scattering (SAXS) profile or distance restraints.
Collapse
Affiliation(s)
- Andrey Alekseenko
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA.,Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA.,Institute of Computer Aided Design of the Russian Academy of Sciences, Moscow, Russia
| | - Mikhail Ignatov
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA.,Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA.,Institute of Computer Aided Design of the Russian Academy of Sciences, Moscow, Russia.,Institute for Advanced Computational Sciences, Stony Brook University, Stony Brook, NY, USA
| | - George Jones
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA.,Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA
| | - Maria Sabitova
- Department of Mathematics, Queens College and CUNY Graduate Center, Flushing, NY, USA
| | - Dima Kozakov
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA. .,Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA. .,Institute for Advanced Computational Sciences, Stony Brook University, Stony Brook, NY, USA.
| |
Collapse
|
47
|
Koukos PI, Roel-Touris J, Ambrosetti F, Geng C, Schaarschmidt J, Trellet ME, Melquiond ASJ, Xue LC, Honorato RV, Moreira I, Kurkcuoglu Z, Vangone A, Bonvin AMJJ. An overview of data-driven HADDOCK strategies in CAPRI rounds 38-45. Proteins 2019; 88:1029-1036. [PMID: 31886559 DOI: 10.1002/prot.25869] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 12/17/2019] [Accepted: 12/26/2019] [Indexed: 01/18/2023]
Abstract
Our information-driven docking approach HADDOCK has demonstrated a sustained performance since the start of its participation to CAPRI. This is due, in part, to its ability to integrate data into the modeling process, and to the robustness of its scoring function. We participated in CAPRI both as server and manual predictors. In CAPRI rounds 38-45, we have used various strategies depending on the available information. These ranged from imposing restraints to a few residues identified from literature as being important for the interaction, to binding pockets identified from homologous complexes or template-based refinement/CA-CA restraint-guided docking from identified templates. When relevant, symmetry restraints were used to limit the conformational sampling. We also tested for a large decamer target a new implementation of the MARTINI coarse-grained force field in HADDOCK. Overall, we obtained acceptable or better predictions for 13 and 11 server and manual submissions, respectively, out of the 22 interfaces. Our server performance (acceptable or higher-quality models when considering the top 10) was better (59%) than the manual (50%) one, in which we typically experiment with various combinations of protocols and data sources. Again, our simple scoring function based on a linear combination of intermolecular van der Waals and electrostatic energies and an empirical desolvation term demonstrated a good performance in the scoring experiment with a 63% success rate across all 22 interfaces. An analysis of model quality indicates that, while we are consistently performing well in generating acceptable models, there is room for improvement for generating/identifying higher quality models.
Collapse
Affiliation(s)
- Panagiotis I Koukos
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Jorge Roel-Touris
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Francesco Ambrosetti
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands.,Department of Physics, Sapienza University, Rome, Italy
| | - Cunliang Geng
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Jörg Schaarschmidt
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands.,Multiscale Materials Modelling and Virtual Design, Institute of Nanotechnology, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
| | - Mikael E Trellet
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Adrien S J Melquiond
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Li C Xue
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Rodrigo V Honorato
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Irina Moreira
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands.,CNC-Center for Neuroscience and Cell Biology, Rua Larga, FMUC, Polo I, 1° andar, Universidade de Coimbra, Coimbra, Portugal
| | - Zeynep Kurkcuoglu
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Anna Vangone
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| | - Alexandre M J J Bonvin
- Faculty of Science, Department of Chemistry, Bijvoet Center for Biomolecular Research, Computational Structural Biology Group, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
48
|
Hadi-Alijanvand H. Complex Stability is Encoded in Binding Patch Softness: a Monomer-Based Approach to Predict Inter-Subunit Affinity of Protein Dimers. J Proteome Res 2019; 19:409-423. [PMID: 31795635 DOI: 10.1021/acs.jproteome.9b00594] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Knowledge about the structure and stability of protein-protein interactions is vital to decipher the behavior of protein systems. Prediction of protein complexes' stability is an interesting topic in the field of structural biology. There are some promising published computational approaches that predict the affinity between subunits of protein dimers using 3D structures of both subunits. In the current study, we classify protein complexes with experimentally measured affinities into distinct classes with different mean affinities. By predicting the mechanical stiffness of the protein binding patch (PBP) region on a single subunit, we successfully predict the assigned affinity class of the PBP in the classification step. Now to predict the experimentally measured affinity between protein monomers in solution, we just need the 3D structure of the suggested PBP on one subunit of the proposed dimer. We designed the SEPAS software and have made the software freely available for academic non-commercial research purposes at " http://biophysics.ir/affinity ". SEPAS predicts the stability of the intended dimer in a classwise manner by utilizing the computed mechanical stiffness of the introduced binding site on one subunit with the minimum accuracy of 0.72.
Collapse
Affiliation(s)
- Hamid Hadi-Alijanvand
- Department of Biological Sciences , Institute for Advanced Studies in Basic Sciences (IASBS) , Zanjan 45137-66731 , Iran
| |
Collapse
|
49
|
Terayama K, Shinobu A, Tsuda K, Takemura K, Kitao A. evERdock BAI: Machine-learning-guided selection of protein-protein complex structure. J Chem Phys 2019; 151:215104. [DOI: 10.1063/1.5129551] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Kei Terayama
- RIKEN Center for Advanced Intelligence Project, 1-4-1 Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan
- Medical Sciences Innovation Hub Program, RIKEN Cluster for Science, Technology and Innovation Hub, Tsurumi-ku, Kanagawa 230-0045, Japan
- Graduate School of Medicine, Kyoto University, Shogoin-kawaharacho, Sakyo-ku, Kyoto 606-8507, Japan
| | - Ai Shinobu
- School of Life Sciences and Technology, Tokyo Institute of Technology, 2-12-1, Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| | - Koji Tsuda
- RIKEN Center for Advanced Intelligence Project, 1-4-1 Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan
- Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwa-no-ha, Kashiwa, Chiba 277-8561, Japan
- Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, Ibaraki 305-0047, Japan
| | - Kazuhiro Takemura
- School of Life Sciences and Technology, Tokyo Institute of Technology, 2-12-1, Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| | - Akio Kitao
- School of Life Sciences and Technology, Tokyo Institute of Technology, 2-12-1, Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| |
Collapse
|
50
|
Chiu ML, Goulet DR, Teplyakov A, Gilliland GL. Antibody Structure and Function: The Basis for Engineering Therapeutics. Antibodies (Basel) 2019; 8:antib8040055. [PMID: 31816964 PMCID: PMC6963682 DOI: 10.3390/antib8040055] [Citation(s) in RCA: 222] [Impact Index Per Article: 44.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 11/25/2019] [Accepted: 11/28/2019] [Indexed: 12/11/2022] Open
Abstract
Antibodies and antibody-derived macromolecules have established themselves as the mainstay in protein-based therapeutic molecules (biologics). Our knowledge of the structure–function relationships of antibodies provides a platform for protein engineering that has been exploited to generate a wide range of biologics for a host of therapeutic indications. In this review, our basic understanding of the antibody structure is described along with how that knowledge has leveraged the engineering of antibody and antibody-related therapeutics having the appropriate antigen affinity, effector function, and biophysical properties. The platforms examined include the development of antibodies, antibody fragments, bispecific antibody, and antibody fusion products, whose efficacy and manufacturability can be improved via humanization, affinity modulation, and stability enhancement. We also review the design and selection of binding arms, and avidity modulation. Different strategies of preparing bispecific and multispecific molecules for an array of therapeutic applications are included.
Collapse
Affiliation(s)
- Mark L. Chiu
- Drug Product Development Science, Janssen Research & Development, LLC, Malvern, PA 19355, USA
- Correspondence:
| | - Dennis R. Goulet
- Department of Medicinal Chemistry, University of Washington, P.O. Box 357610, Seattle, WA 98195-7610, USA;
| | - Alexey Teplyakov
- Biologics Research, Janssen Research & Development, LLC, Spring House, PA 19477, USA; (A.T.); (G.L.G.)
| | - Gary L. Gilliland
- Biologics Research, Janssen Research & Development, LLC, Spring House, PA 19477, USA; (A.T.); (G.L.G.)
| |
Collapse
|