1
|
Baindara P, Roy D, Mandal SM. CycP: A Novel Self-Assembled Vesicle-Forming Cyclic Antimicrobial Peptide to Control Drug-Resistant S. aureus. Bioengineering (Basel) 2024; 11:855. [PMID: 39199812 PMCID: PMC11351190 DOI: 10.3390/bioengineering11080855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 08/14/2024] [Accepted: 08/19/2024] [Indexed: 09/01/2024] Open
Abstract
Antimicrobial peptides (AMPs) are considered a promising alternative to conventional antibiotics to fight against the rapid evolution of antibiotic resistance. Other than their potent antimicrobial properties, AMP-based vesicles can be used as efficient drug-delivery vehicles. In the present study, we synthesized and characterized a new cyclic AMP, consisting of all-hydrophobic cores with antimicrobial activity against S. aureus. Interestingly, CycP undergoes supramolecular self-assembly, and self-assembled CycP (sCycP) vesicles are characterized under an electron microscope; however, these vesicles do not display antimicrobial activity. Next, sCycP vesicles are used in combination with SXT (sulfamethoxazole-trimethoprim) vesicles to check the drug loading and delivery capacity of sCycP vesicles to bacterial cell membranes. Interestingly, sCycP vesicles showed synergistic action with SXT vesicles and resulted in a significant reduction in MIC against S. aureus. Further, electron microscopy confirmed the membrane-specific killing mechanism of SXT-loaded sCycP vesicles. Additionally, CycP showed high binding affinities with the β-lactamase of S. aureus, which was one of its possible antimicrobial mechanisms of action. Overall, the results suggested that CycP is a novel self-assembled dual-action cyclic AMP with non-cytotoxic properties that can be used alone as an AMP or a self-assembled drug delivery vehicle for antibiotics to combat S. aureus infections.
Collapse
Affiliation(s)
- Piyush Baindara
- Animal Sciences Research Center, Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA;
| | - Dinata Roy
- Department of Zoology, Mizoram University, Aizawl 796004, India;
| | - Santi M. Mandal
- Department of Bioscience and Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, USA
| |
Collapse
|
2
|
Liang S, Zhang C, Zhu M. Ab Initio Prediction of 3-D Conformations for Protein Long Loops with High Accuracy and Applications to Antibody CDRH3 Modeling. J Chem Inf Model 2023; 63:7568-7577. [PMID: 38018130 DOI: 10.1021/acs.jcim.3c01051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
Residue-level potentials of mean force were widely used for protein backbone refinements to avoid simultaneous sampling of side-chain conformations. The interaction energy between the reduced side chains and backbone atoms was not considered explicitly. In this study, we developed novel methods to calculate the residue-atom interaction energy in combination with atomic and residue-level terms. The parameters were optimized step by step to remove the overcounting or overlap problem between different energy terms. The mixing energy functions were then used to evaluate the generated backbone conformations at the initial sampling stage of protein loop modeling (OSCAR-loop), including the interaction energy between the reduced loop residues and full atoms of the protein framework. The accuracies of top-ranked decoys were 1.18 and 2.81 Å for 8-residue and 12-residue loops, respectively. We then selected diverse decoys for side-chain modeling, backbone refinement, and energy minimization. The procedure was repeated multiple times to select one prediction with the lowest energy. Consequently, we obtained an accuracy of 0.74 Å for a prevailing test set of 12-residue loops, compared with >1.4 Å reported by other researchers. The OSCAR-loop was also effective for modeling the H3 loops of antibody complementary determining regions (CDRs) in the crystal environment. The prediction accuracy of OSCAR-loop (1.74 Å) was better than the accuracy of the Rosetta NGK method (3.11 Å) or those achieved by deep learning methods (>2.2 Å) for the CDRH3 loops of 49 targets in the Rosetta antibody benchmark. The performance of OSCAR-loop in a model environment was also discussed.
Collapse
Affiliation(s)
- Shide Liang
- Department of Computational Biology, 20n Bio Limited, Hangzhou 310018, P. R. China
- Department of Research and Development, Bio-Thera Solutions, Guangzhou 510530, P. R. China
| | - Chi Zhang
- School of Biological Sciences, University of Nebraska, Lincoln, Nebraska 68588, United States
| | - Mingfu Zhu
- Department of Computational Biology, 20n Bio Limited, Hangzhou 310018, P. R. China
| |
Collapse
|
3
|
Dinata R, Baindara P. Laterosporulin25: A probiotically produced, novel defensin-like bacteriocin and its immunogenic properties. Int Immunopharmacol 2023; 121:110500. [PMID: 37352569 DOI: 10.1016/j.intimp.2023.110500] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/01/2023] [Accepted: 06/11/2023] [Indexed: 06/25/2023]
Abstract
Although multiple vaccines have been developed against infectious diseases, the rapid emergence of new pathogens develops an urgent need for novel strategies to combat infectious diseases. Antimicrobial peptides (AMPs) are excellent agents to fight against infectious diseases having unique multiple mechanisms of action against various pathogens. Apart from the direct applications, AMPs can also be developed as subunit vaccines or could be used as a highly immunogenic carrier protein with highly antigenic but non-immunogenic antigens. Here in the present study, we have identified a novel defensin-like bacteriocin, laterosporulin25 (LS25) upon genome mining of Brevibacillus laterosporus DSM25, a probiotic bacterial strain. By using immunoinformatic tools, we have studied the immunogenic and physiochemical properties of LS25. LS25 is characterized as defensin-like bacteriocin, having 51 amino acids and a molecular weight of 5862.7 Da. The modeled tertiary structure of LS25 is docked with TLR3 and TLR4-MD2 complex to confirm the facilitation of induced immune response that is further validated using molecular dynamics simulations and In-silico immune stimulations. Overall, detailed immunoinformatics analysis suggested LS25 as a potential candidate to be used as an adjuvant or carrier protein for subunit vaccine development, however, further in-vitro and in-vivo experiments are essential to validate its potential.
Collapse
Affiliation(s)
- Roy Dinata
- Department of Zoology, Mizoram University, Aizawl, Mizoram 796004, India
| | - Piyush Baindara
- Department of Radiation Oncology, School of Medicine, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
4
|
Krivacic C, Kundert K, Pan X, Pache RA, Liu L, Conchúir SO, Jeliazkov JR, Gray JJ, Thompson MC, Fraser JS, Kortemme T. Accurate positioning of functional residues with robotics-inspired computational protein design. Proc Natl Acad Sci U S A 2022; 119:e2115480119. [PMID: 35254891 PMCID: PMC8931229 DOI: 10.1073/pnas.2115480119] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 01/27/2022] [Indexed: 11/18/2022] Open
Abstract
SignificanceComputational protein design promises to advance applications in medicine and biotechnology by creating proteins with many new and useful functions. However, new functions require the design of specific and often irregular atom-level geometries, which remains a major challenge. Here, we develop computational methods that design and predict local protein geometries with greater accuracy than existing methods. Then, as a proof of concept, we leverage these methods to design new protein conformations in the enzyme ketosteroid isomerase that change the protein's preference for a key functional residue. Our computational methods are openly accessible and can be applied to the design of other intricate geometries customized for new user-defined protein functions.
Collapse
Affiliation(s)
- Cody Krivacic
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Kale Kundert
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Biophysics Graduate Program, University of California, San Francisco, CA 94158
| | - Xingjie Pan
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Roland A. Pache
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Lin Liu
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Shane O Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | | | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218
| | - Michael C. Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - James S. Fraser
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Biophysics Graduate Program, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| | - Tanja Kortemme
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Biophysics Graduate Program, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| |
Collapse
|
5
|
In Silico Design and Validation of OvMANE1, a Chimeric Antigen for Human Onchocerciasis Diagnosis. Pathogens 2020; 9:pathogens9060495. [PMID: 32580355 PMCID: PMC7350323 DOI: 10.3390/pathogens9060495] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 06/12/2020] [Accepted: 06/18/2020] [Indexed: 02/06/2023] Open
Abstract
The public health goal of onchocerciasis in Africa has advanced from control to elimination. In this light, accurate diagnosis is necessary to determine treatment endpoints and confirm elimination, as well as to conduct surveillance for the identification of any possible recrudescence of the disease. Currently, the monitoring of onchocerciasis elimination relies on the Ov-16 test. However, this test is unable to discriminate between past and active infections. Furthermore, about 15-25% of infected persons are reported to be negative for the Ov-16 test, giving a misleading sense of security to false-negative individuals who might continue to serve as reservoirs for infections. Therefore, we opted to design and validate a more sensitive and specific chimeric antigen (OvMANE1) for onchocerciasis diagnosis, using previously reported immunodominant peptides of O. volvulus, the parasite responsible for the disease. In silico analysis of OvMANE1 predicted it to be more antigenic than its individual peptides. We observed that OvMANE1 reacts specifically and differentially with sera from O. volvulus infected and non-infected individuals, as well as with sera from communities of different levels of endemicity. Moreover, we found that total IgG, unlike IgG4 subclass, positively responded to OvMANE1, strongly suggesting its complementarity to the Ov-16 diagnostic tool, which detects Ov-16 IgG4 antibodies. Overall, OvMANE1 exhibited the potential to be utilized in the development of specific diagnostic tools-based on both antibody capture and antigen capture reactions-which are indispensable to monitor the progress of onchocerciasis elimination programs.
Collapse
|
6
|
Hooper WF, Walcott BD, Wang X, Bystroff C. Fast design of arbitrary length loops in proteins using InteractiveRosetta. BMC Bioinformatics 2018; 19:337. [PMID: 30249181 PMCID: PMC6154894 DOI: 10.1186/s12859-018-2345-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 08/29/2018] [Indexed: 11/10/2022] Open
Abstract
Background With increasing interest in ab initio protein design, there is a desire to be able to fully explore the design space of insertions and deletions. Nature inserts and deletes residues to optimize energy and function, but allowing variable length indels in the context of an interactive protein design session presents challenges with regard to speed and accuracy. Results Here we present a new module (INDEL) for InteractiveRosetta which allows the user to specify a range of lengths for a desired indel, and which returns a set of low energy backbones in a matter of seconds. To make the loop search fast, loop anchor points are geometrically hashed using C α-C α and C β-C β distances, and the hash is mapped to start and end points in a pre-compiled random access file of non-redundant, protein backbone coordinates. Loops with superposable anchors are filtered for collisions and returned to InteractiveRosetta as poly-alanine for display and selective incorporation into the design template. Sidechains can then be added using RosettaDesign tools. Conclusions INDEL was able to find viable loops in 100% of 500 attempts for all lengths from 3 to 20 residues. INDEL has been applied to the task of designing a domain-swapping loop for T7-endonuclease I, changing its specificity from Holliday junctions to paranemic crossover (PX) DNA. Electronic supplementary material The online version of this article (10.1186/s12859-018-2345-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- William F Hooper
- Emmes Corporation, Rockville, Washington, MD, USA.,Department of Biology, Rensselaer Polytechnic Institute, Troy, NY, USA
| | | | - Xing Wang
- Department of Chemistry and Chemical Biology, Rensselaer Polytechnic Institute, Troy, NY, USA
| | - Christopher Bystroff
- Department of Biology, Rensselaer Polytechnic Institute, Troy, NY, USA. .,Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, USA.
| |
Collapse
|
7
|
Investigation of immunogenic properties of Hemolin from silkworm, Bombyx mori as carrier protein: an immunoinformatic approach. Sci Rep 2018; 8:6957. [PMID: 29725106 PMCID: PMC5934409 DOI: 10.1038/s41598-018-25374-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Accepted: 04/20/2018] [Indexed: 11/08/2022] Open
Abstract
Infectious diseases are the major cause of high mortality among infants and geriatric patients. Vaccines are the only weapon in our arsenal to defend us ourselves against innumerable infectious diseases. Though myriad of vaccines are available, still countless people die due to microbial infections. Subunit vaccine is an effective strategy of vaccine development, combining a highly immunogenic carrier protein with highly antigenic but non-immunogenic antigen (haptens). In this study we have made an attempt to utilize the immunoinformatic tool for carrier protein development. Immunogenic mediators (T-cell, B-cell, IFN-γ epitopes) and physiochemical properties of hemolin protein of silkworm, Bombyx mori were studied. Hemolin was found to be non-allergic and highly antigenic in nature. The refined tertiary structure of modelled hemolin was docked against TLR3 and TLR4-MD2 complex. Molecular dynamics study emphasized the stable microscopic interaction between hemolin and TLRs. In-silico cloning and codon optimization was carried out for effective expression of hemolin in E. coli expression system. The overall presence of Cytotoxic T Lymphocytes (CTL), Humoral T Lymphocytes (HTL), and IFN-γ epitopes with high antigenicity depicts the potential of hemolin as a good candidate for carrier protein.
Collapse
|
8
|
Bansal N, Zheng Z, Song LF, Pei J, Merz KM. The Role of the Active Site Flap in Streptavidin/Biotin Complex Formation. J Am Chem Soc 2018; 140:5434-5446. [PMID: 29607642 DOI: 10.1021/jacs.8b00743] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Obtaining a detailed description of how active site flap motion affects substrate or ligand binding will advance structure-based drug design (SBDD) efforts on systems including the kinases, HSP90, HIV protease, ureases, etc. Through this understanding, we will be able to design better inhibitors and better proteins that have desired functions. Herein we address this issue by generating the relevant configurational states of a protein flap on the molecular energy landscape using an approach we call MTFlex-b and then following this with a procedure to estimate the free energy associated with the motion of the flap region. To illustrate our overall workflow, we explored the free energy changes in the streptavidin/biotin system upon introducing conformational flexibility in loop3-4 in the biotin unbound ( apo) and bound ( holo) state. The free energy surfaces were created using the Movable Type free energy method, and for further validation, we compared them to potential of mean force (PMF) generated free energy surfaces using MD simulations employing the FF99SBILDN and FF14SB force fields. We also estimated the free energy thermodynamic cycle using an ensemble of closed-like and open-like end states for the ligand unbound and bound states and estimated the binding free energy to be approximately -16.2 kcal/mol (experimental -18.3 kcal/mol). The good agreement between MTFlex-b in combination with the MT method with experiment and MD simulations supports the effectiveness of our strategy in obtaining unique insights into the motions in proteins that can then be used in a range of biological and biomedical applications.
Collapse
Affiliation(s)
- Nupur Bansal
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Zheng Zheng
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Lin Frank Song
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Jun Pei
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Kenneth M Merz
- Department of Chemistry and Department of Biochemistry and Molecular Biology , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States.,Institute for Cyber Enabled Research , Michigan State University , 567 Wilson Road , East Lansing , Michigan 48824 , United States
| |
Collapse
|
9
|
Bazzoli A, Karanicolas J. "Solvent hydrogen-bond occlusion": A new model of polar desolvation for biomolecular energetics. J Comput Chem 2017; 38:1321-1331. [PMID: 28318014 PMCID: PMC5407913 DOI: 10.1002/jcc.24740] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2016] [Revised: 12/31/2016] [Accepted: 01/03/2017] [Indexed: 12/14/2022]
Abstract
Water engages in two important types of interactions near biomolecules: it forms ordered "cages" around exposed hydrophobic regions, and it participates in hydrogen bonds with surface polar groups. Both types of interaction are critical to biomolecular structure and function, but explicitly including an appropriate number of solvent molecules makes many applications computationally intractable. A number of implicit solvent models have been developed to address this problem, many of which treat these two solvation effects separately. Here, we describe a new model to capture polar solvation effects, called SHO ("solvent hydrogen-bond occlusion"); our model aims to directly evaluate the energetic penalty associated with displacing discrete first-shell water molecules near each solute polar group. We have incorporated SHO into the Rosetta energy function, and find that scoring protein structures with SHO provides superior performance in loop modeling, virtual screening, and protein structure prediction benchmarks. These improvements stem from the fact that SHO accurately identifies and penalizes polar groups that do not participate in hydrogen bonds, either with solvent or with other solute atoms ("unsatisfied" polar groups). We expect that in future, SHO will enable higher-resolution predictions for a variety of molecular modeling applications. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Andrea Bazzoli
- Center for Computational Biology, University of Kansas, 2030 Becker Dr., Lawrence, KS 66045-7534
- Computational Chemical Biology Core, University of Kansas, 2030 Becker Dr., Lawrence, KS 66045-7534
| | - John Karanicolas
- Center for Computational Biology, University of Kansas, 2030 Becker Dr., Lawrence, KS 66045-7534
- Department of Molecular Biosciences, University of Kansas, 2030 Becker Dr., Lawrence, KS 66045-7534
- Program in Molecular Therapeutics, Fox Chase Cancer Center, Philadelphia, PA 19111-2497
| |
Collapse
|
10
|
Dakal TC, Kumar R, Ramotar D. Structural modeling of human organic cation transporters. Comput Biol Chem 2017; 68:153-163. [PMID: 28343125 DOI: 10.1016/j.compbiolchem.2017.03.007] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2016] [Revised: 02/01/2017] [Accepted: 03/11/2017] [Indexed: 12/12/2022]
Abstract
Human organic cation transporters (hOCTs) belong to solute carriers (SLC) 22 family of membrane proteins that play a central role in transportation of chemotherapeutic drugs for several clinical and pathological conditions, including cancer and diabetes. These transporters mediate drug transport; however, the precise mechanism of drug-binding and transport by them is not fully uncovered yet, partly due to unavailability of any crystal structure record. In this work, we performed a multi-phasic approach to compute the 3D structural models of seven human organic cation transporters (hOCTs) starting from primary protein sequence. Our structure modeling approach included 1) I-TASSER based comparative sequence alignment, threading and ab-initio protein modeling; 2) models comparison with PSIPRED secondary structure prediction; 3) loop modeling for incongruent secondary structure in Chimera 1.10.1; 4) high resolution structure simulation, refinement, energy minimization using ModRefiner, and 5) validation of the structure models using PROCHECK at SAVEs. From structural point, the computed 3D structures of hOCTs consist of a typical major facilitator superfamily (MFS) fold of twelve α-transmembrane helix domains arranged in a manner rendering hOCTs a barrel shaped structure with a large cleft that opens in cytoplasm. The modeled 3D structure of all hOCTs closely resemble to human SLC2A3 (GLUT3) transporter (PDB ID: 5c65) and displayed an outward-open confirmation and putative cyclic C1 protein symmetry. In addition, hOCTs has a large (>100 amino acids) unique extracellular loop between TMH1 and TMH2 having potential glycosylation sites (Asn-Xaa-Ser/Thr) and cysteine residues, both features indicative of putative role in drug binding and uptake. There is an intracellular three/four-helix loop between TMH6 and TMH7 containing putative phosphorylation sites for precise regulation of hOCTs function as drug transporters. There are nine loops of 4 to 11 amino acids length that protrude from membrane, both intracellularly and extracellularly, and connect adjacent TMHs. The 2D structure prediction showed Nin-Cin topology of all hOCTs. In the unavailability of the crystal structures of hOCTs, the 3D structural models computed in-silico and presented herein can be used for studying the mechanism of drug binding and transport by hOCTs.
Collapse
Affiliation(s)
- Tikam Chand Dakal
- Maisonneuve-Rosemont Hospital, Research Center, Université de Montréal, Department of Medicine, 5415 Boul. de L' Assomption, Montréal, Québec H1T 2M4, Canada.
| | - Rajender Kumar
- Architecture et Fonction des Macromolécules Biologiques (AFMB), Campus de Luminy, Aix-Marseille Université, Marseille, France; Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research (NIPER), Sector 67, S.A.S. Nagar, 160 062, Punjab, India
| | - Dindial Ramotar
- Maisonneuve-Rosemont Hospital, Research Center, Université de Montréal, Department of Medicine, 5415 Boul. de L' Assomption, Montréal, Québec H1T 2M4, Canada
| |
Collapse
|
11
|
Tang K, Zhang J, Liang J. Distance-Guided Forward and Backward Chain-Growth Monte Carlo Method for Conformational Sampling and Structural Prediction of Antibody CDR-H3 Loops. J Chem Theory Comput 2016; 13:380-388. [PMID: 27996262 DOI: 10.1021/acs.jctc.6b00845] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Antibodies recognize antigens through the complementary determining regions (CDR) formed by six-loop hypervariable regions crucial for the diversity of antigen specificities. Among the six CDR loops, the H3 loop is the most challenging to predict because of its much higher variation in sequence length and identity, resulting in much larger and complex structural space, compared to the other five loops. We developed a novel method based on a chain-growth sequential Monte Carlo method, called distance-guided sequential chain-growth Monte Carlo for H3 loops (DiSGro-H3). The new method samples protein chains in both forward and backward directions. It can efficiently generate low energy, near-native H3 loop structures using the conformation types predicted from the sequences of H3 loops. DiSGro-H3 performs significantly better than another ab initio method, RosettaAntibody, in both sampling and prediction, while taking less computational time. It performs comparably to template-based methods. As an ab initio method, DiSGro-H3 offers satisfactory accuracy while being able to predict any H3 loops without templates.
Collapse
Affiliation(s)
- Ke Tang
- Department of Bioengineering, University of Illinois at Chicago , Chicago, Illinois 60607, United States
| | - Jinfeng Zhang
- Department of Statistics, Florida State University , Tallahassee, Florida 32306, United States
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago , Chicago, Illinois 60607, United States
| |
Collapse
|
12
|
Abstract
Comparative protein structure modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and how to use the ModBase database of such models, and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Benjamin Webb
- University of California at San Francisco, San Francisco, California
| | - Andrej Sali
- University of California at San Francisco, San Francisco, California
| |
Collapse
|
13
|
Topham CM, Barbe S, André I. An Atomistic Statistically Effective Energy Function for Computational Protein Design. J Chem Theory Comput 2016; 12:4146-68. [PMID: 27341125 DOI: 10.1021/acs.jctc.6b00090] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Shortcomings in the definition of effective free-energy surfaces of proteins are recognized to be a major contributory factor responsible for the low success rates of existing automated methods for computational protein design (CPD). The formulation of an atomistic statistically effective energy function (SEEF) suitable for a wide range of CPD applications and its derivation from structural data extracted from protein domains and protein-ligand complexes are described here. The proposed energy function comprises nonlocal atom-based and local residue-based SEEFs, which are coupled using a novel atom connectivity number factor to scale short-range, pairwise, nonbonded atomic interaction energies and a surface-area-dependent cavity energy term. This energy function was used to derive additional SEEFs describing the unfolded-state ensemble of any given residue sequence based on computed average energies for partially or fully solvent-exposed fragments in regions of irregular structure in native proteins. Relative thermal stabilities of 97 T4 bacteriophage lysozyme mutants were predicted from calculated energy differences for folded and unfolded states with an average unsigned error (AUE) of 0.84 kcal mol(-1) when compared to experiment. To demonstrate the utility of the energy function for CPD, further validation was carried out in tests of its capacity to recover cognate protein sequences and to discriminate native and near-native protein folds, loop conformers, and small-molecule ligand binding poses from non-native benchmark decoys. Experimental ligand binding free energies for a diverse set of 80 protein complexes could be predicted with an AUE of 2.4 kcal mol(-1) using an additional energy term to account for the loss in ligand configurational entropy upon binding. The atomistic SEEF is expected to improve the accuracy of residue-based coarse-grained SEEFs currently used in CPD and to extend the range of applications of extant atom-based protein statistical potentials.
Collapse
Affiliation(s)
- Christopher M Topham
- Université de Toulouse; INSA, UPS, INP; LISBP , 135 Avenue de Rangueil, F-31077 Toulouse, France.,CNRS, UMR5504 , F-31400 Toulouse, France.,INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés , F-31400 Toulouse, France
| | - Sophie Barbe
- Université de Toulouse; INSA, UPS, INP; LISBP , 135 Avenue de Rangueil, F-31077 Toulouse, France.,CNRS, UMR5504 , F-31400 Toulouse, France.,INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés , F-31400 Toulouse, France
| | - Isabelle André
- Université de Toulouse; INSA, UPS, INP; LISBP , 135 Avenue de Rangueil, F-31077 Toulouse, France.,CNRS, UMR5504 , F-31400 Toulouse, France.,INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés , F-31400 Toulouse, France
| |
Collapse
|
14
|
Webb B, Sali A. Comparative Protein Structure Modeling Using MODELLER. CURRENT PROTOCOLS IN BIOINFORMATICS 2016; 54:5.6.1-5.6.37. [PMID: 27322406 PMCID: PMC5031415 DOI: 10.1002/cpbi.3] [Citation(s) in RCA: 1890] [Impact Index Per Article: 236.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Comparative protein structure modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and how to use the ModBase database of such models, and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Benjamin Webb
- University of California at San Francisco, San Francisco, California
| | - Andrej Sali
- University of California at San Francisco, San Francisco, California
| |
Collapse
|
15
|
Childers MC, Towse CL, Daggett V. The effect of chirality and steric hindrance on intrinsic backbone conformational propensities: tools for protein design. Protein Eng Des Sel 2016; 29:271-80. [PMID: 27284086 DOI: 10.1093/protein/gzw023] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2016] [Accepted: 05/11/2016] [Indexed: 01/30/2023] Open
Abstract
The conformational propensities of amino acids are an amalgamation of sequence effects, environmental effects and underlying intrinsic behavior. Many have attempted to investigate neighboring residue effects to aid in our understanding of protein folding and improve structure prediction efforts, especially with respect to difficult to characterize states, such as disordered or unfolded states. Host-guest peptide series are a useful tool in examining the propensities of the amino acids free from the surrounding protein structure. Here, we compare the distributions of the backbone dihedral angles (φ/ψ) of the 20 proteogenic amino acids in two different sequence contexts using the AAXAA and GGXGG host-guest pentapeptide series. We further examine their intrinsic behaviors across three environmental contexts: water at 298 K, water at 498 K, and 8 M urea at 298 K. The GGXGG systems provide the intrinsic amino acid propensities devoid of any conformational context. The alanine residues in the AAXAA series enforce backbone chirality, thereby providing a model of the intrinsic behavior of amino acids in a protein chain. Our results show modest differences in φ/ψ distributions due to the steric constraints of the Ala side chains, the magnitudes of which are dependent on the denaturing conditions. One of the strongest factors modulating φ/ψ distributions was the protonation of titratable side chains, and the largest differences observed were in the amino acid propensities for the rarely sampled αL region.
Collapse
Affiliation(s)
| | - Clare-Louise Towse
- Department of Bioengineering, University of Washington, Seattle, WA 98195-5013, USA
| | - Valerie Daggett
- Department of Bioengineering, University of Washington, Seattle, WA 98195-5013, USA
| |
Collapse
|
16
|
Urquiza-Carvalho GA, Fragoso WD, Rocha GB. Assessment of semiempirical enthalpy of formation in solution as an effective energy function to discriminate native-like structures in protein decoy sets. J Comput Chem 2016; 37:1962-72. [DOI: 10.1002/jcc.24415] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Revised: 04/29/2016] [Accepted: 05/11/2016] [Indexed: 11/09/2022]
Affiliation(s)
- Gabriel Aires Urquiza-Carvalho
- Departamento De QúImica; CCEN, Universidade Federal Da ParáIba; Jõao, Pessoa/PB, Caixa Postal: 5093 CEP: 58051-970 Brazil
| | - Wallace Duarte Fragoso
- Departamento De QúImica; CCEN, Universidade Federal Da ParáIba; Jõao, Pessoa/PB, Caixa Postal: 5093 CEP: 58051-970 Brazil
| | - Gerd Bruno Rocha
- Departamento De QúImica; CCEN, Universidade Federal Da ParáIba; Jõao, Pessoa/PB, Caixa Postal: 5093 CEP: 58051-970 Brazil
| |
Collapse
|
17
|
Maadooliat M, Zhou L, Najibi SM, Gao X, Huang JZ. Collective Estimation of Multiple Bivariate Density Functions With Application to Angular-Sampling-Based Protein Loop Modeling. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2015.1099535] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
18
|
López-Blanco JR, Canosa-Valls AJ, Li Y, Chacón P. RCD+: Fast loop modeling server. Nucleic Acids Res 2016; 44:W395-400. [PMID: 27151199 PMCID: PMC4987936 DOI: 10.1093/nar/gkw395] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 04/28/2016] [Indexed: 11/12/2022] Open
Abstract
Modeling loops is a critical and challenging step in protein modeling and prediction. We have developed a quick online service (http://rcd.chaconlab.org) for ab initio loop modeling combining a coarse-grained conformational search with a full-atom refinement. Our original Random Coordinate Descent (RCD) loop closure algorithm has been greatly improved to enrich the sampling distribution towards near-native conformations. These improvements include a new workflow optimization, MPI-parallelization and fast backbone angle sampling based on neighbor-dependent Ramachandran probability distributions. The server starts by efficiently searching the vast conformational space from only the loop sequence information and the environment atomic coordinates. The generated closed loop models are subsequently ranked using a fast distance-orientation dependent energy filter. Top ranked loops are refined with the Rosetta energy function to obtain accurate all-atom predictions that can be interactively inspected in an user-friendly web interface. Using standard benchmarks, the average root mean squared deviation (RMSD) is 0.8 and 1.4 Å for 8 and 12 residues loops, respectively, in the challenging modeling scenario in where the side chains of the loop environment are fully remodeled. These results are not only very competitive compared to those obtained with public state of the art methods, but also they are obtained ∼10-fold faster.
Collapse
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Chemical Physics, Rocasolano Physical Chemistry Institute C.S.I.C., Serrano 119, 28006 Madrid, Spain
| | - Alejandro Jesús Canosa-Valls
- Department of Biological Chemical Physics, Rocasolano Physical Chemistry Institute C.S.I.C., Serrano 119, 28006 Madrid, Spain
| | - Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - Pablo Chacón
- Department of Biological Chemical Physics, Rocasolano Physical Chemistry Institute C.S.I.C., Serrano 119, 28006 Madrid, Spain
| |
Collapse
|
19
|
Perez A, MacCallum JL, Coutsias EA, Dill KA. Constraint methods that accelerate free-energy simulations of biomolecules. J Chem Phys 2015; 143:243143. [PMID: 26723628 PMCID: PMC4684272 DOI: 10.1063/1.4936911] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Accepted: 11/18/2015] [Indexed: 01/07/2023] Open
Abstract
Atomistic molecular dynamics simulations of biomolecules are critical for generating narratives about biological mechanisms. The power of atomistic simulations is that these are physics-based methods that satisfy Boltzmann's law, so they can be used to compute populations, dynamics, and mechanisms. But physical simulations are computationally intensive and do not scale well to the sizes of many important biomolecules. One way to speed up physical simulations is by coarse-graining the potential function. Another way is to harness structural knowledge, often by imposing spring-like restraints. But harnessing external knowledge in physical simulations is problematic because knowledge, data, or hunches have errors, noise, and combinatoric uncertainties. Here, we review recent principled methods for imposing restraints to speed up physics-based molecular simulations that promise to scale to larger biomolecules and motions.
Collapse
Affiliation(s)
- Alberto Perez
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Evangelos A Coutsias
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| | - Ken A Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| |
Collapse
|
20
|
Ó Conchúir S, Barlow KA, Pache RA, Ollikainen N, Kundert K, O'Meara MJ, Smith CA, Kortemme T. A Web Resource for Standardized Benchmark Datasets, Metrics, and Rosetta Protocols for Macromolecular Modeling and Design. PLoS One 2015; 10:e0130433. [PMID: 26335248 PMCID: PMC4559433 DOI: 10.1371/journal.pone.0130433] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 05/20/2015] [Indexed: 11/18/2022] Open
Abstract
The development and validation of computational macromolecular modeling and design methods depend on suitable benchmark datasets and informative metrics for comparing protocols. In addition, if a method is intended to be adopted broadly in diverse biological applications, there needs to be information on appropriate parameters for each protocol, as well as metrics describing the expected accuracy compared to experimental data. In certain disciplines, there exist established benchmarks and public resources where experts in a particular methodology are encouraged to supply their most efficient implementation of each particular benchmark. We aim to provide such a resource for protocols in macromolecular modeling and design. We present a freely accessible web resource (https://kortemmelab.ucsf.edu/benchmarks) to guide the development of protocols for protein modeling and design. The site provides benchmark datasets and metrics to compare the performance of a variety of modeling protocols using different computational sampling methods and energy functions, providing a "best practice" set of parameters for each method. Each benchmark has an associated downloadable benchmark capture archive containing the input files, analysis scripts, and tutorials for running the benchmark. The captures may be run with any suitable modeling method; we supply command lines for running the benchmarks using the Rosetta software suite. We have compiled initial benchmarks for the resource spanning three key areas: prediction of energetic effects of mutations, protein design, and protein structure prediction, each with associated state-of-the-art modeling protocols. With the help of the wider macromolecular modeling community, we hope to expand the variety of benchmarks included on the website and continue to evaluate new iterations of current methods as they become available.
Collapse
Affiliation(s)
- Shane Ó Conchúir
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Kyle A. Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
| | - Roland A. Pache
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Noah Ollikainen
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
| | - Kale Kundert
- Graduate Program in Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Matthew J. O'Meara
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California, United States of America
| | - Colin A. Smith
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
| | - Tanja Kortemme
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Biophysics, University of California San Francisco, San Francisco, California, United States of America
| |
Collapse
|
21
|
Tang K, Wong SWK, Liu JS, Zhang J, Liang J. Conformational sampling and structure prediction of multiple interacting loops in soluble and β-barrel membrane proteins using multi-loop distance-guided chain-growth Monte Carlo method. Bioinformatics 2015; 31:2646-52. [PMID: 25861965 DOI: 10.1093/bioinformatics/btv198] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2014] [Accepted: 04/03/2015] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Loops in proteins are often involved in biochemical functions. Their irregularity and flexibility make experimental structure determination and computational modeling challenging. Most current loop modeling methods focus on modeling single loops. In protein structure prediction, multiple loops often need to be modeled simultaneously. As interactions among loops in spatial proximity can be rather complex, sampling the conformations of multiple interacting loops is a challenging task. RESULTS In this study, we report a new method called multi-loop Distance-guided Sequential chain-Growth Monte Carlo (M-DiSGro) for prediction of the conformations of multiple interacting loops in proteins. Our method achieves an average RMSD of 1.93 Å for lowest energy conformations of 36 pairs of interacting protein loops with the total length ranging from 12 to 24 residues. We further constructed a data set containing proteins with 2, 3 and 4 interacting loops. For the most challenging target proteins with four loops, the average RMSD of the lowest energy conformations is 2.35 Å. Our method is also tested for predicting multiple loops in β-barrel membrane proteins. For outer-membrane protein G, the lowest energy conformation has a RMSD of 2.62 Å for the three extracellular interacting loops with a total length of 34 residues (12, 12 and 10 residues in each loop). AVAILABILITY AND IMPLEMENTATION The software is freely available at: tanto.bioe.uic.edu/m-DiSGro. CONTACT jinfeng@stat.fsu.edu or jliang@uic.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ke Tang
- Richard and Loan Hill Department of Bioengineering, University of Illinois at Chicago, Chicago, IL
| | - Samuel W K Wong
- Department of Statistics, University of Florida, Gainesville, FL
| | - Jun S Liu
- Department of Statistics, Harvard University, Science Center, Cambridge, MA and
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, FL, USA
| | - Jie Liang
- Richard and Loan Hill Department of Bioengineering, University of Illinois at Chicago, Chicago, IL
| |
Collapse
|
22
|
Park H, Lee GR, Heo L, Seok C. Protein loop modeling using a new hybrid energy function and its application to modeling in inaccurate structural environments. PLoS One 2014; 9:e113811. [PMID: 25419655 PMCID: PMC4242723 DOI: 10.1371/journal.pone.0113811] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2014] [Accepted: 10/30/2014] [Indexed: 11/19/2022] Open
Abstract
Protein loop modeling is a tool for predicting protein local structures of particular interest, providing opportunities for applications involving protein structure prediction and de novo protein design. Until recently, the majority of loop modeling methods have been developed and tested by reconstructing loops in frameworks of experimentally resolved structures. In many practical applications, however, the protein loops to be modeled are located in inaccurate structural environments. These include loops in model structures, low-resolution experimental structures, or experimental structures of different functional forms. Accordingly, discrepancies in the accuracy of the structural environment assumed in development of the method and that in practical applications present additional challenges to modern loop modeling methods. This study demonstrates a new strategy for employing a hybrid energy function combining physics-based and knowledge-based components to help tackle this challenge. The hybrid energy function is designed to combine the strengths of each energy component, simultaneously maintaining accurate loop structure prediction in a high-resolution framework structure and tolerating minor environmental errors in low-resolution structures. A loop modeling method based on global optimization of this new energy function is tested on loop targets situated in different levels of environmental errors, ranging from experimental structures to structures perturbed in backbone as well as side chains and template-based model structures. The new method performs comparably to force field-based approaches in loop reconstruction in crystal structures and better in loop prediction in inaccurate framework structures. This result suggests that higher-accuracy predictions would be possible for a broader range of applications. The web server for this method is available at http://galaxy.seoklab.org/loop with the PS2 option for the scoring function.
Collapse
Affiliation(s)
- Hahnbeom Park
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Gyu Rie Lee
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Lim Heo
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
| | - Chaok Seok
- Department of Chemistry, Seoul National University, Seoul, Republic of Korea
- * E-mail:
| |
Collapse
|
23
|
Abstract
Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.
Collapse
Affiliation(s)
- Benjamin Webb
- University of California at San Francisco, San Francisco, California
| | | |
Collapse
|
24
|
Tang K, Zhang J, Liang J. Fast protein loop sampling and structure prediction using distance-guided sequential chain-growth Monte Carlo method. PLoS Comput Biol 2014; 10:e1003539. [PMID: 24763317 PMCID: PMC3998890 DOI: 10.1371/journal.pcbi.1003539] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2013] [Accepted: 02/01/2014] [Indexed: 11/18/2022] Open
Abstract
Loops in proteins are flexible regions connecting regular secondary structures. They are often involved in protein functions through interacting with other molecules. The irregularity and flexibility of loops make their structures difficult to determine experimentally and challenging to model computationally. Conformation sampling and energy evaluation are the two key components in loop modeling. We have developed a new method for loop conformation sampling and prediction based on a chain growth sequential Monte Carlo sampling strategy, called Distance-guided Sequential chain-Growth Monte Carlo (DISGRO). With an energy function designed specifically for loops, our method can efficiently generate high quality loop conformations with low energy that are enriched with near-native loop structures. The average minimum global backbone RMSD for 1,000 conformations of 12-residue loops is 1:53 A° , with a lowest energy RMSD of 2:99 A° , and an average ensembleRMSD of 5:23 A° . A novel geometric criterion is applied to speed up calculations. The computational cost of generating 1,000 conformations for each of the x loops in a benchmark dataset is only about 10 cpu minutes for 12-residue loops, compared to ca 180 cpu minutes using the FALCm method. Test results on benchmark datasets show that DISGRO performs comparably or better than previous successful methods, while requiring far less computing time. DISGRO is especially effective in modeling longer loops (10-17 residues).
Collapse
Affiliation(s)
- Ke Tang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
- * E-mail: (JZ); (JL)
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America
- * E-mail: (JZ); (JL)
| |
Collapse
|
25
|
Webb B, Eswar N, Fan H, Khuri N, Pieper U, Dong G, Sali A. Comparative Modeling of Drug Target Proteins☆. REFERENCE MODULE IN CHEMISTRY, MOLECULAR SCIENCES AND CHEMICAL ENGINEERING 2014. [PMCID: PMC7157477 DOI: 10.1016/b978-0-12-409547-2.11133-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In this perspective, we begin by describing the comparative protein structure modeling technique and the accuracy of the corresponding models. We then discuss the significant role that comparative prediction plays in drug discovery. We focus on virtual ligand screening against comparative models and illustrate the state-of-the-art by a number of specific examples.
Collapse
|
26
|
Das R. Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz. PLoS One 2013; 8:e74830. [PMID: 24204571 PMCID: PMC3804535 DOI: 10.1371/journal.pone.0074830] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 08/07/2013] [Indexed: 11/18/2022] Open
Abstract
Consistently predicting biopolymer structure at atomic resolution from sequence alone remains a difficult problem, even for small sub-segments of large proteins. Such loop prediction challenges, which arise frequently in comparative modeling and protein design, can become intractable as loop lengths exceed 10 residues and if surrounding side-chain conformations are erased. Current approaches, such as the protein local optimization protocol or kinematic inversion closure (KIC) Monte Carlo, involve stages that coarse-grain proteins, simplifying modeling but precluding a systematic search of all-atom configurations. This article introduces an alternative modeling strategy based on a ‘stepwise ansatz’, recently developed for RNA modeling, which posits that any realistic all-atom molecular conformation can be built up by residue-by-residue stepwise enumeration. When harnessed to a dynamic-programming-like recursion in the Rosetta framework, the resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12 residue loop at a significant but achievable cost of thousands of CPU-hours. In a previously established benchmark, SWA recovers crystallographic conformations with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC modeling with a comparable expenditure of computational power. Furthermore, SWA gives high accuracy results on an additional set of 15 loops highlighted in the biological literature for their irregularity or unusual length. Successes include cis-Pro touch turns, loops that pass through tunnels of other side-chains, and loops of lengths up to 24 residues. Remaining problem cases are traced to inaccuracies in the Rosetta all-atom energy function. In five additional blind tests, SWA achieves sub-Angstrom accuracy models, including the first such success in a protein/RNA binding interface, the YbxF/kink-turn interaction in the fourth ‘RNA-puzzle’ competition. These results establish all-atom enumeration as an unusually systematic approach to ab initio protein structure modeling that can leverage high performance computing and physically realistic energy functions to more consistently achieve atomic accuracy.
Collapse
Affiliation(s)
- Rhiju Das
- Departments of Biochemistry and Physics, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
27
|
Bhattacharya D, Cheng J. i3Drefine software for protein 3D structure refinement and its assessment in CASP10. PLoS One 2013; 8:e69648. [PMID: 23894517 PMCID: PMC3716612 DOI: 10.1371/journal.pone.0069648] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Accepted: 06/13/2013] [Indexed: 12/25/2022] Open
Abstract
Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8th CASP experiment. During the 9th and recently concluded 10th CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as ‘MULTICOM-CONSTRUCT’) was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/.
Collapse
Affiliation(s)
- Debswapna Bhattacharya
- Department of Computer Science, University of Missouri, Columbia, Missouri, United States of America
| | - Jianlin Cheng
- Department of Computer Science, Informatics Institute, Bond Life Science Center, University of Missouri, Columbia, Missouri, United States of America
- * E-mail:
| |
Collapse
|
28
|
MacDonald JT, Kelley LA, Freemont PS. Validating a Coarse-Grained Potential Energy Function through Protein Loop Modelling. PLoS One 2013; 8:e65770. [PMID: 23824634 PMCID: PMC3688807 DOI: 10.1371/journal.pone.0065770] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 04/26/2013] [Indexed: 12/02/2022] Open
Abstract
Coarse-grained (CG) methods for sampling protein conformational space have the potential to increase computational efficiency by reducing the degrees of freedom. The gain in computational efficiency of CG methods often comes at the expense of non-protein like local conformational features. This could cause problems when transitioning to full atom models in a hierarchical framework. Here, a CG potential energy function was validated by applying it to the problem of loop prediction. A novel method to sample the conformational space of backbone atoms was benchmarked using a standard test set consisting of 351 distinct loops. This method used a sequence-independent CG potential energy function representing the protein using -carbon positions only and sampling conformations with a Monte Carlo simulated annealing based protocol. Backbone atoms were added using a method previously described and then gradient minimised in the Rosetta force field. Despite the CG potential energy function being sequence-independent, the method performed similarly to methods that explicitly use either fragments of known protein backbones with similar sequences or residue-specific /-maps to restrict the search space. The method was also able to predict with sub-Angstrom accuracy two out of seven loops from recently solved crystal structures of proteins with low sequence and structure similarity to previously deposited structures in the PDB. The ability to sample realistic loop conformations directly from a potential energy function enables the incorporation of additional geometric restraints and the use of more advanced sampling methods in a way that is not possible to do easily with fragment replacement methods and also enable multi-scale simulations for protein design and protein structure prediction. These restraints could be derived from experimental data or could be design restraints in the case of computational protein design. C++ source code is available for download from http://www.sbg.bio.ic.ac.uk/phyre2/PD2/.
Collapse
Affiliation(s)
- James T. MacDonald
- Division of Molecular Biosciences, Imperial College London, London, United Kingdom
- * E-mail:
| | - Lawrence A. Kelley
- Division of Molecular Biosciences, Imperial College London, London, United Kingdom
| | - Paul S. Freemont
- Division of Molecular Biosciences, Imperial College London, London, United Kingdom
| |
Collapse
|
29
|
Olson MA, Lee MS. Application of replica exchange umbrella sampling to protein structure refinement of nontemplate models. J Comput Chem 2013; 34:1785-93. [PMID: 23703032 DOI: 10.1002/jcc.23325] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Revised: 03/12/2013] [Accepted: 04/21/2013] [Indexed: 12/30/2022]
Abstract
We provide an assessment of a computational strategy for protein structure refinement that combines self-guided Langevin dynamics with umbrella-potential biasing replica exchange using the radius of gyration as a coordinate (Rg -ReX). Eight structurally nonredundant proteins and their decoys were examined by sampling conformational space at room temperature using the CHARMM22/GBMV2 force field to generate the ensemble of structures. Two atomic statistical potentials (RWplus and DFIRE) were analyzed for structure identification and compared to the simulation force-field potential. The results show that, while the Rg -ReX simulations were able to sample conformational basins that were more structurally similar to the X-ray crystallographic structures than the starting first-order ranked decoys, the potentials failed to detect these basins from refinement. Of the three potential functions, RWplus yielded the highest accuracy for recognition of structures that refined to an average of nearly 20% increase in native contacts relative to the starting decoys. The overall performance of Rg -ReX is compared to an earlier study of applying temperature-based replica exchange to refine the same decoy sets and highlights the general challenge of achieving consistently the sampling and detection threshold of 70% fraction of native contacts.
Collapse
Affiliation(s)
- Mark A Olson
- Department of Cell Biology and Biochemistry, USAMRIID, Fredrick, Maryland 21702, USA.
| | | |
Collapse
|
30
|
Stein A, Kortemme T. Improvements to robotics-inspired conformational sampling in rosetta. PLoS One 2013; 8:e63090. [PMID: 23704889 PMCID: PMC3660577 DOI: 10.1371/journal.pone.0063090] [Citation(s) in RCA: 140] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 03/28/2013] [Indexed: 02/04/2023] Open
Abstract
To accurately predict protein conformations in atomic detail, a computational method must be capable of sampling models sufficiently close to the native structure. All-atom sampling is difficult because of the vast number of possible conformations and extremely rugged energy landscapes. Here, we test three sampling strategies to address these difficulties: conformational diversification, intensification of torsion and omega-angle sampling and parameter annealing. We evaluate these strategies in the context of the robotics-based kinematic closure (KIC) method for local conformational sampling in Rosetta on an established benchmark set of 45 12-residue protein segments without regular secondary structure. We quantify performance as the fraction of sub-Angstrom models generated. While improvements with individual strategies are only modest, the combination of intensification and annealing strategies into a new “next-generation KIC” method yields a four-fold increase over standard KIC in the median percentage of sub-Angstrom models across the dataset. Such improvements enable progress on more difficult problems, as demonstrated on longer segments, several of which could not be accurately remodeled with previous methods. Given its improved sampling capability, next-generation KIC should allow advances in other applications such as local conformational remodeling of multiple segments simultaneously, flexible backbone sequence design, and development of more accurate energy functions.
Collapse
Affiliation(s)
- Amelie Stein
- California Institute for Quantitative Biomedical Research and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (AS); (TK)
| | - Tanja Kortemme
- California Institute for Quantitative Biomedical Research and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (AS); (TK)
| |
Collapse
|
31
|
Zhu K, Day T. Ab initiostructure prediction of the antibody hypervariable H3 loop. Proteins 2013; 81:1081-9. [DOI: 10.1002/prot.24240] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Accepted: 12/06/2012] [Indexed: 12/25/2022]
|
32
|
Computational methods for high resolution prediction and refinement of protein structures. Curr Opin Struct Biol 2013; 23:177-84. [DOI: 10.1016/j.sbi.2013.01.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Revised: 01/22/2013] [Accepted: 01/24/2013] [Indexed: 01/29/2023]
|
33
|
Chys P, Chacón P. Random Coordinate Descent with Spinor-matrices and Geometric Filters for Efficient Loop Closure. J Chem Theory Comput 2013; 9:1821-9. [PMID: 26587638 DOI: 10.1021/ct300977f] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Protein loop closure constitutes a critical step in loop and protein modeling whereby geometrically feasible loops must be found between two given anchor residues. Here, a new analytic/iterative algorithm denoted random coordinate descent (RCD) to perform protein loop closure is described. The algorithm solves loop closure through minimization as in cyclic coordinate descent but selects bonds for optimization randomly, updates loop conformations by spinor-matrices, performs loop closure in both chain directions, and uses a set of geometric filters to yield efficient conformational sampling. Geometric filters allow one to detect clashes and constrain dihedral angles on the fly. The RCD algorithm is at least comparable to state of the art loop closure algorithms due to an excellent balance between efficiency and intrinsic sampling capability. Furthermore, its efficiency allows one to improve conformational sampling by increasing the sampling number without much penalty. Overall, RCD turns out to be accurate, fast, robust, and applicable over a wide range of loop lengths. Because of the versatility of RCD, it is a solid alternative for integration with current loop modeling strategies.
Collapse
Affiliation(s)
- Pieter Chys
- Structural Bioinformatics Group, Biological Chemical Physics Department, Institute of Physical Chemistry Rocasolano (IQFR), Consejo Superior de Investigaciones Cientı́ficas (CSIC), Calle de Serrano 119, Madrid 28006, Spain
| | - Pablo Chacón
- Structural Bioinformatics Group, Biological Chemical Physics Department, Institute of Physical Chemistry Rocasolano (IQFR), Consejo Superior de Investigaciones Cientı́ficas (CSIC), Calle de Serrano 119, Madrid 28006, Spain
| |
Collapse
|
34
|
Li Y. Conformational sampling in template-free protein loop structure modeling: an overview. Comput Struct Biotechnol J 2013; 5:e201302003. [PMID: 24688696 PMCID: PMC3962101 DOI: 10.5936/csbj.201302003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/23/2013] [Accepted: 01/28/2013] [Indexed: 01/04/2023] Open
Abstract
Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.
Collapse
Affiliation(s)
- Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
35
|
Miller EB, Murrett CS, Zhu K, Zhao S, Goldfeld DA, Bylund JH, Friesner RA. Prediction of Long Loops with Embedded Secondary Structure using the Protein Local Optimization Program. J Chem Theory Comput 2013; 9:1846-4864. [PMID: 23814507 DOI: 10.1021/ct301083q] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Robust homology modeling to atomic-level accuracy requires in the general case successful prediction of protein loops containing small segments of secondary structure. Further, as loop prediction advances to success with larger loops, the exclusion of loops containing secondary structure becomes awkward. Here, we extend the applicability of the Protein Local Optimization Program (PLOP) to loops up to 17 residues in length that contain either helical or hairpin segments. In general, PLOP hierarchically samples conformational space and ranks candidate loops with a high-quality molecular mechanics force field. For loops identified to possess α-helical segments, we employ an alternative dihedral library composed of (ϕ,ψ) angles commonly found in helices. The alternative library is searched over a user-specified range of residues that define the helical bounds. The source of these helical bounds can be from popular secondary structure prediction software or from analysis of past loop predictions where a propensity to form a helix is observed. Due to the maturity of our energy model, the lowest energy loop across all experiments can be selected with an accuracy of sub-Ångström RMSD in 80% of cases, 1.0 to 1.5 Å RMSD in 14% of cases, and poorer than 1.5 Å RMSD in 6% of cases. The effectiveness of our current methods in predicting hairpin-containing loops is explored with hairpins up to 13 residues in length and again reaching an accuracy of sub-Ångström RMSD in 83% of cases, 1.0 to 1.5 Å RMSD in 10% of cases, and poorer than 1.5 Å RMSD in 7% of cases. Finally, we explore the effect of an imprecise surrounding environment, in which side chains, but not the backbone, are initially in perturbed geometries. In these cases, loops perturbed to 3Å RMSD from the native environment were restored to their native conformation with sub-Ångström RMSD.
Collapse
Affiliation(s)
- Edward B Miller
- Department of Chemistry, Columbia University, New York, New York
| | | | | | | | | | | | | |
Collapse
|
36
|
Goldfeld DA, Friesner RA. The protein local optimization program and G-protein-coupled receptors: loop restoration and applications to homology modeling. Methods Enzymol 2013; 522:1-20. [PMID: 23374177 DOI: 10.1016/b978-0-12-407865-9.00001-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The protein local optimization program (PLOP) uses sophisticated sampling algorithms and a highly refined physics-based energy function to restore loops within a protein structure. In this chapter, we highlight some of the recent successes we have had with PLOP restoring long loops in their native environment as well as the intra- and extracellular loops of four G-protein-coupled receptors. This includes the very long second extracellular loops of bovine rhodopsin and the turkey β1- and human β2-adrenergic receptors. We then provide an extremely detailed description of PLOP's algorithms, as well as a sample file and explicit keywords so that a new user can successfully run PLOP.
Collapse
|
37
|
Leaver-Fay A, O'Meara MJ, Tyka M, Jacak R, Song Y, Kellogg EH, Thompson J, Davis IW, Pache RA, Lyskov S, Gray JJ, Kortemme T, Richardson JS, Havranek JJ, Snoeyink J, Baker D, Kuhlman B. Scientific benchmarks for guiding macromolecular energy function improvement. Methods Enzymol 2013; 523:109-43. [PMID: 23422428 DOI: 10.1016/b978-0-12-394292-0.00006-0] [Citation(s) in RCA: 159] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Accurate energy functions are critical to macromolecular modeling and design. We describe new tools for identifying inaccuracies in energy functions and guiding their improvement, and illustrate the application of these tools to the improvement of the Rosetta energy function. The feature analysis tool identifies discrepancies between structures deposited in the PDB and low-energy structures generated by Rosetta; these likely arise from inaccuracies in the energy function. The optE tool optimizes the weights on the different components of the energy function by maximizing the recapitulation of a wide range of experimental observations. We use the tools to examine three proposed modifications to the Rosetta energy function: improving the unfolded state energy model (reference energies), using bicubic spline interpolation to generate knowledge-based torisonal potentials, and incorporating the recently developed Dunbrack 2010 rotamer library (Shapovalov & Dunbrack, 2011).
Collapse
Affiliation(s)
- Andrew Leaver-Fay
- Department of Biochemistry, University of North Carolina, Chapel Hill, North Carolina, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Olson MA, Lee MS. Structure refinement of protein model decoys requires accurate side-chain placement. Proteins 2012; 81:469-78. [PMID: 23070940 DOI: 10.1002/prot.24204] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Revised: 09/18/2012] [Accepted: 10/02/2012] [Indexed: 11/10/2022]
Abstract
In this study, the application of temperature-based replica-exchange (T-ReX) simulations for structure refinement of decoys taken from the I-TASSER dataset was examined. A set of eight nonredundant proteins was investigated using self-guided Langevin dynamics (SGLD) with a generalized Born implicit solvent model to sample conformational space. For two of the protein test cases, a comparison of the SGLD/T-ReX method with that of a hybrid explicit/implicit solvent molecular dynamics T-ReX simulation model is provided. Additionally, the effect of side-chain placement among the starting decoy structures, using alternative rotamer conformations taken from the SCWRL4 modeling program, was investigated. The simulation results showed that, despite having near-native backbone conformations among the starting decoys, the determinant of their refinement is side-chain packing to a level that satisfies a minimum threshold of native contacts to allow efficient excursions toward the downhill refinement regime on the energy landscape. By repacking using SCWRL4 and by applying the RWplus statistical potential for structure identification, the SGLD/T-ReX simulations achieved refinement to an average of 38% increase in the number of native contacts relative to the original I-TASSER decoy sets and a 25% reduction in values of C(α) root-mean-square deviation. The hybrid model succeeded in obtaining a sharper funnel to low-energy states for a modeled target than the implicit solvent SGLD model; yet, structure identification remained roughly the same. Without meeting a threshold of near-native packing of side chains, the T-ReX simulations degrade the accuracy of the decoys, and subsequently, refinement becomes tantamount to the protein folding problem.
Collapse
Affiliation(s)
- Mark A Olson
- Department of Cell Biology and Biochemistry, USAMRIID, Frederick, Maryland 21702, USA.
| | | |
Collapse
|
39
|
Kuroda D, Shirai H, Jacobson MP, Nakamura H. Computer-aided antibody design. Protein Eng Des Sel 2012; 25:507-21. [PMID: 22661385 PMCID: PMC3449398 DOI: 10.1093/protein/gzs024] [Citation(s) in RCA: 169] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2012] [Revised: 04/14/2012] [Accepted: 04/19/2012] [Indexed: 11/12/2022] Open
Abstract
Recent clinical trials using antibodies with low toxicity and high efficiency have raised expectations for the development of next-generation protein therapeutics. However, the process of obtaining therapeutic antibodies remains time consuming and empirical. This review summarizes recent progresses in the field of computer-aided antibody development mainly focusing on antibody modeling, which is divided essentially into two parts: (i) modeling the antigen-binding site, also called the complementarity determining regions (CDRs), and (ii) predicting the relative orientations of the variable heavy (V(H)) and light (V(L)) chains. Among the six CDR loops, the greatest challenge is predicting the conformation of CDR-H3, which is the most important in antigen recognition. Further computational methods could be used in drug development based on crystal structures or homology models, including antibody-antigen dockings and energy calculations with approximate potential functions. These methods should guide experimental studies to improve the affinities and physicochemical properties of antibodies. Finally, several successful examples of in silico structure-based antibody designs are reviewed. We also briefly review structure-based antigen or immunogen design, with application to rational vaccine development.
Collapse
Affiliation(s)
- Daisuke Kuroda
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, Japan.
| | | | | | | |
Collapse
|
40
|
Goldfeld DA, Zhu K, Beuming T, Friesner RA. Loop prediction for a GPCR homology model: Algorithms and results. Proteins 2012; 81:214-28. [DOI: 10.1002/prot.24178] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Revised: 08/13/2012] [Accepted: 08/25/2012] [Indexed: 11/07/2022]
|
41
|
Flick J, Tristram F, Wenzel W. Modeling loop backbone flexibility in receptor-ligand docking simulations. J Comput Chem 2012; 33:2504-15. [DOI: 10.1002/jcc.23087] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2012] [Revised: 06/15/2012] [Accepted: 07/09/2012] [Indexed: 12/20/2022]
|
42
|
Abstract
The prediction of loop structures is considered one of the main challenges in the protein folding problem. Regardless of the dependence of the overall algorithm on the protein data bank, the flexibility of loop regions dictates the need for special attention to their structures. In this article, we present algorithms for loop structure prediction with fixed stem and flexible stem geometry. In the flexible stem geometry problem, only the secondary structure of three stem residues on either side of the loop is known. In the fixed stem geometry problem, the structure of the three stem residues on either side of the loop is also known. Initial loop structures are generated using a probability database for the flexible stem geometry problem, and using torsion angle dynamics for the fixed stem geometry problem. Three rotamer optimization algorithms are introduced to alleviate steric clashes between the generated backbone structures and the side chain rotamers. The structures are optimized by energy minimization using an all-atom force field. The optimized structures are clustered using a traveling salesman problem-based clustering algorithm. The structures in the densest clusters are then utilized to refine dihedral angle bounds on all amino acids in the loop. The entire procedure is carried out for a number of iterations, leading to improved structure prediction and refined dihedral angle bounds. The algorithms presented in this article have been tested on 3190 loops from the PDBSelect25 data set and on targets from the recently concluded CASP9 community-wide experiment.
Collapse
Affiliation(s)
- A. Subramani
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544-5263, U.S.A
| | - C. A. Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544-5263, U.S.A
| |
Collapse
|
43
|
St-Pierre JF, Mousseau N. Large loop conformation sampling using the activation relaxation technique, ART-nouveau method. Proteins 2012; 80:1883-94. [PMID: 22488731 DOI: 10.1002/prot.24085] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Revised: 03/19/2011] [Accepted: 03/30/2012] [Indexed: 12/25/2022]
Abstract
We present an adaptation of the ART-nouveau energy surface sampling method to the problem of loop structure prediction. This method, previously used to study protein folding pathways and peptide aggregation, is well suited to the problem of sampling the conformation space of large loops by targeting probable folding pathways instead of sampling exhaustively that space. The number of sampled conformations needed by ART nouveau to find the global energy minimum for a loop was found to scale linearly with the sequence length of the loop for loops between 8 and about 20 amino acids. Considering the linear scaling dependence of the computation cost on the loop sequence length for sampling new conformations, we estimate the total computational cost of sampling larger loops to scale quadratically compared to the exponential scaling of exhaustive search methods.
Collapse
Affiliation(s)
- Jean-François St-Pierre
- Département de Physique and Regroupement Québécois sur les Matériaux de Pointe, Université de Montréal, CP 6128, Succursale Centre-Ville, Montréal, Québec, Canada H3C 3J7
| | | |
Collapse
|
44
|
Gront D, Kmiecik S, Blaszczyk M, Ekonomiuk D, Koliński A. Optimization of protein models. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2012. [DOI: 10.1002/wcms.1090] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Dominik Gront
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Sebastian Kmiecik
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Maciej Blaszczyk
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Dariusz Ekonomiuk
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| | - Andrzej Koliński
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland
| |
Collapse
|
45
|
|
46
|
Skliros A, Zimmermann MT, Chakraborty D, Saraswathi S, Katebi AR, Leelananda SP, Kloczkowski A, Jernigan RL. The importance of slow motions for protein functional loops. Phys Biol 2012; 9:014001. [PMID: 22314977 DOI: 10.1088/1478-3975/9/1/014001] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Loops in proteins that connect secondary structures such as alpha-helix and beta-sheet, are often on the surface and may play a critical role in some functions of a protein. The mobility of loops is central for the motional freedom and flexibility requirements of active-site loops and may play a critical role for some functions. The structures and behaviors of loops have not been studied much in the context of the whole structure and its overall motions, especially how these might be coupled. Here we investigate loop motions by using coarse-grained structures (C(α) atoms only) to solve the motions of the system by applying Lagrange equations with elastic network models to learn about which loops move in an independent fashion and which move in coordination with domain motions, faster and slower, respectively. The normal modes of the system are calculated using eigen-decomposition of the stiffness matrix. The contribution of individual modes and groups of modes is investigated for their effects on all residues in each loop by using Fourier analyses. Our results indicate overall that the motions of functional sets of loops behave in similar ways as the whole structure. But overall only a relatively few loops move in coordination with the dominant slow modes of motion, and these are often closely related to function.
Collapse
Affiliation(s)
- Aris Skliros
- L. H. Baker Center for Bioinformatics and Biological Statistics, Iowa State University, Ames, IA 50011, USA. Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA 50011, USA
| | | | | | | | | | | | | | | |
Collapse
|
47
|
Tripathy C, Zeng J, Zhou P, Donald BR. Protein loop closure using orientational restraints from NMR data. Proteins 2012; 80:433-53. [PMID: 22161780 PMCID: PMC3305838 DOI: 10.1002/prot.23207] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2011] [Revised: 08/23/2011] [Accepted: 09/06/2011] [Indexed: 11/12/2022]
Abstract
Protein loops often play important roles in biological functions. Modeling loops accurately is crucial to determining the functional specificity of a protein. Despite the recent progress in loop prediction approaches, which led to a number of algorithms over the past decade, few rigorous algorithmic approaches exist to model protein loops using global orientational restraints, such as those obtained from residual dipolar coupling (RDC) data in solution nuclear magnetic resonance (NMR) spectroscopy. In this article, we present a novel, sparse data, RDC-based algorithm, which exploits the mathematical interplay between RDC-derived sphero-conics and protein kinematics, and formulates the loop structure determination problem as a system of low-degree polynomial equations that can be solved exactly, in closed-form. The polynomial roots, which encode the candidate conformations, are searched systematically, using provable pruning strategies that triage the vast majority of conformations, to enumerate or prune all possible loop conformations consistent with the data; therefore, completeness is ensured. Results on experimental RDC datasets for four proteins, including human ubiquitin, FF2, DinI, and GB3, demonstrate that our algorithm can compute loops with higher accuracy, a three- to six-fold improvement in backbone RMSD, versus those obtained by traditional structure determination protocols on the same data. Excellent results were also obtained on synthetic RDC datasets for protein loops of length 4, 8, and 12 used in previous studies. These results suggest that our algorithm can be successfully applied to determine protein loop conformations, and hence, will be useful in high-resolution protein backbone structure determination, including loops, from sparse NMR data. Proteins 2012. © 2011 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
| | - Jianyang Zeng
- Department of Computer Science, Duke University, Durham, NC 27708, USA
| | - Pei Zhou
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA
| | - Bruce Randall Donald
- Department of Computer Science, Duke University, Durham, NC 27708, USA
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA
| |
Collapse
|
48
|
Toward ab initio refinement of protein X-ray crystal structures: interpreting and correlating structural fluctuations. Theor Chem Acc 2012. [DOI: 10.1007/s00214-011-1076-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
49
|
Abstract
Accurate all-atom energy functions are crucial for successful high-resolution protein structure prediction. In this chapter, we review both physics-based force fields and knowledge-based potentials used in protein modeling. Because it is important to calculate the energy as accurately as possible given the limitations imposed by sampling convergence, different components of the energy, and force fields representing them to varying degrees of detail and complexity are discussed. Force fields using Cartesian as well as torsion angle representations of protein geometry are covered. Since solvent is important for protein energetics, different aqueous and membrane solvation models for protein simulations are also described. Finally, we summarize recent progress in protein structure refinement using new force fields.
Collapse
|
50
|
Wickstrom L, Gallicchio E, Levy RM. The linear interaction energy method for the prediction of protein stability changes upon mutation. Proteins 2011; 80:111-25. [PMID: 22038697 DOI: 10.1002/prot.23168] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Revised: 07/28/2011] [Accepted: 08/06/2011] [Indexed: 12/25/2022]
Abstract
The coupling of protein energetics and sequence changes is a critical aspect of computational protein design, as well as for the understanding of protein evolution, human disease, and drug resistance. To study the molecular basis for this coupling, computational tools must be sufficiently accurate and computationally inexpensive enough to handle large amounts of sequence data. We have developed a computational approach based on the linear interaction energy (LIE) approximation to predict the changes in the free-energy of the native state induced by a single mutation. This approach was applied to a set of 822 mutations in 10 proteins which resulted in an average unsigned error of 0.82 kcal/mol and a correlation coefficient of 0.72 between the calculated and experimental ΔΔG values. The method is able to accurately identify destabilizing hot spot mutations; however, it has difficulty in distinguishing between stabilizing and destabilizing mutations because of the distribution of stability changes for the set of mutations used to parameterize the model. In addition, the model also performs quite well in initial tests on a small set of double mutations. On the basis of these promising results, we can begin to examine the relationship between protein stability and fitness, correlated mutations, and drug resistance.
Collapse
Affiliation(s)
- Lauren Wickstrom
- Department of Chemistry and Chemical Biology, BioMaPS Institute for Quantitative Biology, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | | | | |
Collapse
|