1
|
Liang S, Zhang C, Zhu M. Ab Initio Prediction of 3-D Conformations for Protein Long Loops with High Accuracy and Applications to Antibody CDRH3 Modeling. J Chem Inf Model 2023; 63:7568-7577. [PMID: 38018130 DOI: 10.1021/acs.jcim.3c01051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
Residue-level potentials of mean force were widely used for protein backbone refinements to avoid simultaneous sampling of side-chain conformations. The interaction energy between the reduced side chains and backbone atoms was not considered explicitly. In this study, we developed novel methods to calculate the residue-atom interaction energy in combination with atomic and residue-level terms. The parameters were optimized step by step to remove the overcounting or overlap problem between different energy terms. The mixing energy functions were then used to evaluate the generated backbone conformations at the initial sampling stage of protein loop modeling (OSCAR-loop), including the interaction energy between the reduced loop residues and full atoms of the protein framework. The accuracies of top-ranked decoys were 1.18 and 2.81 Å for 8-residue and 12-residue loops, respectively. We then selected diverse decoys for side-chain modeling, backbone refinement, and energy minimization. The procedure was repeated multiple times to select one prediction with the lowest energy. Consequently, we obtained an accuracy of 0.74 Å for a prevailing test set of 12-residue loops, compared with >1.4 Å reported by other researchers. The OSCAR-loop was also effective for modeling the H3 loops of antibody complementary determining regions (CDRs) in the crystal environment. The prediction accuracy of OSCAR-loop (1.74 Å) was better than the accuracy of the Rosetta NGK method (3.11 Å) or those achieved by deep learning methods (>2.2 Å) for the CDRH3 loops of 49 targets in the Rosetta antibody benchmark. The performance of OSCAR-loop in a model environment was also discussed.
Collapse
Affiliation(s)
- Shide Liang
- Department of Computational Biology, 20n Bio Limited, Hangzhou 310018, P. R. China
- Department of Research and Development, Bio-Thera Solutions, Guangzhou 510530, P. R. China
| | - Chi Zhang
- School of Biological Sciences, University of Nebraska, Lincoln, Nebraska 68588, United States
| | - Mingfu Zhu
- Department of Computational Biology, 20n Bio Limited, Hangzhou 310018, P. R. China
| |
Collapse
|
2
|
Chen J, Woldring DR, Huang F, Huang X, Wei GW. Topological deep learning based deep mutational scanning. Comput Biol Med 2023; 164:107258. [PMID: 37506452 PMCID: PMC10528359 DOI: 10.1016/j.compbiomed.2023.107258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 06/28/2023] [Accepted: 07/08/2023] [Indexed: 07/30/2023]
Abstract
High-throughput deep mutational scanning (DMS) experiments have significantly impacted protein engineering, drug discovery, immunology, cancer biology, and evolutionary biology by enabling the systematic understanding of protein functions. However, the mutational space associated with proteins is astronomically large, making it overwhelming for current experimental capabilities. Therefore, alternative methods for DMS are imperative. We propose a topological deep learning (TDL) paradigm to facilitate in silico DMS. We utilize a new topological data analysis (TDA) technique based on the persistent spectral theory, also known as persistent Laplacian, to capture both topological invariants and the homotopic shape evolution of data. To validate our TDL-DMS model, we use SARS-CoV-2 datasets and show excellent accuracy and reliability for binding interface mutations. This finding is significant for SARS-CoV-2 variant forecasting and designing effective antibodies and vaccines. Our proposed model is expected to have a significant impact on drug discovery, vaccine design, precision medicine, and protein engineering.
Collapse
Affiliation(s)
- Jiahui Chen
- Department of Mathematical Sciences, University of Arkansas, Fayetteville, AR 72701, USA
| | - Daniel R Woldring
- Department of Chemical Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Faqing Huang
- Department of Chemistry and Biochemistry, University of Southern Mississippi, Hattiesburg, MS 39406, USA
| | - Xuefei Huang
- Department of Chemistry, Michigan State University, MI 48824, USA; Department of Biomedical Engineering, Michigan State University, East Lansing, MI 48824, USA; The Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, MI 48824, USA; Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48824, USA; Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.
| |
Collapse
|
3
|
Kong L, Ju F, Zhang H, Sun S, Bu D. FALCON2: a web server for high-quality prediction of protein tertiary structures. BMC Bioinformatics 2021; 22:439. [PMID: 34525939 PMCID: PMC8444573 DOI: 10.1186/s12859-021-04353-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 09/01/2021] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Accurate prediction of protein tertiary structures is highly desired as the knowledge of protein structures provides invaluable insights into protein functions. We have designed two approaches to protein structure prediction, including a template-based modeling approach (called ProALIGN) and an ab initio prediction approach (called ProFOLD). Briefly speaking, ProALIGN aligns a target protein with templates through exploiting the patterns of context-specific alignment motifs and then builds the final structure with reference to the homologous templates. In contrast, ProFOLD uses an end-to-end neural network to estimate inter-residue distances of target proteins and builds structures that satisfy these distance constraints. These two approaches emphasize different characteristics of target proteins: ProALIGN exploits structure information of homologous templates of target proteins while ProFOLD exploits the co-evolutionary information carried by homologous protein sequences. Recent progress has shown that the combination of template-based modeling and ab initio approaches is promising. RESULTS In the study, we present FALCON2, a web server that integrates ProALIGN and ProFOLD to provide high-quality protein structure prediction service. For a target protein, FALCON2 executes ProALIGN and ProFOLD simultaneously to predict possible structures and selects the most likely one as the final prediction result. We evaluated FALCON2 on widely-used benchmarks, including 104 CASP13 (the 13th Critical Assessment of protein Structure Prediction) targets and 91 CASP14 targets. In-depth examination suggests that when high-quality templates are available, ProALIGN is superior to ProFOLD and in other cases, ProFOLD shows better performance. By integrating these two approaches with different emphasis, FALCON2 server outperforms the two individual approaches and also achieves state-of-the-art performance compared with existing approaches. CONCLUSIONS By integrating template-based modeling and ab initio approaches, FALCON2 provides an easy-to-use and high-quality protein structure prediction service for the community and we expect it to enable insights into a deep understanding of protein functions.
Collapse
Affiliation(s)
- Lupeng Kong
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, 100190 Beijing, China
- University of Chinese Academy of Sciences, 100049 Beijing, China
| | - Fusong Ju
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, 100190 Beijing, China
- University of Chinese Academy of Sciences, 100049 Beijing, China
| | - Haicang Zhang
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, 100190 Beijing, China
- University of Chinese Academy of Sciences, 100049 Beijing, China
| | - Shiwei Sun
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, 100190 Beijing, China
- University of Chinese Academy of Sciences, 100049 Beijing, China
| | - Dongbo Bu
- Key Lab of Intelligent Information Processing, Big-Data Academy, Institute of Computing Technology, Chinese Academy of Sciences, 100190 Beijing, China
- University of Chinese Academy of Sciences, 100049 Beijing, China
| |
Collapse
|
4
|
Zou T, Woodrum BW, Halloran N, Campitelli P, Bobkov AA, Ghirlanda G, Ozkan SB. Local Interactions That Contribute Minimal Frustration Determine Foldability. J Phys Chem B 2021; 125:2617-2626. [PMID: 33687216 DOI: 10.1021/acs.jpcb.1c00364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Earlier experiments suggest that the evolutionary information (conservation and coevolution) encoded in protein sequences is necessary and sufficient to specify the fold of a protein family. However, there is no computational work to quantify the effect of such evolutionary information on the folding process. Here we explore the role of early folding steps for sequences designed using coevolution and conservation through a combination of computational and experimental methods. We simulated a repertoire of native and designed WW domain sequences to analyze early local contact formation and found that the N-terminal β-hairpin turn would not form correctly due to strong non-native local contacts in unfoldable sequences. Through a maximum likelihood approach, we identified five local contacts that play a critical role in folding, suggesting that a small subset of amino acid pairs can be used to solve the "needle in the haystack" problem to design foldable sequences. Thus, using the contact probability of those five local contacts that form during the early stage of folding, we built a classification model that predicts the foldability of a WW sequence with 81% accuracy. This classification model was used to redesign WW domain sequences that could not fold due to frustration and make them foldable by introducing a few mutations that led to the stabilization of these critical local contacts. The experimental analysis shows that a redesigned sequence folds and binds to polyproline peptides with a similar affinity as those observed for native WW domains. Overall, our analysis shows that evolutionary-designed sequences should not only satisfy the folding stability but also ensure a minimally frustrated folding landscape.
Collapse
Affiliation(s)
- Taisong Zou
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona 85287, United States
| | - Brian W Woodrum
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
| | - Nicholas Halloran
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona 85287, United States
| | - Andrey A Bobkov
- Conrad Prebys Center for Chemical Genomics, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, California 92037, United States
| | - Giovanna Ghirlanda
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
| | - Sefika Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona 85287, United States
| |
Collapse
|
5
|
Kesharwani A, Chaurasia DK, Katara P. Repurposing of FDA approved drugs and their validation against potential drug targets for Salmonella enterica through molecular dynamics simulation. J Biomol Struct Dyn 2021; 40:6255-6271. [PMID: 33525976 DOI: 10.1080/07391102.2021.1880482] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Salmonella is a widely distributed pathogen causing infection of intestinal tract, typhoid, and paratyphoid fever. Number of drugs was developed against salmonella, but in the last few decades due to the emergence of drug resistant strains, most of these drugs became dormant. As a result Salmonellosis emerges as a trivial cause of human mortality worldwide; therefore, there is an urgent need for unexploited drug targets and drugs to treat Salmonellosis. As development of new drug molecules is very time consuming and costly, drug repurposing is in consideration as a better alternative. With the aim to identify a new drug molecule against the Salmonella through repurposing approach, we utilized 14 well reported druggable targets known to play a vital role in the life cycle of pathogens. These targets were used to screen DrugBank and got 53 FDA approved drugs against them. To find the interaction between considered target proteins and screened drugs, molecular docking was performed. Fourteen docked drug-target complexes with reasonable binding affinities were subjected to Molecular Dynamics Simulation (MDS) at 150 ns, using Amber18. At the end MMPBSA and MMGBSA calculations were performed for all stable complexes and finally, got 3 precise and favourable complexes, i.e. ArcB-Cefpiramide, MrcB-Cefoperazone, and PhoQ-Carindacillin. Rigorous structural and energetic analysis for these complexes validates the potential of drug molecules to act as therapeutic drugs against Salmonella enterica. With this study we hypothesize that the drugs Cefpiramide (DB00430), Cefoperazone (DB01329) and Carindacillin (DB09319) will be the good repurposed-drugs for the treatment of Salmonellosis.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Akanksha Kesharwani
- Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, Uttar Pradesh, India
| | - Dheeraj Kumar Chaurasia
- Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, Uttar Pradesh, India.,Supercomputing Facility for Bioinformatics & Computational Biology, Indian Institute of Technology Delhi, New Delhi, India
| | - Pramod Katara
- Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, Uttar Pradesh, India
| |
Collapse
|
6
|
Chen G, Seukep AJ, Guo M. Recent Advances in Molecular Docking for the Research and Discovery of Potential Marine Drugs. Mar Drugs 2020; 18:md18110545. [PMID: 33143025 PMCID: PMC7692358 DOI: 10.3390/md18110545] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 10/27/2020] [Accepted: 10/28/2020] [Indexed: 12/28/2022] Open
Abstract
Marine drugs have long been used and exhibit unique advantages in clinical practices. Among the marine drugs that have been approved by the Food and Drug Administration (FDA), the protein–ligand interactions, such as cytarabine–DNA polymerase, vidarabine–adenylyl cyclase, and eribulin–tubulin complexes, are the important mechanisms of action for their efficacy. However, the complex and multi-targeted components in marine medicinal resources, their bio-active chemical basis, and mechanisms of action have posed huge challenges in the discovery and development of marine drugs so far, which need to be systematically investigated in-depth. Molecular docking could effectively predict the binding mode and binding energy of the protein–ligand complexes and has become a major method of computer-aided drug design (CADD), hence this powerful tool has been widely used in many aspects of the research on marine drugs. This review introduces the basic principles and software of the molecular docking and further summarizes the applications of this method in marine drug discovery and design, including the early virtual screening in the drug discovery stage, drug target discovery, potential mechanisms of action, and the prediction of drug metabolism. In addition, this review would also discuss and prospect the problems of molecular docking, in order to provide more theoretical basis for clinical practices and new marine drug research and development.
Collapse
Affiliation(s)
- Guilin Chen
- Key Laboratory of Plant Germplasm Enhancement & Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; (G.C.); (A.J.S.)
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
- Innovation Academy for Drug Discovery and Development, Chinese Academy of Sciences, Shanghai 201203, China
| | - Armel Jackson Seukep
- Key Laboratory of Plant Germplasm Enhancement & Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; (G.C.); (A.J.S.)
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
- Innovation Academy for Drug Discovery and Development, Chinese Academy of Sciences, Shanghai 201203, China
- Department of Biomedical Sciences, Faculty of Health Sciences, University of Buea, P.O. Box 63 Buea, Cameroon
| | - Mingquan Guo
- Key Laboratory of Plant Germplasm Enhancement & Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; (G.C.); (A.J.S.)
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
- Innovation Academy for Drug Discovery and Development, Chinese Academy of Sciences, Shanghai 201203, China
- Correspondence: ; Tel.: +86-27-8770-0850
| |
Collapse
|
7
|
Jiménez KM, Morel A, Parada-Niño L, Alejandra González-Rodriguez M, Flórez S, Bolívar-Salazar D, Becerra-Bayona S, Aguirre-García A, Gómez-Murcia T, Fernanda Castillo L, Carlosama C, Ardila J, Vaiman D, Serrano N, Laissue P. Identifying new potential genetic biomarkers for HELLP syndrome using massive parallel sequencing. Pregnancy Hypertens 2020; 22:181-190. [PMID: 33059327 DOI: 10.1016/j.preghy.2020.09.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2020] [Revised: 07/20/2020] [Accepted: 09/05/2020] [Indexed: 12/19/2022]
Abstract
BACKGROUND Preeclampsia (PE) is a frequently occurring multisystemic disease affecting ~5% of pregnancies. PE patients may develop HELLP syndrome (haemolysis, elevated liver enzymes, and low platelet), a mother and foetus life-threatening condition. Research into HELLP's genetic origin has been relatively unsuccessful, mainly because normal placental function and blood pressure regulation involve the fine-regulation of hundreds of genes. OBJECTIVE To identify new genes and mutations constituting potential biomarkers for HELLP syndrome. STUDY DESIGN The present case-control study involved whole-exome sequencing of 79 unrelated HELLP women. Candidate variants were screened in a control population constituted by 176 individuals. Stringent bioinformatics filters were used for selecting potentially etiological sequence variants in a subset of 487 genes. We used robust in silico mutation modelling for predicting the potential effect on protein structure. RESULTS We identified numerous sequence variants in genes related to angiogenesis/coagulation/blood pressure regulation, cell differentiation/communication/adhesion, cell cycle and transcriptional gene regulation, extracellular matrix biology, lipid metabolism and immunological response. Five sequence variants generated premature stop codons in genes playing an essential role in placental physiology (STOX1, PDGFD, IGF2, MMP1 and DNAH11). Six variants (ERAP1- p.Ile915Thr, ERAP2- p.Leu837Ser, COMT-p.His192Gln, CSAD-p.Pro418Ser, CDH1- p.Ala298Thr and CCR2-p.Met249Lys) led to destabilisation of protein structure as they had significant energy and residue interaction-related changes. We identified at least two mutations in 57% of patients, arguing in favour of a polygenic origin for the HELLP syndrome. CONCLUSION Our results provide novel evidence regarding PE/HELLP's genetic origin, leading to new biomarkers, having potential clinical usefulness, being proposed.
Collapse
Affiliation(s)
- Karen Marcela Jiménez
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Adrien Morel
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Laura Parada-Niño
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - María Alejandra González-Rodriguez
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Stephanie Flórez
- Hospital Universitario Mayor Méderi, Universidad del Rosario, Bogotá, Colombia
| | - David Bolívar-Salazar
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | | | - Angel Aguirre-García
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia; Hospital Universitario Mayor Méderi, Universidad del Rosario, Bogotá, Colombia
| | - Tatiana Gómez-Murcia
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia; Hospital Universitario Mayor Méderi, Universidad del Rosario, Bogotá, Colombia
| | - Luisa Fernanda Castillo
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Carolina Carlosama
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Javier Ardila
- Hospital Universitario Mayor Méderi, Universidad del Rosario, Bogotá, Colombia
| | - Daniel Vaiman
- Inserm U1016, CNRS UMR8104, Institut Cochin, équipe FGTB, 24, rue du faubourg Saint-Jacques, 75014 Paris, France
| | - Norma Serrano
- Research Centre, Fundación Cardiovascular de Colombia (FCV), Bucaramanga, Colombia
| | - Paul Laissue
- Center For Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia; Inserm U1016, CNRS UMR8104, Institut Cochin, équipe FGTB, 24, rue du faubourg Saint-Jacques, 75014 Paris, France; Orphan Diseases Group, Biopas Laboratoires, Bogotá, Colombia.
| |
Collapse
|
8
|
Mitusińska K, Skalski T, Góra A. Simple Selection Procedure to Distinguish between Static and Flexible Loops. Int J Mol Sci 2020; 21:ijms21072293. [PMID: 32225102 PMCID: PMC7177474 DOI: 10.3390/ijms21072293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 03/22/2020] [Accepted: 03/24/2020] [Indexed: 12/02/2022] Open
Abstract
Loops are the most variable and unorganized elements of the secondary structure of proteins. Their ability to shift their shape can play a role in the binding of small ligands, enzymatic catalysis, or protein–protein interactions. Due to the loop flexibility, the positions of their residues in solved structures show the largest B-factors, or in a worst-case scenario can be unknown. Based on the loops’ movements’ timeline, they can be divided into slow (static) and fast (flexible). Although most of the loops that are missing in experimental structures belong to the flexible loops group, the computational tools for loop reconstruction use a set of static loop conformations to predict the missing part of the structure and evaluate the model. We believe that these two loop types can adopt different conformations and that using scoring functions appropriate for static loops is not sufficient for flexible loops. We showed that common model evaluation methods, are insufficient in the case of flexible solvent-exposed loops. Instead, we recommend using the potential energy to evaluate such loop models. We provide a novel model selection method based on a set of geometrical parameters to distinguish between flexible and static loops without the use of molecular dynamics simulations. We have also pointed out the importance of water network and interactions with the solvent for the flexible loop modeling.
Collapse
Affiliation(s)
- Karolina Mitusińska
- Tunneling Group, Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
| | - Tomasz Skalski
- Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
| | - Artur Góra
- Tunneling Group, Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
- Correspondence: ; Tel.: +48-322371659
| |
Collapse
|
9
|
Liu S, Xiang X, Gao X, Liu H. Neighborhood Preference of Amino Acids in Protein Structures and its Applications in Protein Structure Assessment. Sci Rep 2020; 10:4371. [PMID: 32152349 PMCID: PMC7062742 DOI: 10.1038/s41598-020-61205-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 02/24/2020] [Indexed: 12/02/2022] Open
Abstract
Amino acids form protein 3D structures in unique manners such that the folded structure is stable and functional under physiological conditions. Non-specific and non-covalent interactions between amino acids exhibit neighborhood preferences. Based on structural information from the protein data bank, a statistical energy function was derived to quantify amino acid neighborhood preferences. The neighborhood of one amino acid is defined by its contacting residues, and the energy function is determined by the neighboring residue types and relative positions. The neighborhood preference of amino acids was exploited to facilitate structural quality assessment, which was implemented in the neighborhood preference program NEPRE. The source codes are available via https://github.com/LiuLab-CSRC/NePre.
Collapse
Affiliation(s)
- Siyuan Liu
- Complex Systems Division, Beijing Computational Science Research Center, Haidian, Beijing, 100193, China
- School of Software Engineering, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Xilun Xiang
- Complex Systems Division, Beijing Computational Science Research Center, Haidian, Beijing, 100193, China
- School of Software Engineering, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Xiang Gao
- Complex Systems Division, Beijing Computational Science Research Center, Haidian, Beijing, 100193, China
- School of Software Engineering, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Haiguang Liu
- Complex Systems Division, Beijing Computational Science Research Center, Haidian, Beijing, 100193, China.
- Physics Department, Beijing Normal University, Haidian, Beijing, 100875, China.
| |
Collapse
|
10
|
A topology-based network tree for the prediction of protein-protein binding affinity changes following mutation. NAT MACH INTELL 2020; 2:116-123. [PMID: 34170981 PMCID: PMC7223817 DOI: 10.1038/s42256-020-0149-6] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 01/10/2020] [Indexed: 12/14/2022]
Abstract
The ability to predict protein-protein interactions is crucial to our understanding of a wide range of biological activities and functions in the human body, and for guiding drug discovery. Despite considerable efforts to develop suitable computational methods, predicting protein-protein interaction binding affinity changes following mutation (ΔΔG) remains a severe challenge. Algebraic topology, a champion in recent worldwide competitions for protein-ligand binding affinity predictions, is a promising approach to simplifying the complexity of biological structures. Here we introduce element- and site-specific persistent homology (a new branch of algebraic topology) to simplify the structural complexity of protein-protein complexes and embed crucial biological information into topological invariants. We also propose a new deep learning algorithm called NetTree to take advantage of convolutional neural networks and gradient-boosting trees. A topology-based network tree is constructed by integrating the topological representation and NetTree for predicting protein-protein interaction ΔΔG. Tests on major benchmark datasets indicate that the proposed topology-based network tree is an important improvement over the current state of the art in predicting ΔΔG.
Collapse
|
11
|
Haas J, Gumienny R, Barbato A, Ackermann F, Tauriello G, Bertoni M, Studer G, Smolinski A, Schwede T. Introducing "best single template" models as reference baseline for the Continuous Automated Model Evaluation (CAMEO). Proteins 2019; 87:1378-1387. [PMID: 31571280 DOI: 10.1002/prot.25815] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2019] [Revised: 09/10/2019] [Accepted: 09/13/2019] [Indexed: 12/17/2022]
Abstract
Critical blind assessment of structure prediction techniques is crucial for the scientific community to establish the state of the art, identify bottlenecks, and guide future developments. In Critical Assessment of Techniques in Structure Prediction (CASP), human experts assess the performance of participating methods in relation to the difficulty of the prediction task in a biennial experiment on approximately 100 targets. Yet, the development of automated computational modeling methods requires more frequent evaluation cycles and larger sets of data. The "Continuous Automated Model EvaluatiOn (CAMEO)" platform complements CASP by conducting fully automated blind prediction evaluations based on the weekly pre-release of sequences of those structures, which are going to be published in the next release of the Protein Data Bank (PDB). Each week, CAMEO publishes benchmarking results for predictions corresponding to a set of about 20 targets collected during a 4-day prediction window. CAMEO benchmarking data are generated consistently for all methods at the same point in time, enabling developers to cross-validate their method's performance, and referring to their results in publications. Many successful participants of CASP have used CAMEO-either by directly benchmarking their methods within the system or by comparing their own performance to CAMEO reference data. CAMEO offers a variety of scores reflecting different aspects of structure modeling, for example, binding site accuracy, homo-oligomer interface quality, or accuracy of local model confidence estimates. By introducing the "bestSingleTemplate" method based on structure superpositions as a reference for the accuracy of 3D modeling predictions, CAMEO facilitates objective comparison of techniques and fosters the development of advanced methods.
Collapse
Affiliation(s)
- Juergen Haas
- Computational Structural Biology, University of Basel, Switzerland
| | - Rafal Gumienny
- Computational Structural Biology, Swiss Institute of Bioinformatics, Switzerland
| | - Alessandro Barbato
- Computational Structural Biology, Universitat Basel Department Biozentrum, Switzerland
| | - Flavio Ackermann
- Computational Structural Biology, University of Basel, Switzerland
| | | | - Martino Bertoni
- Computational Structural Biology, Universitat Basel Department Biozentrum, Switzerland
| | - Gabriel Studer
- Computational Structural Biology, University of Basel, Switzerland
| | - Anna Smolinski
- Computational Structural Biology, University of Basel, Switzerland
| | - Torsten Schwede
- Computational Structural Biology, University of Basel, Switzerland
| |
Collapse
|
12
|
Ganesan P, Ramalingam R. Investigation of structural stability and functionality of homodimeric gramicidin towards peptide‐based drug: a molecular simulation approach. J Cell Biochem 2018; 120:4903-4911. [DOI: 10.1002/jcb.27765] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 09/06/2018] [Indexed: 12/17/2022]
Affiliation(s)
- Pavithrra Ganesan
- Bioinformatics Lab, Department of Biotechnology, School of Biosciences and Technology Vellore Institute of Technology (VIT) Vellore India
| | - Rajasekaran Ramalingam
- Bioinformatics Lab, Department of Biotechnology, School of Biosciences and Technology Vellore Institute of Technology (VIT) Vellore India
| |
Collapse
|
13
|
Zheng Z, Pei J, Bansal N, Liu H, Song LF, Merz KM. Generation of Pairwise Potentials Using Multidimensional Data Mining. J Chem Theory Comput 2018; 14:5045-5067. [PMID: 30183299 DOI: 10.1021/acs.jctc.8b00516] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The rapid development of molecular structural databases provides the chemistry community access to an enormous array of experimental data that can be used to build and validate computational models. Using radial distribution functions collected from experimentally available X-ray and NMR structures, a number of so-called statistical potentials have been developed over the years using the structural data mining strategy. These potentials have been developed within the context of the two-particle Kirkwood equation by extending its original use for isotropic monatomic systems to anisotropic biomolecular systems. However, the accuracy and the unclear physical meaning of statistical potentials have long formed the central arguments against such methods. In this work, we present a new approach to generate molecular energy functions using structural data mining. Instead of employing the Kirkwood equation and introducing the "reference state" approximation, we model the multidimensional probability distributions of the molecular system using graphical models and generate the target pairwise Boltzmann probabilities using the Bayesian field theory. Different from the current statistical potentials that mimic the "knowledge-based" PMF based on the 2-particle Kirkwood equation, the graphical-model-based structure-derived potential developed in this study focuses on the generation of lower-dimensional Boltzmann distributions of atoms through reduction of dimensionality. We have named this new scoring function GARF, and in this work we focus on the mathematical derivation of our novel approach followed by validation studies on its ability to predict protein-ligand interactions.
Collapse
Affiliation(s)
- Zheng Zheng
- Department of Chemistry , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Jun Pei
- Department of Chemistry , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Nupur Bansal
- Department of Chemistry , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Hao Liu
- Department of Chemistry , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Lin Frank Song
- Department of Chemistry , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| | - Kenneth M Merz
- Department of Chemistry , Michigan State University , 578 South Shaw Lane , East Lansing , Michigan 48824 , United States
| |
Collapse
|
14
|
Formation of a Secretion-Competent Protein Complex by a Dynamic Wrap-around Binding Mechanism. J Mol Biol 2018; 430:3157-3169. [DOI: 10.1016/j.jmb.2018.07.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Revised: 06/20/2018] [Accepted: 07/10/2018] [Indexed: 11/18/2022]
|
15
|
Zhang Y, Zhang Y, Zhang Y. Transcriptome-wide identification and competitive disruption of sacum-binding partners in human colorectal cancer. J Mol Graph Model 2018; 80:48-51. [PMID: 29328992 DOI: 10.1016/j.jmgm.2017.12.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Human sacum is regulatory adaptor protein involved in cellular signaling network of colorectal cancer. Molecular evidences suggest that the protein is integrated into oncogenic signaling network by binding to SH3-containing proteins through its proline-rich motifs. In this study, we have performed a transcriptome-wide analysis and identification of sacum-binding partners in the genome profile of human colorectal cancer. The sacum-binding potency of SH3-containing proteins found in colorectal cancer was investigated by using bioinformatics modeling and intermolecular binding analysis. With the protocol we were able to predict those high-affinity domain binders of the proline-rich peptides of human sacum in a high-throughput manner, and to analyze sequence-specific interaction in the domain-peptide recognition at molecular level. Consequently, a number of putative domain binders with both high affinity and specificity were identified, from which the Src SH3 domain was selected as a case study and tested for its binding activity towards the sacum peptides. We also designed two peptide variants that may have potent capability to competitively disrupt sacum interaction with its partners.
Collapse
Affiliation(s)
- Yinguang Zhang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Capital Medical University, Beijing 100069, China
| | - Yongwang Zhang
- Beijing Union Second Pharmaceutical Factory, Beijing 102600, China
| | - Yuxiang Zhang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Capital Medical University, Beijing 100069, China; Beijing Key Laboratory for Cancer Invasion and Metastasis Research, Capital Medical University, Beijing 100069, China.
| |
Collapse
|
16
|
Singh A, Kaushik R, Kuntal H, Jayaram B. PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4938395. [PMID: 29688373 PMCID: PMC5852996 DOI: 10.1093/database/bay021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Accepted: 02/06/2018] [Indexed: 12/20/2022]
Abstract
The severity of malaria caused by Plasmodium vivax worldwide and its resistance against the available general antimalarial drugs has created an urgent need for a comprehensive insight into its biology and biochemistry for developing some novel potential vaccines and therapeutics. P.vivax comprises 5392 proteins mostly predicted, out of which 4211 are soluble proteins and 2205 of these belong to blood and liver stages of malarial cycle. Presently available public resources report functional annotation (gene ontology) of only 28% (627 proteins) of the enzymatic soluble proteins and experimental structures are determined for only 42 proteins P. vivax proteome. In this milieu of severe paucity of structural and functional data, we have generated structures of 2205 soluble proteins, validated them thoroughly, identified their binding pockets (including active sites) and annotated their function increasing the coverage from the existing 28% to 100%. We have pooled all this information together and created a database christened as PvaxDB, which furnishes extensive sequence, structure, ligand binding site and functional information. We believe PvaxDB could be helpful in identifying novel protein drug targets, expediting development of new drugs to combat malaria. This is also the first attempt to create a reliable comprehensive computational structural repository of all the soluble proteins of P. vivax. Database URL: http://www.scfbio-iitd.res.in/PvaxDB
Collapse
Affiliation(s)
- Ankita Singh
- Department of Bioinformatics, Banasthali Vidyapith, Banasthali 304022, Rajasthan, India.,Supercomputing Facility for Bioinformatics and Computational Biology, IIT Delhi, Delhi, India
| | - Rahul Kaushik
- Supercomputing Facility for Bioinformatics and Computational Biology, IIT Delhi, Delhi, India.,Kusuma School of Biological Sciences, IIT Delhi, Delhi, India
| | - Himani Kuntal
- Department of Bioinformatics, Banasthali Vidyapith, Banasthali 304022, Rajasthan, India
| | - B Jayaram
- Supercomputing Facility for Bioinformatics and Computational Biology, IIT Delhi, Delhi, India.,Kusuma School of Biological Sciences, IIT Delhi, Delhi, India.,Department of Chemistry, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, Delhi, India
| |
Collapse
|
17
|
Jiang L, Tang C, Rao J, Xue Q, Wu H, Wu D, Zhang A, Chen L, Shen Z, Lei L. Systematic identification of the druggable interactions between human protein kinases and naturally occurring compounds in endometriosis. Comput Biol Chem 2017; 71:136-143. [PMID: 29096379 DOI: 10.1016/j.compbiolchem.2017.10.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Revised: 10/08/2017] [Accepted: 10/14/2017] [Indexed: 01/31/2023]
Abstract
Diverse kinase signaling pathways have been involved in the pathogenesis of endometriosis (EM), which can be modulated either by directly targeting the hub kinases or by indirectly regulating marginal members in the pathways. Here, a systematic kinase-inhibitor interaction profile was created for 8 naturally occurring compounds against 20 human protein kinases. The compounds are all non-sterid that have been reported as pharmacologically active molecular entities potential for EM therapeutics, while the kinases were curated via gene ontology terms enriched from the gene co-citation network with EM. The resulting profile was analyzed at structural, energetic and dynamic levels to identify druggable kinase-compound interactions. The compounds Gossypol, Curcumin and EGCG showed a similar interaction profile across these kinases; they can bind tightly to the top-listed kinases in gene ontology, while the compounds Marrubiin, Apigenin and DIM were predicted to exhibit generally weak affinity for the 20 curated kinases. The JNK kinase, a MAPK family member, was identified as a putative candidate of druggable target for EM therapeutics; the inhibitory activity of eight naturally occurring compounds as well as a sophisticated kinase inhibitor SP600125 against the JNK was tested using enzymatic activity analysis. As might be expected, the Gossypol and EGCG were determined to have high inhibitory activity at namomolar level (IC50=55 and 94nM, respectively), which are comparable with or better than the positive control SP600125 (IC50=76nM), while other tested compounds exhibited weak inhibition (IC50>100nM) or bad potency (IC50=n.d.) against the kinase.
Collapse
Affiliation(s)
- Lai Jiang
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Chaoliang Tang
- Department of Anesthesiology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China; Department of Anesthesiology, Renmin Hospital of Wuhan University, Wuhan 430060, China
| | - Jie Rao
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Qing Xue
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Hao Wu
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Dabao Wu
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Aijun Zhang
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Ling Chen
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Zhen Shen
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China
| | - Lei Lei
- Department of Obstetrics and Gynecology, Anhui Provincial Hospital of Anhui Medical University, Hefei 230001, China.
| |
Collapse
|
18
|
Zhang Y, Zhang Y, Zhang Y. Transcriptome-wide identification and competitive disruption of sacum-binding partners in human colorectal cancer. J Mol Graph Model 2017; 77:259-262. [PMID: 28898789 DOI: 10.1016/j.jmgm.2017.08.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Revised: 08/23/2017] [Accepted: 08/31/2017] [Indexed: 11/19/2022]
Abstract
Human sacum is regulatory adaptor protein involved in cellular signaling network of colorectal cancer. Molecular evidence suggests that the protein is integrated into oncogenic signaling network by binding to SH3-containing proteins through its proline-rich motifs. In this study, we have performed a transcriptome-wide analysis and identification of sacum-binding partners in the genome profile of human colorectal cancer. The sacum-binding potency of SH3-containing proteins found in colorectal cancer was investigated by using bioinformatics modeling and intermolecular binding analysis. With the protocol we were able to predict those high-affinity domain binders of the proline-rich peptides of human sacum in a high-throughput manner, and to analyze sequence-specific interaction in the domain-peptide recognition at molecular level. Consequently, a number of putative domain binders with both high affinity and specificity were identified, from which the Src SH3 domain was selected as a case study and tested for its binding activity towards the sacum peptides. We also designed two peptide variants that may have potent capability to competitively disrupt sacum interaction with its partners.
Collapse
Affiliation(s)
- Yinguang Zhang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Capital Medical University, Beijing 100069, China
| | - Yongwang Zhang
- Beijing Union Second Pharmaceutical Factory, Beijing 102600, China
| | - Yuxiang Zhang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Capital Medical University, Beijing 100069, China; Beijing Key Laboratory for Cancer Invasion and Metastasis Research, Capital Medical University, Beijing 100069, China.
| |
Collapse
|
19
|
GalaxyDock BP2 score: a hybrid scoring function for accurate protein–ligand docking. J Comput Aided Mol Des 2017. [DOI: 10.1007/s10822-017-0030-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
20
|
Heo S, Lee J, Joo K, Shin HC, Lee J. Protein Loop Structure Prediction Using Conformational Space Annealing. J Chem Inf Model 2017; 57:1068-1078. [DOI: 10.1021/acs.jcim.6b00742] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Seungryong Heo
- School
of Systems Biomedical Science, Soongsil University, Seoul 06978, Korea
| | - Juyong Lee
- Laboratory
of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | | | - Hang-Cheol Shin
- School
of Systems Biomedical Science, Soongsil University, Seoul 06978, Korea
| | | |
Collapse
|
21
|
Marks C, Deane C. Antibody H3 Structure Prediction. Comput Struct Biotechnol J 2017; 15:222-231. [PMID: 28228926 PMCID: PMC5312500 DOI: 10.1016/j.csbj.2017.01.010] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Revised: 01/24/2017] [Accepted: 01/27/2017] [Indexed: 01/20/2023] Open
Abstract
Antibodies are proteins of the immune system that are able to bind to a huge variety of different substances, making them attractive candidates for therapeutic applications. Antibody structures have the potential to be useful during drug development, allowing the implementation of rational design procedures. The most challenging part of the antibody structure to experimentally determine or model is the H3 loop, which in addition is often the most important region in an antibody's binding site. This review summarises the approaches used so far in the pursuit of accurate computational H3 structure prediction.
Collapse
Affiliation(s)
- C. Marks
- Department of Statistics, University of Oxford, 24-29 St Giles', Oxford OX1 3LB, United Kingdom
| | | |
Collapse
|
22
|
Pang YP. FF12MC: A revised AMBER forcefield and new protein simulation protocol. Proteins 2016; 84:1490-516. [PMID: 27348292 PMCID: PMC5129589 DOI: 10.1002/prot.25094] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Revised: 06/16/2016] [Accepted: 06/18/2016] [Indexed: 12/25/2022]
Abstract
Specialized to simulate proteins in molecular dynamics (MD) simulations with explicit solvation, FF12MC is a combination of a new protein simulation protocol employing uniformly reduced atomic masses by tenfold and a revised AMBER forcefield FF99 with (i) shortened CH bonds, (ii) removal of torsions involving a nonperipheral sp(3) atom, and (iii) reduced 1-4 interaction scaling factors of torsions ϕ and ψ. This article reports that in multiple, distinct, independent, unrestricted, unbiased, isobaric-isothermal, and classical MD simulations FF12MC can (i) simulate the experimentally observed flipping between left- and right-handed configurations for C14-C38 of BPTI in solution, (ii) autonomously fold chignolin, CLN025, and Trp-cage with folding times that agree with the experimental values, (iii) simulate subsequent unfolding and refolding of these miniproteins, and (iv) achieve a robust Z score of 1.33 for refining protein models TMR01, TMR04, and TMR07. By comparison, the latest general-purpose AMBER forcefield FF14SB locks the C14-C38 bond to the right-handed configuration in solution under the same protein simulation conditions. Statistical survival analysis shows that FF12MC folds chignolin and CLN025 in isobaric-isothermal MD simulations 2-4 times faster than FF14SB under the same protein simulation conditions. These results suggest that FF12MC may be used for protein simulations to study kinetics and thermodynamics of miniprotein folding as well as protein structure and dynamics. Proteins 2016; 84:1490-1516. © 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yuan-Ping Pang
- Computer-Aided Molecular Design Laboratory, Mayo Clinic, Rochester, MN, 55905, USA.
| |
Collapse
|
23
|
Receptor residence time trumps drug-likeness and oral bioavailability in determining efficacy of complement C5a antagonists. Sci Rep 2016; 6:24575. [PMID: 27094554 PMCID: PMC4837355 DOI: 10.1038/srep24575] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 03/31/2016] [Indexed: 12/13/2022] Open
Abstract
Drug discovery and translation are normally based on optimizing efficacy by increasing receptor affinity, functional potency, drug-likeness (rule-of-five compliance) and oral bioavailability. Here we demonstrate that residence time of a compound on its receptor has an overriding influence on efficacy, exemplified for antagonists of inflammatory protein complement C5a that activates immune cells and promotes disease. Three equipotent antagonists (3D53, W54011, JJ47) of inflammatory responses to C5a (3 nM) were compared for drug-likeness, receptor affinity and antagonist potency in human macrophages, and anti-inflammatory efficacy in rats. Only the least drug-like antagonist (3D53) maintained potency in cells against higher C5a concentrations and had a much longer duration of action (t1/2 ~ 20 h) than W54011 or JJ47 (t1/2 ~ 1 -3 h) in inhibiting macrophage responses. The unusually long residence time of 3D53 on its receptor was mechanistically probed by molecular dynamics simulations, which revealed long-lasting interactions that trap the antagonist within the receptor. Despite negligible oral bioavailability, 3D53 was much more orally efficacious than W54011 or JJ47 in preventing repeated agonist insults to induce rat paw oedema over 24 h. Thus, residence time on a receptor can trump drug-likeness in determining efficacy, even oral efficacy, of pharmacological agents.
Collapse
|
24
|
Abuhammad A, Taha M. Innovative computer-aided methods for the discovery of new kinase ligands. Future Med Chem 2016; 8:509-526. [PMID: 27105126 DOI: 10.4155/fmc-2015-0003] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 02/02/2016] [Indexed: 07/10/2024] Open
Abstract
Recent evidence points to significant roles played by protein kinases in cell signaling and cellular proliferation. Faulty protein kinases are involved in cancer, diabetes and chronic inflammation. Efforts are continuously carried out to discover new inhibitors for selected protein kinases. In this review, we discuss two new computer-aided methodologies we developed to mine virtual databases for new bioactive compounds. One method is ligand-based exploration of the pharmacophoric space of inhibitors of any particular biotarget followed by quantitative structure-activity relationship-based selection of the best pharmacophore(s). The second approach is structure-based assuming that potent ligands come into contact with binding site spots distinct from those contacted by weakly potent ligands. Both approaches yield pharmacophores useful as 3D search queries for the discovery of new bioactive (kinase) inhibitors.
Collapse
Affiliation(s)
- Areej Abuhammad
- Department of Pharmaceutical Sciences, Faculty of Pharmacy, The University of Jordan, 11942, Amman, Jordan
| | - Mutasem Taha
- Department of Pharmaceutical Sciences, Faculty of Pharmacy, The University of Jordan, 11942, Amman, Jordan
| |
Collapse
|
25
|
Abstract
Native proteins perform an amazing variety of biochemical functions, including enzymatic catalysis, and can engage in protein-protein and protein-DNA interactions that are essential for life. A key question is how special are these functional properties of proteins. Are they extremely rare, or are they an intrinsic feature? Comparison to the properties of compact conformations of artificially generated compact protein structures selected for thermodynamic stability but not any type of function, the artificial (ART) protein library, demonstrates that a remarkable number of the properties of native-like proteins are recapitulated. These include the complete set of small molecule ligand-binding pockets and most protein-protein interfaces. ART structures are predicted to be capable of weakly binding metabolites and cover a significant fraction of metabolic pathways, with the most enriched pathways including ancient ones such as glycolysis. Native-like active sites are also found in ART proteins. A small fraction of ART proteins are predicted to have strong protein-protein and protein-DNA interactions. Overall, it appears that biochemical function is an intrinsic feature of proteins which nature has significantly optimized during evolution. These studies raise questions as to the relative roles of specificity and promiscuity in the biochemical function and control of cells that need investigation.
Collapse
Affiliation(s)
- Jeffrey Skolnick
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Mu Gao
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
26
|
Sikdar S, Chakrabarti J, Ghosh M. Conformational thermodynamics guided structural reconstruction of biomolecular fragments. MOLECULAR BIOSYSTEMS 2016; 12:444-53. [DOI: 10.1039/c5mb00529a] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Conformational thermodynamics compares the modeling protocols to identify the conformation of the missing region leading to a suitable model for metal ion free (apo) skeletal muscle Troponin C.
Collapse
Affiliation(s)
- Samapan Sikdar
- Department of Chemical
- Biological and Macromolecular Sciences
- S. N. Bose National Centre for Basic Sciences
- Salt Lake
- India
| | - J. Chakrabarti
- Department of Chemical
- Biological and Macromolecular Sciences
- S. N. Bose National Centre for Basic Sciences
- Salt Lake
- India
| | - Mahua Ghosh
- Department of Chemical
- Biological and Macromolecular Sciences
- S. N. Bose National Centre for Basic Sciences
- Salt Lake
- India
| |
Collapse
|
27
|
Lee MS, Olson MA. Assessment of Detection and Refinement Strategies for de novo Protein Structures Using Force Field and Statistical Potentials. J Chem Theory Comput 2015; 3:312-24. [PMID: 26627174 DOI: 10.1021/ct600195f] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
De novo predictions of protein structures at high resolution are plagued by the problem of detecting the native conformation from false energy minima. In this work, we provide an assessment of various detection and refinement protocols on a small subset of the second-generation all-atom Rosetta decoy set (Tsai et al. Proteins 2003, 53, 76-87) using two potentials: the all-atom CHARMM PARAM22 force field combined with generalized Born/surface-area (GB-SA) implicit solvation and the DFIRE-AA statistical potential. Detection schemes included DFIRE-AA conformational scoring and energy minimization followed by scoring with both GB-SA and DFIRE-AA potentials. Refinement methods included short-time (1-ps) molecular dynamics simulations, temperature-based replica exchange molecular dynamics, and a new computational unfold/refold procedure. Refinement methods include temperature-based replica exchange molecular dynamics and a new computational unfold/refold procedure. Our results indicate that simple detection with only minimization is the best protocol for finding the most nativelike structures in the decoy set. The refinement techniques that we tested are generally unsuccessful in improving detection; however, they provide marginal improvements to some of the decoy structures. Future directions in the development of refinement techniques are discussed in the context of the limitations of the protocols evaluated in this study.
Collapse
Affiliation(s)
- Michael S Lee
- Computational and Information Sciences Directorate, U.S. Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, and Department of Cell Biology and Biochemistry, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702
| | - Mark A Olson
- Computational and Information Sciences Directorate, U.S. Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, and Department of Cell Biology and Biochemistry, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702
| |
Collapse
|
28
|
Perry SR, Xu W, Wirija A, Lim J, Yau MK, Stoermer MJ, Lucke AJ, Fairlie DP. Three Homology Models of PAR2 Derived from Different Templates: Application to Antagonist Discovery. J Chem Inf Model 2015; 55:1181-91. [PMID: 26000704 DOI: 10.1021/acs.jcim.5b00087] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Protease activated receptor 2 (PAR2) is an unusual G-protein coupled receptor (GPCR) involved in inflammation and metabolism. It is activated through cleavage of its N-terminus by proteases. The new N-terminus functions as a tethered ligand that folds back and intramolecularly activates PAR2, initiating multiple downstream signaling pathways. The only compounds reported to date to inhibit PAR2 activation are of moderate potency. Three structural models for PAR2 have been constructed based on sequence homology with known crystal structures for bovine rhodopsin, human ORL-1 (also called nociceptin/orphanin FQ receptor), and human PAR1. The three PAR2 model structures were compared and used to predict potential interactions with ligands. Virtual screening for ligands using the Chembridge database, and either ORL-1 or PAR1 derived PAR2 models led to identification of eight new small molecule PAR2 antagonists (IC50 10-100 μM). Notably, the most potent compound 1 (IC50 11 μM) was derived from the less homologous template protein, human ORL-1. The results suggest that virtual screening against multiple homology models of the same GPCR can produce structurally diverse antagonists and that this may be desirable even when some models have less sequence homology with the target protein.
Collapse
Affiliation(s)
- Samuel R Perry
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Weijun Xu
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Anna Wirija
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Junxian Lim
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Mei-Kwan Yau
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Martin J Stoermer
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Andrew J Lucke
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - David P Fairlie
- Division of Chemistry and Structural Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| |
Collapse
|
29
|
Tong J, Pei J, Otwinowski Z, Grishin NV. Refinement by shifting secondary structure elements improves sequence alignments. Proteins 2015; 83:411-27. [PMID: 25546158 DOI: 10.1002/prot.24746] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2014] [Revised: 11/25/2014] [Accepted: 12/10/2014] [Indexed: 01/09/2023]
Abstract
Constructing a model of a query protein based on its alignment to a homolog with experimentally determined spatial structure (the template) is still the most reliable approach to structure prediction. Alignment errors are the main bottleneck for homology modeling when the query is distantly related to the template. Alignment methods often misalign secondary structural elements by a few residues. Therefore, better alignment solutions can be found within a limited set of local shifts of secondary structures. We present a refinement method to improve pairwise sequence alignments by evaluating alignment variants generated by local shifts of template-defined secondary structures. Our method SFESA is based on a novel scoring function that combines the profile-based sequence score and the structure score derived from residue contacts in a template. Such a combined score frequently selects a better alignment variant among a set of candidate alignments generated by local shifts and leads to overall increase in alignment accuracy. Evaluation of several benchmarks shows that our refinement method significantly improves alignments made by automatic methods such as PROMALS, HHpred and CNFpred. The web server is available at http://prodata.swmed.edu/sfesa.
Collapse
Affiliation(s)
- Jing Tong
- Department of Biophysics, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, 75390; Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, 75390
| | | | | | | |
Collapse
|
30
|
Kabbani N, Nordman JC, Corgiat BA, Veltri DP, Shehu A, Seymour VA, Adams DJ. Are nicotinic acetylcholine receptors coupled to G proteins? Bioessays 2014; 35:1025-34. [PMID: 24185813 DOI: 10.1002/bies.201300082] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
It was, until recently, accepted that the two classes of acetylcholine (ACh) receptors are distinct in an important sense: muscarinic ACh receptors signal via heterotrimeric GTP binding proteins (G proteins), whereas nicotinic ACh receptors (nAChRs) open to allow flux of Na+, Ca2+, and K+ ions into the cell after activation. Here we present evidence of direct coupling between G proteins and nAChRs in neurons. Based on proteomic, biophysical, and functional evidence, we hypothesize that binding to G proteins modulates the activity and signaling of nAChRs in cells. It is important to note that while this hypothesis is new for the nAChR, it is consistent with known interactions between G proteins and structurally related ligand-gated ion channels. Therefore, it underscores an evolutionarily conserved metabotropic mechanism of G protein signaling via nAChR channels.
Collapse
|
31
|
Tang K, Zhang J, Liang J. Fast protein loop sampling and structure prediction using distance-guided sequential chain-growth Monte Carlo method. PLoS Comput Biol 2014; 10:e1003539. [PMID: 24763317 PMCID: PMC3998890 DOI: 10.1371/journal.pcbi.1003539] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2013] [Accepted: 02/01/2014] [Indexed: 11/18/2022] Open
Abstract
Loops in proteins are flexible regions connecting regular secondary structures. They are often involved in protein functions through interacting with other molecules. The irregularity and flexibility of loops make their structures difficult to determine experimentally and challenging to model computationally. Conformation sampling and energy evaluation are the two key components in loop modeling. We have developed a new method for loop conformation sampling and prediction based on a chain growth sequential Monte Carlo sampling strategy, called Distance-guided Sequential chain-Growth Monte Carlo (DISGRO). With an energy function designed specifically for loops, our method can efficiently generate high quality loop conformations with low energy that are enriched with near-native loop structures. The average minimum global backbone RMSD for 1,000 conformations of 12-residue loops is 1:53 A° , with a lowest energy RMSD of 2:99 A° , and an average ensembleRMSD of 5:23 A° . A novel geometric criterion is applied to speed up calculations. The computational cost of generating 1,000 conformations for each of the x loops in a benchmark dataset is only about 10 cpu minutes for 12-residue loops, compared to ca 180 cpu minutes using the FALCm method. Test results on benchmark datasets show that DISGRO performs comparably or better than previous successful methods, while requiring far less computing time. DISGRO is especially effective in modeling longer loops (10-17 residues).
Collapse
Affiliation(s)
- Ke Tang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
- * E-mail: (JZ); (JL)
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois, United States of America
- * E-mail: (JZ); (JL)
| |
Collapse
|
32
|
Liang S, Zhang C, Zhou Y. LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J Comput Chem 2014; 35:335-41. [PMID: 24327406 PMCID: PMC4125323 DOI: 10.1002/jcc.23509] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2013] [Revised: 10/06/2013] [Accepted: 11/24/2013] [Indexed: 11/11/2022]
Abstract
Prediction of protein loop conformations without any prior knowledge (ab initio prediction) is an unsolved problem. Its solution will significantly impact protein homology and template-based modeling as well as ab initio protein-structure prediction. Here, we developed a coarse-grained, optimized scoring function for initial sampling and ranking of loop decoys. The resulting decoys are then further optimized in backbone and side-chain conformations and ranked by all-atom energy scoring functions. The final integrated technique called loop prediction by energy-assisted protocol achieved a median value of 2.1 Å root mean square deviation (RMSD) for 325 12-residue test loops and 2.0 Å RMSD for 45 12-residue loops from critical assessment of structure-prediction techniques (CASP) 10 target proteins with native core structures (backbone and side chains). If all side-chain conformations in protein cores were predicted in the absence of the target loop, loop-prediction accuracy only reduces slightly (0.2 Å difference in RMSD for 12-residue loops in the CASP target proteins). The accuracy obtained is about 1 Å RMSD or more improvement over other methods we tested. The executable file for a Linux system is freely available for academic users at http://sparks-lab.org.
Collapse
Affiliation(s)
- Shide Liang
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Chi Zhang
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, 68588, USA
| | - Yaoqi Zhou
- School of Informatics, Indiana University Purdue University at Indianapolis, Indianapolis, IN 46202, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
- Institute for Glycomics and School of Informatics and Communication Technology, Griffith University, Parklands Drive, Southport Qld 4222, Australia
| |
Collapse
|
33
|
Holtby D, Li SC, Li M. LoopWeaver: loop modeling by the weighted scaling of verified proteins. J Comput Biol 2014; 20:212-23. [PMID: 23461572 DOI: 10.1089/cmb.2012.0078] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Modeling loops is a necessary step in protein structure determination, even with experimental nuclear magnetic resonance (NMR) data, it is widely known to be difficult. Database techniques have the advantage of producing a higher proportion of predictions with subangstrom accuracy when compared with ab initio techniques, but the disadvantage of also producing a higher proportion of clashing or highly inaccurate predictions. We introduce LoopWeaver, a database method that uses multidimensional scaling to achieve better, clash-free placement of loops obtained from a database of protein structures. This allows us to maintain the above-mentioned advantage while avoiding the disadvantage. Test results show that we achieve significantly better results than all other methods, including Modeler, Loopy, SuperLooper, and Rapper, before refinement. With refinement, our results (LoopWeaver and Loopy consensus) are better than ROSETTA, with 0.42 Å RMSD on average for 206 length 6 loops, 0.64 Å local RMSD for 168 length 7 loops, 0.81Å RMSD for 117 length 8 loops, and 0.98 Å RMSD for length 9 loops, while ROSETTA has 0.55, 0.79, 1.16, 1.42, respectively, at the same average time limit (3 hours). When we allow ROSETTA to run for over a week, it approaches, but does not surpass, our accuracy.
Collapse
Affiliation(s)
- Daniel Holtby
- David R. Chariton School of Computer Science, University of Waterloo, Waterloo, Canada.
| | | | | |
Collapse
|
34
|
Dong GQ, Fan H, Schneidman-Duhovny D, Webb B, Sali A. Optimized atomic statistical potentials: assessment of protein interfaces and loops. Bioinformatics 2013; 29:3158-66. [PMID: 24078704 PMCID: PMC3842762 DOI: 10.1093/bioinformatics/btt560] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Revised: 08/13/2013] [Accepted: 09/22/2013] [Indexed: 01/16/2023] Open
Abstract
MOTIVATION Statistical potentials have been widely used for modeling whole proteins and their parts (e.g. sidechains and loops) as well as interactions between proteins, nucleic acids and small molecules. Here, we formulate the statistical potentials entirely within a statistical framework, avoiding questionable statistical mechanical assumptions and approximations, including a definition of the reference state. RESULTS We derive a general Bayesian framework for inferring statistically optimized atomic potentials (SOAP) in which the reference state is replaced with data-driven 'recovery' functions. Moreover, we restrain the relative orientation between two covalent bonds instead of a simple distance between two atoms, in an effort to capture orientation-dependent interactions such as hydrogen bonds. To demonstrate this general approach, we computed statistical potentials for protein-protein docking (SOAP-PP) and loop modeling (SOAP-Loop). For docking, a near-native model is within the top 10 scoring models in 40% of the PatchDock benchmark cases, compared with 23 and 27% for the state-of-the-art ZDOCK and FireDock scoring functions, respectively. Similarly, for modeling 12-residue loops in the PLOP benchmark, the average main-chain root mean square deviation of the best scored conformations by SOAP-Loop is 1.5 Å, close to the average root mean square deviation of the best sampled conformations (1.2 Å) and significantly better than that selected by Rosetta (2.1 Å), DFIRE (2.3 Å), DOPE (2.5 Å) and PLOP scoring functions (3.0 Å). Our Bayesian framework may also result in more accurate statistical potentials for additional modeling applications, thus affording better leverage of the experimentally determined protein structures. AVAILABILITY AND IMPLEMENTATION SOAP-PP and SOAP-Loop are available as part of MODELLER (http://salilab.org/modeller).
Collapse
Affiliation(s)
- Guang Qiang Dong
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, CA 94158, USA
| | | | | | | | | |
Collapse
|
35
|
Pieper U, Webb BM, Dong GQ, Schneidman-Duhovny D, Fan H, Kim SJ, Khuri N, Spill YG, Weinkam P, Hammel M, Tainer JA, Nilges M, Sali A. ModBase, a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res 2013; 42:D336-46. [PMID: 24271400 PMCID: PMC3965011 DOI: 10.1093/nar/gkt1144] [Citation(s) in RCA: 219] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains almost 30 million reliable models for domains in 4.7 million unique protein sequences. ModBase allows users to compute or update comparative models on demand, through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the AllosMod server for modeling ligand-induced protein dynamics (http://salilab.org/allosmod), the AllosMod-FoXS server for predicting a structural ensemble that fits an SAXS profile (http://salilab.org/allosmod-foxs), the FoXSDock server for protein–protein docking filtered by an SAXS profile (http://salilab.org/foxsdock), the SAXS Merge server for automatic merging of SAXS profiles (http://salilab.org/saxsmerge) and the Pose & Rank server for scoring protein–ligand complexes (http://salilab.org/poseandrank). In this update, we also highlight two applications of ModBase: a PSI:Biology initiative to maximize the structural coverage of the human alpha-helical transmembrane proteome and a determination of structural determinants of human immunodeficiency virus-1 protease specificity.
Collapse
Affiliation(s)
- Ursula Pieper
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, Office 503B, University of California at San Francisco, 1700 4th Street, San Francisco, CA 94158, USA, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, Office 503B, University of California at San Francisco, 1700 4th Street, San Francisco, CA 94158, USA, Graduate Group in Biophysics, University of California at San Francisco, CA 94158, USA, Structural Bioinformatics Unit, Structural Biology and Chemistry department, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, France, Université Paris Diderot-Paris 7, école doctorale iViv, Paris Rive Gauche, 5 rue Thomas Mann, 75013 Paris, France, Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Department of Molecular Biology, Skaggs Institute of Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037, USA, Life Sciences Division, Department of Molecular Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Khoury GA, Tamamis P, Pinnaduwage N, Smadbeck J, Kieslich CA, Floudas CA. Princeton_TIGRESS: protein geometry refinement using simulations and support vector machines. Proteins 2013; 82:794-814. [PMID: 24174311 DOI: 10.1002/prot.24459] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Revised: 10/18/2013] [Accepted: 10/22/2013] [Indexed: 12/30/2022]
Abstract
Protein structure refinement aims to perform a set of operations given a predicted structure to improve model quality and accuracy with respect to the native in a blind fashion. Despite the numerous computational approaches to the protein refinement problem reported in the previous three CASPs, an overwhelming majority of methods degrade models rather than improve them. We initially developed a method tested using blind predictions during CASP10 which was officially ranked in 5th place among all methods in the refinement category. Here, we present Princeton_TIGRESS, which when benchmarked on all CASP 7,8,9, and 10 refinement targets, simultaneously increased GDT_TS 76% of the time with an average improvement of 0.83 GDT_TS points per structure. The method was additionally benchmarked on models produced by top performing three-dimensional structure prediction servers during CASP10. The robustness of the Princeton_TIGRESS protocol was also tested for different random seeds. We make the Princeton_TIGRESS refinement protocol freely available as a web server at http://atlas.princeton.edu/refinement. Using this protocol, one can consistently refine a prediction to help bridge the gap between a predicted structure and the actual native structure.
Collapse
Affiliation(s)
- George A Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, 08540
| | | | | | | | | | | |
Collapse
|
37
|
Mutation induced structural variation in membrane proteins. Chem Res Chin Univ 2013. [DOI: 10.1007/s40242-013-2427-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
38
|
Han KQ, Wu G, Lv F. Development of QSAR-Improved Statistical Potential for the Structure-Based Analysis of ProteinPeptide Binding Affinities. Mol Inform 2013; 32:783-92. [DOI: 10.1002/minf.201300064] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 06/21/2013] [Indexed: 12/21/2022]
|
39
|
Pawlowski M, Bogdanowicz A, Bujnicki JM. QA-RecombineIt: a server for quality assessment and recombination of protein models. Nucleic Acids Res 2013; 41:W389-97. [PMID: 23700309 PMCID: PMC3692112 DOI: 10.1093/nar/gkt408] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
QA-RecombineIt provides a web interface to assess the quality of protein 3D structure models and to improve the accuracy of models by merging fragments of multiple input models. QA-RecombineIt has been developed for protein modelers who are working on difficult problems, have a set of different homology models and/or de novo models (from methods such as I-TASSER or ROSETTA) and would like to obtain one consensus model that incorporates the best parts into one structure that is internally coherent. An advanced mode is also available, in which one can modify the operation of the fragment recombination algorithm by manually identifying individual fragments or entire models to recombine. Our method produces up to 100 models that are expected to be on the average more accurate than the starting models. Therefore, our server may be useful for crystallographic protein structure determination, where protein models are used for Molecular Replacement to solve the phase problem. To address the latter possibility, a special feature was added to the QA-RecombineIt server. The QA-RecombineIt server can be freely accessed at http://iimcb.genesilico.pl/qarecombineit/.
Collapse
Affiliation(s)
- Marcin Pawlowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, Warsaw PL-02-109, Poland.
| | | | | |
Collapse
|
40
|
Chys P, Chacón P. Random Coordinate Descent with Spinor-matrices and Geometric Filters for Efficient Loop Closure. J Chem Theory Comput 2013; 9:1821-9. [PMID: 26587638 DOI: 10.1021/ct300977f] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Protein loop closure constitutes a critical step in loop and protein modeling whereby geometrically feasible loops must be found between two given anchor residues. Here, a new analytic/iterative algorithm denoted random coordinate descent (RCD) to perform protein loop closure is described. The algorithm solves loop closure through minimization as in cyclic coordinate descent but selects bonds for optimization randomly, updates loop conformations by spinor-matrices, performs loop closure in both chain directions, and uses a set of geometric filters to yield efficient conformational sampling. Geometric filters allow one to detect clashes and constrain dihedral angles on the fly. The RCD algorithm is at least comparable to state of the art loop closure algorithms due to an excellent balance between efficiency and intrinsic sampling capability. Furthermore, its efficiency allows one to improve conformational sampling by increasing the sampling number without much penalty. Overall, RCD turns out to be accurate, fast, robust, and applicable over a wide range of loop lengths. Because of the versatility of RCD, it is a solid alternative for integration with current loop modeling strategies.
Collapse
Affiliation(s)
- Pieter Chys
- Structural Bioinformatics Group, Biological Chemical Physics Department, Institute of Physical Chemistry Rocasolano (IQFR), Consejo Superior de Investigaciones Cientı́ficas (CSIC), Calle de Serrano 119, Madrid 28006, Spain
| | - Pablo Chacón
- Structural Bioinformatics Group, Biological Chemical Physics Department, Institute of Physical Chemistry Rocasolano (IQFR), Consejo Superior de Investigaciones Cientı́ficas (CSIC), Calle de Serrano 119, Madrid 28006, Spain
| |
Collapse
|
41
|
Mirjalili V, Feig M. Protein Structure Refinement through Structure Selection and Averaging from Molecular Dynamics Ensembles. J Chem Theory Comput 2013; 9:1294-1303. [PMID: 23526422 PMCID: PMC3603382 DOI: 10.1021/ct300962x] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
A molecular dynamics (MD) simulation based protocol for structure refinement of template-based model predictions is described. The protocol involves the application of restraints, ensemble averaging of selected subsets, interpolation between initial and refined structures, and assessment of refinement success. It is found that sub-microsecond MD-based sampling when combined with ensemble averaging can produce moderate but consistent refinement for most systems in the CASP targets considered here.
Collapse
Affiliation(s)
- Vahid Mirjalili
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824; USA
- Department of Mechanical Engineering, Michigan State University, East Lansing, MI 48824; USA
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824; USA
- Department of Chemistry, Michigan State University, East Lansing, MI 48824; USA
| |
Collapse
|
42
|
Abstract
A major multidrug transporter, MDR1 (multidrug resistance 1), a member of the MFS (major facilitator superfamily), invariably contributes to an increased efflux of commonly used azoles and thus corroborates their direct involvement in MDR in Candida albicans. The Mdr1 protein has two transmembrane domains, each comprising six transmembrane helices, interconnected with extracellular loops and ICLs (intracellular loops). The introduction of deletions and insertions through mutagenesis was used to address the role of the largest interdomain ICL3 of the MDR1 protein. Most of the progressive deletants, when overexpressed, eliminated the drug resistance. Notably, restoration of the length of the ICL3 by insertional mutagenesis did not restore the functionality of the protein. Interestingly, most of the insertion and deletion variants of ICL3 became amenable to trypsinization, yielding peptide fragments. The homology model of the Mdr1 protein showed that the molecular surface-charge distribution was perturbed in most of the ICL3 mutant variants. Taken together, these results provide the first evidence that the CCL (central cytoplasmic loop) of the fungal MFS transporter of the DHA1 (drug/proton antiporter) family is critical for the function of MDR. Unlike other homologous proteins, ICL3 has no apparent role in imparting substrate specificity or in the recruitment of the transporter protein.
Collapse
|
43
|
St-Pierre JF, Mousseau N. Large loop conformation sampling using the activation relaxation technique, ART-nouveau method. Proteins 2012; 80:1883-94. [PMID: 22488731 DOI: 10.1002/prot.24085] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Revised: 03/19/2011] [Accepted: 03/30/2012] [Indexed: 12/25/2022]
Abstract
We present an adaptation of the ART-nouveau energy surface sampling method to the problem of loop structure prediction. This method, previously used to study protein folding pathways and peptide aggregation, is well suited to the problem of sampling the conformation space of large loops by targeting probable folding pathways instead of sampling exhaustively that space. The number of sampled conformations needed by ART nouveau to find the global energy minimum for a loop was found to scale linearly with the sequence length of the loop for loops between 8 and about 20 amino acids. Considering the linear scaling dependence of the computation cost on the loop sequence length for sampling new conformations, we estimate the total computational cost of sampling larger loops to scale quadratically compared to the exponential scaling of exhaustive search methods.
Collapse
Affiliation(s)
- Jean-François St-Pierre
- Département de Physique and Regroupement Québécois sur les Matériaux de Pointe, Université de Montréal, CP 6128, Succursale Centre-Ville, Montréal, Québec, Canada H3C 3J7
| | | |
Collapse
|
44
|
Liang S, Zhang C, Sarmiento J, Standley DM. Protein Loop Modeling with Optimized Backbone Potential Functions. J Chem Theory Comput 2012; 8:1820-7. [PMID: 26593673 DOI: 10.1021/ct300131p] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We represented protein backbone potential as a Fourier series. The parameters of the backbone dihedral potential were initialized to random values and optimized by Monte Carlo simulations so that generated native-like loop decoys had a lower energy than non-native decoys. The low energy regions of the optimized backbone potential were consistent with observed Ramachandran plots derived from crystal structures. The backbone potential was then used for the prediction of loop conformations (OSCAR-loop) combining with the previously described OSCAR force field, which has been shown to be very accurate in side chain modeling. As a result, the accuracy of OSCAR-loop was improved by local energy minimization based on the complete force field. The average accuracies were 0.40, 0.70, 1.10, 2.08, and 3.58 Å for 4, 6, 8, 10, and 12-residue loops, respectively, with each size being represented by 325 to 2809 targets. The accuracy was better than that of other loop modeling algorithms for short loops (<10 residues). For longer loops, the prediction accuracy was improved by concurrently sampling with a fragment-based method, Spanner. OSCAR-loop is available for download at http://sysimm.ifrec.osaka-u.ac.jp/OSCAR/ .
Collapse
Affiliation(s)
- Shide Liang
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University , Suita, Osaka, 565-0871, Japan
| | - Chi Zhang
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska , Lincoln, Nebraska 68588, United States
| | - Jamica Sarmiento
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University , Suita, Osaka, 565-0871, Japan
| | - Daron M Standley
- Systems Immunology Lab, Immunology Frontier Research Center, Osaka University , Suita, Osaka, 565-0871, Japan
| |
Collapse
|
45
|
|
46
|
Zhou H, Skolnick J. GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. Biophys J 2012; 101:2043-52. [PMID: 22004759 DOI: 10.1016/j.bpj.2011.09.012] [Citation(s) in RCA: 201] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2011] [Revised: 09/07/2011] [Accepted: 09/09/2011] [Indexed: 12/18/2022] Open
Abstract
An accurate scoring function is a key component for successful protein structure prediction. To address this important unsolved problem, we develop a generalized orientation and distance-dependent all-atom statistical potential. The new statistical potential, generalized orientation-dependent all-atom potential (GOAP), depends on the relative orientation of the planes associated with each heavy atom in interacting pairs. GOAP is a generalization of previous orientation-dependent potentials that consider only representative atoms or blocks of side-chain or polar atoms. GOAP is decomposed into distance- and angle-dependent contributions. The DFIRE distance-scaled finite ideal gas reference state is employed for the distance-dependent component of GOAP. GOAP was tested on 11 commonly used decoy sets containing 278 targets, and recognized 226 native structures as best from the decoys, whereas DFIRE recognized 127 targets. The major improvement comes from decoy sets that have homology-modeled structures that are close to native (all within ∼4.0 Å) or from the ROSETTA ab initio decoy set. For these two kinds of decoys, orientation-independent DFIRE or only side-chain orientation-dependent RWplus performed poorly. Although the OPUS-PSP block-based orientation-dependent, side-chain atom contact potential performs much better (recognizing 196 targets) than DFIRE, RWplus, and dDFIRE, it is still ∼15% worse than GOAP. Thus, GOAP is a promising advance in knowledge-based, all-atom statistical potentials. GOAP is available for download at http://cssb.biology.gatech.edu/GOAP.
Collapse
Affiliation(s)
- Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, USA
| | | |
Collapse
|
47
|
Adhikari AN, Peng J, Wilde M, Xu J, Freed KF, Sosnick TR. Modeling large regions in proteins: applications to loops, termini, and folding. Protein Sci 2012; 21:107-21. [PMID: 22095743 PMCID: PMC3323786 DOI: 10.1002/pro.767] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2011] [Revised: 11/02/2011] [Accepted: 11/06/2011] [Indexed: 11/10/2022]
Abstract
Template-based methods for predicting protein structure provide models for a significant portion of the protein but often contain insertions or chain ends (InsEnds) of indeterminate conformation. The local structure prediction "problem" entails modeling the InsEnds onto the rest of the protein. A well-known limit involves predicting loops of ≤12 residues in crystal structures. However, InsEnds may contain as many as ~50 amino acids, and the template-based model of the protein itself may be imperfect. To address these challenges, we present a free modeling method for predicting the local structure of loops and large InsEnds in both crystal structures and template-based models. The approach uses single amino acid torsional angle "pivot" moves of the protein backbone with a C(β) level representation. Nevertheless, our accuracy for loops is comparable to existing methods. We also apply a more stringent test, the blind structure prediction and refinement categories of the CASP9 tournament, where we improve the quality of several homology based models by modeling InsEnds as long as 45 amino acids, sizes generally inaccessible to existing loop prediction methods. Our approach ranks as one of the best in the CASP9 refinement category that involves improving template-based models so that they can function as molecular replacement models to solve the phase problem for crystallographic structure determination.
Collapse
Affiliation(s)
- Aashish N Adhikari
- Department of Chemistry, The University of ChicagoChicago, Illinois 60637
- The James Franck Institute, The University of ChicagoChicago, Illinois 60637
| | - Jian Peng
- Toyota Technological Institute at ChicagoChicago, Illinois 60637
| | - Michael Wilde
- Department of Biochemistry and Molecular Biology, The University of ChicagoChicago, Illinois 60637
| | - Jinbo Xu
- Toyota Technological Institute at ChicagoChicago, Illinois 60637
| | - Karl F Freed
- Department of Chemistry, The University of ChicagoChicago, Illinois 60637
- The James Franck Institute, The University of ChicagoChicago, Illinois 60637
- Computation Institute, The University of Chicago and Argonne National LaboratoryChicago, Illinois 60637
| | - Tobin R Sosnick
- Computation Institute, The University of Chicago and Argonne National LaboratoryChicago, Illinois 60637
- Department of Biochemistry and Molecular Biology, The University of ChicagoChicago, Illinois 60637
- Institute for Biophysical Dynamics, The University of ChicagoChicago, Illinois 60637
| |
Collapse
|
48
|
Li H, Zhou Y. FOLD HELICAL PROTEINS BY ENERGY MINIMIZATION IN DIHEDRAL SPACE AND A DFIRE-BASED STATISTICAL ENERGY FUNCTION. J Bioinform Comput Biol 2011; 3:1151-70. [PMID: 16278952 DOI: 10.1142/s0219720005001430] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2004] [Revised: 04/12/2005] [Accepted: 04/21/2005] [Indexed: 11/18/2022]
Abstract
Statistical energy functions are discrete (or stepwise) energy functions that lack van der Waals repulsion. As a result, they are often applied directly to a given structure (native or decoy) without further energy minimization being performed to the structure. However, the full benefit (or hidden defect) of an energy function cannot be revealed without energy minimization. This paper tests a recently developed, all-atom statistical energy function by energy minimization with a fixed secondary helical structure in dihedral space. This is accomplished by combining the statistical energy function based on a distance-scaled finite ideal-gas reference (DFIRE) state with a simple repulsive interaction and an improper torsion energy function. The energy function was used to minimize 2000 random initial structures of 41 small and medium-sized helical proteins in a dihedral space with a fixed helical region. Results indicate that near-native structures for most studied proteins can be obtained by minimization alone. The average minimum root-mean-squared distance (rmsd) from the native structure for all 41 proteins is 4.1 Å. The energy function (together with a simple clustering of similar structures) also makes a reasonable selection of near-native structures from minimized structures. The average rmsd value and the average rank for the best structure in the top five is 6.8 Å and 2.4, respectively. The accuracy of the structures sampled and the structure selections can be improved significantly with the removal of flexible terminal regions in rmsd calculations and in minimization and with the increase in the number of minimizations. The minimized structures form an excellent decoy set for testing other energy functions because most structures are well-packed with minimum hard-core overlaps with correct hydrophobic/hydrophilic partitioning. They are available online at .
Collapse
Affiliation(s)
- Hongzhi Li
- Howard Hughes Medical Institute Center for Single Molecule Biophysics, Department of Physiology & Biophysics, State University of New York at Buffalo, 124 Sherman Hall, Buffalo, New York 14214, USA.
| | | |
Collapse
|
49
|
Danielson ML, Lill MA. Predicting flexible loop regions that interact with ligands: the challenge of accurate scoring. Proteins 2011; 80:246-60. [PMID: 22072600 DOI: 10.1002/prot.23199] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2011] [Revised: 09/06/2011] [Accepted: 09/13/2011] [Indexed: 01/12/2023]
Abstract
Flexible loop regions play a critical role in the biological function of many proteins and have been shown to be involved in ligand binding. In the context of structure-based drug design, using or predicting an incorrect loop configuration can be detrimental to the study if the loop is capable of interacting with the ligand. Three protein systems, each with at least one flexible loop region in close proximity to the known binding site, were selected for loop prediction using the CorLps program; a six residue loop region from phosphoribosylglycinamide formyltransferase (GART), two nine residue loop regions from cytochrome P450 (CYP) 119, and an 11 residue loop region from enolase were selected for loop prediction. The results of this study indicate that the statistically based DFIRE scoring function implemented in the CorLps program did not accurately rank native-like predicted loop configurations in any protein system. In an attempt to improve the ranking of the native-like predicted loop configurations, the MM/GBSA and the optimized MM/GBSA-dsr scoring functions were used to re-rank the predicted loops with and without bound ligand. In general, single snapshot MM/GBSA scoring provided the best ranking of native-like loop configurations. Based on the scoring function analyses presented, the optimal ranking of native-like loop configurations is still a difficult challenge and the choice of the "best" scoring function appears to be system dependent.
Collapse
Affiliation(s)
- Matthew L Danielson
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
50
|
Shapovalov MV, Dunbrack RL. A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 2011; 19:844-58. [PMID: 21645855 DOI: 10.1016/j.str.2011.03.019] [Citation(s) in RCA: 636] [Impact Index Per Article: 48.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2010] [Revised: 03/19/2011] [Accepted: 03/22/2011] [Indexed: 11/15/2022]
Abstract
Rotamer libraries are used in protein structure determination, prediction, and design. The backbone-dependent rotamer library consists of rotamer frequencies, mean dihedral angles, and variances as a function of the backbone dihedral angles. Structure prediction and design methods that employ backbone flexibility would strongly benefit from smoothly varying probabilities and angles. A new version of the backbone-dependent rotamer library has been developed using adaptive kernel density estimates for the rotamer frequencies and adaptive kernel regression for the mean dihedral angles and variances. This formulation allows for evaluation of the rotamer probabilities, mean angles, and variances as a smooth and continuous function of phi and psi. Continuous probability density estimates for the nonrotameric degrees of freedom of amides, carboxylates, and aromatic side chains have been modeled as a function of the backbone dihedrals and rotamers of the remaining degrees of freedom. New backbone-dependent rotamer libraries at varying levels of smoothing are available from http://dunbrack.fccc.edu.
Collapse
Affiliation(s)
- Maxim V Shapovalov
- Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111, USA
| | | |
Collapse
|