1
|
Liang S, Zhang C, Zhu M. Ab Initio Prediction of 3-D Conformations for Protein Long Loops with High Accuracy and Applications to Antibody CDRH3 Modeling. J Chem Inf Model 2023; 63:7568-7577. [PMID: 38018130 DOI: 10.1021/acs.jcim.3c01051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
Residue-level potentials of mean force were widely used for protein backbone refinements to avoid simultaneous sampling of side-chain conformations. The interaction energy between the reduced side chains and backbone atoms was not considered explicitly. In this study, we developed novel methods to calculate the residue-atom interaction energy in combination with atomic and residue-level terms. The parameters were optimized step by step to remove the overcounting or overlap problem between different energy terms. The mixing energy functions were then used to evaluate the generated backbone conformations at the initial sampling stage of protein loop modeling (OSCAR-loop), including the interaction energy between the reduced loop residues and full atoms of the protein framework. The accuracies of top-ranked decoys were 1.18 and 2.81 Å for 8-residue and 12-residue loops, respectively. We then selected diverse decoys for side-chain modeling, backbone refinement, and energy minimization. The procedure was repeated multiple times to select one prediction with the lowest energy. Consequently, we obtained an accuracy of 0.74 Å for a prevailing test set of 12-residue loops, compared with >1.4 Å reported by other researchers. The OSCAR-loop was also effective for modeling the H3 loops of antibody complementary determining regions (CDRs) in the crystal environment. The prediction accuracy of OSCAR-loop (1.74 Å) was better than the accuracy of the Rosetta NGK method (3.11 Å) or those achieved by deep learning methods (>2.2 Å) for the CDRH3 loops of 49 targets in the Rosetta antibody benchmark. The performance of OSCAR-loop in a model environment was also discussed.
Collapse
Affiliation(s)
- Shide Liang
- Department of Computational Biology, 20n Bio Limited, Hangzhou 310018, P. R. China
- Department of Research and Development, Bio-Thera Solutions, Guangzhou 510530, P. R. China
| | - Chi Zhang
- School of Biological Sciences, University of Nebraska, Lincoln, Nebraska 68588, United States
| | - Mingfu Zhu
- Department of Computational Biology, 20n Bio Limited, Hangzhou 310018, P. R. China
| |
Collapse
|
2
|
Abstract
Membrane transporter proteins are divided into channels/pores and carriers and constitute protein families of physiological and pharmacological importance. Several presently used therapeutic compounds elucidate their effects by targeting membrane transporter proteins, including anti-arrhythmic, anesthetic, antidepressant, anxiolytic and diuretic drugs. The lack of three-dimensional structures of human transporters hampers experimental studies and drug discovery. In this chapter, the use of homology modeling for generating structural models of membrane transporter proteins is reviewed. The increasing number of atomic resolution structures available as templates, together with improvements in methods and algorithms for sequence alignments, secondary structure predictions, and model generation, in addition to the increase in computational power have increased the applicability of homology modeling for generating structural models of transporter proteins. Different pitfalls and hints for template selection, multiple-sequence alignments, generation and optimization, validation of the models, and the use of transporter homology models for structure-based virtual ligand screening are discussed.
Collapse
Affiliation(s)
- Ingebrigt Sylte
- Molecular Pharmacology and Toxicology, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway.
| | - Mari Gabrielsen
- Molecular Pharmacology and Toxicology, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Kurt Kristiansen
- Molecular Pharmacology and Toxicology, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
3
|
Wong SWK, Liu Z. Conformational variability of loops in the SARS-CoV-2 spike protein. Proteins 2021; 90:691-703. [PMID: 34661307 PMCID: PMC8662175 DOI: 10.1002/prot.26266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 10/05/2021] [Accepted: 10/12/2021] [Indexed: 11/07/2022]
Abstract
The SARS‐CoV‐2 spike (S) protein facilitates viral infection, and has been the focus of many structure determination efforts. Its flexible loop regions are known to be involved in protein binding and may adopt multiple conformations. This article identifies the S protein loops and studies their conformational variability based on the available Protein Data Bank structures. While most loops had essentially one stable conformation, 17 of 44 loop regions were observed to be structurally variable with multiple substantively distinct conformations based on a cluster analysis. Loop modeling methods were then applied to the S protein loop targets, and the prediction accuracies discussed in relation to the characteristics of the conformational clusters identified. Loops with multiple conformations were found to be challenging to model based on a single structural template.
Collapse
Affiliation(s)
- Samuel W. K. Wong
- Department of Statistics and Actuarial ScienceUniversity of WaterlooWaterlooCanada
| | - Zongjun Liu
- Department of Statistics and Actuarial ScienceUniversity of WaterlooWaterlooCanada
| |
Collapse
|
4
|
Mayer-Bacon C, Agboha N, Muscalli M, Freeland S. Evolution as a Guide to Designing xeno Amino Acid Alphabets. Int J Mol Sci 2021; 22:ijms22062787. [PMID: 33801827 PMCID: PMC8000707 DOI: 10.3390/ijms22062787] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 03/01/2021] [Accepted: 03/05/2021] [Indexed: 02/02/2023] Open
Abstract
Here, we summarize a line of remarkably simple, theoretical research to better understand the chemical logic by which life’s standard alphabet of 20 genetically encoded amino acids evolved. The connection to the theme of this Special Issue, “Protein Structure Analysis and Prediction with Statistical Scoring Functions”, emerges from the ways in which current bioinformatics currently lacks empirical science when it comes to xenoproteins composed largely or entirely of amino acids from beyond the standard genetic code. Our intent is to present new perspectives on existing data from two different frontiers in order to suggest fresh ways in which their findings complement one another. These frontiers are origins/astrobiology research into the emergence of the standard amino acid alphabet, and empirical xenoprotein synthesis.
Collapse
Affiliation(s)
- Christopher Mayer-Bacon
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, MD 21250, USA; (C.M.-B.); (N.A.)
| | - Neyiasuo Agboha
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, MD 21250, USA; (C.M.-B.); (N.A.)
| | - Mickey Muscalli
- Individualized Study Program, University of Maryland, Baltimore County, Baltimore, MD 21250, USA;
| | - Stephen Freeland
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, MD 21250, USA; (C.M.-B.); (N.A.)
- Individualized Study Program, University of Maryland, Baltimore County, Baltimore, MD 21250, USA;
- Correspondence:
| |
Collapse
|
5
|
Karami Y, Rey J, Postic G, Murail S, Tufféry P, de Vries SJ. DaReUS-Loop: a web server to model multiple loops in homology models. Nucleic Acids Res 2020; 47:W423-W428. [PMID: 31114872 PMCID: PMC6602439 DOI: 10.1093/nar/gkz403] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Revised: 04/20/2019] [Accepted: 05/06/2019] [Indexed: 02/07/2023] Open
Abstract
Loop regions in protein structures often have crucial roles, and they are much more variable in sequence and structure than other regions. In homology modeling, this leads to larger deviations from the homologous templates, and loop modeling of homology models remains an open problem. To address this issue, we have previously developed the DaReUS-Loop protocol, leading to significant improvement over existing methods. Here, a DaReUS-Loop web server is presented, providing an automated platform for modeling or remodeling loops in the context of homology models. This is the first web server accepting a protein with up to 20 loop regions, and modeling them all in parallel. It also provides a prediction confidence level that corresponds to the expected accuracy of the loops. DaReUS-Loop facilitates the analysis of the results through its interactive graphical interface and is freely available at http://bioserv.rpbs.univ-paris-diderot.fr/services/DaReUS-Loop/.
Collapse
Affiliation(s)
- Yasaman Karami
- Sorbonne Paris Cité, Université Paris Diderot, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | - Julien Rey
- Sorbonne Paris Cité, Université Paris Diderot, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | - Guillaume Postic
- Sorbonne Paris Cité, Université Paris Diderot, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France.,Institut Français de Bioinformatique (IFB), UMS 3601-CNRS, Université Paris-Saclay, Orsay, France
| | - Samuel Murail
- Sorbonne Paris Cité, Université Paris Diderot, CNRS UMR 8251, INSERM ERL U1133, Paris, France
| | - Pierre Tufféry
- Sorbonne Paris Cité, Université Paris Diderot, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | - Sjoerd J de Vries
- Sorbonne Paris Cité, Université Paris Diderot, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| |
Collapse
|
6
|
Pang WC, Ramli ANM, Hamid AAA. Comparative modelling studies of fruit bromelain using molecular dynamics simulation. J Mol Model 2020; 26:142. [PMID: 32417971 DOI: 10.1007/s00894-020-04398-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 04/28/2020] [Indexed: 12/25/2022]
Abstract
Fruit bromelain is a cysteine protease accumulated in pineapple fruits. This proteolytic enzyme has received high demand for industrial and therapeutic applications. In this study, fruit bromelain sequences QIM61759, QIM61760 and QIM61761 were retrieved from the National Center for Biotechnology Information (NCBI) Genbank Database. The tertiary structure of fruit bromelain QIM61759, QIM61760 and QIM61761 was generated by using MODELLER. The result revealed that the local stereochemical quality of the generated models was improved by using multiple templates during modelling process. Moreover, by comparing with the available papain model, structural analysis provides an insight on how pro-peptide functions as a scaffold in fruit bromelain folding and contributing to inactivation of mature protein. The structural analysis also disclosed the similarities and differences between these models. Lastly, thermal stability of fruit bromelain was studied. Molecular dynamics simulation of fruit bromelain structures at several selected temperatures demonstrated how fruit bromelain responds to elevation of temperature.
Collapse
Affiliation(s)
- Wei Cheng Pang
- Faculty of Industrial Science & Technology, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang Darul Makmur, Malaysia
| | - Aizi Nor Mazila Ramli
- Faculty of Industrial Science & Technology, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang Darul Makmur, Malaysia. .,Bio Aromatic Research Centre of Excellence, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang Darul Makmur, Malaysia.
| | - Azzmer Azzar Abdul Hamid
- Department of Biotechnology, Kulliyyah of Science, International Islamic University Malaysia (IIUM), Bandar Indera Mahkota, 25200, Kuantan, Pahang, Malaysia.,Research Unit for Bioinformatics and Computational Biology (RUBIC), Kulliyyah of Science, International Islamic University Malaysia (IIUM), Bandar Indera Mahkota, 25200, Kuantan, Pahang, Malaysia
| |
Collapse
|
7
|
Kundert K, Kortemme T. Computational design of structured loops for new protein functions. Biol Chem 2019; 400:275-288. [PMID: 30676995 PMCID: PMC6530579 DOI: 10.1515/hsz-2018-0348] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 12/18/2018] [Indexed: 12/20/2022]
Abstract
The ability to engineer the precise geometries, fine-tuned energetics and subtle dynamics that are characteristic of functional proteins is a major unsolved challenge in the field of computational protein design. In natural proteins, functional sites exhibiting these properties often feature structured loops. However, unlike the elements of secondary structures that comprise idealized protein folds, structured loops have been difficult to design computationally. Addressing this shortcoming in a general way is a necessary first step towards the routine design of protein function. In this perspective, we will describe the progress that has been made on this problem and discuss how recent advances in the field of loop structure prediction can be harnessed and applied to the inverse problem of computational loop design.
Collapse
Affiliation(s)
- Kale Kundert
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
| | - Tanja Kortemme
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| |
Collapse
|
8
|
Karami Y, Guyon F, De Vries S, Tufféry P. DaReUS-Loop: accurate loop modeling using fragments from remote or unrelated proteins. Sci Rep 2018; 8:13673. [PMID: 30209260 PMCID: PMC6135855 DOI: 10.1038/s41598-018-32079-w] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 08/31/2018] [Indexed: 11/08/2022] Open
Abstract
Despite efforts during the past decades, loop modeling remains a difficult part of protein structure modeling. Several approaches have been developed in the framework of crystal structures. However, for homology models, the modeling of loops is still far from being solved. We propose DaReUS-Loop, a data-based approach that identifies loop candidates mining the complete set of experimental structures available in the Protein Data Bank. Candidate filtering relies on local conformation profile-profile comparison, together with physico-chemical scoring. Applied to three different template-based test sets, DaReUS-Loop shows significant increase in the number of high-accuracy loops, and significant enhancement for modeling long loops. A special advantage is that our method proposes a prediction confidence score that correlates well with the expected accuracy of the loops. Strikingly, over 50% of successful loop models are derived from unrelated proteins, indicating that fragments under similar constraints tend to adopt similar structure, beyond mere homology.
Collapse
Affiliation(s)
- Yasaman Karami
- Molécules Thérapeutiques in silico, UMR-S973, Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, RPBS, 75013, Paris, France
| | - Frédéric Guyon
- Molécules Thérapeutiques in silico, UMR-S973, Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, RPBS, 75013, Paris, France
| | - Sjoerd De Vries
- Molécules Thérapeutiques in silico, UMR-S973, Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, RPBS, 75013, Paris, France.
| | - Pierre Tufféry
- Molécules Thérapeutiques in silico, UMR-S973, Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, RPBS, 75013, Paris, France.
| |
Collapse
|
9
|
Wong SWK, Liu JS, Kou SC. Exploring the conformational space for protein folding with sequential Monte Carlo. Ann Appl Stat 2018. [DOI: 10.1214/17-aoas1124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|