1
|
Soleymani F, Paquet E, Viktor HL, Michalowski W. Structure-based protein and small molecule generation using EGNN and diffusion models: A comprehensive review. Comput Struct Biotechnol J 2024; 23:2779-2797. [PMID: 39050782 PMCID: PMC11268121 DOI: 10.1016/j.csbj.2024.06.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 06/13/2024] [Accepted: 06/18/2024] [Indexed: 07/27/2024] Open
Abstract
Recent breakthroughs in deep learning have revolutionized protein sequence and structure prediction. These advancements are built on decades of protein design efforts, and are overcoming traditional time and cost limitations. Diffusion models, at the forefront of these innovations, significantly enhance design efficiency by automating knowledge acquisition. In the field of de novo protein design, the goal is to create entirely novel proteins with predetermined structures. Given the arbitrary positions of proteins in 3-D space, graph representations and their properties are widely used in protein generation studies. A critical requirement in protein modelling is maintaining spatial relationships under transformations (rotations, translations, and reflections). This property, known as equivariance, ensures that predicted protein characteristics adapt seamlessly to changes in orientation or position. Equivariant graph neural networks offer a solution to this challenge. By incorporating equivariant graph neural networks to learn the score of the probability density function in diffusion models, one can generate proteins with robust 3-D structural representations. This review examines the latest deep learning advancements, specifically focusing on frameworks that combine diffusion models with equivariant graph neural networks for protein generation.
Collapse
Affiliation(s)
- Farzan Soleymani
- Telfer School of Management, University of Ottawa, ON, K1N 6N5, Canada
| | - Eric Paquet
- National Research Council, 1200 Montreal Road, Ottawa, ON, K1A 0R6, Canada
- School of Electrical Engineering and Computer Science, University of Ottawa, ON, K1N 6N5, Canada
| | - Herna Lydia Viktor
- School of Electrical Engineering and Computer Science, University of Ottawa, ON, K1N 6N5, Canada
| | | |
Collapse
|
2
|
Gao T, Yan R, Fang N, He L, Duan Z, Wang J, Ye L, Hu S, Chen Y, Yuan S, Yan X, Yuan M. Alisol C 23-acetate might be a lead compound of potential lipase inhibitor from Alismatis Rhizoma: Screening, identification and molecular dynamics simulation. Int J Biol Macromol 2024; 278:134878. [PMID: 39168221 DOI: 10.1016/j.ijbiomac.2024.134878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Revised: 08/05/2024] [Accepted: 08/17/2024] [Indexed: 08/23/2024]
Abstract
Alismatis Rhizoma (AR), a traditional Chinese medicine for treating obesity in traditional Chinese medicine clinic, is recognized as a promising source of lead compounds of lipase inhibitors. Ultrafiltration centrifugal combined with liquid chromatography-mass spectrometry (UF-LC-MS) was used for screening potential lipase inhibitors from AR, and the result indicated the binding capacity between compound 7 and lipase (92.3 ± 1.28 %) was significantly higher than other triterpenoids, and was identified as alisol C 23-acetate. It exhibited a mixed-type inhibitory behavior with an IC50 value of 84.88 ± 1.03 μM. Subsequently, the binding pockets of alisol C 23-acetate to lipase were predicted, and their binding mechanism was explored with molecular simulation. Pocket 1 (active center) and pocket 4 might be the orthosteric and allosteric binding sites of alisol C 23-acetate to lipase, respectively. The interaction between alisol C 23-acetate and lipase was identified to involve key amino acid residues such as GLY-77, PHE-78, TYR-115, LEU-154, PRO-181, PHE-216, LEU-264, ASP-278, GLN-306, ARG-313, and VAL-426. Meanwhile, alisol C 23-acetate remained stable during the intestinal digestive but degraded in the gastric digestion. Overall, alisol C 23-acetate is expected to be the lead compound of lipase inhibitors for treating obesity.
Collapse
Affiliation(s)
- Tao Gao
- College of Life Science, Sichuan Agricultural University, Yaan 625014, China
| | - Rui Yan
- Wanzhou Food and Drug Inspection Institute, Wanzhou 404100, China
| | - Nan Fang
- College of Life Science, Sichuan Agricultural University, Yaan 625014, China
| | - Lingzhi He
- Wanzhou Food and Drug Inspection Institute, Wanzhou 404100, China
| | - Zhihao Duan
- College of Life Science, Sichuan Agricultural University, Yaan 625014, China
| | - Jiyu Wang
- College of Life Science, Sichuan Agricultural University, Yaan 625014, China
| | - Lin Ye
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China
| | | | - Yanger Chen
- College of Life Science, Sichuan Agricultural University, Yaan 625014, China
| | - Shu Yuan
- College of Resources, Sichuan Agricultural University, Chengdu 611134, China
| | | | - Ming Yuan
- College of Life Science, Sichuan Agricultural University, Yaan 625014, China; State Key Laboratory Foundation of Crop Gene Exploration and Utilization in Southwest China, Sichuan Agricultural University, Chengdu 611130, China.
| |
Collapse
|
3
|
Maleki R, Khedri M, Rezvantalab S, Beheshtizadeh N. Investigation of pH-dependent Paclitaxel delivery mechanism employing Chitosan-Eudragit bioresponsive nanocarriers: a molecular dynamics simulation. J Biol Eng 2024; 18:49. [PMID: 39252122 PMCID: PMC11386078 DOI: 10.1186/s13036-024-00445-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 08/26/2024] [Indexed: 09/11/2024] Open
Abstract
Before embarking on any experimental research endeavor, it is advisable to do a mathematical computation and thoroughly examine the methodology. Despite the use of polymeric nanocarriers, the regulation of bioavailability and drug release at the disease site remains insufficient. Several effective methods have been devised to address this issue, including the creation of polymeric nanocarriers that can react to stimuli such as redox potential, temperature, pH, and light. The present study has been utilized all-atom molecular dynamics (AA-MD) and coarse-grained molecular dynamics (CG-MD) methods and illustrated the drug release mechanism, which is influenced by pH, for Chitosan-Eudragit bioresponsive nanocarriers. The aim of current work is to study the molecular mechanism and atomistic interactions of PAX delivery using a Chitosan-Eudragit carrier. The ability of Eudragit polymers to dissolve in various organic solvents employed in the process of solvent evaporation is a crucial benefit in enhancing the solubility of pharmaceuticals. This study investigated the use of Chitosan-Eudragit nanocarriers for delivering an anti-tumor drug, namely Paclitaxel (PAX). Upon analyzing several significant factors affecting the stability of the drug and nanocarrier, it has been shown that the level of stability is more significant in the neutral state than the acidic state. Furthermore, the system exhibits higher stability in the neutral state. The used Chitosan-Eudragit nanocarriers exhibit a stable structure under alkaline conditions, but undergo deformation and release their payloads under acidic conditions. It was demonstrated that the in silico analysis of anti-tumor drugs and carriers' integration could be quantified and validated by experimental results (from previous works) at an acceptable level.
Collapse
Affiliation(s)
- Reza Maleki
- Department of Chemical Technologies, Iranian Research Organization for Science and Technology (IROST), P.O. Box 33535111, Tehran, Iran.
| | - Mohammad Khedri
- Department of Chemical Engineering, Amirkabir University of Technology, 424 Hafez Avenue, Tehran, Iran
| | - Sima Rezvantalab
- Chemical Engineering Department, Urmia University of Technology, Urmia, 57166-419, Iran
| | - Nima Beheshtizadeh
- Department of Tissue Engineering, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran.
- Regenerative Medicine Group (REMED), Universal Scientific Education and Research Network (USERN), Tehran, Iran.
| |
Collapse
|
4
|
Pala D, Clark DE. Caught between a ROCK and a hard place: current challenges in structure-based drug design. Drug Discov Today 2024; 29:104106. [PMID: 39029868 DOI: 10.1016/j.drudis.2024.104106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 06/27/2024] [Accepted: 07/13/2024] [Indexed: 07/21/2024]
Abstract
The discipline of structure-based drug design (SBDD) is several decades old and it is tempting to think that the proliferation of experimental structures for many drug targets might make computer-aided drug design (CADD) straightforward. However, this is far from true. In this review, we illustrate some of the challenges that CADD scientists face every day in their work, even now. We use Rho-associated protein kinase (ROCK), and public domain structures and data, as an example to illustrate some of the challenges we have experienced during our project targeting this protein. We hope that this will help to prevent unrealistic expectations of what CADD can accomplish and to educate non-CADD scientists regarding the challenges still facing their CADD colleagues.
Collapse
Affiliation(s)
- Daniele Pala
- Medicinal Chemistry and Drug Design Technologies Department, Chiesi Farmaceutici S.p.A, Research Center, Largo Belloli 11/a, 43122 Parma, Italy
| | - David E Clark
- Charles River, 6-9 Spire Green Centre, Flex Meadow, Harlow CM19 5TR, UK.
| |
Collapse
|
5
|
Fiorucci L, Schiavina M, Felli IC, Pierattelli R, Ravera E. Are Protein Conformational Ensembles in Agreement with Experimental Data? A Geometrical Interpretation of the Problem. J Chem Inf Model 2024; 64:5392-5401. [PMID: 38959217 DOI: 10.1021/acs.jcim.4c00582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
The conformational variability of biological macromolecules can play an important role in their biological function. Therefore, understanding conformational variability is expected to be key for predicting the behavior of a particular molecule in the context of organism-wide studies. Several experimental methods have been developed and deployed for accessing this information, and computational methods are continuously updated for the profitable integration of different experimental sources. The outcome of this endeavor is conformational ensembles, which may vary significantly in properties and composition when different ensemble reconstruction methods are used, and this raises the issue of comparing the predicted ensembles against experimental data. In this article, we discuss a geometrical formulation to provide a framework for understanding the agreement of an ensemble prediction to the experimental observations.
Collapse
Affiliation(s)
- Letizia Fiorucci
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Marco Schiavina
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Isabella C Felli
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Roberta Pierattelli
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Enrico Ravera
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Florence Data Science, University of Florence, Viale G.B. Morgagni 59, 50134 Florence, Italy
| |
Collapse
|
6
|
Champion C, Lehner M, Smith AA, Ferrage F, Bolik-Coulon N, Riniker S. Unraveling motion in proteins by combining NMR relaxometry and molecular dynamics simulations: A case study on ubiquitin. J Chem Phys 2024; 160:104105. [PMID: 38465679 DOI: 10.1063/5.0188416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 02/20/2024] [Indexed: 03/12/2024] Open
Abstract
Nuclear magnetic resonance (NMR) relaxation experiments shine light onto the dynamics of molecular systems in the picosecond to millisecond timescales. As these methods cannot provide an atomically resolved view of the motion of atoms, functional groups, or domains giving rise to such signals, relaxation techniques have been combined with molecular dynamics (MD) simulations to obtain mechanistic descriptions and gain insights into the functional role of side chain or domain motion. In this work, we present a comparison of five computational methods that permit the joint analysis of MD simulations and NMR relaxation experiments. We discuss their relative strengths and areas of applicability and demonstrate how they may be utilized to interpret the dynamics in MD simulations with the small protein ubiquitin as a test system. We focus on the aliphatic side chains given the rigidity of the backbone of this protein. We find encouraging agreement between experiment, Markov state models built in the χ1/χ2 rotamer space of isoleucine residues, explicit rotamer jump models, and a decomposition of the motion using ROMANCE. These methods allow us to ascribe the dynamics to specific rotamer jumps. Simulations with eight different combinations of force field and water model highlight how the different metrics may be employed to pinpoint force field deficiencies. Furthermore, the presented comparison offers a perspective on the utility of NMR relaxation to serve as validation data for the prediction of kinetics by state-of-the-art biomolecular force fields.
Collapse
Affiliation(s)
- Candide Champion
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Marc Lehner
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Albert A Smith
- Institute for Medical Physics and Biophysics, Leipzig University, Härtelstrasse 16-18, 04107 Leipzig, Germany
| | - Fabien Ferrage
- Laboratoire des Biomolécules, LBM, Département de Chimie, École normale supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Nicolas Bolik-Coulon
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
7
|
Claussen ER, Renfrew PD, Müller CL, Drew K. Scaffold Matcher: A CMA-ES based algorithm for identifying hotspot aligned peptidomimetic scaffolds. Proteins 2024; 92:343-355. [PMID: 37874196 PMCID: PMC10873094 DOI: 10.1002/prot.26619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 10/06/2023] [Indexed: 10/25/2023]
Abstract
The design of protein interaction inhibitors is a promising approach to address aberrant protein interactions that cause disease. One strategy in designing inhibitors is to use peptidomimetic scaffolds that mimic the natural interaction interface. A central challenge in using peptidomimetics as protein interaction inhibitors, however, is determining how best the molecular scaffold aligns to the residues of the interface it is attempting to mimic. Here we present the Scaffold Matcher algorithm that aligns a given molecular scaffold onto hotspot residues from a protein interaction interface. To optimize the degrees of freedom of the molecular scaffold we implement the covariance matrix adaptation evolution strategy (CMA-ES), a state-of-the-art derivative-free optimization algorithm in Rosetta. To evaluate the performance of the CMA-ES, we used 26 peptides from the FlexPepDock Benchmark and compared with three other algorithms in Rosetta, specifically, Rosetta's default minimizer, a Monte Carlo protocol of small backbone perturbations, and a Genetic algorithm. We test the algorithms' performance on their ability to align a molecular scaffold to a series of hotspot residues (i.e., constraints) along native peptides. Of the 4 methods, CMA-ES was able to find the lowest energy conformation for all 26 benchmark peptides. Additionally, as a proof of concept, we apply the Scaffold Match algorithm with CMA-ES to align a peptidomimetic oligooxopiperazine scaffold to the hotspot residues of the substrate of the main protease of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Our implementation of CMA-ES into Rosetta allows for an alternative optimization method to be used on macromolecular modeling problems with rough energy landscapes. Finally, our Scaffold Matcher algorithm allows for the identification of initial conformations of interaction inhibitors that can be further designed and optimized as high-affinity reagents.
Collapse
Affiliation(s)
- Erin R. Claussen
- Department of Biological Sciences, University of Illinois
at Chicago, Chicago, Il, 60607, USA
| | - P. Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, New
York, NY, 10010, USA
| | - Christian L. Müller
- Ludwig-Maximilians-Universität München
- Helmholtz Munich, München
- Center for Computational Mathematics, Flatiron Institute,
New York
| | - Kevin Drew
- Department of Biological Sciences, University of Illinois
at Chicago, Chicago, Il, 60607, USA
| |
Collapse
|
8
|
Riggi M, Torrez RM, Iwasa JH. 3D animation as a tool for integrative modeling of dynamic molecular mechanisms. Structure 2024; 32:122-130. [PMID: 38183978 PMCID: PMC10872329 DOI: 10.1016/j.str.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 11/01/2023] [Accepted: 12/12/2023] [Indexed: 01/08/2024]
Abstract
As the scientific community accumulates diverse data describing how molecular mechanisms occur, creating and sharing visual models that integrate the richness of this information has become increasingly important to help us explore, refine, and communicate our hypotheses. Three-dimensional (3D) animation is a powerful tool to capture dynamic hypotheses that are otherwise difficult or impossible to visualize using traditional 2D illustration techniques. This perspective discusses the current and future roles that 3D animation can play in the research sphere.
Collapse
Affiliation(s)
- Margot Riggi
- Department of Biochemistry, University of Utah, Salt Lake City, UT, USA
| | - Rachel M Torrez
- Department of Biochemistry, University of Utah, Salt Lake City, UT, USA
| | - Janet H Iwasa
- Department of Biochemistry, University of Utah, Salt Lake City, UT, USA.
| |
Collapse
|
9
|
Joy A, Biswas R. Significance of the Disulfide Bridge in the Structure and Stability of Metalloprotein Azurin. J Phys Chem B 2024; 128:973-984. [PMID: 38236012 DOI: 10.1021/acs.jpcb.3c07089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Metalloproteins make up a class of proteins that incorporate metal ions into their structures, enabling them to perform essential functions in biological systems, such as catalysis and electron transport. Azurin is one such metalloprotein with copper cofactor, having a β-barrel structure with exceptional thermal stability. The copper metal ion is coordinated at one end of the β-barrel structure, and there is a disulfide bond at the opposite end. In this study, we explore the effect of this disulfide bond in the high thermal stability of azurin by analyzing both the native S-S bonded and S-S nonbonded (S-S open) forms using temperature replica exchange molecular dynamics (REMD). Similar to experimental observations, we find a 35 K decrease in denaturation temperature for S-S open azurin compared to that of the native holo form (420 K). As observed in the case of native holo azurin, the unfolding process of the S-S open form also started with disruptions of the α-helix. The free energy surfaces of the unfolding process revealed that the denaturation event of the S-S open form progresses through different sets of conformational ensembles. Subsequently, we compared the stabilities of individual β-sheet strands of both the S-S bonded and the S-S nonbonded forms of azurin. Further, we examined the contacts between individual residues for the central structures from the free energy surfaces of the S-S nonbonded form. The microscopic origin of the lowering in the denaturation temperature is further supplemented by thermodynamic analysis.
Collapse
Affiliation(s)
- Albin Joy
- Department of Chemistry, Indian Institute of Technology Tirupati, Yerpedu, Tirupati, Andhra Pradesh, India 517619
| | - Rajib Biswas
- Department of Chemistry, Indian Institute of Technology Tirupati, Yerpedu, Tirupati, Andhra Pradesh, India 517619
| |
Collapse
|
10
|
Kou X, Su D, Pan F, Xu X, Meng Q, Ke Q. Molecular dynamics simulation techniques and their application to aroma compounds/cyclodextrin inclusion complexes: A review. Carbohydr Polym 2024; 324:121524. [PMID: 37985058 DOI: 10.1016/j.carbpol.2023.121524] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 10/16/2023] [Accepted: 10/22/2023] [Indexed: 11/22/2023]
Abstract
Homeostatic technologies play a crucial role in maintaining the quality and extending the service life of aroma compounds (ACs). Commercial cyclodextrins (CDs) are commonly used to form inclusion complexes (ICs) with ACs to enhance their solubility, stability, and morphology. The selection of suitable CDs and ACs is of paramount importance in this process. Molecular dynamics (MD) simulations provide an in-depth understanding of the interactions between ACs and CDs, aiding researchers in optimising the properties and effects of ICs. This review offers a systematic discussion of the application of MD simulations in ACs/CDs ICs, covering the establishment of the simulation process, parameter selection, model evaluation, and various application cases, along with their advantages and disadvantages. Additionally, this review summarises the major achievements and challenges of this method while identifying areas that require further exploration. These findings may contribute to a comprehensive understanding of the formation and stabilization mechanisms of ACs/CDs ICs and offer guidance for the selection and computational characterisation of CDs in the AC steady state.
Collapse
Affiliation(s)
- Xingran Kou
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology (Shanghai Research Institute of Fragrance & Flavour Industry), Shanghai Institute of Technology, Shanghai, China; Key Laboratory of Textile Science & Technology, Ministry of Education, College of Textiles, Donghua University, Shanghai, China
| | - Dongdong Su
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology (Shanghai Research Institute of Fragrance & Flavour Industry), Shanghai Institute of Technology, Shanghai, China
| | - Fei Pan
- State Key Laboratory of Resource Insects, Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China
| | - Xiwei Xu
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology (Shanghai Research Institute of Fragrance & Flavour Industry), Shanghai Institute of Technology, Shanghai, China
| | - Qingran Meng
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology (Shanghai Research Institute of Fragrance & Flavour Industry), Shanghai Institute of Technology, Shanghai, China.
| | - Qinfei Ke
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology (Shanghai Research Institute of Fragrance & Flavour Industry), Shanghai Institute of Technology, Shanghai, China; Key Laboratory of Textile Science & Technology, Ministry of Education, College of Textiles, Donghua University, Shanghai, China.
| |
Collapse
|
11
|
Durojaye OA, Ejaz U, Uzoeto HO, Fadahunsi AA, Opabunmi AO, Ekpo DE, Sedzro DM, Idris MO. CSC01 shows promise as a potential inhibitor of the oncogenic G13D mutant of KRAS: an in silico approach. Amino Acids 2023; 55:1745-1764. [PMID: 37500789 DOI: 10.1007/s00726-023-03304-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 07/11/2023] [Indexed: 07/29/2023]
Abstract
About 30% of malignant tumors include KRAS mutations, which are frequently required for the development and maintenance of malignancies. KRAS is now a top-priority cancer target as a result. After years of research, it is now understood that the oncogenic KRAS-G12C can be targeted. However, many other forms, such as the G13D mutant, are yet to be addressed. Here, we used a receptor-based pharmacophore modeling technique to generate potential inhibitors of the KRAS-G13D oncogenic mutant. Using a comprehensive virtual screening workflow model, top hits were selected, out of which CSC01 was identified as a promising inhibitor of the oncogenic KRAS mutant (G13D). The stability of CSC01 upon binding the switch II pocket was evaluated through an exhaustive molecular dynamics simulation study. The several post-simulation analyses conducted suggest that CSC01 formed a stable complex with KRAS-G13D. CSC01, through a dynamic protein-ligand interaction profiling analysis, was also shown to maintain strong interactions with the mutated aspartic acid residue throughout the simulation. Although binding free energy analysis through the umbrella sampling approach suggested that the affinity of CSC01 with the switch II pocket of KRAS-G13D is moderate, our DFT analysis showed that the stable interaction of the compound might be facilitated by the existence of favorable molecular electrostatic potentials. Furthermore, based on ADMET predictions, CSC01 demonstrated a satisfactory drug likeness and toxicity profile, making it an exemplary candidate for consideration as a potential KRAS-G13D inhibitor.
Collapse
Affiliation(s)
- Olanrewaju Ayodeji Durojaye
- MOE Key Laboratory of Membraneless Organelle and Cellular Dynamics, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230027, Anhui, China.
- School of Life Sciences, University of Science and Technology of China, Hefei, 230027, Anhui, China.
- Department of Chemical Sciences, Coal City University, Emene, EnuguState, Nigeria.
| | - Umer Ejaz
- MOE Key Laboratory of Membraneless Organelle and Cellular Dynamics, Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230027, Anhui, China
- School of Life Sciences, University of Science and Technology of China, Hefei, 230027, Anhui, China
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, 210009, China
| | - Henrietta Onyinye Uzoeto
- Federal College of Dental Technology, Trans-Ekulu, Enugu State, Nigeria
- Department of Biological Sciences, Coal City University, Emene, Enugu State, Nigeria
| | - Adeola Abraham Fadahunsi
- Graduate School of Biomedical Science and Engineering, University of Maine, Orono, ME, 04469, USA
| | - Adebayo Oluwole Opabunmi
- RNA Medical Center, International Institutes of Medicine, Zhejiang University, Hangzhou, China
- Zhejiang University-University of Edinburgh Institute, Zhejiang University, Hangzhou, China
- The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Daniel Emmanuel Ekpo
- Institute of Biological Science and Technology, National Engineering Research Center for Non-Food Biorefinery, Guangxi Academy of Sciences, Nanning, 530007, China
- Department of Biochemistry, Faculty of Biological Sciences, University of Nigeria, 410001, Nsukka, Enugu State, Nigeria
| | - Divine Mensah Sedzro
- Wisconsin National Primate Research Center, University of Wisconsin Graduate School, 1220 Capitol Court, Madison, 53715, WI, USA.
| | - Mukhtar Oluwaseun Idris
- School of Life Sciences, University of Science and Technology of China, Hefei, 230027, Anhui, China.
| |
Collapse
|
12
|
Scrima S, Tiberti M, Ryde U, Lambrughi M, Papaleo E. Comparison of force fields to study the zinc-finger containing protein NPL4, a target for disulfiram in cancer therapy. BIOCHIMICA ET BIOPHYSICA ACTA. PROTEINS AND PROTEOMICS 2023; 1871:140921. [PMID: 37230374 DOI: 10.1016/j.bbapap.2023.140921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 05/16/2023] [Accepted: 05/19/2023] [Indexed: 05/27/2023]
Abstract
Molecular dynamics (MD) simulations are a powerful approach to studying the structure and dynamics of proteins related to health and disease. Advances in the MD field allow modeling proteins with high accuracy. However, modeling metal ions and their interactions with proteins is still challenging. NPL4 is a zinc-binding protein and works as a cofactor for p97 to regulate protein homeostasis. NPL4 is of biomedical importance and has been proposed as the target of disulfiram, a drug recently repurposed for cancer treatment. Experimental studies proposed that the disulfiram metabolites, bis-(diethyldithiocarbamate)‑copper and cupric ions, induce NPL4 misfolding and aggregation. However, the molecular details of their interactions with NPL4 and consequent structural effects are still elusive. Here, biomolecular simulations can help to shed light on the related structural details. To apply MD simulations to NPL4 and its interaction with copper the first important step is identifying a suitable force field to describe the protein in its zinc-bound states. We examined different sets of non-bonded parameters because we want to study the misfolding mechanism and cannot rule out that the zinc may detach from the protein during the process and copper replaces it. We investigated the force-field ability to model the coordination geometry of the metal ions by comparing the results from MD simulations with optimized geometries from quantum mechanics (QM) calculations using model systems of NPL4. Furthermore, we investigated the performance of a force field including bonded parameters to treat copper ions in NPL4 that we obtained based on QM calculations.
Collapse
Affiliation(s)
- Simone Scrima
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100 Copenhagen, Denmark; Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100 Copenhagen, Denmark
| | - Ulf Ryde
- Division of Theoretical Chemistry, Lund University, Chemical Centre, P. O. Box 124, SE-221 00 Lund, Sweden
| | - Matteo Lambrughi
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100 Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100 Copenhagen, Denmark; Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800 Lyngby, Denmark.
| |
Collapse
|
13
|
Beheshtizadeh N, Salimi A, Golmohammadi M, Ansari JM, Azami M. In-silico engineering of RNA nanoplatforms to promote the diabetic wound healing. BMC Chem 2023; 17:52. [PMID: 37291669 PMCID: PMC10251717 DOI: 10.1186/s13065-023-00969-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Accepted: 05/30/2023] [Indexed: 06/10/2023] Open
Abstract
One of the most notable required features of wound healing is the enhancement of angiogenesis, which aids in the acceleration of regeneration. Poor angiogenesis during diabetic wound healing is linked to a shortage of pro-angiogenic or an increase in anti-angiogenic factors. As a result, a potential treatment method is to increase angiogenesis promoters and decrease suppressors. Incorporating microRNAs (miRNAs) and small interfering RNAs (siRNAs), two forms of quite small RNA molecules, is one way to make use of RNA interference. Several different types of antagomirs and siRNAs are now in the works to counteract the negative effects of miRNAs. The purpose of this research is to locate novel antagonists for miRNAs and siRNAs that target multiple genes to promote angiogenesis and wound healing in diabetic ulcers.In this context, we used gene ontology analysis by exploring across several datasets. Following data analysis, it was processed using a systems biology approach. The feasibility of incorporating the proposed siRNAs and miRNA antagomirs into polymeric bioresponsive nanocarriers for wound delivery was further investigated by means of a molecular dynamics (MD) simulation study. Among the three nanocarriers tested (Poly (lactic-co-glycolic acid) (PLGA), Polyethylenimine (PEI), and Chitosan (CTS), MD simulations show that the integration of PLGA/hsa-mir-422a is the most stable (total energy = -1202.62 KJ/mol, Gyration radius = 2.154 nm, and solvent-accessible surface area = 408.416 nm2). With values of -25.437 KJ/mol, 0.047 nm for the Gyration radius, and 204.563 nm2 for the SASA, the integration of the second siRNA/ Chitosan took the last place. The results of the systems biology and MD simulations show that the suggested RNA may be delivered through bioresponsive nanocarriers to speed up wound healing by boosting angiogenesis.
Collapse
Affiliation(s)
- Nima Beheshtizadeh
- Department of Tissue Engineering, School of Advanced Technologies in Medicine, Tehran University of Medical Sciences, Tehran, Iran.
- Regenerative Medicine group (REMED), Universal Scientific Education and Research Network (USERN), Tehran, Iran.
- Students? Scientific Research Center, Tehran University of Medical Sciences, Tehran, Iran.
| | - Alireza Salimi
- Regenerative Medicine group (REMED), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- Department of Advanced Technologies, School of Medicine, North Khorasan University of Medical Science, Bojnurd, Iran
| | - Mahsa Golmohammadi
- Department of Polymer Engineering and Color Technology, Amirkabir University of Technology, Tehran, Iran
| | - Javad Mohajer Ansari
- Regenerative Medicine group (REMED), Universal Scientific Education and Research Network (USERN), Tehran, Iran
- Department of Anatomy, School of Medicine, Hormozgan University of Medical Sciences, Jomhuri Eslami Blvd, Bandar Abbas, 7919915519, Iran
| | - Mahmoud Azami
- Department of Tissue Engineering, School of Advanced Technologies in Medicine, Tehran University of Medical Sciences, Tehran, Iran.
- Regenerative Medicine group (REMED), Universal Scientific Education and Research Network (USERN), Tehran, Iran.
| |
Collapse
|
14
|
Abstract
Metal cofactors are critical centers for different biochemical processes of metalloproteins, and often, this metal coordination renders additional structural stability. In this study, we explore the additional stability conferred by the copper ion on azurin by analyzing both the apo and holo forms using temperature replica exchange molecular dynamics (REMD) data. We find a 14 K decrease in denaturation temperature for apo (406 K) azurin relative to that of holo (420 K), indicating a copper ion-induced additional thermal stability for holo azurin. The unfolding of apo azurin begins with the melting of α-helix and β-sheet V, similar to that of holo form. β-Sheets IV, VII, and VIII are comparatively more stable than other β-strands and melt at higher temperatures. Similar to holo azurin, the strong hydrophobic interactions among the apolar residues in the protein core is the key factor that renders high stability to apo protein as well. We construct free energy surfaces at different temperatures to capture the major conformations along the unfolding basins of the protein. Using contact maps from different basins we show the changes in the interaction between different residues along the unfolding pathway. Furthermore, we compare the Cα root-mean-square fluctuations (Cα-RMSF) and B-factor of all residues of apo and holo forms to understand the flexibility of different regions. The concerted displacement of α-helix and β-sheets V and VI from the protein core is another distinction we observe for apo compared to the holo form, where β-sheet VI was relatively stable.
Collapse
Affiliation(s)
- Albin Joy
- Department of Chemistry, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| | - Rajib Biswas
- Department of Chemistry and Center for Atomic, Molecular and Optical Sciences & Technologies, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| |
Collapse
|
15
|
Mishra S, Rout M, Panda S, Singh SK, Sinha R, Dehury B, Pati S. An immunoinformatic approach towards development of a potent and effective multi-epitope vaccine against monkeypox virus (MPXV). J Biomol Struct Dyn 2023; 41:11714-11727. [PMID: 36591724 DOI: 10.1080/07391102.2022.2163426] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 12/22/2022] [Indexed: 01/03/2023]
Abstract
Monkeypox is a viral zoonotic disease, often transmitted to humans from animals. While the whole world is haggling with the COVID-19 pandemic, the emergence of the monkeypox virus (MPXV) arose as a new challenge to mankind. Till date, numerous cases related to the MPXV have been reported in several countries across the globe, but, its momentary distribution in the current time has left everyone in fright with increasing mortality and limited clinically approved treatments. Therefore, it is of immense importance to develop a potent and highly effective vaccine capable of inducing desired immunogenic responses against the highly contagious MPXV. Herein, using various immunoinformatic and computational biology tools, we made an attempt to develop a multi-epitope vaccine construct against the MPXV which is antigenic, non-allergen and non-toxic in nature and capable of exhibiting immunogenic behavior. The sequence of vaccine construct was designed using the proposed 4 MHC-I, 3 MHC-II and 4 B-cell epitopes linked with suitable adjuvant and linkers. The modeled structure of the vaccine construct was used to assess its interaction with the Toll-like Receptor 4 (TLR4) using ClusPro and HADDOCK. All-atoms molecular dynamics simulation of the MPXV vaccine construct-TLR4 complex followed by a high level of gene expression of the construct within the bacterial system affirmed its stability along with induction of immunogenic response within the host cell. Altogether, our immunoinformatic approach aid in the development of a stable chimeric vaccine construct against MPXV and needs further experimental validation for its immunological relevance and usefulness as a vaccine candidate.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Sarbani Mishra
- Bioinformatics Division, ICMR-Regional Medical Research Centre, Bhubaneswar, Odisha, India
| | - Madhusmita Rout
- Bioinformatics Division, ICMR-Regional Medical Research Centre, Bhubaneswar, Odisha, India
| | - Sunita Panda
- Mycology Laboratory, ICMR-Regional Medical Research Centre, Bhubaneswar, Odisha, India
| | - Susheel Kumar Singh
- Vaccine and Diagnostic Laboratory, ICMR-Regional Medical Research Centre, Bhubaneswar, Odisha, India
| | - Rohan Sinha
- Department of Computer Science and Engineering, National Institute of Technology Patna, Patna, Bihar, India
| | - Budheswar Dehury
- Bioinformatics Division, ICMR-Regional Medical Research Centre, Bhubaneswar, Odisha, India
| | - Sanghamitra Pati
- Bioinformatics Division, ICMR-Regional Medical Research Centre, Bhubaneswar, Odisha, India
| |
Collapse
|
16
|
Sala D, Del Alamo D, Mchaourab HS, Meiler J. Modeling of protein conformational changes with Rosetta guided by limited experimental data. Structure 2022; 30:1157-1168.e3. [PMID: 35597243 PMCID: PMC9357069 DOI: 10.1016/j.str.2022.04.013] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/08/2022] [Accepted: 04/25/2022] [Indexed: 11/24/2022]
Abstract
Conformational changes are an essential component of functional cycles of many proteins, but their characterization often requires an integrative structural biology approach. Here, we introduce and benchmark ConfChangeMover (CCM), a new method built into the widely used macromolecular modeling suite Rosetta that is tailored to model conformational changes in proteins using sparse experimental data. CCM can rotate and translate secondary structural elements and modify their backbone dihedral angles in regions of interest. We benchmarked CCM on soluble and membrane proteins with simulated Cα-Cα distance restraints and sparse experimental double electron-electron resonance (DEER) restraints, respectively. In both benchmarks, CCM outperformed state-of-the-art Rosetta methods, showing that it can model a diverse array of conformational changes. In addition, the Rosetta framework allows a wide variety of experimental data to be integrated with CCM, thus extending its capability beyond DEER restraints. This method will contribute to the biophysical characterization of protein dynamics.
Collapse
Affiliation(s)
- Davide Sala
- Institute for Drug Discovery, Leipzig University, Leipzig, Saxony 04103, Germany
| | - Diego Del Alamo
- Department of Chemistry, Vanderbilt University, Nashville, TN 37232, USA; Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN 37235, USA
| | - Hassane S Mchaourab
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN 37235, USA
| | - Jens Meiler
- Institute for Drug Discovery, Leipzig University, Leipzig, Saxony 04103, Germany; Department of Chemistry, Vanderbilt University, Nashville, TN 37232, USA.
| |
Collapse
|
17
|
Kabir KL, Ma B, Nussinov R, Shehu A. Fewer Dimensions, More Structures for Improved Discrete Models of Dynamics of Free versus Antigen-Bound Antibody. Biomolecules 2022; 12:biom12071011. [PMID: 35883567 PMCID: PMC9313177 DOI: 10.3390/biom12071011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 07/12/2022] [Accepted: 07/19/2022] [Indexed: 12/10/2022] Open
Abstract
Over the past decade, Markov State Models (MSM) have emerged as powerful methodologies to build discrete models of dynamics over structures obtained from Molecular Dynamics trajectories. The identification of macrostates for the MSM is a central decision that impacts the quality of the MSM but depends on both the selected representation of a structure and the clustering algorithm utilized over the featurized structures. Motivated by a large molecular system in its free and bound state, this paper investigates two directions of research, further reducing the representation dimensionality in a non-parametric, data-driven manner and including more structures in the computation. Rigorous evaluation of the quality of obtained MSMs via various statistical tests in a comparative setting firmly shows that fewer dimensions and more structures result in a better MSM. Many interesting findings emerge from the best MSM, advancing our understanding of the relationship between antibody dynamics and antibody–antigen recognition.
Collapse
Affiliation(s)
- Kazi Lutful Kabir
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA;
- Correspondence: ; Tel.: +1-571-201-5070
| | - Buyong Ma
- Engineering Research Center of Cell & Therapeutic Antibody School of Pharmacy, Shanghai Jiaotong University, Shanghai 200240, China;
| | - Ruth Nussinov
- Computational Structural Biology Section, Cancer Innovation Laboratory, Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA;
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA;
| |
Collapse
|
18
|
Data Size and Quality Matter: Generating Physically-Realistic Distance Maps of Protein Tertiary Structures. Biomolecules 2022; 12:biom12070908. [PMID: 35883464 PMCID: PMC9313347 DOI: 10.3390/biom12070908] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 06/14/2022] [Accepted: 06/23/2022] [Indexed: 02/01/2023] Open
Abstract
With the debut of AlphaFold2, we now can get a highly-accurate view of a reasonable equilibrium tertiary structure of a protein molecule. Yet, a single-structure view is insufficient and does not account for the high structural plasticity of protein molecules. Obtaining a multi-structure view of a protein molecule continues to be an outstanding challenge in computational structural biology. In tandem with methods formulated under the umbrella of stochastic optimization, we are now seeing rapid advances in the capabilities of methods based on deep learning. In recent work, we advance the capability of these models to learn from experimentally-available tertiary structures of protein molecules of varying lengths. In this work, we elucidate the important role of the composition of the training dataset on the neural network’s ability to learn key local and distal patterns in tertiary structures. To make such patterns visible to the network, we utilize a contact map-based representation of protein tertiary structure. We show interesting relationships between data size, quality, and composition on the ability of latent variable models to learn key patterns of tertiary structure. In addition, we present a disentangled latent variable model which improves upon the state-of-the-art variable autoencoder-based model in key, physically-realistic structural patterns. We believe this work opens up further avenues of research on deep learning-based models for computing multi-structure views of protein molecules.
Collapse
|
19
|
Tsai E, Gallage Dona HK, Tong X, Du P, Novak B, David R, Rick SW, Zhang D, Kumar R. Unraveling the Role of Charge Patterning in the Micellar Structure of Sequence-Defined Amphiphilic Peptoid Oligomers by Molecular Dynamics Simulations. Macromolecules 2022; 55:5197-5212. [PMID: 35784657 PMCID: PMC9245439 DOI: 10.1021/acs.macromol.2c00141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 05/26/2022] [Indexed: 11/28/2022]
Abstract
![]()
Electrostatic interactions
play a significant role in regulating
biological systems and have received increasing attention due to their
usefulness in designing advanced stimulus-responsive materials. Polypeptoids
are highly tunable N-substituted peptidomimetic polymers that lack
backbone hydrogen bonding and chirality. Therefore, polypeptoids are
suitable systems to study the effect of noncovalent interactions of
substituents without complications of backbone intramolecular and
intermolecular hydrogen bonding. In this study, all-atom molecular
dynamics (MD) simulations were performed on micelles formed by a series
of sequence-defined ionic polypeptoid block copolymers consisting
of a hydrophobic segment and a hydrophilic segment in an aqueous solution.
By combining the results from MD simulations and experimental small-angle
neutron scattering data, further insights were gained into the internal
structure of the formed polypeptoid micelles, which is not always
directly accessible from experiments. In addition, information was
gained into the physics of the noncovalent interactions responsible
for the self-assembly of weakly charged polypeptoids in an aqueous
solution. While the aggregation number is governed by electrostatic
repulsion of the negatively charged carboxylate (COO–) substituents on the polypeptoid chain within the micelle, MD simulations
indicate that the position of the charge on singly charged chains
mediates the shape of the micelle through the charge–dipole
interactions between the COO– substituent and the
surrounding water. Therefore, the polypeptoid micelles formed from
the single-charged series offer the possibility for tailorable micelle
shapes. In contrast, the polypeptoid micelles formed from the triple-charged
series are characterized by more pronounced electrostatic repulsion
that competes with more significant charge–sodium interactions,
making it difficult to predict the shape of the micelles. This work
has helped further develop design principles for the shape and structure
of self-assembled micelles by controlling the position of charged
moieties on the backbone of polypeptoid block copolymers.
Collapse
Affiliation(s)
- Erin Tsai
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| | | | - Xinjie Tong
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| | - Pu Du
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| | - Brian Novak
- Department of Mechanical and Industrial Engineering, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| | - Rolf David
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| | - Steven W. Rick
- Department of Chemistry, University of New Orleans, New Orleans, Louisiana 70148, United States
| | - Donghui Zhang
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| | - Revati Kumar
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, United States
- Center for Computation and Technology, Louisiana State University, Baton Rouge, Louisiana 70803, United States
| |
Collapse
|
20
|
Reif MM, Zacharias M. Improving the Potential of Mean Force and Nonequilibrium Pulling Simulations by Simultaneous Alchemical Modifications. J Chem Theory Comput 2022; 18:3873-3893. [PMID: 35653503 DOI: 10.1021/acs.jctc.1c01194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We present an approach combining alchemical modifications and physical-pathway methods to calculate absolute binding free energies. The employed physical-pathway method is either a stratified umbrella sampling to calculate a potential of mean force or nonequilibrium pulling. We devised two basic approaches: the simultaneous approach (S-approach), where, along the physical unbinding pathway, an alchemical transformation of ligand-protein interactions is installed and deinstalled, and the prior-plus-simultaneous approach (PPS-approach), where, prior to the physical-pathway simulation, an alchemical transformation of ligand-protein interactions is installed in the binding site and deinstalled during the physical-pathway simulation. Using a mutant of T4 lysozyme with a benzene ligand as an example, we show that installation and deinstallation of soft-core interactions concurrent with physical ligand unbinding (S-approach) allow successful potential of mean force calculations and nonequilibrium pulling simulations despite the problems posed by the occluded nature of the lysozyme binding pocket. Good agreement between the potential of the mean-force-based S-approach and double decoupling simulations as well as a remarkable efficiency and accuracy of the nonequilibrium-pulling-based S-approach is found. The latter turned out to be more compute-efficient than the potential of mean force calculation by approximately 70%. Furthermore, we illustrate the merits of reducing ligand-protein interactions prior to potential of mean force calculations using the murine double minute homologue protein MDM2 with a p53-derived peptide ligand (PPS-approach). Here, the problem of breaking strong interactions in the binding pocket is transferred to a prior alchemical transformation that reduces the free-energy barrier between the bound and unbound state in the potential of mean force. Besides, disentangling physical ligand displacement from the deinstallation of ligand-protein interactions was seen to allow a more uniform sampling of distance histograms in the umbrella sampling. In the future, physical ligand unbinding combined with simultaneous alchemical modifications may prove useful in the calculation of protein-protein binding free energies, where sampling problems posed by multiple, possibly sticky interactions and potential steric clashes can thus be reduced.
Collapse
Affiliation(s)
- Maria M Reif
- Center for Protein Assemblies (CPA), Physics Department, Chair of Theoretical Biophysics (T38), Technical University of Munich, Ernst-Otto-Fischer-Str. 8, Garching 85748, Germany
| | - Martin Zacharias
- Center for Protein Assemblies (CPA), Physics Department, Chair of Theoretical Biophysics (T38), Technical University of Munich, Ernst-Otto-Fischer-Str. 8, Garching 85748, Germany
| |
Collapse
|
21
|
Akhter N, Kabir KL, Chennupati G, Vangara R, Alexandrov BS, Djidjev H, Shehu A. Improved Protein Decoy Selection via Non-Negative Matrix Factorization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1670-1682. [PMID: 33400654 DOI: 10.1109/tcbb.2020.3049088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
A central challenge in protein modeling research and protein structure prediction in particular is known as decoy selection. The problem refers to selecting biologically-active/native tertiary structures among a multitude of physically-realistic structures generated by template-free protein structure prediction methods. Research on decoy selection is active. Clustering-based methods are popular, but they fail to identify good/near-native decoys on datasets where near-native decoys are severely under-sampled by a protein structure prediction method. Reasonable progress is reported by methods that additionally take into account the internal energy of a structure and employ it to identify basins in the energy landscape organizing the multitude of decoys. These methods, however, incur significant time costs for extracting basins from the landscape. In this paper, we propose a novel decoy selection method based on non-negative matrix factorization. We demonstrate that our method outperforms energy landscape-based methods. In particular, the proposed method addresses both the time cost issue and the challenge of identifying good decoys in a sparse dataset, successfully recognizing near-native decoys for both easy and hard protein targets.
Collapse
|
22
|
Moshawih S, Goh HP, Kifli N, Idris AC, Yassin H, Kotra V, Goh KW, Liew KB, Ming LC. Synergy between machine learning and natural products cheminformatics: Application to the lead discovery of anthraquinone derivatives. Chem Biol Drug Des 2022; 100:185-217. [PMID: 35490393 DOI: 10.1111/cbdd.14062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 04/15/2022] [Accepted: 04/23/2022] [Indexed: 11/28/2022]
Abstract
Cheminformatics utilizing machine learning (ML) techniques have opened up a new horizon in drug discovery. This is owing to vast chemical space expansion with rocketing numbers of expected hits and lead compounds that match druggable macromolecular targets, in particular from natural compounds. Due to the natural products' (NP) structural complexity, uniqueness, and diversity, they could occupy a bigger space in pharmaceuticals, allowing the industry to pursue more selective leads in the nanomolar range of binding affinity. ML is an essential part of each step of the drug design pipeline, such as target prediction, compound library preparation, and lead optimization. Notably, molecular mechanic and dynamic simulations, induced docking, and free energy perturbations are essential in predicting best binding poses, binding free energy values, and molecular mechanics force fields. Those applications have leveraged from artificial intelligence (AI), which decreases the computational costs required for such costly simulations. This review aimed to describe chemical space and compound libraries related to NPs. High-throughput screening utilized for fractionating NPs and high-throughput virtual screening and their strategies, and significance, are reviewed. Particular emphasis was given to AI approaches, ML tools, algorithms, and techniques, especially in drug discovery of macrocyclic compounds and approaches in computer-aided and ML-based drug discovery. Anthraquinone derivatives were discussed as a source of new lead compounds that can be developed using ML tools for diverse medicinal uses such as cancer, infectious diseases, and metabolic disorders. Furthermore, the power of principal component analysis in understanding relevant protein conformations, and molecular modeling of protein-ligand interaction were also presented. Apart from being a concise reference for cheminformatics, this review is a useful text to understand the application of ML-based algorithms to molecular dynamics simulation and in silico absorption, distribution, metabolism, excretion, and toxicity prediction.
Collapse
Affiliation(s)
- Said Moshawih
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Hui Poh Goh
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Nurolaini Kifli
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Azam Che Idris
- Faculty of Integrated Technologies, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Hayati Yassin
- Faculty of Integrated Technologies, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Vijay Kotra
- Faculty of Pharmacy, Quest International University, Perak, Malaysia
| | - Khang Wen Goh
- Faculty of Data Science and Information Technology, INTI International University, Nilai, Malaysia
| | - Kai Bin Liew
- Faculty of Pharmacy, University of Cyberjaya, Cyberjaya, Malaysia
| | - Long Chiau Ming
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| |
Collapse
|
23
|
Huang W, Ou X, Luo J. Inverse Boltzmann Iterative Multi-Scale Molecular Dynamics Study between Carbon Nanotubes and Amino Acids. Molecules 2022; 27:2785. [PMID: 35566140 PMCID: PMC9104776 DOI: 10.3390/molecules27092785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/23/2022] [Accepted: 04/26/2022] [Indexed: 11/17/2022] Open
Abstract
Our work uses Iterative Boltzmann Inversion (IBI) to study the coarse-grained interaction between 20 amino acids and the representative carbon nanotube CNT55L3. IBI is a multi-scale simulation method that has attracted the attention of many researchers in recent years. It can effectively modify the coarse-grained model derived from the Potential of Mean Force (PMF). IBI is based on the distribution result obtained by All-Atom molecular dynamics simulation; that is, the target distribution function and the PMF potential energy are extracted, and then, the initial potential energy extracted by the PMF is used to perform simulation iterations using IBI. Our research results have been through more than 100 iterations, and finally, the distribution obtained by coarse-grained molecular simulation (CGMD) can effectively overlap with the results of all-atom molecular dynamics simulation (AAMD). In addition, our work lays the foundation for the study of force fields for the simulation of the coarse-graining of super-large proteins and other important nanoparticles.
Collapse
Affiliation(s)
- Wanying Huang
- T-Life Research Center, State Key Laboratory of Surface Physics, Department of Physics, Fudan University, Shanghai 200433, China;
| | - Xinwen Ou
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong 999077, China;
| | - Junyan Luo
- Department of Physics, Zhejiang University of Science and Technology, Hangzhou 310023, China
| |
Collapse
|
24
|
Mlýnský V, Janeček M, Kührová P, Fröhlking T, Otyepka M, Bussi G, Banáš P, Šponer J. Toward Convergence in Folding Simulations of RNA Tetraloops: Comparison of Enhanced Sampling Techniques and Effects of Force Field Modifications. J Chem Theory Comput 2022; 18:2642-2656. [PMID: 35363478 DOI: 10.1021/acs.jctc.1c01222] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Atomistic molecular dynamics simulations represent an established technique for investigation of RNA structural dynamics. Despite continuous development, contemporary RNA simulations still suffer from suboptimal accuracy of empirical potentials (force fields, ffs) and sampling limitations. Development of efficient enhanced sampling techniques is important for two reasons. First, they allow us to overcome the sampling limitations, and second, they can be used to quantify ff imbalances provided they reach a sufficient convergence. Here, we study two RNA tetraloops (TLs), namely the GAGA and UUCG motifs. We perform extensive folding simulations and calculate folding free energies (ΔGfold°) with the aim to compare different enhanced sampling techniques and to test several modifications of the nonbonded terms extending the AMBER OL3 RNA ff. We demonstrate that replica-exchange solute tempering (REST2) simulations with 12-16 replicas do not show any sign of convergence even when extended to a timescale of 120 μs per replica. However, the combination of REST2 with well-tempered metadynamics (ST-MetaD) achieves good convergence on a timescale of 5-10 μs per replica, improving the sampling efficiency by at least 2 orders of magnitude. Effects of ff modifications on ΔGfold° energies were initially explored by the reweighting approach and then validated by new simulations. We tested several manually prepared variants of the gHBfix potential which improve stability of the native state of both TLs by ∼2 kcal/mol. This is sufficient to conveniently stabilize the folded GAGA TL while the UUCG TL still remains under-stabilized. Appropriate adjustment of van der Waals parameters for C-H···O5' base-phosphate interaction may further stabilize the native states of both TLs by ∼0.6 kcal/mol.
Collapse
Affiliation(s)
- Vojtěch Mlýnský
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Michal Janeček
- Department of Physical Chemistry, Faculty of Science, Palacký University, tř. 17 listopadu 12, 771 46 Olomouc, Czech Republic
| | - Petra Kührová
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Thorben Fröhlking
- Scuola Internazionale Superiore di Studi Avanzati, SISSA, via Bonomea 265, 34136 Trieste, Italy
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic.,IT4Innovations, VSB─Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, SISSA, via Bonomea 265, 34136 Trieste, Italy
| | - Pavel Banáš
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| |
Collapse
|
25
|
Joy A, Biswas R. Molecular Insight into the High Thermal Stability of Metalloprotein Azurin. J Phys Chem B 2022; 126:2496-2506. [PMID: 35324174 DOI: 10.1021/acs.jpcb.2c00622] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We investigate the events characterizing the steps of the unfolding pathway of blue copper metalloprotein azurin using replica exchange molecular dynamics (REMD). Our studies show that the unfolding of azurin begins with the melting of α-helix and β-sheets II and V. This is followed by the melting of other β-sheets and the exposure of hydrophobic protein core to the solvent, resulting in disruptions of its tertiary structure. Free energy surfaces constructed at different temperatures portray different basins that signify the stability of different melted structures in the unfolding process. The contact maps at different temperatures reveal that the strong hydrophobic interaction within the core of the protein is the vital force that renders high stability to this protein. Analysis of the individual β-sheets by looking into their amino acid sequence shows that β-sheets with charged side chains on the surface melt fast compared to others. The β-barrel of azurin is able to dynamically rearrange, and it helps the protein to preserve its hydrophobic core, holding back the native topology from melting fast. B-factor analysis shows that residues of β-sheets III, IV, and VII deviate less from their initial structure at the transition temperature.
Collapse
Affiliation(s)
- Albin Joy
- Department of Chemistry, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| | - Rajib Biswas
- Department of Chemistry and Center for Atomic, Molecular and Optical Sciences & Technologies, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| |
Collapse
|
26
|
Puthenveetil R, Christenson ET, Vinogradova O. New Horizons in Structural Biology of Membrane Proteins: Experimental Evaluation of the Role of Conformational Dynamics and Intrinsic Flexibility. MEMBRANES 2022; 12:227. [PMID: 35207148 PMCID: PMC8877495 DOI: 10.3390/membranes12020227] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 02/13/2022] [Accepted: 02/14/2022] [Indexed: 02/08/2023]
Abstract
A plethora of membrane proteins are found along the cell surface and on the convoluted labyrinth of membranes surrounding organelles. Since the advent of various structural biology techniques, a sub-population of these proteins has become accessible to investigation at near-atomic resolutions. The predominant bona fide methods for structure solution, X-ray crystallography and cryo-EM, provide high resolution in three-dimensional space at the cost of neglecting protein motions through time. Though structures provide various rigid snapshots, only an amorphous mechanistic understanding can be inferred from interpolations between these different static states. In this review, we discuss various techniques that have been utilized in observing dynamic conformational intermediaries that remain elusive from rigid structures. More specifically we discuss the application of structural techniques such as NMR, cryo-EM and X-ray crystallography in studying protein dynamics along with complementation by conformational trapping by specific binders such as antibodies. We finally showcase the strength of various biophysical techniques including FRET, EPR and computational approaches using a multitude of succinct examples from GPCRs, transporters and ion channels.
Collapse
Affiliation(s)
- Robbins Puthenveetil
- Section on Structural and Chemical Biology of Membrane Proteins, Neurosciences and Cellular and Structural Biology Division, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, 35A Convent Dr., Bethesda, MD 20892, USA
| | | | - Olga Vinogradova
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
27
|
Reif MM, Zacharias M. Computational Tools for Accurate Binding Free-Energy Prediction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2385:255-292. [PMID: 34888724 DOI: 10.1007/978-1-0716-1767-0_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
A quantitative thermodynamic understanding of the noncovalent association of (bio)molecules is of central importance in molecular life sciences. An important quantity characterizing (bio)molecular association is the binding affinity or absolute binding free energy. In recent years, the computational prediction of absolute binding free energies has evolved considerably in terms of accuracy, computational speed, and user-friendliness. In this chapter, we first give an overview of how absolute free energies are defined and how they can be determined with computational means. We proceed with an outline of the theoretical basis of the two most reliable methods, potential of mean force, and double decoupling calculations. In particular, we describe how the sampling problem can be alleviated by application of restraints. Finally, we provide step-by-step instructions of how to set up corresponding molecular simulations with a commonly employed molecular dynamics simulation engine.
Collapse
Affiliation(s)
- Maria M Reif
- Physics Department (T38), Technische Universität München, Garching, Germany
| | - Martin Zacharias
- Physics Department (T38), Technische Universität München, Garching, Germany.
| |
Collapse
|
28
|
Guo X, Du Y, Tadepalli S, Zhao L, Shehu A. Generating tertiary protein structures via interpretable graph variational autoencoders. BIOINFORMATICS ADVANCES 2021; 1:vbab036. [PMID: 36700110 PMCID: PMC9710582 DOI: 10.1093/bioadv/vbab036] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 11/07/2021] [Accepted: 11/17/2021] [Indexed: 01/28/2023]
Abstract
Motivation Modeling the structural plasticity of protein molecules remains challenging. Most research has focused on obtaining one biologically active structure. This includes the recent AlphaFold2 that has been hailed as a breakthrough for protein modeling. Computing one structure does not suffice to understand how proteins modulate their interactions and even evade our immune system. Revealing the structure space available to a protein remains challenging. Data-driven approaches that learn to generate tertiary structures are increasingly garnering attention. These approaches exploit the ability to represent tertiary structures as contact or distance maps and make direct analogies with images to harness convolution-based generative adversarial frameworks from computer vision. Since such opportunistic analogies do not allow capturing highly structured data, current deep models struggle to generate physically realistic tertiary structures. Results We present novel deep generative models that build upon the graph variational autoencoder framework. In contrast to existing literature, we represent tertiary structures as 'contact' graphs, which allow us to leverage graph-generative deep learning. Our models are able to capture rich, local and distal constraints and additionally compute disentangled latent representations that reveal the impact of individual latent factors. This elucidates what the factors control and makes our models more interpretable. Rigorous comparative evaluation along various metrics shows that the models, we propose advance the state-of-the-art. While there is still much ground to cover, the work presented here is an important first step, and graph-generative frameworks promise to get us to our goal of unraveling the exquisite structural complexity of protein molecules. Availability and implementation Code is available at https://github.com/anonymous1025/CO-VAE. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Xiaojie Guo
- Department of Information Sciences and Technology, George Mason University, Fairfax, VA 22030, USA
| | - Yuanqi Du
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA
| | - Sivani Tadepalli
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA
| | - Liang Zhao
- Department of Computer Science, Emory University, Atlanta, GA 30322, USA
| | - Amarda Shehu
- Department of Bioengineering, George Mason University, Fairfax, VA 22030, USA,To whom correspondence should be addressed.
| |
Collapse
|
29
|
Tian H, Jiang X, Trozzi F, Xiao S, Larson EC, Tao P. Explore Protein Conformational Space With Variational Autoencoder. Front Mol Biosci 2021; 8:781635. [PMID: 34869602 PMCID: PMC8633506 DOI: 10.3389/fmolb.2021.781635] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 10/28/2021] [Indexed: 12/02/2022] Open
Abstract
Molecular dynamics (MD) simulations have been actively used in the study of protein structure and function. However, extensive sampling in the protein conformational space requires large computational resources and takes a prohibitive amount of time. In this study, we demonstrated that variational autoencoders (VAEs), a type of deep learning model, can be employed to explore the conformational space of a protein through MD simulations. VAEs are shown to be superior to autoencoders (AEs) through a benchmark study, with low deviation between the training and decoded conformations. Moreover, we show that the learned latent space in the VAE can be used to generate unsampled protein conformations. Additional simulations starting from these generated conformations accelerated the sampling process and explored hidden spaces in the conformational landscape.
Collapse
Affiliation(s)
- Hao Tian
- Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Department of Chemistry, Southern Methodist University, Dallas, TX, United States
| | - Xi Jiang
- Department of Statistical Science, Southern Methodist University, Dallas, TX, United States
| | - Francesco Trozzi
- Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Department of Chemistry, Southern Methodist University, Dallas, TX, United States
| | - Sian Xiao
- Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Department of Chemistry, Southern Methodist University, Dallas, TX, United States
| | - Eric C. Larson
- Department of Computer Science, Southern Methodist University, Dallas, TX, United States
| | - Peng Tao
- Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Department of Chemistry, Southern Methodist University, Dallas, TX, United States
| |
Collapse
|
30
|
Li Z, Yao Y, Cheng X, Li W, Fei T. An in silico drug repositioning workflow for host-based antivirals. STAR Protoc 2021; 2:100653. [PMID: 34286288 PMCID: PMC8273420 DOI: 10.1016/j.xpro.2021.100653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Drug repositioning represents a cost- and time-efficient strategy for drug development. Artificial intelligence-based algorithms have been applied in drug repositioning by predicting drug-target interactions in an efficient and high throughput manner. Here, we present a workflow of in silico drug repositioning for host-based antivirals using specially defined targets, a refined list of drug candidates, and an easily implemented computational framework. The workflow described here can also apply to more general purposes, especially when given a user-defined druggable target gene set. For complete details on the use and execution of this protocol, please refer to Li et al. (2021).
Collapse
Affiliation(s)
- Zexu Li
- College of Life and Health Sciences, Northeastern University, Shenyang 110819, People’s Republic of China
- Key Laboratory of Data Analytics and Optimization for Smart Industry (Northeastern University), Ministry of Education, Shenyang 110819, People’s Republic of China
| | - Yingjia Yao
- College of Life and Health Sciences, Northeastern University, Shenyang 110819, People’s Republic of China
- Key Laboratory of Data Analytics and Optimization for Smart Industry (Northeastern University), Ministry of Education, Shenyang 110819, People’s Republic of China
| | - Xiaolong Cheng
- Center for Genetic Medicine Research, Children’s National Hospital, 111 Michigan Ave NW, Washington, DC 20010, USA
- Department of Genomics and Precision Medicine, George Washington University, 111 Michigan Ave NW, Washington, DC 20010, USA
| | - Wei Li
- Center for Genetic Medicine Research, Children’s National Hospital, 111 Michigan Ave NW, Washington, DC 20010, USA
- Department of Genomics and Precision Medicine, George Washington University, 111 Michigan Ave NW, Washington, DC 20010, USA
| | - Teng Fei
- College of Life and Health Sciences, Northeastern University, Shenyang 110819, People’s Republic of China
- Key Laboratory of Data Analytics and Optimization for Smart Industry (Northeastern University), Ministry of Education, Shenyang 110819, People’s Republic of China
| |
Collapse
|
31
|
Westermayr J, Marquetand P. Machine Learning for Electronically Excited States of Molecules. Chem Rev 2021; 121:9873-9926. [PMID: 33211478 PMCID: PMC8391943 DOI: 10.1021/acs.chemrev.0c00749] [Citation(s) in RCA: 167] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Indexed: 12/11/2022]
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna
Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data
Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
32
|
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
33
|
Williams AE, Hammer NI, Fortenberry RC, Reinemann DN. Tracking the Amide I and αCOO- Terminal ν(C=O) Raman Bands in a Family of l-Glutamic Acid-Containing Peptide Fragments: A Raman and DFT Study. Molecules 2021; 26:4790. [PMID: 34443382 PMCID: PMC8399447 DOI: 10.3390/molecules26164790] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/02/2021] [Accepted: 08/04/2021] [Indexed: 11/16/2022] Open
Abstract
The E-hook of β-tubulin plays instrumental roles in cytoskeletal regulation and function. The last six C-terminal residues of the βII isotype, a peptide of amino acid sequence EGEDEA, extend from the microtubule surface and have eluded characterization with classic X-ray crystallographic techniques. The band position of the characteristic amide I vibration of small peptide fragments is heavily dependent on the length of the peptide chain, the extent of intramolecular hydrogen bonding, and the overall polarity of the fragment. The dependence of the E residue's amide I ν(C=O) and the αCOO- terminal ν(C=O) bands on the neighboring side chain, the length of the peptide fragment, and the extent of intramolecular hydrogen bonding in the structure are investigated here via the EGEDEA peptide. The hexapeptide is broken down into fragments increasing in size from dipeptides to hexapeptides, including EG, ED, EA, EGE, EDE, DEA, EGED, EDEA, EGEDE, GEDEA, and, finally, EGEDEA, which are investigated with experimental Raman spectroscopy and density functional theory (DFT) computations to model the zwitterionic crystalline solids (in vacuo). The molecular geometries and Boltzmann sum of the simulated Raman spectra for a set of energetic minima corresponding to each peptide fragment are computed with full geometry optimizations and corresponding harmonic vibrational frequency computations at the B3LYP/6-311++G(2df,2pd) level of theory. In absence of the crystal structure, geometry sampling is performed to approximate solid phase behavior. Natural bond order (NBO) analyses are performed on each energetic minimum to quantify the magnitude of the intramolecular hydrogen bonds. The extent of the intramolecular charge transfer is dependent on the overall polarity of the fragment considered, with larger and more polar fragments exhibiting the greatest extent of intramolecular charge transfer. A steady blue shift arises when considering the amide I band position moving linearly from ED to EDE to EDEA to GEDEA and, finally, to EGEDEA. However, little variation is observed in the αCOO- ν(C=O) band position in this family of fragments.
Collapse
Affiliation(s)
- Ashley E. Williams
- Department of Chemistry and Biochemistry, University of Mississippi, University, MS 38677, USA; (A.E.W.); (N.I.H.)
| | - Nathan I. Hammer
- Department of Chemistry and Biochemistry, University of Mississippi, University, MS 38677, USA; (A.E.W.); (N.I.H.)
| | - Ryan C. Fortenberry
- Department of Chemistry and Biochemistry, University of Mississippi, University, MS 38677, USA; (A.E.W.); (N.I.H.)
| | - Dana N. Reinemann
- Department of Biomedical Engineering, University of Mississippi, University, MS 38677, USA
- Department of Chemical Engineering, University of Mississippi, University, MS 38677, USA
| |
Collapse
|
34
|
Kingdon ADH, Alderwick LJ. Structure-based in silico approaches for drug discovery against Mycobacterium tuberculosis. Comput Struct Biotechnol J 2021; 19:3708-3719. [PMID: 34285773 PMCID: PMC8258792 DOI: 10.1016/j.csbj.2021.06.034] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 06/22/2021] [Accepted: 06/22/2021] [Indexed: 12/12/2022] Open
Abstract
Mycobacterium tuberculosis is the causative agent of TB and was estimated to cause 1.4 million death in 2019, alongside 10 million new infections. Drug resistance is a growing issue, with multi-drug resistant infections representing 3.3% of all new infections, hence novel antimycobacterial drugs are urgently required to combat this growing health emergency. Alongside this, increased knowledge of gene essentiality in the pathogenic organism and larger compound databases can aid in the discovery of new drug compounds. The number of protein structures, X-ray based and modelled, is increasing and now accounts for greater than > 80% of all predicted M. tuberculosis proteins; allowing novel targets to be investigated. This review will focus on structure-based in silico approaches for drug discovery, covering a range of complexities and computational demands, with associated antimycobacterial examples. This includes molecular docking, molecular dynamic simulations, ensemble docking and free energy calculations. Applications of machine learning onto each of these approaches will be discussed. The need for experimental validation of computational hits is an essential component, which is unfortunately missing from many current studies. The future outlooks of these approaches will also be discussed.
Collapse
Key Words
- CV, collective variable
- Docking
- Drug discovery
- In silico
- LIE, Linear Interaction Energy
- MD, Molecular Dynamic
- MDR, multi-drug resistant
- MMPB(GB)SA, Molecular Mechanics with Poisson Boltzmann (or generalised Born) and Surface Area solvation
- Machine learning
- Mt, Mycobacterium tuberculosis
- Mycobacterium tuberculosis
- PTC, peptidyl transferase centre
- RMSD, root-mean square-deviation
- Tuberculosis, TB
- cMD, Classical Molecular Dynamic
- cryo-EM, cryogenic electron microscopy
- ns, nanosecond
Collapse
Affiliation(s)
- Alexander D H Kingdon
- Institute of Microbiology and Infection, School of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
| | - Luke J Alderwick
- Institute of Microbiology and Infection, School of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
| |
Collapse
|
35
|
Sobieraj M, Setny P. Entropy-based distance cutoff for protein internal contact networks. Proteins 2021; 89:1333-1339. [PMID: 34053102 DOI: 10.1002/prot.26154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 05/17/2021] [Accepted: 05/19/2021] [Indexed: 11/05/2022]
Abstract
Protein structure networks (PSNs) have long been used to provide a coarse yet meaningful representation of protein structure, dynamics, and internal communication pathways. An important question is what criteria should be applied to construct the network so that to include relevant interresidue contacts while avoiding unnecessary connections. To address this issue, we systematically considered varying residue distance cutoff length and the probability threshold for contact formation to construct PSNs based on atomistic molecular dynamics in order to assess the amount of mutual information within the resulting representations. We found that the minimum in mutual information is universally achieved at the cutoff length of 5 Å, irrespective of the applied contact formation probability threshold in all considered, distinct proteins. Assuming that the optimal PSNs should be characterized by the least amount of redundancy, which corresponds to the minimum in mutual information, this finding suggests an objective criterion for cutoff distance and supports the existing preference towards its customary selection around 5 Å length, typically based to date on heuristic criteria.
Collapse
Affiliation(s)
- Marcin Sobieraj
- Center of New Technologies, University of Warsaw, Warsaw, Poland.,Department of Physics, University of Warsaw, Warsaw, Poland
| | - Piotr Setny
- Center of New Technologies, University of Warsaw, Warsaw, Poland
| |
Collapse
|
36
|
Arasteh S, Zhang BW, Levy RM. Protein Loop Conformational Free Energy Changes via an Alchemical Path without Reaction Coordinates. J Phys Chem Lett 2021; 12:4368-4377. [PMID: 33938761 PMCID: PMC8170697 DOI: 10.1021/acs.jpclett.1c00778] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
We introduce a method called restrain-free energy perturbation-release 2.0 (R-FEP-R 2.0) to estimate conformational free energy changes of protein loops via an alchemical path. R-FEP-R 2.0 is a generalization of the method called restrain-free energy perturbation-release (R-FEP-R) that can only estimate conformational free energy changes of protein side chains but not loops. The reorganization of protein loops is a central feature of many biological processes. Unlike other advanced sampling algorithms such as umbrella sampling and metadynamics, R-FEP-R and R-FEP-R 2.0 do not require predetermined collective coordinates and transition pathways that connect the two endpoint conformational states. The R-FEP-R 2.0 method was applied to estimate the conformational free energy change of a β-turn flip in the protein ubiquitin. The result obtained by R-FEP-R 2.0 agrees with the benchmarks very well. We also comment on problems commonly encountered when applying umbrella sampling to calculate protein conformational free energy changes.
Collapse
Affiliation(s)
- Shima Arasteh
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Bin W Zhang
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
37
|
Ozkan A, Sitharam M, Flores-Canales JC, Prabhu R, Kurnikova M. Baseline Comparisons of Complementary Sampling Methods for Assembly Driven by Short-Ranged Pair Potentials toward Fast and Flexible Hybridization. J Chem Theory Comput 2021; 17:1967-1987. [PMID: 33576635 DOI: 10.1021/acs.jctc.0c00945] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This work measures baseline sampling characteristics that highlight fundamental differences between sampling methods for assembly driven by short-ranged pair potentials. Such granular comparison is essential for fast, flexible, and accurate hybridization of complementary methods. Besides sampling speed, efficiency, and accuracy of uniform grid coverage, other sampling characteristics measured are (i) accuracy of covering narrow low energy regions that have low effective dimension (ii) ability to localize sampling to specific basins, and (iii) flexibility in sampling distributions. As a proof of concept, we compare a recently developed geometric methodology EASAL (Efficient Atlasing and Search of Assembly Landscapes) and the traditional Monte Carlo (MC) method for sampling the energy landscape of two assembling trans-membrane helices, driven by short-range pair potentials. By measuring the above-mentioned sampling characteristics, we demonstrate that EASAL provides localized and accurate coverage of crucial regions of the energy landscape of low effective dimension, under flexible sampling distributions, with much fewer samples and computational resources than MC sampling. EASAL's empirically validated theoretical guarantees permit credible extrapolation of these measurements and comparisons to arbitrary number and size of assembling units. Promising avenues for hybridizing the complementary advantages of the two methods are discussed.
Collapse
Affiliation(s)
- Aysegul Ozkan
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | - Meera Sitharam
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | | | - Rahul Prabhu
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | - Maria Kurnikova
- Chemistry Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
38
|
Jin Y, Johannissen LO, Hay S. Predicting new protein conformations from molecular dynamics simulation conformational landscapes and machine learning. Proteins 2021; 89:915-921. [PMID: 33629765 DOI: 10.1002/prot.26068] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 01/21/2021] [Accepted: 02/23/2021] [Indexed: 11/06/2022]
Abstract
Molecular dynamics (MD) simulations are a popular method of studying protein structure and function, but are unable to reliably sample all relevant conformational space in reasonable computational timescales. A range of enhanced sampling methods are available that can improve conformational sampling, but these do not offer a complete solution. We present here a proof-of-principle method of combining MD simulation with machine learning to explore protein conformational space. An autoencoder is used to map snapshots from MD simulations onto a user-defined conformational landscape defined by principal components analysis or specific structural features, and we show that we can predict, with useful accuracy, conformations that are not present in the training data. This method offers a new approach to the prediction of new low energy/physically realistic structures of conformationally dynamic proteins and allows an alternative approach to enhanced sampling of MD simulations.
Collapse
Affiliation(s)
- Yiming Jin
- Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Linus O Johannissen
- Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| | - Sam Hay
- Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| |
Collapse
|
39
|
Rahman T, Du Y, Zhao L, Shehu A. Generative Adversarial Learning of Protein Tertiary Structures. Molecules 2021; 26:molecules26051209. [PMID: 33668217 PMCID: PMC7956369 DOI: 10.3390/molecules26051209] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/13/2021] [Accepted: 02/16/2021] [Indexed: 12/15/2022] Open
Abstract
Protein molecules are inherently dynamic and modulate their interactions with different molecular partners by accessing different tertiary structures under physiological conditions. Elucidating such structures remains challenging. Current momentum in deep learning and the powerful performance of generative adversarial networks (GANs) in complex domains, such as computer vision, inspires us to investigate GANs on their ability to generate physically-realistic protein tertiary structures. The analysis presented here shows that several GAN models fail to capture complex, distal structural patterns present in protein tertiary structures. The study additionally reveals that mechanisms touted as effective in stabilizing the training of a GAN model are not all effective, and that performance based on loss alone may be orthogonal to performance based on the quality of generated datasets. A novel contribution in this study is the demonstration that Wasserstein GAN strikes a good balance and manages to capture both local and distal patterns, thus presenting a first step towards more powerful deep generative models for exploring a possibly very diverse set of structures supporting diverse activities of a protein molecule in the cell.
Collapse
Affiliation(s)
- Taseef Rahman
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA; (T.R.); (Y.D.)
| | - Yuanqi Du
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA; (T.R.); (Y.D.)
| | - Liang Zhao
- Department of Computer Science, Emory University, Atlanta, GA 30322, USA;
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA; (T.R.); (Y.D.)
- Center for Advancing Human-Machine Partnerships, George Mason University, Fairfax, VA 22030, USA
- Department of Bioengineering, George Mason University, Fairfax, VA 22030, USA
- School of Systems Biology, George Mason University, Manassas, VA 20110, USA
- Correspondence:
| |
Collapse
|
40
|
Mei L, Wu F, Hao G, Yang G. Protocol for hit-to-lead optimization of compounds by auto in silico ligand directing evolution (AILDE) approach. STAR Protoc 2021; 2:100312. [PMID: 33554146 PMCID: PMC7856476 DOI: 10.1016/j.xpro.2021.100312] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Hit-to-lead (H2L) optimization is crucial for drug design, which has become an increasing concern in medicinal chemistry. A virtual screening strategy of auto in silico ligand directing evolution (AILDE) has been developed to yield promising lead compounds rapidly and efficiently. The protocol includes instructions for fragment compound library construction, conformational sampling by molecular dynamics simulation, ligand modification by fragment growing, as well as the binding free energy prediction. For complete details on the use and execution of this protocol, please refer to Wu et al. (2020).
Collapse
Affiliation(s)
- Longcan Mei
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, China
| | - Fengxu Wu
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, China
| | - Gefei Hao
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, China.,State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University, Guiyang 550025, China
| | - Guangfu Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, China.,Collaborative Innovation Center of Chemical Science and Engineering, Tianjin 300072, China
| |
Collapse
|
41
|
Rick SW, Schwing GJ, Summa CM. An Implementation of Replica Exchange with Dynamical Scaling for Efficient Large-Scale Simulations. J Chem Inf Model 2021; 61:810-818. [PMID: 33496583 DOI: 10.1021/acs.jcim.0c01236] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
An implementation of the replica exchange with dynamical scaling (REDS) method in the commonly used molecular dynamics program GROMACS is presented. REDS is a replica exchange method that requires fewer replicas than conventional replica exchange while still providing data over a range of temperatures and can be used in either constant volume or constant pressure ensembles. Details for running REDS simulations are given, and an application to the human islet amyloid polypeptide (hIAPP) 11-25 fragment shows that the model efficiently samples conformational space.
Collapse
Affiliation(s)
- Steven W Rick
- Department of Chemistry, University of New Orleans, New Orleans, Louisiana 70148, United States
| | - Gregory J Schwing
- Department of Computer Science, University of New Orleans, New Orleans, Louisiana 70148, United States
| | - Christopher M Summa
- Department of Computer Science, University of New Orleans, New Orleans, Louisiana 70148, United States
| |
Collapse
|
42
|
Planas-Iglesias J, Marques SM, Pinto GP, Musil M, Stourac J, Damborsky J, Bednar D. Computational design of enzymes for biotechnological applications. Biotechnol Adv 2021; 47:107696. [PMID: 33513434 DOI: 10.1016/j.biotechadv.2021.107696] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/14/2022]
Abstract
Enzymes are the natural catalysts that execute biochemical reactions upholding life. Their natural effectiveness has been fine-tuned as a result of millions of years of natural evolution. Such catalytic effectiveness has prompted the use of biocatalysts from multiple sources on different applications, including the industrial production of goods (food and beverages, detergents, textile, and pharmaceutics), environmental protection, and biomedical applications. Natural enzymes often need to be improved by protein engineering to optimize their function in non-native environments. Recent technological advances have greatly facilitated this process by providing the experimental approaches of directed evolution or by enabling computer-assisted applications. Directed evolution mimics the natural selection process in a highly accelerated fashion at the expense of arduous laboratory work and economic resources. Theoretical methods provide predictions and represent an attractive complement to such experiments by waiving their inherent costs. Computational techniques can be used to engineer enzymatic reactivity, substrate specificity and ligand binding, access pathways and ligand transport, and global properties like protein stability, solubility, and flexibility. Theoretical approaches can also identify hotspots on the protein sequence for mutagenesis and predict suitable alternatives for selected positions with expected outcomes. This review covers the latest advances in computational methods for enzyme engineering and presents many successful case studies.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Sérgio M Marques
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Gaspar P Pinto
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic; IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 61266 Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic.
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic.
| |
Collapse
|
43
|
Juárez-Jiménez J, Tew P, O Connor M, Llabrés S, Sage R, Glowacki D, Michel J. Combining Virtual Reality Visualization with Ensemble Molecular Dynamics to Study Complex Protein Conformational Changes. J Chem Inf Model 2020; 60:6344-6354. [PMID: 33180485 DOI: 10.1021/acs.jcim.0c00221] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Molecular dynamics (MD) simulations are increasingly used to elucidate relationships between protein structure, dynamics, and their biological function. Currently, it is extremely challenging to perform MD simulations of large-scale structural rearrangements in proteins that occur on millisecond timescales or beyond, as this requires very significant computational resources, or the use of cumbersome "collective variable" enhanced sampling protocols. Here, we describe a framework that combines ensemble MD simulations and virtual reality visualization (eMD-VR) to enable users to interactively generate realistic descriptions of large amplitude, millisecond timescale protein conformational changes in proteins. Detailed tests demonstrate that eMD-VR substantially decreases the computational cost of folding simulations of a WW domain, without the need to define collective variables a priori. We further show that eMD-VR generated pathways can be combined with Markov state models to describe the thermodynamics and kinetics of large-scale loop motions in the enzyme cyclophilin A. Our results suggest eMD-VR is a powerful tool for exploring protein energy landscapes in bioengineering efforts.
Collapse
Affiliation(s)
- Jordi Juárez-Jiménez
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, United Kingdom
| | - Philip Tew
- Interactive Scientific, Engine Shed, Station Approach, Bristol BS1 6QH, United Kingdom
| | - Michael O Connor
- Intangible Realities Laboratory, University of Bristol, Cantock's Close, Bristol BS8 1TS, United Kingdom.,Department of Computer Science, University of Bristol, Merchant Venture's Building, Bristol BS8 1UB, United Kingdom.,Centre for Computational Chemistry, School of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, United Kingdom
| | - Salomé Llabrés
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, United Kingdom
| | - Rebecca Sage
- Interactive Scientific, Engine Shed, Station Approach, Bristol BS1 6QH, United Kingdom
| | - David Glowacki
- Intangible Realities Laboratory, University of Bristol, Cantock's Close, Bristol BS8 1TS, United Kingdom.,Department of Computer Science, University of Bristol, Merchant Venture's Building, Bristol BS8 1UB, United Kingdom.,Centre for Computational Chemistry, School of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, United Kingdom
| | - Julien Michel
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, United Kingdom
| |
Collapse
|
44
|
Hoseini P, Zhao L, Shehu A. Generative deep learning for macromolecular structure and dynamics. Curr Opin Struct Biol 2020; 67:170-177. [PMID: 33338762 DOI: 10.1016/j.sbi.2020.11.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 11/16/2020] [Accepted: 11/23/2020] [Indexed: 01/06/2023]
Abstract
Much scientific enquiry across disciplines is founded upon a mechanistic treatment of dynamic systems that ties form to function. A highly visible instance of this is in molecular biology, where characterizing macromolecular structure and dynamics is central to a detailed, molecular-level understanding of biological processes in the living cell. The current computational paradigm utilizes optimization as the generative process for modeling both structure and structural dynamics. Computational biology researchers are now attempting to wield generative models employing deep neural networks as an alternative computational paradigm. In this review, we summarize such efforts. We highlight progress and shortcomings. More importantly, we expose challenges that macromolecular structure poses to deep generative models and take this opportunity to introduce the structural biology community to several recent advances in the deep learning community that promise a way forward.
Collapse
Affiliation(s)
- Pourya Hoseini
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA; Center for Advancing Human-Machine Partnerships, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA
| | - Liang Zhao
- Department of Computer Science, Emory University, 201 Dowman Dr, Atlanta, GA 30322, USA; Center for Advancing Human-Machine Partnerships, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA
| | - Amarda Shehu
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA; Center for Advancing Human-Machine Partnerships, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA.
| |
Collapse
|
45
|
Akhter N, Chennupati G, Djidjev H, Shehu A. Decoy selection for protein structure prediction via extreme gradient boosting and ranking. BMC Bioinformatics 2020; 21:189. [PMID: 33297949 PMCID: PMC7724862 DOI: 10.1186/s12859-020-3523-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 04/29/2020] [Indexed: 11/10/2022] Open
Abstract
Background Identifying one or more biologically-active/native decoys from millions of non-native decoys is one of the major challenges in computational structural biology. The extreme lack of balance in positive and negative samples (native and non-native decoys) in a decoy set makes the problem even more complicated. Consensus methods show varied success in handling the challenge of decoy selection despite some issues associated with clustering large decoy sets and decoy sets that do not show much structural similarity. Recent investigations into energy landscape-based decoy selection approaches show promises. However, lack of generalization over varied test cases remains a bottleneck for these methods. Results We propose a novel decoy selection method, ML-Select, a machine learning framework that exploits the energy landscape associated with the structure space probed through a template-free decoy generation. The proposed method outperforms both clustering and energy ranking-based methods, all the while consistently offering better performance on varied test-cases. Moreover, ML-Select shows promising results even for the decoy sets consisting of mostly low-quality decoys. Conclusions ML-Select is a useful method for decoy selection. This work suggests further research in finding more effective ways to adopt machine learning frameworks in achieving robust performance for decoy selection in template-free protein structure prediction.
Collapse
Affiliation(s)
- Nasrin Akhter
- Department of Computer Science, George Mason University, Fairfax, 22030, VA, USA
| | - Gopinath Chennupati
- Information Sciences (CCS-3) Group, Los Alamos National Laboratory, Bikini At al Rd., Los Alamos, 87545, USA.
| | - Hristo Djidjev
- Information Sciences (CCS-3) Group, Los Alamos National Laboratory, Bikini At al Rd., Los Alamos, 87545, USA
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, 22030, VA, USA.,Department of Bioengineering, George Mason University, Fairfax, 22030, VA, USA.,School of Systems Biology, George Mason University, Manassas, 20110, VA, USA
| |
Collapse
|
46
|
Strodel B. Amyloid aggregation simulations: challenges, advances and perspectives. Curr Opin Struct Biol 2020; 67:145-152. [PMID: 33279865 DOI: 10.1016/j.sbi.2020.10.019] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Accepted: 10/18/2020] [Indexed: 10/22/2022]
Abstract
In amyloid aggregation diseases soluble proteins coalesce into a wide array of undesirable structures, ranging through oligomers and prefibrillar assemblies to highly ordered amyloid fibrils and plaques. Explicit-solvent all-atom molecular dynamics (MD) simulations of amyloid aggregation have been performed for almost 20 years, revealing valuable information about this phenomenon. However, these simulations are challenged by three main problems. Firstly, current force fields modeling amyloid aggregation are insufficiently accurate. Secondly, the protein concentrations in MD simulations are usually orders of magnitude higher than those used in vitro or found in vivo, which has direct consequences on the aggregates that form. Finally, the third problem is the well-known time-scale limit of MD simulations. In this review I highlight recent approaches to overcome these three limitations.
Collapse
Affiliation(s)
- Birgit Strodel
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, 52425 Jülich, Germany; Institute of Theoretical and Computational Chemistry, Heinrich Heine University Düsseldorf, Universitätstrasse 1, 40225 Düsseldorf, Germany.
| |
Collapse
|
47
|
Cárdenas R, Martínez-Seoane J, Amero C. Combining Experimental Data and Computational Methods for the Non-Computer Specialist. Molecules 2020; 25:E4783. [PMID: 33081072 PMCID: PMC7594097 DOI: 10.3390/molecules25204783] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 08/25/2020] [Accepted: 08/28/2020] [Indexed: 01/01/2023] Open
Abstract
Experimental methods are indispensable for the study of the function of biological macromolecules, not just as static structures, but as dynamic systems that change conformation, bind partners, perform reactions, and respond to different stimulus. However, providing a detailed structural interpretation of the results is often a very challenging task. While experimental and computational methods are often considered as two different and separate approaches, the power and utility of combining both is undeniable. The integration of the experimental data with computational techniques can assist and enrich the interpretation, providing new detailed molecular understanding of the systems. Here, we briefly describe the basic principles of how experimental data can be combined with computational methods to obtain insights into the molecular mechanism and expand the interpretation through the generation of detailed models.
Collapse
Affiliation(s)
| | | | - Carlos Amero
- Laboratorio de Bioquímica y Resonancia Magnética Nuclear, Centro de Investigaciones Químicas, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos 62209, Mexico; (R.C.); (J.M.-S.)
| |
Collapse
|
48
|
Agarwal PK, Bernard DN, Bafna K, Doucet N. Enzyme dynamics: Looking beyond a single structure. ChemCatChem 2020; 12:4704-4720. [PMID: 33897908 PMCID: PMC8064270 DOI: 10.1002/cctc.202000665] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Indexed: 12/23/2022]
Abstract
Conventional understanding of how enzymes function strongly emphasizes the role of structure. However, increasing evidence clearly indicates that enzymes do not remain fixed or operate exclusively in or close to their native structure. Different parts of the enzyme (from individual residues to full domains) undergo concerted motions on a wide range of time-scales, including that of the catalyzed reaction. Information obtained on these internal motions and conformational fluctuations has so far uncovered and explained many aspects of enzyme mechanisms, which could not have been understood from a single structure alone. Although there is wide interest in understanding enzyme dynamics and its role in catalysis, several challenges remain. In addition to technical difficulties, the vast majority of investigations are performed in dilute aqueous solutions, where conditions are significantly different than the cellular milieu where a large number of enzymes operate. In this review, we discuss recent developments, several challenges as well as opportunities related to this topic. The benefits of considering dynamics as an integral part of the enzyme function can also enable new means of biocatalysis, engineering enzymes for industrial and medicinal applications.
Collapse
Affiliation(s)
- Pratul K. Agarwal
- Department of Physiological Sciences and High-Performance Computing Center, Oklahoma State University, Stillwater, Oklahoma 74078
- Arium BioLabs, 2519 Caspian Drive, Knoxville, Tennessee 37932
| | - David N. Bernard
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, 531 Boulevard des Prairies, Laval, Quebec, H7V 1B7, Canada
| | - Khushboo Bafna
- Department of Chemistry and Chemical Biology, and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York 12180
| | - Nicolas Doucet
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, 531 Boulevard des Prairies, Laval, Quebec, H7V 1B7, Canada
- PROTEO, the Quebec Network for Research on Protein Function, Structure, and Engineering, 1045 Avenue de la Médecine, Université Laval, Québec, QC, G1V 0A6, Canada
| |
Collapse
|
49
|
Neelamraju S, Wales DJ, Gosavi S. Protein energy landscape exploration with structure-based models. Curr Opin Struct Biol 2020; 64:145-151. [DOI: 10.1016/j.sbi.2020.07.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 06/30/2020] [Accepted: 07/15/2020] [Indexed: 12/11/2022]
|
50
|
Khairallah A, Ross CJ, Tastan Bishop Ö. Probing the Structural Dynamics of the Plasmodium falciparum Tunneling-Fold Enzyme 6-Pyruvoyl Tetrahydropterin Synthase to Reveal Allosteric Drug Targeting Sites. Front Mol Biosci 2020; 7:575196. [PMID: 33102524 PMCID: PMC7546909 DOI: 10.3389/fmolb.2020.575196] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 08/20/2020] [Indexed: 11/13/2022] Open
Abstract
The de novo folate synthesis pathway is a well-established drug target in the treatment of many infectious diseases. Antimalarial antifolate drugs have proven to be effective against malaria, however, rapid drug resistance has emerged on the two primary targeted enzymes: dihydrofolate reductase and dihydroptoreate synthase. The need to identify alternative antifolate drugs and novel metabolic targets is of imminent importance. The 6-pyruvol tetrahydropterin synthase (PTPS) enzyme belongs to the tunneling fold protein superfamily which is characterized by a distinct central tunnel/cavity. The enzyme catalyzes the second reaction step of the parasite’s de novo folate synthesis pathway and is responsible for the conversion of 7,8-dihydroneopterin to 6-pyruvoyl-tetrahydropterin. In this study, we examine the structural dynamics of Plasmodium falciparum PTPS using the anisotropic network model, to elucidate the collective motions that drive the function of the enzyme and identify potential sites for allosteric modulation of its binding properties. Based on our modal analysis, we identified key sites in the N-terminal domains and central helices which control the accessibility to the active site. Notably, the N-terminal domains were shown to regulate the open-to-closed transition of the tunnel, via a distinctive wringing motion that deformed the core of the protein. We, further, combined the dynamic analysis with motif discovery which revealed highly conserved motifs that are unique to the Plasmodium species and are located in the N-terminal domains and central helices. This provides essential structural information for the efficient design of drugs such as allosteric modulators that would have high specificity and low toxicity as they do not target the PTPS active site that is highly conserved in humans.
Collapse
Affiliation(s)
- Afrah Khairallah
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa
| | - Caroline J Ross
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa
| | - Özlem Tastan Bishop
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa
| |
Collapse
|