1
|
Nakao A, Harabuchi Y, Maeda S, Tsuda K. Leveraging algorithmic search in quantum chemical reaction path finding. Phys Chem Chem Phys 2022; 24:10305-10310. [PMID: 35437567 DOI: 10.1039/d2cp01079h] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Reaction path finding methods construct a graph connecting reactants and products in a quantum chemical energy landscape. They are useful in elucidating various reactions and provide footsteps for designing new reactions. Their enormous computational cost, however, limits their application to relatively simple reactions. This paper engages in accelerating reaction path finding by introducing the principles of algorithmic search. A new method called RRT/SC-AFIR is devised by combining rapidly exploring random tree (RRT) and single component artificial force induced reaction (SC-AFIR). Using 96 cores, our method succeeded in constructing a reaction graph for Fritsch-Buttenberg-Wiechell rearrangement within a time limit of 3 days, while the conventional methods could not. Our results illustrate that the algorithm theory provides refreshing and beneficial viewpoints on quantum chemical methodologies.
Collapse
Affiliation(s)
- Atsuyuki Nakao
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa 2778561, Japan.
| | - Yu Harabuchi
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo 001-0021, Japan.,JST ERATO Maeda Artificial Intelligence for Chemical Reaction Design and Discovery Project, Sapporo 060-0810, Japan.,Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
| | - Satoshi Maeda
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo 001-0021, Japan.,JST ERATO Maeda Artificial Intelligence for Chemical Reaction Design and Discovery Project, Sapporo 060-0810, Japan.,Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
| | - Koji Tsuda
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa 2778561, Japan. .,RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan.,Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, Tsukuba 305-0047, Japan
| |
Collapse
|
2
|
Low-Frequency Harmonic Perturbations Drive Protein Conformational Changes. Int J Mol Sci 2021; 22:ijms221910501. [PMID: 34638837 PMCID: PMC8508695 DOI: 10.3390/ijms221910501] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 09/18/2021] [Accepted: 09/26/2021] [Indexed: 02/01/2023] Open
Abstract
Protein dynamics has been investigated since almost half a century, as it is believed to constitute the fundamental connection between structure and function. Elastic network models (ENMs) have been widely used to predict protein dynamics, flexibility and the biological mechanism, from which remarkable results have been found regarding the prediction of protein conformational changes. Starting from the knowledge of the reference structure only, these conformational changes have been usually predicted either by looking at the individual mode shapes of vibrations (i.e., by considering the free vibrations of the ENM) or by applying static perturbations to the protein network (i.e., by considering a linear response theory). In this paper, we put together the two previous approaches and evaluate the complete protein response under the application of dynamic perturbations. Harmonic forces with random directions are applied to the protein ENM, which are meant to simulate the single frequency-dependent components of the collisions of the surrounding particles, and the protein response is computed by solving the dynamic equations in the underdamped regime, where mass, viscous damping and elastic stiffness contributions are explicitly taken into account. The obtained motion is investigated both in the coordinate space and in the sub-space of principal components (PCs). The results show that the application of perturbations in the low-frequency range is able to drive the protein conformational change, leading to remarkably high values of direction similarity. Eventually, this suggests that protein conformational change might be triggered by external collisions and favored by the inherent low-frequency dynamics of the protein structure.
Collapse
|
3
|
Afrasiabi F, Dehghanpoor R, Haspel N. Integrating Rigidity Analysis into the Exploration of Protein Conformational Pathways Using RRT* and MC. Molecules 2021; 26:molecules26082329. [PMID: 33923805 PMCID: PMC8073574 DOI: 10.3390/molecules26082329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/12/2021] [Accepted: 04/13/2021] [Indexed: 11/16/2022] Open
Abstract
To understand how proteins function on a cellular level, it is of paramount importance to understand their structures and dynamics, including the conformational changes they undergo to carry out their function. For the aforementioned reasons, the study of large conformational changes in proteins has been an interest to researchers for years. However, since some proteins experience rapid and transient conformational changes, it is hard to experimentally capture the intermediate structures. Additionally, computational brute force methods are computationally intractable, which makes it impossible to find these pathways which require a search in a high-dimensional, complex space. In our previous work, we implemented a hybrid algorithm that combines Monte-Carlo (MC) sampling and RRT*, a version of the Rapidly Exploring Random Trees (RRT) robotics-based method, to make the conformational exploration more accurate and efficient, and produce smooth conformational pathways. In this work, we integrated the rigidity analysis of proteins into our algorithm to guide the search to explore flexible regions. We demonstrate that rigidity analysis dramatically reduces the run time and accelerates convergence.
Collapse
|
4
|
ART-RRT: As-Rigid-As-Possible search for protein conformational transition paths. J Comput Aided Mol Des 2019; 33:705-727. [PMID: 31435895 DOI: 10.1007/s10822-019-00216-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 08/06/2019] [Indexed: 10/26/2022]
Abstract
The possible functions of a protein are strongly related to its structural rearrangements in the presence of other molecules or environmental changes. Hence, the evaluation of transition paths of proteins, which encodes conformational changes between stable states, is important since it may reveal the underlying mechanisms of the biochemical processes related to these motions. During the last few decades, different geometry-based methods have been proposed to predict such transition paths. However, in the cases where the solution requires complex motions, these methods, which typically constrain only locally the molecular structures, could produce physically irrelevant solutions involving self-intersection. Recently, we have proposed ART-RRT, an efficient method for finding ligand-unbinding pathways. It relies on the exploration of energy valleys in low-dimensional spaces, taking advantage of some mechanisms inspired from computer graphics to ensure the consistency of molecular structures. This article extends ART-RRT to the problem of finding probable conformational transition between two stable states for proteins. It relies on a bidirectional exploration rooted on the two end states and introduces an original strategy to attempt connections between the explored regions. The resulting method is able to produce at low computational cost biologically realistic paths free from self-intersection. These paths can serve as valuable input to other advanced methods for the study of proteins. A better understanding of conformational changes of proteins is important since it may reveal the underlying mechanisms of the biochemical processes related to such motions. Recently, the ART-RRT method has been introduced for finding ligand-unbinding pathways. This article presents an adaptation of the method for finding probable conformational transition between two stable states of a protein. The method is not only computationally cost-effective but also able to produce biologically realistic paths which are free from self-intersection.
Collapse
|
5
|
Morris D, Maximova T, Plaku E, Shehu A. Attenuating dependence on structural data in computing protein energy landscapes. BMC Bioinformatics 2019; 20:280. [PMID: 31167640 PMCID: PMC6551245 DOI: 10.1186/s12859-019-2822-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Background Nearly all cellular processes involve proteins structurally rearranging to accommodate molecular partners. The energy landscape underscores the inherent nature of proteins as dynamic molecules interconverting between structures with varying energies. In principle, reconstructing a protein’s energy landscape holds the key to characterizing the structural dynamics and its regulation of protein function. In practice, the disparate spatio-temporal scales spanned by the slow dynamics challenge both wet and dry laboratories. However, the growing number of deposited structures for proteins central to human biology presents an opportunity to infer the relevant dynamics via exploitation of the information encoded in such structures about equilibrium dynamics. Results Recent computational efforts using extrinsic modes of motion as variables have successfully reconstructed detailed energy landscapes of several medium-size proteins. Here we investigate the extent to which one can reconstruct the energy landscape of a protein in the absence of sufficient, wet-laboratory structural data. We do so by integrating intrinsic modes of motion extracted off a single structure in a stochastic optimization framework that supports the plug-and-play of different variable selection strategies. We demonstrate that, while knowledge of more wet-laboratory structures yields better-reconstructed landscapes, precious information can be obtained even when only one structural model is available. Conclusions The presented work shows that it is possible to reconstruct the energy landscape of a protein with reasonable detail and accuracy even when the structural information about the protein is limited to one structure. By attenuating the dependence on structural data of methods designed to compute protein energy landscapes, the work opens up interesting venues of research on structure-based inference of dynamics. Of particular interest are directions of research that will extend such inference to proteins with no experimentally-characterized structures.
Collapse
Affiliation(s)
- David Morris
- Department of Computer Science, George Mason University, Fairfax, 22030, VA, USA
| | - Tatiana Maximova
- Department of Computer Science, George Mason University, Fairfax, 22030, VA, USA
| | - Erion Plaku
- Department of Electrical Engineering and Computer Science, The Catholic University of America, Washington, 20064, D.C., USA
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, 22030, VA, USA. .,Department of Bioengineering, George Mason University, Fairfax, 22030, VA, USA. .,School of Systems Biology, George Mason University, Manassas, 20110, VA, USA.
| |
Collapse
|
6
|
Shao Q, Zhu W. Ligand binding effects on the activation of the EGFR extracellular domain. Phys Chem Chem Phys 2019; 21:8141-8151. [PMID: 30933195 DOI: 10.1039/c8cp07496h] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The epidermal growth factor receptor (EGFR) is one of the most common target proteins in anti-cancer therapy. The binding of the EGF ligand to the EGFR extracellular domain (EGFR-ECD) promotes its inactive-to-active conformational transition (activation) but the relevant detailed mechanism remains elusive still. Here, the structural characterization and energetics of the EGFR-ECD conformational transition with and without the binding of the EGF are quantitatively explored using an innovative enhanced sampling MD simulation method. Intriguingly, the EGF offers hydrophobic interactions (e.g., EGF residues of Tyr44 and Leu47) and electrostatic interactions (e.g., the EGF residues of Glu5, Asp11, Asp17, and Arg41) to play a dominant role in dragging domain III to close the ligand binding domain gap. Subsequently, the correlation between domains III and II is enhanced through salt-bridges among Glu376, Arg403, and Arg405 from domain III and Glu293, Glu295, and Arg300 from domain II. Finally, the structural bending of domain II is regulated to facilitate the disengagement of domain II from domain IV. In this regard, the functional conformational transition of EGFR-ECD is a consequence of the cooperative motion of protein domains driven by the EGF ligand binding. The present study shows a detailed scenario of the EGF induced activation of EGFR-ECD and provides valuable information for drug discovery targeting the EGFR.
Collapse
Affiliation(s)
- Qiang Shao
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China.
| | | |
Collapse
|
7
|
Sorzano COS, Jiménez A, Mota J, Vilas JL, Maluenda D, Martínez M, Ramírez-Aportela E, Majtner T, Segura J, Sánchez-García R, Rancel Y, del Caño L, Conesa P, Melero R, Jonic S, Vargas J, Cazals F, Freyberg Z, Krieger J, Bahar I, Marabini R, Carazo JM. Survey of the analysis of continuous conformational variability of biological macromolecules by electron microscopy. Acta Crystallogr F Struct Biol Commun 2019; 75:19-32. [PMID: 30605122 PMCID: PMC6317454 DOI: 10.1107/s2053230x18015108] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 10/26/2018] [Indexed: 11/10/2022] Open
Abstract
Single-particle analysis by electron microscopy is a well established technique for analyzing the three-dimensional structures of biological macromolecules. Besides its ability to produce high-resolution structures, it also provides insights into the dynamic behavior of the structures by elucidating their conformational variability. Here, the different image-processing methods currently available to study continuous conformational changes are reviewed.
Collapse
Affiliation(s)
| | - A. Jiménez
- National Center of Biotechnology (CSIC), Spain
| | - J. Mota
- National Center of Biotechnology (CSIC), Spain
| | - J. L. Vilas
- National Center of Biotechnology (CSIC), Spain
| | - D. Maluenda
- National Center of Biotechnology (CSIC), Spain
| | - M. Martínez
- National Center of Biotechnology (CSIC), Spain
| | | | - T. Majtner
- National Center of Biotechnology (CSIC), Spain
| | - J. Segura
- National Center of Biotechnology (CSIC), Spain
| | | | - Y. Rancel
- National Center of Biotechnology (CSIC), Spain
| | - L. del Caño
- National Center of Biotechnology (CSIC), Spain
| | - P. Conesa
- National Center of Biotechnology (CSIC), Spain
| | - R. Melero
- National Center of Biotechnology (CSIC), Spain
| | - S. Jonic
- Sorbonne Université, UMR CNRS 7590, Muséum National d’Histoire Naturelle, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | | | - F. Cazals
- Inria Sophia Antipolis – Méditerranée, France
| | | | | | | | | | | |
Collapse
|
8
|
Mechanical variations in proteins with large-scale motions highlight the formation of structural locks. J Struct Biol 2018; 203:195-204. [DOI: 10.1016/j.jsb.2018.05.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Revised: 05/18/2018] [Accepted: 05/22/2018] [Indexed: 12/18/2022]
|
9
|
Fonseca R, Budday D, van den Bedem H. Collision-free poisson motion planning in ultra high-dimensional molecular conformation spaces. J Comput Chem 2018; 39:711-720. [PMID: 29315667 DOI: 10.1002/jcc.25138] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 11/22/2017] [Accepted: 11/27/2017] [Indexed: 12/22/2022]
Abstract
The function of protein, RNA, and DNA is modulated by fast, dynamic exchanges between three-dimensional conformations. Conformational sampling of biomolecules with exact and nullspace inverse kinematics, using rotatable bonds as revolute joints and noncovalent interactions as holonomic constraints, can accurately characterize these native ensembles. However, sampling biomolecules remains challenging owing to their ultra-high dimensional configuration spaces, and the requirement to avoid (self-) collisions, which results in low acceptance rates. Here, we present two novel mechanisms to overcome these limitations. First, we introduce temporary constraints between near-colliding links. The resulting constraint varieties instantaneously redirect the search for collision-free conformations, and couple motions between distant parts of the linkage. Second, we adapt a randomized Poisson-disk motion planner, which prevents local oversampling and widens the search, to ultra-high dimensions. Tests on several model systems show that the sampling acceptance rate can increase from 16% to 70%, and that the conformational coverage in loop modeling measured as average closeness to existing loop conformations doubled. Correlated protein motions identified with our algorithm agree with those from MD simulations. © 2018 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Rasmus Fonseca
- Molecular and Cellular Physiology, Stanford University, Stanford, California.,Bioscience Division, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, California
| | - Dominik Budday
- Chair of Applied Dynamics, University of Erlangen-Nuremberg, Erlangen, 91058, Germany
| | - Henry van den Bedem
- Bioscience Division, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, California
| |
Collapse
|
10
|
Nguyen MK, Jaillet L, Redon S. ART-RRT: As-Rigid-As-Possible exploration of ligand unbinding pathways. J Comput Chem 2018; 39:665-678. [DOI: 10.1002/jcc.25132] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2017] [Revised: 10/30/2017] [Accepted: 11/20/2017] [Indexed: 01/09/2023]
Affiliation(s)
- Minh Khoa Nguyen
- Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP (Institute of Engineering Univ. Grenoble Alpes), LJK; 38000 Grenoble France
| | - Léonard Jaillet
- Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP (Institute of Engineering Univ. Grenoble Alpes), LJK; 38000 Grenoble France
| | - Stéphane Redon
- Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP (Institute of Engineering Univ. Grenoble Alpes), LJK; 38000 Grenoble France
| |
Collapse
|
11
|
Abella JR, Moll M, Kavraki LE. Maintaining and Enhancing Diversity of Sampled Protein Conformations in Robotics-Inspired Methods. J Comput Biol 2018; 25:3-20. [PMID: 29035572 PMCID: PMC5756939 DOI: 10.1089/cmb.2017.0164] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The ability to efficiently sample structurally diverse protein conformations allows one to gain a high-level view of a protein's energy landscape. Algorithms from robot motion planning have been used for conformational sampling, and several of these algorithms promote diversity by keeping track of "coverage" in conformational space based on the local sampling density. However, large proteins present special challenges. In particular, larger systems require running many concurrent instances of these algorithms, but these algorithms can quickly become memory intensive because they typically keep previously sampled conformations in memory to maintain coverage estimates. In addition, robotics-inspired algorithms depend on defining useful perturbation strategies for exploring the conformational space, which is a difficult task for large proteins because such systems are typically more constrained and exhibit complex motions. In this article, we introduce two methodologies for maintaining and enhancing diversity in robotics-inspired conformational sampling. The first method addresses algorithms based on coverage estimates and leverages the use of a low-dimensional projection to define a global coverage grid that maintains coverage across concurrent runs of sampling. The second method is an automatic definition of a perturbation strategy through readily available flexibility information derived from B-factors, secondary structure, and rigidity analysis. Our results show a significant increase in the diversity of the conformations sampled for proteins consisting of up to 500 residues when applied to a specific robotics-inspired algorithm for conformational sampling. The methodologies presented in this article may be vital components for the scalability of robotics-inspired approaches.
Collapse
Affiliation(s)
- Jayvee R Abella
- 1 Department of Computer Science, Rice University , Houston, Texas
| | - Mark Moll
- 1 Department of Computer Science, Rice University , Houston, Texas
| | - Lydia E Kavraki
- 1 Department of Computer Science, Rice University , Houston, Texas
| |
Collapse
|
12
|
Maximova T, Zhang Z, Carr DB, Plaku E, Shehu A. Sample-Based Models of Protein Energy Landscapes and Slow Structural Rearrangements. J Comput Biol 2018; 25:33-50. [DOI: 10.1089/cmb.2017.0158] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Affiliation(s)
- Tatiana Maximova
- Department of Computer Science, George Mason University, Fairfax, Virginia
| | - Zijing Zhang
- Department of Statistics, George Mason University, Fairfax, Virginia
| | - Daniel B. Carr
- Department of Statistics, George Mason University, Fairfax, Virginia
| | - Erion Plaku
- Department of Electrical Engineering and Computer Science, The Catholic University of America, Washington, D.C
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, Virginia
- Department of Bioengineering, George Mason University, Fairfax, Virginia
- School of Systems Biology, George Mason University, Manassas, Virginia
| |
Collapse
|
13
|
Abstract
Background Understanding protein structure and dynamics is essential for understanding their function. This is a challenging task due to the high complexity of the conformational landscapes of proteins and their rugged energy levels. In particular, it is important to detect highly populated regions which could correspond to intermediate structures or local minima. Results We present a hierarchical clustering and algebraic topology based method that detects regions of interest in protein conformational space. The method is based on several techniques. We use coarse grained protein conformational search, efficient robust dimensionality reduction and topological analysis via persistent homology as the main tools. We use two dimensionality reduction methods as well, robust Principal Component Analysis (PCA) and Isomap, to generate a reduced representation of the data while preserving most of the variance in the data. Conclusions Our hierarchical clustering method was able to produce compact, well separated clusters for all the tested examples.
Collapse
Affiliation(s)
- Nurit Haspel
- Department of Computer Science, University of Massachusetts Boston, 100 Morrissey Blvd., Boston, 02125, MA, USA.
| | - Dong Luo
- Department of Computer Science, University of Massachusetts Boston, 100 Morrissey Blvd., Boston, 02125, MA, USA
| | - Eduardo González
- Department of Mathematics, University of Massachusetts Boston, 100 Morrissey Blvd., Boston, 02125, MA, USA
| |
Collapse
|
14
|
Budday D, Fonseca R, Leyendecker S, van den Bedem H. Frustration-guided motion planning reveals conformational transitions in proteins. Proteins 2017; 85:1795-1807. [DOI: 10.1002/prot.25333] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 05/19/2017] [Accepted: 06/07/2017] [Indexed: 01/27/2023]
Affiliation(s)
- Dominik Budday
- Chair of Applied Dynamics, University of Erlangen-Nuremberg; Erlangen Germany
| | - Rasmus Fonseca
- Department of Molecular and Cellular Physiology; Stanford University; California Menlo Park
- Biosciences Division; SLAC National Accelerator Laboratory, Stanford University; California Menlo Park
| | - Sigrid Leyendecker
- Chair of Applied Dynamics, University of Erlangen-Nuremberg; Erlangen Germany
| | - Henry van den Bedem
- Biosciences Division; SLAC National Accelerator Laboratory, Stanford University; California Menlo Park
| |
Collapse
|
15
|
Sapin E, Carr DB, De Jong KA, Shehu A. Computing energy landscape maps and structural excursions of proteins. BMC Genomics 2016; 17 Suppl 4:546. [PMID: 27535545 PMCID: PMC5001232 DOI: 10.1186/s12864-016-2798-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Structural excursions of a protein at equilibrium are key to biomolecular recognition and function modulation. Protein modeling research is driven by the need to aid wet laboratories in characterizing equilibrium protein dynamics. In principle, structural excursions of a protein can be directly observed via simulation of its dynamics, but the disparate temporal scales involved in such excursions make this approach computationally impractical. On the other hand, an informative representation of the structure space available to a protein at equilibrium can be obtained efficiently via stochastic optimization, but this approach does not directly yield information on equilibrium dynamics. METHODS We present here a novel methodology that first builds a multi-dimensional map of the energy landscape that underlies the structure space of a given protein and then queries the computed map for energetically-feasible excursions between structures of interest. An evolutionary algorithm builds such maps with a practical computational budget. Graphical techniques analyze a computed multi-dimensional map and expose interesting features of an energy landscape, such as basins and barriers. A path searching algorithm then queries a nearest-neighbor graph representation of a computed map for energetically-feasible basin-to-basin excursions. RESULTS Evaluation is conducted on intrinsically-dynamic proteins of importance in human biology and disease. Visual statistical analysis of the maps of energy landscapes computed by the proposed methodology reveals features already captured in the wet laboratory, as well as new features indicative of interesting, unknown thermodynamically-stable and semi-stable regions of the equilibrium structure space. Comparison of maps and structural excursions computed by the proposed methodology on sequence variants of a protein sheds light on the role of equilibrium structure and dynamics in the sequence-function relationship. CONCLUSIONS Applications show that the proposed methodology is effective at locating basins in complex energy landscapes and computing basin-basin excursions of a protein with a practical computational budget. While the actual temporal scales spanned by a structural excursion cannot be directly obtained due to the foregoing of simulation of dynamics, hypotheses can be formulated regarding the impact of sequence mutations on protein function. These hypotheses are valuable in instigating further research in wet laboratories.
Collapse
Affiliation(s)
- Emmanuel Sapin
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
| | - Daniel B Carr
- Department of Statistics, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
| | - Kenneth A De Jong
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA.,Krasnow Institute for Advanced Study, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
| | - Amarda Shehu
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA. amarda.@gmu.edu.,Department of Bioengineering, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA. amarda.@gmu.edu.,School of Systems Biology, George Mason University, 10900 University Boulevard, Manassas, 20110, VA, USA. amarda.@gmu.edu
| |
Collapse
|
16
|
Kurkcuoglu Z, Bahar I, Doruker P. ClustENM: ENM-Based Sampling of Essential Conformational Space at Full Atomic Resolution. J Chem Theory Comput 2016; 12:4549-62. [PMID: 27494296 DOI: 10.1021/acs.jctc.6b00319] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Accurate sampling of conformational space and, in particular, the transitions between functional substates has been a challenge in molecular dynamic (MD) simulations of large biomolecular systems. We developed an Elastic Network Model (ENM)-based computational method, ClustENM, for sampling large conformational changes of biomolecules with various sizes and oligomerization states. ClustENM is an iterative method that combines ENM with energy minimization and clustering steps. It is an unbiased technique, which requires only an initial structure as input, and no information about the target conformation. To test the performance of ClustENM, we applied it to six biomolecular systems: adenylate kinase (AK), calmodulin, p38 MAP kinase, HIV-1 reverse transcriptase (RT), triosephosphate isomerase (TIM), and the 70S ribosomal complex. The generated ensembles of conformers determined at atomic resolution show good agreement with experimental data (979 structures resolved by X-ray and/or NMR) and encompass the subspaces covered in independent MD simulations for TIM, p38, and RT. ClustENM emerges as a computationally efficient tool for characterizing the conformational space of large systems at atomic detail, in addition to generating a representative ensemble of conformers that can be advantageously used in simulating substrate/ligand-binding events.
Collapse
Affiliation(s)
- Zeynep Kurkcuoglu
- Department of Chemical Engineering and Polymer Research Center, Bogazici University , Bebek 34342, Istanbul, Turkey
| | - Ivet Bahar
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh , Pittsburgh, Pennsylvania 15213, United States
| | - Pemra Doruker
- Department of Chemical Engineering and Polymer Research Center, Bogazici University , Bebek 34342, Istanbul, Turkey
| |
Collapse
|
17
|
Ekenna C, Thomas S, Amato NM. Adaptive local learning in sampling based motion planning for protein folding. BMC SYSTEMS BIOLOGY 2016; 10 Suppl 2:49. [PMID: 27490494 PMCID: PMC4977477 DOI: 10.1186/s12918-016-0297-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
BACKGROUND Simulating protein folding motions is an important problem in computational biology. Motion planning algorithms, such as Probabilistic Roadmap Methods, have been successful in modeling the folding landscape. Probabilistic Roadmap Methods and variants contain several phases (i.e., sampling, connection, and path extraction). Most of the time is spent in the connection phase and selecting which variant to employ is a difficult task. Global machine learning has been applied to the connection phase but is inefficient in situations with varying topology, such as those typical of folding landscapes. RESULTS We develop a local learning algorithm that exploits the past performance of methods within the neighborhood of the current connection attempts as a basis for learning. It is sensitive not only to different types of landscapes but also to differing regions in the landscape itself, removing the need to explicitly partition the landscape. We perform experiments on 23 proteins of varying secondary structure makeup with 52-114 residues. We compare the success rate when using our methods and other methods. We demonstrate a clear need for learning (i.e., only learning methods were able to validate against all available experimental data) and show that local learning is superior to global learning producing, in many cases, significantly higher quality results than the other methods. CONCLUSIONS We present an algorithm that uses local learning to select appropriate connection methods in the context of roadmap construction for protein folding. Our method removes the burden of deciding which method to use, leverages the strengths of the individual input methods, and it is extendable to include other future connection methods.
Collapse
Affiliation(s)
- Chinwe Ekenna
- Department of Computer Science and Engineering, Texas A&M University, College Station, 77843 TX USA
| | - Shawna Thomas
- Department of Computer Science and Engineering, Texas A&M University, College Station, 77843 TX USA
| | - Nancy M. Amato
- Department of Computer Science and Engineering, Texas A&M University, College Station, 77843 TX USA
| |
Collapse
|
18
|
Kurkcuoglu Z, Doruker P. Ligand Docking to Intermediate and Close-To-Bound Conformers Generated by an Elastic Network Model Based Algorithm for Highly Flexible Proteins. PLoS One 2016; 11:e0158063. [PMID: 27348230 PMCID: PMC4922591 DOI: 10.1371/journal.pone.0158063] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2016] [Accepted: 06/09/2016] [Indexed: 01/03/2023] Open
Abstract
Incorporating receptor flexibility in small ligand-protein docking still poses a challenge for proteins undergoing large conformational changes. In the absence of bound structures, sampling conformers that are accessible by apo state may facilitate docking and drug design studies. For this aim, we developed an unbiased conformational search algorithm, by integrating global modes from elastic network model, clustering and energy minimization with implicit solvation. Our dataset consists of five diverse proteins with apo to complex RMSDs 4.7-15 Å. Applying this iterative algorithm on apo structures, conformers close to the bound-state (RMSD 1.4-3.8 Å), as well as the intermediate states were generated. Dockings to a sequence of conformers consisting of a closed structure and its "parents" up to the apo were performed to compare binding poses on different states of the receptor. For two periplasmic binding proteins and biotin carboxylase that exhibit hinge-type closure of two dynamics domains, the best pose was obtained for the conformer closest to the bound structure (ligand RMSDs 1.5-2 Å). In contrast, the best pose for adenylate kinase corresponded to an intermediate state with partially closed LID domain and open NMP domain, in line with recent studies (ligand RMSD 2.9 Å). The docking of a helical peptide to calmodulin was the most challenging case due to the complexity of its 15 Å transition, for which a two-stage procedure was necessary. The technique was first applied on the extended calmodulin to generate intermediate conformers; then peptide docking and a second generation stage on the complex were performed, which in turn yielded a final peptide RMSD of 2.9 Å. Our algorithm is effective in producing conformational states based on the apo state. This study underlines the importance of such intermediate states for ligand docking to proteins undergoing large transitions.
Collapse
Affiliation(s)
- Zeynep Kurkcuoglu
- Department of Chemical Engineering and Polymer Research Center, Bogazici University, Bebek, Istanbul, 34342, Turkey
- * E-mail: (ZK); (PD)
| | - Pemra Doruker
- Department of Chemical Engineering and Polymer Research Center, Bogazici University, Bebek, Istanbul, 34342, Turkey
- * E-mail: (ZK); (PD)
| |
Collapse
|
19
|
Abstract
SUMMARYIt is estimated that allergies afflict up to 40% of the world's population. A primary mediator for allergies is the aggregation of antigens and IgE antibodies bound to cell-surface receptors, FcεRI. Antibody/antigen aggregate formation causes stimulation of mast cells and basophils, initiating cellular degranulation and releasing immune mediators which produce an allergic or anaphylactic response. Understanding the shape and structure of these aggregates can provide critical insights into the allergic response. We have previously developed methods to geometrically model, simulate and analyze antibody aggregation inspired by rigid body robotic motion simulations. Our technique handles the large size and number of molecules involved in aggregation, providing an advantage over traditional simulations such as molecular dynamics (MD) and coarse-grained energetic models. In this paper, we study the impact of model resolution on simulations of geometric structures using both our previously developed Monte Carlo simulation and a novel application of rule-based modeling. These methods complement each other, the former providing explicit geometric detail and the latter providing a generic representation where multiple resolutions can be captured. Our exploration is focused on two antigens, a man-made antigen with three binding sites, DF3, and a common shrimp allergen (antigen), Pen a 1. We find that impact of resolution is minimal for DF3, a small globular antigen, but has a larger impact on Pen a 1, a rod-shaped molecule. The volume reduction caused by the loss in resolution allows more binding site accessibility, which can be quantified using a rule-based model with implicit geometric input. Clustering analysis of our simulation shows good correlation when compared with available experimental results. Moreover, collisions in all-atom reconstructions are negligible, at around 0.2% at 90% reduction.
Collapse
|
20
|
Fonseca R, van den Bedem H, Bernauer J. Probing RNA Native Conformational Ensembles with Structural Constraints. J Comput Biol 2016; 23:362-71. [PMID: 27028235 DOI: 10.1089/cmb.2015.0201] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Noncoding ribonucleic acids (RNA) play a critical role in a wide variety of cellular processes, ranging from regulating gene expression to post-translational modification and protein synthesis. Their activity is modulated by highly dynamic exchanges between three-dimensional conformational substates, which are difficult to characterize experimentally and computationally. Here, we present an innovative, entirely kinematic computational procedure to efficiently explore the native ensemble of RNA molecules. Our procedure projects degrees of freedom onto a subspace of conformation space defined by distance constraints in the tertiary structure. The dimensionality reduction enables efficient exploration of conformational space. We show that the conformational distributions obtained with our method broadly sample the conformational landscape observed in NMR experiments. Compared to normal mode analysis-based exploration, our procedure diffuses faster through the experimental ensemble while also accessing conformational substates to greater precision. Our results suggest that conformational sampling with a highly reduced but fully atomistic representation of noncoding RNA expresses key features of their dynamic nature.
Collapse
Affiliation(s)
- Rasmus Fonseca
- 1 INRIA Saclay-Île de France, Bâtiment Alan Turing, Campus de l'École Polytechnique , Palaiseau, France .,2 Laboratoire d'Informatique de l'École Polytechnique (LIX), CNRS UMR 7161, École Polytechnique , Palaiseau, France .,3 Department of Computer Science, University of Copenhagen , Nørre Campus, Copenhagen, Denmark
| | - Henry van den Bedem
- 4 Biosciences Division, SLAC National Accelerator Laboratory, Stanford University , Menlo Park, California
| | - Julie Bernauer
- 1 INRIA Saclay-Île de France, Bâtiment Alan Turing, Campus de l'École Polytechnique , Palaiseau, France .,2 Laboratoire d'Informatique de l'École Polytechnique (LIX), CNRS UMR 7161, École Polytechnique , Palaiseau, France
| |
Collapse
|
21
|
Molloy K, Shehu A. A General, Adaptive, Roadmap-Based Algorithm for Protein Motion Computation. IEEE Trans Nanobioscience 2016; 15:158-65. [PMID: 26863668 DOI: 10.1109/tnb.2016.2519246] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Precious information on protein function can be extracted from a detailed characterization of protein equilibrium dynamics. This remains elusive in wet and dry laboratories, as function-modulating transitions of a protein between functionally-relevant, thermodynamically-stable and meta-stable structural states often span disparate time scales. In this paper we propose a novel, robotics-inspired algorithm that circumvents time-scale challenges by drawing analogies between protein motion and robot motion. The algorithm adapts the popular roadmap-based framework in robot motion computation to handle the more complex protein conformation space and its underlying rugged energy surface. Given known structures representing stable and meta-stable states of a protein, the algorithm yields a time- and energy-prioritized list of transition paths between the structures, with each path represented as a series of conformations. The algorithm balances computational resources between a global search aimed at obtaining a global view of the network of protein conformations and their connectivity and a detailed local search focused on realizing such connections with physically-realistic models. Promising results are presented on a variety of proteins that demonstrate the general utility of the algorithm and its capability to improve the state of the art without employing system-specific insight.
Collapse
|
22
|
Shao Q. Enhanced conformational sampling technique provides an energy landscape view of large-scale protein conformational transitions. Phys Chem Chem Phys 2016; 18:29170-29182. [DOI: 10.1039/c6cp05634b] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
A novel in silico approach (NMA–ITS) is introduced to rapidly and effectively sample the configuration space and give quantitative data for exploring the conformational changes of proteins.
Collapse
Affiliation(s)
- Qiang Shao
- Drug Discovery and Design Center
- CAS Key Laboratory of Receptor Research
- Shanghai Institute of Materia Medica
- Chinese Academy of Sciences
- Shanghai
| |
Collapse
|
23
|
Ceres N, Lavery R. Improving the treatment of coarse-grain electrostatics: CVCEL. J Chem Phys 2015; 143:243118. [DOI: 10.1063/1.4933434] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- N. Ceres
- Bioinformatics: Structures and Interactions, Institut de Biologie et Chimie des Protéines, BMSSI UMR CNRS 5086/Université Lyon I, 7 Passage du Vercors, Lyon 69367, France
| | - R. Lavery
- Bioinformatics: Structures and Interactions, Institut de Biologie et Chimie des Protéines, BMSSI UMR CNRS 5086/Université Lyon I, 7 Passage du Vercors, Lyon 69367, France
| |
Collapse
|
24
|
Abstract
SUMMARYEvidence is emerging that the role of protein structure in disease needs to be rethought. Sequence mutations in proteins are often found to affect the rate at which a protein switches between structures. Modeling structural transitions in wildtype and variant proteins is central to understanding the molecular basis of disease. This paper investigates an efficient algorithmic realization of the stochastic roadmap simulation framework to model structural transitions in wildtype and variants of proteins implicated in human disorders. Our results indicate that the algorithm is able to extract useful information on the impact of mutations on protein structure and function.
Collapse
|
25
|
Abstract
The article reviews the significant contributions to, and the present status of, applications of computational methods for the characterization and prediction of protein-carbohydrate interactions. After a presentation of the specific features of carbohydrate modeling, along with a brief description of the experimental data and general features of carbohydrate-protein interactions, the survey provides a thorough coverage of the available computational methods and tools. At the quantum-mechanical level, the use of both molecular orbitals and density-functional theory is critically assessed. These are followed by a presentation and critical evaluation of the applications of semiempirical and empirical methods: QM/MM, molecular dynamics, free-energy calculations, metadynamics, molecular robotics, and others. The usefulness of molecular docking in structural glycobiology is evaluated by considering recent docking- validation studies on a range of protein targets. The range of applications of these theoretical methods provides insights into the structural, energetic, and mechanistic facets that occur in the course of the recognition processes. Selected examples are provided to exemplify the usefulness and the present limitations of these computational methods in their ability to assist in elucidation of the structural basis underlying the diverse function and biological roles of carbohydrates in their dialogue with proteins. These test cases cover the field of both carbohydrate biosynthesis and glycosyltransferases, as well as glycoside hydrolases. The phenomenon of (macro)molecular recognition is illustrated for the interactions of carbohydrates with such proteins as lectins, monoclonal antibodies, GAG-binding proteins, porins, and viruses.
Collapse
Affiliation(s)
- Serge Pérez
- Department of Molecular Pharmacochemistry, CNRS, University Grenoble-Alpes, Grenoble, France.
| | - Igor Tvaroška
- Department of Chemistry, Slovak Academy of Sciences, Bratislava, Slovak Republic; Department of Chemistry, Faculty of Natural Sciences, Constantine The Philosopher University, Nitra, Slovak Republic.
| |
Collapse
|