1
|
ART-RRT: As-Rigid-As-Possible search for protein conformational transition paths. J Comput Aided Mol Des 2019; 33:705-727. [PMID: 31435895 DOI: 10.1007/s10822-019-00216-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 08/06/2019] [Indexed: 10/26/2022]
Abstract
The possible functions of a protein are strongly related to its structural rearrangements in the presence of other molecules or environmental changes. Hence, the evaluation of transition paths of proteins, which encodes conformational changes between stable states, is important since it may reveal the underlying mechanisms of the biochemical processes related to these motions. During the last few decades, different geometry-based methods have been proposed to predict such transition paths. However, in the cases where the solution requires complex motions, these methods, which typically constrain only locally the molecular structures, could produce physically irrelevant solutions involving self-intersection. Recently, we have proposed ART-RRT, an efficient method for finding ligand-unbinding pathways. It relies on the exploration of energy valleys in low-dimensional spaces, taking advantage of some mechanisms inspired from computer graphics to ensure the consistency of molecular structures. This article extends ART-RRT to the problem of finding probable conformational transition between two stable states for proteins. It relies on a bidirectional exploration rooted on the two end states and introduces an original strategy to attempt connections between the explored regions. The resulting method is able to produce at low computational cost biologically realistic paths free from self-intersection. These paths can serve as valuable input to other advanced methods for the study of proteins. A better understanding of conformational changes of proteins is important since it may reveal the underlying mechanisms of the biochemical processes related to such motions. Recently, the ART-RRT method has been introduced for finding ligand-unbinding pathways. This article presents an adaptation of the method for finding probable conformational transition between two stable states of a protein. The method is not only computationally cost-effective but also able to produce biologically realistic paths which are free from self-intersection.
Collapse
|
2
|
Maximova T, Plaku E, Shehu A. Structure-Guided Protein Transition Modeling with a Probabilistic Roadmap Algorithm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1783-1796. [PMID: 27411226 DOI: 10.1109/tcbb.2016.2586044] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Proteins are macromolecules in perpetual motion, switching between structural states to modulate their function. A detailed characterization of the precise yet complex relationship between protein structure, dynamics, and function requires elucidating transitions between functionally-relevant states. Doing so challenges both wet and dry laboratories, as protein dynamics involves disparate temporal scales. In this paper, we present a novel, sampling-based algorithm to compute transition paths. The algorithm exploits two main ideas. First, it leverages known structures to initialize its search and define a reduced conformation space for rapid sampling. This is key to address the insufficient sampling issue suffered by sampling-based algorithms. Second, the algorithm embeds samples in a nearest-neighbor graph where transition paths can be efficiently computed via queries. The algorithm adapts the probabilistic roadmap framework that is popular in robot motion planning. In addition to efficiently computing lowest-cost paths between any given structures, the algorithm allows investigating hypotheses regarding the order of experimentally-known structures in a transition event. This novel contribution is likely to open up new venues of research. Detailed analysis is presented on multiple-basin proteins of relevance to human disease. Multiscaling and the AMBER ff14SB force field are used to obtain energetically-credible paths at atomistic detail.
Collapse
|
3
|
Devaurs D, Antunes DA, Kavraki LE. Revealing Unknown Protein Structures Using Computational Conformational Sampling Guided by Experimental Hydrogen-Exchange Data. Int J Mol Sci 2018; 19:E3406. [PMID: 30384411 PMCID: PMC6280153 DOI: 10.3390/ijms19113406] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 10/19/2018] [Accepted: 10/25/2018] [Indexed: 11/17/2022] Open
Abstract
Both experimental and computational methods are available to gather information about a protein's conformational space and interpret changes in protein structure. However, experimentally observing and computationally modeling large proteins remain critical challenges for structural biology. Our work aims at addressing these challenges by combining computational and experimental techniques relying on each other to overcome their respective limitations. Indeed, despite its advantages, an experimental technique such as hydrogen-exchange monitoring cannot produce structural models because of its low resolution. Additionally, the computational methods that can generate such models suffer from the curse of dimensionality when applied to large proteins. Adopting a common solution to this issue, we have recently proposed a framework in which our computational method for protein conformational sampling is biased by experimental hydrogen-exchange data. In this paper, we present our latest application of this computational framework: generating an atomic-resolution structural model for an unknown protein state. For that, starting from an available protein structure, we explore the conformational space of this protein, using hydrogen-exchange data on this unknown state as a guide. We have successfully used our computational framework to generate models for three proteins of increasing size, the biggest one undergoing large-scale conformational changes.
Collapse
Affiliation(s)
- Didier Devaurs
- Department of Computer Science, Rice University, 6100 Main St, Houston, TX 77005, USA.
| | - Dinler A Antunes
- Department of Computer Science, Rice University, 6100 Main St, Houston, TX 77005, USA.
| | - Lydia E Kavraki
- Department of Computer Science, Rice University, 6100 Main St, Houston, TX 77005, USA.
| |
Collapse
|
4
|
Devaurs D, Papanastasiou M, Antunes DA, Abella JR, Moll M, Ricklin D, Lambris JD, Kavraki LE. Native State of Complement Protein C3d Analysed via Hydrogen Exchange and Conformational Sampling. INTERNATIONAL JOURNAL OF COMPUTATIONAL BIOLOGY AND DRUG DESIGN 2018; 11:90-113. [PMID: 30700993 PMCID: PMC6349257 DOI: 10.1504/ijcbdd.2018.090834] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Hydrogen/deuterium exchange detected by mass spectrometry (HDXMS) provides valuable information on protein structure and dynamics. Although HDX-MS data is often interpreted using crystal structures, it was suggested that conformational ensembles produced by molecular dynamics simulations yield more accurate interpretations. In this paper, we analyse the complement protein C3d by performing an HDX-MS experiment, and evaluate several interpretation methodologies using an existing prediction model to derive HDX-MS data from protein structure. To interpret and refine C3d's HDX-MS data, we look for a conformation (or conformational ensemble) of C3d that allows computationally replicating this data. We confirm that crystal structures are not a good choice and suggest that conformational ensembles produced by molecular dynamics simulations might not always be satisfactory either. Finally, we show that coarse-grained conformational sampling of C3d produces a conformation from which its HDX-MS data can be replicated and refined.
Collapse
Affiliation(s)
- Didier Devaurs
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Malvina Papanastasiou
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Broad Institute of MIT & Harvard, Cambridge, MA, USA
| | - Dinler A Antunes
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Jayvee R Abella
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Mark Moll
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Daniel Ricklin
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Pharmaceutical Sciences, University of Basel, Basel, Switzerland
| | - John D Lambris
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Lydia E Kavraki
- Department of Computer Science, Rice University, Houston, TX, USA
| |
Collapse
|
5
|
Abella JR, Moll M, Kavraki LE. Maintaining and Enhancing Diversity of Sampled Protein Conformations in Robotics-Inspired Methods. J Comput Biol 2018; 25:3-20. [PMID: 29035572 PMCID: PMC5756939 DOI: 10.1089/cmb.2017.0164] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The ability to efficiently sample structurally diverse protein conformations allows one to gain a high-level view of a protein's energy landscape. Algorithms from robot motion planning have been used for conformational sampling, and several of these algorithms promote diversity by keeping track of "coverage" in conformational space based on the local sampling density. However, large proteins present special challenges. In particular, larger systems require running many concurrent instances of these algorithms, but these algorithms can quickly become memory intensive because they typically keep previously sampled conformations in memory to maintain coverage estimates. In addition, robotics-inspired algorithms depend on defining useful perturbation strategies for exploring the conformational space, which is a difficult task for large proteins because such systems are typically more constrained and exhibit complex motions. In this article, we introduce two methodologies for maintaining and enhancing diversity in robotics-inspired conformational sampling. The first method addresses algorithms based on coverage estimates and leverages the use of a low-dimensional projection to define a global coverage grid that maintains coverage across concurrent runs of sampling. The second method is an automatic definition of a perturbation strategy through readily available flexibility information derived from B-factors, secondary structure, and rigidity analysis. Our results show a significant increase in the diversity of the conformations sampled for proteins consisting of up to 500 residues when applied to a specific robotics-inspired algorithm for conformational sampling. The methodologies presented in this article may be vital components for the scalability of robotics-inspired approaches.
Collapse
Affiliation(s)
- Jayvee R Abella
- 1 Department of Computer Science, Rice University , Houston, Texas
| | - Mark Moll
- 1 Department of Computer Science, Rice University , Houston, Texas
| | - Lydia E Kavraki
- 1 Department of Computer Science, Rice University , Houston, Texas
| |
Collapse
|
6
|
Budday D, Fonseca R, Leyendecker S, van den Bedem H. Frustration-guided motion planning reveals conformational transitions in proteins. Proteins 2017; 85:1795-1807. [DOI: 10.1002/prot.25333] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 05/19/2017] [Accepted: 06/07/2017] [Indexed: 01/27/2023]
Affiliation(s)
- Dominik Budday
- Chair of Applied Dynamics, University of Erlangen-Nuremberg; Erlangen Germany
| | - Rasmus Fonseca
- Department of Molecular and Cellular Physiology; Stanford University; California Menlo Park
- Biosciences Division; SLAC National Accelerator Laboratory, Stanford University; California Menlo Park
| | - Sigrid Leyendecker
- Chair of Applied Dynamics, University of Erlangen-Nuremberg; Erlangen Germany
| | - Henry van den Bedem
- Biosciences Division; SLAC National Accelerator Laboratory, Stanford University; California Menlo Park
| |
Collapse
|
7
|
Devaurs D, Antunes DA, Papanastasiou M, Moll M, Ricklin D, Lambris JD, Kavraki LE. Coarse-Grained Conformational Sampling of Protein Structure Improves the Fit to Experimental Hydrogen-Exchange Data. Front Mol Biosci 2017; 4:13. [PMID: 28344973 PMCID: PMC5344923 DOI: 10.3389/fmolb.2017.00013] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2016] [Accepted: 02/24/2017] [Indexed: 11/13/2022] Open
Abstract
Monitoring hydrogen/deuterium exchange (HDX) undergone by a protein in solution produces experimental data that translates into valuable information about the protein's structure. Data produced by HDX experiments is often interpreted using a crystal structure of the protein, when available. However, it has been shown that the correspondence between experimental HDX data and crystal structures is often not satisfactory. This creates difficulties when trying to perform a structural analysis of the HDX data. In this paper, we evaluate several strategies to obtain a conformation providing a good fit to the experimental HDX data, which is a premise of an accurate structural analysis. We show that performing molecular dynamics simulations can be inadequate to obtain such conformations, and we propose a novel methodology involving a coarse-grained conformational sampling approach instead. By extensively exploring the intrinsic flexibility of a protein with this approach, we produce a conformational ensemble from which we extract a single conformation providing a good fit to the experimental HDX data. We successfully demonstrate the applicability of our method to four small and medium-sized proteins.
Collapse
Affiliation(s)
- Didier Devaurs
- Department of Computer Science, Rice UniversityHouston, TX, USA
| | | | - Malvina Papanastasiou
- Department of Pathology and Laboratory Medicine, University of PennsylvaniaPhiladelphia, PA, USA
- Broad Institute of MIT & HarvardCambridge, MA, USA
| | - Mark Moll
- Department of Computer Science, Rice UniversityHouston, TX, USA
| | - Daniel Ricklin
- Department of Pathology and Laboratory Medicine, University of PennsylvaniaPhiladelphia, PA, USA
- Department of Pharmaceutical Sciences, University of BaselBasel, Switzerland
| | - John D. Lambris
- Department of Pathology and Laboratory Medicine, University of PennsylvaniaPhiladelphia, PA, USA
| | | |
Collapse
|
8
|
Novinskaya A, Devaurs D, Moll M, Kavraki LE. Defining Low-Dimensional Projections to Guide Protein Conformational Sampling. J Comput Biol 2016; 24:79-89. [PMID: 27892695 DOI: 10.1089/cmb.2016.0144] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Exploring the conformational space of proteins is critical to characterize their functions. Numerous methods have been proposed to sample a protein's conformational space, including techniques developed in the field of robotics and known as sampling-based motion-planning algorithms (or sampling-based planners). However, these algorithms suffer from the curse of dimensionality when applied to large proteins. Many sampling-based planners attempt to mitigate this issue by keeping track of sampling density to guide conformational sampling toward unexplored regions of the conformational space. This is often done using low-dimensional projections as an indirect way to reduce the dimensionality of the exploration problem. However, how to choose an appropriate projection and how much it influences the planner's performance are still poorly understood issues. In this article, we introduce two methodologies defining low-dimensional projections that can be used by sampling-based planners for protein conformational sampling. The first method leverages information about a protein's flexibility to construct projections that can efficiently guide conformational sampling, when expert knowledge is available. The second method builds similar projections automatically, without expert intervention. We evaluate the projections produced by both methodologies on two conformational search problems involving three middle-size proteins. Our experiments demonstrate that (i) defining projections based on expert knowledge can benefit conformational sampling and (ii) automatically constructing such projections is a reasonable alternative.
Collapse
Affiliation(s)
| | - Didier Devaurs
- Department of Computer Science, Rice University , Houston, Texas
| | - Mark Moll
- Department of Computer Science, Rice University , Houston, Texas
| | - Lydia E Kavraki
- Department of Computer Science, Rice University , Houston, Texas
| |
Collapse
|
9
|
Taylor WR. Steric exclusion and constraint satisfaction in multi-scale coarse-grained simulations. Comput Biol Chem 2016; 64:297-312. [PMID: 27543766 PMCID: PMC5272901 DOI: 10.1016/j.compbiolchem.2016.06.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 06/07/2016] [Accepted: 06/08/2016] [Indexed: 11/26/2022]
Abstract
The method developed here provides a fast and flexible way to capture the structure of most macromolecules in a hierarchy of increasingly larger coarse-grained levels without losing the detailed low-level representation. Molecules can then be viewed using an integral graphical viewer or animated through a high-level application programming interface (API) in C++. Although much testing remains to be done, the system has the potential to be applied to very large dynamic systems including both protein and nucleic acids.
An algorithm is described for the interaction of a hierarchy of objects that seeks to circumvent a fundamental problem in coarse-grained modelling which is the loss of fine detail when components become bundled together. A “currants-in-jelly” model is developed that provides a flexible approach in which the contribution of the soft high-level objects (jelly-like) are employed to protect the underlying atomic structure (currants), while still allowing them to interact. Idealised chains were used to establish the parameters to achieve this degree of interaction over a hierarchy spanning four levels and in a more realistic example, the distortion experienced by a protein domain structure during collision was measured and the parameters refined. This model of steric repulsion was then combined with sets of predicted distance constraints, derived from correlated mutation analysis. Firstly, an integral trans-membrane protein was modelled in which the packing of the seven helices was refined but without topological rearrangement. Secondly, an RNA structure was ‘folded’ under the predicted constraints, starting only from its 2-dimensional secondary structure prediction.
Collapse
|
10
|
Antunes DA, Devaurs D, Kavraki LE. Understanding the challenges of protein flexibility in drug design. Expert Opin Drug Discov 2015; 10:1301-13. [DOI: 10.1517/17460441.2015.1094458] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|