1
|
Moradi S, Nowroozi A, Aryaei Nezhad M, Jalali P, Khosravi R, Shahlaei M. A review on description dynamics and conformational changes of proteins using combination of principal component analysis and molecular dynamics simulation. Comput Biol Med 2024; 183:109245. [PMID: 39388840 DOI: 10.1016/j.compbiomed.2024.109245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 09/22/2024] [Accepted: 10/01/2024] [Indexed: 10/12/2024]
Abstract
Understanding how proteins behave dynamically and undergo conformational changes is essential to comprehending their biological roles. This review article examines the potent tool of using Molecular Dynamics simulations in conjunction with Principal Component Analysis (PCA) to explore protein dynamics. Molecular dynamics data can be made easier to read by removing prominent patterns through the use of PCA, a sophisticated dimensionality reduction approach. Researchers can obtain critical insights into the fundamental principles governing protein function by using PCA on MD simulation data. We provide a systematic approach to PCA that includes data collection, input coordinate selection, and result interpretation. Protein collective movements and fundamental dynamics are made visible by PCA, which makes it possible to identify conformational substates that are crucial to function. By means of principal component analysis, scientists are able to observe and measure large-scale movements, like hinge bending and domain motions, as well as pinpoint areas of protein structural stiffness and flexibility. Moreover, PCA allows temporal separation, distinguishing slower global motions from faster local changes. A strong foundation for researching protein dynamics is provided by the combination of PCA and Molecular Dynamics simulations, which have applications in drug development and enhance our comprehension of intricate biological systems.
Collapse
Affiliation(s)
- Sajad Moradi
- Nano Drug Delivery Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Amin Nowroozi
- Pharmaceutical Sciences Research Center, Faculty of Pharmacy, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Mohammad Aryaei Nezhad
- Pharmaceutical Sciences Research Center, Faculty of Pharmacy, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Parvin Jalali
- Nano Drug Delivery Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Rasool Khosravi
- Pharmaceutical Sciences Research Center, Faculty of Pharmacy, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Mohsen Shahlaei
- Nano Drug Delivery Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran; Medical Biology Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran.
| |
Collapse
|
2
|
Ruzmetov T, Montes R, Sun J, Chen SH, Tang Z, Chang CEA. Binding Kinetics Toolkit for Analyzing Transient Molecular Conformations and Computing Free Energy Landscapes. J Phys Chem A 2022; 126:8761-8770. [DOI: 10.1021/acs.jpca.2c05499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Talant Ruzmetov
- Department of Chemistry, University of California at Riverside, Riverside, California92521, United States
| | - Ruben Montes
- Department of Chemistry, University of California at Riverside, Riverside, California92521, United States
| | - Jianan Sun
- Department of Chemistry, University of California at Riverside, Riverside, California92521, United States
| | - Si-Han Chen
- Department of Chemistry, University of California at Riverside, Riverside, California92521, United States
| | - Zhiye Tang
- Department of Chemistry, University of California at Riverside, Riverside, California92521, United States
| | - Chia-en A. Chang
- Department of Chemistry, University of California at Riverside, Riverside, California92521, United States
| |
Collapse
|
3
|
Principal Component Analysis and Related Methods for Investigating the Dynamics of Biological Macromolecules. J 2022. [DOI: 10.3390/j5020021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Principal component analysis (PCA) is used to reduce the dimensionalities of high-dimensional datasets in a variety of research areas. For example, biological macromolecules, such as proteins, exhibit many degrees of freedom, allowing them to adopt intricate structures and exhibit complex functions by undergoing large conformational changes. Therefore, molecular simulations of and experiments on proteins generate a large number of structure variations in high-dimensional space. PCA and many PCA-related methods have been developed to extract key features from such structural data, and these approaches have been widely applied for over 30 years to elucidate macromolecular dynamics. This review mainly focuses on the methodological aspects of PCA and related methods and their applications for investigating protein dynamics.
Collapse
|
4
|
Chen H, Ogden D, Pant S, Cai W, Tajkhorshid E, Moradi M, Roux B, Chipot C. A Companion Guide to the String Method with Swarms of Trajectories: Characterization, Performance, and Pitfalls. J Chem Theory Comput 2022; 18:1406-1422. [PMID: 35138832 PMCID: PMC8904302 DOI: 10.1021/acs.jctc.1c01049] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The string method with swarms of trajectories (SMwST) is an algorithm that identifies a physically meaningful transition pathway─a one-dimensional curve, embedded within a high-dimensional space of selected collective variables. The SMwST algorithm leans on a series of short, unbiased molecular dynamics simulations spawned at different locations of the discretized path, from whence an average dynamic drift is determined to evolve the string toward an optimal pathway. However conceptually simple in both its theoretical formulation and practical implementation, the SMwST algorithm is computationally intensive and requires a careful choice of parameters for optimal cost-effectiveness in applications to challenging problems in chemistry and biology. In this contribution, the SMwST algorithm is presented in a self-contained manner, discussing with a critical eye its theoretical underpinnings, applicability, inherent limitations, and use in the context of path-following free-energy calculations and their possible extension to kinetics modeling. Through multiple simulations of a prototypical polypeptide, combining the search of the transition pathway and the computation of the potential of mean force along it, several practical aspects of the methodology are examined with the objective of optimizing the computational effort, yet without sacrificing accuracy. In light of the results reported here, we propose some general guidelines aimed at improving the efficiency and reliability of the computed pathways and free-energy profiles underlying the conformational transitions at hand.
Collapse
Affiliation(s)
- Haochuan Chen
- Research Center for Analytical Sciences, College of Chemistry, Tianjin Key Laboratory of Biosensing and Molecular Recognition, Nankai University, Tianjin 300071, China
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche no 7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy Cedex, France
| | - Dylan Ogden
- Department of Chemistry and Biochemistry, University of Arkansas, Fayetteville, Arkansas 72701, United States
| | - Shashank Pant
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Wensheng Cai
- Research Center for Analytical Sciences, College of Chemistry, Tianjin Key Laboratory of Biosensing and Molecular Recognition, Nankai University, Tianjin 300071, China
| | - Emad Tajkhorshid
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Biochemistry and Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Mahmoud Moradi
- Department of Chemistry and Biochemistry, University of Arkansas, Fayetteville, Arkansas 72701, United States
| | - Benoît Roux
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, United States
| | - Christophe Chipot
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche no 7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy Cedex, France
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
5
|
Bartocci A, Gillet N, Jiang T, Szczepaniak F, Dumont E. Molecular Dynamics Approach for Capturing Calixarene-Protein Interactions: The Case of Cytochrome C. J Phys Chem B 2020; 124:11371-11378. [PMID: 33270456 DOI: 10.1021/acs.jpcb.0c08482] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Functionalized supramolecular cages are of growing importance in biology and biochemistry. They have recently been proposed as efficient auxiliaries to obtain high-resolution cocrystallized proteins. Here, we propose a molecular dynamics investigation of the supramolecular association of sulfonated calix-[8]-arenes to cytochrome c starting from initially distant proteins and ligands. We characterize two main binding sites for the sulfonated calixarene on the cytochrome c surface which are in perfect agreement with the previous experiments with regard to the structure (comparison with the X-ray structure PDB 6GD8) and the binding free energies [comparison between the molecular mechanics Poisson-Boltzmann surface area analysis and the isothermal titration calorimetry measurements]. The per-residue decomposition of the interaction energies reveals the detailed picture of this electrostatically driven association and notably the role of arginine R13 as a bridging residue between the two main anchoring sites. In addition, the analysis of the residue behavior by means of a supervised machine learning protocol unveils the formation of a hydrogen bond network far from the binding sites, increasing the rigidity of the protein. This study paves the way toward an automated procedure to predict the supramolecular protein-cage association, with the possibility of a computational screening of new promising derivatives for controlled protein assembly and protein surface recognition processes.
Collapse
Affiliation(s)
- Alessio Bartocci
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Natacha Gillet
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Tao Jiang
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Florence Szczepaniak
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Elise Dumont
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France.,Institut Universitaire de France, 5 Rue Descartes, 75005 Paris, France
| |
Collapse
|
6
|
Khalil K, Eldash O, Kumar A, Bayoumi M. Intelligent Fault-Prediction Assisted Self-Healing for Embryonic Hardware. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2020; 14:852-866. [PMID: 32746336 DOI: 10.1109/tbcas.2020.2995784] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This paper proposes novel methods for making embryonic bio-inspired hardware efficient against faults through self-healing, fault prediction, and fault-prediction assisted self-healing. The proposed self-healing recovers a faulty embryonic cell through innovative usage of healthy cells. Through experimentations, it is observed that self-healing is effective, but it takes a considerable amount of time for the hardware to recover from a fault that occurs suddenly without forewarning. To get over this problem of delay, novel deep learning-based formulations are proposed for fault predictions. The proposed self-healing technique is then deployed along with the proposed fault prediction methods to gauge the accuracy and delay of embryonic hardware. The proposed fault prediction and self-healing methods have been implemented in VHDL over FPGA. The proposed fault predictions achieve high accuracy with low training time. The accuracy is up to 99.36% with the training time of 2.16 min. The area overhead of the proposed self-healing method is 34%, and the fault recovery percentage is 75%. To the best of our knowledge, this is the first such work in embryonic hardware, and it is expected to open a new frontier in fault-prediction assisted self-healing for embryonic systems.
Collapse
|
7
|
Fleetwood O, Kasimova MA, Westerlund AM, Delemotte L. Molecular Insights from Conformational Ensembles via Machine Learning. Biophys J 2020; 118:765-780. [PMID: 31952811 PMCID: PMC7002924 DOI: 10.1016/j.bpj.2019.12.016] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 11/21/2019] [Accepted: 12/16/2019] [Indexed: 01/04/2023] Open
Abstract
Biomolecular simulations are intrinsically high dimensional and generate noisy data sets of ever-increasing size. Extracting important features from the data is crucial for understanding the biophysical properties of molecular processes, but remains a big challenge. Machine learning (ML) provides powerful dimensionality reduction tools. However, such methods are often criticized as resembling black boxes with limited human-interpretable insight. We use methods from supervised and unsupervised ML to efficiently create interpretable maps of important features from molecular simulations. We benchmark the performance of several methods, including neural networks, random forests, and principal component analysis, using a toy model with properties reminiscent of macromolecular behavior. We then analyze three diverse biological processes: conformational changes within the soluble protein calmodulin, ligand binding to a G protein-coupled receptor, and activation of an ion channel voltage-sensor domain, unraveling features critical for signal transduction, ligand binding, and voltage sensing. This work demonstrates the usefulness of ML in understanding biomolecular states and demystifying complex simulations.
Collapse
Affiliation(s)
- Oliver Fleetwood
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, Solna, Sweden
| | - Marina A Kasimova
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, Solna, Sweden
| | - Annie M Westerlund
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, Solna, Sweden
| | - Lucie Delemotte
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, Solna, Sweden.
| |
Collapse
|
8
|
Prajapati GK, Pandey B, Mishra AK, Baek KH, Pandey DM. Identification of GCC-box and TCC-box motifs in the promoters of differentially expressed genes in rice (Oryza sativa L.): Experimental and computational approaches. PLoS One 2019; 14:e0214964. [PMID: 31026257 PMCID: PMC6485614 DOI: 10.1371/journal.pone.0214964] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Accepted: 03/23/2019] [Indexed: 01/22/2023] Open
Abstract
The transcription factor selectively binds with the cis-regulatory elements of the promoter and regulates the differential expression of genes. In this study, we aimed to identify and validate the presence of GCC-box and TCC-box motifs in the promoters of upregulated differentially expressed genes (UR-DEGs) and downregulated differentially expressed genes (DR-DEGs) under anoxia using molecular beacon probe (MBP) based real-time PCR. The GCC-box motif was detected in UR-DEGs (DnaJ and 60S ribosomal protein L7 genes), whereas, the TCC-box was detected in DR-DEGs (DnaK and CPuORF11 genes). In addition, the mechanism of interaction of AP2/EREBP family transcription factor (LOC_Os03g22170) with GCC-box promoter motif present in DnaJ gene (LOC_Os06g09560) and 60S ribosomal protein L7 gene (LOC_Os08g42920); and TCC-box promoter motif of DnaK gene (LOC_Os02g48110) and CPuORF11 gene (LOC_Os02g01240) were explored using molecular dynamics (MD) simulations analysis including binding free energy calculations, principal component analyses, and free energy landscapes. The binding free energy analysis revealed that AP2/EREBP model residues such as Arg68, Arg72, Arg83, Lys87, and Arg90 were commonly involved in the formation of hydrogen bonds with GCC and TCC-box promoter motifs, suggesting that these residues are critical for strong interaction. The movement of the entire protein bound to DNA was restricted, confirming the stability of the complex. This study provides comprehensive binding information and a more detailed view of the dynamic interaction between proteins and DNA.
Collapse
Affiliation(s)
- Gopal Kumar Prajapati
- Department of Bio-Engineering, Birla Institute of Technology, Mesra, Ranchi, Jharkhand, India
| | - Bharati Pandey
- Department of Bio-Engineering, Birla Institute of Technology, Mesra, Ranchi, Jharkhand, India
| | - Awdhesh Kumar Mishra
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Republic of Korea
| | - Kwang-Hyun Baek
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Republic of Korea
- * E-mail: (DP); (KB)
| | - Dev Mani Pandey
- Department of Bio-Engineering, Birla Institute of Technology, Mesra, Ranchi, Jharkhand, India
- * E-mail: (DP); (KB)
| |
Collapse
|