Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bandyopadhyay S, Mondal J. A deep autoencoder framework for discovery of metastable ensembles in biomacromolecules. J Chem Phys 2021;155:114106. [PMID: 34551528 DOI: 10.1063/5.0059965] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

For:	Bandyopadhyay S, Mondal J. A deep autoencoder framework for discovery of metastable ensembles in biomacromolecules. J Chem Phys 2021;155:114106. [PMID: 34551528 DOI: 10.1063/5.0059965] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Number

Cited by Other Article(s)

Ishizone T, Matsunaga Y, Fuchigami S, Nakamura K. Representation of Protein Dynamics Disentangled by Time-Structure-Based Prior. J Chem Theory Comput 2024;20:436-450. [PMID: 38151233 DOI: 10.1021/acs.jctc.3c01025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]

Adhikari S, Mondal J. Machine Learning Subtle Conformational Change due to Phosphorylation in Intrinsically Disordered Proteins. J Phys Chem B 2023;127:9433-9449. [PMID: 37905972 DOI: 10.1021/acs.jpcb.3c05136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]

Abstract

Phosphorylation of intrinsically disordered proteins/regions (IDPs/IDRs) has a profound effect in biological functions such as cell signaling, protein folding or unfolding, and long-range allosteric effects. However, here we focus on two IDPs, namely 83-residue IDR transcription factor Ash1 and 92-residue long N-terminal region of CDK inhibitor Sic1 protein, found in Saccharomyces cerevisiae, for which experimental measurements of average conformational properties, namely, radius of gyration and structure factor, indicate negligible changes upon phosphorylation. Here, we show that a judicious dissection of conformational ensemble via combination of unsupervised machine learning and extensive molecular dynamics (MD) trajectories can highlight key differences and similarities among the phosphorylated and wild-type IDP. In particular, we develop Markov state model (MSM) using the latent-space dimensions of an autoencoder, trained using multi-microsecond long MD simulation trajectories. Examination of structural changes among the states, prior to and upon phosphorylation, captured several similarities and differences in their backbone contact maps, secondary structure, and torsion angles. Hydrogen bonding analysis revealed that phosphorylation not only increases the number of hydrogen bonds but also switches the pattern of hydrogen bonding between the backbone and side chain atoms with the phosphorylated residues. We also observe that although phosphorylation introduces salt bridges, there is a loss of the cation-π interaction. Phosphorylation also improved the probability for long-range hydrophobic contacts and also enhanced interaction with water molecules and improved the local structure of water as evident from the geometric order parameters. The observations on these machine-learnt states gave important insights, as it would otherwise be difficult to determine experimentally which is important, if we were to understand the role of phosphorylation of IDPs in their biological functions.

Collapse

Lemcke S, Appeldorn JH, Wand M, Speck T. Toward a structural identification of metastable molecular conformations. J Chem Phys 2023;159:114105. [PMID: 37712784 DOI: 10.1063/5.0164145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 08/21/2023] [Indexed: 09/16/2023] Open

Bandyopadhyay S, Mondal J. A deep encoder-decoder framework for identifying distinct ligand binding pathways. J Chem Phys 2023;158:2890463. [PMID: 37184003 DOI: 10.1063/5.0145197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 04/25/2023] [Indexed: 05/16/2023] Open

Ahalawat N, Sahil M, Mondal J. Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning. J Chem Theory Comput 2023;19:2644-2657. [PMID: 37068044 DOI: 10.1021/acs.jctc.2c00932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]

Abstract

A long-standing target in elucidating the biomolecular recognition process is the identification of binding-competent conformations of the receptor protein. However, protein conformational plasticity and the stochastic nature of the recognition processes often preclude the assignment of a specific protein conformation to an individual ligand-bound pose. Here, we demonstrate that a computational framework coined as RF-TICA-MD, which integrates an ensemble decision-tree-based Random Forest (RF) machine learning (ML) technique with an unsupervised dimension reduction approach time-structured independent component analysis (TICA), provides an efficient and unambiguous solution toward resolving protein conformational plasticity and the substrate binding process. In particular, we consider multimicrosecond-long molecular dynamics (MD) simulation trajectories of a ligand recognition process in solvent-inaccessible cavities of archetypal proteins T4 lysozyme and cytochrome P450cam. We show that in a scenario in which clear correspondence between protein conformation and binding-competent macrostates could not be obtained via an unsupervised dimension reduction approach, an a priori decision-tree-based supervised classification of the simulated recognition trajectories via RF would help characterize key amino acid residue pairs of the protein that are deemed sensitive for ligand binding. A subsequent unsupervised dimensional reduction of the selected residue pairs via TICA would then delineate a conformational landscape of protein which is able to demarcate ligand-bound poses from unbound ones. The proposed RF-TICA-MD approach is shown to be data agnostic and found to be robust when using other ML-based classification methods such as XGBoost. As a promising spinoff of the protocol, the framework is found to be capable of identifying distal protein locations which would be allosterically important for ligand binding and would characterize their roles in recognition pathways. A Python implementation of a proposed ML workflow is available in GitHub https://github.com/navjeet0211/rf-tica-md.

Collapse

Dutta P, Sengupta N. Efficient Interrogation of the Kinetic Barriers Demarcating Catalytic States of a Tyrosine Kinase with Optimal Physical Descriptors and Mixture Models. Chemphyschem 2023;24:e202200595. [PMID: 36394126 DOI: 10.1002/cphc.202200595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 11/16/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022]

Sahil M, Sarkar S, Mondal J. Long-time-step molecular dynamics can retard simulation of protein-ligand recognition process. Biophys J 2023;122:802-816. [PMID: 36726313 PMCID: PMC10027446 DOI: 10.1016/j.bpj.2023.01.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 10/31/2022] [Accepted: 01/25/2023] [Indexed: 02/03/2023] Open

Abstract

Molecular dynamics (MD) simulation of biologically relevant processes at realistic time scale and atomistic precision is generally limited by prohibitively large computational cost, due to its restriction of using an ultrashort integration time step (1-2 fs). A popular numerical recipe to reduce the associated computational burden is adopting schemes that would allow relatively longer-time-step for MD propagation. Here, we explore the perceived potential of one of the most frequently used long-time-step protocols, namely the hydrogen mass repartitioning (HMR) approach, in alleviating the computational overhead associated with simulation of the kinetic process of protein-ligand recognition events. By repartitioning the mass of heavier atoms to their linked hydrogen atoms, HMR leverages around twofold longer time step than regular simulation, holding promise of significant performance boost. However, our probe into direct simulation of the protein-ligand recognition event, one of the computationally most challenging processes, shows that long-time-step HMR MD simulations do not necessarily translate to a computationally affordable solution. Our investigations spanning cumulative 176 μs in three independent proteins (T4 lysozyme, sensor domain of MopR, and galectin-3) show that long-time-step HMR-based MD simulations can catch the ligand in its act of recognizing the native cavity. But, as a major caveat, the ligand is found to require significantly longer time to identify buried native protein cavity in an HMR MD simulation than regular simulation, thereby defeating the purpose of its usage for performance upgrade. A molecular analysis shows that the longer time required by a ligand to recognize the protein in HMR is rooted in faster diffusion of the ligand, which reduces the survival probability of decisive on-pathway metastable intermediates, thereby slowing down the eventual recognition process at the native cavity. Together, the investigation stresses careful assessment of pitfalls of long-time-step algorithms before attempting to utilize them for higher performance for biomolecular recognition simulations.

Collapse

Tian H, Jiang X, Xiao S, La Force H, Larson EC, Tao P. LAST: Latent Space-Assisted Adaptive Sampling for Protein Trajectories. J Chem Inf Model 2023;63:67-75. [PMID: 36472885 PMCID: PMC9904845 DOI: 10.1021/acs.jcim.2c01213] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

Molecular dynamics (MD) simulation is widely used to study protein conformations and dynamics. However, conventional simulation suffers from being trapped in some local energy minima that are hard to escape. Thus, most of the computational time is spent sampling in the already visited regions. This leads to an inefficient sampling process and further hinders the exploration of protein movements in affordable simulation time. The advancement of deep learning provides new opportunities for protein sampling. Variational autoencoders are a class of deep learning models to learn a low-dimensional representation (referred to as the latent space) that can capture the key features of the input data. Based on this characteristic, we proposed a new adaptive sampling method, latent space-assisted adaptive sampling for protein trajectories (LAST), to accelerate the exploration of protein conformational space. This method comprises cycles of (i) variational autoencoder training, (ii) seed structure selection on the latent space, and (iii) conformational sampling through additional MD simulations. The proposed approach is validated through the sampling of four structures of two protein systems: two metastable states of Escherichia coli adenosine kinase (ADK) and two native states of Vivid (VVD). In all four conformations, seed structures were shown to lie on the boundary of conformation distributions. Moreover, large conformational changes were observed in a shorter simulation time when compared with structural dissimilarity sampling (SDS) and conventional MD (cMD) simulations in both systems. In metastable ADK simulations, LAST explored two transition paths toward two stable states, while SDS explored only one and cMD neither. In VVD light state simulations, LAST was three times faster than cMD simulation with a similar conformational space. Overall, LAST is comparable to SDS and is a promising tool in adaptive sampling. The LAST method is publicly available at https://github.com/smu-tao-group/LAST to facilitate related research.

Collapse

Chen H, Chipot C. Chasing collective variables using temporal data-driven strategies. QRB DISCOVERY 2023;4:e2. [PMID: 37564298 PMCID: PMC10411323 DOI: 10.1017/qrd.2022.23] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 12/21/2022] [Accepted: 12/29/2022] [Indexed: 01/09/2023] Open

Rickert CA, Lieleg O. Machine learning approaches for biomolecular, biophysical, and biomaterials research. BIOPHYSICS REVIEWS 2022;3:021306. [PMID: 38505413 PMCID: PMC10914139 DOI: 10.1063/5.0082179] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 05/12/2022] [Indexed: 03/21/2024]