1
|
Srivastava G, Liu M, Ni X, Pu L, Brylinski M. Machine Learning Techniques to Infer Protein Structure and Function from Sequences: A Comprehensive Review. Methods Mol Biol 2025; 2867:79-104. [PMID: 39576576 DOI: 10.1007/978-1-0716-4196-5_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2024]
Abstract
The elucidation of protein structure and function plays a pivotal role in understanding biological processes and facilitating drug discovery. With the exponential growth of protein sequence data, machine learning techniques have emerged as powerful tools for predicting protein characteristics from sequences alone. This review provides a comprehensive overview of the importance and application of machine learning in inferring protein structure and function. We discuss various machine learning approaches, primarily focusing on convolutional neural networks and natural language processing, and their utilization in predicting protein secondary and tertiary structures, residue-residue contacts, protein function, and subcellular localization. Furthermore, we highlight the challenges associated with using machine learning techniques in this context, such as the availability of high-quality training datasets and the interpretability of models. We also delve into the latest progress in the field concerning the advancements made in the development of intricate deep learning architectures. Overall, this review underscores the significance of machine learning in advancing our understanding of protein structure and function, and its potential to revolutionize drug discovery and personalized medicine.
Collapse
Affiliation(s)
- Gopal Srivastava
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA
| | - Mengmeng Liu
- Division of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA, USA
| | - Xialong Ni
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA
| | - Limeng Pu
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, USA
| | - Michal Brylinski
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA.
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, USA.
| |
Collapse
|
2
|
Ruzmetov T, Hung TI, Jonnalagedda SP, Chen SH, Fasihianifard P, Guo Z, Bhanu B, Chang CEA. Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.05.592587. [PMID: 38979147 PMCID: PMC11230202 DOI: 10.1101/2024.05.05.592587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure-function relationship, and numerous transient but stable conformations of Intrinsically Disordered Proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper we first introduce a deep learning-based model, termed Internal Coordinate Net (ICoN), which learns the physical principles of conformational changes from Molecular Dynamics (MD) simulation data. Second, we selected interpolating data points in the learned latent space that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β 1-42 (Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42's conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. This approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability of deep learning to utilize learned natural atomistic motions in protein conformation sampling.
Collapse
|
3
|
Aupič J, Pokorná P, Ruthstein S, Magistrato A. Predicting Conformational Ensembles of Intrinsically Disordered Proteins: From Molecular Dynamics to Machine Learning. J Phys Chem Lett 2024; 15:8177-8186. [PMID: 39093570 DOI: 10.1021/acs.jpclett.4c01544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Intrinsically disordered proteins and regions (IDP/IDRs) are ubiquitous across all domains of life. Characterized by a lack of a stable tertiary structure, IDP/IDRs populate a diverse set of transiently formed structural states that can promiscuously adapt upon binding with specific interaction partners and/or certain alterations in environmental conditions. This malleability is foundational for their role as tunable interaction hubs in core cellular processes such as signaling, transcription, and translation. Tracing the conformational ensemble of an IDP/IDR and its perturbation in response to regulatory cues is thus paramount for illuminating its function. However, the conformational heterogeneity of IDP/IDRs poses several challenges. Here, we review experimental and computational methods devised to disentangle the conformational landscape of IDP/IDRs, highlighting recent computational advances that permit proteome-wide scans of IDP/IDRs conformations. We briefly evaluate selected computational methods using the disordered N-terminal of the human copper transporter 1 as a test case and outline further challenges in IDP/IDRs ensemble prediction.
Collapse
Affiliation(s)
- Jana Aupič
- CNR-IOM at International School for Advanced Studies (SISSA/ISAS), via Bonomea 265, 34136 Trieste, Italy
| | - Pavlína Pokorná
- CNR-IOM at International School for Advanced Studies (SISSA/ISAS), via Bonomea 265, 34136 Trieste, Italy
| | - Sharon Ruthstein
- Department of Chemistry, Faculty of Exact Sciences and the Institute for Nanotechnology and Advanced Materials (BINA), Bar-Ilan University, 5290002 Ramat-Gan, Israel
| | - Alessandra Magistrato
- CNR-IOM at International School for Advanced Studies (SISSA/ISAS), via Bonomea 265, 34136 Trieste, Italy
| |
Collapse
|
4
|
Xu Q, Yang M, Ji J, Weng J, Wang W, Xu X. Impact of Nonnative Interactions on the Binding Kinetics of Intrinsically Disordered p53 with MDM2: Insights from All-Atom Simulation and Markov State Model Analysis. J Chem Inf Model 2024; 64:5219-5231. [PMID: 38916177 DOI: 10.1021/acs.jcim.3c01833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Intrinsically disordered proteins (IDPs) lack a well-defined tertiary structure but are essential players in various biological processes. Their ability to undergo a disorder-to-order transition upon binding to their partners, known as the folding-upon-binding process, is crucial for their function. One classical example is the intrinsically disordered transactivation domain (TAD) of the tumor suppressor protein p53, which quickly forms a structured α-helix after binding to its partner MDM2, with clinical significance for cancer treatment. However, the contribution of nonnative interactions between the IDP and its partner to the rapid binding kinetics, as well as their interplay with native interactions, is not well understood at the atomic level. Here, we used molecular dynamics simulation and Markov state model (MSM) analysis to study the folding-upon-binding mechanism between p53-TAD and MDM2. Our results suggest that the system progresses from the nascent encounter complex to the well-structured encounter complex and finally reaches the native complex, following an induced-fit mechanism. We found that nonnative hydrophobic and hydrogen bond interactions, combined with native interactions, effectively stabilize the nascent and well-structured encounter complexes. Among the nonnative interactions, Leu25p53-Leu54MDM2 and Leu25p53-Phe55MDM2 are particularly noteworthy, as their interaction strength is close to the optimum. Evidently, strengthening or weakening these interactions could both adversely affect the binding kinetics. Overall, our findings suggest that nonnative interactions are evolutionarily optimized to accelerate the binding kinetics of IDPs in conjunction with native interactions.
Collapse
Affiliation(s)
- Qianjun Xu
- Department of Chemistry, Institute of Biomedical Sciences and Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200438, China
| | - Maohua Yang
- Department of Chemistry, Institute of Biomedical Sciences and Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200438, China
| | - Jie Ji
- Department of Chemistry, Institute of Biomedical Sciences and Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200438, China
| | - Jingwei Weng
- Department of Chemistry, Institute of Biomedical Sciences and Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200438, China
| | - Wenning Wang
- Department of Chemistry, Institute of Biomedical Sciences and Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200438, China
| | - Xin Xu
- Department of Chemistry, Institute of Biomedical Sciences and Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200438, China
| |
Collapse
|
5
|
Maiti S, Singh A, Maji T, Saibo NV, De S. Experimental methods to study the structure and dynamics of intrinsically disordered regions in proteins. Curr Res Struct Biol 2024; 7:100138. [PMID: 38707546 PMCID: PMC11068507 DOI: 10.1016/j.crstbi.2024.100138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 03/12/2024] [Accepted: 03/15/2024] [Indexed: 05/07/2024] Open
Abstract
Eukaryotic proteins often feature long stretches of amino acids that lack a well-defined three-dimensional structure and are referred to as intrinsically disordered proteins (IDPs) or regions (IDRs). Although these proteins challenge conventional structure-function paradigms, they play vital roles in cellular processes. Recent progress in experimental techniques, such as NMR spectroscopy, single molecule FRET, high speed AFM and SAXS, have provided valuable insights into the biophysical basis of IDP function. This review discusses the advancements made in these techniques particularly for the study of disordered regions in proteins. In NMR spectroscopy new strategies such as 13C detection, non-uniform sampling, segmental isotope labeling, and rapid data acquisition methods address the challenges posed by spectral overcrowding and low stability of IDPs. The importance of various NMR parameters, including chemical shifts, hydrogen exchange rates, and relaxation measurements, to reveal transient secondary structures within IDRs and IDPs are presented. Given the high flexibility of IDPs, the review outlines NMR methods for assessing their dynamics at both fast (ps-ns) and slow (μs-ms) timescales. IDPs exert their functions through interactions with other molecules such as proteins, DNA, or RNA. NMR-based titration experiments yield insights into the thermodynamics and kinetics of these interactions. Detailed study of IDPs requires multiple experimental techniques, and thus, several methods are described for studying disordered proteins, highlighting their respective advantages and limitations. The potential for integrating these complementary techniques, each offering unique perspectives, is explored to achieve a comprehensive understanding of IDPs.
Collapse
Affiliation(s)
| | - Aakanksha Singh
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Tanisha Maji
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Nikita V. Saibo
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Soumya De
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| |
Collapse
|
6
|
Arai M, Suetaka S, Ooka K. Dynamics and interactions of intrinsically disordered proteins. Curr Opin Struct Biol 2024; 84:102734. [PMID: 38039868 DOI: 10.1016/j.sbi.2023.102734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/27/2023] [Accepted: 10/30/2023] [Indexed: 12/03/2023]
Abstract
Intrinsically disordered proteins (IDPs) are widespread in eukaryotes and participate in a variety of important cellular processes. Numerous studies using state-of-the-art experimental and theoretical methods have advanced our understanding of IDPs and revealed that disordered regions engage in a large repertoire of intra- and intermolecular interactions through their conformational dynamics, thereby regulating many intracellular functions in concert with folded domains. The mechanisms by which IDPs interact with their partners are diverse, depending on their conformational propensities, and include induced fit, conformational selection, and their mixtures. In addition, IDPs are implicated in many diseases, and progress has been made in designing inhibitors of IDP-mediated interactions. Here we review these recent advances with a focus on the dynamics and interactions of IDPs.
Collapse
Affiliation(s)
- Munehito Arai
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan; Komaba Organization for Educational Excellence, College of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan; Department of Physics, Graduate School of Science, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan.
| | - Shunji Suetaka
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan
| | - Koji Ooka
- Komaba Organization for Educational Excellence, College of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan
| |
Collapse
|
7
|
Ko S, Kim J, Lim J, Lee SM, Park JY, Woo J, Scott-Nevros ZK, Kim JR, Yoon H, Kim D. Blanket antimicrobial resistance gene database with structural information, BOARDS, provides insights on historical landscape of resistance prevalence and effects of mutations in enzyme structure. mSystems 2024; 9:e0094323. [PMID: 38085058 PMCID: PMC10871167 DOI: 10.1128/msystems.00943-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/02/2023] [Indexed: 01/24/2024] Open
Abstract
Antimicrobial resistance (AMR) in pathogenic bacteria poses a significant threat to public health, yet there is still a need for development in the tools to deeply understand AMR genes based on genetic or structural information. In this study, we present an interactive web database named Blanket Overarching Antimicrobial-Resistance gene Database with Structural information (BOARDS, sbml.unist.ac.kr), a database that comprehensively includes 3,943 reported AMR gene information for 1,997 extended spectrum beta-lactamase (ESBL) and 1,946 other genes as well as a total of 27,395 predicted protein structures. These structures, which include both wild-type AMR genes and their mutants, were derived from 80,094 publicly available whole-genome sequences. In addition, we developed the rapid analysis and detection tool of antimicrobial-resistance (RADAR), a one-stop analysis pipeline to detect AMR genes across whole-genome sequencing (WGSs). By integrating BOARDS and RADAR, the AMR prevalence landscape for eight multi-drug resistant pathogens was reconstructed, leading to unexpected findings such as the pre-existence of the MCR genes before their official reports. Enzymatic structure prediction-based analysis revealed that the occurrence of mutations found in some ESBL genes was found to be closely related to the binding affinities with their antibiotic substrates. Overall, BOARDS can play a significant role in performing in-depth analysis on AMR.IMPORTANCEWhile the increasing antibiotic resistance (AMR) in pathogen has been a burden on public health, effective tools for deep understanding of AMR based on genetic or structural information remain limited. In this study, a blanket overarching antimicrobial-resistance gene database with structure information (BOARDS)-a web-based database that comprehensively collected AMR gene data with predictive protein structural information was constructed. Additionally, we report the development of a RADAR pipeline that can analyze whole-genome sequences as well. BOARDS, which includes sequence and structural information, has shown the historical landscape and prevalence of the AMR genes and can provide insight into single-nucleotide polymorphism effects on antibiotic degrading enzymes within protein structures.
Collapse
Affiliation(s)
- Seyoung Ko
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
- School of Life Sciences, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Jaehyung Kim
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Jaewon Lim
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Sang-Mok Lee
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Joon Young Park
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Jihoon Woo
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Zoe K. Scott-Nevros
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| | - Jong R. Kim
- School of Engineering and Digital Sciences, Nazarbayev University, Astan, Kazakhstan
| | - Hyunjin Yoon
- Department of Molecular Science and Technology, Ajou University, Suwon, South Korea
| | - Donghyuk Kim
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
- School of Life Sciences, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea
| |
Collapse
|
8
|
Shahrajabian MH, Sun W. Characterization of Intrinsically Disordered Proteins in Healthy and Diseased States by Nuclear Magnetic Resonance. Rev Recent Clin Trials 2024; 19:176-188. [PMID: 38409704 DOI: 10.2174/0115748871271420240213064251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/10/2023] [Accepted: 12/13/2023] [Indexed: 02/28/2024]
Abstract
INTRODUCTION Intrinsically Disordered Proteins (IDPs) are active in different cellular procedures like ordered assembly of chromatin and ribosomes, interaction with membrane, protein, and ligand binding, molecular recognition, binding, and transportation via nuclear pores, microfilaments and microtubules process and disassembly, protein functions, RNA chaperone, and nucleic acid binding, modulation of the central dogma, cell cycle, and other cellular activities, post-translational qualification and substitute splicing, and flexible entropic linker and management of signaling pathways. METHODS The intrinsic disorder is a precise structural characteristic that permits IDPs/IDPRs to be involved in both one-to-many and many-to-one signaling. IDPs/IDPRs also exert some dynamical and structural ordering, being much less constrained in their activities than folded proteins. Nuclear magnetic resonance (NMR) spectroscopy is a major technique for the characterization of IDPs, and it can be used for dynamic and structural studies of IDPs. RESULTS AND CONCLUSION This review was carried out to discuss intrinsically disordered proteins and their different goals, as well as the importance and effectiveness of NMR in characterizing intrinsically disordered proteins in healthy and diseased states.
Collapse
Affiliation(s)
- Mohamad Hesam Shahrajabian
- National Key Laboratory of Agricultural Microbiology, Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Wenli Sun
- National Key Laboratory of Agricultural Microbiology, Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
9
|
Patel KN, Chavda D, Manna M. Molecular Docking of Intrinsically Disordered Proteins: Challenges and Strategies. Methods Mol Biol 2024; 2780:165-201. [PMID: 38987470 DOI: 10.1007/978-1-0716-3985-6_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Intrinsically disordered proteins (IDPs) are a novel class of proteins that have established a significant importance and attention within a very short period of time. These proteins are essentially characterized by their inherent structural disorder, encoded mainly by their amino acid sequences. The profound abundance of IDPs and intrinsically disordered regions (IDRs) in the biological world delineates their deep-rooted functionality. IDPs and IDRs convey such extensive functionality through their unique dynamic nature, which enables them to carry out huge number of multifaceted biomolecular interactions and make them "interaction hub" of the cellular systems. Additionally, with such widespread functions, their misfunctioning is also intimately associated with multiple diseases. Thus, understanding the dynamic heterogeneity of various IDPs along with their interactions with respective binding partners is an important field with immense potentials in biomolecular research. In this context, molecular docking-based computational approaches have proven to be remarkable in case of ordered proteins. Molecular docking methods essentially model the biomolecular interactions in both structural and energetic terms and use this information to characterize the putative interactions between the two participant molecules. However, direct applications of the conventional docking methods to study IDPs are largely limited by their structural heterogeneity and demands for unique IDP-centric strategies. Thus, in this chapter, we have presented an overview of current methodologies for successful docking operations involving IDPs and IDRs. These specialized methods majorly include the ensemble-based and fragment-based approaches with their own benefits and limitations. More recently, artificial intelligence and machine learning-assisted approaches are also used to significantly reduce the complexity and computational burden associated with various docking applications. Thus, this chapter aims to provide a comprehensive summary of major challenges and recent advancements of molecular docking approaches in the IDP field for their better utilization and greater applicability.Asp (D).
Collapse
Affiliation(s)
- Keyur N Patel
- Applied Phycology and Biotechnology Division, CSIR Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India
| | - Dhruvil Chavda
- Applied Phycology and Biotechnology Division, CSIR Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India
| | - Moutusi Manna
- Applied Phycology and Biotechnology Division, CSIR Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India.
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India.
| |
Collapse
|
10
|
Kobeissy F, Goli M, Yadikar H, Shakkour Z, Kurup M, Haidar MA, Alroumi S, Mondello S, Wang KK, Mechref Y. Advances in neuroproteomics for neurotrauma: unraveling insights for personalized medicine and future prospects. Front Neurol 2023; 14:1288740. [PMID: 38073638 PMCID: PMC10703396 DOI: 10.3389/fneur.2023.1288740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 11/01/2023] [Indexed: 02/12/2024] Open
Abstract
Neuroproteomics, an emerging field at the intersection of neuroscience and proteomics, has garnered significant attention in the context of neurotrauma research. Neuroproteomics involves the quantitative and qualitative analysis of nervous system components, essential for understanding the dynamic events involved in the vast areas of neuroscience, including, but not limited to, neuropsychiatric disorders, neurodegenerative disorders, mental illness, traumatic brain injury, chronic traumatic encephalopathy, and other neurodegenerative diseases. With advancements in mass spectrometry coupled with bioinformatics and systems biology, neuroproteomics has led to the development of innovative techniques such as microproteomics, single-cell proteomics, and imaging mass spectrometry, which have significantly impacted neuronal biomarker research. By analyzing the complex protein interactions and alterations that occur in the injured brain, neuroproteomics provides valuable insights into the pathophysiological mechanisms underlying neurotrauma. This review explores how such insights can be harnessed to advance personalized medicine (PM) approaches, tailoring treatments based on individual patient profiles. Additionally, we highlight the potential future prospects of neuroproteomics, such as identifying novel biomarkers and developing targeted therapies by employing artificial intelligence (AI) and machine learning (ML). By shedding light on neurotrauma's current state and future directions, this review aims to stimulate further research and collaboration in this promising and transformative field.
Collapse
Affiliation(s)
- Firas Kobeissy
- Department of Neurobiology, School of Medicine, Neuroscience Institute, Atlanta, GA, United States
| | - Mona Goli
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, United States
| | - Hamad Yadikar
- Department of Biological Sciences Faculty of Science, Kuwait University, Safat, Kuwait
| | - Zaynab Shakkour
- Department of Pathology and Anatomical Sciences, University of Missouri School of Medicine, Columbia, MO, United States
| | - Milin Kurup
- Alabama College of Osteopathic Medicine, Dothan, AL, United States
| | | | - Shahad Alroumi
- Department of Biological Sciences Faculty of Science, Kuwait University, Safat, Kuwait
| | - Stefania Mondello
- Department of Biomedical and Dental Sciences and Morphofunctional Imaging, University of Messina, Messina, Italy
| | - Kevin K. Wang
- Department of Neurobiology, School of Medicine, Neuroscience Institute, Atlanta, GA, United States
| | - Yehia Mechref
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, United States
| |
Collapse
|
11
|
Urrutia R, Espejo D, Evens N, Guerra M, Sühn T, Boese A, Hansen C, Fuentealba P, Illanes A, Poblete V. Clustering Methods for Vibro-Acoustic Sensing Features as a Potential Approach to Tissue Characterisation in Robot-Assisted Interventions. SENSORS (BASEL, SWITZERLAND) 2023; 23:9297. [PMID: 38067671 PMCID: PMC10708300 DOI: 10.3390/s23239297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 11/06/2023] [Accepted: 11/11/2023] [Indexed: 12/18/2023]
Abstract
This article provides a comprehensive analysis of the feature extraction methods applied to vibro-acoustic signals (VA signals) in the context of robot-assisted interventions. The primary objective is to extract valuable information from these signals to understand tissue behaviour better and build upon prior research. This study is divided into three key stages: feature extraction using the Cepstrum Transform (CT), Mel-Frequency Cepstral Coefficients (MFCCs), and Fast Chirplet Transform (FCT); dimensionality reduction employing techniques such as Principal Component Analysis (PCA), t-Distributed Stochastic Neighbour Embedding (t-SNE), and Uniform Manifold Approximation and Projection (UMAP); and, finally, classification using a nearest neighbours classifier. The results demonstrate that using feature extraction techniques, especially the combination of CT and MFCC with dimensionality reduction algorithms, yields highly efficient outcomes. The classification metrics (Accuracy, Recall, and F1-score) approach 99%, and the clustering metric is 0.61. The performance of the CT-UMAP combination stands out in the evaluation metrics.
Collapse
Affiliation(s)
- Robin Urrutia
- Instituto de Acústica, Facultad de Ciencias de la Ingeniería, Universidad Austral de Chile, Valdivia 5111187, Chile; (R.U.); (V.P.)
- Audio Mining Laboratory (AuMiLab), Instituto de Acústica, Universidad Austral de Chile, Valdivia 5111187, Chile;
| | - Diego Espejo
- Audio Mining Laboratory (AuMiLab), Instituto de Acústica, Universidad Austral de Chile, Valdivia 5111187, Chile;
| | - Natalia Evens
- Instituto de Anatomia, Histologia y Patologia, Facultad de Medicina, Universidad Austral de Chile, Valdivia 5111187, Chile; (N.E.); (M.G.)
| | - Montserrat Guerra
- Instituto de Anatomia, Histologia y Patologia, Facultad de Medicina, Universidad Austral de Chile, Valdivia 5111187, Chile; (N.E.); (M.G.)
| | - Thomas Sühn
- Department of Orthopaedic Surgery, Otto-von-Guericke University Magdeburg, 39120 Magdeburg, Germany;
- SURAG Medical GmbH, 39118 Magdeburg, Germany;
| | - Axel Boese
- INKA Innovation Laboratory for Image Guided Therapy, Otto-von-Guericke University Magdeburg, 39120 Magdeburg, Germany
| | - Christian Hansen
- Research Campus STIMULATE, Otto-von-Guericke University Magdeburg, 39106 Magdeburg, Germany;
| | - Patricio Fuentealba
- Instituto de Electricidad y Electrónica, Facultad de Ciencias de la Ingeniería, Universidad Austral de Chile, Valdivia 5111187, Chile;
| | | | - Victor Poblete
- Instituto de Acústica, Facultad de Ciencias de la Ingeniería, Universidad Austral de Chile, Valdivia 5111187, Chile; (R.U.); (V.P.)
- Audio Mining Laboratory (AuMiLab), Instituto de Acústica, Universidad Austral de Chile, Valdivia 5111187, Chile;
| |
Collapse
|
12
|
Varadi M, Velankar S. The impact of AlphaFold Protein Structure Database on the fields of life sciences. Proteomics 2023; 23:e2200128. [PMID: 36382391 DOI: 10.1002/pmic.202200128] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 11/07/2022] [Accepted: 11/08/2022] [Indexed: 09/06/2023]
Abstract
Arguably, 2020 was the year of high-accuracy protein structure predictions, with AlphaFold 2.0 achieving previously unseen accuracy in the Critical Assessment of Protein Structure Prediction (CASP). In 2021, DeepMind and EMBL-EBI developed the AlphaFold Protein Structure Database to make an unprecedented number of reliable protein structure predictions easily accessible to the broad scientific community. We provide a brief overview and describe the latest developments in the AlphaFold database. We highlight how the fields of data services, bioinformatics, structural biology, and drug discovery are directly affected by the influx of protein structure data. We also show examples of cutting-edge research that took advantage of the AlphaFold database. It is apparent that connections between various fields through protein structures are now possible, but the amount of data poses new challenges. Finally, we give an outlook regarding the future direction of the database, both in terms of data sets and new functionalities.
Collapse
Affiliation(s)
- Mihaly Varadi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Sameer Velankar
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| |
Collapse
|
13
|
Zheng LE, Barethiya S, Nordquist E, Chen J. Machine Learning Generation of Dynamic Protein Conformational Ensembles. Molecules 2023; 28:4047. [PMID: 37241789 PMCID: PMC10220786 DOI: 10.3390/molecules28104047] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 05/04/2023] [Accepted: 05/09/2023] [Indexed: 05/28/2023] Open
Abstract
Machine learning has achieved remarkable success across a broad range of scientific and engineering disciplines, particularly its use for predicting native protein structures from sequence information alone. However, biomolecules are inherently dynamic, and there is a pressing need for accurate predictions of dynamic structural ensembles across multiple functional levels. These problems range from the relatively well-defined task of predicting conformational dynamics around the native state of a protein, which traditional molecular dynamics (MD) simulations are particularly adept at handling, to generating large-scale conformational transitions connecting distinct functional states of structured proteins or numerous marginally stable states within the dynamic ensembles of intrinsically disordered proteins. Machine learning has been increasingly applied to learn low-dimensional representations of protein conformational spaces, which can then be used to drive additional MD sampling or directly generate novel conformations. These methods promise to greatly reduce the computational cost of generating dynamic protein ensembles, compared to traditional MD simulations. In this review, we examine recent progress in machine learning approaches towards generative modeling of dynamic protein ensembles and emphasize the crucial importance of integrating advances in machine learning, structural data, and physical principles to achieve these ambitious goals.
Collapse
Affiliation(s)
- Li-E Zheng
- Department of Gynecology, The First Affiliated Hospital of Fujian Medical University, Fuzhou 350005, China;
| | - Shrishti Barethiya
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (S.B.); (E.N.)
| | - Erik Nordquist
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (S.B.); (E.N.)
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (S.B.); (E.N.)
| |
Collapse
|
14
|
Zhang O, Haghighatlari M, Li J, Liu ZH, Namini A, Teixeira JMC, Forman-Kay JD, Head-Gordon T. Learning to evolve structural ensembles of unfolded and disordered proteins using experimental solution data. J Chem Phys 2023; 158:174113. [PMID: 37144719 PMCID: PMC10163956 DOI: 10.1063/5.0141474] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 04/11/2023] [Indexed: 05/06/2023] Open
Abstract
The structural characterization of proteins with a disorder requires a computational approach backed by experiments to model their diverse and dynamic structural ensembles. The selection of conformational ensembles consistent with solution experiments of disordered proteins highly depends on the initial pool of conformers, with currently available tools limited by conformational sampling. We have developed a Generative Recurrent Neural Network (GRNN) that uses supervised learning to bias the probability distributions of torsions to take advantage of experimental data types such as nuclear magnetic resonance J-couplings, nuclear Overhauser effects, and paramagnetic resonance enhancements. We show that updating the generative model parameters according to the reward feedback on the basis of the agreement between experimental data and probabilistic selection of torsions from learned distributions provides an alternative to existing approaches that simply reweight conformers of a static structural pool for disordered proteins. Instead, the biased GRNN, DynamICE, learns to physically change the conformations of the underlying pool of the disordered protein to those that better agree with experiments.
Collapse
Affiliation(s)
- Oufan Zhang
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, USA
| | - Mojtaba Haghighatlari
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, USA
| | - Jie Li
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, USA
| | | | - Ashley Namini
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5S 1A8, Canada
| | | | | | | |
Collapse
|
15
|
Dixit R, Khambhati K, Supraja KV, Singh V, Lederer F, Show PL, Awasthi MK, Sharma A, Jain R. Application of machine learning on understanding biomolecule interactions in cellular machinery. BIORESOURCE TECHNOLOGY 2023; 370:128522. [PMID: 36565819 DOI: 10.1016/j.biortech.2022.128522] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 12/17/2022] [Accepted: 12/20/2022] [Indexed: 06/17/2023]
Abstract
Machine learning (ML) applications have become ubiquitous in all fields of research including protein science and engineering. Apart from protein structure and mutation prediction, scientists are focusing on knowledge gaps with respect to the molecular mechanisms involved in protein binding and interactions with other components in the experimental setups or the human body. Researchers are working on several wet-lab techniques and generating data for a better understanding of concepts and mechanics involved. The information like biomolecular structure, binding affinities, structure fluctuations and movements are enormous which can be handled and analyzed by ML. Therefore, this review highlights the significance of ML in understanding the biomolecular interactions while assisting in various fields of research such as drug discovery, nanomedicine, nanotoxicity and material science. Hence, the way ahead would be to force hand-in hand of laboratory work and computational techniques.
Collapse
Affiliation(s)
- Rewati Dixit
- Waste Treatment Laboratory, Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Haus-khas, New Delhi 110016, India
| | - Khushal Khambhati
- Department of Biosciences, School of Science, Indrashil University, Rajpur, Mehsana 382715, Gujarat, India
| | - Kolli Venkata Supraja
- Waste Treatment Laboratory, Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Haus-khas, New Delhi 110016, India
| | - Vijai Singh
- Department of Biosciences, School of Science, Indrashil University, Rajpur, Mehsana 382715, Gujarat, India
| | - Franziska Lederer
- Helmholtz-Zentrum Dresden-Rossendorf, Helmholtz Institute Freiberg for Resource Technology, Bautzner landstrasse 400, 01328 Dresden, Germany
| | - Pau-Loke Show
- Zhejiang Provincial Key Laboratory for Subtropical Water Environment and Marine Biological Resources Protection, Wenzhou University, Wenzhou 325035, China; Department of Sustainable Engineering, Saveetha School of Engineering, SIMATS, Chennai 602105, India; Department of Chemical and Environmental Engineering, University of Nottingham, Malaysia, 43500 Semenyih, Selangor Darul Ehsan, Malaysia
| | - Mukesh Kumar Awasthi
- College of Natural Resources and Environment, Northwest A&F University, Yangling 712100, China
| | - Abhinav Sharma
- Institute Theory of Polymers, Leibniz Institute for Polymer Research, Hohe Strasse 6, 01069 Dresden, Germany
| | - Rohan Jain
- Helmholtz-Zentrum Dresden-Rossendorf, Helmholtz Institute Freiberg for Resource Technology, Bautzner landstrasse 400, 01328 Dresden, Germany.
| |
Collapse
|
16
|
Teixeira JMC, Liu ZH, Namini A, Li J, Vernon RM, Krzeminski M, Shamandy AA, Zhang O, Haghighatlari M, Yu L, Head-Gordon T, Forman-Kay JD. IDPConformerGenerator: A Flexible Software Suite for Sampling the Conformational Space of Disordered Protein States. J Phys Chem A 2022; 126:5985-6003. [PMID: 36030416 PMCID: PMC9465686 DOI: 10.1021/acs.jpca.2c03726] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 08/08/2022] [Indexed: 11/29/2022]
Abstract
The power of structural information for informing biological mechanisms is clear for stable folded macromolecules, but similar structure-function insight is more difficult to obtain for highly dynamic systems such as intrinsically disordered proteins (IDPs) which must be described as structural ensembles. Here, we present IDPConformerGenerator, a flexible, modular open-source software platform for generating large and diverse ensembles of disordered protein states that builds conformers that obey geometric, steric, and other physical restraints on the input sequence. IDPConformerGenerator samples backbone phi (φ), psi (ψ), and omega (ω) torsion angles of relevant sequence fragments from loops and secondary structure elements extracted from folded protein structures in the RCSB Protein Data Bank and builds side chains from robust Monte Carlo algorithms using expanded rotamer libraries. IDPConformerGenerator has many user-defined options enabling variable fractional sampling of secondary structures, supports Bayesian models for assessing the agreement of IDP ensembles for consistency with experimental data, and introduces a machine learning approach to transform between internal and Cartesian coordinates with reduced error. IDPConformerGenerator will facilitate the characterization of disordered proteins to ultimately provide structural insights into these states that have key biological functions.
Collapse
Affiliation(s)
- João M. C. Teixeira
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Zi Hao Liu
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Ashley Namini
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Jie Li
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Robert M. Vernon
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Mickaël Krzeminski
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Alaa A. Shamandy
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
| | - Oufan Zhang
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Mojtaba Haghighatlari
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Lei Yu
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Teresa Head-Gordon
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Julie D. Forman-Kay
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
17
|
Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, Artificial Intelligence (AI), and Allostery. J Phys Chem B 2022; 126:6372-6383. [PMID: 35976160 PMCID: PMC9442638 DOI: 10.1021/acs.jpcb.2c04346] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/03/2022] [Indexed: 02/08/2023]
Abstract
AlphaFold has burst into our lives. A powerful algorithm that underscores the strength of biological sequence data and artificial intelligence (AI). AlphaFold has appended projects and research directions. The database it has been creating promises an untold number of applications with vast potential impacts that are still difficult to surmise. AI approaches can revolutionize personalized treatments and usher in better-informed clinical trials. They promise to make giant leaps toward reshaping and revamping drug discovery strategies, selecting and prioritizing combinations of drug targets. Here, we briefly overview AI in structural biology, including in molecular dynamics simulations and prediction of microbiota-human protein-protein interactions. We highlight the advancements accomplished by the deep-learning-powered AlphaFold in protein structure prediction and their powerful impact on the life sciences. At the same time, AlphaFold does not resolve the decades-long protein folding challenge, nor does it identify the folding pathways. The models that AlphaFold provides do not capture conformational mechanisms like frustration and allostery, which are rooted in ensembles, and controlled by their dynamic distributions. Allostery and signaling are properties of populations. AlphaFold also does not generate ensembles of intrinsically disordered proteins and regions, instead describing them by their low structural probabilities. Since AlphaFold generates single ranked structures, rather than conformational ensembles, it cannot elucidate the mechanisms of allosteric activating driver hotspot mutations nor of allosteric drug resistance. However, by capturing key features, deep learning techniques can use the single predicted conformation as the basis for generating a diverse ensemble.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational
Structural Biology Section, Frederick National
Laboratory for Cancer Research, Frederick, Maryland 21702, United States
- Department
of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Mingzhen Zhang
- Computational
Structural Biology Section, Frederick National
Laboratory for Cancer Research, Frederick, Maryland 21702, United States
| | - Yonglan Liu
- Cancer
Innovation Laboratory, National Cancer Institute, Frederick, Maryland 21702, United States
| | - Hyunbum Jang
- Computational
Structural Biology Section, Frederick National
Laboratory for Cancer Research, Frederick, Maryland 21702, United States
| |
Collapse
|
18
|
Gupta A, Dey S, Hicks A, Zhou HX. Artificial intelligence guided conformational mining of intrinsically disordered proteins. Commun Biol 2022; 5:610. [PMID: 35725761 PMCID: PMC9209487 DOI: 10.1038/s42003-022-03562-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 06/07/2022] [Indexed: 12/29/2022] Open
Abstract
Artificial intelligence recently achieved the breakthrough of predicting the three-dimensional structures of proteins. The next frontier is presented by intrinsically disordered proteins (IDPs), which, representing 30% to 50% of proteomes, readily access vast conformational space. Molecular dynamics (MD) simulations are promising in sampling IDP conformations, but only at extremely high computational cost. Here, we developed generative autoencoders that learn from short MD simulations and generate full conformational ensembles. An encoder represents IDP conformations as vectors in a reduced-dimensional latent space. The mean vector and covariance matrix of the training dataset are calculated to define a multivariate Gaussian distribution, from which vectors are sampled and fed to a decoder to generate new conformations. The ensembles of generated conformations cover those sampled by long MD simulations and are validated by small-angle X-ray scattering profile and NMR chemical shifts. This work illustrates the vast potential of artificial intelligence in conformational mining of IDPs.
Collapse
Affiliation(s)
- Aayush Gupta
- Department of Chemistry, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Souvik Dey
- Department of Chemistry, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Alan Hicks
- Department of Chemistry, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Huan-Xiang Zhou
- Department of Chemistry, University of Illinois at Chicago, Chicago, IL, 60607, USA.
- Department of Physics, University of Illinois at Chicago, Chicago, IL, 60607, USA.
| |
Collapse
|
19
|
Kulkarni P, Bhattacharya S, Achuthan S, Behal A, Jolly MK, Kotnala S, Mohanty A, Rangarajan G, Salgia R, Uversky V. Intrinsically Disordered Proteins: Critical Components of the Wetware. Chem Rev 2022; 122:6614-6633. [PMID: 35170314 PMCID: PMC9250291 DOI: 10.1021/acs.chemrev.1c00848] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Despite the wealth of knowledge gained about intrinsically disordered proteins (IDPs) since their discovery, there are several aspects that remain unexplored and, hence, poorly understood. A living cell is a complex adaptive system that can be described as a wetware─a metaphor used to describe the cell as a computer comprising both hardware and software and attuned to logic gates─capable of "making" decisions. In this focused Review, we discuss how IDPs, as critical components of the wetware, influence cell-fate decisions by wiring protein interaction networks to keep them minimally frustrated. Because IDPs lie between order and chaos, we explore the possibility that they can be modeled as attractors. Further, we discuss how the conformational dynamics of IDPs manifests itself as conformational noise, which can potentially amplify transcriptional noise to stochastically switch cellular phenotypes. Finally, we explore the potential role of IDPs in prebiotic evolution, in forming proteinaceous membrane-less organelles, in the origin of multicellularity, and in protein conformation-based transgenerational inheritance of acquired characteristics. Together, these ideas provide a new conceptual framework to discern how IDPs may perform critical biological functions despite their lack of structure.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Supriyo Bhattacharya
- Integrative Genomics Core, City of Hope National Medical Center, Duarte, CA, USA
| | - Srisairam Achuthan
- Division of Research Informatics, Center for Informatics, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Amita Behal
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Sourabh Kotnala
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
- Center for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Vladimir Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Institutskiy pereulok, 9, Dolgoprudny, Moscow region 141700, Russia
| |
Collapse
|
20
|
Kulkarni P, Leite VBP, Roy S, Bhattacharyya S, Mohanty A, Achuthan S, Singh D, Appadurai R, Rangarajan G, Weninger K, Orban J, Srivastava A, Jolly MK, Onuchic JN, Uversky VN, Salgia R. Intrinsically disordered proteins: Ensembles at the limits of Anfinsen's dogma. BIOPHYSICS REVIEWS 2022; 3:011306. [PMID: 38505224 PMCID: PMC10903413 DOI: 10.1063/5.0080512] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 02/17/2022] [Indexed: 03/21/2024]
Abstract
Intrinsically disordered proteins (IDPs) are proteins that lack rigid 3D structure. Hence, they are often misconceived to present a challenge to Anfinsen's dogma. However, IDPs exist as ensembles that sample a quasi-continuum of rapidly interconverting conformations and, as such, may represent proteins at the extreme limit of the Anfinsen postulate. IDPs play important biological roles and are key components of the cellular protein interaction network (PIN). Many IDPs can interconvert between disordered and ordered states as they bind to appropriate partners. Conformational dynamics of IDPs contribute to conformational noise in the cell. Thus, the dysregulation of IDPs contributes to increased noise and "promiscuous" interactions. This leads to PIN rewiring to output an appropriate response underscoring the critical role of IDPs in cellular decision making. Nonetheless, IDPs are not easily tractable experimentally. Furthermore, in the absence of a reference conformation, discerning the energy landscape representation of the weakly funneled IDPs in terms of reaction coordinates is challenging. To understand conformational dynamics in real time and decipher how IDPs recognize multiple binding partners with high specificity, several sophisticated knowledge-based and physics-based in silico sampling techniques have been developed. Here, using specific examples, we highlight recent advances in energy landscape visualization and molecular dynamics simulations to discern conformational dynamics and discuss how the conformational preferences of IDPs modulate their function, especially in phenotypic switching. Finally, we discuss recent progress in identifying small molecules targeting IDPs underscoring the potential therapeutic value of IDPs. Understanding structure and function of IDPs can not only provide new insight on cellular decision making but may also help to refine and extend Anfinsen's structure/function paradigm.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Vitor B. P. Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista (UNESP), São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Susmita Roy
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, West Bengal 741246, India
| | - Supriyo Bhattacharyya
- Translational Bioinformatics, Center for Informatics, Department of Computational and Quantitative Medicine, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Srisairam Achuthan
- Center for Informatics, Division of Research Informatics, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Divyoj Singh
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Rajeswari Appadurai
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | | - Anand Srivastava
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Jose N. Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005-1892, USA
| | | | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| |
Collapse
|
21
|
Clyde A, Galanie S, Kneller DW, Ma H, Babuji Y, Blaiszik B, Brace A, Brettin T, Chard K, Chard R, Coates L, Foster I, Hauner D, Kertesz V, Kumar N, Lee H, Li Z, Merzky A, Schmidt JG, Tan L, Titov M, Trifan A, Turilli M, Van Dam H, Chennubhotla SC, Jha S, Kovalevsky A, Ramanathan A, Head MS, Stevens R. High-Throughput Virtual Screening and Validation of a SARS-CoV-2 Main Protease Noncovalent Inhibitor. J Chem Inf Model 2022; 62:116-128. [PMID: 34793155 PMCID: PMC8610012 DOI: 10.1021/acs.jcim.1c00851] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Indexed: 12/27/2022]
Abstract
Despite the recent availability of vaccines against the acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the search for inhibitory therapeutic agents has assumed importance especially in the context of emerging new viral variants. In this paper, we describe the discovery of a novel noncovalent small-molecule inhibitor, MCULE-5948770040, that binds to and inhibits the SARS-Cov-2 main protease (Mpro) by employing a scalable high-throughput virtual screening (HTVS) framework and a targeted compound library of over 6.5 million molecules that could be readily ordered and purchased. Our HTVS framework leverages the U.S. supercomputing infrastructure achieving nearly 91% resource utilization and nearly 126 million docking calculations per hour. Downstream biochemical assays validate this Mpro inhibitor with an inhibition constant (Ki) of 2.9 μM (95% CI 2.2, 4.0). Furthermore, using room-temperature X-ray crystallography, we show that MCULE-5948770040 binds to a cleft in the primary binding site of Mpro forming stable hydrogen bond and hydrophobic interactions. We then used multiple μs-time scale molecular dynamics (MD) simulations and machine learning (ML) techniques to elucidate how the bound ligand alters the conformational states accessed by Mpro, involving motions both proximal and distal to the binding site. Together, our results demonstrate how MCULE-5948770040 inhibits Mpro and offers a springboard for further therapeutic design.
Collapse
Affiliation(s)
- Austin Clyde
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Stephanie Galanie
- Biosciences Division, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37831, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Daniel W. Kneller
- Neutron Scattering Division, Oak Ridge
National Laboratory, Oak Ridge, Tennessee 37831, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Heng Ma
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Yadu Babuji
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Ben Blaiszik
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Alexander Brace
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Thomas Brettin
- Computing Environment and Life Sciences Directorate,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Kyle Chard
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Ryan Chard
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Leighton Coates
- Neutron Scattering Division, Oak Ridge
National Laboratory, Oak Ridge, Tennessee 37831, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Ian Foster
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Darin Hauner
- Computational Biology Group, Biological Science Division,
Pacific Northwest National Laboratory, Richland, Washington
99352, United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Vlimos Kertesz
- Neutron Scattering Division, Oak Ridge
National Laboratory, Oak Ridge, Tennessee 37831, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Neeraj Kumar
- Computational Biology Group, Biological Science Division,
Pacific Northwest National Laboratory, Richland, Washington
99352, United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Hyungro Lee
- Department of Electrical and Computer Engineering,
Rutgers University, Piscataway, New Jersey 08854,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Zhuozhao Li
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Andre Merzky
- Department of Electrical and Computer Engineering,
Rutgers University, Piscataway, New Jersey 08854,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Jurgen G. Schmidt
- Bioscience Division, Los Alamos National
Laboratory, Los Alamos, New Mexico 87545, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Li Tan
- Department of Electrical and Computer Engineering,
Rutgers University, Piscataway, New Jersey 08854,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Mikhail Titov
- Department of Electrical and Computer Engineering,
Rutgers University, Piscataway, New Jersey 08854,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Anda Trifan
- University of Illinois at
Urbana-Champaign, Champaign, Illinois 61820, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Matteo Turilli
- Department of Electrical and Computer Engineering,
Rutgers University, Piscataway, New Jersey 08854,
United States
- Computational Science Initiative,
Brookhaven National Laboratory, Upton, New York 11973,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Hubertus Van Dam
- Computational Science Initiative,
Brookhaven National Laboratory, Upton, New York 11973,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Srinivas C. Chennubhotla
- Department of Computational and Systems
Biology, University of Pittsburgh, Pittsburgh, Pennsylvania
15260, United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Shantenu Jha
- Department of Electrical and Computer Engineering,
Rutgers University, Piscataway, New Jersey 08854,
United States
- Computational Science Initiative,
Brookhaven National Laboratory, Upton, New York 11973,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Andrey Kovalevsky
- Second Target Station, Oak Ridge
National Laboratory, Oak Ridge, Tennessee 37831, United
States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Arvind Ramanathan
- Data Science and Learning Division,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Martha S. Head
- Joint Institute for Biological Sciences,
Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| | - Rick Stevens
- Department of Computer Science,
University of Chicago, Chicago, Illinois 60615,
United States
- Computing Environment and Life Sciences Directorate,
Argonne National Laboratory, Lemont, Illinois 60439,
United States
- National Virtual Biotechnology
Laboratory, Washington, District of Columbia 20585, United
States
| |
Collapse
|
22
|
Varadi M, Anyango S, Armstrong D, Berrisford J, Choudhary P, Deshpande M, Nadzirin N, Nair SS, Pravda L, Tanweer A, Al-Lazikani B, Andreini C, Barton GJ, Bednar D, Berka K, Blundell T, Brock KP, Carazo JM, Damborsky J, David A, Dey S, Dunbrack R, Recio JF, Fraternali F, Gibson T, Helmer-Citterich M, Hoksza D, Hopf T, Jakubec D, Kannan N, Krivak R, Kumar M, Levy ED, London N, Macias JR, Srivatsan MM, Marks DS, Martens L, McGowan SA, McGreig JE, Modi V, Parra RG, Pepe G, Piovesan D, Prilusky J, Putignano V, Radusky LG, Ramasamy P, Rausch AO, Reuter N, Rodriguez LA, Rollins NJ, Rosato A, Rubach P, Serrano L, Singh G, Skoda P, Sorzano COS, Stourac J, Sulkowska JI, Svobodova R, Tichshenko N, Tosatto SCE, Vranken W, Wass MN, Xue D, Zaidman D, Thornton J, Sternberg M, Orengo C, Velankar S. PDBe-KB: collaboratively defining the biological context of structural data. Nucleic Acids Res 2022; 50:D534-D542. [PMID: 34755867 PMCID: PMC8728252 DOI: 10.1093/nar/gkab988] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/01/2021] [Accepted: 10/14/2021] [Indexed: 12/15/2022] Open
Abstract
The Protein Data Bank in Europe - Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive.
Collapse
|
23
|
Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, Yuan D, Stroe O, Wood G, Laydon A, Žídek A, Green T, Tunyasuvunakool K, Petersen S, Jumper J, Clancy E, Green R, Vora A, Lutfi M, Figurnov M, Cowie A, Hobbs N, Kohli P, Kleywegt G, Birney E, Hassabis D, Velankar S. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res 2022; 50:D439-D444. [PMID: 34791371 PMCID: PMC8728224 DOI: 10.1093/nar/gkab1061] [Citation(s) in RCA: 4001] [Impact Index Per Article: 1333.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 10/14/2021] [Accepted: 10/19/2021] [Indexed: 12/16/2022] Open
Abstract
The AlphaFold Protein Structure Database (AlphaFold DB, https://alphafold.ebi.ac.uk) is an openly accessible, extensive database of high-accuracy protein-structure predictions. Powered by AlphaFold v2.0 of DeepMind, it has enabled an unprecedented expansion of the structural coverage of the known protein-sequence space. AlphaFold DB provides programmatic access to and interactive visualization of predicted atomic coordinates, per-residue and pairwise model-confidence estimates and predicted aligned errors. The initial release of AlphaFold DB contains over 360,000 predicted structures across 21 model-organism proteomes, which will soon be expanded to cover most of the (over 100 million) representative sequences from the UniRef90 data set.
Collapse
Affiliation(s)
- Mihaly Varadi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Stephen Anyango
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Mandar Deshpande
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Sreenath Nair
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Cindy Natassia
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Galabina Yordanova
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - David Yuan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Oana Stroe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Gemma Wood
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Gerard Kleywegt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | | | - Sameer Velankar
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| |
Collapse
|
24
|
Montepietra D, Cecconi C, Brancolini G. Combining enhanced sampling and deep learning dimensionality reduction for the study of the heat shock protein B8 and its pathological mutant K141E. RSC Adv 2022; 12:31996-32011. [PMID: 36380940 PMCID: PMC9641792 DOI: 10.1039/d2ra04913a] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Accepted: 10/28/2022] [Indexed: 11/11/2022] Open
Abstract
The biological functions of proteins closely depend on their conformational dynamics. This aspect is especially relevant for intrinsically disordered proteins (IDP) for which structural ensembles often offer more useful representations than individual conformations. Here we employ extensive enhanced sampling temperature replica-exchange atomistic simulations (TREMD) and deep learning dimensionality reduction to study the conformational ensembles of the human heat shock protein B8 and its pathological mutant K141E, for which no experimental 3D structures are available. First, we combined homology modelling with TREMD to generate high-dimensional data sets of 3D structures. Then, we employed a recently developed machine learning based post-processing algorithm, EncoderMap, to project the large conformational data sets into meaningful two-dimensional maps that helped us interpret the data and extract the most significant conformations adopted by both proteins during TREMD. These studies provide the first 3D structural characterization of HSPB8 and reveal the effects of the pathogenic K141E mutation on its conformational ensembles. In particular, this missense mutation appears to increase the compactness of the protein and its structural variability, at the same time rearranging the hydrophobic patches exposed on the protein surface. These results offer the possibility of rationalizing the pathogenic effects of the K141E mutation in terms of conformational changes. The study provides the first 3D structural characterization of HSPB8 and its K141E mutant: extensive TREMD are combined with a deep learning algorithm to rationalize the disordered ensemble of structures adopted by each variant.![]()
Collapse
Affiliation(s)
- Daniele Montepietra
- Department of Physics, Computer Science and Mathematics, University of Modena and Reggio Emilia, Via Campi 213/A, 41100 Modena, Italy
- Istituto Nanoscienze – CNR-NANO, Center S3, Via G. Campi 213/A, 41100 Modena, Italy
| | - Ciro Cecconi
- Department of Physics, Computer Science and Mathematics, University of Modena and Reggio Emilia, Via Campi 213/A, 41100 Modena, Italy
- Istituto Nanoscienze – CNR-NANO, Center S3, Via G. Campi 213/A, 41100 Modena, Italy
| | - Giorgia Brancolini
- Istituto Nanoscienze – CNR-NANO, Center S3, Via G. Campi 213/A, 41100 Modena, Italy
| |
Collapse
|
25
|
Sawade K, Peter C. Multiscale simulations of protein and membrane systems. Curr Opin Struct Biol 2021; 72:203-208. [PMID: 34953308 DOI: 10.1016/j.sbi.2021.11.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2021] [Revised: 11/01/2021] [Accepted: 11/10/2021] [Indexed: 02/07/2023]
Abstract
Classical multiscale simulations are perfectly suited to investigate biological soft matter systems. Owing to the bridging between microscopically realistic and lower-resolution models or the integration of a hierarchy of subsystems, one gets access to biologically relevant system sizes and timescales. In recent years, increasingly complex systems and processes have come into focus such as multidomain proteins, phase separation processes in biopolymer solutions, multicomponent biomembranes, or multiprotein complexes up to entire viruses. The review shows factors that have contributed to this progress - from improved models to machine-learning-based analysis and scale-bridging methods.
Collapse
Affiliation(s)
- Kevin Sawade
- Department of Chemistry, University of Konstanz, Universitätsstraße 10, 78 457, Konstanz, Germany
| | - Christine Peter
- Department of Chemistry, University of Konstanz, Universitätsstraße 10, 78 457, Konstanz, Germany.
| |
Collapse
|
26
|
Sacquin-Mora S, Prévost C. When Order Meets Disorder: Modeling and Function of the Protein Interface in Fuzzy Complexes. Biomolecules 2021; 11:1529. [PMID: 34680162 PMCID: PMC8533853 DOI: 10.3390/biom11101529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/11/2021] [Accepted: 10/12/2021] [Indexed: 11/30/2022] Open
Abstract
The degree of proteins structural organization ranges from highly structured, compact folding to intrinsic disorder, where each degree of self-organization corresponds to specific functions: well-organized structural motifs in enzymes offer a proper environment for precisely positioned functional groups to participate in catalytic reactions; at the other end of the self-organization spectrum, intrinsically disordered proteins act as binding hubs via the formation of multiple, transient and often non-specific interactions. This review focusses on cases where structurally organized proteins or domains associate with highly disordered protein chains, leading to the formation of interfaces with varying degrees of fuzziness. We present a review of the computational methods developed to provide us with information on such fuzzy interfaces, and how they integrate experimental information. The discussion focusses on two specific cases, microtubules and homologous recombination nucleoprotein filaments, where a network of intrinsically disordered tails exerts regulatory function in recruiting partner macromolecules, proteins or DNA and tuning the atomic level association. Notably, we show how computational approaches such as molecular dynamics simulations can bring new knowledge to help bridging the gap between experimental analysis, that mostly concerns ensemble properties, and the behavior of individual disordered protein chains that contribute to regulation functions.
Collapse
Affiliation(s)
- Sophie Sacquin-Mora
- CNRS, Laboratoire de Biochimie Théorique, UPR9080, Université de Paris, 13 Rue Pierre et Marie Curie, 75005 Paris, France;
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, PSL Research University, 75006 Paris, France
| | - Chantal Prévost
- CNRS, Laboratoire de Biochimie Théorique, UPR9080, Université de Paris, 13 Rue Pierre et Marie Curie, 75005 Paris, France;
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, PSL Research University, 75006 Paris, France
| |
Collapse
|
27
|
Fatafta H, Samantray S, Sayyed-Ahmad A, Coskuner-Weber O, Strodel B. Molecular simulations of IDPs: From ensemble generation to IDP interactions leading to disorder-to-order transitions. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2021; 183:135-185. [PMID: 34656328 DOI: 10.1016/bs.pmbts.2021.06.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/26/2023]
Abstract
Intrinsically disordered proteins (IDPs) lack a well-defined three-dimensional structure but do exhibit some dynamical and structural ordering. The structural plasticity of IDPs indicates that entropy-driven motions are crucial for their function. Many IDPs undergo function-related disorder-to-order transitions upon by their interaction with specific binding partners. Approaches that are based on both experimental and theoretical tools enable the biophysical characterization of IDPs. Molecular simulations provide insights into IDP structural ensembles and disorder-to-order transition mechanisms. However, such studies depend strongly on the chosen force field parameters and simulation techniques. In this chapter, we provide an overview of IDP characteristics, review all-atom force fields recently developed for IDPs, and present molecular dynamics-based simulation methods that allow IDP ensemble generation as well as the characterization of disorder-to-order transitions. In particular, we introduce metadynamics, replica exchange molecular dynamics simulations, and also kinetic models resulting from Markov State modeling, and provide various examples for the successful application of these simulation methods to IDPs.
Collapse
Affiliation(s)
- Hebah Fatafta
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, Jülich, Germany
| | - Suman Samantray
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, Jülich, Germany; AICES Graduate School, RWTH Aachen University, Aachen, Germany
| | | | - Orkid Coskuner-Weber
- Molecular Biotechnology, Turkish-German University, Sahinkaya Caddesi, Istanbul, Turkey
| | - Birgit Strodel
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, Jülich, Germany; Institute of Theoretical and Computational Chemistry, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
| |
Collapse
|