Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Degiacomi MT. Coupling Molecular Dynamics and Deep Learning to Mine Protein Conformational Space. Structure 2019;27:1034-1040.e3. [PMID: 31031199 DOI: 10.1016/j.str.2019.03.018] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Revised: 01/25/2019] [Accepted: 03/25/2019] [Indexed: 01/09/2023]

For:	Degiacomi MT. Coupling Molecular Dynamics and Deep Learning to Mine Protein Conformational Space. Structure 2019;27:1034-1040.e3. [PMID: 31031199 DOI: 10.1016/j.str.2019.03.018] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Revised: 01/25/2019] [Accepted: 03/25/2019] [Indexed: 01/09/2023]

Number

Cited by Other Article(s)

Ray Chaudhuri N, Ghosh Dastidar S. Adaptive Workflows of Machine Learning Illuminate the Sequential Operation Mechanism of the TAK1's Allosteric Network. Biochemistry 2024;63:1474-1492. [PMID: 38743619 DOI: 10.1021/acs.biochem.3c00643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]

Abstract

Allostery is a fundamental mechanism driving biomolecular processes that holds significant therapeutic concern. Our study rigorously investigates how two distinct machine-learning algorithms uniquely classify two already close-to-active DFG-in states of TAK1, differing just by the presence or absence of its allosteric activator TAB1, from an ensemble mixture of conformations (obtained from 2.4 μs molecular dynamics (MD) simulations). The novelty, however, lies in understanding the deeper algorithmic potentials to systematically derive a diverse set of differential residue connectivity features that reconstruct the essential mechanistic architecture for TAK1-TAB1 allostery in such a close-to-active biochemical scenario. While the recursive, random forest-based workflow displays the potential of conducting discretized, hierarchical derivation of allosteric features, a multilayer perceptron-based approach gains considerable efficacy in revealing fluid connected patterns of features when hybridized with mutual information scoring. Interestingly, both pipelines benchmark similar directions of functional conformational changes for TAK1's activation. The findings significantly advance the depth of mechanistic understanding by highlighting crucial activation signatures along a directed C-lobe → activation loop → ATP pocket channel of information flow, including (1) the αF-αE biterminal alignments and (2) the "catalytic" drift of the activation loop toward kinase active site. Besides, some novel allosteric hotspots (K253, Y206, N189, etc.) are further recognized as TAB1 sensors, transducers, and responders, including a benchmark E70 mutation site, precisely mapping the important structural segments for sequential allosteric execution. Hence, our work demonstrates how to navigate through greater structural depths and dimensions of dynamic allosteric machineries just by leveraging standard ML methods in suitable streamlined workflows adaptive to the specific system and objectives.

Collapse

Basciu A, Athar M, Kurt H, Neville C, Malloci G, Muredda FC, Bosin A, Ruggerone P, Bonvin AMJJ, Vargiu AV. Predicting binding events in very flexible, allosteric, multi-domain proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.02.597018. [PMID: 38895346 PMCID: PMC11185556 DOI: 10.1101/2024.06.02.597018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]

Lensink MF, Brysbaert G, Raouraoua N, Bates PA, Giulini M, Honorato RV, van Noort C, Teixeira JMC, Bonvin AMJJ, Kong R, Shi H, Lu X, Chang S, Liu J, Guo Z, Chen X, Morehead A, Roy RS, Wu T, Giri N, Quadir F, Chen C, Cheng J, Del Carpio CA, Ichiishi E, Rodriguez‐Lumbreras LA, Fernandez‐Recio J, Harmalkar A, Chu L, Canner S, Smanta R, Gray JJ, Li H, Lin P, He J, Tao H, Huang S, Roel‐Touris J, Jimenez‐Garcia B, Christoffer CW, Jain AJ, Kagaya Y, Kannan H, Nakamura T, Terashi G, Verburgt JC, Zhang Y, Zhang Z, Fujuta H, Sekijima M, Kihara D, Khan O, Kotelnikov S, Ghani U, Padhorny D, Beglov D, Vajda S, Kozakov D, Negi SS, Ricciardelli T, Barradas‐Bautista D, Cao Z, Chawla M, Cavallo L, Oliva R, Yin R, Cheung M, Guest JD, Lee J, Pierce BG, Shor B, Cohen T, Halfon M, Schneidman‐Duhovny D, Zhu S, Yin R, Sun Y, Shen Y, Maszota‐Zieleniak M, Bojarski KK, Lubecka EA, Marcisz M, Danielsson A, Dziadek L, Gaardlos M, Gieldon A, Liwo A, Samsonov SA, Slusarz R, Zieba K, Sieradzan AK, Czaplewski C, Kobayashi S, Miyakawa Y, Kiyota Y, Takeda‐Shitaka M, Olechnovic K, Valancauskas L, Dapkunas J, Venclovas C, Wallner B, Yang L, Hou C, He X, Guo S, Jiang S, Ma X, Duan R, Qui L, Xu X, Zou X, Velankar S, Wodak SJ. Impact of AlphaFold on structure prediction of protein complexes: The CASP15-CAPRI experiment. Proteins 2023;91:1658-1683. [PMID: 37905971 PMCID: PMC10841881 DOI: 10.1002/prot.26609] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 09/22/2023] [Accepted: 09/28/2023] [Indexed: 11/02/2023]

Affiliation(s)

Marc F. Lensink Univ. Lille, CNRS, UMR8576 – UGSF – Unité de Glycobiologie Structurale et FonctionnelleLilleFrance
Guillaume Brysbaert Univ. Lille, CNRS, UMR8576 – UGSF – Unité de Glycobiologie Structurale et FonctionnelleLilleFrance
Nessim Raouraoua Univ. Lille, CNRS, UMR8576 – UGSF – Unité de Glycobiologie Structurale et FonctionnelleLilleFrance
Paul A. Bates Biomolecular Modeling LaboratoryThe Francis Crick InstituteLondonUK
Marco Giulini Bijvoet Center for Biomolecular Research, Faculty of Science – ChemistryUtrecht UniversityUtrechtThe Netherlands
Rodrigo V. Honorato Bijvoet Center for Biomolecular Research, Faculty of Science – ChemistryUtrecht UniversityUtrechtThe Netherlands
Charlotte van Noort Bijvoet Center for Biomolecular Research, Faculty of Science – ChemistryUtrecht UniversityUtrechtThe Netherlands
Joao M. C. Teixeira Bijvoet Center for Biomolecular Research, Faculty of Science – ChemistryUtrecht UniversityUtrechtThe Netherlands
Alexandre M. J. J. Bonvin Bijvoet Center for Biomolecular Research, Faculty of Science – ChemistryUtrecht UniversityUtrechtThe Netherlands
Ren Kong Institute of Bioinformatics and Medical Engineering, School of Electrical and Information EngineeringJiangsu University of TechnologyChangzhouChina
Hang Shi Institute of Bioinformatics and Medical Engineering, School of Electrical and Information EngineeringJiangsu University of TechnologyChangzhouChina
Xufeng Lu Institute of Bioinformatics and Medical Engineering, School of Electrical and Information EngineeringJiangsu University of TechnologyChangzhouChina
Shan Chang Institute of Bioinformatics and Medical Engineering, School of Electrical and Information EngineeringJiangsu University of TechnologyChangzhouChina
Jian Liu Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Zhiye Guo Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Xiao Chen Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Alex Morehead Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Raj S. Roy Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Tianqi Wu Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Nabin Giri Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Farhan Quadir Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Chen Chen Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Jianlin Cheng Dept. of Electrical Engineering and Computer ScienceUniversity of MissouriColumbiaMissouriUSA
Carlos A. Del Carpio Choju‐Medical InstituteFukushimuta HospitalToyohashi‐CityAichi‐kenJapan
Eichiro Ichiishi International University of Health and Welfare (IUHV Hospital)Nasushiobara‐CityJapan
Luis A. Rodriguez‐Lumbreras Instituto de Ciencias de la Vida y del Vino (ICVV)CSIC ‐ Universidad de La Rioja ‐ Gobierno de La RiojaLogronoSpain Barcelona Supercomputing Center (BSC)BarcelonaSpain
Juan Fernandez‐Recio Instituto de Ciencias de la Vida y del Vino (ICVV)CSIC ‐ Universidad de La Rioja ‐ Gobierno de La RiojaLogronoSpain Barcelona Supercomputing Center (BSC)BarcelonaSpain
Ameya Harmalkar Dept. of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
Lee‐Shin Chu Dept. of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
Sam Canner Dept. of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
Rituparna Smanta Dept. of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
Jeffrey J. Gray Dept. of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA Program in Molecular BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
Hao Li School of PhysicsHuazhong University of Science and TechnologyWuhanChina
Peicong Lin School of PhysicsHuazhong University of Science and TechnologyWuhanChina
Jiahua He School of PhysicsHuazhong University of Science and TechnologyWuhanChina
Huanyu Tao School of PhysicsHuazhong University of Science and TechnologyWuhanChina
Sheng‐You Huang School of PhysicsHuazhong University of Science and TechnologyWuhanChina
Jorge Roel‐Touris Protein Design and Modeling Lab, Dept. of Structural BiologyMolecular Biology Institute of Barcelona (IBMB‐CSIC)BarcelonaSpain
Brian Jimenez‐Garcia Zymvol BiomodelingBarcelonaSpain
Charles W. Christoffer Dept. of Computer SciencePurdue UniversityWest LafayetteIndianaUSA
Anika J. Jain Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
Yuki Kagaya Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
Harini Kannan Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA Dept. of Biotechnology, Bhupat and Jyoti Mehta School of BiosciencesIndian Institute of Technology MadrasChennaiIndia
Tsukasa Nakamura Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
Genki Terashi Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
Jacob C. Verburgt Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
Yuanyuan Zhang Dept. of Computer SciencePurdue UniversityWest LafayetteIndianaUSA
Zicong Zhang Dept. of Computer SciencePurdue UniversityWest LafayetteIndianaUSA
Hayato Fujuta Dept. of Biotechnology, Bhupat and Jyoti Mehta School of BiosciencesIndian Institute of Technology MadrasChennaiIndia
Masakazu Sekijima Dept. of Computer ScienceTokyo Institute of TechnologyYokohamaJapan
Daisuke Kihara Dept. of Computer SciencePurdue UniversityWest LafayetteIndianaUSA Dept. of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
Omeir Khan Boston UniversityBostonMassachusettsUSA
Sergei Kotelnikov Stony Brook UniversityNew York CityNew YorkUSA
Usman Ghani Boston UniversityBostonMassachusettsUSA
Dzmitry Padhorny Stony Brook UniversityNew York CityNew YorkUSA
Dmitri Beglov Boston UniversityBostonMassachusettsUSA
Sandor Vajda Boston UniversityBostonMassachusettsUSA
Dima Kozakov Stony Brook UniversityNew York CityNew YorkUSA
Surendra S. Negi Sealy Center for Structural Biology and Molecular BiophysicsUniversity of Texas Medical BranchGalvestonTexasUSA
Tiziana Ricciardelli King Abdullah University of Science and Technology (KAUST)Saudi Arabia
Didier Barradas‐Bautista King Abdullah University of Science and Technology (KAUST)Saudi Arabia
Zhen Cao King Abdullah University of Science and Technology (KAUST)Saudi Arabia
Mohit Chawla King Abdullah University of Science and Technology (KAUST)Saudi Arabia
Luigi Cavallo King Abdullah University of Science and Technology (KAUST)Saudi Arabia Department of Chemistry and BiologyUniversity of SalernoFiscianoItaly
Romina Oliva University of Naples “Parthenope”NaplesItaly
Rui Yin University of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMarylandUSA Dept. of Cell Biology and Molecular GeneticsUniversity of MarylandCollege ParkMarylandUSA
Melyssa Cheung University of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMarylandUSA Dept. of Chemistry and BiochemistryUniversity of MarylandCollege ParkMarylandUSA
Johnathan D. Guest University of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMarylandUSA Dept. of Cell Biology and Molecular GeneticsUniversity of MarylandCollege ParkMarylandUSA
Jessica Lee University of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMarylandUSA Dept. of Cell Biology and Molecular GeneticsUniversity of MarylandCollege ParkMarylandUSA
Brian G. Pierce University of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMarylandUSA Dept. of Cell Biology and Molecular GeneticsUniversity of MarylandCollege ParkMarylandUSA
Ben Shor School of Computer Science and EngineeringThe Hebrew University of JerusalemJerusalemIsrael
Tomer Cohen School of Computer Science and EngineeringThe Hebrew University of JerusalemJerusalemIsrael
Matan Halfon School of Computer Science and EngineeringThe Hebrew University of JerusalemJerusalemIsrael
Dina Schneidman‐Duhovny School of Computer Science and EngineeringThe Hebrew University of JerusalemJerusalemIsrael
Shaowen Zhu Department of Electrical and Computer EngineeringTexas A&M UniversityCollege StationTexasUSA
Rujie Yin Department of Electrical and Computer EngineeringTexas A&M UniversityCollege StationTexasUSA
Yuanfei Sun Department of Electrical and Computer EngineeringTexas A&M UniversityCollege StationTexasUSA
Yang Shen Department of Electrical and Computer EngineeringTexas A&M UniversityCollege StationTexasUSA Department of Computer Science and EngineeringTexas A&M UniversityCollege StationTexasUSA Institute of Biosciences and Technology and Department of Translational Medical SciencesTexas A&M UniversityHoustonTexasUSA
Martyna Maszota‐Zieleniak University of GdanskGdanskPoland
Krzysztof K. Bojarski Technical University of GdanskGdanskPoland
Emilia A. Lubecka Technical University of GdanskGdanskPoland
Mateusz Marcisz University of GdanskGdanskPoland
Annemarie Danielsson University of GdanskGdanskPoland
Lukasz Dziadek University of GdanskGdanskPoland
Margrethe Gaardlos University of GdanskGdanskPoland
Artur Gieldon University of GdanskGdanskPoland
Adam Liwo University of GdanskGdanskPoland
Sergey A. Samsonov University of GdanskGdanskPoland
Rafal Slusarz University of GdanskGdanskPoland
Karolina Zieba University of GdanskGdanskPoland
Adam K. Sieradzan University of GdanskGdanskPoland
Cezary Czaplewski University of GdanskGdanskPoland
Shinpei Kobayashi School of PharmacyKitasato UniversityMinato‐kuTokyoJapan
Yuta Miyakawa School of PharmacyKitasato UniversityMinato‐kuTokyoJapan
Yasuomi Kiyota School of PharmacyKitasato UniversityMinato‐kuTokyoJapan
Mayuko Takeda‐Shitaka School of PharmacyKitasato UniversityMinato‐kuTokyoJapan
Kliment Olechnovic Institute of Biotechnology, Life Sciences CenterVilnius UniversityVilniusLithuania
Lukas Valancauskas Institute of Biotechnology, Life Sciences CenterVilnius UniversityVilniusLithuania
Justas Dapkunas Institute of Biotechnology, Life Sciences CenterVilnius UniversityVilniusLithuania
Ceslovas Venclovas Institute of Biotechnology, Life Sciences CenterVilnius UniversityVilniusLithuania
Bjorn Wallner Bioinformatics Division, Department of Physics, Chemistry, and BiologyLinkoping UniversityLinköpingSweden
Lin Yang National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and StructuresHarbin Institute of TechnologyHarbinChina School of Aerospace, Mechanical and Mechatronic EngineeringThe University of SydneyNew South WalesAustralia
Chengyu Hou School of Electronics and Information EngineeringHarbin Institute of TechnologyHarbinChina
Xiaodong He National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and StructuresHarbin Institute of TechnologyHarbinChina Shenzhen STRONG Advanced Materials Research Institute Col, LtdShenzhenPeople's Republic of China
Shuai Guo National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and StructuresHarbin Institute of TechnologyHarbinChina
Shenda Jiang National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and StructuresHarbin Institute of TechnologyHarbinChina
Xiaoliang Ma National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and StructuresHarbin Institute of TechnologyHarbinChina
Rui Duan Dalton Cardiovascular Research CenterUniversity of MissouriColumbiaMissouriUSA
Liming Qui Dalton Cardiovascular Research CenterUniversity of MissouriColumbiaMissouriUSA
Xianjin Xu Dalton Cardiovascular Research CenterUniversity of MissouriColumbiaMissouriUSA
Xiaoqin Zou Dalton Cardiovascular Research CenterUniversity of MissouriColumbiaMissouriUSA Dept. of Physics and AstronomyUniversity of MissouriColumbiaMissouriUSA Dept. of BiochemistryUniversity of MissouriColumbiaMissouriUSA Institute for Data Science and InformaticsUniversity of MissouriColumbiaMissouriUSA
Sameer Velankar Protein Data Bank in Europe, European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)HinxtonCambridgeUK
Shoshana J. Wodak VIB‐VUB Center for Structural BiologyBrusselsBelgium

Collapse

López-Correa JM, König C, Vellido A. GPCR molecular dynamics forecasting using recurrent neural networks. Sci Rep 2023;13:20995. [PMID: 38017062 PMCID: PMC10684758 DOI: 10.1038/s41598-023-48346-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 11/25/2023] [Indexed: 11/30/2023] Open

Zhu J, Li Z, Tong H, Lu Z, Zhang N, Wei T, Chen HF. Phanto-IDP: compact model for precise intrinsically disordered protein backbone generation and enhanced sampling. Brief Bioinform 2023;25:bbad429. [PMID: 38018910 PMCID: PMC10783862 DOI: 10.1093/bib/bbad429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/21/2023] [Accepted: 11/05/2023] [Indexed: 11/30/2023] Open

Abstract

The biological function of proteins is determined not only by their static structures but also by the dynamic properties of their conformational ensembles. Numerous high-accuracy static structure prediction tools have been recently developed based on deep learning; however, there remains a lack of efficient and accurate methods for exploring protein dynamic conformations. Traditionally, studies concerning protein dynamics have relied on molecular dynamics (MD) simulations, which incur significant computational costs for all-atom precision and struggle to adequately sample conformational spaces with high energy barriers. To overcome these limitations, various enhanced sampling techniques have been developed to accelerate sampling in MD. Traditional enhanced sampling approaches like replica exchange molecular dynamics (REMD) and frontier expansion sampling (FEXS) often follow the MD simulation approach and still cost a lot of computational resources and time. Variational autoencoders (VAEs), as a classic deep generative model, are not restricted by potential energy landscapes and can explore conformational spaces more efficiently than traditional methods. However, VAEs often face challenges in generating reasonable conformations for complex proteins, especially intrinsically disordered proteins (IDPs), which limits their application as an enhanced sampling method. In this study, we presented a novel deep learning model (named Phanto-IDP) that utilizes a graph-based encoder to extract protein features and a transformer-based decoder combined with variational sampling to generate highly accurate protein backbones. Ten IDPs and four structured proteins were used to evaluate the sampling ability of Phanto-IDP. The results demonstrate that Phanto-IDP has high fidelity and diversity in the generated conformation ensembles, making it a suitable tool for enhancing the efficiency of MD simulation, generating broader protein conformational space and a continuous protein transition path.

Collapse

Affiliation(s)

Junjie Zhu State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
Zhengxin Li State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
Haowei Tong State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
Zhouyu Lu State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
Ningjie Zhang State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
Ting Wei State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
Hai-Feng Chen State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China

Collapse

Chen SH, Weiss KL, Stanley C, Bhowmik D. Structural characterization of an intrinsically disordered protein complex using integrated small-angle neutron scattering and computing. Protein Sci 2023;32:e4772. [PMID: 37646172 PMCID: PMC10503416 DOI: 10.1002/pro.4772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/22/2023] [Accepted: 08/27/2023] [Indexed: 09/01/2023]

Zhang L, Wang S, Hou J, Si D, Zhu J, Cao R. ComplexQA: a deep graph learning approach for protein complex structure assessment. Brief Bioinform 2023;24:bbad287. [PMID: 37930021 DOI: 10.1093/bib/bbad287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 05/09/2023] [Accepted: 07/24/2023] [Indexed: 11/07/2023] Open

Chen Z, Liu N, Huang Y, Min X, Zeng X, Ge S, Zhang J, Xia N. PointDE: Protein Docking Evaluation Using 3D Point Cloud Neural Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:3128-3138. [PMID: 37220029 DOI: 10.1109/tcbb.2023.3279019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Xiao S, Song Z, Tian H, Tao P. Assessments of Variational Autoencoder in Protein Conformation Exploration. JOURNAL OF COMPUTATIONAL BIOPHYSICS AND CHEMISTRY 2023;22:489-501. [PMID: 38826699 PMCID: PMC11138204 DOI: 10.1142/s2737416523500217] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]

Kubečka J, Knattrup Y, Engsvang M, Jensen AB, Ayoubi D, Wu H, Christiansen O, Elm J. Current and future machine learning approaches for modeling atmospheric cluster formation. NATURE COMPUTATIONAL SCIENCE 2023;3:495-503. [PMID: 38177415 DOI: 10.1038/s43588-023-00435-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 03/16/2023] [Indexed: 01/06/2024]

Zheng LE, Barethiya S, Nordquist E, Chen J. Machine Learning Generation of Dynamic Protein Conformational Ensembles. Molecules 2023;28:4047. [PMID: 37241789 PMCID: PMC10220786 DOI: 10.3390/molecules28104047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 05/04/2023] [Accepted: 05/09/2023] [Indexed: 05/28/2023] Open

Wodak SJ, Vajda S, Lensink MF, Kozakov D, Bates PA. Critical Assessment of Methods for Predicting the 3D Structure of Proteins and Protein Complexes. Annu Rev Biophys 2023;52:183-206. [PMID: 36626764 PMCID: PMC10885158 DOI: 10.1146/annurev-biophys-102622-084607] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Zhang O, Haghighatlari M, Li J, Liu ZH, Namini A, Teixeira JMC, Forman-Kay JD, Head-Gordon T. Learning to evolve structural ensembles of unfolded and disordered proteins using experimental solution data. J Chem Phys 2023;158:174113. [PMID: 37144719 PMCID: PMC10163956 DOI: 10.1063/5.0141474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 04/11/2023] [Indexed: 05/06/2023] Open

Sun F, Kadupitiya J, Jadhao V. Probing Accuracy-Speedup Tradeoff in Machine Learning Surrogates for Molecular Dynamics Simulations. J Chem Theory Comput 2023. [PMID: 37094180 DOI: 10.1021/acs.jctc.2c01282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2023]

Zhu JJ, Zhang NJ, Wei T, Chen HF. Enhancing Conformational Sampling for Intrinsically Disordered and Ordered Proteins by Variational Autoencoder. Int J Mol Sci 2023;24:ijms24086896. [PMID: 37108059 PMCID: PMC10138423 DOI: 10.3390/ijms24086896] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/26/2023] [Accepted: 03/27/2023] [Indexed: 04/29/2023] Open

Abstract

Intrinsically disordered proteins (IDPs) account for more than 50% of the human proteome and are closely associated with tumors, cardiovascular diseases, and neurodegeneration, which have no fixed three-dimensional structure under physiological conditions. Due to the characteristic of conformational diversity, conventional experimental methods of structural biology, such as NMR, X-ray diffraction, and CryoEM, are unable to capture conformational ensembles. Molecular dynamics (MD) simulation can sample the dynamic conformations at the atomic level, which has become an effective method for studying the structure and function of IDPs. However, the high computational cost prevents MD simulations from being widely used for IDPs conformational sampling. In recent years, significant progress has been made in artificial intelligence, which makes it possible to solve the conformational reconstruction problem of IDP with fewer computational resources. Here, based on short MD simulations of different IDPs systems, we use variational autoencoders (VAEs) to achieve the generative reconstruction of IDPs structures and include a wider range of sampled conformations from longer simulations. Compared with the generative autoencoder (AEs), VAEs add an inference layer between the encoder and decoder in the latent space, which can cover the conformational landscape of IDPs more comprehensively and achieve the effect of enhanced sampling. Through experimental verification, the Cα RMSD between VAE-generated and MD simulation sampling conformations in the 5 IDPs test systems was significantly lower than that of AE. The Spearman correlation coefficient on the structure was higher than that of AE. VAE can also achieve excellent performance regarding structured proteins. In summary, VAEs can be used to effectively sample protein structures.

Collapse

Ramírez-Palacios C, Marrink SJ. Super High-Throughput Screening of Enzyme Variants by Spectral Graph Convolutional Neural Networks. J Chem Theory Comput 2023. [PMID: 36961994 PMCID: PMC10373491 DOI: 10.1021/acs.jctc.2c01227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2023]

Dutagaci B, Duan B, Qiu C, Kaplan CD, Feig M. Characterization of RNA polymerase II trigger loop mutations using molecular dynamics simulations and machine learning. PLoS Comput Biol 2023;19:e1010999. [PMID: 36947548 PMCID: PMC10069792 DOI: 10.1371/journal.pcbi.1010999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 04/03/2023] [Accepted: 03/06/2023] [Indexed: 03/23/2023] Open

Banerjee A, Saha S, Tvedt NC, Yang LW, Bahar I. Mutually beneficial confluence of structure-based modeling of protein dynamics and machine learning methods. Curr Opin Struct Biol 2023;78:102517. [PMID: 36587424 PMCID: PMC10038760 DOI: 10.1016/j.sbi.2022.102517] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 11/19/2022] [Accepted: 11/22/2022] [Indexed: 12/31/2022]

Protein Function Analysis through Machine Learning. Biomolecules 2022;12:biom12091246. [PMID: 36139085 PMCID: PMC9496392 DOI: 10.3390/biom12091246] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 08/22/2022] [Accepted: 08/31/2022] [Indexed: 11/16/2022] Open

Ton AT, Pandey M, Smith JR, Ban F, Fernandez M, Cherkasov A. Targeting SARS-CoV-2 Papain-Like Protease in the Post-Vaccine Era. Trends Pharmacol Sci 2022;43:906-919. [PMID: 36114026 PMCID: PMC9399131 DOI: 10.1016/j.tips.2022.08.008] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 08/10/2022] [Accepted: 08/19/2022] [Indexed: 11/29/2022]

Fu X, Bates PA. Application of deep learning methods: From molecular modelling to patient classification. Exp Cell Res 2022;418:113278. [PMID: 35810775 DOI: 10.1016/j.yexcr.2022.113278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 06/16/2022] [Accepted: 07/05/2022] [Indexed: 11/28/2022]

Villalobos-Alva J, Ochoa-Toledo L, Villalobos-Alva MJ, Aliseda A, Pérez-Escamirosa F, Altamirano-Bustamante NF, Ochoa-Fernández F, Zamora-Solís R, Villalobos-Alva S, Revilla-Monsalve C, Kemper-Valverde N, Altamirano-Bustamante MM. Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field. Front Bioeng Biotechnol 2022;10:788300. [PMID: 35875501 PMCID: PMC9301016 DOI: 10.3389/fbioe.2022.788300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 05/25/2022] [Indexed: 11/23/2022] Open

Abstract

Proteins are some of the most fascinating and challenging molecules in the universe, and they pose a big challenge for artificial intelligence. The implementation of machine learning/AI in protein science gives rise to a world of knowledge adventures in the workhorse of the cell and proteome homeostasis, which are essential for making life possible. This opens up epistemic horizons thanks to a coupling of human tacit–explicit knowledge with machine learning power, the benefits of which are already tangible, such as important advances in protein structure prediction. Moreover, the driving force behind the protein processes of self-organization, adjustment, and fitness requires a space corresponding to gigabytes of life data in its order of magnitude. There are many tasks such as novel protein design, protein folding pathways, and synthetic metabolic routes, as well as protein-aggregation mechanisms, pathogenesis of protein misfolding and disease, and proteostasis networks that are currently unexplored or unrevealed. In this systematic review and biochemical meta-analysis, we aim to contribute to bridging the gap between what we call binomial artificial intelligence (AI) and protein science (PS), a growing research enterprise with exciting and promising biotechnological and biomedical applications. We undertake our task by exploring “the state of the art” in AI and machine learning (ML) applications to protein science in the scientific literature to address some critical research questions in this domain, including What kind of tasks are already explored by ML approaches to protein sciences? What are the most common ML algorithms and databases used? What is the situational diagnostic of the AI–PS inter-field? What do ML processing steps have in common? We also formulate novel questions such as Is it possible to discover what the rules of protein evolution are with the binomial AI–PS? How do protein folding pathways evolve? What are the rules that dictate the folds? What are the minimal nuclear protein structures? How do protein aggregates form and why do they exhibit different toxicities? What are the structural properties of amyloid proteins? How can we design an effective proteostasis network to deal with misfolded proteins? We are a cross-functional group of scientists from several academic disciplines, and we have conducted the systematic review using a variant of the PICO and PRISMA approaches. The search was carried out in four databases (PubMed, Bireme, OVID, and EBSCO Web of Science), resulting in 144 research articles. After three rounds of quality screening, 93 articles were finally selected for further analysis. A summary of our findings is as follows: regarding AI applications, there are mainly four types: 1) genomics, 2) protein structure and function, 3) protein design and evolution, and 4) drug design. In terms of the ML algorithms and databases used, supervised learning was the most common approach (85%). As for the databases used for the ML models, PDB and UniprotKB/Swissprot were the most common ones (21 and 8%, respectively). Moreover, we identified that approximately 63% of the articles organized their results into three steps, which we labeled pre-process, process, and post-process. A few studies combined data from several databases or created their own databases after the pre-process. Our main finding is that, as of today, there are no research road maps serving as guides to address gaps in our knowledge of the AI–PS binomial. All research efforts to collect, integrate multidimensional data features, and then analyze and validate them are, so far, uncoordinated and scattered throughout the scientific literature without a clear epistemic goal or connection between the studies. Therefore, our main contribution to the scientific literature is to offer a road map to help solve problems in drug design, protein structures, design, and function prediction while also presenting the “state of the art” on research in the AI–PS binomial until February 2021. Thus, we pave the way toward future advances in the synthetic redesign of novel proteins and protein networks and artificial metabolic pathways, learning lessons from nature for the welfare of humankind. Many of the novel proteins and metabolic pathways are currently non-existent in nature, nor are they used in the chemical industry or biomedical field.

Collapse

Affiliation(s)

Jalil Villalobos-Alva Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Luis Ochoa-Toledo Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Mario Javier Villalobos-Alva Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Atocha Aliseda Instituto de Investigaciones Filosóficas, Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Fernando Pérez-Escamirosa Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Nelly F. Altamirano-Bustamante Instituto Nacional de Pediatría, Mexico City, Mexico
Francine Ochoa-Fernández Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Ricardo Zamora-Solís Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Sebastián Villalobos-Alva Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Cristina Revilla-Monsalve Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Nicolás Kemper-Valverde Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Myriam M. Altamirano-Bustamante Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico *Correspondence: Myriam M. Altamirano-Bustamante,

Collapse

Li Y, Guo Y, Cheng H, Zeng X, Zhang X, Sang P, Chen B, Yang L. Deciphering gp120 sequence variation and structural dynamics in HIV neutralization phenotype by molecular dynamics simulations and graph machine learning. Proteins 2022;90:1413-1424. [DOI: 10.1002/prot.26322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 01/21/2022] [Accepted: 02/10/2022] [Indexed: 02/04/2023]

Gupta A, Dey S, Hicks A, Zhou HX. Artificial intelligence guided conformational mining of intrinsically disordered proteins. Commun Biol 2022;5:610. [PMID: 35725761 PMCID: PMC9209487 DOI: 10.1038/s42003-022-03562-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 06/07/2022] [Indexed: 12/29/2022] Open

Taneishi K, Tsuchiya Y. Structure-based analyses of gut microbiome-related proteins by neural networks and molecular dynamics simulations. Curr Opin Struct Biol 2022;73:102336. [DOI: 10.1016/j.sbi.2022.102336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 11/18/2021] [Accepted: 01/14/2022] [Indexed: 11/03/2022]

Li C, Liu J, Chen J, Yuan Y, Yu J, Gou Q, Guo Y, Pu X. An Interpretable Convolutional Neural Network Framework for Analyzing Molecular Dynamics Trajectories: a Case Study on Functional States for G-Protein-Coupled Receptors. J Chem Inf Model 2022;62:1399-1410. [PMID: 35257580 DOI: 10.1021/acs.jcim.2c00085] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Ketkaew R, Creazzo F, Luber S. Machine Learning-Assisted Discovery of Hidden States in Expanded Free Energy Space. J Phys Chem Lett 2022;13:1797-1805. [PMID: 35171614 DOI: 10.1021/acs.jpclett.1c04004] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Ruban A, Saccon F. Chlorophyll a De-Excitation Pathways in the LHCII antenna. J Chem Phys 2022;156:070902. [DOI: 10.1063/5.0073825] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Basciu A, Callea L, Motta S, Bonvin AM, Bonati L, Vargiu AV. No dance, no partner! A tale of receptor flexibility in docking and virtual screening. VIRTUAL SCREENING AND DRUG DOCKING 2022. [DOI: 10.1016/bs.armc.2022.08.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Protein-Protein Docking: Past, Present, and Future. Protein J 2021;41:1-26. [PMID: 34787783 DOI: 10.1007/s10930-021-10031-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2021] [Indexed: 10/19/2022]

Tian H, Jiang X, Trozzi F, Xiao S, Larson EC, Tao P. Explore Protein Conformational Space With Variational Autoencoder. Front Mol Biosci 2021;8:781635. [PMID: 34869602 PMCID: PMC8633506 DOI: 10.3389/fmolb.2021.781635] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 10/28/2021] [Indexed: 12/02/2022] Open

Tam C, Kumar A, Zhang KYJ. NbX: Machine Learning-Guided Re-Ranking of Nanobody-Antigen Binding Poses. Pharmaceuticals (Basel) 2021;14:ph14100968. [PMID: 34681192 PMCID: PMC8537642 DOI: 10.3390/ph14100968] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 09/17/2021] [Accepted: 09/21/2021] [Indexed: 12/02/2022] Open

Dongre AV, Das S, Bellur A, Kumar S, Chandrashekarmath A, Karmakar T, Balaram P, Balasubramanian S, Balaram H. Structural basis for the hyperthermostability of an archaeal enzyme induced by succinimide formation. Biophys J 2021;120:3732-3746. [PMID: 34302792 DOI: 10.1016/j.bpj.2021.07.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 06/18/2021] [Accepted: 07/19/2021] [Indexed: 10/20/2022] Open

Kulichenko M, Smith JS, Nebgen B, Li YW, Fedik N, Boldyrev AI, Lubbers N, Barros K, Tretiak S. The Rise of Neural Networks for Materials and Chemical Dynamics. J Phys Chem Lett 2021;12:6227-6243. [PMID: 34196559 DOI: 10.1021/acs.jpclett.1c01357] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Nunes-Alves A, Ormersbach F, Wade RC. Prediction of the Drug-Target Binding Kinetics for Flexible Proteins by Comparative Binding Energy Analysis. J Chem Inf Model 2021;61:3708-3721. [PMID: 34197096 DOI: 10.1021/acs.jcim.1c00639] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

There is growing consensus that the optimization of the kinetic parameters for drug-protein binding leads to improved drug efficacy. Therefore, computational methods have been developed to predict kinetic rates and to derive quantitative structure-kinetic relationships (QSKRs). Many of these methods are based on crystal structures of ligand-protein complexes. However, a drawback is that each ligand-protein complex is usually treated as having a single structure. Here, we present a modification of COMparative BINding Energy (COMBINE) analysis, which uses the structures of ligand-protein complexes to predict binding parameters. We introduce the option of using multiple structures to describe each ligand-protein complex in COMBINE analysis and apply this to study the effects of protein flexibility on the derivation of dissociation rate constants (k_off) for inhibitors of p38 mitogen-activated protein (MAP) kinase, which has a flexible binding site. Multiple structures were obtained for each ligand-protein complex by performing docking to an ensemble of protein configurations obtained from molecular dynamics simulations. Coefficients to scale ligand-protein interaction energies determined from energy-minimized structures of ligand-protein complexes were obtained by partial least squares regression, and they allowed for the computation of k_off values. The QSKR model obtained using single, energy-minimized crystal structures for each ligand-protein complex had higher predictive power than the QSKR model obtained with multiple structures from ensemble docking. However, incorporation of ligand-protein flexibility helped to highlight additional ligand-protein interactions that lead to longer residence times, such as interactions with residues Arg67 and Asp168, which are close to the ligand in many crystal structures. These results show that COMBINE analysis is a promising method to guide the design of compounds that bind to flexible proteins with improved binding kinetics.

Collapse

Kingdon ADH, Alderwick LJ. Structure-based in silico approaches for drug discovery against Mycobacterium tuberculosis. Comput Struct Biotechnol J 2021;19:3708-3719. [PMID: 34285773 PMCID: PMC8258792 DOI: 10.1016/j.csbj.2021.06.034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 06/22/2021] [Accepted: 06/22/2021] [Indexed: 12/12/2022] Open

Wang X, Flannery ST, Kihara D. Protein Docking Model Evaluation by Graph Neural Networks. Front Mol Biosci 2021;8:647915. [PMID: 34113650 PMCID: PMC8185212 DOI: 10.3389/fmolb.2021.647915] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Accepted: 04/26/2021] [Indexed: 12/03/2022] Open

Ward MD, Zimmerman MI, Meller A, Chung M, Swamidass SJ, Bowman GR. Deep learning the structural determinants of protein biochemical properties by comparing structural ensembles with DiffNets. Nat Commun 2021;12:3023. [PMID: 34021153 PMCID: PMC8140102 DOI: 10.1038/s41467-021-23246-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 04/16/2021] [Indexed: 12/05/2022] Open

Plante A, Weinstein H. Ligand-Dependent Conformational Transitions in Molecular Dynamics Trajectories of GPCRs Revealed by a New Machine Learning Rare Event Detection Protocol. Molecules 2021;26:molecules26103059. [PMID: 34065494 PMCID: PMC8161244 DOI: 10.3390/molecules26103059] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Revised: 05/11/2021] [Accepted: 05/11/2021] [Indexed: 01/14/2023] Open

Dechant PP, He YH. Machine-learning a virus assembly fitness landscape. PLoS One 2021;16:e0250227. [PMID: 33951035 PMCID: PMC8099058 DOI: 10.1371/journal.pone.0250227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 04/01/2021] [Indexed: 02/05/2023] Open

Jin Y, Johannissen LO, Hay S. Predicting new protein conformations from molecular dynamics simulation conformational landscapes and machine learning. Proteins 2021;89:915-921. [PMID: 33629765 DOI: 10.1002/prot.26068] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 01/21/2021] [Accepted: 02/23/2021] [Indexed: 11/06/2022]

Wang B, Su Z, Wu Y. Characterizing the function of domain linkers in regulating the dynamics of multi-domain fusion proteins by microsecond molecular dynamics simulations and artificial intelligence. Proteins 2021;89:884-895. [PMID: 33620752 DOI: 10.1002/prot.26066] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 01/20/2021] [Accepted: 02/20/2021] [Indexed: 11/12/2022]

Rahman T, Du Y, Zhao L, Shehu A. Generative Adversarial Learning of Protein Tertiary Structures. Molecules 2021;26:molecules26051209. [PMID: 33668217 PMCID: PMC7956369 DOI: 10.3390/molecules26051209] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/13/2021] [Accepted: 02/16/2021] [Indexed: 12/15/2022] Open

Rognan D. Modeling Protein-Ligand Interactions: Are We Ready for Deep Learning? SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11521-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Harmalkar A, Gray JJ. Advances to tackle backbone flexibility in protein docking. Curr Opin Struct Biol 2020;67:178-186. [PMID: 33360497 DOI: 10.1016/j.sbi.2020.11.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 11/18/2020] [Accepted: 11/25/2020] [Indexed: 12/11/2022]

Meng F, Liang Z, Zhao K, Luo C. Drug design targeting active posttranslational modification protein isoforms. Med Res Rev 2020;41:1701-1750. [PMID: 33355944 DOI: 10.1002/med.21774] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 11/29/2020] [Accepted: 12/03/2020] [Indexed: 12/11/2022]

Abstract

Modern drug design aims to discover novel lead compounds with attractable chemical profiles to enable further exploration of the intersection of chemical space and biological space. Identification of small molecules with good ligand efficiency, high activity, and selectivity is crucial toward developing effective and safe drugs. However, the intersection is one of the most challenging tasks in the pharmaceutical industry, as chemical space is almost infinity and continuous, whereas the biological space is very limited and discrete. This bottleneck potentially limits the discovery of molecules with desirable properties for lead optimization. Herein, we present a new direction leveraging posttranslational modification (PTM) protein isoforms target space to inspire drug design termed as "Post-translational Modification Inspired Drug Design (PTMI-DD)." PTMI-DD aims to extend the intersections of chemical space and biological space. We further rationalized and highlighted the importance of PTM protein isoforms and their roles in various diseases and biological functions. We then laid out a few directions to elaborate the PTMI-DD in drug design including discovering covalent binding inhibitors mimicking PTMs, targeting PTM protein isoforms with distinctive binding sites from that of wild-type counterpart, targeting protein-protein interactions involving PTMs, and hijacking protein degeneration by ubiquitination for PTM protein isoforms. These directions will lead to a significant expansion of the biological space and/or increase the tractability of compounds, primarily due to precisely targeting PTM protein isoforms or complexes which are highly relevant to biological functions. Importantly, this new avenue will further enrich the personalized treatment opportunity through precision medicine targeting PTM isoforms.

Collapse

Instantaneous generation of protein hydration properties from static structures. Commun Chem 2020;3:188. [PMID: 36703451 PMCID: PMC9814540 DOI: 10.1038/s42004-020-00435-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 11/10/2020] [Indexed: 01/29/2023] Open

Bartocci A, Gillet N, Jiang T, Szczepaniak F, Dumont E. Molecular Dynamics Approach for Capturing Calixarene-Protein Interactions: The Case of Cytochrome C. J Phys Chem B 2020;124:11371-11378. [PMID: 33270456 DOI: 10.1021/acs.jpcb.0c08482] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Gkeka P, Stoltz G, Barati Farimani A, Belkacemi Z, Ceriotti M, Chodera JD, Dinner AR, Ferguson AL, Maillet JB, Minoux H, Peter C, Pietrucci F, Silveira A, Tkatchenko A, Trstanova Z, Wiewiora R, Lelièvre T. Machine Learning Force Fields and Coarse-Grained Variables in Molecular Dynamics: Application to Materials and Biological Systems. J Chem Theory Comput 2020;16:4757-4775. [PMID: 32559068 PMCID: PMC8312194 DOI: 10.1021/acs.jctc.0c00355] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Affiliation(s)

Paraskevi Gkeka Integrated Drug Discovery, Sanofi R&D, 91385 Chilly-Mazarin, France
Gabriel Stoltz CERMICS, Ecole des Ponts, Marne-la-Vallée, France Matherials Project-Team, Inria Paris, 75012 Paris, France
Amir Barati Farimani Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
Zineb Belkacemi Integrated Drug Discovery, Sanofi R&D, 91385 Chilly-Mazarin, France CERMICS, Ecole des Ponts, Marne-la-Vallée, France
Michele Ceriotti Laboratory of Computational Science and Modelling, Institute of Materials, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
John D Chodera Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
Aaron R Dinner Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
Andrew L Ferguson Pritzker School of Molecular Engineering, University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, United States
Jean-Bernard Maillet CEA-DAM, DIF, 91297 Arpajon Cedex, France
Hervé Minoux Integrated Drug Discovery, Sanofi R&D, 94403 Vitry-sur-Seine, France
Christine Peter University of Konstanz, 78457 Konstanz, Germany
Fabio Pietrucci UMR CNRS 7590, MNHN, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, Sorbonne Université, 75005 Paris, France
Ana Silveira Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
Alexandre Tkatchenko Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
Zofia Trstanova School of Mathematics, The University of Edinburgh, Edinburgh EH9 3FD, U.K
Rafal Wiewiora Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
Tony Lelièvre CERMICS, Ecole des Ponts, Marne-la-Vallée, France Matherials Project-Team, Inria Paris, 75012 Paris, France

Collapse

Verkhivker GM, Agajanian S, Hu G, Tao P. Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning. Front Mol Biosci 2020;7:136. [PMID: 32733918 PMCID: PMC7363947 DOI: 10.3389/fmolb.2020.00136] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 06/08/2020] [Indexed: 12/12/2022] Open

Abstract

Allosteric regulation is a common mechanism employed by complex biomolecular systems for regulation of activity and adaptability in the cellular environment, serving as an effective molecular tool for cellular communication. As an intrinsic but elusive property, allostery is a ubiquitous phenomenon where binding or disturbing of a distal site in a protein can functionally control its activity and is considered as the "second secret of life." The fundamental biological importance and complexity of these processes require a multi-faceted platform of synergistically integrated approaches for prediction and characterization of allosteric functional states, atomistic reconstruction of allosteric regulatory mechanisms and discovery of allosteric modulators. The unifying theme and overarching goal of allosteric regulation studies in recent years have been integration between emerging experiment and computational approaches and technologies to advance quantitative characterization of allosteric mechanisms in proteins. Despite significant advances, the quantitative characterization and reliable prediction of functional allosteric states, interactions, and mechanisms continue to present highly challenging problems in the field. In this review, we discuss simulation-based multiscale approaches, experiment-informed Markovian models, and network modeling of allostery and information-theoretical approaches that can describe the thermodynamics and hierarchy allosteric states and the molecular basis of allosteric mechanisms. The wealth of structural and functional information along with diversity and complexity of allosteric mechanisms in therapeutically important protein families have provided a well-suited platform for development of data-driven research strategies. Data-centric integration of chemistry, biology and computer science using artificial intelligence technologies has gained a significant momentum and at the forefront of many cross-disciplinary efforts. We discuss new developments in the machine learning field and the emergence of deep learning and deep reinforcement learning applications in modeling of molecular mechanisms and allosteric proteins. The experiment-guided integrated approaches empowered by recent advances in multiscale modeling, network science, and machine learning can lead to more reliable prediction of allosteric regulatory mechanisms and discovery of allosteric modulators for therapeutically important protein targets.

Collapse