1
|
Mustali J, Yasuda I, Hirano Y, Yasuoka K, Gautieri A, Arai N. Unsupervised deep learning for molecular dynamics simulations: a novel analysis of protein-ligand interactions in SARS-CoV-2 M pro. RSC Adv 2023; 13:34249-34261. [PMID: 38019981 PMCID: PMC10663885 DOI: 10.1039/d3ra06375e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 11/06/2023] [Indexed: 12/01/2023] Open
Abstract
Molecular dynamics (MD) simulations, which are central to drug discovery, offer detailed insights into protein-ligand interactions. However, analyzing large MD datasets remains a challenge. Current machine-learning solutions are predominantly supervised and have data labelling and standardisation issues. In this study, we adopted an unsupervised deep-learning framework, previously benchmarked for rigid proteins, to study the more flexible SARS-CoV-2 main protease (Mpro). We ran MD simulations of Mpro with various ligands and refined the data by focusing on binding-site residues and time frames in stable protein conformations. The optimal descriptor chosen was the distance between the residues and the center of the binding pocket. Using this approach, a local dynamic ensemble was generated and fed into our neural network to compute Wasserstein distances across system pairs, revealing ligand-induced conformational differences in Mpro. Dimensionality reduction yielded an embedding map that correlated ligand-induced dynamics and binding affinity. Notably, the high-affinity compounds showed pronounced effects on the protein's conformations. We also identified the key residues that contributed to these differences. Our findings emphasize the potential of combining unsupervised deep learning with MD simulations to extract valuable information and accelerate drug discovery.
Collapse
Affiliation(s)
- Jessica Mustali
- Department of Electronics, Information and Bioengineering, Politecnico di Milano Italy
| | - Ikki Yasuda
- Department of Mechanical Engineering, Keio University Japan
| | | | - Kenji Yasuoka
- Department of Mechanical Engineering, Keio University Japan
| | - Alfonso Gautieri
- Department of Electronics, Information and Bioengineering, Politecnico di Milano Italy
| | - Noriyoshi Arai
- Department of Mechanical Engineering, Keio University Japan
| |
Collapse
|
2
|
Meli R, Morris GM, Biggin PC. Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review. FRONTIERS IN BIOINFORMATICS 2022; 2:885983. [PMID: 36187180 PMCID: PMC7613667 DOI: 10.3389/fbinf.2022.885983] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 05/11/2022] [Indexed: 01/01/2023] Open
Abstract
The rapid and accurate in silico prediction of protein-ligand binding free energies or binding affinities has the potential to transform drug discovery. In recent years, there has been a rapid growth of interest in deep learning methods for the prediction of protein-ligand binding affinities based on the structural information of protein-ligand complexes. These structure-based scoring functions often obtain better results than classical scoring functions when applied within their applicability domain. Here we review structure-based scoring functions for binding affinity prediction based on deep learning, focussing on different types of architectures, featurization strategies, data sets, methods for training and evaluation, and the role of explainable artificial intelligence in building useful models for real drug-discovery applications.
Collapse
Affiliation(s)
- Rocco Meli
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | - Garrett M. Morris
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Philip C. Biggin
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
3
|
Radakovich N, Nagy M, Nazha A. Artificial Intelligence in Hematology: Current Challenges and Opportunities. Curr Hematol Malig Rep 2020; 15:203-210. [PMID: 32239350 DOI: 10.1007/s11899-020-00575-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
PURPOSE OF REVIEW Artificial intelligence (AI), and in particular its subcategory machine learning, is finding an increasing number of applications in medicine, driven in large part by an abundance of data and powerful, accessible tools that have made AI accessible to a larger circle of investigators. RECENT FINDINGS AI has been employed in the analysis of hematopathological, radiographic, laboratory, genomic, pharmacological, and chemical data to better inform diagnosis, prognosis, treatment planning, and foundational knowledge related to benign and malignant hematology. As more widespread implementation of clinical AI nears, attention has also turned to the effects this will have on other areas in medicine. AI offers many promising tools to clinicians broadly, and specifically in the practice of hematology. Ongoing research into its various applications will likely result in an increasing utilization of AI by a broader swath of clinicians.
Collapse
Affiliation(s)
- Nathan Radakovich
- Cleveland Clinic Lerner College of Medicine of Case Western Reserve University, Cleveland, OH, USA
| | - Matthew Nagy
- Cleveland Clinic Lerner College of Medicine of Case Western Reserve University, Cleveland, OH, USA
| | - Aziz Nazha
- Center for Clinical Artificial Intelligence, Cleveland Clinic, Cleveland, OH, USA.
- Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Desk R35 9500 Euclid Ave., Cleveland, OH, 44195, USA.
| |
Collapse
|
4
|
Parks CD, Gaieb Z, Chiu M, Yang H, Shao C, Walters WP, Jansen JM, McGaughey G, Lewis RA, Bembenek SD, Ameriks MK, Mirzadegan T, Burley SK, Amaro RE, Gilson MK. D3R grand challenge 4: blind prediction of protein-ligand poses, affinity rankings, and relative binding free energies. J Comput Aided Mol Des 2020; 34:99-119. [PMID: 31974851 PMCID: PMC7261493 DOI: 10.1007/s10822-020-00289-y] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 01/13/2020] [Indexed: 12/11/2022]
Abstract
The Drug Design Data Resource (D3R) aims to identify best practice methods for computer aided drug design through blinded ligand pose prediction and affinity challenges. Herein, we report on the results of Grand Challenge 4 (GC4). GC4 focused on proteins beta secretase 1 and Cathepsin S, and was run in an analogous manner to prior challenges. In Stage 1, participant ability to predict the pose and affinity of BACE1 ligands were assessed. Following the completion of Stage 1, all BACE1 co-crystal structures were released, and Stage 2 tested affinity rankings with co-crystal structures. We provide an analysis of the results and discuss insights into determined best practice methods.
Collapse
Affiliation(s)
- Conor D Parks
- Drug Design Data Resource, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Zied Gaieb
- Drug Design Data Resource, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Michael Chiu
- Drug Design Data Resource, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Huanwang Yang
- RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, New Brunswick, NJ, 08903, USA
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Chenghua Shao
- RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, New Brunswick, NJ, 08903, USA
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, 92093, USA
| | | | - Johanna M Jansen
- Novartis Institutes for BioMedical Research, Emeryville, CA, 94608, USA
| | | | - Richard A Lewis
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002, Basel, Switzerland
| | | | | | | | - Stephen K Burley
- RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, New Brunswick, NJ, 08903, USA
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Rommie E Amaro
- Drug Design Data Resource, University of California, San Diego, La Jolla, CA, 92093, USA.
- Department of Chemistry and Biochemistry, UC San Diego, La Jolla, CA, 92093-0340, USA.
| | - Michael K Gilson
- Drug Design Data Resource, University of California, San Diego, La Jolla, CA, 92093, USA.
- Skaggs School of Pharmacy and Pharmaceutical Sciences, UC San Diego, 9500 Gilman Drive, MC0751, La Jolla, CA, 92093, USA.
| |
Collapse
|