1
|
Čivić J, McFarlane NR, Masschelein J, Harvey JN. Exploring the selectivity of cytochrome P450 for enhanced novel anticancer agent synthesis. Faraday Discuss 2024; 252:69-88. [PMID: 38855920 DOI: 10.1039/d4fd00004h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Cytochrome P450 monooxygenases are an extensive and unique class of enzymes, which can regio- and stereo-selectively functionalise hydrocarbons by way of oxidation reactions. These enzymes are naturally occurring but have also been extensively applied in a synthesis context, where they are used as efficient biocatalysts. Recently, a biosynthetic pathway where a cytochrome P450 monooxygenase catalyses a critical step of the pathway was uncovered, leading to the production of a number of products that display high antitumour potency. In this work, we use computational techniques to gain insight into the factors that determine the relative yields of the different products. We use conformational search algorithms to understand the substrate stereochemistry. On a machine-learned 3D protein structure, we use molecular docking to obtain a library of favourable poses for substrate-protein interaction. With molecular dynamics, we investigate the most favourable poses for reactivity on a molecular level, allowing us to investigate which protein-substrate interactions favour a given product and thus gain insight into the product selectivity.
Collapse
Affiliation(s)
- Janko Čivić
- Department of Chemistry, KU Leuven, Celestijnenlaan 200F, B-3001 Leuven, Belgium.
| | - Neil R McFarlane
- Department of Chemistry, KU Leuven, Celestijnenlaan 200F, B-3001 Leuven, Belgium.
| | - Joleen Masschelein
- Department of Biology, Vlaams Instituut voor Biotechnologie VIB-KU Leuven Center for Microbiology, Leuven, Belgium
| | - Jeremy N Harvey
- Department of Chemistry, KU Leuven, Celestijnenlaan 200F, B-3001 Leuven, Belgium.
| |
Collapse
|
2
|
Metz TO, Chang CH, Gautam V, Anjum A, Tian S, Wang F, Colby SM, Nunez JR, Blumer MR, Edison AS, Fiehn O, Jones DP, Li S, Morgan ET, Patti GJ, Ross DH, Shapiro MR, Williams AJ, Wishart DS. Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.30.605945. [PMID: 39131324 PMCID: PMC11312557 DOI: 10.1101/2024.07.30.605945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence - the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in context of the chemical space being considered, are easily automated, or are transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a reference library or chemical space that match to an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an in silico analysis of multi-property reference libraries constructed from the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.
Collapse
Affiliation(s)
- Thomas O. Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Christine H. Chang
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Vasuk Gautam
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Afia Anjum
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Siyang Tian
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Fei Wang
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
- Alberta Machine Intelligence Institute, Edmonton, AB, Canada
| | - Sean M. Colby
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Jamie R. Nunez
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Madison R. Blumer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Arthur S. Edison
- Department of Biochemistry & Molecular Biology, Complex Carbohydrate Research Center and Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| | - Oliver Fiehn
- West Coast Metabolomics Center, University of California Davis, Davis, CA, USA
| | - Dean P. Jones
- Clinical Biomarkers Laboratory, Department of Medicine, Emory University, Atlanta, Georgia, USA
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Edward T. Morgan
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Gary J. Patti
- Center for Mass Spectrometry and Metabolic Tracing, Department of Chemistry, Department of Medicine, Washington University, Saint Louis, Missouri, USA
| | - Dylan H. Ross
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Madelyn R. Shapiro
- Artificial Intelligence & Data Analytics Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Antony J. Williams
- U.S. Environmental Protection Agency, Office of Research & Development, Center for Computational Toxicology & Exposure (CCTE), Research Triangle Park, NC USA
| | - David S. Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
3
|
Lai Y, Koelmel JP, Walker DI, Price EJ, Papazian S, Manz KE, Castilla-Fernández D, Bowden JA, Nikiforov V, David A, Bessonneau V, Amer B, Seethapathy S, Hu X, Lin EZ, Jbebli A, McNeil BR, Barupal D, Cerasa M, Xie H, Kalia V, Nandakumar R, Singh R, Tian Z, Gao P, Zhao Y, Froment J, Rostkowski P, Dubey S, Coufalíková K, Seličová H, Hecht H, Liu S, Udhani HH, Restituito S, Tchou-Wong KM, Lu K, Martin JW, Warth B, Godri Pollitt KJ, Klánová J, Fiehn O, Metz TO, Pennell KD, Jones DP, Miller GW. High-Resolution Mass Spectrometry for Human Exposomics: Expanding Chemical Space Coverage. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:12784-12822. [PMID: 38984754 PMCID: PMC11271014 DOI: 10.1021/acs.est.4c01156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 07/11/2024]
Abstract
In the modern "omics" era, measurement of the human exposome is a critical missing link between genetic drivers and disease outcomes. High-resolution mass spectrometry (HRMS), routinely used in proteomics and metabolomics, has emerged as a leading technology to broadly profile chemical exposure agents and related biomolecules for accurate mass measurement, high sensitivity, rapid data acquisition, and increased resolution of chemical space. Non-targeted approaches are increasingly accessible, supporting a shift from conventional hypothesis-driven, quantitation-centric targeted analyses toward data-driven, hypothesis-generating chemical exposome-wide profiling. However, HRMS-based exposomics encounters unique challenges. New analytical and computational infrastructures are needed to expand the analysis coverage through streamlined, scalable, and harmonized workflows and data pipelines that permit longitudinal chemical exposome tracking, retrospective validation, and multi-omics integration for meaningful health-oriented inferences. In this article, we survey the literature on state-of-the-art HRMS-based technologies, review current analytical workflows and informatic pipelines, and provide an up-to-date reference on exposomic approaches for chemists, toxicologists, epidemiologists, care providers, and stakeholders in health sciences and medicine. We propose efforts to benchmark fit-for-purpose platforms for expanding coverage of chemical space, including gas/liquid chromatography-HRMS (GC-HRMS and LC-HRMS), and discuss opportunities, challenges, and strategies to advance the burgeoning field of the exposome.
Collapse
Affiliation(s)
- Yunjia Lai
- Department
of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York 10032, United States
| | - Jeremy P. Koelmel
- Department
of Environmental Health Sciences, Yale School
of Public Health, New Haven, Connecticut 06520, United States
| | - Douglas I. Walker
- Gangarosa
Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, Georgia 30322, United States
| | - Elliott J. Price
- RECETOX,
Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
| | - Stefano Papazian
- Department
of Environmental Science, Science for Life Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
- National
Facility for Exposomics, Metabolomics Platform, Science for Life Laboratory, Stockholm University, Solna 171 65, Sweden
| | - Katherine E. Manz
- Department
of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Delia Castilla-Fernández
- Department
of Food Chemistry and Toxicology, Faculty of Chemistry, University of Vienna, 1010 Vienna, Austria
| | - John A. Bowden
- Center for
Environmental and Human Toxicology, Department of Physiological Sciences,
College of Veterinary Medicine, University
of Florida, Gainesville, Florida 32611, United States
| | | | - Arthur David
- Univ Rennes,
Inserm, EHESP, Irset (Institut de recherche en santé, environnement
et travail) − UMR_S, 1085 Rennes, France
| | - Vincent Bessonneau
- Univ Rennes,
Inserm, EHESP, Irset (Institut de recherche en santé, environnement
et travail) − UMR_S, 1085 Rennes, France
| | - Bashar Amer
- Thermo
Fisher Scientific, San Jose, California 95134, United States
| | | | - Xin Hu
- Gangarosa
Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, Georgia 30322, United States
| | - Elizabeth Z. Lin
- Department
of Environmental Health Sciences, Yale School
of Public Health, New Haven, Connecticut 06520, United States
| | - Akrem Jbebli
- RECETOX,
Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
| | - Brooklynn R. McNeil
- Biomarkers
Core Laboratory, Irving Institute for Clinical and Translational Research, Columbia University Irving Medical Center, New York, New York 10032, United States
| | - Dinesh Barupal
- Department
of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, New York 10029, United States
| | - Marina Cerasa
- Institute
of Atmospheric Pollution Research, Italian National Research Council, 00015 Monterotondo, Rome, Italy
| | - Hongyu Xie
- Department
of Environmental Science, Science for Life Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Vrinda Kalia
- Department
of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York 10032, United States
| | - Renu Nandakumar
- Biomarkers
Core Laboratory, Irving Institute for Clinical and Translational Research, Columbia University Irving Medical Center, New York, New York 10032, United States
| | - Randolph Singh
- Department
of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York 10032, United States
| | - Zhenyu Tian
- Department
of Chemistry and Chemical Biology, Northeastern
University, Boston, Massachusetts 02115, United States
| | - Peng Gao
- Department
of Environmental and Occupational Health, and Department of Civil
and Environmental Engineering, University
of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
- UPMC Hillman
Cancer Center, Pittsburgh, Pennsylvania 15232, United States
| | - Yujia Zhao
- Institute
for Risk Assessment Sciences, Utrecht University, Utrecht 3584CM, The Netherlands
| | | | | | - Saurabh Dubey
- Biomarkers
Core Laboratory, Irving Institute for Clinical and Translational Research, Columbia University Irving Medical Center, New York, New York 10032, United States
| | - Kateřina Coufalíková
- RECETOX,
Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
| | - Hana Seličová
- RECETOX,
Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
| | - Helge Hecht
- RECETOX,
Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
| | - Sheng Liu
- Department
of Environmental Health Sciences, Yale School
of Public Health, New Haven, Connecticut 06520, United States
| | - Hanisha H. Udhani
- Biomarkers
Core Laboratory, Irving Institute for Clinical and Translational Research, Columbia University Irving Medical Center, New York, New York 10032, United States
| | - Sophie Restituito
- Department
of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York 10032, United States
| | - Kam-Meng Tchou-Wong
- Department
of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York 10032, United States
| | - Kun Lu
- Department
of Environmental Sciences and Engineering, Gillings School of Global
Public Health, The University of North Carolina
at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Jonathan W. Martin
- Department
of Environmental Science, Science for Life Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
- National
Facility for Exposomics, Metabolomics Platform, Science for Life Laboratory, Stockholm University, Solna 171 65, Sweden
| | - Benedikt Warth
- Department
of Food Chemistry and Toxicology, Faculty of Chemistry, University of Vienna, 1010 Vienna, Austria
| | - Krystal J. Godri Pollitt
- Department
of Environmental Health Sciences, Yale School
of Public Health, New Haven, Connecticut 06520, United States
| | - Jana Klánová
- RECETOX,
Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech Republic
| | - Oliver Fiehn
- West Coast
Metabolomics Center, University of California−Davis, Davis, California 95616, United States
| | - Thomas O. Metz
- Biological
Sciences Division, Pacific Northwest National
Laboratory, Richland, Washington 99354, United States
| | - Kurt D. Pennell
- School
of Engineering, Brown University, Providence, Rhode Island 02912, United States
| | - Dean P. Jones
- Department
of Medicine, School of Medicine, Emory University, Atlanta, Georgia 30322, United States
| | - Gary W. Miller
- Department
of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York 10032, United States
| |
Collapse
|
4
|
Öeren M, Hunt PA, Wharrick CE, Tabatabaei Ghomi H, Segall MD. Predicting routes of phase I and II metabolism based on quantum mechanics and machine learning. Xenobiotica 2024; 54:379-393. [PMID: 37966132 DOI: 10.1080/00498254.2023.2284251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 11/13/2023] [Indexed: 11/16/2023]
Abstract
Unexpected metabolism could lead to the failure of many late-stage drug candidates or even the withdrawal of approved drugs. Thus, it is critical to predict and study the dominant routes of metabolism in the early stages of research.We describe the development and validation of a 'WhichEnzyme' model that accurately predicts the enzyme families most likely to be responsible for a drug-like molecule's metabolism. Furthermore, we combine this model with our previously published regioselectivity models for Cytochromes P450, Aldehyde Oxidases, Flavin-containing Monooxygenases, UDP-glucuronosyltransferases and Sulfotransferases - the most important Phase I and Phase II drug metabolising enzymes - and a 'WhichP450' model that predicts the Cytochrome P450 isoform(s) responsible for a compound's metabolism.The regioselectivity models are based on a mechanistic understanding of these enzymes' actions and use quantum mechanical simulations with machine learning methods to accurately predict sites of metabolism and the resulting metabolites. We train heuristics based on the outputs of the 'WhichEnzyme', 'WhichP450', and regioselectivity models to determine the most likely routes of metabolism and metabolites to be observed experimentally.Finally, we demonstrate that this combination delivers high sensitivity in identifying experimentally reported metabolites and higher precision than other methods for predicting in vivo metabolite profiles.
Collapse
Affiliation(s)
- Mario Öeren
- Optibrium Limited, Cambridge Innovation Park, Cambridge, UK
| | - Peter A Hunt
- Optibrium Limited, Cambridge Innovation Park, Cambridge, UK
| | | | | | | |
Collapse
|
5
|
Groff L, Williams A, Shah I, Patlewicz G. MetSim: Integrated Programmatic Access and Pathway Management for Xenobiotic Metabolism Simulators. Chem Res Toxicol 2024; 37:685-697. [PMID: 38598715 PMCID: PMC11325951 DOI: 10.1021/acs.chemrestox.3c00398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Xenobiotic metabolism is a key consideration in evaluating the hazards and risks posed by environmental chemicals. A number of software tools exist that are capable of simulating metabolites, but each reports its predictions in a different format and with varying levels of detail. This makes comparing the performance and coverage of the tools a practical challenge. To address this shortcoming, we developed a metabolic simulation framework called MetSim, which comprises three main components. A graph-based schema was developed to allow metabolism information to be harmonized. The schema was implemented in MongoDB to store and retrieve metabolic graphs for subsequent analysis. MetSim currently includes an application programming interface for four metabolic simulators: BioTransformer, the OECD Toolbox, EPA's chemical transformation simulator (CTS), and tissue metabolism simulator (TIMES). Lastly, MetSim provides functions to help evaluate simulator performance for specific data sets. In this study, a set of 112 drugs with 432 reported metabolites were compiled, and predictions were made using the 4 simulators. Fifty-nine of the 112 drugs were taken from the Small Molecule Pathway Database, with the remainder sourced from the literature. The human models within BioTransformer and CTS (Phase I only) and the rat models within TIMES and the OECD Toolbox (Phase I only) were used to make predictions for the chemicals in the data set. The recall and precision (recall, precision) ranked in order of highest recall for each individual tool were CTS (0.54, 0.017), BioTransformer (0.50, 0.008), Toolbox in vitro (0.40, 0.144), TIMES in vivo (0.40, 0.133), Toolbox in vivo (0.40, 0.118), and TIMES in vitro (0.39, 0.128). Combining all of the model predictions together increased the overall recall (0.73, 0.008). MetSim enabled insights into the performance and coverage of in silico metabolic simulators to be more efficiently derived, which in turn should aid future efforts to evaluate other data sets.
Collapse
Affiliation(s)
- Louis Groff
- Center for Computational Toxicology and Exposure (CCTE), Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Antony Williams
- Center for Computational Toxicology and Exposure (CCTE), Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Imran Shah
- Center for Computational Toxicology and Exposure (CCTE), Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Grace Patlewicz
- Center for Computational Toxicology and Exposure (CCTE), Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| |
Collapse
|
6
|
Yang H, Liu J, Chen K, Cong S, Cai S, Li Y, Jia Z, Wu H, Lou T, Wei Z, Yang X, Xiao H. D-CyPre: a machine learning-based tool for accurate prediction of human CYP450 enzyme metabolic sites. PeerJ Comput Sci 2024; 10:e2040. [PMID: 38855237 PMCID: PMC11157575 DOI: 10.7717/peerj-cs.2040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 04/15/2024] [Indexed: 06/11/2024]
Abstract
The advancement of graph neural networks (GNNs) has made it possible to accurately predict metabolic sites. Despite the combination of GNNs with XGBOOST showing impressive performance, this technology has not yet been applied in the realm of metabolic site prediction. Previous metabolic site prediction tools focused on bonds and atoms, regardless of the overall molecular skeleton. This study introduces a novel tool, named D-CyPre, that amalgamates atom, bond, and molecular skeleton information via two directed message-passing neural networks (D-MPNN) to predict the metabolic sites of the nine cytochrome P450 enzymes using XGBOOST. In D-CyPre Precision Mode, the model produces fewer, but more accurate results (Jaccard score: 0.497, F1: 0.660, and precision: 0.737 in the test set). In D-CyPre Recall Mode, the model produces less accurate, but more comprehensive results (Jaccard score: 0.506, F1: 0.669, and recall: 0.720 in the test set). In the test set of 68 reactants, D-CyPre outperformed BioTransformer on all isoenzymes and CyProduct on most isoenzymes (5/9). For the subtypes where D-CyPre outperformed CyProducts, the Jaccard score and F1 scores increased by 24% and 16% in Precision Mode (4/9) and 19% and 12% in Recall Mode (5/9), respectively, relative to the second-best CyProduct. Overall, D-CyPre provides more accurate prediction results for human CYP450 enzyme metabolic sites.
Collapse
Affiliation(s)
- Haolan Yang
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Jie Liu
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Kui Chen
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Shiyu Cong
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Shengnan Cai
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Yueting Li
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Zhixin Jia
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Hao Wu
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Tianyu Lou
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Zuying Wei
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Xiaoqin Yang
- School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| | - Hongbin Xiao
- Beijing University of Chinese Medicine, Research Center of Chinese Medicine Analysis and Transformation, Beijing, China
| |
Collapse
|
7
|
Scholz VA, Stork C, Frericks M, Kirchmair J. Computational prediction of the metabolites of agrochemicals formed in rats. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 895:165039. [PMID: 37355108 DOI: 10.1016/j.scitotenv.2023.165039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 06/17/2023] [Accepted: 06/19/2023] [Indexed: 06/26/2023]
Abstract
Today, computational tools for the prediction of the metabolite structures of xenobiotics are widely available and employed in small-molecule research. Reflecting the availability of measured data, these in silico tools are trained and validated primarily on drug metabolism data. In this work, we assessed the capacity of five leading metabolite structure predictors to represent the metabolism of agrochemicals observed in rats. More specifically, we tested the ability of SyGMa, GLORY, GLORYx, BioTransformer 3.0, and MetaTrans to correctly predict and rank the experimentally observed metabolites of a set of 85 parent compounds. We found that the models were able to recover about one to two-thirds of the experimentally observed first-generation, second-generation and third-generation metabolites, confirming their value in applications such as metabolite identification. However, precision was low for all investigated tools and did not exceed approximately 18 % for the pool of first-generation metabolites and 2 % for the pool of compounds representing the first three generations of metabolites. The variance in prediction success rates was high across the individual metabolic maps, meaning that outcomes depend strongly on the specific compound under investigation. We also found that the predictions for individual parent compounds differed strongly between the tools, particularly between those built on orthogonal technologies (e.g., rule-based and end-to-end machine learning approaches). This renders ensemble model strategies promising for improving success rates. Overall, the results of this benchmark study show that there is still considerable room for the improvement of metabolite structure predictors left. Our discussion points out several avenues to progress. The bottleneck in method development certainly has been, and will remain, for the foreseeable future, the limited quantity and quality of available measured data on small-molecule metabolism.
Collapse
Affiliation(s)
- Vincent-Alexander Scholz
- Department of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria; Vienna Doctoral School of Pharmaceutical, Nutritional and Sport Sciences (PhaNuSpo), University of Vienna, 1090 Vienna, Austria
| | | | | | - Johannes Kirchmair
- Department of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria; Christian Doppler Laboratory for Molecular Informatics in the Biosciences, Department for Pharmaceutical Sciences, University of Vienna, 1090 Vienna, Austria.
| |
Collapse
|
8
|
Feng Y, Gong C, Zhu J, Liu G, Tang Y, Li W. Prediction of Sites of Metabolism of CYP3A4 Substrates Utilizing Docking-Derived Geometric Features. J Chem Inf Model 2023. [PMID: 37336765 DOI: 10.1021/acs.jcim.3c00549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023]
Abstract
Cytochrome P450 3A4 (CYP3A4) is one of the major drug-metabolizing enzymes in the human body and is responsible for the metabolism of ∼50% of clinically used drugs. Therefore, the identification of the compound's sites of metabolism (SOMs) mediated by CYP3A4 is of utmost importance in the early stage of drug discovery and development. Herein, docking-based approaches incorporating geometric features were used for SOMs prediction of CYP3A4 substrates. The cross-docking poses of a relatively large data set containing 474 substrates were analyzed in depth, and a widely observed geometric pattern called the close proximity of SOMs was derived from the poses. On the basis of the close proximity, several structure-based models have been constructed, which demonstrated better performance than those structure-based models using the criterion of Fe-SOM distance. For further improving the prediction performance, the structure-based models were also combined with the well-known ligand-based model SMARTCyp. One combined model exhibited good performance on the SOMs prediction of an external substrate set containing kinase inhibitors, PROTACs, approved drugs, and some lead compounds.
Collapse
Affiliation(s)
- Yanjun Feng
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Changda Gong
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Jieyu Zhu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| |
Collapse
|
9
|
Tran TTV, Tayara H, Chong KT. Artificial Intelligence in Drug Metabolism and Excretion Prediction: Recent Advances, Challenges, and Future Perspectives. Pharmaceutics 2023; 15:1260. [PMID: 37111744 PMCID: PMC10143484 DOI: 10.3390/pharmaceutics15041260] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 04/07/2023] [Accepted: 04/14/2023] [Indexed: 04/29/2023] Open
Abstract
Drug metabolism and excretion play crucial roles in determining the efficacy and safety of drug candidates, and predicting these processes is an essential part of drug discovery and development. In recent years, artificial intelligence (AI) has emerged as a powerful tool for predicting drug metabolism and excretion, offering the potential to speed up drug development and improve clinical success rates. This review highlights recent advances in AI-based drug metabolism and excretion prediction, including deep learning and machine learning algorithms. We provide a list of public data sources and free prediction tools for the research community. We also discuss the challenges associated with the development of AI models for drug metabolism and excretion prediction and explore future perspectives in the field. We hope this will be a helpful resource for anyone who is researching in silico drug metabolism, excretion, and pharmacokinetic properties.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea;
- Faculty of Information Technology, An Giang University, Long Xuyen 880000, Vietnam
- Vietnam National University—Ho Chi Minh City, Ho Chi Minh 700000, Vietnam
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
10
|
Hu B, Zhao X, Wang E, Zhou J, Li J, Chen J, Du G. Efficient heterologous expression of cytochrome P450 enzymes in microorganisms for the biosynthesis of natural products. Crit Rev Biotechnol 2023; 43:227-241. [PMID: 35129020 DOI: 10.1080/07388551.2022.2029344] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Natural products, a chemically and structurally diverse class of molecules, possess a wide spectrum of biological activities, have been used therapeutically for millennia, and have provided many lead compounds for the development of synthetic drugs. Cytochrome P450 enzymes (P450s, CYP) are widespread in nature and are involved in the biosynthesis of many natural products. P450s are heme-containing enzymes that use molecular oxygen and the hydride donor NAD(P)H (coupled via enzymic redox partners) to catalyze the insertion of oxygen into C-H bonds in a regio- and stereo-selective manner, effecting hydroxylation and several other reactions. With the rapid development of systems biology, numerous novel P450s have been identified for the biosynthesis of natural products, but there are still several challenges to the efficient heterologous expression of active P450s. This review covers recent developments in P450 research and development, including the properties and functions of P450s, discovery and mining of novel P450s, modification and screening of P450 mutants, improved heterologous expression of P450s in microbial hosts, efficient whole-cell transformation with P450s, and current applications of P450s for the biosynthesis of natural products. This resource provides a solid foundation for the application of highly active and stable P450s in microbial cell factories to biosynthesize natural products.
Collapse
Affiliation(s)
- Baodong Hu
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, Wuxi, Jiangsu, China
| | - Xinrui Zhao
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, Wuxi, Jiangsu, China
| | - Endao Wang
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China
| | - Jingwen Zhou
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, Wuxi, Jiangsu, China
| | - Jianghua Li
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, Wuxi, Jiangsu, China
| | - Jian Chen
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, Wuxi, Jiangsu, China
| | - Guocheng Du
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, Wuxi, Jiangsu, China.,Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, Jiangsu, China
| |
Collapse
|
11
|
Metabolite annotation from knowns to unknowns through knowledge-guided multi-layer metabolic networking. Nat Commun 2022; 13:6656. [PMID: 36333358 PMCID: PMC9636193 DOI: 10.1038/s41467-022-34537-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 10/27/2022] [Indexed: 11/06/2022] Open
Abstract
Liquid chromatography - mass spectrometry (LC-MS) based untargeted metabolomics allows to measure both known and unknown metabolites in the metabolome. However, unknown metabolite annotation is a major challenge in untargeted metabolomics. Here, we develop an approach, namely, knowledge-guided multi-layer network (KGMN), to enable global metabolite annotation from knowns to unknowns in untargeted metabolomics. The KGMN approach integrates three-layer networks, including knowledge-based metabolic reaction network, knowledge-guided MS/MS similarity network, and global peak correlation network. To demonstrate the principle, we apply KGMN in an in vitro enzymatic reaction system and different biological samples, with ~100-300 putative unknowns annotated in each data set. Among them, >80% unknown metabolites are corroborated with in silico MS/MS tools. Finally, we validate 5 metabolites that are absent in common MS/MS libraries through repository mining and synthesis of chemical standards. Together, the KGMN approach enables efficient unknown annotations, and substantially advances the discovery of recurrent unknown metabolites for common biological samples from model organisms, towards deciphering dark matter in untargeted metabolomics.
Collapse
|
12
|
Wishart DS, Tian S, Allen D, Oler E, Peters H, Lui V, Gautam V, Djoumbou-Feunang Y, Greiner R, Metz T. BioTransformer 3.0-a web server for accurately predicting metabolic transformation products. Nucleic Acids Res 2022; 50:W115-W123. [PMID: 35536252 PMCID: PMC9252798 DOI: 10.1093/nar/gkac313] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 04/11/2022] [Accepted: 05/04/2022] [Indexed: 11/15/2022] Open
Abstract
BioTransformer 3.0 (https://biotransformer.ca) is a freely available web server that supports accurate, rapid and comprehensive in silico metabolism prediction. It combines machine learning approaches with a rule-based system to predict small-molecule metabolism in human tissues, the human gut as well as the external environment (soil and water microbiota). Simply stated, BioTransformer takes a molecular structure as input (SMILES or SDF) and outputs an interactively sortable table of the predicted metabolites or transformation products (SMILES, PNG images) along with the enzymes that are predicted to be responsible for those reactions and richly annotated downloadable files (CSV and JSON). The entire process typically takes less than a minute. Previous versions of BioTransformer focused exclusively on predicting the metabolism of xenobiotics (such as plant natural products, drugs, cosmetics and other synthetic compounds) using a limited number of pre-defined steps and somewhat limited rule-based methods. BioTransformer 3.0 uses much more sophisticated methods and incorporates new databases, new constraints and new prediction modules to not only more accurately predict the metabolic transformation products of exogenous xenobiotics but also the transformation products of endogenous metabolites, such as amino acids, peptides, carbohydrates, organic acids, and lipids. BioTransformer 3.0 can also support customized sequential combinations of these transformations along with multiple iterations to simulate multi-step human biotransformation events. Performance tests indicate that BioTransformer 3.0 is 40–50% more accurate, far less prone to combinatorial ‘explosions’ and much more comprehensive in terms of metabolite coverage/capabilities than previous versions of BioTransformer.
Collapse
Affiliation(s)
- David S Wishart
- To whom correspondence should be addressed. Tel: +1 780 492 8574;
| | - Siyang Tian
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - Dana Allen
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - Eponine Oler
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - Harrison Peters
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - Vicki W Lui
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - Vasuk Gautam
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | | | - Russell Greiner
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2E8, Canada
- Alberta Machine Intelligence Institute, University of Alberta, Edmonton, AB T6G 2E8, Canada
| | - Thomas O Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| |
Collapse
|
13
|
Kaitoh K, Yamanishi Y. Scaffold-Retained Structure Generator to Exhaustively Create Molecules in an Arbitrary Chemical Space. J Chem Inf Model 2022; 62:2212-2225. [DOI: 10.1021/acs.jcim.1c01130] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Kazuma Kaitoh
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan
| | - Yoshihiro Yamanishi
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan
| |
Collapse
|
14
|
Zheng S, Wang L, Xiong J, Liang G, Xu Y, Lin F. Consensus Prediction of Human Gut Microbiota-Mediated Metabolism Susceptibility for Small Molecules by Machine Learning, Structural Alerts, and Dietary Compounds-Based Average Similarity Methods. J Chem Inf Model 2022; 62:1078-1099. [PMID: 35156807 DOI: 10.1021/acs.jcim.1c00948] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The human gut microbiota (HGM) colonizing human gastrointestinal tract (HGT) confers a repertoire of dynamic and unique metabolic capacities that are not possessed by the host and therefore is tentatively perceived as an alternative metabolic ″organ″ besides the liver in the host. Nevertheless, the significant contribution of HGM to the overall human metabolism is often overlooked in the modern drug discovery pipeline. Hence, a systematic evaluation of HGM-mediated drug metabolism is gradually important, and its computational prediction becomes increasingly necessary. In this work, a new data set containing both the HGM-mediated metabolism susceptible (HGMMS) and insusceptible (HGMMI) compounds (329 vs 320) was manually curated. Based on this data set, the first machine learning (ML) model, a new structural alerts (SA) model, and the K-nearest neighboring dietary compounds-based average similarity (AS) model were proposed to directly predict the HGM-mediated metabolism susceptibility for small molecules, and exhibit promising performance on three independent test sets. Finally, consensus prediction (ML/SA/AS) for DrugBank molecules revealed an intriguing phenomenon that a typical Michael acceptor ″α,β-unsaturated carbonyl group″ is a very common warhead for the design of covalent inhibitors and inclined to be metabolized by HGM in anaerobic HGT to generate the reduced metabolite without the reactive warhead, which could be a new concern to medicinal chemists. To the best of our knowledge, we gleaned the first HGMMS/HGMMI data set, developed the first HGMMS/HGMMI classification model, implemented a relatively comprehensive program based on ML/SA/AS approaches, and found a new phenomenon on the HGM-mediated deactivation of an extensively used warhead for covalent inhibitors.
Collapse
Affiliation(s)
- Suqing Zheng
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China.,Chemical Biology Research Center, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China
| | - Lei Wang
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China
| | - Jun Xiong
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China
| | - Guang Liang
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China.,Chemical Biology Research Center, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China
| | - Yong Xu
- Center of Chemical Biology, Guangzhou Institute of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, Guangdong 510530, P.R. China
| | - Fu Lin
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang 325035, P.R. China
| |
Collapse
|
15
|
Ertl P, Gerebtzoff G, Lewis RA, Muenkler H, Schneider N, Sirockin F, Stiefl N, Tosco P. Chemical reactivity prediction: current methods and different application areas. Mol Inform 2021; 41:e2100277. [PMID: 34964302 DOI: 10.1002/minf.202100277] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 12/28/2021] [Indexed: 11/10/2022]
Abstract
The ability to predict chemical reactivity of a molecule is highly desirable in drug discovery, both ex vivo (synthetic route planning, formulation, stability) and in vivo: metabolic reactions determine pharmacodynamics, pharmacokinetics and potential toxic effects, and early assessment of liabilities is vital to reduce attrition rates in later stages of development. Quantum mechanics offer a precise description of the interactions between electrons and orbitals in the breaking and forming of new bonds. Modern algorithms and faster computers have allowed the study of more complex systems in a punctual and accurate fashion, and answers for chemical questions around stability and reactivity can now be provided. Through machine learning, predictive models can be built out of descriptors derived from quantum mechanics and cheminformatics, even in the absence of experimental data to train on. In this article, current progress on computational reactivity prediction is reviewed: applications to problems in drug design, such as modelling of metabolism and covalent inhibition, are highlighted and unmet challenges are posed.
Collapse
Affiliation(s)
| | | | - Richard A Lewis
- Computer-Aided Drug Design, Eli Lilly and Company Limited, Windlesham, SWITZERLAND
| | - Hagen Muenkler
- Novartis Institutes for BioMedical Research Inc, SWITZERLAND
| | | | | | | | - Paolo Tosco
- Novartis Institutes for BioMedical Research Inc, SWITZERLAND
| |
Collapse
|