1
|
Brennan RJ, Jenkinson S, Brown A, Delaunois A, Dumotier B, Pannirselvam M, Rao M, Ribeiro LR, Schmidt F, Sibony A, Timsit Y, Sales VT, Armstrong D, Lagrutta A, Mittlestadt SW, Naven R, Peri R, Roberts S, Vergis JM, Valentin JP. The state of the art in secondary pharmacology and its impact on the safety of new medicines. Nat Rev Drug Discov 2024; 23:525-545. [PMID: 38773351 DOI: 10.1038/s41573-024-00942-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/05/2024] [Indexed: 05/23/2024]
Abstract
Secondary pharmacology screening of investigational small-molecule drugs for potentially adverse off-target activities has become standard practice in pharmaceutical research and development, and regulatory agencies are increasingly requesting data on activity against targets with recognized adverse effect relationships. However, the screening strategies and target panels used by pharmaceutical companies may vary substantially. To help identify commonalities and differences, as well as to highlight opportunities for further optimization of secondary pharmacology assessment, we conducted a broad-ranging survey across 18 companies under the auspices of the DruSafe leadership group of the International Consortium for Innovation and Quality in Pharmaceutical Development. Based on our analysis of this survey and discussions and additional research within the group, we present here an overview of the current state of the art in secondary pharmacology screening. We discuss best practices, including additional safety-associated targets not covered by most current screening panels, and present approaches for interpreting and reporting off-target activities. We also provide an assessment of the safety impact of secondary pharmacology screening, and a perspective on opportunities and challenges in this rapidly developing field.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Mohan Rao
- Janssen Research & Development, San Diego, CA, USA
- Neurocrine Biosciences, San Diego, CA, USA
| | - Lyn Rosenbrier Ribeiro
- UCB Biopharma, Braine-l'Alleud, Belgium
- AstraZeneca, Cambridge, UK
- Grunenthal, Berkshire, UK
| | | | | | - Yoav Timsit
- Novartis Biomedical Research, Cambridge, MA, USA
- Blueprint Medicines, Cambridge, MA, USA
| | | | - Duncan Armstrong
- Novartis Biomedical Research, Cambridge, MA, USA
- Armstrong Pharmacology, Macclesfield, UK
| | | | | | - Russell Naven
- Takeda Pharmaceuticals, Cambridge, MA, USA
- Novartis Biomedical Research, Cambridge, MA, USA
| | - Ravikumar Peri
- Takeda Pharmaceuticals, Cambridge, MA, USA
- Alexion Pharmaceuticals, Wilmington, DE, USA
| | - Sonia Roberts
- Roche Pharma Research and Early Development, Roche Innovation Center, Basel, Switzerland
| | - James M Vergis
- Faegre Drinker Biddle and Reath, LLP, Washington, DC, USA
| | | |
Collapse
|
2
|
Sawada R, Sakajiri Y, Shibata T, Yamanishi Y. Predicting therapeutic and side effects from drug binding affinities to human proteome structures. iScience 2024; 27:110032. [PMID: 38868195 PMCID: PMC11167438 DOI: 10.1016/j.isci.2024.110032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 04/08/2024] [Accepted: 05/16/2024] [Indexed: 06/14/2024] Open
Abstract
Evaluation of the binding affinities of drugs to proteins is a crucial process for identifying drug pharmacological actions, but it requires three dimensional structures of proteins. Herein, we propose novel computational methods to predict the therapeutic indications and side effects of drug candidate compounds from the binding affinities to human protein structures on a proteome-wide scale. Large-scale docking simulations were performed for 7,582 drugs with 19,135 protein structures revealed by AlphaFold (including experimentally unresolved proteins), and machine learning models on the proteome-wide binding affinity score (PBAS) profiles were constructed. We demonstrated the usefulness of the method for predicting the therapeutic indications for 559 diseases and side effects for 285 toxicities. The method enabled to predict drug indications for which the related protein structures had not been experimentally determined and to successfully extract proteins eliciting the side effects. The proposed method will be useful in various applications in drug discovery.
Collapse
Affiliation(s)
- Ryusuke Sawada
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, Iizuka, Japan
- Department of Pharmacology, Okayama University Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama, Japan
| | - Yuko Sakajiri
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, Iizuka, Japan
- Graduate School of Informatics, Nagoya University, Chikusa, Nagoya, Japan
| | - Tomokazu Shibata
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, Iizuka, Japan
| | - Yoshihiro Yamanishi
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, Iizuka, Japan
- Graduate School of Informatics, Nagoya University, Chikusa, Nagoya, Japan
| |
Collapse
|
3
|
Toni E, Ayatollahi H, Abbaszadeh R, Fotuhi Siahpirani A. Machine Learning Techniques for Predicting Drug-Related Side Effects: A Scoping Review. Pharmaceuticals (Basel) 2024; 17:795. [PMID: 38931462 PMCID: PMC11206653 DOI: 10.3390/ph17060795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 06/28/2024] Open
Abstract
BACKGROUND Drug safety relies on advanced methods for timely and accurate prediction of side effects. To tackle this requirement, this scoping review examines machine-learning approaches for predicting drug-related side effects with a particular focus on chemical, biological, and phenotypical features. METHODS This was a scoping review in which a comprehensive search was conducted in various databases from 1 January 2013 to 31 December 2023. RESULTS The results showed the widespread use of Random Forest, k-nearest neighbor, and support vector machine algorithms. Ensemble methods, particularly random forest, emphasized the significance of integrating chemical and biological features in predicting drug-related side effects. CONCLUSIONS This review article emphasized the significance of considering a variety of features, datasets, and machine learning algorithms for predicting drug-related side effects. Ensemble methods and Random Forest showed the best performance and combining chemical and biological features improved prediction. The results suggested that machine learning techniques have some potential to improve drug development and trials. Future work should focus on specific feature types, selection techniques, and graph-based methods for even better prediction.
Collapse
Affiliation(s)
- Esmaeel Toni
- Medical Informatics, Student Research Committee, Iran University of Medical Sciences, Tehran, Iran 14496-14535;
| | - Haleh Ayatollahi
- Medical Informatics, Health Management and Economics Research Center, Health Management Research Institute, Iran University of Medical Sciences, Tehran, Iran 1996-713883
| | - Reza Abbaszadeh
- Pediatric Cardiology, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran 19956-14331;
| | - Alireza Fotuhi Siahpirani
- Systems Biology and Bioinformatics, Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran 14176-14411;
| |
Collapse
|
4
|
Miyajima I, Yoshikawa A, Sahashi K, Seki C, Nagai Y, Watabe H, Shidahara M. DOCK-PET: database of CNS kinetic parameters in the healthy human brain for existing PET tracers. Ann Nucl Med 2024:10.1007/s12149-024-01947-z. [PMID: 38814564 DOI: 10.1007/s12149-024-01947-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 04/24/2024] [Indexed: 05/31/2024]
Abstract
PURPOSE Information about developed positron emission tomography (PET) tracers and obtained clinical PET images is publicly available in a database. However, findings regarding the kinetic parameters of PET tracers are yet to be summarized. Therefore, in this study, we created an open-access database of central nervous system (CNS) kinetic parameters in the healthy human brain for existing PET tracers (DOCK-PET). METHODS Our database includes information on the kinetic parameters and compounds of existing CNS-PET tracers. The kinetic parameter dataset comprises the analysis methods, VT, BPND, K parameters, relevant literature, and study details. The list of PET tracers and kinetic parameter information was compiled through keyword-based searches of PubMed and the Molecular Imaging and Contrast Agent Database (MICAD). The kinetic parameters obtained, including VT, BPND, and K parameters, were reorganized based on the defined brain anatomical regions. All data were rigorously double-checked before being summarized in Microsoft Excel and JavaScript Object Notation (JSON) formats. RESULTS Of the 247 PET tracers identified through searches using the PubMed and MICAD websites, the kinetic parameters of 120 PET tracers were available. Among the 120 PET tracers, compound structures with chemical and physical properties were obtained from the PubChem website or the ChemDraw software. Furthermore, the affinity information of the 104 PET tracers was gathered from PubChem or extensive literature surveys of the 120 PET tracers. CONCLUSIONS We developed a comprehensive open-access database, DOCK-PET, that includes both kinetic parameters of healthy humans and compound information for existing CNS-PET tracers.
Collapse
Affiliation(s)
- Itsuki Miyajima
- Department of Quantum Science and Energy Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Ayano Yoshikawa
- Department of Quantum Science and Energy Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Kyosei Sahashi
- Department of Quantum Science and Energy Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Chie Seki
- Advanced Neuroimaging Center, Institute for Quantum Medical Science, National Institutes for Quantum Science and Technology, Chiba, Japan
| | - Yuji Nagai
- Advanced Neuroimaging Center, Institute for Quantum Medical Science, National Institutes for Quantum Science and Technology, Chiba, Japan
| | - Hiroshi Watabe
- Division of Radiation Protection and Nuclear Safety, Research Center for Accelerator and Radioisotope Science, Tohoku University, Sendai, Japan
| | - Miho Shidahara
- Department of Quantum Science and Energy Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan.
| |
Collapse
|
5
|
Cichońska A, Ravikumar B, Rahman R. AI for targeted polypharmacology: The next frontier in drug discovery. Curr Opin Struct Biol 2024; 84:102771. [PMID: 38215530 DOI: 10.1016/j.sbi.2023.102771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/30/2023] [Accepted: 12/20/2023] [Indexed: 01/14/2024]
Abstract
In drug discovery, targeted polypharmacology, i.e., targeting multiple molecular targets with a single drug, is redefining therapeutic design to address complex diseases. Pre-selected pharmacological profiles, as exemplified in kinase drugs, promise enhanced efficacy and reduced toxicity. Historically, many of such drugs were discovered serendipitously, limiting predictability and efficacy, but currently artificial intelligence (AI) offers a transformative solution. Machine learning and deep learning techniques enable modeling protein structures, generating novel compounds, and decoding their polypharmacological effects, opening an avenue for more systematic and predictive multi-target drug design. This review explores the use of AI in identifying synergistic co-targets and delineating them from anti-targets that lead to adverse effects, and then discusses advances in AI-enabled docking, generative chemistry, and proteochemometric modeling of proteome-wide compound interactions, in the context of polypharmacology. We also provide insights into challenges ahead.
Collapse
|
6
|
Tremmel R, Hofmann U, Haag M, Schaeffeler E, Schwab M. Circulating Biomarkers Instead of Genotyping to Establish Metabolizer Phenotypes. Annu Rev Pharmacol Toxicol 2024; 64:65-87. [PMID: 37585662 DOI: 10.1146/annurev-pharmtox-032023-121106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Pharmacogenomics (PGx) enables personalized treatment for the prediction of drug response and to avoid adverse drug reactions. Currently, PGx mainly relies on the genetic information of absorption, distribution, metabolism, and excretion (ADME) targets such as drug-metabolizing enzymes or transporters to predict differences in the patient's phenotype. However, there is evidence that the phenotype-genotype concordance is limited. Thus, we discuss different phenotyping strategies using exogenous xenobiotics (e.g., drug cocktails) or endogenous compounds for phenotype prediction. In particular, minimally invasive approaches focusing on liquid biopsies offer great potential to preemptively determine metabolic and transport capacities. Early studies indicate that ADME phenotyping using exosomes released from the liver is reliable. In addition, pharmacometric modeling and artificial intelligence improve phenotype prediction. However, further prospective studies are needed to demonstrate the clinical utility of individualized treatment based on phenotyping strategies, not only relying on genetics. The present review summarizes current knowledge and limitations.
Collapse
Affiliation(s)
- Roman Tremmel
- Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany;
- University of Tuebingen, Tuebingen, Germany
| | - Ute Hofmann
- Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany;
- University of Tuebingen, Tuebingen, Germany
| | - Mathias Haag
- Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany;
- University of Tuebingen, Tuebingen, Germany
| | - Elke Schaeffeler
- Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany;
- University of Tuebingen, Tuebingen, Germany
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies," University of Tuebingen, Tuebingen, Germany
| | - Matthias Schwab
- Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany;
- University of Tuebingen, Tuebingen, Germany
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies," University of Tuebingen, Tuebingen, Germany
- Departments of Clinical Pharmacology, and Pharmacy and Biochemistry, University of Tuebingen, Tuebingen, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center Heidelberg (DKFZ), Partner Site, Tübingen, Germany
| |
Collapse
|
7
|
Yu Z, Wu Z, Zhou M, Chen L, Li W, Liu G, Tang Y. mtADENet: A novel interpretable method integrating multiple types of network-based inference approaches for prediction of adverse drug events. Comput Biol Med 2024; 168:107831. [PMID: 38081118 DOI: 10.1016/j.compbiomed.2023.107831] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 11/23/2023] [Accepted: 12/04/2023] [Indexed: 01/10/2024]
Abstract
Identification of adverse drug events (ADEs) is crucial to reduce human health risks and accelerate drug safety assessment. ADEs are mainly caused by unintended interactions with primary or additional targets (off-targets). In this study, we proposed a novel interpretable method named mtADENet, which integrates multiple types of network-based inference approaches for ADE prediction. Different from phenotype-based methods, mtADENet introduced computational target profiles predicted by network-based methods to bridge the gap between chemical structures and ADEs, and hence can not only predict ADEs for drugs and novel compounds within or outside the drug-ADE association network, but also provide insights for the elucidation of molecular mechanisms of the ADEs caused by drugs. We constructed a series of network-based prediction models for 23 ADE categories. These models achieved high AUC values ranging from 0.865 to 0.942 in 10-fold cross validation. The best model further showed high performance on four external validation sets, which outperformed two previous network-based methods. To show the practical value of mtADENet, we performed case studies on developmental neurotoxicity and cardio-oncology, and over 50 % of predicted ADEs and targets for drugs and novel compounds were validated by literature. Moreover, mtADENet is freely available at our web server named NetInfer (http://lmmd.ecust.edu.cn/netinfer/). In summary, mtADENet would be a powerful tool for ADE prediction and drug safety assessment in drug discovery and development.
Collapse
Affiliation(s)
- Zhuohang Yu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Zengrui Wu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China.
| | - Moran Zhou
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Long Chen
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China.
| |
Collapse
|
8
|
Dauner DG, Leal E, Adam TJ, Zhang R, Farley JF. Evaluation of four machine learning models for signal detection. Ther Adv Drug Saf 2023; 14:20420986231219472. [PMID: 38157242 PMCID: PMC10752114 DOI: 10.1177/20420986231219472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 11/17/2023] [Indexed: 01/03/2024] Open
Abstract
Background Logistic regression-based signal detection algorithms have benefits over disproportionality analysis due to their ability to handle potential confounders and masking factors. Feature exploration and developing alternative machine learning algorithms can further strengthen signal detection. Objectives Our objective was to compare the signal detection performance of logistic regression, gradient-boosted trees, random forest and support vector machine models utilizing Food and Drug Administration adverse event reporting system data. Design Cross-sectional study. Methods The quarterly data extract files from 1 October 2017 through 31 December 2020 were downloaded. Due to an imbalanced outcome, two training sets were used: one stratified on the outcome variable and another using Synthetic Minority Oversampling Technique (SMOTE). A crude model and a model with tuned hyperparameters were developed for each algorithm. Model performance was compared against a reference set using accuracy, precision, F1 score, recall, the receiver operating characteristic area under the curve (ROCAUC), and the precision-recall curve area under the curve (PRCAUC). Results Models trained on the balanced training set had higher accuracy, F1 score and recall compared to models trained on the SMOTE training set. When using the balanced training set, logistic regression, gradient-boosted trees, random forest and support vector machine models obtained similar performance evaluation metrics. The gradient-boosted trees hyperparameter tuned model had the highest ROCAUC (0.646) and the random forest crude model had the highest PRCAUC (0.839) when using the balanced training set. Conclusion All models trained on the balanced training set performed similarly. Logistic regression models had higher accuracy, precision and recall. Logistic regression, random forest and gradient-boosted trees hyperparameter tuned models had a PRCAUC ⩾ 0.8. All models had an ROCAUC ⩾ 0.5. Including both disproportionality analysis results and additional case report information in models resulted in higher performance evaluation metrics than disproportionality analysis alone.
Collapse
Affiliation(s)
- Daniel G. Dauner
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, University of Minnesota Duluth, 232 Life Science, 1110 Kirby Drive, Duluth, MN 55812, USA
| | - Eleazar Leal
- Department of Computer Science, Swenson College of Science and Engineering, University of Minnesota Duluth, Duluth, MN, USA
| | - Terrence J. Adam
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, Institute for Health Informatics, University of Minnesota, Minneapolis, MN, USA
| | - Rui Zhang
- Division of Computational Health Sciences, Department of Surgery, University of Minnesota, Minneapolis, MN, USA
| | - Joel F. Farley
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
9
|
Yu Z, Wu Z, Zhou M, Cao K, Li W, Liu G, Tang Y. EDC-Predictor: A Novel Strategy for Prediction of Endocrine-Disrupting Chemicals by Integrating Pharmacological and Toxicological Profiles. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:18013-18025. [PMID: 37053516 DOI: 10.1021/acs.est.2c08558] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Identification of endocrine-disrupting chemicals (EDCs) is crucial in the reduction of human health risks. However, it is hard to do so because of the complex mechanisms of the EDCs. In this study, we propose a novel strategy named EDC-Predictor to integrate pharmacological and toxicological profiles for the prediction of EDCs. Different from conventional methods that only focus on a few nuclear receptors (NRs), EDC-Predictor considers more targets. It uses computational target profiles from network-based and machine learning-based methods to characterize compounds, including both EDCs and non-EDCs. The best model constructed by these target profiles outperformed those models by molecular fingerprints. In a case study to predict NR-related EDCs, EDC-Predictor showed a wider applicability domain and higher accuracy than four previous tools. Another case study further demonstrated that EDC-Predictor could predict EDCs targeting other proteins rather than NRs. Finally, a free web server was developed to make EDC prediction easier (http://lmmd.ecust.edu.cn/edcpred/). In summary, EDC-Predictor would be a powerful tool in EDC prediction and drug safety assessment.
Collapse
Affiliation(s)
- Zhuohang Yu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Zengrui Wu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Moran Zhou
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Kangjia Cao
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| |
Collapse
|
10
|
Ekins S, Lane TR, Urbina F, Puhl AC. In silico ADME/tox comes of age: twenty years later. Xenobiotica 2023:1-7. [PMID: 37539466 PMCID: PMC10850432 DOI: 10.1080/00498254.2023.2245049] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/01/2023] [Accepted: 08/02/2023] [Indexed: 08/05/2023]
Abstract
In the early 2000s pharmaceutical drug discovery was beginning to use computational approaches for absorption, distribution, metabolism, excretion and toxicity (ADME/Tox, also known as ADMET) prediction. This emphasis on prediction was an effort to reduce the risk of later stage failures from ADME/Tox.Much has been written in the intervening twenty plus years and significant expenditure has occurred in companies developing these in silico capabilities which can be gleaned from publications. It is therefore an appropriate time to briefly reflect on what was proposed then and what the reality is today.20 years ago, we tended to optimise bioactivity and perhaps one ADME/Tox property at a time. Previously pharmaceutical companies needed a whole infrastructure for models - in silico and in vitro experts, IT, champions on a project team, educators and management support. Now we are in the age of generative de novo design where bioactivity and many ADME/Tox properties can be optimised and large language model technologies are available.There are also some challenges such as the focus on very large molecules which may be outside of current ADME/Tox models.We provide an opportunity to look forward with the increasing public data for ADME/Tox as well as expanded types of algorithms available.
Collapse
Affiliation(s)
- Sean Ekins
- Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, NC 27606, USA
| | - Thomas R. Lane
- Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, NC 27606, USA
| | - Fabio Urbina
- Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, NC 27606, USA
| | - Ana C. Puhl
- Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, NC 27606, USA
| |
Collapse
|
11
|
Dauner DG, Zhang R, Adam TJ, Leal E, Heitlage V, Farley JF. Performance of subgrouped proportional reporting ratios in the US Food and Drug Administration (FDA) adverse event reporting system. Expert Opin Drug Saf 2023; 22:589-597. [PMID: 36800190 DOI: 10.1080/14740338.2023.2182289] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 02/06/2023] [Indexed: 02/18/2023]
Abstract
BACKGROUND Many signal detection algorithms give the same weight to information from all products and patients, which may result in signals being masked or false positives being flagged as potential signals. Subgrouped analysis can be used to help correct for this. RESEARCH DESIGN AND METHODS The publicly available US Food and Drug Administration Adverse Event Reporting System quarterly data extract files from 1 January 2015 through 30 September 2017 were utilized. A proportional reporting ratio (PRR) analysis subgrouped by either age, sex, ADE report type, seriousness of ADE, or reporter was compared to the crude PRR analysis using sensitivity, specificity, precision, and c-statistic. RESULTS Subgrouping by age (n = 78, 34.5% increase), sex (n = 67, 15.5% increase), and reporter (n = 64, 10.3% increase) identified more signals than the crude analysis. Subgrouping by either age or sex increased both the sensitivity and precision. Subgrouping by report type or seriousness resulted in fewer signals (n = 50, -13.8% for both). Subgrouped analyses had higher c-statistic values, with age having the highest (0.468). CONCLUSIONS Subgrouping by either age or sex produced more signals with higher sensitivity and precision than the crude PRR analysis. Subgrouping by these variables can unmask potentially important associations.
Collapse
Affiliation(s)
- Daniel G Dauner
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, University of Minnesota, Minneapolis, Minnesota, USA
| | - Rui Zhang
- Division of Computational Health Sciences, Department of Surgery, University of Minnesota, Minneapolis, Minnesota, USA
| | - Terrence J Adam
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Eleazar Leal
- Department of Computer Science, Swenson College of Science and Engineering, University of Minnesota, Duluth, Minnesota, USA
| | - Viviene Heitlage
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, University of Minnesota, Minneapolis, Minnesota, USA
| | - Joel F Farley
- Department of Pharmaceutical Care and Health Systems, College of Pharmacy, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
12
|
Early Detection of Adverse Drug Reaction Signals by Association Rule Mining Using Large-Scale Administrative Claims Data. Drug Saf 2023; 46:371-389. [PMID: 36828947 PMCID: PMC10113351 DOI: 10.1007/s40264-023-01278-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/06/2023] [Indexed: 02/26/2023]
Abstract
INTRODUCTION Adverse drug reactions (ADRs) are a leading cause of mortality worldwide and should be detected promptly to reduce health risks to patients. A data-mining approach using large-scale medical records might be a useful method for the early detection of ADRs. Many studies have analyzed medical records to detect ADRs; however, most of them have focused on a narrow range of ADRs, limiting their usefulness. OBJECTIVE This study aimed to identify methods for the early detection of a wide range of ADR signals. METHODS First, to evaluate the performance in signal detection of ADRs by data-mining, we attempted to create a gold standard based on clinical evidence. Second, association rule mining (ARM) was applied to patient symptoms and medications registered in claims data, followed by evaluating ADR signal detection performance. RESULTS We created a new gold standard consisting of 92 positive and 88 negative controls. In the assessment of ARM using claims data, the areas under the receiver-operating characteristic curve and the precision-recall curve were 0.80 and 0.83, respectively. If the detection criteria were defined as lift > 1, conviction > 1, and p-value < 0.05, ARM could identify 156 signals, of which 90 were true positive controls (sensitivity: 0.98, specificity: 0.25). Evaluation of the capability of ARM with short periods of data revealed that ARM could detect a greater number of positive controls than the conventional analysis method. CONCLUSIONS ARM of claims data may be effective in the early detection of a wide range of ADR signals.
Collapse
|
13
|
Das P, Mazumder DH. An extensive survey on the use of supervised machine learning techniques in the past two decades for prediction of drug side effects. Artif Intell Rev 2023; 56:1-28. [PMID: 36819660 PMCID: PMC9930028 DOI: 10.1007/s10462-023-10413-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2023] [Indexed: 02/19/2023]
Abstract
Approved drugs for sale must be effective and safe, implying that the drug's advantages outweigh its known harmful side effects. Side effects (SE) of drugs are one of the common reasons for drug failure that may halt the whole drug discovery pipeline. The side effects might vary from minor concerns like a runny nose to potentially life-threatening issues like liver damage, heart attack, and death. Therefore, predicting the side effects of the drug is vital in drug development, discovery, and design. Supervised machine learning-based side effects prediction task has recently received much attention since it reduces time, chemical waste, design complexity, risk of failure, and cost. The advancement of supervised learning approaches for predicting side effects have emerged as essential computational tools. Supervised machine learning technique provides early information on drug side effects to develop an effective drug based on drug properties. Still, there are several challenges to predicting drug side effects. Thus, a near-exhaustive survey is carried out in this paper on the use of supervised machine learning approaches employed in drug side effects prediction tasks in the past two decades. In addition, this paper also summarized the drug descriptor required for the side effects prediction task, commonly utilized drug properties sources, computational models, and their performances. Finally, the research gap, open problems, and challenges for the further supervised learning-based side effects prediction task have been discussed.
Collapse
Affiliation(s)
- Pranab Das
- Department of Computer Science and Engineering, National Institute of Technology Nagaland, Chumukedima, Dimapur, Nagaland 797103 India
| | - Dilwar Hussain Mazumder
- Department of Computer Science and Engineering, National Institute of Technology Nagaland, Chumukedima, Dimapur, Nagaland 797103 India
| |
Collapse
|
14
|
Kumar M, Nguyen TPN, Kaur J, Singh TG, Soni D, Singh R, Kumar P. Opportunities and challenges in application of artificial intelligence in pharmacology. Pharmacol Rep 2023; 75:3-18. [PMID: 36624355 PMCID: PMC9838466 DOI: 10.1007/s43440-022-00445-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 12/23/2022] [Accepted: 12/25/2022] [Indexed: 01/11/2023]
Abstract
Artificial intelligence (AI) is a machine science that can mimic human behaviour like intelligent analysis of data. AI functions with specialized algorithms and integrates with deep and machine learning. Living in the digital world can generate a huge amount of medical data every day. Therefore, we need an automated and reliable evaluation tool that can make decisions more accurately and faster. Machine learning has the potential to learn, understand and analyse the data used in healthcare systems. In the last few years, AI is known to be employed in various fields in pharmaceutical science especially in pharmacological research. It helps in the analysis of preclinical (laboratory animals) and clinical (in human) trial data. AI also plays important role in various processes such as drug discovery/manufacturing, diagnosis of big data for disease identification, personalized treatment, clinical trial research, radiotherapy, surgical robotics, smart electronic health records, and epidemic outbreak prediction. Moreover, AI has been used in the evaluation of biomarkers and diseases. In this review, we explain various models and general processes of machine learning and their role in pharmacological science. Therefore, AI with deep learning and machine learning could be relevant in pharmacological research.
Collapse
Affiliation(s)
- Mandeep Kumar
- Department of Pharmacy, Unit of Pharmacology and Toxicology, University of Genoa, Genoa, Italy
| | - T P Nhung Nguyen
- Department of Pharmacy, Unit of Pharmacology and Toxicology, University of Genoa, Genoa, Italy
- Department of Pharmacy, Da Nang University of Medical Technology and Pharmacy, Da Nang, Vietnam
| | - Jasleen Kaur
- Department of Pharmacology and Toxicology, National Institute of Pharmaceutical Education and Research (NIPER), Lucknow, Uttar Pradesh, 226002, India
| | | | - Divya Soni
- Department of Pharmacology, Central University of Punjab, Ghudda, Bathinda, Punjab, 151401, India
| | - Randhir Singh
- Department of Pharmacology, Central University of Punjab, Ghudda, Bathinda, Punjab, 151401, India
| | - Puneet Kumar
- Department of Pharmacology, Central University of Punjab, Ghudda, Bathinda, Punjab, 151401, India.
| |
Collapse
|
15
|
McNair D. Artificial Intelligence and Machine Learning for Lead-to-Candidate Decision-Making and Beyond. Annu Rev Pharmacol Toxicol 2023; 63:77-97. [PMID: 35679624 DOI: 10.1146/annurev-pharmtox-051921-023255] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The use of artificial intelligence (AI) and machine learning (ML) in pharmaceutical research and development has to date focused on research: target identification; docking-, fragment-, and motif-based generation of compound libraries; modeling of synthesis feasibility; rank-ordering likely hits according to structural and chemometric similarity to compounds having known activity and affinity to the target(s); optimizing a smaller library for synthesis and high-throughput screening; and combining evidence from screening to support hit-to-lead decisions. Applying AI/ML methods to lead optimization and lead-to-candidate (L2C) decision-making has shown slower progress, especially regarding predicting absorption, distribution, metabolism, excretion, and toxicology properties. The present review surveys reasons why this is so, reports progress that has occurred in recent years, and summarizes some of the issues that remain. Effective AI/ML tools to derisk L2C and later phases of development are important to accelerate the pharmaceutical development process, ameliorate escalating development costs, and achieve greater success rates.
Collapse
Affiliation(s)
- Douglas McNair
- Global Health, Integrated Development, Bill & Melinda Gates Foundation, Seattle, Washington, USA;
| |
Collapse
|
16
|
Dalal S, Onyema EM, Malik A. Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy. World J Gastroenterol 2022; 28:6551-6563. [PMID: 36569269 PMCID: PMC9782838 DOI: 10.3748/wjg.v28.i46.6551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 07/27/2022] [Accepted: 11/21/2022] [Indexed: 12/08/2022] Open
Abstract
BACKGROUND Liver disease indicates any pathology that can harm or destroy the liver or prevent it from normal functioning. The global community has recently witnessed an increase in the mortality rate due to liver disease. This could be attributed to many factors, among which are human habits, awareness issues, poor healthcare, and late detection. To curb the growing threats from liver disease, early detection is critical to help reduce the risks and improve treatment outcome. Emerging technologies such as machine learning, as shown in this study, could be deployed to assist in enhancing its prediction and treatment.
AIM To present a more efficient system for timely prediction of liver disease using a hybrid eXtreme Gradient Boosting model with hyperparameter tuning with a view to assist in early detection, diagnosis, and reduction of risks and mortality associated with the disease.
METHODS The dataset used in this study consisted of 416 people with liver problems and 167 with no such history. The data were collected from the state of Andhra Pradesh, India, through https://www.kaggle.com/datasets/uciml/indian-liver-patient-records. The population was divided into two sets depending on the disease state of the patient. This binary information was recorded in the attribute "is_patient".
RESULTS The results indicated that the chi-square automated interaction detection and classification and regression trees models achieved an accuracy level of 71.36% and 73.24%, respectively, which was much better than the conventional method. The proposed solution would assist patients and physicians in tackling the problem of liver disease and ensuring that cases are detected early to prevent it from developing into cirrhosis (scarring) and to enhance the survival of patients. The study showed the potential of machine learning in health care, especially as it concerns disease prediction and monitoring.
CONCLUSION This study contributed to the knowledge of machine learning application to health and to the efforts toward combating the problem of liver disease. However, relevant authorities have to invest more into machine learning research and other health technologies to maximize their potential.
Collapse
Affiliation(s)
- Surjeet Dalal
- Department of CSE, Amity University, Gurugram 122413, Haryana, India
| | - Edeh Michael Onyema
- Department of Mathematics and Computer Science, Coal City University, Enugu 400102, Nigeria
| | - Amit Malik
- Department of CSE, SRM University, Delhi-NCR, Sonipat 131001, Haryana, India
| |
Collapse
|
17
|
Pandey M, Radaeva M, Mslati H, Garland O, Fernandez M, Ester M, Cherkasov A. Ligand Binding Prediction Using Protein Structure Graphs and Residual Graph Attention Networks. Molecules 2022; 27:molecules27165114. [PMID: 36014351 PMCID: PMC9416537 DOI: 10.3390/molecules27165114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 08/03/2022] [Accepted: 08/09/2022] [Indexed: 11/25/2022] Open
Abstract
Computational prediction of ligand–target interactions is a crucial part of modern drug discovery as it helps to bypass high costs and labor demands of in vitro and in vivo screening. As the wealth of bioactivity data accumulates, it provides opportunities for the development of deep learning (DL) models with increasing predictive powers. Conventionally, such models were either limited to the use of very simplified representations of proteins or ineffective voxelization of their 3D structures. Herein, we present the development of the PSG-BAR (Protein Structure Graph-Binding Affinity Regression) approach that utilizes 3D structural information of the proteins along with 2D graph representations of ligands. The method also introduces attention scores to selectively weight protein regions that are most important for ligand binding. Results: The developed approach demonstrates the state-of-the-art performance on several binding affinity benchmarking datasets. The attention-based pooling of protein graphs enables identification of surface residues as critical residues for protein–ligand binding. Finally, we validate our model predictions against an experimental assay on a viral main protease (Mpro)—the hallmark target of SARS-CoV-2 coronavirus.
Collapse
Affiliation(s)
- Mohit Pandey
- Vancouver Prostate Centre, Department of Urologic Sciences, University of British Columbia, Vancouver, BC V6T 1Z2, Canada
| | - Mariia Radaeva
- Vancouver Prostate Centre, Department of Urologic Sciences, University of British Columbia, Vancouver, BC V6T 1Z2, Canada
| | - Hazem Mslati
- Vancouver Prostate Centre, Department of Urologic Sciences, University of British Columbia, Vancouver, BC V6T 1Z2, Canada
| | - Olivia Garland
- Vancouver Prostate Centre, Department of Urologic Sciences, University of British Columbia, Vancouver, BC V6T 1Z2, Canada
| | - Michael Fernandez
- Vancouver Prostate Centre, Department of Urologic Sciences, University of British Columbia, Vancouver, BC V6T 1Z2, Canada
| | - Martin Ester
- School of Computing Science, Simon Fraser University, Burnaby, BC V5A 1S6, Canada
| | - Artem Cherkasov
- Vancouver Prostate Centre, Department of Urologic Sciences, University of British Columbia, Vancouver, BC V6T 1Z2, Canada
- Correspondence:
| |
Collapse
|
18
|
Galletti C, Aguirre-Plans J, Oliva B, Fernandez-Fuentes N. Prediction of Adverse Drug Reaction Linked to Protein Targets Using Network-Based Information and Machine Learning. FRONTIERS IN BIOINFORMATICS 2022; 2:906644. [PMID: 36304303 PMCID: PMC9580901 DOI: 10.3389/fbinf.2022.906644] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 06/02/2022] [Indexed: 11/17/2022] Open
Abstract
Drug discovery attrition rates, particularly at advanced clinical trial stages, are high because of unexpected adverse drug reactions (ADR) elicited by novel drug candidates. Predicting undesirable ADRs produced by the modulation of certain protein targets would contribute to developing safer drugs, thereby reducing economic losses associated with high attrition rates. As opposed to the more traditional drug-centric approach, we propose a target-centric approach to predict associations between protein targets and ADRs. The implementation of the predictor is based on a machine learning classifier that integrates a set of eight independent network-based features. These include a network diffusion-based score, identification of protein modules based on network clustering algorithms, functional similarity among proteins, network distance to proteins that are part of safety panels used in preclinical drug development, set of network descriptors in the form of degree and betweenness centrality measurements, and conservation. This diverse set of descriptors were used to generate predictors based on different machine learning classifiers ranging from specific models for individual ADR to higher levels of abstraction as per MEDDRA hierarchy such as system organ class. The results obtained from the different machine-learning classifiers, namely, support vector machine, random forest, and neural network were further analyzed as a meta-predictor exploiting three different voting systems, namely, jury vote, consensus vote, and red flag, obtaining different models for each of the ADRs in analysis. The level of accuracy of the predictors justifies the identification of problematic protein targets both at the level of individual ADR as well as a set of related ADRs grouped in common system organ classes. As an example, the prediction of ventricular tachycardia achieved an accuracy and precision of 0.83 and 0.90, respectively, and a Matthew correlation coefficient of 0.70. We believe that this approach is a good complement to the existing methodologies devised to foresee potential liabilities in preclinical drug discovery. The method is available through the DocTOR utility at GitHub (https://github.com/cristian931/DocTOR).
Collapse
Affiliation(s)
- Cristiano Galletti
- Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Barcelona, Spain
| | - Joaquim Aguirre-Plans
- Department of Physics, Network Science Institute, Northeastern University, Boston, MA, United States
| | - Baldo Oliva
- Department of Experimental and Health Sciences, Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
| | - Narcis Fernandez-Fuentes
- Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Barcelona, Spain
- *Correspondence: Narcis Fernandez-Fuentes,
| |
Collapse
|
19
|
Königs C, Friedrichs M, Dietrich T. The heterogeneous pharmacological medical biochemical network PharMeBINet. Sci Data 2022; 9:393. [PMID: 35821017 PMCID: PMC9276653 DOI: 10.1038/s41597-022-01510-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/22/2022] [Indexed: 12/04/2022] Open
Abstract
Heterogeneous biomedical pharmacological databases are important for multiple fields in bioinformatics. Hetionet is a freely available database combining diverse entities and relationships from 29 public resources. Therefore, it is used as the basis for this project. 19 additional pharmacological medical and biological databases such as CTD, DrugBank, and ClinVar are parsed and integrated into Neo4j. Afterwards, the information is merged into the Hetionet structure. Different mapping methods are used such as external identification systems or name mapping. The resulting open-source Neo4j database PharMeBINet has 2,869,407 different nodes with 66 labels and 15,883,653 relationships with 208 edge types. It is a heterogeneous database containing interconnected information on ADRs, diseases, drugs, genes, gene variations, proteins, and more. Relationships between these entities represent drug-drug interactions or drug-causes-ADR relations, to name a few. It has much potential for developing further data analyses including machine learning applications. A web application for accessing the database is free to use for everyone and available at https://pharmebi.net. Additionally, the database is deposited on Zenodo at 10.5281/zenodo.6578218. Measurement(s) | data integration objective | Technology Type(s) | database creation objective |
Collapse
Affiliation(s)
- Cassandra Königs
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, 33615, Germany.
| | - Marcel Friedrichs
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, 33615, Germany
| | - Theresa Dietrich
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, 33615, Germany
| |
Collapse
|
20
|
Khoury P, Srinivasan R, Kakumanu S, Ochoa S, Keswani A, Sparks R, Rider NL. A Framework for Augmented Intelligence in Allergy and Immunology Practice and Research—A Work Group Report of the AAAAI Health Informatics, Technology, and Education Committee. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY: IN PRACTICE 2022; 10:1178-1188. [PMID: 35300959 PMCID: PMC9205719 DOI: 10.1016/j.jaip.2022.01.047] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 01/19/2022] [Accepted: 01/20/2022] [Indexed: 10/18/2022]
Abstract
Artificial and augmented intelligence (AI) and machine learning (ML) methods are expanding into the health care space. Big data are increasingly used in patient care applications, diagnostics, and treatment decisions in allergy and immunology. How these technologies will be evaluated, approved, and assessed for their impact is an important consideration for researchers and practitioners alike. With the potential of ML, deep learning, natural language processing, and other assistive methods to redefine health care usage, a scaffold for the impact of AI technology on research and patient care in allergy and immunology is needed. An American Academy of Asthma Allergy and Immunology Health Information Technology and Education subcommittee workgroup was convened to perform a scoping review of AI within health care as well as the specialty of allergy and immunology to address impacts on allergy and immunology practice and research as well as potential challenges including education, AI governance, ethical and equity considerations, and potential opportunities for the specialty. There are numerous potential clinical applications of AI in allergy and immunology that range from disease diagnosis to multidimensional data reduction in electronic health records or immunologic datasets. For appropriate application and interpretation of AI, specialists should be involved in the design, validation, and implementation of AI in allergy and immunology. Challenges include incorporation of data science and bioinformatics into training of future allergists-immunologists.
Collapse
|
21
|
Manley PW, Huth F, Moussaoui S, Schoepfer J. A kinase inhibitor which specifically targets the ABL myristate pocket (STAMP), but unlike asciminib crosses the blood–brain barrier. Bioorg Med Chem Lett 2022; 59:128577. [DOI: 10.1016/j.bmcl.2022.128577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 01/03/2022] [Accepted: 01/15/2022] [Indexed: 11/25/2022]
|
22
|
Yu Z, Wu Z, Li W, Liu G, Tang Y. ADENet: a novel network-based inference method for prediction of drug adverse events. Brief Bioinform 2022; 23:6510157. [PMID: 35039845 DOI: 10.1093/bib/bbab580] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Revised: 12/02/2021] [Accepted: 12/19/2021] [Indexed: 11/13/2022] Open
Abstract
Identification of adverse drug events (ADEs) is crucial to reduce human health risks and improve drug safety assessment. With an increasing number of biological and medical data, computational methods such as network-based methods were proposed for ADE prediction with high efficiency and low cost. However, previous network-based methods rely on the topological information of known drug-ADE networks, and hence cannot make predictions for novel compounds without any known ADE. In this study, we introduced chemical substructures to bridge the gap between the drug-ADE network and novel compounds, and developed a novel network-based method named ADENet, which can predict potential ADEs for not only drugs within the drug-ADE network, but also novel compounds outside the network. To show the performance of ADENet, we collected drug-ADE associations from a comprehensive database named MetaADEDB and constructed a series of network-based prediction models. These models obtained high area under the receiver operating characteristic curve values ranging from 0.871 to 0.947 in 10-fold cross-validation. The best model further showed high performance in external validation, which outperformed a previous network-based and a recent deep learning-based method. Using several approved drugs as case studies, we found that 32-54% of the predicted ADEs can be validated by the literature, indicating the practical value of ADENet. Moreover, ADENet is freely available at our web server named NetInfer (http://lmmd.ecust.edu.cn/netinfer). In summary, our method would provide a promising tool for ADE prediction and drug safety assessment in drug discovery and development.
Collapse
Affiliation(s)
- Zhuohang Yu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Zengrui Wu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| |
Collapse
|
23
|
Zhao Y, Yu Y, Wang H, Li Y, Deng Y, Jiang G, Luo Y. Machine Learning in Causal Inference: Application in Pharmacovigilance. Drug Saf 2022; 45:459-476. [PMID: 35579811 PMCID: PMC9114053 DOI: 10.1007/s40264-022-01155-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2022] [Indexed: 01/28/2023]
Abstract
Monitoring adverse drug events or pharmacovigilance has been promoted by the World Health Organization to assure the safety of medicines through a timely and reliable information exchange regarding drug safety issues. We aim to discuss the application of machine learning methods as well as causal inference paradigms in pharmacovigilance. We first reviewed data sources for pharmacovigilance. Then, we examined traditional causal inference paradigms, their applications in pharmacovigilance, and how machine learning methods and causal inference paradigms were integrated to enhance the performance of traditional causal inference paradigms. Finally, we summarized issues with currently mainstream correlation-based machine learning models and how the machine learning community has tried to address these issues by incorporating causal inference paradigms. Our literature search revealed that most existing data sources and tasks for pharmacovigilance were not designed for causal inference. Additionally, pharmacovigilance was lagging in adopting machine learning-causal inference integrated models. We highlight several currently trending directions or gaps to integrate causal inference with machine learning in pharmacovigilance research. Finally, our literature search revealed that the adoption of causal paradigms can mitigate known issues with machine learning models. We foresee that the pharmacovigilance domain can benefit from the progress in the machine learning field.
Collapse
Affiliation(s)
- Yiqing Zhao
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yue Yu
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Hanyin Wang
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yikuan Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yu Deng
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA.
| |
Collapse
|
24
|
Bassan A, Alves VM, Amberg A, Anger LT, Auerbach S, Beilke L, Bender A, Cronin MT, Cross KP, Hsieh JH, Greene N, Kemper R, Kim MT, Mumtaz M, Noeske T, Pavan M, Pletz J, Russo DP, Sabnis Y, Schaefer M, Szabo DT, Valentin JP, Wichard J, Williams D, Woolley D, Zwickl C, Myatt GJ. In silico approaches in organ toxicity hazard assessment: current status and future needs in predicting liver toxicity. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2021; 20:100187. [PMID: 35340402 PMCID: PMC8955833 DOI: 10.1016/j.comtox.2021.100187] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/15/2023]
Abstract
Hepatotoxicity is one of the most frequently observed adverse effects resulting from exposure to a xenobiotic. For example, in pharmaceutical research and development it is one of the major reasons for drug withdrawals, clinical failures, and discontinuation of drug candidates. The development of faster and cheaper methods to assess hepatotoxicity that are both more sustainable and more informative is critically needed. The biological mechanisms and processes underpinning hepatotoxicity are summarized and experimental approaches to support the prediction of hepatotoxicity are described, including toxicokinetic considerations. The paper describes the increasingly important role of in silico approaches and highlights challenges to the adoption of these methods including the lack of a commonly agreed upon protocol for performing such an assessment and the need for in silico solutions that take dose into consideration. A proposed framework for the integration of in silico and experimental information is provided along with a case study describing how computational methods have been used to successfully respond to a regulatory question concerning non-genotoxic impurities in chemically synthesized pharmaceuticals.
Collapse
Affiliation(s)
- Arianna Bassan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Vinicius M. Alves
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Alexander Amberg
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | - Scott Auerbach
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Lisa Beilke
- Toxicology Solutions Inc., San Diego, CA, USA
| | - Andreas Bender
- AI and Data Analytics, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Cambridge, UK
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW
| | - Mark T.D. Cronin
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | | | - Jui-Hua Hsieh
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Nigel Greene
- Data Science and AI, DSM, IMED Biotech Unit, AstraZeneca, Boston, USA
| | - Raymond Kemper
- Nuvalent, One Broadway, 14th floor, Cambridge, MA, 02142, USA
| | - Marlene T. Kim
- US Food and Drug Administration, Center for Drug Evaluation and Research, Silver Spring, MD, 20993, USA
| | - Moiz Mumtaz
- Office of the Associate Director for Science (OADS), Agency for Toxic Substances and Disease, Registry, US Department of Health and Human Services, Atlanta, GA, USA
| | - Tobias Noeske
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Manuela Pavan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Julia Pletz
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | - Daniel P. Russo
- Department of Chemistry, Rutgers University, Camden, NJ 08102, USA
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Yogesh Sabnis
- UCB Biopharma SRL, Chemin du Foriest – B-1420 Braine-l’Alleud, Belgium
| | - Markus Schaefer
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | | | - Joerg Wichard
- Bayer AG, Genetic Toxicology, Müllerstr. 178, 13353 Berlin, Germany
| | - Dominic Williams
- Functional & Mechanistic Safety, Clinical Pharmacology & Safety Sciences, AstraZeneca, Darwin Building 310, Cambridge Science Park, Milton Rd, Cambridge CB4 0FZ, UK
| | - David Woolley
- ForthTox Limited, PO Box 13550, Linlithgow, EH49 7YU, UK
| | - Craig Zwickl
- Transendix LLC, 1407 Moores Manor, Indianapolis, IN 46229, USA
| | - Glenn J. Myatt
- Instem, 1393 Dublin Road, Columbus, OH 43215. USA
- Corresponding author. (G.J. Myatt)
| |
Collapse
|
25
|
Galletti C, Bota PM, Oliva B, Fernandez-Fuentes N. Mining drug-target and drug-adverse drug reaction databases to identify target-adverse drug reaction relationships. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6408542. [PMID: 34679164 PMCID: PMC8533369 DOI: 10.1093/database/baab068] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 10/04/2021] [Accepted: 10/07/2021] [Indexed: 01/05/2023]
Abstract
The level of attrition on drug discovery, particularly at advanced stages, is very high due to unexpected adverse drug reactions (ADRs) caused by drug candidates, and thus, being able to predict undesirable responses when modulating certain protein targets would contribute to the development of safer drugs and have important economic implications. On the one hand, there are a number of databases that compile information of drug-target interactions. On the other hand, there are a number of public resources that compile information on drugs and ADR. It is therefore possible to link target and ADRs using drug entities as connecting elements. Here, we present T-ARDIS (Target-Adverse Reaction Database Integrated Search) database, a resource that provides comprehensive information on proteins and associated ADRs. By combining the information from drug-protein and drug-ADR databases, we statistically identify significant associations between proteins and ADRs. Besides describing the relationship between proteins and ADRs, T-ARDIS provides detailed description about proteins along with the drug and adverse reaction information. Currently T-ARDIS contains over 3000 ADR and 248 targets for a total of more 17 000 pairwise interactions. Each entry can be retrieved through multiple search terms including target Uniprot ID, gene name, adverse effect and drug name. Ultimately, the T-ARDIS database has been created in response to the increasing interest in identifying early in the drug development pipeline potentially problematic protein targets whose modulation could result in ADRs. Database URL: http://www.bioinsilico.org/T-ARDIS.
Collapse
Affiliation(s)
- Cristiano Galletti
- Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Carrer Laura 13, Vic, Catalonia 08500, Spain
| | - Patricia Mirela Bota
- Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Carrer Laura 13, Vic, Catalonia 08500, Spain.,Department of Experimental and Health Sciences, Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Catalonia 08003, Spain
| | - Baldo Oliva
- Department of Experimental and Health Sciences, Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Catalonia 08003, Spain
| | - Narcis Fernandez-Fuentes
- Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Carrer Laura 13, Vic, Catalonia 08500, Spain
| |
Collapse
|
26
|
Zhang X, Sumathipala M, Zitnik M. Population-scale identification of differential adverse events before and during a pandemic. NATURE COMPUTATIONAL SCIENCE 2021; 1:666-677. [PMID: 38217191 PMCID: PMC10766557 DOI: 10.1038/s43588-021-00138-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 09/08/2021] [Indexed: 01/15/2024]
Abstract
Adverse patient safety events, unintended injuries resulting from medical therapy, were associated with 110,000 deaths in the United States in 2019. A nationwide pandemic (such as COVID-19) further challenges the ability of healthcare systems to ensure safe medication use and the pandemic's effects on safety events remain poorly understood. Here, we investigate drug safety events across demographic groups before and during a pandemic using a dataset of 1,425,371 reports involving 2,821 drugs and 7,761 adverse events. Among 64 adverse events identified by our analyses, we find 54 increased in frequency during the pandemic, despite a 4.4% decrease in the total number of reports. Out of 53 adverse events with a pre-pandemic gender gap, 33 have seen their gap increase with the pandemic onset. We find that the number of adverse events with an increased reporting ratio is higher in adults (by 16.8%) than in older patients. Our findings have implications for safe medication use and preventable healthcare inequality in public health emergencies.
Collapse
Affiliation(s)
- Xiang Zhang
- Department of Biomedical Informatics, Harvard Medical School, Harvard University, Boston, MA, USA
| | - Marissa Sumathipala
- Department of Biomedical Informatics, Harvard Medical School, Harvard University, Boston, MA, USA
| | - Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Harvard University, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Harvard Data Science Initiative, Cambridge, MA, USA.
| |
Collapse
|
27
|
Seo S, Kim Y, Han HJ, Son WC, Hong ZY, Sohn I, Shim J, Hwang C. Predicting Successes and Failures of Clinical Trials With Outer Product-Based Convolutional Neural Network. Front Pharmacol 2021; 12:670670. [PMID: 34220508 PMCID: PMC8242994 DOI: 10.3389/fphar.2021.670670] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 05/24/2021] [Indexed: 12/04/2022] Open
Abstract
Despite several improvements in the drug development pipeline over the past decade, drug failures due to unexpected adverse effects have rapidly increased at all stages of clinical trials. To improve the success rate of clinical trials, it is necessary to identify potential loser drug candidates that may fail at clinical trials. Therefore, we need to develop reliable models for predicting the outcomes of clinical trials of drug candidates, which have the potential to guide the drug discovery process. In this study, we propose an outer product–based convolutional neural network (OPCNN) model which integrates effectively chemical features of drugs and target-based features. The validation results via 10-fold cross-validations on the dataset used for a data-driven approach PrOCTOR proved that our OPCNN model performs quite well in terms of accuracy, F1-score, Matthews correlation coefficient (MCC), precision, recall, area under the curve (AUC) of the receiver operating characteristic, and area under the precision–recall curve (AUPRC). In particular, the proposed OPCNN model showed the best performance in terms of MCC, which is widely used in biomedicine as a performance metric and is a more reliable statistical measure. Through 10-fold cross-validation experiments, the accuracy of the OPCNN model is as high as 0.9758, F1 score is as high as 0.9868, the MCC reaches 0.8451, the precision is as high as 0.9889, the recall is as high as 0.9893, the AUC is as high as 0.9824, and the AUPRC is as high as 0.9979. The results proved that our OPCNN model shows significantly good prediction performance on outcomes of clinical trials and it can be quite helpful in early drug discovery.
Collapse
Affiliation(s)
- Sangwoo Seo
- Department of Data and Knowledge Service Engineering, Dankook University, Gyeonggido, Korea
| | - Youngmin Kim
- Department of Statistics, Dankook University, Gyeonggido, Korea
| | - Hyo-Jeong Han
- Department of Pathology, College of Medicine, University of Ulsan, Asan Medical Center, Seoul, Korea
| | - Woo Chan Son
- Department of Pathology, College of Medicine, University of Ulsan, Asan Medical Center, Seoul, Korea
| | | | | | - Jooyong Shim
- Department of Statistics, Institute of Statistical Information, Inje University, Gyeongsangnamdo, Korea
| | - Changha Hwang
- Department of Data and Knowledge Service Engineering, Dankook University, Gyeonggido, Korea.,Department of Statistics, Dankook University, Gyeonggido, Korea
| |
Collapse
|
28
|
Ji Y, Lotfollahi M, Wolf FA, Theis FJ. Machine learning for perturbational single-cell omics. Cell Syst 2021; 12:522-537. [PMID: 34139164 DOI: 10.1016/j.cels.2021.05.016] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 05/04/2021] [Accepted: 05/19/2021] [Indexed: 12/18/2022]
Abstract
Cell biology is fundamentally limited in its ability to collect complete data on cellular phenotypes and the wide range of responses to perturbation. Areas such as computer vision and speech recognition have addressed this problem of characterizing unseen or unlabeled conditions with the combined advances of big data, deep learning, and computing resources in the past 5 years. Similarly, recent advances in machine learning approaches enabled by single-cell data start to address prediction tasks in perturbation response modeling. We first define objectives in learning perturbation response in single-cell omics; survey existing approaches, resources, and datasets (https://github.com/theislab/sc-pert); and discuss how a perturbation atlas can enable deep learning models to construct an informative perturbation latent space. We then examine future avenues toward more powerful and explainable modeling using deep neural networks, which enable the integration of disparate information sources and an understanding of heterogeneous, complex, and unseen systems.
Collapse
Affiliation(s)
- Yuge Ji
- Institute of Computational Biology, Helmholtz Center Munich, Munich, Germany; Department of Mathematics, Technical University of Munich, Munich, Germany
| | - Mohammad Lotfollahi
- Institute of Computational Biology, Helmholtz Center Munich, Munich, Germany; TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - F Alexander Wolf
- Institute of Computational Biology, Helmholtz Center Munich, Munich, Germany; Cellarity, Cambridge, MA, USA
| | - Fabian J Theis
- Institute of Computational Biology, Helmholtz Center Munich, Munich, Germany; Department of Mathematics, Technical University of Munich, Munich, Germany; Cellarity, Cambridge, MA, USA.
| |
Collapse
|
29
|
Remodelling structure-based drug design using machine learning. Emerg Top Life Sci 2021; 5:13-27. [PMID: 33825834 DOI: 10.1042/etls20200253] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 03/17/2021] [Accepted: 03/30/2021] [Indexed: 12/13/2022]
Abstract
To keep up with the pace of rapid discoveries in biomedicine, a plethora of research endeavors had been directed toward Rational Drug Development that slowly gave way to Structure-Based Drug Design (SBDD). In the past few decades, SBDD played a stupendous role in identification of novel drug-like molecules that are capable of altering the structures and/or functions of the target macromolecules involved in different disease pathways and networks. Unfortunately, post-delivery drug failures due to adverse drug interactions have constrained the use of SBDD in biomedical applications. However, recent technological advancements, along with parallel surge in clinical research have led to the concomitant establishment of other powerful computational techniques such as Artificial Intelligence (AI) and Machine Learning (ML). These leading-edge tools with the ability to successfully predict side-effects of a wide range of drugs have eventually taken over the field of drug design. ML, a subset of AI, is a robust computational tool that is capable of data analysis and analytical model building with minimal human intervention. It is based on powerful algorithms that use huge sets of 'training data' as inputs to predict new output values, which improve iteratively through experience. In this review, along with a brief discussion on the evolution of the drug discovery process, we have focused on the methodologies pertaining to the technological advancements of machine learning. This review, with specific examples, also emphasises the tremendous contributions of ML in the field of biomedicine, while exploring possibilities for future developments.
Collapse
|
30
|
Ietswaart R, Gyori BM, Bachman JA, Sorger PK, Churchman LS. GeneWalk identifies relevant gene functions for a biological context using network representation learning. Genome Biol 2021; 22:55. [PMID: 33526072 PMCID: PMC7852222 DOI: 10.1186/s13059-021-02264-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 01/05/2021] [Indexed: 12/13/2022] Open
Abstract
A bottleneck in high-throughput functional genomics experiments is identifying the most important genes and their relevant functions from a list of gene hits. Gene Ontology (GO) enrichment methods provide insight at the gene set level. Here, we introduce GeneWalk ( github.com/churchmanlab/genewalk ) that identifies individual genes and their relevant functions critical for the experimental setting under examination. After the automatic assembly of an experiment-specific gene regulatory network, GeneWalk uses representation learning to quantify the similarity between vector representations of each gene and its GO annotations, yielding annotation significance scores that reflect the experimental context. By performing gene- and condition-specific functional analysis, GeneWalk converts a list of genes into data-driven hypotheses.
Collapse
Affiliation(s)
- Robert Ietswaart
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA
| | - Benjamin M Gyori
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, 02115, USA
| | - John A Bachman
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, 02115, USA
| | - Peter K Sorger
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, 02115, USA
| | - L Stirling Churchman
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA.
| |
Collapse
|
31
|
Zhou Y, Li S, Zhao Y, Guo M, Liu Y, Li M, Wen Z. Quantitative Structure-Activity Relationship (QSAR) Model for the Severity Prediction of Drug-Induced Rhabdomyolysis by Using Random Forest. Chem Res Toxicol 2021; 34:514-521. [PMID: 33393765 DOI: 10.1021/acs.chemrestox.0c00347] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Drug-induced rhabdomyolysis (DIR) is a rare and potentially life-threatening muscle injury that is characterized by low incidence and high risk. To our best knowledge, the performance of the current predictive models for the early detection of DIR is suboptimal because of the scarcity and dispersion of DIR cases. Therefore, on the basis of the curated drug information from the Drug-Induced Rhabdomyolysis Atlas (DIRA) database, we proposed a random forest (RF) model to predict the DIR severity of the marketed drugs. Compared with the state-of-art methods, our proposed model outperformed extreme gradient boosting, support vector machine, and logistic regression in distinguishing the Most-DIR concern drugs from the No-DIR concern drugs (Matthews correlation coefficient (MCC) and recall rate of our model were 0.46 and 0.81, respectively). Our model was subsequently applied to predicting the potentially serious DIR for 1402 drugs, which were reported to cause DIR by the postmarketing DIR surveillance data in the FDA Spontaneous Adverse Events Reporting System (FAERS). As a result, 62.7% (94) of drugs ranked in the top 150 drugs with the Most-DIR concerns in FAERS can be identified by our model. The top four drugs (odds ratio >30) including acepromazine, rapacuronium, oxyphenbutazone, and naringenin were correctly predicted by our model. In conclusion, the RF model can well predict the Most-DIR concern drug only based on the chemical structure information and can be a facilitated tool for early DIR detection.
Collapse
Affiliation(s)
- Yifan Zhou
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China
| | - Shihai Li
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China
| | - Yiru Zhao
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610064, China
| | - Mingkun Guo
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China
| | - Yuan Liu
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China
| | - Menglong Li
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China
| | - Zhining Wen
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China.,Medical Big Data Center, Sichuan University, Chengdu, Sichuan 610064, China
| |
Collapse
|
32
|
Smit IA, Afzal AM, Allen CHG, Svensson F, Hanser T, Bender A. Systematic Analysis of Protein Targets Associated with Adverse Events of Drugs from Clinical Trials and Postmarketing Reports. Chem Res Toxicol 2020; 34:365-384. [PMID: 33351593 DOI: 10.1021/acs.chemrestox.0c00294] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Adverse drug reactions (ADRs) are undesired effects of medicines that can harm patients and are a significant source of attrition in drug development. ADRs are anticipated by routinely screening drugs against secondary pharmacology protein panels. However, there is still a lack of quantitative information on the links between these off-target proteins and the reporting of ADRs in humans. Here, we present a systematic analysis of associations between measured and predicted in vitro bioactivities of drugs and adverse events (AEs) in humans from two sources of data: the Side Effect Resource, derived from clinical trials, and the Food and Drug Administration Adverse Event Reporting System, derived from postmarketing surveillance. The ratio of a drug's therapeutic unbound plasma concentration over the drug's in vitro potency against a given protein was used to select proteins most likely to be relevant to in vivo effects. In examining individual target bioactivities as predictors of AEs, we found a trade-off between the positive predictive value and the fraction of drugs with AEs that can be detected. However, considering sets of multiple targets for the same AE can help identify a greater fraction of AE-associated drugs. Of the 45 targets with statistically significant associations to AEs, 30 are included on existing safety target panels. The remaining 15 targets include 9 carbonic anhydrases, of which CA5B is significantly associated with cholestatic jaundice. We include the full quantitative data on associations between measured and predicted in vitro bioactivities and AEs in humans in this work, which can be used to make a more informed selection of safety profiling targets.
Collapse
Affiliation(s)
- Ines A Smit
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Avid M Afzal
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Chad H G Allen
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Fredrik Svensson
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Thierry Hanser
- Lhasa Limited, Granary Wharf House, 2 Canal Wharf, Leeds LS11 5PS, United Kingdom
| | - Andreas Bender
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
33
|
Giblin KA, Basili D, Afzal AM, Rosenbrier-Ribeiro L, Greene N, Barrett I, Hughes SJ, Bender A. New Associations between Drug-Induced Adverse Events in Animal Models and Humans Reveal Novel Candidate Safety Targets. Chem Res Toxicol 2020; 34:438-451. [PMID: 33338378 DOI: 10.1021/acs.chemrestox.0c00311] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
To improve our ability to extrapolate preclinical toxicity to humans, there is a need to understand and quantify the concordance of adverse events (AEs) between animal models and clinical studies. In the present work, we discovered 3011 statistically significant associations between preclinical and clinical AEs caused by drugs reported in the PharmaPendium database of which 2952 were new associations between toxicities encoded by different Medical Dictionary for Regulatory Activities terms across species. To find plausible and testable candidate off-target drug activities for the derived associations, we investigated the genetic overlap between the genes linked to both a preclinical and a clinical AE and the protein targets found to interact with one or more drugs causing both AEs. We discuss three associations from the analysis in more detail for which novel candidate off-target drug activities could be identified, namely, the association of preclinical mutagenicity readouts with clinical teratospermia and ovarian failure, the association of preclinical reflexes abnormal with clinical poor-quality sleep, and the association of preclinical psychomotor hyperactivity with clinical drug withdrawal syndrome. Our analysis successfully identified a total of 77% of known safety targets currently tested in in vitro screening panels plus an additional 431 genes which were proposed for investigation as future safety targets for different clinical toxicities. This work provides new translational toxicity relationships beyond AE term-matching, the results of which can be used for risk profiling of future new chemical entities for clinical studies and for the development of future in vitro safety panels.
Collapse
Affiliation(s)
- Kathryn A Giblin
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom.,Medicinal Chemistry, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Danilo Basili
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Avid M Afzal
- Data Sciences and Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Lyn Rosenbrier-Ribeiro
- Safety Platforms, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Nigel Greene
- Data Science and Artificial Intelligence, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Boston, Massachusetts 02451, United States
| | - Ian Barrett
- Data Sciences and Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Samantha J Hughes
- Medicinal Chemistry, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
34
|
Giangreco NP, Elias JE, Tatonetti NP. No population left behind: Improving paediatric drug safety using informatics and systems biology. Br J Clin Pharmacol 2020; 88:1464-1470. [PMID: 33332641 PMCID: PMC8209126 DOI: 10.1111/bcp.14705] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 10/26/2020] [Accepted: 12/05/2020] [Indexed: 12/12/2022] Open
Abstract
Adverse drugs effects (ADEs) in children are common and may result in disability and death. The current paediatric drug safety landscape, including clinical trials, is limited as it rarely includes children and relies on extrapolation from adults. Children are not small adults but go through an evolutionarily conserved and physiologically dynamic process of growth and maturation. Novel quantitative approaches, integrating observations from clinical trials and drug safety databases with dynamic mechanisms, can be used to systematically identify ADEs unique to childhood. In this perspective, we discuss three critical research directions using systems biology methodologies and novel informatics to improve paediatric drug safety, namely child versus adult drug safety profiles, age-dependent drug toxicities and genetic susceptibility of ADEs across childhood. We argue that a data-driven framework that leverages observational data, biomedical knowledge and systems biology modelling will reveal previously unknown mechanisms of pediatric adverse drug events and lead to improved paediatric drug safety.
Collapse
Affiliation(s)
- Nicholas P Giangreco
- Department of Biomedical Informatics and Systems Biology, Columbia University, New York, NY, USA
| | - Jonathan E Elias
- Department of Pediatrics, Instructor in Pediatrics, Assistant Medical Director of Information Services, Weill Cornell Medical & NYP Weill Cornell Medical Center, New York, NY, USA
| | - Nicholas P Tatonetti
- Department of Biomedical Informatics and Systems Biology, Columbia University, New York, NY, USA
| |
Collapse
|