1
|
Novel drug targets in 2023. Nat Rev Drug Discov 2024; 23:330. [PMID: 38565953 DOI: 10.1038/d41573-024-00057-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
|
2
|
Node-degree aware edge sampling mitigates inflated classification performance in biomedical random walk-based graph representation learning. BIOINFORMATICS ADVANCES 2024; 4:vbae036. [PMID: 38577542 PMCID: PMC10994718 DOI: 10.1093/bioadv/vbae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 01/31/2024] [Accepted: 02/29/2024] [Indexed: 04/06/2024]
Abstract
Motivation Graph representation learning is a family of related approaches that learn low-dimensional vector representations of nodes and other graph elements called embeddings. Embeddings approximate characteristics of the graph and can be used for a variety of machine-learning tasks such as novel edge prediction. For many biomedical applications, partial knowledge exists about positive edges that represent relationships between pairs of entities, but little to no knowledge is available about negative edges that represent the explicit lack of a relationship between two nodes. For this reason, classification procedures are forced to assume that the vast majority of unlabeled edges are negative. Existing approaches to sampling negative edges for training and evaluating classifiers do so by uniformly sampling pairs of nodes. Results We show here that this sampling strategy typically leads to sets of positive and negative examples with imbalanced node degree distributions. Using representative heterogeneous biomedical knowledge graph and random walk-based graph machine learning, we show that this strategy substantially impacts classification performance. If users of graph machine-learning models apply the models to prioritize examples that are drawn from approximately the same distribution as the positive examples are, then performance of models as estimated in the validation phase may be artificially inflated. We present a degree-aware node sampling approach that mitigates this effect and is simple to implement. Availability and implementation Our code and data are publicly available at https://github.com/monarch-initiative/negativeExampleSelection.
Collapse
|
3
|
Overview of the Knowledge Management Center for Illuminating the Druggable Genome. Drug Discov Today 2024; 29:103882. [PMID: 38218214 PMCID: PMC10939799 DOI: 10.1016/j.drudis.2024.103882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/22/2023] [Accepted: 01/09/2024] [Indexed: 01/15/2024]
Abstract
The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations. Based on this knowledge, proteins are classified into different Target Development Levels (TDLs) for identification of understudied targets. Additional work by the KMC focuses on enriching target knowledge and producing DrugCentral and other data visualization tools for expanding investigation of understudied targets.
Collapse
|
4
|
Artificial Intelligence for Drug Discovery: Are We There Yet? Annu Rev Pharmacol Toxicol 2024; 64:527-550. [PMID: 37738505 DOI: 10.1146/annurev-pharmtox-040323-040828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/24/2023]
Abstract
Drug discovery is adapting to novel technologies such as data science, informatics, and artificial intelligence (AI) to accelerate effective treatment development while reducing costs and animal experiments. AI is transforming drug discovery, as indicated by increasing interest from investors, industrial and academic scientists, and legislators. Successful drug discovery requires optimizing properties related to pharmacodynamics, pharmacokinetics, and clinical outcomes. This review discusses the use of AI in the three pillars of drug discovery: diseases, targets, and therapeutic modalities, with a focus on small-molecule drugs. AI technologies, such as generative chemistry, machine learning, and multiproperty optimization, have enabled several compounds to enter clinical trials. The scientific community must carefully vet known information to address the reproducibility crisis. The full potential of AI in drug discovery can only be realized with sufficient ground truth and appropriate human intervention at later pipeline stages.
Collapse
|
5
|
Correction: Exploring DrugCentral: from molecular structures to clinical effects. J Comput Aided Mol Des 2023; 38:2. [PMID: 38040935 PMCID: PMC10692018 DOI: 10.1007/s10822-023-00545-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2023]
|
6
|
Exploring DrugCentral: from molecular structures to clinical effects. J Comput Aided Mol Des 2023; 37:681-694. [PMID: 37707619 PMCID: PMC10692006 DOI: 10.1007/s10822-023-00529-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/14/2023] [Indexed: 09/15/2023]
Abstract
DrugCentral, accessible at https://drugcentral.org , is an open-access online drug information repository. It covers over 4950 drugs, incorporating structural, physicochemical, and pharmacological details to support drug discovery, development, and repositioning. With around 20,000 bioactivity data points, manual curation enhances information from several major digital sources. Approximately 724 mechanism-of-action (MoA) targets offer updated drug target insights. The platform captures clinical data: over 14,300 on- and off-label uses, 27,000 contraindications, and around 340,000 adverse drug events from pharmacovigilance reports. DrugCentral encompasses information from molecular structures to marketed formulations, providing a comprehensive pharmaceutical reference. Users can easily navigate basic drug information and key features, making DrugCentral a versatile, unique resource. Furthermore, we present a use-case example where we utilize experimentally determined data from DrugCentral to support drug repurposing. A minimum activity threshold t should be considered against novel targets to repurpose a drug. Analyzing 1156 bioactivities for human MoA targets suggests a general threshold of 1 µM: t = 6 when expressed as - log[Activity(M)]). This applies to 87% of the drugs. Moreover, t can be refined empirically based on water solubility (S): t = 3 - logS, for logS < - 3. Alongside the drug repurposing classification scheme, which considers intellectual property rights, market exclusivity protections, and market accessibility, DrugCentral provides valuable data to prioritize candidates for drug repurposing programs efficiently.
Collapse
|
7
|
Abstract
Molecular complexity (MC) lacks a universal definition, but various studies address it in contexts ranging from ligand-receptor interactions to DNA sequencing, with the overarching emphasis being its significance in synthetic organic chemistry and pharmaceutical research. Efforts to quantify MC in drug discovery have been numerous, but a unified approach remains challenging. Strategies based on graph theory, information theory, and substructural feature counts employed to gauge MC are often correlated to molecular weight (MW). Herbert Waldmann and his team introduced a new MC metric called the spacial score (SPS), which is based on factors like atom hybridization and stereoisomeric considerations. While SPS and its normalized version, nSPS, correlate with the natural product likeness score, they do not align with traditional chemical properties. We examined nSPS trends for approved drugs and found no significant changes in MC over eight decades, nor did nSPS capture drug innovation during that period. Furthermore, our analysis indicates that while the majority of approved drugs have an nSPS value between 10 and 20, this metric does not correlate with key drug properties like target bioactivity and oral bioavailability. Mirroring a chemist's intuitive sense of chemical complexity, nSPS addresses the need for a precise empirical tool while a universal definition of MC remains elusive.
Collapse
|
8
|
Toxicology knowledge graph for structural birth defects. COMMUNICATIONS MEDICINE 2023; 3:98. [PMID: 37460679 DOI: 10.1038/s43856-023-00329-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 06/29/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Birth defects are functional and structural abnormalities that impact about 1 in 33 births in the United States. They have been attributed to genetic and other factors such as drugs, cosmetics, food, and environmental pollutants during pregnancy, but for most birth defects there are no known causes. METHODS To further characterize associations between small molecule compounds and their potential to induce specific birth abnormalities, we gathered knowledge from multiple sources to construct a reproductive toxicity Knowledge Graph (ReproTox-KG) with a focus on associations between birth defects, drugs, and genes. Specifically, we gathered data from drug/birth-defect associations from co-mentions in published abstracts, gene/birth-defect associations from genetic studies, drug- and preclinical-compound-induced gene expression changes in cell lines, known drug targets, genetic burden scores for human genes, and placental crossing scores for small molecules. RESULTS Using ReproTox-KG and semi-supervised learning (SSL), we scored >30,000 preclinical small molecules for their potential to cross the placenta and induce birth defects, and identified >500 birth-defect/gene/drug cliques that can be used to explain molecular mechanisms for drug-induced birth defects. The ReproTox-KG can be accessed via a web-based user interface available at https://maayanlab.cloud/reprotox-kg . This site enables users to explore the associations between birth defects, approved and preclinical drugs, and all human genes. CONCLUSIONS ReproTox-KG provides a resource for exploring knowledge about the molecular mechanisms of birth defects with the potential of predicting the likelihood of genes and preclinical small molecules to induce birth defects.
Collapse
|
9
|
Novel drug targets in 2022. Nat Rev Drug Discov 2023:10.1038/d41573-023-00068-y. [PMID: 37138052 DOI: 10.1038/d41573-023-00068-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
|
10
|
Abstract
In the wake of recent COVID-19 pandemics scientists around the world rushed to deliver numerous CADD (Computer-Aided Drug Discovery) methods and tools that could be reliably used to discover novel drug candidates against the SARS-CoV-2 virus. With that, there emerged a trend of a significant democratization of CADD that contributed to the rapid development of various COVID-19 drug candidates currently undergoing different stages of validation. On the other hand, this democratization also inadvertently led to the surge rapidly performed molecular docking studies to nominate multiple scores of novel drug candidates supported by computational arguments only. Albeit driven by best intentions, most of such studies also did not follow best practices in the field that require experience and expertise learned through multiple rigorously designed benchmarking studies and rigorous experimental validation. In this Viewpoint we reflect on recent disbalance between small number of rigorous and comprehensive studies and the proliferation of purely computational studies enabled by the ease of docking software availability. We further elaborate on the hyped oversale of CADD methods' ability to rapidly yield viable drug candidates and reiterate the critical importance of rigor and adherence to the best practices of CADD in view of recent emergence of AI and Big Data in the field.
Collapse
|
11
|
Illuminating the druggable genome through patent bioactivity data. PeerJ 2023; 11:e15153. [PMID: 37151295 PMCID: PMC10162037 DOI: 10.7717/peerj.15153] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 03/10/2023] [Indexed: 05/09/2023] Open
Abstract
The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.surechembl.org/), according to how likely they were to contain bioactivity data for potent small molecules on less-studied targets, based on the classification developed by the Illuminating the Druggable Genome (IDG) project. The overall goal was to select a smaller number of patents that could be manually curated and incorporated into the ChEMBL database. Using relatively simple annotation and filtering pipelines, we have been able to identify a substantial number of patents containing quantitative bioactivity data for understudied targets that had not previously been reported in the peer-reviewed medicinal chemistry literature. We quantify the added value of such methods in terms of the numbers of targets that are so identified, and provide some specific illustrative examples. Our work underlines the potential value in searching the patent corpus in addition to the more traditional peer-reviewed literature. The small molecules found in these patents, together with their measured activity against the targets, are now accessible via the ChEMBL database.
Collapse
|
12
|
Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds. J Cheminform 2022; 14:82. [PMID: 36461094 PMCID: PMC9716667 DOI: 10.1186/s13321-022-00661-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022] Open
Abstract
We report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries. During the meeting, applications, challenges, and opportunities in drug discovery, de novo drug design, ADME-Tox (absorption, distribution, metabolism, excretion and toxicity) property predictions, organic chemistry, peptides, and antibiotic resistance were discussed. The program along with the recordings of all sessions are freely available at https://www.difacquim.com/english/events/2022-colloquium/ .
Collapse
|
13
|
Interpretable deep learning translation of GWAS and multi-omics findings to identify pathobiology and drug repurposing in Alzheimer's disease. Cell Rep 2022; 41:111717. [PMID: 36450252 PMCID: PMC9837836 DOI: 10.1016/j.celrep.2022.111717] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 09/01/2022] [Accepted: 11/02/2022] [Indexed: 12/03/2022] Open
Abstract
Translating human genetic findings (genome-wide association studies [GWAS]) to pathobiology and therapeutic discovery remains a major challenge for Alzheimer's disease (AD). We present a network topology-based deep learning framework to identify disease-associated genes (NETTAG). We leverage non-coding GWAS loci effects on quantitative trait loci, enhancers and CpG islands, promoter regions, open chromatin, and promoter flanking regions under the protein-protein interactome. Via NETTAG, we identified 156 AD-risk genes enriched in druggable targets. Combining network-based prediction and retrospective case-control observations with 10 million individuals, we identified that usage of four drugs (ibuprofen, gemfibrozil, cholecalciferol, and ceftriaxone) is associated with reduced likelihood of AD incidence. Gemfibrozil (an approved lipid regulator) is significantly associated with 43% reduced risk of AD compared with simvastatin using an active-comparator design (95% confidence interval 0.51-0.63, p < 0.0001). In summary, NETTAG offers a deep learning methodology that utilizes GWAS and multi-genomic findings to identify pathobiology and drug repurposing in AD.
Collapse
|
14
|
Agent-based modeling predicts RAC1 is critical for ovarian cancer metastasis. Mol Biol Cell 2022; 33:ar138. [PMID: 36200848 PMCID: PMC9727804 DOI: 10.1091/mbc.e21-11-0540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Experimental and computational studies pinpoint rate-limiting step(s) in metastasis governed by Rac1. Using ovarian cancer cell and animal models, Rac1 expression was manipulated, and quantitative measurements of cell-cell and cell-substrate adhesion, cell invasion, mesothelial clearance, and peritoneal tumor growth discriminated the tumor behaviors most highly influenced by Rac1. The experimental data were used to parameterize an agent-based computational model simulating peritoneal niche colonization, intravasation, and hematogenous metastasis to distant organs. Increased ovarian cancer cell survival afforded by the more rapid adhesion and intravasation upon Rac1 overexpression is predicted to increase the numbers of and the rates at which tumor cells are disseminated to distant sites. Surprisingly, crowding of cancer cells along the blood vessel was found to decrease the numbers of cells reaching a distant niche irrespective of Rac1 overexpression or knockdown, suggesting that sites for tumor cell intravasation are rate limiting and become accessible if cells intravasate rapidly or are displaced due to diminished viability. Modeling predictions were confirmed through animal studies of Rac1-dependent metastasis to the lung. Collectively, the experimental and modeling approaches identify cell adhesion, rapid intravasation, and survival in the blood as parameters in the ovarian metastatic cascade that are most critically dependent on Rac1.
Collapse
|
15
|
AlphaFold illuminates half of the dark human proteins. Curr Opin Struct Biol 2022; 74:102372. [PMID: 35439658 PMCID: PMC10669925 DOI: 10.1016/j.sbi.2022.102372] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 03/02/2022] [Accepted: 03/13/2022] [Indexed: 01/05/2023]
Abstract
We investigate the use of confidence scores to evaluate the accuracy of a given AlphaFold (AF2) protein model for drug discovery. Prediction of accuracy is improved by not considering confidence scores below 80 due to the effects of disorder. On a set of recent crystal structures, 95% are likely to have accurate folds. Conformational discordance in the training set has a much more significant effect on accuracy than sequence divergence. We propose criteria for models and residues that are possibly useful for virtual screening. Based on these criteria, AF2 provides models for half of understudied (dark) human proteins and two-thirds of residues in those models.
Collapse
|
16
|
Novel drug targets in 2021. Nat Rev Drug Discov 2022; 21:328. [PMID: 35361900 DOI: 10.1038/d41573-022-00057-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
17
|
Diseases 2.0: a weekly updated database of disease–gene associations from text mining and data integration. Database (Oxford) 2022; 2022:6554833. [PMID: 35348648 PMCID: PMC9216524 DOI: 10.1093/database/baac019] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 02/14/2022] [Accepted: 03/11/2022] [Indexed: 12/04/2022]
Abstract
The scientific knowledge about which genes are involved in which diseases grows rapidly, which makes it difficult to keep up with new publications and genetics datasets. The DISEASES database aims to provide a comprehensive overview by systematically integrating and assigning confidence scores to evidence for disease–gene associations from curated databases, genome-wide association studies (GWAS) and automatic text mining of the biomedical literature. Here, we present a major update to this resource, which greatly increases the number of associations from all these sources. This is especially true for the text-mined associations, which have increased by at least 9-fold at all confidence cutoffs. We show that this dramatic increase is primarily due to adding full-text articles to the text corpus, secondarily due to improvements to both the disease and gene dictionaries used for named entity recognition, and only to a very small extent due to the growth in number of PubMed abstracts. DISEASES now also makes use of a new GWAS database, Target Illumination by GWAS Analytics, which considerably increased the number of GWAS-derived disease–gene associations. DISEASES itself is also integrated into several other databases and resources, including GeneCards/MalaCards, Pharos/Target Central Resource Database and the Cytoscape stringApp. All data in DISEASES are updated on a weekly basis and is available via a web interface at https://diseases.jensenlab.org, from where it can also be downloaded under open licenses. Database URL: https://diseases.jensenlab.org
Collapse
|
18
|
State of the Art and Uses for the Biopharmaceutics Drug Disposition Classification System (BDDCS): New Additions, Revisions, and Citation References. AAPS J 2022; 24:37. [PMID: 35199251 PMCID: PMC8865883 DOI: 10.1208/s12248-022-00687-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 01/24/2022] [Indexed: 12/16/2022] Open
Abstract
The Biopharmaceutics Drug Disposition Classification system (BDDCS) is a four-class approach based on water solubility and extent of metabolism/permeability rate. Based on the BDDCS class to which a drug is assigned, it is possible to predict the role of metabolic enzymes and transporters on the drug disposition of a new molecular entity (NME) prior to its administration to animals or humans. Here, we report a total of 1475 drugs and active metabolites to which the BDDCS is applied. Of these, 379 are new entries, and 1096 are revisions of former classification studies with the addition of references for the approved maximum dose strength, extent of the systemically available drug excreted unchanged in the urine, and lowest solubility over the pH range 1.0–6.8 when such information is available in the literature. We detail revised class assignments of previously misclassified drugs and the literature analyses to classify new drugs. We review the process of solubility assessment for NMEs prior to drug dosing in humans and approved dose classification, as well as the comparison of Biopharmaceutics Classification System (BCS) versus BDDCS assignment. We detail the uses of BDDCS in predicting, prior to dosing animals or humans, disposition characteristics, potential brain penetration, food effect, and drug-induced liver injury (DILI) potential. This work provides an update on the current status of the BDDCS and its uses in the drug development process.
Collapse
|
19
|
A Comprehensive COVID-19 Daily News and Medical Literature Briefing to Inform Health Care and Policy in New Mexico: Implementation Study. JMIR MEDICAL EDUCATION 2022; 8:e23845. [PMID: 35142625 PMCID: PMC8908195 DOI: 10.2196/23845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 04/29/2021] [Accepted: 02/09/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND On March 11, 2020, the New Mexico Governor declared a public health emergency in response to the COVID-19 pandemic. The New Mexico medical advisory team contacted University of New Mexico (UNM) faculty to form a team to consolidate growing information on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its disease to facilitate New Mexico's pandemic management. Thus, faculty, physicians, staff, graduate students, and medical students created the "UNM Global Health COVID-19 Intelligence Briefing." OBJECTIVE In this paper, we sought to (1) share how to create an informative briefing to guide public policy and medical practice and manage information overload with rapidly evolving scientific evidence; (2) determine the qualitative usefulness of the briefing to its readers; and (3) determine the qualitative effect this project has had on virtual medical education. METHODS Microsoft Teams was used for manual and automated capture of COVID-19 articles and composition of briefings. Multilevel triaging saved impactful articles to be reviewed, and priority was placed on randomized controlled studies, meta-analyses, systematic reviews, practice guidelines, and information on health care and policy response to COVID-19. The finalized briefing was disseminated by email, a listserv, and posted on the UNM digital repository. A survey was sent to readers to determine briefing usefulness and whether it led to policy or medical practice changes. Medical students, unable to partake in direct patient care, proposed to the School of Medicine that involvement in the briefing should count as course credit, which was approved. The maintenance of medical student involvement in the briefings as well as this publication was led by medical students. RESULTS An average of 456 articles were assessed daily. The briefings reached approximately 1000 people by email and listserv directly, with an unknown amount of forwarding. Digital repository tracking showed 5047 downloads across 116 countries as of July 5, 2020. The survey found 108 (95%) of 114 participants gained relevant knowledge, 90 (79%) believed it decreased misinformation, 27 (24%) used the briefing as their primary source of information, and 90 (79%) forwarded it to colleagues. Specific and impactful public policy decisions were informed based on the briefing. Medical students reported that the project allowed them to improve on their scientific literature assessment, stay current on the pandemic, and serve their community. CONCLUSIONS The COVID-19 briefings succeeded in informing and guiding New Mexico policy and clinical practice. The project received positive feedback from the community and was shown to decrease information burden and misinformation. The virtual platforms allowed for the continuation of medical education. Variability in subject matter expertise was addressed with training, standardized article selection criteria, and collaborative editing led by faculty.
Collapse
|
20
|
Abstract
In the event of an outbreak due to an emerging pathogen, time is of the essence to contain or to mitigate the spread of the disease. Drug repositioning is one of the strategies that has the potential to deliver therapeutics relatively quickly. The SARS-CoV-2 pandemic has shown that integrating critical data resources to drive drug-repositioning studies, involving host-host, host-pathogen, and drug-target interactions, remains a time-consuming effort that translates to a delay in the development and delivery of a life-saving therapy. Here, we describe a workflow we designed for a semiautomated integration of rapidly emerging data sets that can be generally adopted in a broad network pharmacology research setting. The workflow was used to construct a COVID-19 focused multimodal network that integrates 487 host-pathogen, 63 278 host-host protein, and 1221 drug-target interactions. The resultant Neo4j graph database named "Neo4COVID19" is made publicly accessible via a web interface and via API calls based on the Bolt protocol. Details for accessing the database are provided on a landing page (https://neo4covid19.ncats.io/). We believe that our Neo4COVID19 database will be a valuable asset to the research community and will catalyze the discovery of therapeutics to fight COVID-19.
Collapse
|
21
|
Abstract
The Illuminating the Druggable Genome (IDG) consortium is a National Institutes of Health (NIH) Common Fund program designed to enhance our knowledge of under-studied proteins, more specifically, proteins unannotated within the three most commonly drug-targeted protein families: G-protein coupled receptors, ion channels, and protein kinases. Since 2014, the IDG Knowledge Management Center (IDG-KMC) has generated several open-access datasets and resources that jointly serve as a highly translational machine-learning-ready knowledgebase focused on human protein-coding genes and their products. The goal of the IDG-KMC is to develop comprehensive integrated knowledge for the druggable genome to illuminate the uncharacterized or poorly annotated portion of the druggable genome. The tools derived from the IDG-KMC provide either user-friendly visualizations or ways to impute the knowledge about potential targets using machine learning strategies. In the following protocols, we describe how to use each web-based tool to accelerate illumination in under-studied proteins. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Interacting with the Pharos user interface Basic Protocol 2: Accessing the data in Harmonizome Basic Protocol 3: The ARCHS4 resource Basic Protocol 4: Making predictions about gene function with PrismExp Basic Protocol 5: Using Geneshot to illuminate knowledge about under-studied targets Basic Protocol 6: Exploring under-studied targets with TIN-X Basic Protocol 7: Interacting with the DrugCentral user interface Basic Protocol 8: Estimating Anti-SARS-CoV-2 activities with DrugCentral REDIAL-2020 Basic Protocol 9: Drug Set Enrichment Analysis using Drugmonizome Basic Protocol 10: The Drugmonizome-ML Appyter Basic Protocol 11: The Harmonizome-ML Appyter Basic Protocol 12: GWAS target illumination with TIGA Basic Protocol 13: Prioritizing kinases for lists of proteins and phosphoproteins with KEA3 Basic Protocol 14: Converting PubMed searches to drug sets with the DrugShot Appyter.
Collapse
|
22
|
Supervised learning with word embeddings derived from PubMed captures latent knowledge about protein kinases and cancer. NAR Genom Bioinform 2021; 3:lqab113. [PMID: 34888523 PMCID: PMC8652379 DOI: 10.1093/nargab/lqab113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 10/14/2021] [Accepted: 11/24/2021] [Indexed: 11/17/2022] Open
Abstract
Inhibiting protein kinases (PKs) that cause cancers has been an important topic in cancer therapy for years. So far, almost 8% of >530 PKs have been targeted by FDA-approved medications, and around 150 protein kinase inhibitors (PKIs) have been tested in clinical trials. We present an approach based on natural language processing and machine learning to investigate the relations between PKs and cancers, predicting PKs whose inhibition would be efficacious to treat a certain cancer. Our approach represents PKs and cancers as semantically meaningful 100-dimensional vectors based on word and concept neighborhoods in PubMed abstracts. We use information about phase I-IV trials in ClinicalTrials.gov to construct a training set for random forest classification. Our results with historical data show that associations between PKs and specific cancers can be predicted years in advance with good accuracy. Our tool can be used to predict the relevance of inhibiting PKs for specific cancers and to support the design of well-focused clinical trials to discover novel PKIs for cancer therapy.
Collapse
|
23
|
TIGA: target illumination GWAS analytics. Bioinformatics 2021; 37:3865-3873. [PMID: 34086846 PMCID: PMC11025677 DOI: 10.1093/bioinformatics/btab427] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 05/12/2021] [Accepted: 06/03/2021] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION Genome-wide association studies can reveal important genotype-phenotype associations; however, data quality and interpretability issues must be addressed. For drug discovery scientists seeking to prioritize targets based on the available evidence, these issues go beyond the single study. RESULTS Here, we describe rational ranking, filtering and interpretation of inferred gene-trait associations and data aggregation across studies by leveraging existing curation and harmonization efforts. Each gene-trait association is evaluated for confidence, with scores derived solely from aggregated statistics, linking a protein-coding gene and phenotype. We propose a method for assessing confidence in gene-trait associations from evidence aggregated across studies, including a bibliometric assessment of scientific consensus based on the iCite relative citation ratio, and meanRank scores, to aggregate multivariate evidence.This method, intended for drug target hypothesis generation, scoring and ranking, has been implemented as an analytical pipeline, available as open source, with public datasets of results, and a web application designed for usability by drug discovery scientists. AVAILABILITY AND IMPLEMENTATION Web application, datasets and source code via https://unmtid-shinyapps.net/tiga/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
24
|
A critical overview of computational approaches employed for COVID-19 drug discovery. Chem Soc Rev 2021; 50:9121-9151. [PMID: 34212944 PMCID: PMC8371861 DOI: 10.1039/d0cs01065k] [Citation(s) in RCA: 91] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Indexed: 01/18/2023]
Abstract
COVID-19 has resulted in huge numbers of infections and deaths worldwide and brought the most severe disruptions to societies and economies since the Great Depression. Massive experimental and computational research effort to understand and characterize the disease and rapidly develop diagnostics, vaccines, and drugs has emerged in response to this devastating pandemic and more than 130 000 COVID-19-related research papers have been published in peer-reviewed journals or deposited in preprint servers. Much of the research effort has focused on the discovery of novel drug candidates or repurposing of existing drugs against COVID-19, and many such projects have been either exclusively computational or computer-aided experimental studies. Herein, we provide an expert overview of the key computational methods and their applications for the discovery of COVID-19 small-molecule therapeutics that have been reported in the research literature. We further outline that, after the first year the COVID-19 pandemic, it appears that drug repurposing has not produced rapid and global solutions. However, several known drugs have been used in the clinic to cure COVID-19 patients, and a few repurposed drugs continue to be considered in clinical trials, along with several novel clinical candidates. We posit that truly impactful computational tools must deliver actionable, experimentally testable hypotheses enabling the discovery of novel drugs and drug combinations, and that open science and rapid sharing of research results are critical to accelerate the development of novel, much needed therapeutics for COVID-19.
Collapse
|
25
|
Abstract
Pharos is an integrated web-based informatics platform for the analysis of data aggregated by the Illuminating the Druggable Genome (IDG) Knowledge Management Center, an NIH Common Fund initiative. The current version of Pharos (as of October 2019) spans 20,244 proteins in the human proteome, 19,880 disease and phenotype associations, and 226,829 ChEMBL compounds. This resource not only collates and analyzes data from over 60 high-quality resources to generate these types, but also uses text indexing to find less apparent connections between targets, and has recently begun to collaborate with institutions that generate data and resources. Proteins are ranked according to a knowledge-based classification system, which can help researchers to identify less studied "dark" targets that could be potentially further illuminated. This is an important process for both drug discovery and target validation, as more knowledge can accelerate target identification, and previously understudied proteins can serve as novel targets in drug discovery. Two basic protocols illustrate the levels of detail available for targets and several methods of finding targets of interest. An Alternate Protocol illustrates the difference in available knowledge between less and more studied targets. © 2020 by John Wiley & Sons, Inc. Basic Protocol 1: Search for a target and view details Alternate Protocol: Search for dark target and view details Basic Protocol 2: Filter a target list to get refined results.
Collapse
|
26
|
COVIDomic: A multi-modal cloud-based platform for identification of risk factors associated with COVID-19 severity. PLoS Comput Biol 2021; 17:e1009183. [PMID: 34260589 PMCID: PMC8312936 DOI: 10.1371/journal.pcbi.1009183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 07/26/2021] [Accepted: 06/14/2021] [Indexed: 01/08/2023] Open
Abstract
Coronavirus disease 2019 (COVID-19) is an acute infection of the respiratory tract that emerged in December 2019 in Wuhan, China. It was quickly established that both the symptoms and the disease severity may vary from one case to another and several strains of SARS-CoV-2 have been identified. To gain a better understanding of the wide variety of SARS-CoV-2 strains and their associated symptoms, thousands of SARS-CoV-2 genomes have been sequenced in dozens of countries. In this article, we introduce COVIDomic, a multi-omics online platform designed to facilitate the analysis and interpretation of the large amount of health data collected from patients with COVID-19. The COVIDomic platform provides a comprehensive set of bioinformatic tools for the multi-modal metatranscriptomic data analysis of COVID-19 patients to determine the origin of the coronavirus strain and the expected severity of the disease. An integrative analytical workflow, which includes microbial pathogens community analysis, COVID-19 genetic epidemiology and patient stratification, allows to analyze the presence of the most common microbial organisms, their antibiotic resistance, the severity of the infection and the set of the most probable geographical locations from which the studied strain could have originated. The online platform integrates a user friendly interface which allows easy visualization of the results. We envision this tool will not only have immediate implications for management of the ongoing COVID-19 pandemic, but will also improve our readiness to respond to other infectious outbreaks.
Collapse
|
27
|
Abstract
Despite decades of intensive search for compounds that modulate the activity of particular protein targets, a large proportion of the human kinome remains as yet undrugged. Effective approaches are therefore required to map the massive space of unexplored compound-kinase interactions for novel and potent activities. Here, we carry out a crowdsourced benchmarking of predictive algorithms for kinase inhibitor potencies across multiple kinase families tested on unpublished bioactivity data. We find the top-performing predictions are based on various models, including kernel learning, gradient boosting and deep learning, and their ensemble leads to a predictive accuracy exceeding that of single-dose kinase activity assays. We design experiments based on the model predictions and identify unexpected activities even for under-studied kinases, thereby accelerating experimental mapping efforts. The open-source prediction algorithms together with the bioactivities between 95 compounds and 295 kinases provide a resource for benchmarking prediction algorithms and for extending the druggable kinome.
Collapse
|
28
|
|
29
|
|
30
|
InContext: curation of medical context for drug indications. J Biomed Semantics 2021; 12:2. [PMID: 33579375 PMCID: PMC7881657 DOI: 10.1186/s13326-021-00234-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2018] [Accepted: 01/21/2021] [Indexed: 11/10/2022] Open
Abstract
Accurate and precise information about the therapeutic uses (indications) of a drug is essential for applications in drug repurposing and precision medicine. Leading online drug resources such as DrugCentral and DrugBank provide rich information about various properties of drugs, including their indications. However, because indications in such databases are often partly automatically mined, some may prove to be inaccurate or imprecise. Particularly challenging for text mining methods is the task of distinguishing between general disease mentions in drug product labels and actual indications for the drug. For this, the qualifying medical context of the disease mentions in the text should be studied. Some examples include contraindications, co-prescribed drugs and target patient qualifications. No existing indication curation efforts attempt to capture such information in a precise way. Here we fill this gap by presenting a novel curation protocol for extracting indications and machine processable annotations of contextual information about the therapeutic use of a drug. We implemented the protocol on a reference set of FDA-approved drug product labels on the DailyMed website to curate indications for 150 anti-cancer and cardiovascular drugs. The resulting corpus - InContext - focuses on anti-cancer and cardiovascular drugs because of the heightened societal interest in cancer and heart disease. In order to understand how InContext relates with existing reputable drug indication databases, we analysed it’s overlap with a state-of-the-art indications database - LabeledIn - as well as a reputable online drug compendium - DrugCentral. We found that 40% of indications sampled from DrugCentral (and 23% from LabeledIn) respectively, could not be accounted for in InContext. This raises questions about the veracity of indications not appearing in InContext. The additional contextual information curated by InContext about disease mentions in drug SPLs provides a foundation for more precise, structured and formal representations of knowledge related to drug therapeutic use, in order to increase accuracy and agreement of drug indication extraction methods for in silico drug repurposing.
Collapse
|
31
|
DrugCentral 2021 supports drug discovery and repositioning. Nucleic Acids Res 2021; 49:D1160-D1169. [PMID: 33151287 PMCID: PMC7779058 DOI: 10.1093/nar/gkaa997] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/18/2022] Open
Abstract
DrugCentral is a public resource (http://drugcentral.org) that serves the scientific community by providing up-to-date drug information, as described in previous papers. The current release includes 109 newly approved (October 2018 through March 2020) active pharmaceutical ingredients in the US, Europe, Japan and other countries; and two molecular entities (e.g. mefuparib) of interest for COVID19. New additions include a set of pharmacokinetic properties for ∼1000 drugs, and a sex-based separation of side effects, processed from FAERS (FDA Adverse Event Reporting System); as well as a drug repositioning prioritization scheme based on the market availability and intellectual property rights forFDA approved drugs. In the context of the COVID19 pandemic, we also incorporated REDIAL-2020, a machine learning platform that estimates anti-SARS-CoV-2 activities, as well as the 'drugs in news' feature offers a brief enumeration of the most interesting drugs at the present moment. The full database dump and data files are available for download from the DrugCentral web portal.
Collapse
|
32
|
TCRD and Pharos 2021: mining the human proteome for disease biology. Nucleic Acids Res 2021; 49:D1334-D1346. [PMID: 33156327 PMCID: PMC7778974 DOI: 10.1093/nar/gkaa993] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/13/2022] Open
Abstract
In 2014, the National Institutes of Health (NIH) initiated the Illuminating the Druggable Genome (IDG) program to identify and improve our understanding of poorly characterized proteins that can potentially be modulated using small molecules or biologics. Two resources produced from these efforts are: The Target Central Resource Database (TCRD) (http://juniper.health.unm.edu/tcrd/) and Pharos (https://pharos.nih.gov/), a web interface to browse the TCRD. The ultimate goal of these resources is to highlight and facilitate research into currently understudied proteins, by aggregating a multitude of data sources, and ranking targets based on the amount of data available, and presenting data in machine learning ready format. Since the 2017 release, both TCRD and Pharos have produced two major releases, which have incorporated or expanded an additional 25 data sources. Recently incorporated data types include human and viral-human protein-protein interactions, protein-disease and protein-phenotype associations, and drug-induced gene signatures, among others. These aggregated data have enabled us to generate new visualizations and content sections in Pharos, in order to empower users to find new areas of study in the druggable genome.
Collapse
|
33
|
Virtual and In Vitro Antiviral Screening Revive Therapeutic Drugs for COVID-19. ACS Pharmacol Transl Sci 2020; 3:1278-1292. [PMID: 33330842 PMCID: PMC7571299 DOI: 10.1021/acsptsci.0c00131] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Indexed: 02/08/2023]
Abstract
The urgent need for a cure for early phase COVID-19 infected patients critically underlines drug repositioning strategies able to efficiently identify new and reliable treatments by merging computational, experimental, and pharmacokinetic expertise. Here we report new potential therapeutics for COVID-19 identified with a combined virtual and experimental screening strategy and selected among already approved drugs. We used hydroxychloroquine (HCQ), one of the most studied drugs in current clinical trials, as a reference template to screen for structural similarity against a library of almost 4000 approved drugs. The top-ranked drugs, based on structural similarity to HCQ, were selected for in vitro antiviral assessment. Among the selected drugs, both zuclopenthixol and nebivolol efficiently block SARS-CoV-2 infection with EC50 values in the low micromolar range, as confirmed by independent experiments. The anti-SARS-CoV-2 potential of ambroxol, amodiaquine, and its active metabolite (N-monodesethyl amodiaquine) is also discussed. In trying to understand the "hydroxychloroquine" mechanism of action, both pK a and the HCQ aromatic core may play a role. Further, we show that the amodiaquine metabolite and, to a lesser extent, zuclopenthixol and nebivolol are active in a SARS-CoV-2 titer reduction assay. Given the need for improved efficacy and safety, we propose zuclopenthixol, nebivolol, and amodiaquine as potential candidates for clinical trials against the early phase of the SARS-CoV-2 infection and discuss their potential use as adjuvant to the current (i.e., remdesivir and favipiravir) COVID-19 therapeutics.
Collapse
|
34
|
Abstract
With an estimated prevalence of 463 million affected, type 2 diabetes represents a major challenge to health care systems worldwide. Analyzing the plasma proteomes of individuals with type 2 diabetes may illuminate hitherto unknown functional mechanisms underlying disease pathology. We assessed the associations between type 2 diabetes and >1,000 plasma proteins in the Cooperative Health Research in the Region of Augsburg (KORA) F4 cohort (n = 993, 110 cases), with subsequent replication in the third wave of the Nord-Trøndelag Health Study (HUNT3) cohort (n = 940, 149 cases). We computed logistic regression models adjusted for age, sex, BMI, smoking status, and hypertension. Additionally, we investigated associations with incident type 2 diabetes and performed two-sample bidirectional Mendelian randomization (MR) analysis to prioritize our results. Association analysis of prevalent type 2 diabetes revealed 24 replicated proteins, of which 8 are novel. Proteins showing association with incident type 2 diabetes were aminoacylase-1, growth hormone receptor, and insulin-like growth factor-binding protein 2. Aminoacylase-1 was associated with both prevalent and incident type 2 diabetes. MR analysis yielded nominally significant causal effects of type 2 diabetes on cathepsin Z and rennin, both known to have roles in the pathophysiological pathways of cardiovascular disease, and of sex hormone-binding globulin on type 2 diabetes. In conclusion, our high-throughput proteomics study replicated previously reported type 2 diabetes-protein associations and identified new candidate proteins possibly involved in the pathogenesis of type 2 diabetes.
Collapse
|
35
|
A Workflow of Integrated Resources to Catalyze Network Pharmacology Driven COVID-19 Research. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.11.04.369041. [PMID: 33173863 PMCID: PMC7654851 DOI: 10.1101/2020.11.04.369041] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
MOTIVATION In the event of an outbreak due to an emerging pathogen, time is of the essence to contain or to mitigate the spread of the disease. Drug repositioning is one of the strategies that has the potential to deliver therapeutics relatively quickly. The SARS-CoV-2 pandemic has shown that integrating critical data resources to drive drug-repositioning studies, involving host-host, hostpathogen and drug-target interactions, remains a time-consuming effort that translates to a delay in the development and delivery of a life-saving therapy. RESULTS Here, we describe a workflow we designed for a semi-automated integration of rapidly emerging datasets that can be generally adopted in a broad network pharmacology research setting. The workflow was used to construct a COVID-19 focused multimodal network that integrates 487 host-pathogen, 74,805 host-host protein and 1,265 drug-target interactions. The resultant Neo4j graph database named "Neo4COVID19" is accessible via a web interface and via API calls based on the Bolt protocol. We believe that our Neo4COVID19 database will be a valuable asset to the research community and will catalyze the discovery of therapeutics to fight COVID-19. AVAILABILITY https://neo4covid19.ncats.io.
Collapse
|
36
|
REDIAL-2020: A suite of machine learning models to estimate Anti-SARS-CoV-2 activities. CHEMRXIV : THE PREPRINT SERVER FOR CHEMISTRY 2020:12915779. [PMID: 33200119 PMCID: PMC7668752 DOI: 10.26434/chemrxiv.12915779] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Revised: 09/16/2020] [Indexed: 11/09/2022]
Abstract
Strategies for drug discovery and repositioning are an urgent need with respect to COVID-19. We developed "REDIAL-2020", a suite of machine learning models for estimating small molecule activity from molecular structure, for a range of SARS-CoV-2 related assays. Each classifier is based on three distinct types of descriptors (fingerprint, physicochemical, and pharmacophore) for parallel model development. These models were trained using high throughput screening data from the NCATS COVID19 portal (https://opendata.ncats.nih.gov/covid19/index.html), with multiple categorical machine learning algorithms. The "best models" are combined in an ensemble consensus predictor that outperforms single models where external validation is available. This suite of machine learning models is available through the DrugCentral web portal (http://drugcentral.org/Redial). Acceptable input formats are: drug name, PubChem CID, or SMILES; the output is an estimate of anti-SARS-CoV-2 activities. The web application reports estimated activity across three areas (viral entry, viral replication, and live virus infectivity) spanning six independent models, followed by a similarity search that displays the most similar molecules to the query among experimentally determined data. The ML models have 60% to 74% external predictivity, based on three separate datasets. Complementing the NCATS COVID19 portal, REDIAL-2020 can serve as a rapid online tool for identifying active molecules for COVID-19 treatment. The source code and specific models are available through Github (https://github.com/sirimullalab/redial-2020), or via Docker Hub (https://hub.docker.com/r/sirimullalab/redial-2020) for users preferring a containerized version.
Collapse
|
37
|
Abstract
Drug repositioning aims to reuse "old" drugs to treat diseases outside their approved indication(s). Composition-of-matter patents and FDA exclusivities can hinder the immediate availability of some drugs to be repositioned (repurposed). Here, we analyze data from the FDA Orange Book and use current on-market patent validity and exclusivities to classify drugs into on-patent (ONP), off-patent (OFP), and off-market (OFM) sets. In the absence of an unanimously accepted definition for small molecules, these sets include organic molecules and peptides with molecular weight between 100 and 1250, which resulted in 237 ONP drugs, 320 OFM, and 996 OFP drugs, respectively. We discuss the differences between the three categories in terms of primary molecular properties, chemical diversity, mechanism-of-action target classes, and therapeutic areas and comment on the enrichment of OFP drugs in the near future. Given the intellectual property landscape, and in the absence of specific property rights, we suggest that drugs should be prioritized as follows, to improve the repositioning strategy: (i) OFP, (ii) OFM, and (iii) ONP, respectively.
Collapse
|
38
|
|
39
|
|
40
|
Abstract
Prediction of chemical bioactivity and physical properties has been one of the most important applications of statistical and more recently, machine learning and artificial intelligence methods in chemical sciences. This field of research, broadly known as quantitative structure-activity relationships (QSAR) modeling, has developed many important algorithms and has found a broad range of applications in physical organic and medicinal chemistry in the past 55+ years. This Perspective summarizes recent technological advances in QSAR modeling but it also highlights the applicability of algorithms, modeling methods, and validation practices developed in QSAR to a wide range of research areas outside of traditional QSAR boundaries including synthesis planning, nanotechnology, materials science, biomaterials, and clinical informatics. As modern research methods generate rapidly increasing amounts of data, the knowledge of robust data-driven modelling methods professed within the QSAR field can become essential for scientists working both within and outside of chemical research. We hope that this contribution highlighting the generalizable components of QSAR modeling will serve to address this challenge.
Collapse
|
41
|
Will Artificial Intelligence for Drug Discovery Impact Clinical Pharmacology? Clin Pharmacol Ther 2020; 107:780-785. [PMID: 31957003 PMCID: PMC7158211 DOI: 10.1002/cpt.1795] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 01/09/2020] [Indexed: 01/05/2023]
Abstract
As the field of artificial intelligence and machine learning (AI/ML) for drug discovery is rapidly advancing, we address the question "What is the impact of recent AI/ML trends in the area of Clinical Pharmacology?" We address difficulties and AI/ML developments for target identification, their use in generative chemistry for small molecule drug discovery, and the potential role of AI/ML in clinical trial outcome evaluation. We briefly discuss current trends in the use of AI/ML in health care and the impact of AI/ML context of the daily practice of clinical pharmacologists.
Collapse
|
42
|
Diabetes mellitus risk for 102 drugs and drug combinations used in patients with bipolar disorder. Psychoneuroendocrinology 2020; 112:104511. [PMID: 31744781 DOI: 10.1016/j.psyneuen.2019.104511] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Revised: 08/28/2019] [Accepted: 11/07/2019] [Indexed: 12/18/2022]
Abstract
OBJECTIVE To compare the largest set of bipolar disorder pharmacotherapies to date (102 drugs and drug combinations) for risk of diabetes mellitus (DM). METHODS The IBM MarketScan® database was used to retrospectively analyze data on 565,253 adults with bipolar disorder without prior glucose metabolism-related diagnoses. The pharmacotherapies compared were lithium, mood-stabilizing anticonvulsants, antipsychotics, and antidepressants (monotherapy and multi-class polypharmacy). Cox regression modeling included fixed pre-treatment covariates and time-varying drug exposure covariates to estimate the hazard ratio (HR) of each treatment versus "No drug". RESULTS The annual incidence of new-onset diabetes during the exposure period was 3.09 % (22,951 patients). The HR of drug-dependent DM ranged from 0.79 to 2.37. One-third of the studied pharmacotherapies, including most of the antipsychotic-containing regimens, had a significantly higher risk of DM compared to "No drug". A significantly lower DM risk was associated with lithium, lamotrigine, oxcarbazepine and bupropion monotherapies, selective serotonin reuptake inhibitors (SSRI) mono-class therapy and several drug combinations containing bupropion and an SSRI. As additional drugs were combined in more complex polypharmacy, higher HRs were consistently observed. CONCLUSIONS There is an increased risk of diabetes mellitus associated with antipsychotic and psychotropic polypharmacy use in bipolar disorder. The evidence of a lower-than-baseline risk of DM with lamotrigine, oxcarbazepine, lithium, and bupropion monotherapy should be further investigated.
Collapse
|
43
|
Abstract
A lack of robust knowledge of the number of rare diseases and the number of people affected by them limits the development of approaches to ameliorate the substantial cumulative burden of rare diseases. Here, we call for coordinated efforts to more precisely define rare diseases.
Collapse
|
44
|
Abstract
Motivation Drug discovery investigations need to incorporate network pharmacology concepts while navigating the complex landscape of drug-target and target-target interactions. This task requires solutions that integrate high-quality biomedical data, combined with analytic and predictive workflows as well as efficient visualization. SmartGraph is an innovative platform that utilizes state-of-the-art technologies such as a Neo4j graph-database, Angular web framework, RxJS asynchronous event library and D3 visualization to accomplish these goals. Results The SmartGraph framework integrates high quality bioactivity data and biological pathway information resulting in a knowledgebase comprised of 420,526 unique compound-target interactions defined between 271,098 unique compounds and 2018 targets. SmartGraph then performs bioactivity predictions based on the 63,783 Bemis-Murcko scaffolds extracted from these compounds. Through several use-cases, we illustrate the use of SmartGraph to generate hypotheses for elucidating mechanism-of-action, drug-repurposing and off-target prediction. Availability https://smartgraph.ncats.io/.
Collapse
|
45
|
Abstract
Correction for ‘QSAR without borders’ by Eugene N. Muratov et al., Chem. Soc. Rev., 2020, DOI: 10.1039/d0cs00098a.
Collapse
|
46
|
Therapies for rare diseases: therapeutic modalities, progress and challenges ahead. Nat Rev Drug Discov 2019; 19:93-111. [PMID: 31836861 DOI: 10.1038/s41573-019-0049-9] [Citation(s) in RCA: 155] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/24/2019] [Indexed: 12/26/2022]
Abstract
Most rare diseases still lack approved treatments despite major advances in research providing the tools to understand their molecular basis, as well as legislation providing regulatory and economic incentives to catalyse the development of specific therapies. Addressing this translational gap is a multifaceted challenge, for which a key aspect is the selection of the optimal therapeutic modality for translating advances in rare disease knowledge into potential medicines, known as orphan drugs. With this in mind, we discuss here the technological basis and rare disease applicability of the main therapeutic modalities, including small molecules, monoclonal antibodies, protein replacement therapies, oligonucleotides and gene and cell therapies, as well as drug repurposing. For each modality, we consider its strengths and limitations as a platform for rare disease therapy development and describe clinical progress so far in developing drugs based on it. We also discuss selected overarching topics in the development of therapies for rare diseases, such as approval statistics, engagement of patients in the process, regulatory pathways and digital tools.
Collapse
|
47
|
Can BDDCS illuminate targets in drug design? Drug Discov Today 2019; 24:2299-2306. [PMID: 31585170 DOI: 10.1016/j.drudis.2019.09.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 09/18/2019] [Accepted: 09/25/2019] [Indexed: 12/19/2022]
Abstract
The fact that pharmacokinetic (PK) properties of drugs influence their interaction with protein targets is a principle known for decades. The same cannot be said for the opposite, namely that targets influence the PK properties of drugs. Evidence confirming this possibility is introduced here for the first time, as we show that certain protein families have a clear preference for drugs with specific PK properties. We investigate this by cross-referencing 'druggable target' annotations for >1000 US Food and Drug Administration (FDA)-approved drugs with their PK profile, as defined by the Biopharmaceutics Drug Disposition Classification System (BDDCS) criteria, and then examine the BDDCS preference for several major target protein families and therapeutic categories. Our findings suggest a novel way to conduct drug discovery by focusing PK profiles at the very early stage of target selection.
Collapse
|
48
|
Exploring the dark genome: implications for precision medicine. Mamm Genome 2019; 30:192-200. [PMID: 31270560 DOI: 10.1007/s00335-019-09809-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 06/15/2019] [Indexed: 01/08/2023]
Abstract
The increase in the number of both patients and healthcare practitioners who grew up using the Internet and computers (so-called "digital natives") is likely to impact the practice of precision medicine, and requires novel platforms for data integration and mining, as well as contextualized information retrieval. The "Illuminating the Druggable Genome Knowledge Management Center" (IDG KMC) quantifies data availability from a wide range of chemical, biological, and clinical resources, and has developed platforms that can be used to navigate understudied proteins (the "dark genome"), and their potential contribution to specific pathologies. Using the "Target Importance and Novelty Explorer" (TIN-X) highlights the role of LRRC10 (a dark gene) in dilated cardiomyopathy. Combining mouse and human phenotype data leads to increased strength of evidence, which is discussed for four additional dark genes: SLX4IP and its role in glucose metabolism, the role of HSF2BP in coronary artery disease, the involvement of ELFN1 in attention-deficit hyperactivity disorder and the role of VPS13D in mouse neural tube development and its confirmed role in childhood onset movement disorders. The workflow and tools described here are aimed at guiding further experimental research, particularly within the context of precision medicine.
Collapse
|
49
|
The human endogenous metabolome as a pharmacology baseline for drug discovery. Drug Discov Today 2019; 24:1806-1820. [PMID: 31226432 DOI: 10.1016/j.drudis.2019.06.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 05/17/2019] [Accepted: 06/12/2019] [Indexed: 01/01/2023]
Abstract
We have limited understanding of the variation in in vitro affinities of drugs for their targets. An analysis of a highly curated set of 815 interactions between 566 drugs and 129 primary targets reveals that 71% of drug-target affinities have values above that of the corresponding endogenous ligand, 96% of them fitting within a range of two orders of magnitude. Our findings suggest that the evolutionary optimised affinity of endogenous ligands for their native proteins can serve as a baseline for the primary pharmacology of drugs. We show that the degree of off-target selectivity and safety risks of drugs derived from their secondary pharmacology depend very much on that baseline. Thus, we propose a new approach for estimating safety margins.
Collapse
|
50
|
Comparison of 71 bipolar disorder pharmacotherapies for kidney disorder risk: The potential hazards of polypharmacy. J Affect Disord 2019; 252:201-211. [PMID: 30986735 DOI: 10.1016/j.jad.2019.04.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 02/13/2019] [Accepted: 04/06/2019] [Indexed: 11/17/2022]
Abstract
BACKGROUND This study compared the largest set of bipolar disorder pharmacotherapies to date (71 drugs and drug combinations) for risk of kidney disorders (KDs). METHODS This retrospective observational study used the IBM MarketScan® database to analyze data on 591,052 adults with bipolar disorder without prior nephropathy, for onset of KDs (of "moderate" or "high" severity) following psychopharmacotherapy (lithium, mood stabilizing anticonvulsants [MSAs], antipsychotics, antidepressants), or "No drug". Cox regression models included fixed pre-treatment covariates and time-varying drug exposure covariates to estimate the hazard ratio (HR) of each treatment versus "No drug". RESULTS Newly observed KD occurred in 14,713 patients. No regimen had significantly lower risk of KDs than "No drug". The HR estimates ranged 0.86-2.66 for "all" KDs and 0.87-5.30 for "severe" KDs. As additional drugs were combined to compare more complex polypharmacies, higher HRs were consistently observed. Most regimens containing lithium, MSAs, or antipsychotics had a higher risk than "No drug" (p < 0.05). The risk for "all" and "severe" KDs was highest respectively on monoamine oxidase inhibitors (MAOIs) (HR = 2.66, p = 5.73 × 10-5), and a lithium-containing four-class combination (HR = 5.30, p = 2.46 × 10-9). The HR for lithium monotherapy was 1.82 (p = 4.73 × 10-17) for "severe" KDs. LIMITATIONS The limitations inherent for an observational study were non-randomized assignment of patients to treatment groups, non-standardization of diagnostic decisions, and non-uniform quality of data collection. No correction was made for medication dosage. CONCLUSIONS The findings support literature concerns about lithium nephrotoxicity and highlight the potential risks of MAOIs, MSAs, antipsychotics and psychotropic polypharmacy.
Collapse
|