Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Total Articles

203
(from Reference Citation Analysis)

Article PDFs (37)

Cited by > 0 (185)

Searched Name

Tudor I Oprea

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Statistics

Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Category

Show more Refine

Number	Citation Analysis
1	Novel drug targets in 2023. Nat Rev Drug Discov 2024;23:330. [PMID: 38565953 DOI: 10.1038/d41573-024-00057-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024] Abstract Collapse Key Words Collapse MESH Headings Humans Drug Discovery Drug Delivery Systems Molecular Targeted Therapy Drug Development/methods Collapse Grants Collapse
2	Node-degree aware edge sampling mitigates inflated classification performance in biomedical random walk-based graph representation learning. BIOINFORMATICS ADVANCES 2024;4:vbae036. [PMID: 38577542 PMCID: PMC10994718 DOI: 10.1093/bioadv/vbae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 01/31/2024] [Accepted: 02/29/2024] [Indexed: 04/06/2024] Abstract Motivation Graph representation learning is a family of related approaches that learn low-dimensional vector representations of nodes and other graph elements called embeddings. Embeddings approximate characteristics of the graph and can be used for a variety of machine-learning tasks such as novel edge prediction. For many biomedical applications, partial knowledge exists about positive edges that represent relationships between pairs of entities, but little to no knowledge is available about negative edges that represent the explicit lack of a relationship between two nodes. For this reason, classification procedures are forced to assume that the vast majority of unlabeled edges are negative. Existing approaches to sampling negative edges for training and evaluating classifiers do so by uniformly sampling pairs of nodes. Results We show here that this sampling strategy typically leads to sets of positive and negative examples with imbalanced node degree distributions. Using representative heterogeneous biomedical knowledge graph and random walk-based graph machine learning, we show that this strategy substantially impacts classification performance. If users of graph machine-learning models apply the models to prioritize examples that are drawn from approximately the same distribution as the positive examples are, then performance of models as estimated in the validation phase may be artificially inflated. We present a degree-aware node sampling approach that mitigates this effect and is simple to implement. Availability and implementation Our code and data are publicly available at https://github.com/monarch-initiative/negativeExampleSelection. Collapse Key Words Collapse MESH Headings Collapse Grants National Institutes of Health National Cancer Institute Collapse
3	Overview of the Knowledge Management Center for Illuminating the Druggable Genome. Drug Discov Today 2024;29:103882. [PMID: 38218214 PMCID: PMC10939799 DOI: 10.1016/j.drudis.2024.103882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/22/2023] [Accepted: 01/09/2024] [Indexed: 01/15/2024] Abstract The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations. Based on this knowledge, proteins are classified into different Target Development Levels (TDLs) for identification of understudied targets. Additional work by the KMC focuses on enriching target knowledge and producing DrugCentral and other data visualization tools for expanding investigation of understudied targets. Collapse Key Words database druggable genome knowledge management pharos Collapse MESH Headings Humans Knowledge Management Genome Proteome Databases, Factual Informatics Collapse Grants U24 CA224370 NCI NIH HHS ZIC TR000410 Intramural NIH HHS Collapse
4	Artificial Intelligence for Drug Discovery: Are We There Yet? Annu Rev Pharmacol Toxicol 2024;64:527-550. [PMID: 37738505 DOI: 10.1146/annurev-pharmtox-040323-040828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/24/2023] Abstract Drug discovery is adapting to novel technologies such as data science, informatics, and artificial intelligence (AI) to accelerate effective treatment development while reducing costs and animal experiments. AI is transforming drug discovery, as indicated by increasing interest from investors, industrial and academic scientists, and legislators. Successful drug discovery requires optimizing properties related to pharmacodynamics, pharmacokinetics, and clinical outcomes. This review discusses the use of AI in the three pillars of drug discovery: diseases, targets, and therapeutic modalities, with a focus on small-molecule drugs. AI technologies, such as generative chemistry, machine learning, and multiproperty optimization, have enabled several compounds to enter clinical trials. The scientific community must carefully vet known information to address the reproducibility crisis. The full potential of AI in drug discovery can only be realized with sufficient ground truth and appropriate human intervention at later pipeline stages. Collapse Key Words autoencoders deep learning explainable AI generative chemistry knowledge graphs machine learning multiproperty optimization small-molecule drug discovery target identification Collapse MESH Headings Animals Humans Artificial Intelligence Reproducibility of Results Drug Discovery Physicians Technology Collapse Grants Collapse
5	Correction: Exploring DrugCentral: from molecular structures to clinical effects. J Comput Aided Mol Des 2023;38:2. [PMID: 38040935 PMCID: PMC10692018 DOI: 10.1007/s10822-023-00545-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2023] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
6	Exploring DrugCentral: from molecular structures to clinical effects. J Comput Aided Mol Des 2023;37:681-694. [PMID: 37707619 PMCID: PMC10692006 DOI: 10.1007/s10822-023-00529-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/14/2023] [Indexed: 09/15/2023] Abstract DrugCentral, accessible at https://drugcentral.org , is an open-access online drug information repository. It covers over 4950 drugs, incorporating structural, physicochemical, and pharmacological details to support drug discovery, development, and repositioning. With around 20,000 bioactivity data points, manual curation enhances information from several major digital sources. Approximately 724 mechanism-of-action (MoA) targets offer updated drug target insights. The platform captures clinical data: over 14,300 on- and off-label uses, 27,000 contraindications, and around 340,000 adverse drug events from pharmacovigilance reports. DrugCentral encompasses information from molecular structures to marketed formulations, providing a comprehensive pharmaceutical reference. Users can easily navigate basic drug information and key features, making DrugCentral a versatile, unique resource. Furthermore, we present a use-case example where we utilize experimentally determined data from DrugCentral to support drug repurposing. A minimum activity threshold t should be considered against novel targets to repurpose a drug. Analyzing 1156 bioactivities for human MoA targets suggests a general threshold of 1 µM: t = 6 when expressed as - log[Activity(M)]). This applies to 87% of the drugs. Moreover, t can be refined empirically based on water solubility (S): t = 3 - logS, for logS < - 3. Alongside the drug repurposing classification scheme, which considers intellectual property rights, market exclusivity protections, and market accessibility, DrugCentral provides valuable data to prioritize candidates for drug repurposing programs efficiently. Collapse Key Words Bioactivity Databases Drug repurposing Drug targets Drugs Pharmacology Collapse MESH Headings Humans Molecular Structure Drug-Related Side Effects and Adverse Reactions Drug Repositioning Drug Discovery Drug Delivery Systems Collapse Grants U24 CA224370 NCI NIH HHS U24 CA224370 NIH HHS National Institutes of Health Collapse
7	Molecular Complexity: You Know It When You See It. J Med Chem 2023;66:12710-12714. [PMID: 37675804 PMCID: PMC10544322 DOI: 10.1021/acs.jmedchem.3c01507] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Indexed: 09/08/2023] Abstract Molecular complexity (MC) lacks a universal definition, but various studies address it in contexts ranging from ligand-receptor interactions to DNA sequencing, with the overarching emphasis being its significance in synthetic organic chemistry and pharmaceutical research. Efforts to quantify MC in drug discovery have been numerous, but a unified approach remains challenging. Strategies based on graph theory, information theory, and substructural feature counts employed to gauge MC are often correlated to molecular weight (MW). Herbert Waldmann and his team introduced a new MC metric called the spacial score (SPS), which is based on factors like atom hybridization and stereoisomeric considerations. While SPS and its normalized version, nSPS, correlate with the natural product likeness score, they do not align with traditional chemical properties. We examined nSPS trends for approved drugs and found no significant changes in MC over eight decades, nor did nSPS capture drug innovation during that period. Furthermore, our analysis indicates that while the majority of approved drugs have an nSPS value between 10 and 20, this metric does not correlate with key drug properties like target bioactivity and oral bioavailability. Mirroring a chemist's intuitive sense of chemical complexity, nSPS addresses the need for a precise empirical tool while a universal definition of MC remains elusive. Collapse Key Words Collapse MESH Headings Drug Discovery Biological Products Molecular Weight Collapse Grants U24 CA224370 NCI NIH HHS National Cancer Institute Collapse
8	Toxicology knowledge graph for structural birth defects. COMMUNICATIONS MEDICINE 2023;3:98. [PMID: 37460679 DOI: 10.1038/s43856-023-00329-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 06/29/2023] [Indexed: 07/20/2023] Open Abstract BACKGROUND Birth defects are functional and structural abnormalities that impact about 1 in 33 births in the United States. They have been attributed to genetic and other factors such as drugs, cosmetics, food, and environmental pollutants during pregnancy, but for most birth defects there are no known causes. METHODS To further characterize associations between small molecule compounds and their potential to induce specific birth abnormalities, we gathered knowledge from multiple sources to construct a reproductive toxicity Knowledge Graph (ReproTox-KG) with a focus on associations between birth defects, drugs, and genes. Specifically, we gathered data from drug/birth-defect associations from co-mentions in published abstracts, gene/birth-defect associations from genetic studies, drug- and preclinical-compound-induced gene expression changes in cell lines, known drug targets, genetic burden scores for human genes, and placental crossing scores for small molecules. RESULTS Using ReproTox-KG and semi-supervised learning (SSL), we scored >30,000 preclinical small molecules for their potential to cross the placenta and induce birth defects, and identified >500 birth-defect/gene/drug cliques that can be used to explain molecular mechanisms for drug-induced birth defects. The ReproTox-KG can be accessed via a web-based user interface available at https://maayanlab.cloud/reprotox-kg . This site enables users to explore the associations between birth defects, approved and preclinical drugs, and all human genes. CONCLUSIONS ReproTox-KG provides a resource for exploring knowledge about the molecular mechanisms of birth defects with the potential of predicting the likelihood of genes and preclinical small molecules to induce birth defects. Collapse Key Words Collapse MESH Headings Collapse Grants OT2OD030160 U.S. Department of Health & Human Services \| NIH \| NIH Office of the Director (OD) OT2OD030546 U.S. Department of Health & Human Services \| NIH \| NIH Office of the Director (OD) OT2OD030162 U.S. Department of Health & Human Services \| NIH \| NIH Office of the Director (OD) OT2OD030162 U.S. Department of Health & Human Services \| NIH \| NIH Office of the Director (OD) Collapse
9	Novel drug targets in 2022. Nat Rev Drug Discov 2023:10.1038/d41573-023-00068-y. [PMID: 37138052 DOI: 10.1038/d41573-023-00068-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
10	Surely you are joking, Mr Docking! Chem Soc Rev 2023;52:872-878. [PMID: 36644974 DOI: 10.1039/d2cs00948j] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Abstract In the wake of recent COVID-19 pandemics scientists around the world rushed to deliver numerous CADD (Computer-Aided Drug Discovery) methods and tools that could be reliably used to discover novel drug candidates against the SARS-CoV-2 virus. With that, there emerged a trend of a significant democratization of CADD that contributed to the rapid development of various COVID-19 drug candidates currently undergoing different stages of validation. On the other hand, this democratization also inadvertently led to the surge rapidly performed molecular docking studies to nominate multiple scores of novel drug candidates supported by computational arguments only. Albeit driven by best intentions, most of such studies also did not follow best practices in the field that require experience and expertise learned through multiple rigorously designed benchmarking studies and rigorous experimental validation. In this Viewpoint we reflect on recent disbalance between small number of rigorous and comprehensive studies and the proliferation of purely computational studies enabled by the ease of docking software availability. We further elaborate on the hyped oversale of CADD methods' ability to rapidly yield viable drug candidates and reiterate the critical importance of rigor and adherence to the best practices of CADD in view of recent emergence of AI and Big Data in the field. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
11	Illuminating the druggable genome through patent bioactivity data. PeerJ 2023;11:e15153. [PMID: 37151295 PMCID: PMC10162037 DOI: 10.7717/peerj.15153] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 03/10/2023] [Indexed: 05/09/2023] Open Abstract The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.surechembl.org/), according to how likely they were to contain bioactivity data for potent small molecules on less-studied targets, based on the classification developed by the Illuminating the Druggable Genome (IDG) project. The overall goal was to select a smaller number of patents that could be manually curated and incorporated into the ChEMBL database. Using relatively simple annotation and filtering pipelines, we have been able to identify a substantial number of patents containing quantitative bioactivity data for understudied targets that had not previously been reported in the peer-reviewed medicinal chemistry literature. We quantify the added value of such methods in terms of the numbers of targets that are so identified, and provide some specific illustrative examples. Our work underlines the potential value in searching the patent corpus in addition to the more traditional peer-reviewed literature. The small molecules found in these patents, together with their measured activity against the targets, are now accessible via the ChEMBL database. Collapse Key Words Bioactive compounds Drug targets Druggable genome Patents Small molecules Understudied targets Collapse MESH Headings Collapse Grants Collapse
12	Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds. J Cheminform 2022;14:82. [PMID: 36461094 PMCID: PMC9716667 DOI: 10.1186/s13321-022-00661-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022] Open Abstract We report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries. During the meeting, applications, challenges, and opportunities in drug discovery, de novo drug design, ADME-Tox (absorption, distribution, metabolism, excretion and toxicity) property predictions, organic chemistry, peptides, and antibiotic resistance were discussed. The program along with the recordings of all sessions are freely available at https://www.difacquim.com/english/events/2022-colloquium/ . Collapse Key Words ADME profile Antibiotic resistance Artificial intelligence Career development Drug discovery Ligand-based drug design Machine learning Natural products Peptides Structure-based drug design Virtual screening Collapse MESH Headings Collapse Grants Collapse
13	Interpretable deep learning translation of GWAS and multi-omics findings to identify pathobiology and drug repurposing in Alzheimer's disease. Cell Rep 2022;41:111717. [PMID: 36450252 PMCID: PMC9837836 DOI: 10.1016/j.celrep.2022.111717] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 09/01/2022] [Accepted: 11/02/2022] [Indexed: 12/03/2022] Open Abstract Translating human genetic findings (genome-wide association studies [GWAS]) to pathobiology and therapeutic discovery remains a major challenge for Alzheimer's disease (AD). We present a network topology-based deep learning framework to identify disease-associated genes (NETTAG). We leverage non-coding GWAS loci effects on quantitative trait loci, enhancers and CpG islands, promoter regions, open chromatin, and promoter flanking regions under the protein-protein interactome. Via NETTAG, we identified 156 AD-risk genes enriched in druggable targets. Combining network-based prediction and retrospective case-control observations with 10 million individuals, we identified that usage of four drugs (ibuprofen, gemfibrozil, cholecalciferol, and ceftriaxone) is associated with reduced likelihood of AD incidence. Gemfibrozil (an approved lipid regulator) is significantly associated with 43% reduced risk of AD compared with simvastatin using an active-comparator design (95% confidence interval 0.51-0.63, p < 0.0001). In summary, NETTAG offers a deep learning methodology that utilizes GWAS and multi-genomic findings to identify pathobiology and drug repurposing in AD. Collapse Key Words AD Alzheimer’s disease CP: Neuroscience EHR GWAS deep learning drug repurposing drug target electronic health record gemfibrozil genome-wide association studies multi-omics pathobiology protein-protein Interactome Collapse MESH Headings Humans Genome-Wide Association Study Drug Repositioning Alzheimer Disease/drug therapy Alzheimer Disease/genetics Deep Learning Gemfibrozil Retrospective Studies Collapse Grants K08 AG065463 NIA NIH HHS UL1 TR001422 NCATS NIH HHS U01 AG073323 NIA NIH HHS R01 AG066707 NIA NIH HHS R01 AG076448 NIA NIH HHS P20 GM109025 NIGMS NIH HHS P30 AG072977 NIA NIH HHS P30 AG072959 NIA NIH HHS P20 AG068053 NIA NIH HHS R56 AG074001 NIA NIH HHS U24 AG021886 NIA NIH HHS Collapse
14	Agent-based modeling predicts RAC1 is critical for ovarian cancer metastasis. Mol Biol Cell 2022;33:ar138. [PMID: 36200848 PMCID: PMC9727804 DOI: 10.1091/mbc.e21-11-0540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open Abstract Experimental and computational studies pinpoint rate-limiting step(s) in metastasis governed by Rac1. Using ovarian cancer cell and animal models, Rac1 expression was manipulated, and quantitative measurements of cell-cell and cell-substrate adhesion, cell invasion, mesothelial clearance, and peritoneal tumor growth discriminated the tumor behaviors most highly influenced by Rac1. The experimental data were used to parameterize an agent-based computational model simulating peritoneal niche colonization, intravasation, and hematogenous metastasis to distant organs. Increased ovarian cancer cell survival afforded by the more rapid adhesion and intravasation upon Rac1 overexpression is predicted to increase the numbers of and the rates at which tumor cells are disseminated to distant sites. Surprisingly, crowding of cancer cells along the blood vessel was found to decrease the numbers of cells reaching a distant niche irrespective of Rac1 overexpression or knockdown, suggesting that sites for tumor cell intravasation are rate limiting and become accessible if cells intravasate rapidly or are displaced due to diminished viability. Modeling predictions were confirmed through animal studies of Rac1-dependent metastasis to the lung. Collectively, the experimental and modeling approaches identify cell adhesion, rapid intravasation, and survival in the blood as parameters in the ovarian metastatic cascade that are most critically dependent on Rac1. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
15	AlphaFold illuminates half of the dark human proteins. Curr Opin Struct Biol 2022;74:102372. [PMID: 35439658 PMCID: PMC10669925 DOI: 10.1016/j.sbi.2022.102372] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 03/02/2022] [Accepted: 03/13/2022] [Indexed: 01/05/2023] Abstract We investigate the use of confidence scores to evaluate the accuracy of a given AlphaFold (AF2) protein model for drug discovery. Prediction of accuracy is improved by not considering confidence scores below 80 due to the effects of disorder. On a set of recent crystal structures, 95% are likely to have accurate folds. Conformational discordance in the training set has a much more significant effect on accuracy than sequence divergence. We propose criteria for models and residues that are possibly useful for virtual screening. Based on these criteria, AF2 provides models for half of understudied (dark) human proteins and two-thirds of residues in those models. Collapse Key Words Artificial intelligence Drug discovery Model evaluation Protein folding Understudied proteins Virtual screening Collapse MESH Headings Drug Discovery Furylfuramide Humans Protein Folding Proteins/chemistry Collapse Grants R35 GM134864 NIGMS NIH HHS U24 CA224370 NCI NIH HHS Collapse
16	Novel drug targets in 2021. Nat Rev Drug Discov 2022;21:328. [PMID: 35361900 DOI: 10.1038/d41573-022-00057-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
17	Diseases 2.0: a weekly updated database of disease–gene associations from text mining and data integration. Database (Oxford) 2022;2022:6554833. [PMID: 35348648 PMCID: PMC9216524 DOI: 10.1093/database/baac019] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 02/14/2022] [Accepted: 03/11/2022] [Indexed: 12/04/2022] Abstract The scientific knowledge about which genes are involved in which diseases grows rapidly, which makes it difficult to keep up with new publications and genetics datasets. The DISEASES database aims to provide a comprehensive overview by systematically integrating and assigning confidence scores to evidence for disease–gene associations from curated databases, genome-wide association studies (GWAS) and automatic text mining of the biomedical literature. Here, we present a major update to this resource, which greatly increases the number of associations from all these sources. This is especially true for the text-mined associations, which have increased by at least 9-fold at all confidence cutoffs. We show that this dramatic increase is primarily due to adding full-text articles to the text corpus, secondarily due to improvements to both the disease and gene dictionaries used for named entity recognition, and only to a very small extent due to the growth in number of PubMed abstracts. DISEASES now also makes use of a new GWAS database, Target Illumination by GWAS Analytics, which considerably increased the number of GWAS-derived disease–gene associations. DISEASES itself is also integrated into several other databases and resources, including GeneCards/MalaCards, Pharos/Target Central Resource Database and the Cytoscape stringApp. All data in DISEASES are updated on a weekly basis and is available via a web interface at https://diseases.jensenlab.org, from where it can also be downloaded under open licenses. Database URL: https://diseases.jensenlab.org Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
18	State of the Art and Uses for the Biopharmaceutics Drug Disposition Classification System (BDDCS): New Additions, Revisions, and Citation References. AAPS J 2022;24:37. [PMID: 35199251 PMCID: PMC8865883 DOI: 10.1208/s12248-022-00687-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 01/24/2022] [Indexed: 12/16/2022] Open Abstract The Biopharmaceutics Drug Disposition Classification system (BDDCS) is a four-class approach based on water solubility and extent of metabolism/permeability rate. Based on the BDDCS class to which a drug is assigned, it is possible to predict the role of metabolic enzymes and transporters on the drug disposition of a new molecular entity (NME) prior to its administration to animals or humans. Here, we report a total of 1475 drugs and active metabolites to which the BDDCS is applied. Of these, 379 are new entries, and 1096 are revisions of former classification studies with the addition of references for the approved maximum dose strength, extent of the systemically available drug excreted unchanged in the urine, and lowest solubility over the pH range 1.0–6.8 when such information is available in the literature. We detail revised class assignments of previously misclassified drugs and the literature analyses to classify new drugs. We review the process of solubility assessment for NMEs prior to drug dosing in humans and approved dose classification, as well as the comparison of Biopharmaceutics Classification System (BCS) versus BDDCS assignment. We detail the uses of BDDCS in predicting, prior to dosing animals or humans, disposition characteristics, potential brain penetration, food effect, and drug-induced liver injury (DILI) potential. This work provides an update on the current status of the BDDCS and its uses in the drug development process. Collapse Key Words BCS BDDCS DILI dose number extent of metabolism food effects solubility Collapse MESH Headings Collapse Grants Collapse
19	A Comprehensive COVID-19 Daily News and Medical Literature Briefing to Inform Health Care and Policy in New Mexico: Implementation Study. JMIR MEDICAL EDUCATION 2022;8:e23845. [PMID: 35142625 PMCID: PMC8908195 DOI: 10.2196/23845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 04/29/2021] [Accepted: 02/09/2022] [Indexed: 06/14/2023] Abstract BACKGROUND On March 11, 2020, the New Mexico Governor declared a public health emergency in response to the COVID-19 pandemic. The New Mexico medical advisory team contacted University of New Mexico (UNM) faculty to form a team to consolidate growing information on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its disease to facilitate New Mexico's pandemic management. Thus, faculty, physicians, staff, graduate students, and medical students created the "UNM Global Health COVID-19 Intelligence Briefing." OBJECTIVE In this paper, we sought to (1) share how to create an informative briefing to guide public policy and medical practice and manage information overload with rapidly evolving scientific evidence; (2) determine the qualitative usefulness of the briefing to its readers; and (3) determine the qualitative effect this project has had on virtual medical education. METHODS Microsoft Teams was used for manual and automated capture of COVID-19 articles and composition of briefings. Multilevel triaging saved impactful articles to be reviewed, and priority was placed on randomized controlled studies, meta-analyses, systematic reviews, practice guidelines, and information on health care and policy response to COVID-19. The finalized briefing was disseminated by email, a listserv, and posted on the UNM digital repository. A survey was sent to readers to determine briefing usefulness and whether it led to policy or medical practice changes. Medical students, unable to partake in direct patient care, proposed to the School of Medicine that involvement in the briefing should count as course credit, which was approved. The maintenance of medical student involvement in the briefings as well as this publication was led by medical students. RESULTS An average of 456 articles were assessed daily. The briefings reached approximately 1000 people by email and listserv directly, with an unknown amount of forwarding. Digital repository tracking showed 5047 downloads across 116 countries as of July 5, 2020. The survey found 108 (95%) of 114 participants gained relevant knowledge, 90 (79%) believed it decreased misinformation, 27 (24%) used the briefing as their primary source of information, and 90 (79%) forwarded it to colleagues. Specific and impactful public policy decisions were informed based on the briefing. Medical students reported that the project allowed them to improve on their scientific literature assessment, stay current on the pandemic, and serve their community. CONCLUSIONS The COVID-19 briefings succeeded in informing and guiding New Mexico policy and clinical practice. The project received positive feedback from the community and was shown to decrease information burden and misinformation. The virtual platforms allowed for the continuation of medical education. Variability in subject matter expertise was addressed with training, standardized article selection criteria, and collaborative editing led by faculty. Collapse Key Words COVID-19 New Mexico SARS-CoV-2 daily report epidemics global health medical education pandemic policy Collapse MESH Headings Collapse Grants D43 TW005884 FIC NIH HHS D43 TW010543 FIC NIH HHS K43 TW011581 FIC NIH HHS T32 MH018399 NIMH NIH HHS Collapse
20	A Workflow of Integrated Resources to Catalyze Network Pharmacology Driven COVID-19 Research. J Chem Inf Model 2022;62:718-729. [PMID: 35057621 PMCID: PMC10790216 DOI: 10.1021/acs.jcim.1c00431] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Abstract In the event of an outbreak due to an emerging pathogen, time is of the essence to contain or to mitigate the spread of the disease. Drug repositioning is one of the strategies that has the potential to deliver therapeutics relatively quickly. The SARS-CoV-2 pandemic has shown that integrating critical data resources to drive drug-repositioning studies, involving host-host, host-pathogen, and drug-target interactions, remains a time-consuming effort that translates to a delay in the development and delivery of a life-saving therapy. Here, we describe a workflow we designed for a semiautomated integration of rapidly emerging data sets that can be generally adopted in a broad network pharmacology research setting. The workflow was used to construct a COVID-19 focused multimodal network that integrates 487 host-pathogen, 63 278 host-host protein, and 1221 drug-target interactions. The resultant Neo4j graph database named "Neo4COVID19" is made publicly accessible via a web interface and via API calls based on the Bolt protocol. Details for accessing the database are provided on a landing page (https://neo4covid19.ncats.io/). We believe that our Neo4COVID19 database will be a valuable asset to the research community and will catalyze the discovery of therapeutics to fight COVID-19. Collapse Key Words Collapse MESH Headings COVID-19 Drug Repositioning Humans Network Pharmacology Pandemics SARS-CoV-2 Workflow Collapse Grants U54 CA189205 NCI NIH HHS ZIC TR000242 Intramural NIH HHS Collapse
21	Getting Started with the IDG KMC Datasets and Tools. Curr Protoc 2022;2:e355. [PMID: 35085427 PMCID: PMC10789444 DOI: 10.1002/cpz1.355] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Abstract The Illuminating the Druggable Genome (IDG) consortium is a National Institutes of Health (NIH) Common Fund program designed to enhance our knowledge of under-studied proteins, more specifically, proteins unannotated within the three most commonly drug-targeted protein families: G-protein coupled receptors, ion channels, and protein kinases. Since 2014, the IDG Knowledge Management Center (IDG-KMC) has generated several open-access datasets and resources that jointly serve as a highly translational machine-learning-ready knowledgebase focused on human protein-coding genes and their products. The goal of the IDG-KMC is to develop comprehensive integrated knowledge for the druggable genome to illuminate the uncharacterized or poorly annotated portion of the druggable genome. The tools derived from the IDG-KMC provide either user-friendly visualizations or ways to impute the knowledge about potential targets using machine learning strategies. In the following protocols, we describe how to use each web-based tool to accelerate illumination in under-studied proteins. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Interacting with the Pharos user interface Basic Protocol 2: Accessing the data in Harmonizome Basic Protocol 3: The ARCHS4 resource Basic Protocol 4: Making predictions about gene function with PrismExp Basic Protocol 5: Using Geneshot to illuminate knowledge about under-studied targets Basic Protocol 6: Exploring under-studied targets with TIN-X Basic Protocol 7: Interacting with the DrugCentral user interface Basic Protocol 8: Estimating Anti-SARS-CoV-2 activities with DrugCentral REDIAL-2020 Basic Protocol 9: Drug Set Enrichment Analysis using Drugmonizome Basic Protocol 10: The Drugmonizome-ML Appyter Basic Protocol 11: The Harmonizome-ML Appyter Basic Protocol 12: GWAS target illumination with TIGA Basic Protocol 13: Prioritizing kinases for lists of proteins and phosphoproteins with KEA3 Basic Protocol 14: Converting PubMed searches to drug sets with the DrugShot Appyter. Collapse Key Words bioinformatics data visualization disease ontology drug discovery drug targets druggable genome web applications Collapse MESH Headings COVID-19 Databases, Genetic Genome Humans Machine Learning Proteins SARS-CoV-2 Collapse Grants U24 CA224370 NCI NIH HHS U24TR002278 NIH HHS U54 HL127624 NHLBI NIH HHS U24 TR002278 NCATS NIH HHS OT2 OD030546 NIH HHS U24CA224370 NIH HHS U01 CA239108 NCI NIH HHS U24CA224260 NIH HHS U24 CA224260 NCI NIH HHS U54HL127624 NIH HHS OT2OD030546 NIH HHS U01CA239108 NIH HHS Collapse
22	Supervised learning with word embeddings derived from PubMed captures latent knowledge about protein kinases and cancer. NAR Genom Bioinform 2021;3:lqab113. [PMID: 34888523 PMCID: PMC8652379 DOI: 10.1093/nargab/lqab113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 10/14/2021] [Accepted: 11/24/2021] [Indexed: 11/17/2022] Open Abstract Inhibiting protein kinases (PKs) that cause cancers has been an important topic in cancer therapy for years. So far, almost 8% of >530 PKs have been targeted by FDA-approved medications, and around 150 protein kinase inhibitors (PKIs) have been tested in clinical trials. We present an approach based on natural language processing and machine learning to investigate the relations between PKs and cancers, predicting PKs whose inhibition would be efficacious to treat a certain cancer. Our approach represents PKs and cancers as semantically meaningful 100-dimensional vectors based on word and concept neighborhoods in PubMed abstracts. We use information about phase I-IV trials in ClinicalTrials.gov to construct a training set for random forest classification. Our results with historical data show that associations between PKs and specific cancers can be predicted years in advance with good accuracy. Our tool can be used to predict the relevance of inhibiting PKs for specific cancers and to support the design of well-focused clinical trials to discover novel PKIs for cancer therapy. Collapse Key Words Collapse MESH Headings Collapse Grants P30 CA034196 NCI NIH HHS U01 CA239108 NCI NIH HHS U24 CA224370 NCI NIH HHS NIH NCI DOE Collapse
23	TIGA: target illumination GWAS analytics. Bioinformatics 2021;37:3865-3873. [PMID: 34086846 PMCID: PMC11025677 DOI: 10.1093/bioinformatics/btab427] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 05/12/2021] [Accepted: 06/03/2021] [Indexed: 12/31/2022] Open Abstract MOTIVATION Genome-wide association studies can reveal important genotype-phenotype associations; however, data quality and interpretability issues must be addressed. For drug discovery scientists seeking to prioritize targets based on the available evidence, these issues go beyond the single study. RESULTS Here, we describe rational ranking, filtering and interpretation of inferred gene-trait associations and data aggregation across studies by leveraging existing curation and harmonization efforts. Each gene-trait association is evaluated for confidence, with scores derived solely from aggregated statistics, linking a protein-coding gene and phenotype. We propose a method for assessing confidence in gene-trait associations from evidence aggregated across studies, including a bibliometric assessment of scientific consensus based on the iCite relative citation ratio, and meanRank scores, to aggregate multivariate evidence.This method, intended for drug target hypothesis generation, scoring and ranking, has been implemented as an analytical pipeline, available as open source, with public datasets of results, and a web application designed for usability by drug discovery scientists. AVAILABILITY AND IMPLEMENTATION Web application, datasets and source code via https://unmtid-shinyapps.net/tiga/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. Collapse Key Words Collapse MESH Headings Genotype Genome-Wide Association Study Lighting Polymorphism, Single Nucleotide Phenotype Collapse Grants U24 CA224370 NCI NIH HHS U54 CA189205 NCI NIH HHS US National Institutes of Health Illuminating the Druggable Genome Knowledge Management Center Novo Nordisk Foundation Collapse
24	A critical overview of computational approaches employed for COVID-19 drug discovery. Chem Soc Rev 2021;50:9121-9151. [PMID: 34212944 PMCID: PMC8371861 DOI: 10.1039/d0cs01065k] [Citation(s) in RCA: 91] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Indexed: 01/18/2023] Abstract COVID-19 has resulted in huge numbers of infections and deaths worldwide and brought the most severe disruptions to societies and economies since the Great Depression. Massive experimental and computational research effort to understand and characterize the disease and rapidly develop diagnostics, vaccines, and drugs has emerged in response to this devastating pandemic and more than 130 000 COVID-19-related research papers have been published in peer-reviewed journals or deposited in preprint servers. Much of the research effort has focused on the discovery of novel drug candidates or repurposing of existing drugs against COVID-19, and many such projects have been either exclusively computational or computer-aided experimental studies. Herein, we provide an expert overview of the key computational methods and their applications for the discovery of COVID-19 small-molecule therapeutics that have been reported in the research literature. We further outline that, after the first year the COVID-19 pandemic, it appears that drug repurposing has not produced rapid and global solutions. However, several known drugs have been used in the clinic to cure COVID-19 patients, and a few repurposed drugs continue to be considered in clinical trials, along with several novel clinical candidates. We posit that truly impactful computational tools must deliver actionable, experimentally testable hypotheses enabling the discovery of novel drugs and drug combinations, and that open science and rapid sharing of research results are critical to accelerate the development of novel, much needed therapeutics for COVID-19. Collapse Key Words Collapse MESH Headings Antiviral Agents/therapeutic use COVID-19/virology Clinical Trials as Topic Computer Simulation Drug Design Drug Discovery/methods Drug Repositioning Humans Pandemics SARS-CoV-2/drug effects COVID-19 Drug Treatment Collapse Grants U01 CA207160 NCI NIH HHS U24 CA224370 NCI NIH HHS R01 GM132826 NIGMS NIH HHS R44 GM122196 NIGMS NIH HHS U24 TR002278 NCATS NIH HHS U01 CA239108 NCI NIH HHS Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung National Institutes of Health Dirección General de Asuntos del Personal Académico, Universidad Nacional Autónoma de México National Science Foundation Collapse
25	How to Illuminate the Druggable Genome Using Pharos. ACTA ACUST UNITED AC 2021;69:e92. [PMID: 31898878 DOI: 10.1002/cpbi.92] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Abstract Pharos is an integrated web-based informatics platform for the analysis of data aggregated by the Illuminating the Druggable Genome (IDG) Knowledge Management Center, an NIH Common Fund initiative. The current version of Pharos (as of October 2019) spans 20,244 proteins in the human proteome, 19,880 disease and phenotype associations, and 226,829 ChEMBL compounds. This resource not only collates and analyzes data from over 60 high-quality resources to generate these types, but also uses text indexing to find less apparent connections between targets, and has recently begun to collaborate with institutions that generate data and resources. Proteins are ranked according to a knowledge-based classification system, which can help researchers to identify less studied "dark" targets that could be potentially further illuminated. This is an important process for both drug discovery and target validation, as more knowledge can accelerate target identification, and previously understudied proteins can serve as novel targets in drug discovery. Two basic protocols illustrate the levels of detail available for targets and several methods of finding targets of interest. An Alternate Protocol illustrates the difference in available knowledge between less and more studied targets. © 2020 by John Wiley & Sons, Inc. Basic Protocol 1: Search for a target and view details Alternate Protocol: Search for dark target and view details Basic Protocol 2: Filter a target list to get refined results. Collapse Key Words bioinformatics dark genome disease drug discovery drug targets phenotype proteins target validation Collapse MESH Headings Collapse Grants Collapse
26	COVIDomic: A multi-modal cloud-based platform for identification of risk factors associated with COVID-19 severity. PLoS Comput Biol 2021;17:e1009183. [PMID: 34260589 PMCID: PMC8312936 DOI: 10.1371/journal.pcbi.1009183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 07/26/2021] [Accepted: 06/14/2021] [Indexed: 01/08/2023] Open Abstract Coronavirus disease 2019 (COVID-19) is an acute infection of the respiratory tract that emerged in December 2019 in Wuhan, China. It was quickly established that both the symptoms and the disease severity may vary from one case to another and several strains of SARS-CoV-2 have been identified. To gain a better understanding of the wide variety of SARS-CoV-2 strains and their associated symptoms, thousands of SARS-CoV-2 genomes have been sequenced in dozens of countries. In this article, we introduce COVIDomic, a multi-omics online platform designed to facilitate the analysis and interpretation of the large amount of health data collected from patients with COVID-19. The COVIDomic platform provides a comprehensive set of bioinformatic tools for the multi-modal metatranscriptomic data analysis of COVID-19 patients to determine the origin of the coronavirus strain and the expected severity of the disease. An integrative analytical workflow, which includes microbial pathogens community analysis, COVID-19 genetic epidemiology and patient stratification, allows to analyze the presence of the most common microbial organisms, their antibiotic resistance, the severity of the infection and the set of the most probable geographical locations from which the studied strain could have originated. The online platform integrates a user friendly interface which allows easy visualization of the results. We envision this tool will not only have immediate implications for management of the ongoing COVID-19 pandemic, but will also improve our readiness to respond to other infectious outbreaks. Collapse Key Words Collapse MESH Headings COVID-19/epidemiology COVID-19/genetics COVID-19/physiopathology COVID-19/virology Cloud Computing Computational Biology/methods Humans Risk Factors SARS-CoV-2/genetics Severity of Illness Index User-Computer Interface Collapse Grants Collapse
27	Crowdsourced mapping of unexplored target space of kinase inhibitors. Nat Commun 2021;12:3307. [PMID: 34083538 PMCID: PMC8175708 DOI: 10.1038/s41467-021-23165-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Accepted: 04/15/2021] [Indexed: 12/31/2022] Open Abstract Despite decades of intensive search for compounds that modulate the activity of particular protein targets, a large proportion of the human kinome remains as yet undrugged. Effective approaches are therefore required to map the massive space of unexplored compound-kinase interactions for novel and potent activities. Here, we carry out a crowdsourced benchmarking of predictive algorithms for kinase inhibitor potencies across multiple kinase families tested on unpublished bioactivity data. We find the top-performing predictions are based on various models, including kernel learning, gradient boosting and deep learning, and their ensemble leads to a predictive accuracy exceeding that of single-dose kinase activity assays. We design experiments based on the model predictions and identify unexpected activities even for under-studied kinases, thereby accelerating experimental mapping efforts. The open-source prediction algorithms together with the bioactivities between 95 compounds and 295 kinases provide a resource for benchmarking prediction algorithms and for extending the druggable kinome. Collapse Key Words kinases cheminformatics machine learning Collapse MESH Headings Algorithms Benchmarking Crowdsourcing Databases, Pharmaceutical Deep Learning Drug Discovery Drug Evaluation, Preclinical Humans Kinetics Machine Learning Models, Biological Models, Chemical Protein Kinase Inhibitors/chemistry Protein Kinase Inhibitors/pharmacokinetics Protein Kinase Inhibitors/pharmacology Protein Kinases/chemistry Protein Kinases/metabolism Proteomics Regression Analysis Collapse Grants C42454/A28596 Cancer Research UK U24 CA224370 NCI NIH HHS U24 TR002278 NCATS NIH HHS U24 DK116204 NIDDK NIH HHS Wellcome Trust U54 OD020353 NIH HHS U01 CA239108 NCI NIH HHS Academy of Finland (Suomen Akatemia) Cancer Research UK (CRUK) Ministry of Health and Care Services \| Helse Sør-Øst RHF (Southern and Eastern Norway Regional Health Authority) Collapse
28	A machine learning platform to estimate anti-SARS-CoV-2 activities. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00335-w] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
29	Novel drug targets in 2020. Nat Rev Drug Discov 2021;20:333. [PMID: 33824485 DOI: 10.1038/d41573-021-00057-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
30	InContext: curation of medical context for drug indications. J Biomed Semantics 2021;12:2. [PMID: 33579375 PMCID: PMC7881657 DOI: 10.1186/s13326-021-00234-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2018] [Accepted: 01/21/2021] [Indexed: 11/10/2022] Open Abstract Accurate and precise information about the therapeutic uses (indications) of a drug is essential for applications in drug repurposing and precision medicine. Leading online drug resources such as DrugCentral and DrugBank provide rich information about various properties of drugs, including their indications. However, because indications in such databases are often partly automatically mined, some may prove to be inaccurate or imprecise. Particularly challenging for text mining methods is the task of distinguishing between general disease mentions in drug product labels and actual indications for the drug. For this, the qualifying medical context of the disease mentions in the text should be studied. Some examples include contraindications, co-prescribed drugs and target patient qualifications. No existing indication curation efforts attempt to capture such information in a precise way. Here we fill this gap by presenting a novel curation protocol for extracting indications and machine processable annotations of contextual information about the therapeutic use of a drug. We implemented the protocol on a reference set of FDA-approved drug product labels on the DailyMed website to curate indications for 150 anti-cancer and cardiovascular drugs. The resulting corpus - InContext - focuses on anti-cancer and cardiovascular drugs because of the heightened societal interest in cancer and heart disease. In order to understand how InContext relates with existing reputable drug indication databases, we analysed it’s overlap with a state-of-the-art indications database - LabeledIn - as well as a reputable online drug compendium - DrugCentral. We found that 40% of indications sampled from DrugCentral (and 23% from LabeledIn) respectively, could not be accounted for in InContext. This raises questions about the veracity of indications not appearing in InContext. The additional contextual information curated by InContext about disease mentions in drug SPLs provides a foundation for more precise, structured and formal representations of knowledge related to drug therapeutic use, in order to increase accuracy and agreement of drug indication extraction methods for in silico drug repurposing. Collapse Key Words Data quality Drug indications Drug repurposing Ontologies Semantic similarity Collapse MESH Headings Collapse Grants Collapse
31	DrugCentral 2021 supports drug discovery and repositioning. Nucleic Acids Res 2021;49:D1160-D1169. [PMID: 33151287 PMCID: PMC7779058 DOI: 10.1093/nar/gkaa997] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/18/2022] Open Abstract DrugCentral is a public resource (http://drugcentral.org) that serves the scientific community by providing up-to-date drug information, as described in previous papers. The current release includes 109 newly approved (October 2018 through March 2020) active pharmaceutical ingredients in the US, Europe, Japan and other countries; and two molecular entities (e.g. mefuparib) of interest for COVID19. New additions include a set of pharmacokinetic properties for ∼1000 drugs, and a sex-based separation of side effects, processed from FAERS (FDA Adverse Event Reporting System); as well as a drug repositioning prioritization scheme based on the market availability and intellectual property rights forFDA approved drugs. In the context of the COVID19 pandemic, we also incorporated REDIAL-2020, a machine learning platform that estimates anti-SARS-CoV-2 activities, as well as the 'drugs in news' feature offers a brief enumeration of the most interesting drugs at the present moment. The full database dump and data files are available for download from the DrugCentral web portal. Collapse Key Words Collapse MESH Headings Antiviral Agents/adverse effects Antiviral Agents/pharmacokinetics Antiviral Agents/therapeutic use COVID-19/epidemiology COVID-19/virology Databases, Pharmaceutical/statistics & numerical data Drug Approval/methods Drug Approval/statistics & numerical data Drug Discovery/methods Drug Discovery/statistics & numerical data Drug Repositioning/methods Drug Repositioning/statistics & numerical data Epidemics Europe Humans Information Storage and Retrieval/methods Internet Japan SARS-CoV-2/drug effects SARS-CoV-2/physiology United States COVID-19 Drug Treatment Collapse Grants P30 CA118100 NCI NIH HHS U24 CA224370 NCI NIH HHS UL1 TR001449 NCATS NIH HHS National Institutes of Health National Cancer Institute NIH NCATS Clinical and Translational Science Center for UNM Intramural Research Program, Division of Preclinical Innovation, NIH NCATS National Science Foundation Collapse
32	TCRD and Pharos 2021: mining the human proteome for disease biology. Nucleic Acids Res 2021;49:D1334-D1346. [PMID: 33156327 PMCID: PMC7778974 DOI: 10.1093/nar/gkaa993] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/13/2022] Open Abstract In 2014, the National Institutes of Health (NIH) initiated the Illuminating the Druggable Genome (IDG) program to identify and improve our understanding of poorly characterized proteins that can potentially be modulated using small molecules or biologics. Two resources produced from these efforts are: The Target Central Resource Database (TCRD) (http://juniper.health.unm.edu/tcrd/) and Pharos (https://pharos.nih.gov/), a web interface to browse the TCRD. The ultimate goal of these resources is to highlight and facilitate research into currently understudied proteins, by aggregating a multitude of data sources, and ranking targets based on the amount of data available, and presenting data in machine learning ready format. Since the 2017 release, both TCRD and Pharos have produced two major releases, which have incorporated or expanded an additional 25 data sources. Recently incorporated data types include human and viral-human protein-protein interactions, protein-disease and protein-phenotype associations, and drug-induced gene signatures, among others. These aggregated data have enabled us to generate new visualizations and content sections in Pharos, in order to empower users to find new areas of study in the druggable genome. Collapse Key Words Collapse MESH Headings Animals Anticonvulsants/chemistry Anticonvulsants/therapeutic use Antiviral Agents/chemistry Antiviral Agents/therapeutic use Biological Products/chemistry Biological Products/therapeutic use Data Mining/statistics & numerical data Databases, Factual Genome, Human Host-Pathogen Interactions/drug effects Host-Pathogen Interactions/genetics Humans Internet Machine Learning/statistics & numerical data Mice Mice, Knockout Molecular Targeted Therapy/methods Neurodegenerative Diseases/classification Neurodegenerative Diseases/drug therapy Neurodegenerative Diseases/genetics Neurodegenerative Diseases/virology Protein Interaction Mapping Proteome/agonists Proteome/antagonists & inhibitors Proteome/genetics Proteome/metabolism Proteomics/methods Small Molecule Libraries/chemistry Small Molecule Libraries/therapeutic use Software Virus Diseases/classification Virus Diseases/drug therapy Virus Diseases/genetics Virus Diseases/virology Collapse Grants U24 CA224370 NCI NIH HHS U24 TR002278 NCATS NIH HHS National Institutes of Health Novo Nordisk Foundation Intramural Research Program, Division of Preclinical Innovation, NIH NCATS Collapse
33	Virtual and In Vitro Antiviral Screening Revive Therapeutic Drugs for COVID-19. ACS Pharmacol Transl Sci 2020;3:1278-1292. [PMID: 33330842 PMCID: PMC7571299 DOI: 10.1021/acsptsci.0c00131] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Indexed: 02/08/2023] Abstract The urgent need for a cure for early phase COVID-19 infected patients critically underlines drug repositioning strategies able to efficiently identify new and reliable treatments by merging computational, experimental, and pharmacokinetic expertise. Here we report new potential therapeutics for COVID-19 identified with a combined virtual and experimental screening strategy and selected among already approved drugs. We used hydroxychloroquine (HCQ), one of the most studied drugs in current clinical trials, as a reference template to screen for structural similarity against a library of almost 4000 approved drugs. The top-ranked drugs, based on structural similarity to HCQ, were selected for in vitro antiviral assessment. Among the selected drugs, both zuclopenthixol and nebivolol efficiently block SARS-CoV-2 infection with EC50 values in the low micromolar range, as confirmed by independent experiments. The anti-SARS-CoV-2 potential of ambroxol, amodiaquine, and its active metabolite (N-monodesethyl amodiaquine) is also discussed. In trying to understand the "hydroxychloroquine" mechanism of action, both pK a and the HCQ aromatic core may play a role. Further, we show that the amodiaquine metabolite and, to a lesser extent, zuclopenthixol and nebivolol are active in a SARS-CoV-2 titer reduction assay. Given the need for improved efficacy and safety, we propose zuclopenthixol, nebivolol, and amodiaquine as potential candidates for clinical trials against the early phase of the SARS-CoV-2 infection and discuss their potential use as adjuvant to the current (i.e., remdesivir and favipiravir) COVID-19 therapeutics. Collapse Key Words covid-19 drug repositioning drug repurposing virtual screening zuclopenthixol nebivolol amodiaquine Collapse MESH Headings Collapse Grants R01 AI130473 NIAID NIH HHS R01 AI160662 NIAID NIH HHS U24 CA224370 NCI NIH HHS Collapse
34	Deciphering the Plasma Proteome of Type 2 Diabetes. Diabetes 2020;69:2766-2778. [PMID: 32928870 PMCID: PMC7679779 DOI: 10.2337/db20-0296] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 08/28/2020] [Indexed: 12/17/2022] Abstract With an estimated prevalence of 463 million affected, type 2 diabetes represents a major challenge to health care systems worldwide. Analyzing the plasma proteomes of individuals with type 2 diabetes may illuminate hitherto unknown functional mechanisms underlying disease pathology. We assessed the associations between type 2 diabetes and >1,000 plasma proteins in the Cooperative Health Research in the Region of Augsburg (KORA) F4 cohort (n = 993, 110 cases), with subsequent replication in the third wave of the Nord-Trøndelag Health Study (HUNT3) cohort (n = 940, 149 cases). We computed logistic regression models adjusted for age, sex, BMI, smoking status, and hypertension. Additionally, we investigated associations with incident type 2 diabetes and performed two-sample bidirectional Mendelian randomization (MR) analysis to prioritize our results. Association analysis of prevalent type 2 diabetes revealed 24 replicated proteins, of which 8 are novel. Proteins showing association with incident type 2 diabetes were aminoacylase-1, growth hormone receptor, and insulin-like growth factor-binding protein 2. Aminoacylase-1 was associated with both prevalent and incident type 2 diabetes. MR analysis yielded nominally significant causal effects of type 2 diabetes on cathepsin Z and rennin, both known to have roles in the pathophysiological pathways of cardiovascular disease, and of sex hormone-binding globulin on type 2 diabetes. In conclusion, our high-throughput proteomics study replicated previously reported type 2 diabetes-protein associations and identified new candidate proteins possibly involved in the pathogenesis of type 2 diabetes. Collapse Key Words Collapse MESH Headings Biomarkers/blood Blood Proteins/genetics Blood Proteins/metabolism Diabetes Mellitus, Type 2/blood Gene Expression Regulation Genome-Wide Association Study Humans Proteomics/methods Collapse Grants U01 CA239108 NCI NIH HHS U24 CA224370 NCI NIH HHS U24 TR002278 NCATS NIH HHS Collapse
35	A Workflow of Integrated Resources to Catalyze Network Pharmacology Driven COVID-19 Research. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.11.04.369041. [PMID: 33173863 PMCID: PMC7654851 DOI: 10.1101/2020.11.04.369041] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Abstract MOTIVATION In the event of an outbreak due to an emerging pathogen, time is of the essence to contain or to mitigate the spread of the disease. Drug repositioning is one of the strategies that has the potential to deliver therapeutics relatively quickly. The SARS-CoV-2 pandemic has shown that integrating critical data resources to drive drug-repositioning studies, involving host-host, hostpathogen and drug-target interactions, remains a time-consuming effort that translates to a delay in the development and delivery of a life-saving therapy. RESULTS Here, we describe a workflow we designed for a semi-automated integration of rapidly emerging datasets that can be generally adopted in a broad network pharmacology research setting. The workflow was used to construct a COVID-19 focused multimodal network that integrates 487 host-pathogen, 74,805 host-host protein and 1,265 drug-target interactions. The resultant Neo4j graph database named "Neo4COVID19" is accessible via a web interface and via API calls based on the Bolt protocol. We believe that our Neo4COVID19 database will be a valuable asset to the research community and will catalyze the discovery of therapeutics to fight COVID-19. AVAILABILITY https://neo4covid19.ncats.io. Collapse Key Words sars-cov-2 covid-19 network pharmacology graph database neo4j data integration drug repositioning Collapse MESH Headings Collapse Grants U54 CA189205 NCI NIH HHS Collapse
36	REDIAL-2020: A suite of machine learning models to estimate Anti-SARS-CoV-2 activities. CHEMRXIV : THE PREPRINT SERVER FOR CHEMISTRY 2020:12915779. [PMID: 33200119 PMCID: PMC7668752 DOI: 10.26434/chemrxiv.12915779] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Revised: 09/16/2020] [Indexed: 11/09/2022] Abstract Strategies for drug discovery and repositioning are an urgent need with respect to COVID-19. We developed "REDIAL-2020", a suite of machine learning models for estimating small molecule activity from molecular structure, for a range of SARS-CoV-2 related assays. Each classifier is based on three distinct types of descriptors (fingerprint, physicochemical, and pharmacophore) for parallel model development. These models were trained using high throughput screening data from the NCATS COVID19 portal (https://opendata.ncats.nih.gov/covid19/index.html), with multiple categorical machine learning algorithms. The "best models" are combined in an ensemble consensus predictor that outperforms single models where external validation is available. This suite of machine learning models is available through the DrugCentral web portal (http://drugcentral.org/Redial). Acceptable input formats are: drug name, PubChem CID, or SMILES; the output is an estimate of anti-SARS-CoV-2 activities. The web application reports estimated activity across three areas (viral entry, viral replication, and live virus infectivity) spanning six independent models, followed by a similarity search that displays the most similar molecules to the query among experimentally determined data. The ML models have 60% to 74% external predictivity, based on three separate datasets. Complementing the NCATS COVID19 portal, REDIAL-2020 can serve as a rapid online tool for identifying active molecules for COVID-19 treatment. The source code and specific models are available through Github (https://github.com/sirimullalab/redial-2020), or via Docker Hub (https://hub.docker.com/r/sirimullalab/redial-2020) for users preferring a containerized version. Collapse Key Words Collapse MESH Headings Collapse Grants U24 CA224370 NCI NIH HHS Collapse
37	Off-Patent Drug Repositioning. J Chem Inf Model 2020;60:5746-5753. [PMID: 32877182 DOI: 10.1021/acs.jcim.0c00826] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Abstract Drug repositioning aims to reuse "old" drugs to treat diseases outside their approved indication(s). Composition-of-matter patents and FDA exclusivities can hinder the immediate availability of some drugs to be repositioned (repurposed). Here, we analyze data from the FDA Orange Book and use current on-market patent validity and exclusivities to classify drugs into on-patent (ONP), off-patent (OFP), and off-market (OFM) sets. In the absence of an unanimously accepted definition for small molecules, these sets include organic molecules and peptides with molecular weight between 100 and 1250, which resulted in 237 ONP drugs, 320 OFM, and 996 OFP drugs, respectively. We discuss the differences between the three categories in terms of primary molecular properties, chemical diversity, mechanism-of-action target classes, and therapeutic areas and comment on the enrichment of OFP drugs in the near future. Given the intellectual property landscape, and in the absence of specific property rights, we suggest that drugs should be prioritized as follows, to improve the repositioning strategy: (i) OFP, (ii) OFM, and (iii) ONP, respectively. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
38	Artificial intelligence, drug repurposing and peer review. Nat Biotechnol 2020;38:1127-1131. [DOI: 10.1038/s41587-020-0686-x] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
39	Novel drug targets in 2019. Nat Rev Drug Discov 2020;19:300. [PMID: 32273590 DOI: 10.1038/d41573-020-00052-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
40	QSAR without borders. Chem Soc Rev 2020;49:3525-3564. [PMID: 32356548 PMCID: PMC8008490 DOI: 10.1039/d0cs00098a] [Citation(s) in RCA: 305] [Impact Index Per Article: 76.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Abstract Prediction of chemical bioactivity and physical properties has been one of the most important applications of statistical and more recently, machine learning and artificial intelligence methods in chemical sciences. This field of research, broadly known as quantitative structure-activity relationships (QSAR) modeling, has developed many important algorithms and has found a broad range of applications in physical organic and medicinal chemistry in the past 55+ years. This Perspective summarizes recent technological advances in QSAR modeling but it also highlights the applicability of algorithms, modeling methods, and validation practices developed in QSAR to a wide range of research areas outside of traditional QSAR boundaries including synthesis planning, nanotechnology, materials science, biomaterials, and clinical informatics. As modern research methods generate rapidly increasing amounts of data, the knowledge of robust data-driven modelling methods professed within the QSAR field can become essential for scientists working both within and outside of chemical research. We hope that this contribution highlighting the generalizable components of QSAR modeling will serve to address this challenge. Collapse Key Words Collapse MESH Headings Algorithms Animals Artificial Intelligence Chemistry, Pharmaceutical/methods Databases, Factual Drug Design Drug-Related Side Effects and Adverse Reactions/metabolism History, 20th Century History, 21st Century Humans Models, Molecular Pharmaceutical Preparations/chemistry Quantitative Structure-Activity Relationship Quantum Theory Reproducibility of Results Collapse Grants U01 CA207160 NCI NIH HHS U01 CA239108 NCI NIH HHS U24 CA224370 NCI NIH HHS U24 TR002278 NCATS NIH HHS Collapse
41	Will Artificial Intelligence for Drug Discovery Impact Clinical Pharmacology? Clin Pharmacol Ther 2020;107:780-785. [PMID: 31957003 PMCID: PMC7158211 DOI: 10.1002/cpt.1795] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 01/09/2020] [Indexed: 01/05/2023] Abstract As the field of artificial intelligence and machine learning (AI/ML) for drug discovery is rapidly advancing, we address the question "What is the impact of recent AI/ML trends in the area of Clinical Pharmacology?" We address difficulties and AI/ML developments for target identification, their use in generative chemistry for small molecule drug discovery, and the potential role of AI/ML in clinical trial outcome evaluation. We briefly discuss current trends in the use of AI/ML in health care and the impact of AI/ML context of the daily practice of clinical pharmacologists. Collapse Key Words Collapse MESH Headings Animals Artificial Intelligence/trends Clinical Trials as Topic/methods Drug Discovery/methods Drug Discovery/trends Humans Pharmacology, Clinical/methods Pharmacology, Clinical/trends Collapse Grants U24 TR002278 NIH HHS U24 CA224370 NCI NIH HHS P30 CA118100 NIH HHS U01 CA239108 NIH HHS U24 TR002278 NCATS NIH HHS U24 CA224370 NIH HHS U01 CA239108 NCI NIH HHS NIH Collapse
42	Diabetes mellitus risk for 102 drugs and drug combinations used in patients with bipolar disorder. Psychoneuroendocrinology 2020;112:104511. [PMID: 31744781 DOI: 10.1016/j.psyneuen.2019.104511] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Revised: 08/28/2019] [Accepted: 11/07/2019] [Indexed: 12/18/2022] Abstract OBJECTIVE To compare the largest set of bipolar disorder pharmacotherapies to date (102 drugs and drug combinations) for risk of diabetes mellitus (DM). METHODS The IBM MarketScan® database was used to retrospectively analyze data on 565,253 adults with bipolar disorder without prior glucose metabolism-related diagnoses. The pharmacotherapies compared were lithium, mood-stabilizing anticonvulsants, antipsychotics, and antidepressants (monotherapy and multi-class polypharmacy). Cox regression modeling included fixed pre-treatment covariates and time-varying drug exposure covariates to estimate the hazard ratio (HR) of each treatment versus "No drug". RESULTS The annual incidence of new-onset diabetes during the exposure period was 3.09 % (22,951 patients). The HR of drug-dependent DM ranged from 0.79 to 2.37. One-third of the studied pharmacotherapies, including most of the antipsychotic-containing regimens, had a significantly higher risk of DM compared to "No drug". A significantly lower DM risk was associated with lithium, lamotrigine, oxcarbazepine and bupropion monotherapies, selective serotonin reuptake inhibitors (SSRI) mono-class therapy and several drug combinations containing bupropion and an SSRI. As additional drugs were combined in more complex polypharmacy, higher HRs were consistently observed. CONCLUSIONS There is an increased risk of diabetes mellitus associated with antipsychotic and psychotropic polypharmacy use in bipolar disorder. The evidence of a lower-than-baseline risk of DM with lamotrigine, oxcarbazepine, lithium, and bupropion monotherapy should be further investigated. Collapse Key Words Bipolar Diabetes Drug Glucose Polypharmacy Treatment Collapse MESH Headings Collapse Grants Collapse
43	How many rare diseases are there? Nat Rev Drug Discov 2020;19:77-78. [PMID: 32020066 PMCID: PMC7771654 DOI: 10.1038/d41573-019-00180-y] [Citation(s) in RCA: 163] [Impact Index Per Article: 40.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Abstract A lack of robust knowledge of the number of rare diseases and the number of people affected by them limits the development of approaches to ameliorate the substantial cumulative burden of rare diseases. Here, we call for coordinated efforts to more precisely define rare diseases. Collapse Key Words Collapse MESH Headings Humans Phenotype Rare Diseases/classification Rare Diseases/diagnosis Rare Diseases/epidemiology Rare Diseases/therapy Collapse Grants P30 CA118100 NCI NIH HHS U24 CA224370 NCI NIH HHS R24 OD011883 NIH HHS U24 TR002278 NCATS NIH HHS RM1 HG010860 NHGRI NIH HHS U41 HG006627 NHGRI NIH HHS U24 HG009650 NHGRI NIH HHS U41 HG009650 NHGRI NIH HHS U01 CA239108 NCI NIH HHS Collapse
44	SmartGraph: a network pharmacology investigation platform. J Cheminform 2020;12:5. [PMID: 33430980 PMCID: PMC6974502 DOI: 10.1186/s13321-020-0409-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 01/07/2020] [Indexed: 11/18/2022] Open Abstract Motivation Drug discovery investigations need to incorporate network pharmacology concepts while navigating the complex landscape of drug-target and target-target interactions. This task requires solutions that integrate high-quality biomedical data, combined with analytic and predictive workflows as well as efficient visualization. SmartGraph is an innovative platform that utilizes state-of-the-art technologies such as a Neo4j graph-database, Angular web framework, RxJS asynchronous event library and D3 visualization to accomplish these goals. Results The SmartGraph framework integrates high quality bioactivity data and biological pathway information resulting in a knowledgebase comprised of 420,526 unique compound-target interactions defined between 271,098 unique compounds and 2018 targets. SmartGraph then performs bioactivity predictions based on the 63,783 Bemis-Murcko scaffolds extracted from these compounds. Through several use-cases, we illustrate the use of SmartGraph to generate hypotheses for elucidating mechanism-of-action, drug-repurposing and off-target prediction. Availability https://smartgraph.ncats.io/. Collapse Key Words Bioactivity prediction Network perturbation Network pharmacology Network visualization Pathway analysis Potent chemical pattern Protein–protein interactions (PPIs) Scaffold Target deconvolution neo4j Collapse MESH Headings Collapse Grants Collapse
45	Correction: QSAR without borders. Chem Soc Rev 2020;49:3716. [DOI: 10.1039/d0cs90041a] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Abstract Correction for ‘QSAR without borders’ by Eugene N. Muratov et al., Chem. Soc. Rev., 2020, DOI: 10.1039/d0cs00098a. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
46	Therapies for rare diseases: therapeutic modalities, progress and challenges ahead. Nat Rev Drug Discov 2019;19:93-111. [PMID: 31836861 DOI: 10.1038/s41573-019-0049-9] [Citation(s) in RCA: 155] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/24/2019] [Indexed: 12/26/2022] Abstract Most rare diseases still lack approved treatments despite major advances in research providing the tools to understand their molecular basis, as well as legislation providing regulatory and economic incentives to catalyse the development of specific therapies. Addressing this translational gap is a multifaceted challenge, for which a key aspect is the selection of the optimal therapeutic modality for translating advances in rare disease knowledge into potential medicines, known as orphan drugs. With this in mind, we discuss here the technological basis and rare disease applicability of the main therapeutic modalities, including small molecules, monoclonal antibodies, protein replacement therapies, oligonucleotides and gene and cell therapies, as well as drug repurposing. For each modality, we consider its strengths and limitations as a platform for rare disease therapy development and describe clinical progress so far in developing drugs based on it. We also discuss selected overarching topics in the development of therapies for rare diseases, such as approval statistics, engagement of patients in the process, regulatory pathways and digital tools. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
47	Can BDDCS illuminate targets in drug design? Drug Discov Today 2019;24:2299-2306. [PMID: 31585170 DOI: 10.1016/j.drudis.2019.09.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 09/18/2019] [Accepted: 09/25/2019] [Indexed: 12/19/2022] Abstract The fact that pharmacokinetic (PK) properties of drugs influence their interaction with protein targets is a principle known for decades. The same cannot be said for the opposite, namely that targets influence the PK properties of drugs. Evidence confirming this possibility is introduced here for the first time, as we show that certain protein families have a clear preference for drugs with specific PK properties. We investigate this by cross-referencing 'druggable target' annotations for >1000 US Food and Drug Administration (FDA)-approved drugs with their PK profile, as defined by the Biopharmaceutics Drug Disposition Classification System (BDDCS) criteria, and then examine the BDDCS preference for several major target protein families and therapeutic categories. Our findings suggest a novel way to conduct drug discovery by focusing PK profiles at the very early stage of target selection. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
48	Exploring the dark genome: implications for precision medicine. Mamm Genome 2019;30:192-200. [PMID: 31270560 DOI: 10.1007/s00335-019-09809-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 06/15/2019] [Indexed: 01/08/2023] Abstract The increase in the number of both patients and healthcare practitioners who grew up using the Internet and computers (so-called "digital natives") is likely to impact the practice of precision medicine, and requires novel platforms for data integration and mining, as well as contextualized information retrieval. The "Illuminating the Druggable Genome Knowledge Management Center" (IDG KMC) quantifies data availability from a wide range of chemical, biological, and clinical resources, and has developed platforms that can be used to navigate understudied proteins (the "dark genome"), and their potential contribution to specific pathologies. Using the "Target Importance and Novelty Explorer" (TIN-X) highlights the role of LRRC10 (a dark gene) in dilated cardiomyopathy. Combining mouse and human phenotype data leads to increased strength of evidence, which is discussed for four additional dark genes: SLX4IP and its role in glucose metabolism, the role of HSF2BP in coronary artery disease, the involvement of ELFN1 in attention-deficit hyperactivity disorder and the role of VPS13D in mouse neural tube development and its confirmed role in childhood onset movement disorders. The workflow and tools described here are aimed at guiding further experimental research, particularly within the context of precision medicine. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
49	The human endogenous metabolome as a pharmacology baseline for drug discovery. Drug Discov Today 2019;24:1806-1820. [PMID: 31226432 DOI: 10.1016/j.drudis.2019.06.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 05/17/2019] [Accepted: 06/12/2019] [Indexed: 01/01/2023] Abstract We have limited understanding of the variation in in vitro affinities of drugs for their targets. An analysis of a highly curated set of 815 interactions between 566 drugs and 129 primary targets reveals that 71% of drug-target affinities have values above that of the corresponding endogenous ligand, 96% of them fitting within a range of two orders of magnitude. Our findings suggest that the evolutionary optimised affinity of endogenous ligands for their native proteins can serve as a baseline for the primary pharmacology of drugs. We show that the degree of off-target selectivity and safety risks of drugs derived from their secondary pharmacology depend very much on that baseline. Thus, we propose a new approach for estimating safety margins. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
50	Comparison of 71 bipolar disorder pharmacotherapies for kidney disorder risk: The potential hazards of polypharmacy. J Affect Disord 2019;252:201-211. [PMID: 30986735 DOI: 10.1016/j.jad.2019.04.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 02/13/2019] [Accepted: 04/06/2019] [Indexed: 11/17/2022] Abstract BACKGROUND This study compared the largest set of bipolar disorder pharmacotherapies to date (71 drugs and drug combinations) for risk of kidney disorders (KDs). METHODS This retrospective observational study used the IBM MarketScan® database to analyze data on 591,052 adults with bipolar disorder without prior nephropathy, for onset of KDs (of "moderate" or "high" severity) following psychopharmacotherapy (lithium, mood stabilizing anticonvulsants [MSAs], antipsychotics, antidepressants), or "No drug". Cox regression models included fixed pre-treatment covariates and time-varying drug exposure covariates to estimate the hazard ratio (HR) of each treatment versus "No drug". RESULTS Newly observed KD occurred in 14,713 patients. No regimen had significantly lower risk of KDs than "No drug". The HR estimates ranged 0.86-2.66 for "all" KDs and 0.87-5.30 for "severe" KDs. As additional drugs were combined to compare more complex polypharmacies, higher HRs were consistently observed. Most regimens containing lithium, MSAs, or antipsychotics had a higher risk than "No drug" (p < 0.05). The risk for "all" and "severe" KDs was highest respectively on monoamine oxidase inhibitors (MAOIs) (HR = 2.66, p = 5.73 × 10^-5), and a lithium-containing four-class combination (HR = 5.30, p = 2.46 × 10^-⁹). The HR for lithium monotherapy was 1.82 (p = 4.73 × 10^-17) for "severe" KDs. LIMITATIONS The limitations inherent for an observational study were non-randomized assignment of patients to treatment groups, non-standardization of diagnostic decisions, and non-uniform quality of data collection. No correction was made for medication dosage. CONCLUSIONS The findings support literature concerns about lithium nephrotoxicity and highlight the potential risks of MAOIs, MSAs, antipsychotics and psychotropic polypharmacy. Collapse Key Words Bipolar Drug Kidney Pharmacotherapy Psychiatric Renal Collapse MESH Headings Adult Anticonvulsants/adverse effects Antidepressive Agents/adverse effects Antimanic Agents/adverse effects Antipsychotic Agents/adverse effects Bipolar Disorder/drug therapy Female Humans Kidney Diseases/chemically induced Male Middle Aged Polypharmacy Psychotropic Drugs/adverse effects Retrospective Studies Risk Factors Collapse Grants Collapse