Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tyrchan C, Evertsson E. Matched Molecular Pair Analysis in Short: Algorithms, Applications and Limitations. Comput Struct Biotechnol J 2016;15:86-90. [PMID: 28066532 PMCID: PMC5198793 DOI: 10.1016/j.csbj.2016.12.003] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Revised: 12/08/2016] [Accepted: 12/09/2016] [Indexed: 12/02/2022] Open

For:	Tyrchan C, Evertsson E. Matched Molecular Pair Analysis in Short: Algorithms, Applications and Limitations. Comput Struct Biotechnol J 2016;15:86-90. [PMID: 28066532 PMCID: PMC5198793 DOI: 10.1016/j.csbj.2016.12.003] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Revised: 12/08/2016] [Accepted: 12/09/2016] [Indexed: 12/02/2022] Open

Number

Cited by Other Article(s)

Yi J, Shi S, Fu L, Yang Z, Nie P, Lu A, Wu C, Deng Y, Hsieh C, Zeng X, Hou T, Cao D. OptADMET: a web-based tool for substructure modifications to improve ADMET properties of lead compounds. Nat Protoc 2024;19:1105-1121. [PMID: 38263521 DOI: 10.1038/s41596-023-00942-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 10/27/2023] [Indexed: 01/25/2024]

Seyedtabib M, Kamyari N. Predicting polypharmacy in half a million adults in the Iranian population: comparison of machine learning algorithms. BMC Med Inform Decis Mak 2023;23:84. [PMID: 37147615 PMCID: PMC10161984 DOI: 10.1186/s12911-023-02177-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 04/21/2023] [Indexed: 05/07/2023] Open

Abstract

BACKGROUND

Polypharmacy (PP) is increasingly common in Iran, and contributes to the substantial burden of drug-related morbidity, increasing the potential for drug interactions and potentially inappropriate medications. Machine learning algorithms (ML) can be employed as an alternative solution for the prediction of PP. Therefore, our study aimed to compare several ML algorithms to predict the PP using the health insurance claims data and choose the best-performing algorithm as a predictive tool for decision-making.

METHODS

This population-based cross-sectional study was performed between April 2021 and March 2022. After feature selection, information about 550 thousand patients were obtained from National Center for Health Insurance Research (NCHIR). Afterwards, several ML algorithms were trained to predict PP. Finally, to assess the models' performance, the metrics derived from the confusion matrix were calculated.

RESULTS

The study sample comprised 554 133 adults with a median (IQR) age of 51 years (40 - 62) that nested in 27 cities within the Khuzestan province of Iran. Most of the patients were female (62.5%), married (63.5%), and employed (83.2%) during the last year. The prevalence of PP in all populations was about 36.0%. After performing the feature selection, out of 23 features, the number of prescriptions, Insurance coverage for prescription drugs, and hypertension were found as the top three predictors. Experimental results showed that Random Forest (RF) performed better than other ML algorithms with recall, specificity, accuracy, precision and F1-score of 63.92%, 89.92%, 79.99%, 63.92% and 63.92% respectively.

CONCLUSION

It was found that ML provides a reasonable level of accuracy in predicting polypharmacy. Therefore, the prediction models based on ML, especially the RF algorithm, performed better than other methods for predicting PP in Iranian people in terms of the performance criteria.

Collapse

Carneiro J, Magalhães RP, de la Oliva Roque VM, Simões M, Pratas D, Sousa SF. TargIDe: a machine-learning workflow for target identification of molecules with antibiofilm activity against Pseudomonas aeruginosa. J Comput Aided Mol Des 2023;37:265-278. [PMID: 37085636 DOI: 10.1007/s10822-023-00505-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 04/12/2023] [Indexed: 04/23/2023]

Hoover AJ, Spale M, Lahue B, Bitton DA. Matcher: An Open-Source Application for Translating Large Structure/Property Data Sets into Insights for Drug Design. J Chem Inf Model 2023;63:1852-1857. [PMID: 36977316 DOI: 10.1021/acs.jcim.3c00015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]

Wellawatte GP, Gandhi HA, Seshadri A, White AD. A Perspective on Explanations of Molecular Prediction Models. J Chem Theory Comput 2023;19:2149-2160. [PMID: 36972469 PMCID: PMC10134429 DOI: 10.1021/acs.jctc.2c01235] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]

Tysinger EP, Rai BK, Sinitskiy AV. Can We Quickly Learn to "Translate" Bioactive Molecules with Transformer Models? J Chem Inf Model 2023;63:1734-1744. [PMID: 36914216 DOI: 10.1021/acs.jcim.2c01618] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]

Fromer JC, Coley CW. Computer-aided multi-objective optimization in small molecule discovery. PATTERNS (NEW YORK, N.Y.) 2023;4:100678. [PMID: 36873904 PMCID: PMC9982302 DOI: 10.1016/j.patter.2023.100678] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Yang L, Jin C, Yang G, Bing Z, Huang L, Niu Y, Yang L. Transformer-based deep learning method for optimizing ADMET properties of lead compounds. Phys Chem Chem Phys 2023;25:2377-2385. [PMID: 36597997 DOI: 10.1039/d2cp05332b] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Hermann MR, Tautermann CS, Sieger P, Grundl MA, Weber A. BIreactive: Expanding the Scope of Reactivity Predictions to Propynamides. Pharmaceuticals (Basel) 2023;16:ph16010116. [PMID: 36678612 PMCID: PMC9866037 DOI: 10.3390/ph16010116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/22/2022] [Accepted: 12/31/2022] [Indexed: 01/15/2023] Open

Dai X, Xu Y, Qiu H, Qian X, Lin M, Luo L, Zhao Y, Huang D, Zhang Y, Chen Y, Liu H, Jiang Y. KID: A Kinase-Focused Interaction Database and Its Application in the Construction of Kinase-Focused Molecule Databases. J Chem Inf Model 2022;62:6022-6034. [PMID: 36447388 DOI: 10.1021/acs.jcim.2c00908] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]

Affiliation(s)

Xiaowen Dai Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Yuan Xu Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Haodi Qiu Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Xu Qian Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Mingde Lin Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Lin Luo Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Yang Zhao Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Dingfang Huang Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Yanmin Zhang Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Yadong Chen Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Haichun Liu Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
Yulei Jiang Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China

Collapse

Santos CEMD, Dorta DJ, de Oliveira DP. Setting limits for N-nitrosamines in drugs: A defined approach based on read-across and structure-activity relationship for N-nitrosopiperazine impurities. Regul Toxicol Pharmacol 2022;136:105288. [DOI: 10.1016/j.yrtph.2022.105288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Revised: 10/14/2022] [Accepted: 11/06/2022] [Indexed: 11/15/2022]

Natural and Synthetic Xanthone Derivatives Counteract Oxidative Stress via Nrf2 Modulation in Inflamed Human Macrophages. Int J Mol Sci 2022;23:ijms232113319. [PMID: 36362104 PMCID: PMC9659273 DOI: 10.3390/ijms232113319] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/25/2022] [Accepted: 10/26/2022] [Indexed: 11/06/2022] Open

Kwapien K, Nittinger E, He J, Margreitter C, Voronov A, Tyrchan C. Implications of Additivity and Nonadditivity for Machine Learning and Deep Learning Models in Drug Design. ACS OMEGA 2022;7:26573-26581. [PMID: 35936431 PMCID: PMC9352238 DOI: 10.1021/acsomega.2c02738] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 07/08/2022] [Indexed: 05/20/2023]

Park S, Han H, Kim H, Choi S. Machine Learning Applications for Chemical Reactions. Chem Asian J 2022;17:e202200203. [PMID: 35471772 PMCID: PMC9401034 DOI: 10.1002/asia.202200203] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/26/2022] [Indexed: 11/30/2022]

Lou C, Yang H, Wang J, Huang M, Li W, Liu G, Lee PW, Tang Y. IDL-PPBopt: A Strategy for Prediction and Optimization of Human Plasma Protein Binding of Compounds via an Interpretable Deep Learning Method. J Chem Inf Model 2022;62:2788-2799. [PMID: 35607907 DOI: 10.1021/acs.jcim.2c00297] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Alqahtani A. Application of Artificial Intelligence in Discovery and Development of Anticancer and Antidiabetic Therapeutic Agents. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE : ECAM 2022;2022:6201067. [PMID: 35509623 PMCID: PMC9060979 DOI: 10.1155/2022/6201067] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 03/17/2022] [Accepted: 04/05/2022] [Indexed: 11/18/2022]

He J, Nittinger E, Tyrchan C, Czechtizky W, Patronov A, Bjerrum EJ, Engkvist O. Transformer-based molecular optimization beyond matched molecular pairs. J Cheminform 2022;14:18. [PMID: 35346368 PMCID: PMC8962145 DOI: 10.1186/s13321-022-00599-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 03/11/2022] [Indexed: 11/11/2022] Open

Abstract

Molecular optimization aims to improve the drug profile of a starting molecule. It is a fundamental problem in drug discovery but challenging due to (i) the requirement of simultaneous optimization of multiple properties and (ii) the large chemical space to explore. Recently, deep learning methods have been proposed to solve this task by mimicking the chemist’s intuition in terms of matched molecular pairs (MMPs). Although MMPs is a widely used strategy by medicinal chemists, it offers limited capability in terms of exploring the space of structural modifications, therefore does not cover the complete space of solutions. Often more general transformations beyond the nature of MMPs are feasible and/or necessary, e.g. simultaneous modifications of the starting molecule at different places including the core scaffold. This study aims to provide a general methodology that offers more general structural modifications beyond MMPs. In particular, the same Transformer architecture is trained on different datasets. These datasets consist of a set of molecular pairs which reflect different types of transformations. Beyond MMP transformation, datasets reflecting general structural changes are constructed from ChEMBL based on two approaches: Tanimoto similarity (allows for multiple modifications) and scaffold matching (allows for multiple modifications but keep the scaffold constant) respectively. We investigate how the model behavior can be altered by tailoring the dataset while using the same model architecture. Our results show that the models trained on differently prepared datasets transform a given starting molecule in a way that it reflects the nature of the dataset used for training the model. These models could complement each other and unlock the capability for the chemists to pursue different options for improving a starting molecule.

Collapse

Jiménez-Luna J, Skalic M, Weskamp N. Benchmarking Molecular Feature Attribution Methods with Activity Cliffs. J Chem Inf Model 2022;62:274-283. [PMID: 35019265 DOI: 10.1021/acs.jcim.1c01163] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Pal S, Pogány P, Lumley JA. Molecule Ideation Using Matched Molecular Pairs. Methods Mol Biol 2022;2390:503-521. [PMID: 34731485 DOI: 10.1007/978-1-0716-1787-8_23] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Tse EG, Aithani L, Anderson M, Cardoso-Silva J, Cincilla G, Conduit GJ, Galushka M, Guan D, Hallyburton I, Irwin BWJ, Kirk K, Lehane AM, Lindblom JCR, Lui R, Matthews S, McCulloch J, Motion A, Ng HL, Öeren M, Robertson MN, Spadavecchio V, Tatsis VA, van Hoorn WP, Wade AD, Whitehead TM, Willis P, Todd MH. An Open Drug Discovery Competition: Experimental Validation of Predictive Models in a Series of Novel Antimalarials. J Med Chem 2021;64:16450-16463. [PMID: 34748707 DOI: 10.1021/acs.jmedchem.1c00313] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Affiliation(s)

Edwin G Tse School of Pharmacy, University College London, London WC1N 1AX, U.K
Laksh Aithani Exscientia Ltd., The Schrödinger Building, Oxford Science Park, Oxford OX4 4GE, U.K
Mark Anderson Drug Discovery Unit, Division of Biological Chemistry and Drug Discovery, School of Life Sciences, University of Dundee, Dundee DD1 5EH, U.K
Jonathan Cardoso-Silva Department of Informatics, Faculty of Natural and Mathematical Sciences, King's College London, London WC2B 4BG, U.K
Giovanni Cincilla Molomics, Barcelona Science Park, Barcelona 08028, Spain
Gareth J Conduit Intellegens Ltd., Eagle Labs, Chesterton Road, Cambridge CB4 3AZ, U.K.,Theory of Condensed Matter Group, Cavendish Laboratories, University of Cambridge, Cambridge CB3 0HE, U.K
Mykola Galushka Auromind Ltd, 126 Eglantine Avenue, Belfast BT9 6EU, U.K
Davy Guan School of Medical Sciences, The University of Sydney, Sydney, NSW 2006, Australia
Irene Hallyburton Drug Discovery Unit, Division of Biological Chemistry and Drug Discovery, School of Life Sciences, University of Dundee, Dundee DD1 5EH, U.K
Benedict W J Irwin Theory of Condensed Matter Group, Cavendish Laboratories, University of Cambridge, Cambridge CB3 0HE, U.K.,Optibrium Ltd. Blenheim House, Denny End Road, Cambridge CB25 9QE, U.K
Kiaran Kirk Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
Adele M Lehane Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
Julia C R Lindblom Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
Raymond Lui School of Medical Sciences, The University of Sydney, Sydney, NSW 2006, Australia
Slade Matthews School of Medical Sciences, The University of Sydney, Sydney, NSW 2006, Australia
James McCulloch Kellerberrin, 6 Wharf Rd, Balmain, Sydney, NSW 2041, Australia
Alice Motion School of Chemistry, The University of Sydney, Sydney, NSW 2006, Australia
Ho Leung Ng Department of Biochemistry and Molecular Biophysics, Kansas State University, Manhattan Kansas 66506, United States
Mario Öeren Optibrium Ltd. Blenheim House, Denny End Road, Cambridge CB25 9QE, U.K
Murray N Robertson Strathclyde Institute Of Pharmacy And Biomedical Sciences, University of Strathclyde, Glasgow G4 ORE, U.K
Vito Spadavecchio Interlinked Therapeutics LLC, Portland, Oregon 97214, United States
Vasileios A Tatsis Exscientia Ltd., The Schrödinger Building, Oxford Science Park, Oxford OX4 4GE, U.K
Willem P van Hoorn Exscientia Ltd., The Schrödinger Building, Oxford Science Park, Oxford OX4 4GE, U.K
Alexander D Wade Theory of Condensed Matter Group, Cavendish Laboratories, University of Cambridge, Cambridge CB3 0HE, U.K
Thomas M Whitehead Intellegens Ltd., Eagle Labs, Chesterton Road, Cambridge CB4 3AZ, U.K
Paul Willis Medicines for Malaria Venture, PO Box 1826, 20 rte de Pre-Bois, 1215 Geneva 15, Switzerland
Matthew H Todd School of Pharmacy, University College London, London WC1N 1AX, U.K

Collapse

Tayara H, Abdelbaky I, To Chong K. Recent omics-based computational methods for COVID-19 drug discovery and repurposing. Brief Bioinform 2021;22:6355836. [PMID: 34423353 DOI: 10.1093/bib/bbab339] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 07/09/2021] [Indexed: 12/22/2022] Open

Naveja JJ, Vogt M. Automatic Identification of Analogue Series from Large Compound Data Sets: Methods and Applications. Molecules 2021;26:5291. [PMID: 34500724 PMCID: PMC8433811 DOI: 10.3390/molecules26175291] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 08/27/2021] [Accepted: 08/28/2021] [Indexed: 01/21/2023] Open

Tamura S, Jasial S, Miyao T, Funatsu K. Interpretation of Ligand-Based Activity Cliff Prediction Models Using the Matched Molecular Pair Kernel. Molecules 2021;26:molecules26164916. [PMID: 34443503 PMCID: PMC8401777 DOI: 10.3390/molecules26164916] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/09/2021] [Accepted: 08/10/2021] [Indexed: 11/16/2022] Open

Tynes M, Gao W, Burrill DJ, Batista ER, Perez D, Yang P, Lubbers N. Pairwise Difference Regression: A Machine Learning Meta-algorithm for Improved Prediction and Uncertainty Quantification in Chemical Search. J Chem Inf Model 2021;61:3846-3857. [PMID: 34347460 DOI: 10.1021/acs.jcim.1c00670] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Gupta R, Srivastava D, Sahu M, Tiwari S, Ambasta RK, Kumar P. Artificial intelligence to deep learning: machine intelligence approach for drug discovery. Mol Divers 2021;25:1315-1360. [PMID: 33844136 PMCID: PMC8040371 DOI: 10.1007/s11030-021-10217-3] [Citation(s) in RCA: 253] [Impact Index Per Article: 84.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 03/22/2021] [Indexed: 02/06/2023]

Abstract

Drug designing and development is an important area of research for pharmaceutical companies and chemical scientists. However, low efficacy, off-target delivery, time consumption, and high cost impose a hurdle and challenges that impact drug design and discovery. Further, complex and big data from genomics, proteomics, microarray data, and clinical trials also impose an obstacle in the drug discovery pipeline. Artificial intelligence and machine learning technology play a crucial role in drug discovery and development. In other words, artificial neural networks and deep learning algorithms have modernized the area. Machine learning and deep learning algorithms have been implemented in several drug discovery processes such as peptide synthesis, structure-based virtual screening, ligand-based virtual screening, toxicity prediction, drug monitoring and release, pharmacophore modeling, quantitative structure-activity relationship, drug repositioning, polypharmacology, and physiochemical activity. Evidence from the past strengthens the implementation of artificial intelligence and deep learning in this field. Moreover, novel data mining, curation, and management techniques provided critical support to recently developed modeling algorithms. In summary, artificial intelligence and deep learning advancements provide an excellent opportunity for rational drug design and discovery process, which will eventually impact mankind. The primary concern associated with drug design and development is time consumption and production cost. Further, inefficiency, inaccurate target delivery, and inappropriate dosage are other hurdles that inhibit the process of drug delivery and development. With advancements in technology, computer-aided drug design integrating artificial intelligence algorithms can eliminate the challenges and hurdles of traditional drug design and development. Artificial intelligence is referred to as superset comprising machine learning, whereas machine learning comprises supervised learning, unsupervised learning, and reinforcement learning. Further, deep learning, a subset of machine learning, has been extensively implemented in drug design and development. The artificial neural network, deep neural network, support vector machines, classification and regression, generative adversarial networks, symbolic learning, and meta-learning are examples of the algorithms applied to the drug design and discovery process. Artificial intelligence has been applied to different areas of drug design and development process, such as from peptide synthesis to molecule design, virtual screening to molecular docking, quantitative structure-activity relationship to drug repositioning, protein misfolding to protein-protein interactions, and molecular pathway identification to polypharmacology. Artificial intelligence principles have been applied to the classification of active and inactive, monitoring drug release, pre-clinical and clinical development, primary and secondary drug screening, biomarker development, pharmaceutical manufacturing, bioactivity identification and physiochemical properties, prediction of toxicity, and identification of mode of action.

Collapse

Shan J, Ji C. MolOpt: A Web Server for Drug Design using Bioisosteric Transformation. Curr Comput Aided Drug Des 2021;16:460-466. [PMID: 31272357 DOI: 10.2174/1573409915666190704093400] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 05/12/2019] [Accepted: 06/13/2019] [Indexed: 01/03/2023]

Cappel D, Mozziconacci JC, Braun T, Steinbrecher T. Performance of Relative Binding Free Energy Calculations on an Automatically Generated Dataset of Halogen-Deshalogen Matched Molecular Pairs. J Chem Inf Model 2021;61:3421-3430. [PMID: 34170707 DOI: 10.1021/acs.jcim.1c00290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Lester CC, Yan G. A matched molecular pair (MMP) approach for selecting analogs suitable for structure activity relationship (SAR)-based read across. Regul Toxicol Pharmacol 2021;124:104966. [PMID: 34044089 DOI: 10.1016/j.yrtph.2021.104966] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 03/12/2021] [Accepted: 05/19/2021] [Indexed: 11/26/2022]

James SA, Yam WK. Sub-structure-based screening and molecular docking studies of potential enteroviruses inhibitors. Comput Biol Chem 2021;92:107499. [PMID: 33932782 DOI: 10.1016/j.compbiolchem.2021.107499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 04/21/2021] [Indexed: 11/15/2022]

Molecular optimization by capturing chemist's intuition using deep neural networks. J Cheminform 2021;13:26. [PMID: 33743817 PMCID: PMC7980633 DOI: 10.1186/s13321-021-00497-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 02/22/2021] [Indexed: 01/08/2023] Open

Abstract

A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task of molecular optimization, where the goal is to optimize a given starting molecule towards desirable properties. This task can be framed as a machine translation problem in natural language processing, where in our case, a molecule is translated into a molecule with optimized properties based on the SMILES representation. Typically, chemists would use their intuition to suggest chemical transformations for the starting molecule being optimized. A widely used strategy is the concept of matched molecular pairs where two molecules differ by a single transformation. We seek to capture the chemist’s intuition from matched molecular pairs using machine translation models. Specifically, the sequence-to-sequence model with attention mechanism, and the Transformer model are employed to generate molecules with desirable properties. As a proof of concept, three ADMET properties are optimized simultaneously: logD, solubility, and clearance, which are important properties of a drug. Since desirable properties often vary from project to project, the user-specified desirable property changes are incorporated into the input as an additional condition together with the starting molecules being optimized. Thus, the models can be guided to generate molecules satisfying the desirable properties. Additionally, we compare the two machine translation models based on the SMILES representation, with a graph-to-graph translation model HierG2G, which has shown the state-of-the-art performance in molecular optimization. Our results show that the Transformer can generate more molecules with desirable properties by making small modifications to the given starting molecules, which can be intuitive to chemists. A further enrichment of diverse molecules can be achieved by using an ensemble of models.

Collapse

Awale M, Hert J, Guasch L, Riniker S, Kramer C. The Playbooks of Medicinal Chemistry Design Moves. J Chem Inf Model 2021;61:729-742. [PMID: 33522806 DOI: 10.1021/acs.jcim.0c01143] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Design, synthesis and stepwise optimization of nitrile-based inhibitors of cathepsins B and L. Bioorg Med Chem 2021;29:115827. [PMID: 33254069 DOI: 10.1016/j.bmc.2020.115827] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 10/21/2020] [Accepted: 10/22/2020] [Indexed: 12/14/2022]

Siramshetty VB, Shah P, Kerns E, Nguyen K, Yu KR, Kabir M, Williams J, Neyra J, Southall N, Nguyễn ÐT, Xu X. Retrospective assessment of rat liver microsomal stability at NCATS: data and QSAR models. Sci Rep 2020;10:20713. [PMID: 33244000 PMCID: PMC7693334 DOI: 10.1038/s41598-020-77327-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Accepted: 11/04/2020] [Indexed: 11/09/2022] Open

Lumley JA, Desai P, Wang J, Cahya S, Zhang H. The Derivation of a Matched Molecular Pairs Based ADME/Tox Knowledge Base for Compound Optimization. J Chem Inf Model 2020;60:4757-4771. [PMID: 32975944 DOI: 10.1021/acs.jcim.0c00583] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Abstract

Matched Molecular Pairs (MMP) analysis is a well-established technique for Structure Activity and Property Analysis (SAR and SPR). Summarizing multiple MMPs that describe the same structural change into a single chemical transform can be a powerful tool for prediction (termed Transform from here on). This is particularly useful in the area of Absorption, Distribution, Metabolism, and Elimination (ADME) analysis that is less influenced by 3D structural binding effects. The creation of a knowledge database containing many of these Transforms across typical ADME assays promises to be a powerful approach to aid multidimensional optimization. We present a detailed workflow for the derivation of such a database. We include details of an MMP fragmentation algorithm with associated statistical summarization methods for the derivation of Transforms. This is made freely available as part of the LillyMol software package. We describe the application of this method to several ADME/Tox (Toxicity) assay data sets and highlight multiple cases where the impact of traditional medicinal chemistry Transforms is contradicted by MMP data. We also describe the internal software interface used by medicinal chemists to aid the design of new compounds via automated suggestion. This approach utilizes the matched pairs database to "suggest" improved compounds in an automated design scenario. A nonvisual script-based version of the automated suggestions code with an associated set of described chemical Transforms is also made freely available along with this paper and as part of the LillyMol software package. Finally, we contrast this knowledge database against a larger database of all MMPs derived from a 2 million compound diversity set and a subset of MMPs seen in historical discovery projects. The comparison against all transforms in the diversity collection highlights the very low coverage of the transform database as compared to all possible transforms involving 15 atom fragments. The comparison against a smaller subset of Transforms seen on internal Medicinal Chemistry projects shows better coverage of the transform database for a small set of common medicinal chemistry strategies. Within the context of all possible transforms available to a medicinal chemistry project team, the challenge remains to move beyond mere idea generation from past projects toward high quality prediction for novel ADME/Tox modulating Transforms.

Collapse

Baker CM, Kidley NJ, Papachristos K, Hotson M, Carson R, Gravestock D, Pouliot M, Harrison J, Dowling A. Tautomer Standardization in Chemical Databases: Deriving Business Rules from Quantum Chemistry. J Chem Inf Model 2020;60:3781-3791. [PMID: 32644790 DOI: 10.1021/acs.jcim.0c00232] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Vanhaelen Q, Lin YC, Zhavoronkov A. The Advent of Generative Chemistry. ACS Med Chem Lett 2020;11:1496-1505. [PMID: 32832015 PMCID: PMC7429972 DOI: 10.1021/acsmedchemlett.0c00088] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 07/14/2020] [Indexed: 12/12/2022] Open

Optimization strategy of single-digit nanomolar cross-class inhibitors of mammalian and protozoa cysteine proteases. Bioorg Chem 2020;101:104039. [PMID: 32629285 DOI: 10.1016/j.bioorg.2020.104039] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2020] [Revised: 06/19/2020] [Accepted: 06/19/2020] [Indexed: 01/04/2023]

Arús-Pous J, Patronov A, Bjerrum EJ, Tyrchan C, Reymond JL, Chen H, Engkvist O. SMILES-based deep generative scaffold decorator for de-novo drug design. J Cheminform 2020;12:38. [PMID: 33431013 PMCID: PMC7260788 DOI: 10.1186/s13321-020-00441-8] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 05/16/2020] [Indexed: 12/21/2022] Open

Abstract

Molecular generative models trained with small sets of molecules represented as SMILES strings can generate large regions of the chemical space. Unfortunately, due to the sequential nature of SMILES strings, these models are not able to generate molecules given a scaffold (i.e., partially-built molecules with explicit attachment points). Herein we report a new SMILES-based molecular generative architecture that generates molecules from scaffolds and can be trained from any arbitrary molecular set. This approach is possible thanks to a new molecular set pre-processing algorithm that exhaustively slices all possible combinations of acyclic bonds of every molecule, combinatorically obtaining a large number of scaffolds with their respective decorations. Moreover, it serves as a data augmentation technique and can be readily coupled with randomized SMILES to obtain even better results with small sets. Two examples showcasing the potential of the architecture in medicinal and synthetic chemistry are described: First, models were trained with a training set obtained from a small set of Dopamine Receptor D2 (DRD2) active modulators and were able to meaningfully decorate a wide range of scaffolds and obtain molecular series predicted active on DRD2. Second, a larger set of drug-like molecules from ChEMBL was selectively sliced using synthetic chemistry constraints (RECAP rules). In this case, the resulting scaffolds with decorations were filtered only to allow those that included fragment-like decorations. This filtering process allowed models trained with this dataset to selectively decorate diverse scaffolds with fragments that were generally predicted to be synthesizable and attachable to the scaffold using known synthetic approaches. In both cases, the models were already able to decorate molecules using specific knowledge without the need to add it with other techniques, such as reinforcement learning. We envision that this architecture will become a useful addition to the already existent architectures for de novo molecular generation.

Collapse

Awale M, Riniker S, Kramer C. Matched Molecular Series Analysis for ADME Property Prediction. J Chem Inf Model 2020;60:2903-2914. [PMID: 32369360 DOI: 10.1021/acs.jcim.0c00269] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Willems H, De Cesco S, Svensson F. Computational Chemistry on a Budget: Supporting Drug Discovery with Limited Resources. J Med Chem 2020;63:10158-10169. [DOI: 10.1021/acs.jmedchem.9b02126] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Keeley A, Petri L, Ábrányi-Balogh P, Keserű GM. Covalent fragment libraries in drug discovery. Drug Discov Today 2020;25:983-996. [PMID: 32298798 DOI: 10.1016/j.drudis.2020.03.016] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 03/07/2020] [Accepted: 03/23/2020] [Indexed: 12/20/2022]

Mapping the S1 and S1' subsites of cysteine proteases with new dipeptidyl nitrile inhibitors as trypanocidal agents. PLoS Negl Trop Dis 2020;14:e0007755. [PMID: 32163418 PMCID: PMC7067379 DOI: 10.1371/journal.pntd.0007755] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Accepted: 01/30/2020] [Indexed: 12/24/2022] Open

Landry ML, Crawford JJ. LogD Contributions of Substituents Commonly Used in Medicinal Chemistry. ACS Med Chem Lett 2020;11:72-76. [PMID: 31938466 DOI: 10.1021/acsmedchemlett.9b00489] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 12/11/2019] [Indexed: 12/18/2022] Open

Advancing Drug Discovery via Artificial Intelligence. Trends Pharmacol Sci 2019;40:592-604. [DOI: 10.1016/j.tips.2019.06.004] [Citation(s) in RCA: 164] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 05/23/2019] [Accepted: 06/11/2019] [Indexed: 01/15/2023]

Koutsoukas A, Chang G, Keefer CE. In-Silico Extraction of Design Ideas Using MMPA-by-QSAR and its Application on ADME Endpoints. J Chem Inf Model 2018;59:477-485. [DOI: 10.1021/acs.jcim.8b00520] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Georgi V, Schiele F, Berger BT, Steffen A, Marin Zapata PA, Briem H, Menz S, Preusse C, Vasta JD, Robers MB, Brands M, Knapp S, Fernández-Montalván A. Binding Kinetics Survey of the Drugged Kinome. J Am Chem Soc 2018;140:15774-15782. [PMID: 30362749 DOI: 10.1021/jacs.8b08048] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Affiliation(s)

Victoria Georgi Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany.,Structural Genomics Consortium, Institute for Pharmaceutical Chemistry , Johann Wolfgang Goethe-University , Max-von-Laue-Straße 9 , 60438 Frankfurt am Main , Germany
Felix Schiele Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
Benedict-Tilman Berger Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany.,Structural Genomics Consortium, Institute for Pharmaceutical Chemistry , Johann Wolfgang Goethe-University , Max-von-Laue-Straße 9 , 60438 Frankfurt am Main , Germany.,Structural Genomics Consortium, Buchmann Institute for Molecular Life Sciences , Johann Wolfgang Goethe-University , Max-von-Laue-Straße 15 , 60438 Frankfurt am Main , Germany
Andreas Steffen Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
Paula A Marin Zapata Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
Hans Briem Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
Stephan Menz Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
Cornelia Preusse Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
James D Vasta Promega Corporation , 2800 Woods Hollow Road , Fitchburg , Wisconsin 53711 , United States
Matthew B Robers Promega Corporation , 2800 Woods Hollow Road , Fitchburg , Wisconsin 53711 , United States
Michael Brands Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany
Stefan Knapp Structural Genomics Consortium, Institute for Pharmaceutical Chemistry , Johann Wolfgang Goethe-University , Max-von-Laue-Straße 9 , 60438 Frankfurt am Main , Germany.,Structural Genomics Consortium, Buchmann Institute for Molecular Life Sciences , Johann Wolfgang Goethe-University , Max-von-Laue-Straße 15 , 60438 Frankfurt am Main , Germany
Amaury Fernández-Montalván Bayer AG, Drug Discovery, Pharmaceuticals , Müllerstraße 178 , 13353 Berlin , Germany

Collapse

The convergence of artificial intelligence and chemistry for improved drug discovery. Future Med Chem 2018;10:2573-2576. [DOI: 10.4155/fmc-2018-0161] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Can we accelerate medicinal chemistry by augmenting the chemist with Big Data and artificial intelligence? Drug Discov Today 2018;23:1373-1384. [DOI: 10.1016/j.drudis.2018.03.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 02/27/2018] [Accepted: 03/20/2018] [Indexed: 12/18/2022]

Dalke A, Hert J, Kramer C. mmpdb: An Open-Source Matched Molecular Pair Platform for Large Multiproperty Data Sets. J Chem Inf Model 2018;58:902-910. [DOI: 10.1021/acs.jcim.8b00173] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Ehmki ESR, Rarey M. Exploring Structure-Activity Relationships with Three-Dimensional Matched Molecular Pairs-A Review. ChemMedChem 2018;13:482-489. [PMID: 29211343 DOI: 10.1002/cmdc.201700628] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 11/27/2017] [Indexed: 11/10/2022]