Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Mitchell JBO. Machine learning methods in chemoinformatics. Wiley Interdiscip Rev Comput Mol Sci 2014;4:468-481. [PMID: 25285160 PMCID: PMC4180928 DOI: 10.1002/wcms.1183] [Citation(s) in RCA: 238] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

For:	Mitchell JBO. Machine learning methods in chemoinformatics. Wiley Interdiscip Rev Comput Mol Sci 2014;4:468-481. [PMID: 25285160 PMCID: PMC4180928 DOI: 10.1002/wcms.1183] [Citation(s) in RCA: 238] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Number

Cited by Other Article(s)

Lane TR, Urbina F, Rank L, Gerlach J, Riabova O, Lepioshkin A, Kazakova E, Vocat A, Tkachenko V, Cole S, Makarov V, Ekins S. Machine Learning Models for Mycobacterium tuberculosisIn Vitro Activity: Prediction and Target Visualization. Mol Pharm 2022;19:674-689. [PMID: 34964633 PMCID: PMC9121329 DOI: 10.1021/acs.molpharmaceut.1c00791] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Smart Materials Prediction: Applying Machine Learning to Lithium Solid-State Electrolyte. MATERIALS 2022;15:ma15031157. [PMID: 35161101 PMCID: PMC8840428 DOI: 10.3390/ma15031157] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/23/2022] [Accepted: 01/31/2022] [Indexed: 11/24/2022]

Jung K, Corrigan N, Wong EHH, Boyer C. Bioactive Synthetic Polymers. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2022;34:e2105063. [PMID: 34611948 DOI: 10.1002/adma.202105063] [Citation(s) in RCA: 47] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 08/13/2021] [Indexed: 05/21/2023]

Unsupervised Representation Learning for Proteochemometric Modeling. Int J Mol Sci 2021;22:ijms222312882. [PMID: 34884688 PMCID: PMC8657702 DOI: 10.3390/ijms222312882] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/25/2021] [Accepted: 11/26/2021] [Indexed: 11/18/2022] Open

Hueffel JA, Sperger T, Funes-Ardoiz I, Ward JS, Rissanen K, Schoenebeck F. Accelerated dinuclear palladium catalyst identification through unsupervised machine learning. Science 2021;374:1134-1140. [PMID: 34822285 DOI: 10.1126/science.abj0999] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Yang Y, Yao K, Repasky MP, Leswing K, Abel R, Shoichet BK, Jerome SV. Efficient Exploration of Chemical Space with Docking and Deep Learning. J Chem Theory Comput 2021;17:7106-7119. [PMID: 34592101 DOI: 10.1021/acs.jctc.1c00810] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Machine Learning Applied to the Modeling of Pharmacological and ADMET Endpoints. Methods Mol Biol 2021. [PMID: 34731464 DOI: 10.1007/978-1-0716-1787-8_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023]

Williams W, Zeng L, Gensch T, Sigman MS, Doyle AG, Anslyn EV. The Evolution of Data-Driven Modeling in Organic Chemistry. ACS CENTRAL SCIENCE 2021;7:1622-1637. [PMID: 34729406 PMCID: PMC8554870 DOI: 10.1021/acscentsci.1c00535] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Indexed: 05/14/2023]

Haywood AL, Redshaw J, Hanson-Heine MWD, Taylor A, Brown A, Mason AM, Gärtner T, Hirst JD. Kernel Methods for Predicting Yields of Chemical Reactions. J Chem Inf Model 2021;62:2077-2092. [PMID: 34699222 DOI: 10.1021/acs.jcim.1c00699] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Chakraborty P, Mandal R, Garg N, Sundararaju B. Recent advances in transition metal-catalyzed asymmetric electrocatalysis. Coord Chem Rev 2021. [DOI: 10.1016/j.ccr.2021.214065] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Lahnsteiner M, Caldera M, Moura HM, Cerrón-Infantes DA, Roeser J, Konegger T, Thomas A, Menche J, Unterlass MM. Hydrothermal polymerization of porous aromatic polyimide networks and machine learning-assisted computational morphology evolution interpretation. JOURNAL OF MATERIALS CHEMISTRY. A 2021;9:19754-19769. [PMID: 34589226 PMCID: PMC8439099 DOI: 10.1039/d1ta01253c] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 08/18/2021] [Indexed: 06/13/2023]

Abstract

We report on the hydrothermal polymerization (HTP) of polyimide (PI) networks using the medium H2O and the comonomers 1,3,5-tris(4-aminophenyl)benzene (TAPB) and pyromellitic acid (PMA). Full condensation is obtained at minimal reaction times of only 2 h at 200 °C. The PI networks are obtained as monoliths and feature thermal stabilities of >500 °C, and in several cases even up to 595 °C. The monoliths are built up by networks of densely packed, near-monodisperse spherical particles and annealed microfibers, and show three types of porosity: (i) intrinsic inter-segment ultramicroporosity (<0.8 nm) of the PI networks composing the particles (∼3-5 μm), (ii) interstitial voids between the particles (0.1-2 μm), and (iii) monolith cell porosity (∽10-100 μm), as studied via low pressure gas physisorption and Hg intrusion porosimetry analyses. This unique hierarchical porosity generates an outstandingly high specific pore volume of 7250 mm3 g-1. A large-scale micromorphological study screening the reaction parameters time, temperature, and the absence/presence of the additive acetic acid was performed. Through expert interpretation of hundreds of scanning electron microscopy (SEM) images of the products of these experiments, we devise a hypothesis for morphology formation and evolution: a monomer salt is initially formed and subsequently transformed to overall eight different fiber, pearl chain, and spherical morphologies, composed of PI and, at long reaction times (>48 h), also PI/SiO2 hybrids that form through reaction with the reaction vessel. Moreover, we have developed a computational image analysis pipeline that deciphers the complex morphologies of these SEM images automatically and also allows for formulating a hypothesis of morphology development in HTP that is in good agreement with the manual morphology analysis. Finally, we upscaled the HTP of PI(TAPB-PMA) and processed the resulting powder into dense cylindrical specimen by green solvent-free warm-pressing, showing that one can follow the full route from the synthesis of these PI networks to a final material without employing harmful solvents.

Collapse

Affiliation(s)

Marianne Lahnsteiner Technische Universität Wien, Institute of Materials Chemistry Getreidemarkt 9/165 1060 Vienna Austria Technische Universität Wien, Institute of Applied Synthetic Chemistry Getreidemarkt 9/163 1060 Vienna Austria CeMM - Research Center for Molecular Medicine of the Austrian Academy of Sciences Lazarettgasse 14, AKH BT 25.3 1090 Vienna Austria
Michael Caldera CeMM - Research Center for Molecular Medicine of the Austrian Academy of Sciences Lazarettgasse 14, AKH BT 25.3 1090 Vienna Austria Max F. Perutz Labs, Campus Vienna Biocenter 5 Dr.-Bohr-Gasse 9 1030 Vienna Austria
Hipassia M Moura Technische Universität Wien, Institute of Materials Chemistry Getreidemarkt 9/165 1060 Vienna Austria Technische Universität Wien, Institute of Applied Synthetic Chemistry Getreidemarkt 9/163 1060 Vienna Austria CeMM - Research Center for Molecular Medicine of the Austrian Academy of Sciences Lazarettgasse 14, AKH BT 25.3 1090 Vienna Austria Universität Konstanz, Department of Chemistry, Solid State Chemistry Universitätsstrasse 10 D-78464 Konstanz Germany
D Alonso Cerrón-Infantes Technische Universität Wien, Institute of Materials Chemistry Getreidemarkt 9/165 1060 Vienna Austria Technische Universität Wien, Institute of Applied Synthetic Chemistry Getreidemarkt 9/163 1060 Vienna Austria CeMM - Research Center for Molecular Medicine of the Austrian Academy of Sciences Lazarettgasse 14, AKH BT 25.3 1090 Vienna Austria Universität Konstanz, Department of Chemistry, Solid State Chemistry Universitätsstrasse 10 D-78464 Konstanz Germany
Jérôme Roeser Technische Universität Berlin, Institute of Chemistry Str. des 17. Juni 115 10623 Berlin Germany
Thomas Konegger Technische Universität Wien, Institute of Chemical Technologies and Analytics Getreidemarkt 9/164 1060 Vienna Austria
Arne Thomas Technische Universität Berlin, Institute of Chemistry Str. des 17. Juni 115 10623 Berlin Germany
Jörg Menche CeMM - Research Center for Molecular Medicine of the Austrian Academy of Sciences Lazarettgasse 14, AKH BT 25.3 1090 Vienna Austria Max F. Perutz Labs, Campus Vienna Biocenter 5 Dr.-Bohr-Gasse 9 1030 Vienna Austria
Miriam M Unterlass Technische Universität Wien, Institute of Materials Chemistry Getreidemarkt 9/165 1060 Vienna Austria Technische Universität Wien, Institute of Applied Synthetic Chemistry Getreidemarkt 9/163 1060 Vienna Austria CeMM - Research Center for Molecular Medicine of the Austrian Academy of Sciences Lazarettgasse 14, AKH BT 25.3 1090 Vienna Austria Universität Konstanz, Department of Chemistry, Solid State Chemistry Universitätsstrasse 10 D-78464 Konstanz Germany

Collapse

Keith JA, Vassilev-Galindo V, Cheng B, Chmiela S, Gastegger M, Müller KR, Tkatchenko A. Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems. Chem Rev 2021;121:9816-9872. [PMID: 34232033 PMCID: PMC8391798 DOI: 10.1021/acs.chemrev.1c00107] [Citation(s) in RCA: 190] [Impact Index Per Article: 63.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Indexed: 12/23/2022]

Machine Learning in Chemical Product Engineering: The State of the Art and a Guide for Newcomers. Processes (Basel) 2021. [DOI: 10.3390/pr9081456] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Xiong G, Shen C, Yang Z, Jiang D, Liu S, Lu A, Chen X, Hou T, Cao D. Featurization strategies for protein–ligand interactions and their applications in scoring function development. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Kerner J, Dogan A, von Recum H. Machine learning and big data provide crucial insight for future biomaterials discovery and research. Acta Biomater 2021;130:54-65. [PMID: 34087445 DOI: 10.1016/j.actbio.2021.05.053] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/24/2021] [Accepted: 05/25/2021] [Indexed: 02/06/2023]

Abstract

Machine learning have been widely adopted in a variety of fields including engineering, science, and medicine revolutionizing how data is collected, used, and stored. Their implementation has led to a drastic increase in the number of computational models for the prediction of various numerical, categorical, or association events given input variables. We aim to examine recent advances in the use of machine learning when applied to the biomaterial field. Specifically, quantitative structure properties relationships offer the unique ability to correlate microscale molecular descriptors to larger macroscale material properties. These new models can be broken down further into four categories: regression, classification, association, and clustering. We examine recent approaches and new uses of machine learning in the three major categories of biomaterials: metals, polymers, and ceramics for rapid property prediction and trend identification. While current research is promising, limitations in the form of lack of standardized reporting and available databases complicates the implementation of described models. Herein, we hope to provide a snapshot of the current state of the field and a beginner's guide to navigating the intersection of biomaterials research and machine learning. STATEMENT OF SIGNIFICANCE: Machine learning and its methods have found a variety of uses beyond the field of computer science but have largely been neglected by those in realm of biomaterials. Through the use of more computational methods, biomaterials development can be expediated while reducing the need for standard trial and error methods. Within, we introduce four basic models that readers can potentially apply to their current research as well as current applications within the field. Furthermore, we hope that this article may act as a "call to action" for readers to realize and address the current lack of implementation within the biomaterials field.

Collapse

Liu Y, Zhou Q, Cui G. Machine Learning Boosting the Development of Advanced Lithium Batteries. SMALL METHODS 2021;5:e2100442. [PMID: 34927866 DOI: 10.1002/smtd.202100442] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 06/22/2021] [Indexed: 06/14/2023]

Tan Z, Li Y, Shi W, Yang S. A Multitask Approach to Learn Molecular Properties. J Chem Inf Model 2021;61:3824-3834. [PMID: 34289687 DOI: 10.1021/acs.jcim.1c00646] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Fritz F, Preissner R, Banerjee P. VirtualTaste: a web server for the prediction of organoleptic properties of chemical compounds. Nucleic Acids Res 2021;49:W679-W684. [PMID: 33905509 PMCID: PMC8262722 DOI: 10.1093/nar/gkab292] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 04/07/2021] [Accepted: 04/09/2021] [Indexed: 12/30/2022] Open

Kashyap K, Siddiqi MI. Recent trends in artificial intelligence-driven identification and development of anti-neurodegenerative therapeutic agents. Mol Divers 2021;25:1517-1539. [PMID: 34282519 DOI: 10.1007/s11030-021-10274-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Accepted: 07/05/2021] [Indexed: 12/12/2022]

Wiesinger H, Wang Z, Hellweg S. Deep Dive into Plastic Monomers, Additives, and Processing Aids. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021;55:9339-9351. [PMID: 34154322 DOI: 10.1021/acs.est.1c00976] [Citation(s) in RCA: 157] [Impact Index Per Article: 52.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Green AJ, Mohlenkamp MJ, Das J, Chaudhari M, Truong L, Tanguay RL, Reif DM. Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology. PLoS Comput Biol 2021;17:e1009135. [PMID: 34214078 PMCID: PMC8301607 DOI: 10.1371/journal.pcbi.1009135] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 07/23/2021] [Accepted: 05/31/2021] [Indexed: 12/01/2022] Open

Heng T, Yang D, Wang R, Zhang L, Lu Y, Du G. Progress in Research on Artificial Intelligence Applied to Polymorphism and Cocrystal Prediction. ACS OMEGA 2021;6:15543-15550. [PMID: 34179597 PMCID: PMC8223226 DOI: 10.1021/acsomega.1c01330] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 05/28/2021] [Indexed: 06/13/2023]

Deng D, Chen X, Zhang R, Lei Z, Wang X, Zhou F. XGraphBoost: Extracting Graph Neural Network-Based Features for a Better Prediction of Molecular Properties. J Chem Inf Model 2021;61:2697-2705. [PMID: 34009965 DOI: 10.1021/acs.jcim.0c01489] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Abstract

Determining the properties of chemical molecules is essential for screening candidates similar to a specific drug. These candidate molecules are further evaluated for their target binding affinities, side effects, target missing probabilities, etc. Conventional machine learning algorithms demonstrated satisfying prediction accuracies of molecular properties. A molecule cannot be directly loaded into a machine learning model, and a set of engineered features needs to be designed and calculated from a molecule. Such hand-crafted features rely heavily on the experiences of the investigating researchers. The concept of graph neural networks (GNNs) was recently introduced to describe the chemical molecules. The features may be automatically and objectively extracted from the molecules through various types of GNNs, e.g., GCN (graph convolution network), GGNN (gated graph neural network), DMPNN (directed message passing neural network), etc. However, the training of a stable GNN model requires a huge number of training samples and a large amount of computing power, compared with the conventional machine learning strategies. This study proposed the integrated framework XGraphBoost to extract the features using a GNN and build an accurate prediction model of molecular properties using the classifier XGBoost. The proposed framework XGraphBoost fully inherits the merits of the GNN-based automatic molecular feature extraction and XGBoost-based accurate prediction performance. Both classification and regression problems were evaluated using the framework XGraphBoost. The experimental results strongly suggest that XGraphBoost may facilitate the efficient and accurate predictions of various molecular properties. The source code is freely available to academic users at https://github.com/chenxiaowei-vincent/XGraphBoost.git.

Collapse

GPCR_LigandClassify.py; a rigorous machine learning classifier for GPCR targeting compounds. Sci Rep 2021;11:9510. [PMID: 33947911 PMCID: PMC8097070 DOI: 10.1038/s41598-021-88939-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 04/12/2021] [Indexed: 02/02/2023] Open

An AY, Choi KYG, Baghela AS, Hancock REW. An Overview of Biological and Computational Methods for Designing Mechanism-Informed Anti-biofilm Agents. Front Microbiol 2021;12:640787. [PMID: 33927701 PMCID: PMC8076610 DOI: 10.3389/fmicb.2021.640787] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Accepted: 03/23/2021] [Indexed: 12/29/2022] Open

Gallarati S, Fabregat R, Laplaza R, Bhattacharjee S, Wodrich MD, Corminboeuf C. Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts. Chem Sci 2021;12:6879-6889. [PMID: 34123316 PMCID: PMC8153079 DOI: 10.1039/d1sc00482d] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 04/01/2021] [Indexed: 12/12/2022] Open

Affiliation(s)

Simone Gallarati Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
Raimon Fabregat Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
Rubén Laplaza Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland National Center for Competence in Research-Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
Sinjini Bhattacharjee Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland Indian Institute of Science Education and Research Dr Homi Bhabha Rd, Ward No. 8, NCL Colony, Pashan Pune Maharashtra 411008 India
Matthew D Wodrich Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland National Center for Competence in Research-Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
Clemence Corminboeuf Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland National Center for Competence in Research-Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland National Center for Computational Design and Discovery of Novel Materials (MARVEL), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland

Collapse

Aghdam SA, Brown AMV. Deep learning approaches for natural product discovery from plant endophytic microbiomes. ENVIRONMENTAL MICROBIOME 2021;16:6. [PMID: 33758794 PMCID: PMC7972023 DOI: 10.1186/s40793-021-00375-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 02/21/2021] [Indexed: 05/10/2023]

Sifain AE, Rice BM, Yalkowsky SH, Barnes BC. Machine learning transition temperatures from 2D structure. J Mol Graph Model 2021;105:107848. [PMID: 33667863 DOI: 10.1016/j.jmgm.2021.107848] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/11/2021] [Accepted: 01/19/2021] [Indexed: 10/22/2022]

Karuth A, Alesadi A, Xia W, Rasulev B. Predicting glass transition of amorphous polymers by application of cheminformatics and molecular dynamics simulations. POLYMER 2021. [DOI: 10.1016/j.polymer.2021.123495] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Ding J, Xu N, Nguyen MT, Qiao Q, Shi Y, He Y, Shao Q. Machine learning for molecular thermodynamics. Chin J Chem Eng 2021. [DOI: 10.1016/j.cjche.2020.10.044] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

McComb M, Bies R, Ramanathan M. Machine learning in pharmacometrics: Opportunities and challenges. Br J Clin Pharmacol 2021;88:1482-1499. [PMID: 33634893 DOI: 10.1111/bcp.14801] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 02/08/2021] [Accepted: 02/12/2021] [Indexed: 12/13/2022] Open

Jiang D, Wu Z, Hsieh CY, Chen G, Liao B, Wang Z, Shen C, Cao D, Wu J, Hou T. Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models. J Cheminform 2021;13:12. [PMID: 33597034 PMCID: PMC7888189 DOI: 10.1186/s13321-020-00479-8] [Citation(s) in RCA: 162] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 11/26/2020] [Indexed: 12/31/2022] Open

Minias A, Żukowska L, Lechowicz E, Gąsior F, Knast A, Podlewska S, Zygała D, Dziadek J. Early Drug Development and Evaluation of Putative Antitubercular Compounds in the -Omics Era. Front Microbiol 2021;11:618168. [PMID: 33603720 PMCID: PMC7884339 DOI: 10.3389/fmicb.2020.618168] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 12/30/2020] [Indexed: 12/14/2022] Open

Affiliation(s)

Alina Minias Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
Lidia Żukowska Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland BioMedChem Doctoral School of the University of Lodz and the Institutes of the Polish Academy of Sciences in Lodz, Lodz, Poland
Ewelina Lechowicz Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland Institute of Microbiology, Biotechnology and Immunology, Faculty of Biology and Environmental Protection, University of Lodz, Lodz, Poland
Filip Gąsior Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland BioMedChem Doctoral School of the University of Lodz and the Institutes of the Polish Academy of Sciences in Lodz, Lodz, Poland
Agnieszka Knast Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland Institute of Molecular and Industrial Biotechnology, Faculty of Biotechnology and Food Sciences, Lodz University of Technology, Lodz, Poland
Sabina Podlewska Department of Technology and Biotechnology of Drugs, Jagiellonian University Medical College, Krakow, Poland Maj Institute of Pharmacology, Polish Academy of Sciences, Krakow, Poland
Daria Zygała Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland Institute of Microbiology, Biotechnology and Immunology, Faculty of Biology and Environmental Protection, University of Lodz, Lodz, Poland
Jarosław Dziadek Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland

Collapse

Espinoza GZ, Angelo RM, Oliveira PR, Honorio KM. Evaluating Deep Learning models for predicting ALK-5 inhibition. PLoS One 2021;16:e0246126. [PMID: 33508008 PMCID: PMC7842961 DOI: 10.1371/journal.pone.0246126] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 01/14/2021] [Indexed: 11/18/2022] Open

Predictive Models for the Binary Diffusion Coefficient at Infinite Dilution in Polar and Nonpolar Fluids. MATERIALS 2021;14:ma14030542. [PMID: 33498723 PMCID: PMC7866074 DOI: 10.3390/ma14030542] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 01/07/2021] [Accepted: 01/19/2021] [Indexed: 12/03/2022]

Shamsara J. Evaluation of the performance of various machine learning methods on the discrimination of the active compounds. Chem Biol Drug Des 2021;97:930-943. [DOI: 10.1111/cbdd.13819] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 12/10/2020] [Accepted: 12/21/2020] [Indexed: 12/12/2022]

Chemoinformatics and QSAR. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Extended Regression Modeling of the Toxicity of Phenol Derivatives to <i>Tetrahymena pyriformis</i> Using the Electronic-Structure Informatics Descriptor. JOURNAL OF COMPUTER AIDED CHEMISTRY 2021. [DOI: 10.2751/jcac.22.17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Yuan J, Liu X, Wang S, Chang C, Zeng Q, Song Z, Jin Y, Zeng Q, Sun G, Ruan S, Greenwell C, Abramov YA. Virtual coformer screening by a combined machine learning and physics-based approach. CrystEngComm 2021. [DOI: 10.1039/d1ce00587a] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Affiliation(s)

Jiuchuang Yuan XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Xuetao Liu XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogeomics, Peking University Shenzhen Graduate School, Shenzhen, 518055 China
Simin Wang XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Chao Chang XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Qiao Zeng XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Zhengtian Song XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Yingdi Jin XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Qun Zeng XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Guangxu Sun XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Shigang Ruan XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Chandler Greenwell XtalPi Inc, Cambridge, Massachusetts 02142, USA
Yuriy A. Abramov XtalPi Inc, Cambridge, Massachusetts 02142, USA Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, USA

Collapse

Wang MWH, Goodman JM, Allen TEH. Machine Learning in Predictive Toxicology: Recent Applications and Future Directions for Classification Models. Chem Res Toxicol 2020;34:217-239. [PMID: 33356168 DOI: 10.1021/acs.chemrestox.0c00316] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Lane TR, Foil DH, Minerali E, Urbina F, Zorn KM, Ekins S. Bioactivity Comparison across Multiple Machine Learning Algorithms Using over 5000 Datasets for Drug Discovery. Mol Pharm 2020;18:403-415. [PMID: 33325717 DOI: 10.1021/acs.molpharmaceut.0c01013] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Abstract

Machine learning methods are attracting considerable attention from the pharmaceutical industry for use in drug discovery and applications beyond. In recent studies, we and others have applied multiple machine learning algorithms and modeling metrics and, in some cases, compared molecular descriptors to build models for individual targets or properties on a relatively small scale. Several research groups have used large numbers of datasets from public databases such as ChEMBL in order to evaluate machine learning methods of interest to them. The largest of these types of studies used on the order of 1400 datasets. We have now extracted well over 5000 datasets from CHEMBL for use with the ECFP6 fingerprint and in comparison of our proprietary software Assay Central with random forest, k-nearest neighbors, support vector classification, naïve Bayesian, AdaBoosted decision trees, and deep neural networks (three layers). Model performance was assessed using an array of fivefold cross-validation metrics including area-under-the-curve, F1 score, Cohen's kappa, and Matthews correlation coefficient. Based on ranked normalized scores for the metrics or datasets, all methods appeared comparable, while the distance from the top indicated that Assay Central and support vector classification were comparable. Unlike prior studies which have placed considerable emphasis on deep neural networks (deep learning), no advantage was seen in this case. If anything, Assay Central may have been at a slight advantage as the activity cutoff for each of the over 5000 datasets representing over 570,000 unique compounds was based on Assay Central performance, although support vector classification seems to be a strong competitor. We also applied Assay Central to perform prospective predictions for the toxicity targets PXR and hERG to further validate these models. This work appears to be the largest scale comparison of these machine learning algorithms to date. Future studies will likely evaluate additional databases, descriptors, and machine learning algorithms and further refine the methods for evaluating and comparing such models.

Collapse

Sun W, Braatz RD. Opportunities in tensorial data analytics for chemical and biological manufacturing processes. Comput Chem Eng 2020. [DOI: 10.1016/j.compchemeng.2020.107099] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

de Albuquerque S, Cianni L, de Vita D, Duque C, Gomes ASM, Gomes P, Laughton C, Leitão A, Montanari CA, Montanari R, Ribeiro JFR, da Silva JS, Teixeira C. Molecular design aided by random forests and synthesis of potent trypanocidal agents as cruzain inhibitors for Chagas disease treatment. Chem Biol Drug Des 2020;96:948-960. [PMID: 33058457 DOI: 10.1111/cbdd.13663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 12/13/2019] [Accepted: 12/23/2019] [Indexed: 11/30/2022]

Yang S, Ye Q, Ding J, Yin, Lu A, Chen X, Hou T, Cao D. Current advances in ligand‐based target prediction. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1504] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Fine J, Kuan-Yu Liu J, Beck A, Alzarieni KZ, Ma X, Boulos VM, Kenttämaa HI, Chopra G. Graph-based machine learning interprets and predicts diagnostic isomer-selective ion-molecule reactions in tandem mass spectrometry. Chem Sci 2020;11:11849-11858. [PMID: 34094414 PMCID: PMC8162943 DOI: 10.1039/d0sc02530e] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Capecchi A, Reymond JL. Assigning the Origin of Microbial Natural Products by Chemical Space Map and Machine Learning. Biomolecules 2020;10:E1385. [PMID: 32998475 PMCID: PMC7600738 DOI: 10.3390/biom10101385] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 09/22/2020] [Accepted: 09/25/2020] [Indexed: 12/20/2022] Open

Lentelink NJ, Palkovits S. Transfer Learning as Tool to Enhance Predictions of Molecular Properties Based on 2D Projections. ADVANCED THEORY AND SIMULATIONS 2020. [DOI: 10.1002/adts.202000148] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Shamsara J. A Random Forest Model to Predict the Activity of a Large Set of Soluble Epoxide Hydrolase Inhibitors Solely Based on a Set of Simple Fragmental Descriptors. Comb Chem High Throughput Screen 2020;22:555-569. [PMID: 31622216 DOI: 10.2174/1386207322666191016110232] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 08/02/2019] [Accepted: 09/19/2019] [Indexed: 01/10/2023]

Zadorozhnii PV, Kiselev VV, Kharchenko AV. In silico toxicity evaluation of Salubrinal and its analogues. Eur J Pharm Sci 2020;155:105538. [PMID: 32889087 DOI: 10.1016/j.ejps.2020.105538] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 08/14/2020] [Accepted: 08/30/2020] [Indexed: 02/06/2023]

100

Achary PGR. Applications of Quantitative Structure-Activity Relationships (QSAR) based Virtual Screening in Drug Design: A Review. Mini Rev Med Chem 2020;20:1375-1388. [DOI: 10.2174/1389557520666200429102334] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 11/07/2019] [Accepted: 11/08/2019] [Indexed: 12/18/2022]

Abstract The scientists, and the researchers around the globe generate tremendous amount of information everyday; for instance, so far more than 74 million molecules are registered in Chemical Abstract Services. According to a recent study, at present we have around 1060 molecules, which are classified as new drug-like molecules. The library of such molecules is now considered as ‘dark chemical space’ or ‘dark chemistry.’ Now, in order to explore such hidden molecules scientifically, a good number of live and updated databases (protein, cell, tissues, structure, drugs, etc.) are available today. The synchronization of the three different sciences: ‘genomics’, proteomics and ‘in-silico simulation’ will revolutionize the process of drug discovery. The screening of a sizable number of drugs like molecules is a challenge and it must be treated in an efficient manner. Virtual screening (VS) is an important computational tool in the drug discovery process; however, experimental verification of the drugs also equally important for the drug development process. The quantitative structure-activity relationship (QSAR) analysis is one of the machine learning technique, which is extensively used in VS techniques. QSAR is well-known for its high and fast throughput screening with a satisfactory hit rate. The QSAR model building involves (i) chemo-genomics data collection from a database or literature (ii) Calculation of right descriptors from molecular representation (iii) establishing a relationship (model) between biological activity and the selected descriptors (iv) application of QSAR model to predict the biological property for the molecules. All the hits obtained by the VS technique needs to be experimentally verified. The present mini-review highlights: the web-based machine learning tools, the role of QSAR in VS techniques, successful applications of QSAR based VS leading to the drug discovery and advantages and challenges of QSAR. Collapse