1
|
Lindley S, Lu Y, Shukla D. The Experimentalist's Guide to Machine Learning for Small Molecule Design. ACS APPLIED BIO MATERIALS 2024; 7:657-684. [PMID: 37535819 PMCID: PMC10880109 DOI: 10.1021/acsabm.3c00054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 07/17/2023] [Indexed: 08/05/2023]
Abstract
Initially part of the field of artificial intelligence, machine learning (ML) has become a booming research area since branching out into its own field in the 1990s. After three decades of refinement, ML algorithms have accelerated scientific developments across a variety of research topics. The field of small molecule design is no exception, and an increasing number of researchers are applying ML techniques in their pursuit of discovering, generating, and optimizing small molecule compounds. The goal of this review is to provide simple, yet descriptive, explanations of some of the most commonly utilized ML algorithms in the field of small molecule design along with those that are highly applicable to an experimentally focused audience. The algorithms discussed here span across three ML paradigms: supervised learning, unsupervised learning, and ensemble methods. Examples from the published literature will be provided for each algorithm. Some common pitfalls of applying ML to biological and chemical data sets will also be explained, alongside a brief summary of a few more advanced paradigms, including reinforcement learning and semi-supervised learning.
Collapse
Affiliation(s)
- Sarah
E. Lindley
- Department
of Bioengineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
| | - Yiyang Lu
- Department
of Chemical and Biomolecular Engineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
| | - Diwakar Shukla
- Department
of Bioengineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
- Department
of Chemical and Biomolecular Engineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
- Center
for Biophysics & Computational Biology, University of Illinois, Urbana−Champaign, Illinois 61801, United States
- Department
of Plant Biology, University of Illinois, Urbana−Champaign, Illinois 61801, United States
| |
Collapse
|
2
|
Zajičková M, Moncoľ J, Šesták S, Kóňa J, Koóš M, Bella M. Synthesis of 4a-Carba- d-lyxofuranose Derivatives and Their Evaluation as Inhibitors of GH38 α-Mannosidases. European J Org Chem 2019. [DOI: 10.1002/ejoc.201801586] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Mária Zajičková
- Centre for Glycomics; Institute of Chemistry; Slovak Academy of Sciences; Dúbravská cesta 9 SK-845 38 Bratislava Slovakia
| | - Ján Moncoľ
- Department of Inorganic Chemistry; Institute of Chemistry; Faculty of Chemical and Food Technology; Radlinského 9 SK-812 37 Bratislava Slovakia
| | - Sergej Šesták
- Centre for Glycomics; Institute of Chemistry; Slovak Academy of Sciences; Dúbravská cesta 9 SK-845 38 Bratislava Slovakia
| | - Juraj Kóňa
- Centre for Glycomics; Institute of Chemistry; Slovak Academy of Sciences; Dúbravská cesta 9 SK-845 38 Bratislava Slovakia
| | - Miroslav Koóš
- Centre for Glycomics; Institute of Chemistry; Slovak Academy of Sciences; Dúbravská cesta 9 SK-845 38 Bratislava Slovakia
| | - Maroš Bella
- Centre for Glycomics; Institute of Chemistry; Slovak Academy of Sciences; Dúbravská cesta 9 SK-845 38 Bratislava Slovakia
| |
Collapse
|
3
|
Karawajczyk A, Orrling KM, de Vlieger JSB, Rijnders T, Tzalis D. The European Lead Factory: A Blueprint for Public-Private Partnerships in Early Drug Discovery. Front Med (Lausanne) 2017; 3:75. [PMID: 28154815 PMCID: PMC5243859 DOI: 10.3389/fmed.2016.00075] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 12/23/2016] [Indexed: 11/17/2022] Open
Abstract
The European Lead Factory (ELF) is a public–private partnership (PPP) that provides researchers in Europe with a unique platform for translation of innovative biology and chemistry into high-quality starting points for drug discovery. It combines an exceptional collection of small molecules, high-throughput screening (HTS) infrastructure, and hit follow-up capabilities to advance research projects from both private companies and publicly funded researchers. By active interactions with the wider European life science community, ELF connects and unites bright ideas, talent, and experience from several disciplines. As a result, ELF is a unique, collaborative lead generation engine that has so far resulted in >4,500 hit compounds with a defined biological activity from 83 successfully completed HTS and hit evaluation campaigns. The PPP has also produced more than 120,000 novel innovative library compounds that complement the 327,000 compounds contributed by the participating pharmaceutical companies. Intrinsic to its setup, ELF enables breakthroughs in areas with unmet medical and societal needs, where no individual entity would be able to create a comparable impact in such a short time.
Collapse
|
4
|
Fernandes RA, Kattanguru P, Gholap SP, Chaudhari DA. Recent advances in the Overman rearrangement: synthesis of natural products and valuable compounds. Org Biomol Chem 2017; 15:2672-2710. [DOI: 10.1039/c6ob02625g] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
This review documents the reports since 2005 on the Overman rearrangement, an important C–N bond forming reaction that has been profoundly used in the synthesis of natural products, synthetic intermediates, building blocks and valuable compounds.
Collapse
Affiliation(s)
- Rodney A. Fernandes
- Department of Chemistry
- Indian Institute of Technology Bombay
- Mumbai 400076
- India
| | - Pullaiah Kattanguru
- Department of Chemistry
- Indian Institute of Technology Bombay
- Mumbai 400076
- India
| | - Sachin P. Gholap
- Department of Chemistry
- Indian Institute of Technology Bombay
- Mumbai 400076
- India
| | - Dipali A. Chaudhari
- Department of Chemistry
- Indian Institute of Technology Bombay
- Mumbai 400076
- India
| |
Collapse
|
5
|
The chemistry of the carbon-transition metal double and triple bond: Annual survey covering the year 2015. Coord Chem Rev 2016. [DOI: 10.1016/j.ccr.2016.08.007] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
6
|
Lo Re D, Jones L, Giralt E, Murphy P. Synthesis of an Orthogonally Protected Polyhydroxylated Cyclopentene from l-Sorbose. Chem Asian J 2016; 11:2035-40. [PMID: 27304425 DOI: 10.1002/asia.201600736] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Indexed: 11/06/2022]
Abstract
The use of l-sorbose in the synthesis of functionalized cyclopentene derivatives was accomplished. These cyclopentene derivatives are related to those found in naturally occurring jatrophane frameworks and in other bioactive compounds. The formation of allyl α-l-sorbopyranoside was a key synthetic step. Regioselective introduction of protecting groups was followed by the hydrolysis of the allyl glycoside to furnish a fully protected acyclic l-sorbose derivative. This acyclic intermediate was subsequently used to give an orthogonally protected polyhydroxylated cyclopentene, which has potential for further synthesis of bioactive compounds. The protected cyclopentene itself showed a clear cytotoxic activity when tested against a panel of human cancer cell lines (HT29, LS174T, SW620, A549, and HeLa cells).
Collapse
Affiliation(s)
- Daniele Lo Re
- School of Chemistry, National University of Ireland, Galway, University Road, Galway, Ireland.
| | - Leigh Jones
- School of Chemistry, National University of Ireland, Galway, University Road, Galway, Ireland
| | - Ernest Giralt
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, C/Baldiri Reixac 10, Barcelona, E-08028, Spain.,Department of Organic Chemistry, University of Barcelona, Marti i Franques 1-11, Barcelona, E-08028, Spain
| | - Paul Murphy
- School of Chemistry, National University of Ireland, Galway, University Road, Galway, Ireland.
| |
Collapse
|
7
|
Karawajczyk A, Giordanetto F, Benningshof J, Hamza D, Kalliokoski T, Pouwer K, Morgentin R, Nelson A, Müller G, Piechot A, Tzalis D. Expansion of chemical space for collaborative lead generation and drug discovery: the European Lead Factory Perspective. Drug Discov Today 2015; 20:1310-6. [PMID: 26429298 DOI: 10.1016/j.drudis.2015.09.009] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Revised: 09/02/2015] [Accepted: 09/11/2015] [Indexed: 01/06/2023]
Abstract
High-throughput screening (HTS) represents a major cornerstone of drug discovery. The availability of an innovative, relevant and high-quality compound collection to be screened often dictates the final fate of a drug discovery campaign. Given that the chemical space to be sampled in research programs is practically infinite and sparsely populated, significant efforts and resources need to be invested in the generation and maintenance of a competitive compound collection. The European Lead Factory (ELF) project is addressing this challenge by leveraging the diverse experience and know-how of academic groups and small and medium enterprises (SMEs) engaged in synthetic and/or medicinal chemistry. Here, we describe the novelty, diversity, structural complexity, physicochemical characteristics and overall attractiveness of this first batch of ELF compounds for HTS purposes.
Collapse
Affiliation(s)
- Anna Karawajczyk
- Taros Chemicals GmbH & Co. KG, Emil-Figge-Str. 76a, 44227 Dortmund, Germany
| | | | | | - Daniel Hamza
- Sygnature Discovery, BioCity, Nottingham NG1 1GF, UK
| | - Tuomo Kalliokoski
- Lead Discovery Center GmbH, Otto-Hahn-Strabe 15, 44227 Dortmund, Germany
| | - Kees Pouwer
- Syncom BV, Kadijk 3, 9747 AT Groningen, The Netherlands
| | | | - Adam Nelson
- School of Chemistry, University of Leeds, Leeds LS2 9JT, UK; Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds LS2 9JT, UK
| | - Gerhard Müller
- Mercachem, Kerkenbos 1013, 6546 BB Nijmegen, The Netherlands
| | - Alexander Piechot
- Taros Chemicals GmbH & Co. KG, Emil-Figge-Str. 76a, 44227 Dortmund, Germany
| | - Dimitrios Tzalis
- Taros Chemicals GmbH & Co. KG, Emil-Figge-Str. 76a, 44227 Dortmund, Germany.
| |
Collapse
|