Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Shim E, Kammeraad JA, Xu Z, Tewari A, Cernak T, Zimmerman PM. Predicting reaction conditions from limited data through active transfer learning. Chem Sci 2022;13:6655-6668. [PMID: 35756521 PMCID: PMC9172577 DOI: 10.1039/d1sc06932b] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 05/10/2022] [Indexed: 12/30/2022] Open

For:	Shim E, Kammeraad JA, Xu Z, Tewari A, Cernak T, Zimmerman PM. Predicting reaction conditions from limited data through active transfer learning. Chem Sci 2022;13:6655-6668. [PMID: 35756521 PMCID: PMC9172577 DOI: 10.1039/d1sc06932b] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 05/10/2022] [Indexed: 12/30/2022] Open

Number

Cited by Other Article(s)

Xu J, Ye X, Lv Z, Chen YH, Wang XS. The Role of Base in Reaction Performance of Photochemical Synthesis of Thiazoles: An Integrated Theoretical and Experimental Study. Chemistry 2024;30:e202304279. [PMID: 38409580 DOI: 10.1002/chem.202304279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/25/2024] [Accepted: 02/26/2024] [Indexed: 02/28/2024]

Wang JY, Stevens JM, Kariofillis SK, Tom MJ, Golden DL, Li J, Tabora JE, Parasram M, Shields BJ, Primer DN, Hao B, Del Valle D, DiSomma S, Furman A, Zipp GG, Melnikov S, Paulson J, Doyle AG. Identifying general reaction conditions by bandit optimization. Nature 2024;626:1025-1033. [PMID: 38418912 DOI: 10.1038/s41586-024-07021-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 01/03/2024] [Indexed: 03/02/2024]

Affiliation(s)

Jason Y Wang Department of Chemistry, Princeton University, Princeton, NJ, USA Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA
Jason M Stevens Chemical Process Development, Bristol Myers Squibb, Summit, NJ, USA
Stavros K Kariofillis Department of Chemistry, Princeton University, Princeton, NJ, USA Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA Department of Chemistry, Columbia University, New York, NY, USA
Mai-Jan Tom Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA
Dung L Golden Chemical Process Development, Bristol Myers Squibb, Summit, NJ, USA
Jun Li Chemical Process Development, Bristol Myers Squibb, New Brunswick, NJ, USA
Jose E Tabora Chemical Process Development, Bristol Myers Squibb, New Brunswick, NJ, USA
Marvin Parasram Department of Chemistry, Princeton University, Princeton, NJ, USA Department of Chemistry, New York University, New York, NY, USA
Benjamin J Shields Department of Chemistry, Princeton University, Princeton, NJ, USA Molecular Structure and Design, Bristol Myers Squibb, Cambridge, MA, USA
David N Primer Chemical Process Development, Bristol Myers Squibb, Summit, NJ, USA Loxo Oncology at Lilly, Louisville, CO, USA
Bo Hao Janssen Research and Development, Spring House, PA, USA
David Del Valle Chemical Process Development, Bristol Myers Squibb, New Brunswick, NJ, USA
Stacey DiSomma Chemical Process Development, Bristol Myers Squibb, New Brunswick, NJ, USA
Ariel Furman Chemical Process Development, Bristol Myers Squibb, New Brunswick, NJ, USA
G Greg Zipp Discovery Synthesis, Bristol Myers Squibb, Princeton, NJ, USA
Sergey Melnikov Spectrix Analytical Services, North Haven, CT, USA
James Paulson Chemical Process Development, Bristol Myers Squibb, New Brunswick, NJ, USA
Abigail G Doyle Department of Chemistry, Princeton University, Princeton, NJ, USA. Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA.

Collapse

Wang X, Hsieh CY, Yin X, Wang J, Li Y, Deng Y, Jiang D, Wu Z, Du H, Chen H, Li Y, Liu H, Wang Y, Luo P, Hou T, Yao X. Generic Interpretable Reaction Condition Predictions with Open Reaction Condition Datasets and Unsupervised Learning of Reaction Center. RESEARCH (WASHINGTON, D.C.) 2023;6:0231. [PMID: 37849643 PMCID: PMC10578430 DOI: 10.34133/research.0231] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 08/29/2023] [Indexed: 10/19/2023]

Affiliation(s)

Xiaorui Wang Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao, 999078, China CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China
Chang-Yu Hsieh Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
Xiaodan Yin Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao, 999078, China CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China
Jike Wang Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China
Yuquan Li College of Chemistry and Chemical Engineering, Lanzhou University, Lanzhou, 730000, China
Yafeng Deng CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China
Dejun Jiang Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China
Zhenxing Wu Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China
Hongyan Du Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
Hongming Chen Center of Chemistry and Chemical Biology, Guangzhou Regenerative Medicine and Health Guangdong Laboratory, Guangzhou 510530, China
Yun Li College of Chemistry and Chemical Engineering, Lanzhou University, Lanzhou, 730000, China
Huanxiang Liu Faculty of Applied Sciences, Macao Polytechnic University, Macao, 999078, China
Yuwei Wang College of Pharmacy, Shaanxi University of Chinese Medicine, Xianyang, Shaanxi, 712044, China
Pei Luo Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao, 999078, China
Tingjun Hou Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
Xiaojun Yao Faculty of Applied Sciences, Macao Polytechnic University, Macao, 999078, China

Collapse

Rinehart NI, Saunthwal RK, Wellauer J, Zahrt AF, Schlemper L, Shved AS, Bigler R, Fantasia S, Denmark SE. A machine-learning tool to predict substrate-adaptive conditions for Pd-catalyzed C-N couplings. Science 2023;381:965-972. [PMID: 37651532 DOI: 10.1126/science.adg2114] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 08/01/2023] [Indexed: 09/02/2023]

Shim E, Tewari A, Cernak T, Zimmerman PM. Machine Learning Strategies for Reaction Development: Toward the Low-Data Limit. J Chem Inf Model 2023;63:3659-3668. [PMID: 37312524 PMCID: PMC11163943 DOI: 10.1021/acs.jcim.3c00577] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Faurschou NV, Taaning RH, Pedersen CM. Substrate specific closed-loop optimization of carbohydrate protective group chemistry using Bayesian optimization and transfer learning. Chem Sci 2023;14:6319-6329. [PMID: 37325141 PMCID: PMC10266441 DOI: 10.1039/d3sc01261a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 05/12/2023] [Indexed: 06/17/2023] Open

Capaldo L, Wen Z, Noël T. A field guide to flow chemistry for synthetic organic chemists. Chem Sci 2023;14:4230-4247. [PMID: 37123197 PMCID: PMC10132167 DOI: 10.1039/d3sc00992k] [Citation(s) in RCA: 34] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 03/15/2023] [Indexed: 03/17/2023] Open

Chen Y, Ou Y, Zheng P, Huang Y, Ge F, Dral PO. Benchmark of general-purpose machine learning-based quantum mechanical method AIQM1 on reaction barrier heights. J Chem Phys 2023;158:074103. [PMID: 36813722 DOI: 10.1063/5.0137101] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Singh S, Sunoj RB. Molecular Machine Learning for Chemical Catalysis: Prospects and Challenges. Acc Chem Res 2023;56:402-412. [PMID: 36715248 DOI: 10.1021/acs.accounts.2c00801] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Abstract

ConspectusIn the domain of reaction development, one aims to obtain higher efficacies as measured in terms of yield and/or selectivities. During the empirical cycles, an admixture of outcomes from low to high yields/selectivities is expected. While it is not easy to identify all of the factors that might impact the reaction efficiency, complex and nonlinear dependence on the nature of reactants, catalysts, solvents, etc. is quite likely. Developmental stages of newer reactions would typically offer a few hundreds of samples with variations in participating molecules and/or reaction conditions. These "observations" and their "output" can be harnessed as valuable labeled data for developing molecular machine learning (ML) models. Once a robust ML model is built for a specific reaction under development, it can predict the reaction outcome for any new choice of substrates/catalyst in a few seconds/minutes and thus can expedite the identification of promising candidates for experimental validation. Recent years have witnessed impressive applications of ML in the molecular world, most of them aimed at predicting important chemical or biological properties. We believe that an integration of effective ML workflows can be made richly beneficial to reaction discovery.As with any new technology, direct adaptation of ML as used in well-developed domains, such as natural language processing (NLP) and image recognition, is unlikely to succeed in reaction discovery. Some of the challenges stem from ineffective featurization of the molecular space, unavailability of quality data and its distribution, in making the right choice of ML model and its technically robust deployment. It shall be noted that there is no universal ML model suitable for an inherently high-dimensional problem such as chemical reactions. Given these backgrounds, rendering ML tools conducive for reactions is an exciting as well as challenging endeavor at the same time. With the increased availability of efficient ML algorithms, we focused on tapping their potential for small-data reaction discovery (a few hundreds to thousands of samples).In this Account, we describe both feature engineering and feature learning approaches for molecular ML as applied to diverse reactions of high contemporary interest. Among these, catalytic asymmetric hydrogenation of imines/alkenes, β-C(sp³)-H bond functionalization, and relay Heck reaction employed a feature engineering approach using the quantum-chemically derived physical organic descriptors as the molecular features─all designed to predict the enantioselectivity. The selection of molecular features to customize it for a reaction of interest is described, along with emphasizing the chemical insights that could be gathered through the use of such features. Feature learning methods for predicting the yield of Buchwald-Hartwig cross-coupling, deoxyfluorination of alcohols, and enantioselectivity of N,S-acetal formation are found to offer excellent predictions. We propose a transfer learning protocol, wherein an ML model such as a language model is trained on a large number of molecules (10⁵-10⁶) and fine-tuned on a focused library of target task reactions, as an effective alternative for small-data reaction discovery (10²-10³ reactions). The exploitation of deep neural network latent space as a method for generative tasks to identify useful substrates for a reaction is demonstrated as a promising strategy.

Collapse

Seifrid M, Pollice R, Aguilar-Granda A, Morgan Chan Z, Hotta K, Ser CT, Vestfrid J, Wu TC, Aspuru-Guzik A. Autonomous Chemical Experiments: Challenges and Perspectives on Establishing a Self-Driving Lab. Acc Chem Res 2022;55:2454-2466. [PMID: 35948428 PMCID: PMC9454899 DOI: 10.1021/acs.accounts.2c00220] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Indexed: 01/19/2023]

Abstract

We must accelerate the pace at which we make technological advancements to address climate change and disease risks worldwide. This swifter pace of discovery requires faster research and development cycles enabled by better integration between hypothesis generation, design, experimentation, and data analysis. Typical research cycles take months to years. However, data-driven automated laboratories, or self-driving laboratories, can significantly accelerate molecular and materials discovery. Recently, substantial advancements have been made in the areas of machine learning and optimization algorithms that have allowed researchers to extract valuable knowledge from multidimensional data sets. Machine learning models can be trained on large data sets from the literature or databases, but their performance can often be hampered by a lack of negative results or metadata. In contrast, data generated by self-driving laboratories can be information-rich, containing precise details of the experimental conditions and metadata. Consequently, much larger amounts of high-quality data are gathered in self-driving laboratories. When placed in open repositories, this data can be used by the research community to reproduce experiments, for more in-depth analysis, or as the basis for further investigation. Accordingly, high-quality open data sets will increase the accessibility and reproducibility of science, which is sorely needed.In this Account, we describe our efforts to build a self-driving lab for the development of a new class of materials: organic semiconductor lasers (OSLs). Since they have only recently been demonstrated, little is known about the molecular and material design rules for thin-film, electrically-pumped OSL devices as compared to other technologies such as organic light-emitting diodes or organic photovoltaics. To realize high-performing OSL materials, we are developing a flexible system for automated synthesis via iterative Suzuki-Miyaura cross-coupling reactions. This automated synthesis platform is directly coupled to the analysis and purification capabilities. Subsequently, the molecules of interest can be transferred to an optical characterization setup. We are currently limited to optical measurements of the OSL molecules in solution. However, material properties are ultimately most important in the solid state (e.g., as a thin-film device). To that end and for a different scientific goal, we are developing a self-driving lab for inorganic thin-film materials focused on the oxygen evolution reaction.While the future of self-driving laboratories is very promising, numerous challenges still need to be overcome. These challenges can be split into cognition and motor function. Generally, the cognitive challenges are related to optimization with constraints or unexpected outcomes for which general algorithmic solutions have yet to be developed. A more practical challenge that could be resolved in the near future is that of software control and integration because few instrument manufacturers design their products with self-driving laboratories in mind. Challenges in motor function are largely related to handling heterogeneous systems, such as dispensing solids or performing extractions. As a result, it is critical to understand that adapting experimental procedures that were designed for human experimenters is not as simple as transferring those same actions to an automated system, and there may be more efficient ways to achieve the same goal in an automated fashion. Accordingly, for self-driving laboratories, we need to carefully rethink the translation of manual experimental protocols.

Collapse