1
|
Beck AG, Fine J, Aggarwal P, Regalado EL, Levorse D, De Jesus Silva J, Sherer EC. Machine learning models and performance dependency on 2D chemical descriptor space for retention time prediction of pharmaceuticals. J Chromatogr A 2024; 1730:465109. [PMID: 38968662 DOI: 10.1016/j.chroma.2024.465109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 07/07/2024]
Abstract
The predictive modeling of liquid chromatography methods can be an invaluable asset, potentially saving countless hours of labor while also reducing solvent consumption and waste. Tasks such as physicochemical screening and preliminary method screening systems where large amounts of chromatography data are collected from fast and routine operations are particularly well suited for both leveraging large datasets and benefiting from predictive models. Therefore, the generation of predictive models for retention time is an active area of development. However, for these predictive models to gain acceptance, researchers first must have confidence in model performance and the computational cost of building them should be minimal. In this study, a simple and cost-effective workflow for the development of machine learning models to predict retention time using only Molecular Operating Environment 2D descriptors as input for support vector regression is developed. Furthermore, we investigated the relative performance of models based on molecular descriptor space by utilizing uniform manifold approximation and projection and clustering with Gaussian mixture models to identify chemically distinct clusters. Results outlined herein demonstrate that local models trained on clusters in chemical space perform equivalently when compared to models trained on all data. Through 10-fold cross-validation on a comprehensive set containing 67,950 of our company's proprietary analytes, these models achieved coefficients of determination of 0.84 and 3 % error in terms of retention time. This promising statistical significance is found to translate from cross-validation to prospective prediction on an external test set of pharmaceutically relevant analytes. The observed equivalency of global and local modeling of large datasets is retained with METLIN's SMRT dataset, thereby confirming the wider applicability of the developed machine learning workflows for global models.
Collapse
Affiliation(s)
- Armen G Beck
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Jonathan Fine
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Pankaj Aggarwal
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA.
| | - Erik L Regalado
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Dorothy Levorse
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | | | - Edward C Sherer
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| |
Collapse
|
2
|
Wen TL, Bai JH, Bao MM, Qin Y, Su Y, Guo YL. Ultrasonic sample introduction combined with flame assisted thermal ionization: Pretreatment-free direct mass spectrometry analysis for fraction collecting tubes of preparative liquid chromatography. Talanta 2023; 259:124508. [PMID: 37043878 DOI: 10.1016/j.talanta.2023.124508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 03/18/2023] [Accepted: 03/30/2023] [Indexed: 04/14/2023]
Abstract
Ultrasonic sample introduction combined with flame assisted thermal ionization mass spectrometry (USI-FATI-MS) was developed to monitor the fractions of preparative liquid chromatography. Recently, ultrasound-based sample introduction techniques have achieved great advance in the field of high-throughput analysis. However, it is still a challenge to directly apply these existing techniques to the analysis of macro volume samples (mL level). In this work, ultrasonic sample introduction combined with flame assisted thermal ionization was used for pretreatment-free direct mass spectrometry analysis of micro to macro volume samples (μL-mL level). Utilizing this unique design of ultrasonic sample introduction, liquid sample in the container can be quickly atomized to the gas phase without contact. Then, due to the flame assisted thermal ionization source, desolvation and ionization of the sample droplets will occur immediately. USI-FATI-MS has shown excellent sensitivity, repeatability and great compatibility to solvents and compounds with a wide range of polarity. As a proof of concept, USI-FATI-MS has been applied for rapid monitoring and identification of purified synthetic and natural products in fractions.
Collapse
Affiliation(s)
- Tian-Lun Wen
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China; National Center for Organic Mass Spectrometry in Shanghai, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai, 200032, China
| | - Jia-Hui Bai
- National Center for Organic Mass Spectrometry in Shanghai, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai, 200032, China
| | - Ming-Mai Bao
- National Center for Organic Mass Spectrometry in Shanghai, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai, 200032, China
| | - Yong Qin
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China
| | - Yue Su
- Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China.
| | - Yin-Long Guo
- National Center for Organic Mass Spectrometry in Shanghai, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai, 200032, China.
| |
Collapse
|
3
|
Hettiarachchi K, Streckfuss E, Sanzone JR, Wang J, Hayes M, Kong M, Greshock TJ. Microscale Purification with Direct Charged Aerosol Detector Quantitation Using Selective Online One- or Two-Dimensional Liquid Chromatography. Anal Chem 2022; 94:8309-8316. [DOI: 10.1021/acs.analchem.2c00750] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Kanaka Hettiarachchi
- Discovery Chemistry, Merck & Co., Inc., 213. E. Grand Avenue, South San Francisco, California 94080, United States
| | - Eric Streckfuss
- Discovery Chemistry, Merck & Co., Inc., 770 Sumneytown Pike, West Point, Pennsylvania 19486, United States
| | - Jillian R. Sanzone
- External Discovery Chemistry, Merck & Co., Inc., 213. E. Grand Avenue, South San Francisco, California 94080, United States
| | - Jun Wang
- Discovery Chemistry, Merck & Co., Inc., 213. E. Grand Avenue, South San Francisco, California 94080, United States
| | - Michael Hayes
- Discovery Chemistry, Merck & Co., Inc., 213. E. Grand Avenue, South San Francisco, California 94080, United States
| | - May Kong
- Discovery Chemistry, Merck & Co., Inc., 213. E. Grand Avenue, South San Francisco, California 94080, United States
| | - Thomas J. Greshock
- Discovery Chemistry, Merck & Co., Inc., 213. E. Grand Avenue, South San Francisco, California 94080, United States
| |
Collapse
|
4
|
Guintu C, Kwok M, Hanlon JJ, Spalding TA, Wolff K, Yin H, Kuhen K, Sasher K, Calvin P, Jiang S, Zhou Y, Isbell JJ. Just-in-Time Purification: An Effective Solution for Cherry-Picking and Purifying Active Compounds from Large Legacy Libraries. ACTA ACUST UNITED AC 2016; 11:933-9. [PMID: 17092919 DOI: 10.1177/1087057106294289] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Many companies possess a compound collection consisting of purified compounds and of unpurified products from combinatorial libraries. Using commercial and proprietary compounds as examples, this report provides clear examples of the significant impact purification can have on the activity observed for a compound and highlights the need to retest the purified compounds prior to creating structure-activity relationships. Crude mixtures made with commercial compounds led to an increase in the number of false positives in the SXR-GAL4 assay as compared with their pure and purified counterparts. An examination of proprietary compounds in an HIV assay resulted in the purification of 61 active crude synthetic mixtures. Of these 61 compounds, 32 were 5-fold less active and 2 were 5-fold more active after purification. This report details a semiautomated process developed and implemented for cherry-picking, tracking, and selectively purifying compounds found active in high-throughput screening campaigns.
Collapse
Affiliation(s)
- Christina Guintu
- Genomics Institute of the Novartis Research Foundation, San Diego, California 92121, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Liu M, Chen K, Christian D, Fatima T, Pissarnitski N, Streckfuss E, Zhang C, Xia L, Borges S, Shi Z, Vachal P, Tata J, Athanasopoulos J. High-throughput purification platform in support of drug discovery. ACS COMBINATORIAL SCIENCE 2012; 14:51-9. [PMID: 22032344 DOI: 10.1021/co200138h] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The application of parallel synthesis is an efficient approach to explore the chemical space and to rapidly develop meaningful structure activity relationship (SAR) data for drug discovery programs. However, the effectiveness of the parallel synthesis requires a high throughput purification workflow that can process a large number of crude samples within a meaningful time frame. This paper describes a high throughput purification platform that has been adopted at Merck's Rahway research site. The platform includes the evaluation of crude samples, mass-directed HPLC purification, fraction analysis, compound registration, final compound purity assessment and assay distribution. Assisting with the sample tracking and the data management is the internally designed laboratory information management system, Light Automation Framework (LAF). Using this process and the tools described herein, the group has successfully achieved purities of 95% or greater for 90% of samples.
Collapse
Affiliation(s)
- Min Liu
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Kuanchang Chen
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Denny Christian
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Tazeen Fatima
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | | | - Eric Streckfuss
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Chaowei Zhang
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Lei Xia
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Scott Borges
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Zhicai Shi
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Petr Vachal
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | - James Tata
- Merck & Company, 126 Lincoln Ave, Rahway, New Jersey 07065, United States
| | | |
Collapse
|
6
|
Janzen WP, Popa-Burke IG. Advances in improving the quality and flexibility of compound management. ACTA ACUST UNITED AC 2009; 14:444-51. [PMID: 19483148 DOI: 10.1177/1087057109335262] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The process of drug discovery has evolved considerably since the advent of high-throughput screening (HTS) in the 1980s. Experts and opinion leaders today are agreeing that the current trend in the field is a focus on increasing overall quality (target, screening, and compounds), use of multiple screening approaches for lead discovery, and more flexibility in the process. The associated need for increased flexibility and quality control to support existing HTS paradigms as well as lower throughput approaches such as fragment screening, computational chemistry, focused library building, and centralized lead optimization support has required an evolution in compound management (CM, aka sample management or library management). Although there is much less published peer-reviewed data in CM, due to its historical links to HTS, it has followed very similar trends. In recent years, the focus in CM has been increasingly in compound quality and increased flexibility of the process, as opposed to number of compounds dispensed and speed of dispensing, which were standard metrics and indicators used not so long ago. Ideally, to screen the highest quality sample for every assay, one would start with a correct identity and pure solid, make a correct concentration solution in water or water-soluble/assay-compatible solvent that would allow 100% solubilization, and screen it immediately in a biological assay. Neither CM nor screening has advanced sufficiently to deliver this ideal scenario, but many significant advancements have been made in recent years both in terms of quality of compounds in stores and flexibility of the process, which will be reviewed herein.
Collapse
Affiliation(s)
- William P Janzen
- Center for Integrative Chemical Biology and Drug Discovery, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, USA
| | | |
Collapse
|
7
|
Scott WL, O'Donnell MJ. Distributed Drug Discovery, Part 1: linking academia and combinatorial chemistry to find drug leads for developing world diseases. JOURNAL OF COMBINATORIAL CHEMISTRY 2009; 11:3-13. [PMID: 19105724 PMCID: PMC2651689 DOI: 10.1021/cc800183m] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2008] [Indexed: 02/08/2023]
Affiliation(s)
- William L Scott
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202-3274, USA.
| | | |
Collapse
|
8
|
Guth O, Krewer D, Freudenberg B, Paulitz C, Hauser M, Ilg K. Automated Modular Preparative HPLC-MS Purification Laboratory with Enhanced Efficiency. ACTA ACUST UNITED AC 2008; 10:875-82. [DOI: 10.1021/cc800095v] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Oliver Guth
- Bayer CropScience AG, D-40789 Monheim, and D-65926 Frankfurt am Main, Germany
| | - Dietmar Krewer
- Bayer CropScience AG, D-40789 Monheim, and D-65926 Frankfurt am Main, Germany
| | - Björn Freudenberg
- Bayer CropScience AG, D-40789 Monheim, and D-65926 Frankfurt am Main, Germany
| | - Christian Paulitz
- Bayer CropScience AG, D-40789 Monheim, and D-65926 Frankfurt am Main, Germany
| | - Manfred Hauser
- Bayer CropScience AG, D-40789 Monheim, and D-65926 Frankfurt am Main, Germany
| | - Kerstin Ilg
- Bayer CropScience AG, D-40789 Monheim, and D-65926 Frankfurt am Main, Germany
| |
Collapse
|
9
|
Espada A, Molina-Martin M, Dage J, Kuo MS. Application of LC/MS and related techniques to high-throughput drug discovery. Drug Discov Today 2008; 13:417-23. [DOI: 10.1016/j.drudis.2008.03.005] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2008] [Revised: 03/06/2008] [Accepted: 03/10/2008] [Indexed: 11/16/2022]
|
10
|
Zhou J, Wang YH, Chu J, Gou BQ, Zhuang YP, Zhang SL, Yuan ZY. Penicillin G acylase purification with the aid of high-throughput screening approach. ACTA ACUST UNITED AC 2008. [DOI: 10.1016/j.jcice.2007.12.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
11
|
Isbell J. Changing Requirements of Purification as Drug Discovery Programs Evolve from Hit Discovery. ACTA ACUST UNITED AC 2008; 10:150-7. [DOI: 10.1021/cc700152v] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- John Isbell
- Genomics Institute of the Novartis Research Foundation (GNF), 10675 John Jay Hopkins Drive, San Diego, California 92121
| |
Collapse
|
12
|
Koppitz M, Eis K. Automated medicinal chemistry. Drug Discov Today 2006; 11:561-8. [PMID: 16713909 DOI: 10.1016/j.drudis.2006.04.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2005] [Revised: 03/17/2006] [Accepted: 04/04/2006] [Indexed: 10/24/2022]
Abstract
With the advent of high throughput technologies in biological screening in the 1980s, providing sufficient numbers of small molecules for screening became a bottleneck in the drug discovery process. Combinatorial chemistry was the first attempt by chemists to address this issue. However, since its first applications, combinatorial chemistry has evolved rapidly into diverse fields. This review will focus on the evolution and the current status of what we refer to today as automated medicinal chemistry.
Collapse
Affiliation(s)
- Marcus Koppitz
- Schering AG, Medicinal Chemistry, 13342 Berlin, Germany.
| | | |
Collapse
|
13
|
Zhang X, Picariello W, Hosein N, Towle M, Goetzinger W. A systematic investigation of recovery in preparative reverse phase high performance liquid chromatography/mass spectrometry. J Chromatogr A 2006; 1119:147-55. [PMID: 16387320 DOI: 10.1016/j.chroma.2005.12.038] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2005] [Revised: 12/01/2005] [Accepted: 12/02/2005] [Indexed: 10/25/2022]
Abstract
In this paper we report a systematic recovery study based on reversed phase high performance liquid chromatography (RP-HPLC) separation and mass spectrometric (MS) based fractionation. Factors including a compound's physicochemical properties, column mass loading and presence of impurities were investigated through commercially available compounds. Results suggest that the delay time between MS peak detection and fraction collection, fraction detector's signal-to-noise ratio and compound's base peak width in the chromatogram have the biggest impacts on purification recovery. In an effort to assess sample recovery within our high throughput purification process, re-purification was performed on four compound libraries that were synthesized in-house. Reproducible recoveries (>80%) were achieved in all tests.
Collapse
Affiliation(s)
- Xu Zhang
- ArQule, Inc., Woburn, MA 01801, USA
| | | | | | | | | |
Collapse
|
14
|
Ari N, Westling L, Isbell J. Cherry-Picking in an Orchard: Unattended LC/MS Analysis from an Autosampler with >32,000 Samples Online. ACTA ACUST UNITED AC 2006; 11:318-22. [PMID: 16490775 DOI: 10.1177/1087057105285837] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The 1536-well microplate format has widely supplanted the 384-well microplate format for high-throughput screening and for IC50 assays. Previously, liquid chromatography/mass spectrometry (LC/MS) analyses of such samples required manual transfers of the wells of interest from a 1536-well plate into a 384-well plate. Because this manual transfer introduced a source of potential error, it became clear that amore appropriate solutionwould be to sample directly from the 1536-well plates. Currently, commercially available 1536-well plate autosamplers are not compatible with Waters LC/MS systems. The authors have modified their CTC PAL autosampler to support injection from up to twenty-four 1536-well plates. This allows them to cherry-pick any sample from up to 36,864 wells on the autosampler. Because of its success at this Institute, sampling from 1536-well plates has not only become the preferredmethod for LC/MS analysis from IC50 plates but also become the standard format used for the handling of and the sampling from large combinatorial libraries.
Collapse
Affiliation(s)
- Nader Ari
- The Genomics Institute of the Novartis Research Foundation, San Diego, California 92121, USA.
| | | | | |
Collapse
|
15
|
Current literature in mass spectrometry. JOURNAL OF MASS SPECTROMETRY : JMS 2005; 40:1390-401. [PMID: 16237664 DOI: 10.1002/jms.811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
|