1
|
Boelrijk J, Molenaar SRA, Bos TS, Dahlseid TA, Ensing B, Stoll DR, Forré P, Pirok BWJ. Enhancing LC×LC separations through multi-task Bayesian optimization. J Chromatogr A 2024; 1726:464941. [PMID: 38749274 DOI: 10.1016/j.chroma.2024.464941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 04/17/2024] [Accepted: 04/22/2024] [Indexed: 05/23/2024]
Abstract
Method development in comprehensive two-dimensional liquid chromatography (LC×LC) is a challenging process. The interdependencies between the two dimensions and the possibility of incorporating complex gradient profiles, such as multi-segmented gradients or shifting gradients, make trial-and-error method development time-consuming and highly dependent on user experience. Retention modeling and Bayesian optimization (BO) have been proposed as solutions to mitigate these issues. However, both approaches have their strengths and weaknesses. On the one hand, retention modeling, which approximates true retention behavior, depends on effective peak tracking and accurate retention time and width predictions, which are increasingly challenging for complex samples and advanced gradient assemblies. On the other hand, Bayesian optimization may require many experiments when dealing with many adjustable parameters, as in LC×LC. Therefore, in this work, we investigate the use of multi-task Bayesian optimization (MTBO), a method that can combine information from both retention modeling and experimental measurements. The algorithm was first tested and compared with BO using a synthetic retention modeling test case, where it was shown that MTBO finds better optima with fewer method-development iterations than conventional BO. Next, the algorithm was tested on the optimization of a method for a pesticide sample and we found that the algorithm was able to improve upon the initial scanning experiments. Multi-task Bayesian optimization is a promising technique in situations where modeling retention is challenging, and the high number of adjustable parameters and/or limited optimization budget makes traditional Bayesian optimization impractical.
Collapse
Affiliation(s)
- Jim Boelrijk
- AI4Science Lab, Informatics Institute, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands; AMLab, Informatics Institute, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands.
| | - Stef R A Molenaar
- Division of Bioanalytical Chemistry, Amsterdam Institute of Molecular and Life Sciences, De Boelelaan 1085, Amsterdam, 1081 HV, The Netherlands; Analytical Chemistry Group, Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands
| | - Tijmen S Bos
- Division of Bioanalytical Chemistry, Amsterdam Institute of Molecular and Life Sciences, De Boelelaan 1085, Amsterdam, 1081 HV, The Netherlands; Analytical Chemistry Group, Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands
| | - Tina A Dahlseid
- Department of Chemistry, Gustavus Adolphus College, Saint Peter, MN 56082, United States
| | - Bernd Ensing
- AI4Science Lab, Informatics Institute, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands; Computational Chemistry Group, Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands
| | - Dwight R Stoll
- Department of Chemistry, Gustavus Adolphus College, Saint Peter, MN 56082, United States
| | - Patrick Forré
- AI4Science Lab, Informatics Institute, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands; AMLab, Informatics Institute, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands
| | - Bob W J Pirok
- AI4Science Lab, Informatics Institute, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands; Analytical Chemistry Group, Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, Science Park 904, 1098 XH, The Netherlands.
| |
Collapse
|
2
|
Kensert A, Desmet G, Cabooter D. A perspective on the use of deep deterministic policy gradient reinforcement learning for retention time modeling in reversed-phase liquid chromatography. J Chromatogr A 2024; 1713:464570. [PMID: 38101304 DOI: 10.1016/j.chroma.2023.464570] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/04/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023]
Abstract
Artificial intelligence and machine learning techniques are increasingly used for different tasks related to method development in liquid chromatography. In this study, the possibilities of a reinforcement learning algorithm, more specifically a deep deterministic policy gradient algorithm, are evaluated for the selection of scouting runs for retention time modeling. As a theoretical exercise, it is investigated whether such an algorithm can be trained to select scouting runs for any compound of interest allowing to retrieve its correct retention parameters for the three-parameter Neue-Kuss retention model. It is observed that three scouting runs are generally sufficient to retrieve the retention parameters with an accuracy (mean relative percentage error MRPE) of 1 % or less. When given the opportunity to select additional scouting runs, this does not lead to a significantly improved accuracy. It is also observed that the agent tends to give preference to isocratic scouting runs for retention time modeling, and is only motivated towards selecting gradient scouting runs when penalized (strongly) for large analysis/gradient times. This seems to reinforce the general power and usefulness of isocratic scouting runs for retention time modeling. Finally, the best results (lowest MRPE) are obtained when the agent manages to retrieve retention time data for % ACN at elution of the compound under consideration that spread the entire relevant range of ACN (5 % ACN to 95 % ACN) as well as possible, i.e., resulting in retention data at a low, intermediate and high % ACN. Based on the obtained results, we believe reinforcement learning holds great potential to automate and rationalize method development in liquid chromatography in the future.
Collapse
Affiliation(s)
- Alexander Kensert
- University of Leuven (KU Leuven), Department for Pharmaceutical and Pharmacological Sciences, Pharmaceutical Analysis, Herestraat 49, 3000 Leuven, Belgium; Vrije Universiteit Brussel, Department of Chemical Engineering, Pleinlaan 2, 1050 Brussel, Belgium
| | - Gert Desmet
- Vrije Universiteit Brussel, Department of Chemical Engineering, Pleinlaan 2, 1050 Brussel, Belgium
| | - Deirdre Cabooter
- University of Leuven (KU Leuven), Department for Pharmaceutical and Pharmacological Sciences, Pharmaceutical Analysis, Herestraat 49, 3000 Leuven, Belgium.
| |
Collapse
|
3
|
Rutan SC, Cash K, Stoll DR. Experimental design and re-parameterization of the Neue-Kuss model for accurate and precise prediction of isocratic retention factors from gradient measurements in reversed phase liquid chromatography. J Chromatogr A 2023; 1711:464443. [PMID: 37890376 DOI: 10.1016/j.chroma.2023.464443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 10/06/2023] [Accepted: 10/09/2023] [Indexed: 10/29/2023]
Abstract
The present work describes a re-parameterization of the Neue Kuss (NK) model for describing retention in liquid chromatography, and this re-parameterized model is used to fit a large set of isocratic retention measurements with improved convergence properties relative to the original parameterization of the model. Next, an experimental design for retention measurements using mobile phase gradient elution conditions is proposed for the purpose of obtaining accurate and precise NK parameters. Simulated retention data for mobile phase gradient elution conditions with two different levels of noise, as well as an essentially zero noise level were fit with the re-parameterized model. The results showed that the re-parameterized fits yielded average (absolute value) prediction errors for the parameters at the highest noise level of 7.2 % for S1,ref, 18 % for S2,ref and 6.2 % for kref (the re-parameterized NK model parameters). These errors were significantly smaller than those for the original parameterization of the NK model, where the errors were 23 % for S1, 25 % for S2 and 160 % for kw (the original NK model parameters). Furthermore, isocratic retention factors predicted using these model parameters were found to have an average magnitude of error of 0.51 % for the re-parameterized model, as opposed to 6800 % for the model with the original parameterization. A further test of this approach was carried out for independent experimental measurements for five solutes on a C18 column. The average magnitude of error of the isocratic retention factors predicted from parameters obtained from fits of gradient data was 1.6 %, provided that the range of organic solvent compositions that the solute sampled in the mobile phase gradient experiments was consistent with the isocratic experiments. These results indicate that the re-parameterization of the NK model allows for significant improvements in the fitting process, and that the proposed experimental design allows for NK parameters to be extracted from mobile phase gradient experiments, with prediction accuracies of isocratic retention factors on the order of 1-2 %.
Collapse
Affiliation(s)
- Sarah C Rutan
- Department of Chemistry, Box 842006, Virginia Commonwealth University, Richmond, VA 23284-2006, USA.
| | - Kathryn Cash
- Department of Chemistry, Gustavus Adolphus College, 800 West College Avenue, Saint Peter, MN 56082, USA
| | - Dwight R Stoll
- Department of Chemistry, Gustavus Adolphus College, 800 West College Avenue, Saint Peter, MN 56082, USA
| |
Collapse
|
4
|
Performance of global retention models in the optimisation of the chromatographic separation (I): Simple multi-analyte samples. J Chromatogr A 2023; 1689:463756. [PMID: 36610184 DOI: 10.1016/j.chroma.2022.463756] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 12/23/2022] [Accepted: 12/23/2022] [Indexed: 12/31/2022]
Abstract
Conventional retention models lead to accurate descriptions of the elution behaviour from the fitting of data for single solutes or from a set of solutes, one by one. However, the simultaneous fitting of several solutes through a regression process that separates the contributions of column and solvent from those of each solute is also possible. The result is a global retention model constituted by a set of equations with some common parameters (those associated with column and solvent), whereas others, specific to each solute, differ for each equation. This work explores the possibilities, advantages, and limitations of global models when they are applied to the optimisation of chromatographic resolution. A set constituted by 13 drugs (diuretics and β-blockers) and a training experimental design of seven multi-linear gradients are considered. Since standards for all compounds were available, the optimisation based on global models could be compared with the conventional optimisation, which is based on individual models. In their current state, global models do not predict changes in elution order, but they do allow for incorporating additional solutes (e.g., new analytes or matrix peaks) with only one new experiment. This possibility is explored by extending the model for the 13 analytes to include 26 peaks associated with a contamination in the injector. The combination of individual and global models allows an optimisation where the effects of matrix peaks on the separation of analytes can be integrated.
Collapse
|
5
|
den Uijl MJ, Roeland T, Bos TS, Schoenmakers PJ, van Bommel MR, Pirok BW. Assessing the feasibility of stationary-phase-assisted modulation for two-dimensional liquid-chromatography separations. J Chromatogr A 2022; 1679:463388. [DOI: 10.1016/j.chroma.2022.463388] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 07/18/2022] [Accepted: 07/21/2022] [Indexed: 02/06/2023]
|
6
|
Chromatographic fingerprint-based analysis of extracts of green tea, lemon balm and linden: I. Development of global retention models without the use of standards. J Chromatogr A 2022; 1672:463060. [DOI: 10.1016/j.chroma.2022.463060] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 04/09/2022] [Accepted: 04/12/2022] [Indexed: 11/17/2022]
|
7
|
Brau T, Pirok B, Rutan S, Stoll D. Accuracy of retention model parameters obtained from retention data in liquid chromatography. J Sep Sci 2022; 45:3241-3255. [PMID: 35304809 DOI: 10.1002/jssc.202100911] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 03/02/2022] [Accepted: 03/14/2022] [Indexed: 11/10/2022]
Abstract
In liquid chromatography (LC), it is often very useful to have an accurate model of the retention factor, k, over a wide range of isocratic elution conditions. In principle, the parameters of a retention model can be obtained by fitting either isocratic or gradient retention factor data. However, in spite of many of our own attempts to accurately predict isocratic k values using retention models trained with gradient retention data, this has not worked in our hands. In the present study we have used synthetic isocratic and gradient retention data for small molecules under reversed-phase LC conditions. This allows us to discover challenges associated with predicting isocratic k's without the confounding influences of experimental issues that are difficult to model or eliminate. The results indicate that it is not currently possible to consistently predict isocratic retention factors for small molecules with accuracies better than 10%, even when using synthetic gradient retention data. Two distinct challenges in fitting gradient retention data were identified: 1) a lack of 'uniqueness' in the parameters; and 2) an inability to find the global optimum fit in a complex fitting landscape. Working with experimental data where measurement noise is unavoidable will only make the accuracy worse. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
| | - Bob Pirok
- Gustavus Adolphus College.,Van 't Hoff Institute for Molecular Sciences
| | | | | |
Collapse
|
8
|
Gisbert-Alonso A, Navarro-Huerta JA, Torres-Lapasió JR, García-Alvarez-Coque MC. Testing experimental designs in liquid chromatography (II): Influence of the design geometry on the prediction performance of retention models. J Chromatogr A 2021; 1654:462458. [PMID: 34399141 DOI: 10.1016/j.chroma.2021.462458] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 07/26/2021] [Accepted: 08/02/2021] [Indexed: 11/25/2022]
Abstract
In liquid chromatography, the reliability of predictions carried out with retention models depends critically on the quality of the training experimental design. The search of the best design is more complex when gradient runs are used instead of isocratic experiments. In Part I of this work (JCA 1624 (2020) 461180), a general methodology based on the error propagation theory was developed and validated for assessing the quality of training designs involving gradients. The treatment relates the mathematical properties of a retention model with the geometry of the training designs and their subsequent predictions. In that work, only five usual designs were considered. Part II investigates in detail the effects on predictions when the features of the training design (number and distribution of the experiments, initial and final modifier content, gradient slope(s), and location of gradient nodes and pulses) are varied systematically. Several groups of related designs containing one or more isocratic steps, linear or multi-linear gradients, or mixed isocratic/gradient runs, among others (in total 38 designs) were evaluated. Box and whiskers and triple plots of expected relative uncertainties were used to evidence the differences in prediction performance. The purpose was to give recommendations to construct designs with good prediction performance. The best designs sample (considering all runs) concentrations as diverse as possible, at any gradient time.
Collapse
Affiliation(s)
- A Gisbert-Alonso
- Department of Analytical Chemistry, Faculty of Chemistry, Universitat de València, C/ Dr. Moliner 50, 46100 Burjassot, Spain
| | - J A Navarro-Huerta
- Department of Analytical Chemistry, Faculty of Chemistry, Universitat de València, C/ Dr. Moliner 50, 46100 Burjassot, Spain
| | - J R Torres-Lapasió
- Department of Analytical Chemistry, Faculty of Chemistry, Universitat de València, C/ Dr. Moliner 50, 46100 Burjassot, Spain.
| | - M C García-Alvarez-Coque
- Department of Analytical Chemistry, Faculty of Chemistry, Universitat de València, C/ Dr. Moliner 50, 46100 Burjassot, Spain
| |
Collapse
|
9
|
Rutan SC, Jeong LN, Carr PW, Stoll DR, Weber SG. Closed form approximations to predict retention times and peak widths in gradient elution under conditions of sample volume overload and sample solvent mismatch. J Chromatogr A 2021; 1653:462376. [PMID: 34293516 DOI: 10.1016/j.chroma.2021.462376] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 06/24/2021] [Accepted: 06/26/2021] [Indexed: 10/21/2022]
Abstract
Closed form expressions for the prediction of retention times and peak widths for gradient liquid chromatography are particularly useful in understanding, rationalizing and optimizing separations. These expressions are obtained by integrating differential equations, in conjunction with a model of the variation of the retention factor as a function of mobile phase composition. Two of these models, the linear solvent strength (LSS) model and the Neue-Kuss (NK) model are explored in the present work. Here, we expand on these closed form expressions to account for effects of sample volume overload and a mismatch between the sample solvent and the initial mobile phase composition for the gradient. We show that there have been errors in expressions reported in the literature, and we have evaluated the accuracy of the predictions from the closed form expressions reported here using a recently developed liquid chromatography simulator. The expressions assume a constant plate height and consider elution across four zones of the gradient profile - elution in the sample solvent, elution in the initial (isocratic) mobile phase caused by the gradient delay volume, elution during a linear gradient, and elution post-gradient at the final (isocratic) mobile phase composition. The expressions generally give reasonably accurate predictions for retention times and peak widths, except for cases where the solute elutes during transitions between the different zones. The average magnitude of the prediction errors for retention time and peak width relative to simulation were 0.093% and 0.40% for the LSS expressions for ten amphetamine solutes at 36 different separation conditions, and 0.22% and 1.8% for the NK expressions for eight alkylbenzene solutes at 36 different separation conditions, respectively.
Collapse
Affiliation(s)
- Sarah C Rutan
- Department of Chemistry, Box 842006, Virginia Commonwealth University, Richmond, VA 23284-2006, USA.
| | - Lena N Jeong
- Department of Chemistry, Box 842006, Virginia Commonwealth University, Richmond, VA 23284-2006, USA
| | - Peter W Carr
- Department of Chemistry, Smith and Kolthoff Halls, University of Minnesota, 207 Pleasant Street SE, Minneapolis, MN 55455, USA
| | - Dwight R Stoll
- Department of Chemistry, Gustavus Adolphus College, 800 West College Avenue, Saint Peter, MN 56082, USA
| | - Stephen G Weber
- Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, PA, 15260, USA
| |
Collapse
|
10
|
Comparison of the Fitting Performance of Retention Models and Elution Strength Behaviour in Hydrophilic-Interaction and Reversed-Phase Liquid Chromatography. SEPARATIONS 2021. [DOI: 10.3390/separations8040054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Hydrophilic interaction liquid chromatography (HILIC) is able to separate from polar to highly polar solutes, using similar eluents to those in the reversed-phase mode (RPLC) and a polar stationary phase, where water is adsorbed onto its surface. It is widely accepted that multiple modes of interaction take place in the HILIC environment, which can be far more complex than the interactions in an RPLC column. The behaviour in HILIC should be adequately modelled to predict the retention with optimisation purposes and improve the understanding on retention mechanisms, as is the case for RPLC. In this work, the prediction performance of several retention models is studied for seven HILIC columns (underivatised silica, and silica containing diol, amino and sulfobetaine functional groups, together with three columns recently manufactured with neutral, anionic, and cationic character), using uracil and six polar nucleosides (adenosine, cytidine, guanosine, thymidine, uridine, and xanthosine) as probe compounds. The results in HILIC are compared with those that were offered by the elution of several polar sulphonamides and diuretics analysed with two C18 columns (Chromolith Speed ROD and Zorbax Eclipse XDB). It is shown that eight retention models, which only consider partitioning or both partitioning and adsorption, give similar good accuracy in predictions for both HILIC and RPLC columns. However, the study on the elution strength behaviour, at varying mobile phase composition, reveals similarities (or differences) between RPLC and HILIC columns of diverse nature. The particular behaviour for the HILIC and RPLC columns was also revealed when the retention, in both modes, was fitted to a model that describes the change in the elution strength with the modifier concentration.
Collapse
|