1
|
Gaida M, Stefanuto PH, Focant JF. Theoretical modeling and machine learning-based data processing workflows in comprehensive two-dimensional gas chromatography-A review. J Chromatogr A 2023; 1711:464467. [PMID: 37871505 DOI: 10.1016/j.chroma.2023.464467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 10/15/2023] [Accepted: 10/17/2023] [Indexed: 10/25/2023]
Abstract
In recent years, comprehensive two-dimensional gas chromatography (GC × GC) has been gradually gaining prominence as a preferred method for the analysis of complex samples due to its higher peak capacity and resolution power compared to conventional gas chromatography (GC). Nonetheless, to fully benefit from the capabilities of GC × GC, a holistic approach to method development and data processing is essential for a successful and informative analysis. Method development enables the fine-tuning of the chromatographic separation, resulting in high-quality data. While generating such data is pivotal, it does not necessarily guarantee that meaningful information will be extracted from it. To this end, the first part of this manuscript reviews the importance of theoretical modeling in achieving good optimization of the separation conditions, ultimately improving the quality of the chromatographic separation. Multiple theoretical modeling approaches are discussed, with a special focus on thermodynamic-based modeling. The second part of this review highlights the importance of establishing robust data processing workflows, with a special emphasis on the use of advanced data processing tools such as, Machine Learning (ML) algorithms. Three widely used ML algorithms are discussed: Random Forest (RF), Support Vector Machine (SVM), and Partial Least Square-Discriminate Analysis (PLS-DA), highlighting their role in discovery-based analysis.
Collapse
Affiliation(s)
- Meriem Gaida
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| | - Pierre-Hugues Stefanuto
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| | - Jean-François Focant
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| |
Collapse
|
2
|
Retention Indices for Naturally-Occurring Chiral and Achiral Compounds on Common Gas Chromatography Chiral Stationary Phases. RESULTS IN CHEMISTRY 2022. [DOI: 10.1016/j.rechem.2022.100659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
3
|
Mao H, Jiang M. Modeling of the first dimensional peak with two modulated sub-peaks in comprehensive two-dimensional gas chromatography. Anal Bioanal Chem 2022; 415:2425-2434. [PMID: 35915249 DOI: 10.1007/s00216-022-04245-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 07/04/2022] [Accepted: 07/21/2022] [Indexed: 11/25/2022]
Abstract
According to previous published works, precise modeling of the first dimensional (1D) peak in comprehensive two-dimensional gas chromatography (GC × GC) requires at least 3 modulated sub-peaks (MSP). This requirement is sometimes difficult to meet, e.g., in case of undersampling modulation. In the present work, the feasibility of modeling of the 1D peak with only 2 MSP was demonstrated. The effects of modulation phase (ϕ), modulation period (PM), the peak width (1σ), and the peak shape of the original 1D peak on the accuracy of the proposed method were explored. When employing PM ranging from 6 s ~ 3 s to modulate original peaks with 1σ = 1.2 s ~ 0.6 s, the maximal error of the modeled 1tR is 1.08 s, which is far less than the error generated by employing the largest MSP to estimate the 1tR. The deviation of modeled 1tR increases with the increase of peak shape distortion, and this deviation is ≤ 0.67 s when tailing factor (Tf) in the range of 0.8 to 1.5. The application of the proposed method was demonstrated by assisting identification of a monoterpene in Myrrh sample. The proposed approach could improve the accuracy in calculation of 1tR or 1I and enhance the reliability of compound identification in GC × GC analysis with undersampling modulation.
Collapse
Affiliation(s)
- Hui Mao
- School of Information Engineering, Wuhan Business University, #816 Dongfeng Avenue, Wuhan, Hubei, 430010, People's Republic of China
| | - Ming Jiang
- School of Pharmacy, Tongji Medical College, Huazhong University of Science & Technology, #13 Hangkong Road, Wuhan, Hubei, 430030, People's Republic of China.
| |
Collapse
|
4
|
A unique data analysis framework and open source benchmark data set for the analysis of comprehensive two-dimensional gas chromatography software. J Chromatogr A 2020; 1635:461721. [PMID: 33246680 DOI: 10.1016/j.chroma.2020.461721] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Revised: 11/05/2020] [Accepted: 11/09/2020] [Indexed: 12/28/2022]
Abstract
Comprehensive two-dimensional gas chromatography (GC × GC) is amongst the most powerful separation technologies currently existing. Since its advent in early 1990, it has become an established method which is readily available. However, one of its most challenging aspects, especially in hyphenation with mass spectrometry is the high amount of chemical information it provides for each measurement. The GC × GC community agrees that there, the highest demand for action is found. In response, the number of software packages allowing for in-depth data processing of GC × GC data has risen over the last couple of years. These packages provide sophisticated tools and algorithms allowing for more streamlined data evaluation. However, these tools/algorithms and their respective specific functionalities differ drastically within the available software packages and might result in various levels of findings if not appropriately implemented by the end users. This study focuses on two main objectives. First, to propose a data analysis framework and second to propose an open-source dataset for benchmarking software options and their specificities. Thus, allowing for an unanimous and comprehensive evaluation of GC × GC software. Thereby, the benchmark data includes a set of standard compound measurements and a set of chocolate aroma profiles. On this foundation, eight readily available GC × GC software packages were anonymously investigated for fundamental and advanced functionalities such as retention and detection device derived parameters, revealing differences in the determination of e.g. retention times and mass spectra.
Collapse
|
5
|
Boegelsack N, Sandau C, McMartin DW, Withey JM, O'Sullivan G. Development of retention time indices for comprehensive multidimensional gas chromatography and application to ignitable liquid residue mapping in wildfire investigations. J Chromatogr A 2020; 1635:461717. [PMID: 33254004 DOI: 10.1016/j.chroma.2020.461717] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 11/07/2020] [Accepted: 11/09/2020] [Indexed: 01/07/2023]
Abstract
In this study, we introduce a simple three-step workflow for a universally applicable RI system, to be used in GC×GC analysis of ignitable liquid residue (ILR) for arson investigations. The proposed RI system applies a combination of two well-established GC RI systems: non-isothermal Kovats (K) index in the first dimension and Lee (L) index in the second dimension. The proposed KLI RI system showed very good correlations when compared with predicted values and existing RI systems (r2 = 0.97 in first dimension, r2 = 0.99 in second dimension) and was valid for a wide range of analyte concentrations and operational settings (coefficient of variance (CV) < 1% in first dimension, < 10% in second dimension). Using the KLI RI, an ILR classification contour map was created to assist with the identification of ILR types within ASTM E1618. The contour map was successfully applied to neat fuels and a fire scene sample, highlighting the application to wildfire investigation. Standardizing the RI process and establishing acceptable error margins allows the exploration and comparison of comprehensive data generated from GC×GC analysis of ILRs regardless of location, time, or system, further enhancing comprehensive and tenable chemometric analyses of samples. Overall, the KLI workflow was inexpensive, quick to apply, and user-friendly with its simple 3-step design.
Collapse
Affiliation(s)
- Nadin Boegelsack
- Department of Earth and Environmental Sciences, Mount Royal University, 4825 Mount Royal Gate SW, Calgary, AB Canada, T3E 6K6; Department of Civil, Geological and Environmental Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK Canada, S7N 5A9.
| | - Court Sandau
- Department of Earth and Environmental Sciences, Mount Royal University, 4825 Mount Royal Gate SW, Calgary, AB Canada, T3E 6K6; Chemistry Matters Inc., 104-1240 Kensington Rd NW Suite 405, Calgary, AB Canada, T2N 3P7
| | - Dena W McMartin
- Department of Civil, Geological and Environmental Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK Canada, S7N 5A9
| | - Jonathan M Withey
- Department of Chemistry and Physics, Mount Royal University, 4825 Mount Royal Gate SW, Calgary, AB Canada, T3E 6K6
| | - Gwen O'Sullivan
- Department of Earth and Environmental Sciences, Mount Royal University, 4825 Mount Royal Gate SW, Calgary, AB Canada, T3E 6K6
| |
Collapse
|
6
|
Jaramillo R, Dorman FL. Thermodynamic modeling of comprehensive two dimensional gas chromatography isovolatility curves for second dimension retention indices based analyte identification. J Chromatogr A 2020; 1622:461111. [DOI: 10.1016/j.chroma.2020.461111] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 04/02/2020] [Accepted: 04/03/2020] [Indexed: 10/24/2022]
|
7
|
Veenaas C, Ripszam M, Haglund P. Analysis of volatile organic compounds in indoor environments using thermal desorption with comprehensive two-dimensional gas chromatography and high-resolution time-of-flight mass spectrometry. J Sep Sci 2020; 43:1489-1498. [PMID: 32052921 DOI: 10.1002/jssc.201901103] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 01/16/2020] [Accepted: 02/10/2020] [Indexed: 01/31/2023]
Abstract
Building-related health effects are frequently observed. Several factors have been listed as possible causes including temperature, humidity, light conditions, presence of particulate matter, and microorganisms or volatile organic compounds. To be able to link exposure to specific volatile organic compounds to building-related health effects, powerful and comprehensive analytical methods are required. For this purpose, we developed an active air sampling method that utilizes dual-bed tubes loaded with TENAX-TA and Carboxen-1000 adsorbents to sample two parallel air samples of 4 L each. For the comprehensive volatile organic compounds analysis, an automated thermal desorption comprehensive two-dimensional gas chromatography high-resolution time-of-flight mass spectrometry method was developed and used. It allowed targeted analysis of approximately 90 known volatile organic compounds with relative standard deviations below 25% for the vast majority of target volatile organic compounds. It also allowed semiquantification (no matching standards) of numerous nontarget air contaminants using the same data set. The nontarget analysis workflow included peak finding, background elimination, feature alignment, detection frequency filtering, and tentative identification. Application of the workflow to air samples from 68 indoor environments at a large hospital complex resulted in a comprehensive volatile organic compound characterization, including 178 single compounds and 13 hydrocarbon groups.
Collapse
Affiliation(s)
| | | | - Peter Haglund
- Department of Chemistry, Umeå University, Umeå, Sweden
| |
Collapse
|
8
|
|
9
|
Amaral MSS, Nolvachai Y, Marriott PJ. Comprehensive Two-Dimensional Gas Chromatography Advances in Technology and Applications: Biennial Update. Anal Chem 2019; 92:85-104. [DOI: 10.1021/acs.analchem.9b05412] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Michelle S. S. Amaral
- Australian Centre for Research on Separation Science, School of Chemistry, Monash University, Wellington Road, Clayton, Victoria 3800, Australia
| | - Yada Nolvachai
- Australian Centre for Research on Separation Science, School of Chemistry, Monash University, Wellington Road, Clayton, Victoria 3800, Australia
| | - Philip J. Marriott
- Australian Centre for Research on Separation Science, School of Chemistry, Monash University, Wellington Road, Clayton, Victoria 3800, Australia
| |
Collapse
|
10
|
Jiang M. Facile Approach for Calculation of Second Dimensional Retention Indices in Comprehensive Two Dimensional Gas Chromatography with Single Injection. Anal Chem 2019; 91:4085-4091. [DOI: 10.1021/acs.analchem.8b05717] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Ming Jiang
- School of Pharmacy, Tongji Medical College, Huazhong University of Science & Technology, #13 Hangkong Road, Wuhan, Hubei 430030, PR China
| |
Collapse
|
11
|
The strength in numbers: comprehensive characterization of house dust using complementary mass spectrometric techniques. Anal Bioanal Chem 2019; 411:1957-1977. [PMID: 30830245 PMCID: PMC6458998 DOI: 10.1007/s00216-019-01615-6] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Revised: 12/20/2018] [Accepted: 01/15/2019] [Indexed: 11/18/2022]
Abstract
Untargeted analysis of a composite house dust sample has been performed as part of a collaborative effort to evaluate the progress in the field of suspect and nontarget screening and build an extensive database of organic indoor environment contaminants. Twenty-one participants reported results that were curated by the organizers of the collaborative trial. In total, nearly 2350 compounds were identified (18%) or tentatively identified (25% at confidence level 2 and 58% at confidence level 3), making the collaborative trial a success. However, a relatively small share (37%) of all compounds were reported by more than one participant, which shows that there is plenty of room for improvement in the field of suspect and nontarget screening. An even a smaller share (5%) of the total number of compounds were detected using both liquid chromatography–mass spectrometry (LC-MS) and gas chromatography–mass spectrometry (GC-MS). Thus, the two MS techniques are highly complementary. Most of the compounds were detected using LC with electrospray ionization (ESI) MS and comprehensive 2D GC (GC×GC) with atmospheric pressure chemical ionization (APCI) and electron ionization (EI), respectively. Collectively, the three techniques accounted for more than 75% of the reported compounds. Glycols, pharmaceuticals, pesticides, and various biogenic compounds dominated among the compounds reported by LC-MS participants, while hydrocarbons, hydrocarbon derivatives, and chlorinated paraffins and chlorinated biphenyls were primarily reported by GC-MS participants. Plastics additives, flavor and fragrances, and personal care products were reported by both LC-MS and GC-MS participants. It was concluded that the use of multiple analytical techniques was required for a comprehensive characterization of house dust contaminants. Further, several recommendations are given for improved suspect and nontarget screening of house dust and other indoor environment samples, including the use of open-source data processing tools. One of the tools allowed provisional identification of almost 500 compounds that had not been reported by participants. ![]()
Collapse
|
12
|
Retention-time prediction in comprehensive two-dimensional gas chromatography to aid identification of unknown contaminants. Anal Bioanal Chem 2018; 410:7931-7941. [PMID: 30361914 PMCID: PMC6244764 DOI: 10.1007/s00216-018-1415-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 09/27/2018] [Accepted: 10/02/2018] [Indexed: 11/29/2022]
Abstract
Comprehensive two-dimensional (2D) gas chromatography (GC×GC) coupled to mass spectrometry (MS, GC×GC-MS), which enhances selectivity compared to GC-MS analysis, can be used for non-directed analysis (non-target screening) of environmental samples. Additional tools that aid in identifying unknown compounds are needed to handle the large amount of data generated. These tools include retention indices for characterizing relative retention of compounds and prediction of such. In this study, two quantitative structure–retention relationship (QSRR) approaches for prediction of retention times (1tR and 2tR) and indices (linear retention indices (LRIs) and a new polyethylene glycol–based retention index (PEG-2I)) in GC × GC were explored, and their predictive power compared. In the first method, molecular descriptors combined with partial least squares (PLS) analysis were used to predict times and indices. In the second method, the commercial software package ChromGenius (ACD/Labs), based on a “federation of local models,” was employed. Overall, the PLS approach exhibited better accuracy than the ChromGenius approach. Although average errors for the LRI prediction via ChromGenius were slightly lower, PLS was superior in all other cases. The average deviations between the predicted and the experimental value were 5% and 3% for the 1tR and LRI, and 5% and 12% for the 2tR and PEG-2I, respectively. These results are comparable to or better than those reported in previous studies. Finally, the developed model was successfully applied to an independent dataset and led to the discovery of 12 wrongly assigned compounds. The results of the present work represent the first-ever prediction of the PEG-2I. ᅟ ![]()
Collapse
|
13
|
Prodhan MAI, Sleman AA, Kim S, McClain C, Zhang X. Generalization of Reference System for Calculating the Second Dimension Retention Index in GC × GC–MS. JOURNAL OF ANALYSIS AND TESTING 2018. [DOI: 10.1007/s41664-018-0074-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|