Poole CF. The effect of descriptor database selection on the physicochemical characterization and prediction of water-air, octanol-air and octanol-water partition constants using the solvation parameter model.
J Chromatogr A 2023;
1706:464213. [PMID:
37567000 DOI:
10.1016/j.chroma.2023.464213]
[Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/10/2023] [Accepted: 07/11/2023] [Indexed: 08/13/2023]
Abstract
The distribution of neutral compounds in biphasic separation systems can be described by the solvation parameter model using six solute properties, or descriptors. These descriptors (McGowan's characteristic volume, excess molar refraction, dipolarity/polarizability, hydrogen-bond acidity and basicity, and the gas-liquid partition constant on n-hexadecane at 298.15 K) are curated in two publicly accessible databases for hundreds (WSU compound descriptor database) or thousands (Abraham compound descriptor database). These databases were developed independently using different approaches resulting in descriptor values that vary for many compounds. Previously, it was shown that the two descriptor databases are not interchangeable, and the WSU descriptor database consistently demonstrated improved model performance for chromatographic systems where the uncertainty in the dependent variable was minimized by suitable quality control and calibration procedures. In this report we wish to evaluate whether the same conclusions are true for models with a dependent variable containing significant measurement uncertainty. To evaluate this hypothesis, we assembled databases for water-air, octanol-air, and octanol-water partition constants reported by multiple laboratories using various measurement methods. It was found that database selection has little effect on model quality or model predictive capability but significantly affects the assignment of the contribution of individual intermolecular interactions to the dependent variable. The latter information is database specific, and a quantitative comparison of system constants should be restricted to models using the same compound descriptor database.
Collapse