Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

37
(from Reference Citation Analysis)

Article PDFs (5)

Cited by > 0 (21)

Searched Name

Raquel Rodríguez-Pérez

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Trunzer M, Teigão J, Huth F, Poller B, Desrayaud S, Rodríguez-Pérez R, Faller B. Improving In Vitro-In Vivo Extrapolation of Clearance Using Rat Liver Microsomes for Highly Plasma Protein-Bound Molecules. Drug Metab Dispos 2024;52:345-354. [PMID: 38360916 DOI: 10.1124/dmd.123.001597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 02/07/2024] [Accepted: 02/12/2024] [Indexed: 02/17/2024] Open

Abstract

It is common practice in drug discovery and development to predict in vivo hepatic clearance from in vitro incubations with liver microsomes or hepatocytes using the well-stirred model (WSM). When applying the WSM to a set of approximately 3000 Novartis research compounds, 73% of neutral and basic compounds (extended clearance classification system [ECCS] class 2) were well-predicted within 3-fold. In contrast, only 44% (ECCS class 1A) or 34% (ECCS class 1B) of acids were predicted within 3-fold. To explore the hypothesis whether the higher degree of plasma protein binding for acids contributes to the in vitro-in vivo correlation (IVIVC) disconnect, 68 proprietary compounds were incubated with rat liver microsomes in the presence and absence of 5% plasma. A minor impact of plasma on clearance IVIVC was found for moderately bound compounds (fraction unbound in plasma [fup] ≥1%). However, addition of plasma significantly improved the IVIVC for highly bound compounds (fup <1%) as indicated by an increase of the average fold error from 0.10 to 0.36. Correlating fup with the scaled unbound intrinsic clearance ratio in the presence or absence of plasma allowed the establishment of an empirical, nonlinear correction equation that depends on fup Taken together, estimation of the metabolic clearance of highly bound compounds was enhanced by the addition of plasma to microsomal incubations. For standard incubations in buffer only, application of an empirical correction provided improved clearance predictions. SIGNIFICANCE STATEMENT: Application of the well-stirred liver model for clearance in vitro-in vivo extrapolation (IVIVE) in rat generally underpredicts the clearance of acids and the strong protein binding of acids is suspected to be one responsible factor. Unbound intrinsic in vitro clearance (CLint,u) determinations using rat liver microsomes supplemented with 5% plasma resulted in an improved IVIVE. An empirical equation was derived that can be applied to correct CLint,u-values in dependance of fraction unbound in plasma (fup) and measured CLint in buffer.

Collapse

Fluetsch A, Trunzer M, Gerebtzoff G, Rodríguez-Pérez R. Deep Learning Models Compared to Experimental Variability for the Prediction of CYP3A4 Time-Dependent Inhibition. Chem Res Toxicol 2024;37:549-560. [PMID: 38501689 DOI: 10.1021/acs.chemrestox.3c00305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]

Abstract

Most drugs are mainly metabolized by cytochrome P450 (CYP450), which can lead to drug-drug interactions (DDI). Specifically, time-dependent inhibition (TDI) of CYP3A4 isoenzyme has been associated with clinically relevant DDI. To overcome potential DDI issues, high-throughput in vitro assays were established to assess the TDI of CYP3A4 during the discovery and lead optimization phases. However, in silico machine learning models would enable an earlier and larger-scale assessment of TDI potential liabilities. For CYP inhibition, most modeling efforts have focused on highly imbalanced and small data sets. Moreover, assay variability is rarely considered, which is key to understand the model's quality and suitability for decision-making. In this work, machine learning models were built for the prediction of TDI of CYP3A4, evaluated prospectively, and compared to the variability of the experimental assay. Different modeling strategies were investigated to assess their influence on the model's performance. Through multitask learning, additional data sets were leveraged for model building, coming from public databases, in-house CYP-related assays, or other pharmaceutical companies (federated learning). Apart from the numerical prediction of inactivation rates of CYP3A4 TDI, three-class predictions were carried out, giving a negative (inactivation rate kobs < 0.01 min-1), weak positive (0.01 ≤ kobs ≤ 0.025 min-1), or positive (kobs > 0.025 min-1) output. The final multitask graph neural network model achieved misclassification rates of 8 and 7% for positive and negative TDI, respectively. Importantly, the presented deep learning-based predictions had a similar precision to the reproducibility of in vitro experiments and thus offered great opportunities for drug design, early derisk of DDI potential, and selection of experiments. To facilitate CYP inhibition modeling efforts in the public domain, the developed model was used to annotate ∼16 000 publicly available structures, and a surrogate data set is shared as Supporting Information.

Collapse

Fluetsch A, Di Lascio E, Gerebtzoff G, Rodríguez-Pérez R. Adapting Deep Learning QSPR Models to Specific Drug Discovery Projects. Mol Pharm 2024;21:1817-1826. [PMID: 38373038 DOI: 10.1021/acs.molpharmaceut.3c01124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]

Lluch-Bernal M, Pedrosa M, Domínguez-Ortega J, Colque-Bayona M, Correa-Borit J, Phillips-Anglés E, Gómez-Traseira C, Quirce S, Rodríguez-Pérez R. Sensitization to Quercus ilex pollen is clinically relevant in patients with seasonal pollen allergy. J Investig Allergol Clin Immunol 2024;34:0. [PMID: 38381081 DOI: 10.18176/jiaci.0998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2024] Open

Narváez-Fernández E, Pose K, Caballero ML, Rodríguez-Pérez R, Quirce S. Occupational asthma and food allergy due to soybean in a bakery worker. J Investig Allergol Clin Immunol 2023;34:0. [PMID: 37905416 DOI: 10.18176/jiaci.0958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023] Open

Rodríguez-Pérez R, Del Pozuelo S, Pulido E, Brigido C, Carretero P, Caballero ML. Chemiluminescence-based IgE dot-blot assay to diagnose a case of anaphylaxis caused by Prontosan. J Investig Allergol Clin Immunol 2023;34:0. [PMID: 37669080 DOI: 10.18176/jiaci.0933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023] Open

Amara K, Rodríguez-Pérez R, Jiménez-Luna J. Explaining compound activity predictions with a substructure-aware loss for graph neural networks. J Cheminform 2023;15:67. [PMID: 37491407 PMCID: PMC10369817 DOI: 10.1186/s13321-023-00733-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 07/08/2023] [Indexed: 07/27/2023] Open

Di Lascio E, Gerebtzoff G, Rodríguez-Pérez R. Systematic Evaluation of Local and Global Machine Learning Models for the Prediction of ADME Properties. Mol Pharm 2023;20:1758-1767. [PMID: 36745394 DOI: 10.1021/acs.molpharmaceut.2c00962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Abstract

Machine learning (ML) has become an indispensable tool to predict absorption, distribution, metabolism, and excretion (ADME) properties in pharmaceutical research. ML algorithms are trained on molecular structures and corresponding ADME assay data to develop quantitative structure-property relationship (QSPR) models. Traditional QSPR models were trained on compound sets of limited size. With the advent of more complex ML algorithms and data availability, training sets have become larger and more diverse. Most common training approaches consist in either training a model with a small set of similar compounds, namely, compounds designed for the same drug discovery project or chemical series (local model approach) or with a larger set of diverse compounds (global model approach). Global models are built with all experimental data available for an assay, combining compound data from different projects and disease areas. Despite the ML progress made so far, the choice of the appropriate data composition for building ML models is still unclear. Herein, a systematic evaluation of local and global ML models was performed for 10 different experimental assays and 112 drug discovery projects. Results show a consistent superior performance of global models for ADME property predictions. Diagnostic analyses were also carried out to investigate the influence of training set size, structural diversity, and data shift in the relative performance of local and global ML models. Training set and structural diversity did not have an impact in the relative performance on the methods. Instead, data shift helped to identify the projects with larger performance differences between local and global models. Results presented in this work can be leveraged to improve ML-based ADME properties predictions and thus decision-making in drug discovery projects.

Collapse

Rodríguez-Pérez R, Trunzer M, Schneider N, Faller B, Gerebtzoff G. Multispecies Machine Learning Predictions of In Vitro Intrinsic Clearance with Uncertainty Quantification Analyses. Mol Pharm 2023;20:383-394. [PMID: 36437712 DOI: 10.1021/acs.molpharmaceut.2c00680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Bajorath J, Chávez-Hernández AL, Duran-Frigola M, Fernández-de Gortari E, Gasteiger J, López-López E, Maggiora GM, Medina-Franco JL, Méndez-Lucio O, Mestres J, Miranda-Quintana RA, Oprea TI, Plisson F, Prieto-Martínez FD, Rodríguez-Pérez R, Rondón-Villarreal P, Saldívar-Gonzalez FI, Sánchez-Cruz N, Valli M. Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds. J Cheminform 2022;14:82. [PMID: 36461094 PMCID: PMC9716667 DOI: 10.1186/s13321-022-00661-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022] Open

Affiliation(s)

Jürgen Bajorath grid.10388.320000 0001 2240 3300Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53113 Bonn, Germany
Ana L. Chávez-Hernández grid.9486.30000 0001 2159 0001DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510 Mexico City, Mexico
Miquel Duran-Frigola Ersilia Open Source Initiative, Cambridge, UK ,4grid.7722.00000 0001 1811 6966Joint IRB-BSC-CRG Programme in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia Spain
Eli Fernández-de Gortari grid.420330.60000 0004 0521 6935Nanosafety Laboratory, International Iberian Nanotechnology Laboratory, 4715-330 Braga, Portugal
Johann Gasteiger grid.5330.50000 0001 2107 3311Computer-Chemie-Centrum, University of Erlangen-Nuremberg, Erlangen, Germany
Edgar López-López grid.9486.30000 0001 2159 0001DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510 Mexico City, Mexico ,7grid.512574.0Department of Pharmacology, Center for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV), 07360 Mexico City, Mexico
Gerald M. Maggiora grid.134563.60000 0001 2168 186XBIO5 Institute, University of Arizona, Tucson, AZ 85721 USA
José L. Medina-Franco grid.9486.30000 0001 2159 0001DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510 Mexico City, Mexico
Oscar Méndez-Lucio grid.505135.7Recursion Pharmaceuticals, Salt Lake City, USA
Jordi Mestres grid.5841.80000 0004 1937 0247Chemotargets SL, Baldiri Reixac 4, Parc Cientific de Barcelona (PCB), 08028 Barcelona, Catalonia Spain ,11grid.20522.370000 0004 1767 9005Research Group on Systems Pharmacology, Research Program on Biomedical Informatics (GRIB), IMIM Hospital del Mar Medical Research Institute and University Pompeu Fabra, Parc de Recerca Biomedica (PRBB), 08003 Barcelona, Catalonia Spain
Ramón Alain Miranda-Quintana grid.15276.370000 0004 1936 8091Department of Chemistry, University of Florida, Gainesville, FL 32603 USA
Tudor I. Oprea grid.266832.b0000 0001 2188 8502Department of Internal Medicine, University of New Mexico School of Medicine, Albuquerque, NM 87131 USA ,14grid.8761.80000 0000 9919 9582Department of Rheumatology and Inflammation Research, Institute of Medicine, Sahlgrenska Academy at Gothenburg University, 40530 Gothenburg, Sweden ,15grid.5254.60000 0001 0674 042XNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark ,16Present Address: Roivant Discovery Sciences, Inc., 451 D Street, Boston, MA 02210 USA
Fabien Plisson grid.512574.0Department of Biotechnology and Biochemistry, Center for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV-IPN), Irapuato Unit, 36824 Irapuato, Gto Mexico
Fernando D. Prieto-Martínez grid.9486.30000 0001 2159 0001Chemistry Institute, National Autonomous University of Mexico, 04510 Mexico City, Mexico
Raquel Rodríguez-Pérez grid.419481.10000 0001 1515 9979Novartis Institutes for Biomedical Research, 4002 Basel, Switzerland
Paola Rondón-Villarreal grid.442204.40000 0004 0486 1035Universidad de Santander, Facultad de Ciencias Médicas y de la Salud, Instituto de Investigación Masira, Calle 70 No. 55-210, 680003 Santander, Bucaramanga Colombia
Fernanda I. Saldívar-Gonzalez grid.9486.30000 0001 2159 0001DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510 Mexico City, Mexico
Norberto Sánchez-Cruz grid.5841.80000 0004 1937 0247Chemotargets SL, Baldiri Reixac 4, Parc Cientific de Barcelona (PCB), 08028 Barcelona, Catalonia Spain ,21grid.9486.30000 0001 2159 0001Instituto de Química, Unidad Mérida, Universidad Nacional Autónoma de México, Carretera Mérida-Tetiz Km. 4.5, Yucatán, 97357 Ucú, Mexico
Marilia Valli grid.410543.70000 0001 2188 478XNuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University-UNESP, Araraquara, Brazil

Collapse

Mastropietro A, Pasculli G, Feldmann C, Rodríguez-Pérez R, Bajorath J. EdgeSHAPer: Bond-Centric Shapley Value-Based Explanation Method for Graph Neural Networks. iScience 2022;25:105043. [PMID: 36134335 PMCID: PMC9483788 DOI: 10.1016/j.isci.2022.105043] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 08/17/2022] [Accepted: 08/25/2022] [Indexed: 11/29/2022] Open

Hamzic S, Lewis R, Desrayaud S, Soylu C, Fortunato M, Gerebtzoff G, Rodríguez-Pérez R. Predicting In Vivo Compound Brain Penetration Using Multi-task Graph Neural Networks. J Chem Inf Model 2022;62:3180-3190. [PMID: 35738004 DOI: 10.1021/acs.jcim.2c00412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Rodríguez-Pérez R, Miljković F, Bajorath J. Machine Learning in Chemoinformatics and Medicinal Chemistry. Annu Rev Biomed Data Sci 2022;5:43-65. [PMID: 35440144 DOI: 10.1146/annurev-biodatasci-122120-124216] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Rodríguez-Pérez R, Bajorath J. Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery. J Comput Aided Mol Des 2022;36:355-362. [PMID: 35304657 PMCID: PMC9325859 DOI: 10.1007/s10822-022-00442-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 02/15/2022] [Indexed: 11/05/2022]

Rodríguez-Pérez R, Carretero P, Brigido C, Nin-Valencia A, Carpio-Hernández D, Tomás M, Quirce S, Caballero ML. The new Api m 11.0301 isoallergen from Apis mellifera is a food allergen from honey. J Investig Allergol Clin Immunol 2022;32:492-493. [PMID: 35234637 DOI: 10.18176/jiaci.0799] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Rodríguez-Pérez R, Bajorath J. Explainable Machine Learning for Property Predictions in Compound Optimization. J Med Chem 2021;64:17744-17752. [PMID: 34902252 DOI: 10.1021/acs.jmedchem.1c01789] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Miljković F, Rodríguez-Pérez R, Bajorath J. Impact of Artificial Intelligence on Compound Discovery, Design, and Synthesis. ACS Omega 2021;6:33293-33299. [PMID: 34926881 PMCID: PMC8674916 DOI: 10.1021/acsomega.1c05512] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 11/18/2021] [Indexed: 05/17/2023]

Rodríguez-Pérez R, Bajorath J. Feature importance correlation from machine learning indicates functional relationships between proteins and similar compound binding characteristics. Sci Rep 2021;11:14245. [PMID: 34244588 PMCID: PMC8270985 DOI: 10.1038/s41598-021-93771-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Accepted: 06/30/2021] [Indexed: 11/08/2022] Open

Galati S, Yonchev D, Rodríguez-Pérez R, Vogt M, Tuccinardi T, Bajorath J. Predicting Isoform-Selective Carbonic Anhydrase Inhibitors via Machine Learning and Rationalizing Structural Features Important for Selectivity. ACS Omega 2021;6:4080-4089. [PMID: 33585783 PMCID: PMC7876851 DOI: 10.1021/acsomega.0c06153] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 01/14/2021] [Indexed: 05/03/2023]

Rodríguez-Pérez R, Miljković F, Bajorath J. Assessing the information content of structural and protein-ligand interaction representations for the classification of kinase inhibitor binding modes via machine learning and active learning. J Cheminform 2020;12:36. [PMID: 33431025 PMCID: PMC7245824 DOI: 10.1186/s13321-020-00434-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2019] [Accepted: 04/27/2020] [Indexed: 12/27/2022] Open

Abstract

For kinase inhibitors, X-ray crystallography has revealed different types of binding modes. Currently, more than 2000 kinase inhibitors with known binding modes are available, which makes it possible to derive and test machine learning models for the prediction of inhibitors with different binding modes. We have addressed this prediction task to evaluate and compare the information content of distinct molecular representations including protein–ligand interaction fingerprints (IFPs) and compound structure-based structural fingerprints (i.e., atom environment/fragment fingerprints). IFPs were designed to capture binding mode-specific interaction patterns at different resolution levels. Accurate predictions of kinase inhibitor binding modes were achieved with random forests using both representations. The performance of IFPs was consistently superior to atom environment fingerprints, albeit only by less than 10%. An active learning strategy applying information entropy-based selection of training instances was applied as a diagnostic approach to assess the relative information content of distinct representations. IFPs were found to capture more binding mode-relevant information than atom environment fingerprints, leading to highly predictive models even when training instances were randomly selected. By contrast, for atom environment fingerprints, the derivation of accurate models via active learning depended on entropy-based selection of informative training compounds. Notably, higher information content of IFPs confirmed by active learning only resulted in small improvements in global prediction accuracy compared to models derived using atom environment fingerprints. For practical applications, prediction of binding modes of new kinase inhibitors on the basis of chemical structure is highly attractive.

Collapse

Rodríguez-Pérez R, Bajorath J. Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values. J Med Chem 2019;63:8761-8777. [PMID: 31512867 DOI: 10.1021/acs.jmedchem.9b01101] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Miljković F, Rodríguez-Pérez R, Bajorath J. Machine Learning Models for Accurate Prediction of Kinase Inhibitors with Different Binding Modes. J Med Chem 2019;63:8738-8748. [PMID: 31469557 DOI: 10.1021/acs.jmedchem.9b00867] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Rodríguez-Pérez R, Bajorath J. Prediction of Compound Profiling Matrices, Part II: Relative Performance of Multitask Deep Learning and Random Forest Classification on the Basis of Varying Amounts of Training Data. ACS Omega 2018;3:12033-12040. [PMID: 30320286 PMCID: PMC6175492 DOI: 10.1021/acsomega.8b01682] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Accepted: 09/12/2018] [Indexed: 05/28/2023]

Rodríguez-Pérez R, Fernández L, Marco S. Overoptimism in cross-validation when using partial least squares-discriminant analysis for omics data: a systematic study. Anal Bioanal Chem 2018;410:5981-5992. [PMID: 29959482 DOI: 10.1007/s00216-018-1217-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 06/13/2018] [Accepted: 06/21/2018] [Indexed: 01/29/2023]

Abstract

Advances in analytical instrumentation have provided the possibility of examining thousands of genes, peptides, or metabolites in parallel. However, the cost and time-consuming data acquisition process causes a generalized lack of samples. From a data analysis perspective, omics data are characterized by high dimensionality and small sample counts. In many scenarios, the analytical aim is to differentiate between two different conditions or classes combining an analytical method plus a tailored qualitative predictive model using available examples collected in a dataset. For this purpose, partial least squares-discriminant analysis (PLS-DA) is frequently employed in omics research. Recently, there has been growing concern about the uncritical use of this method, since it is prone to overfitting and may aggravate problems of false discoveries. In many applications involving a small number of subjects or samples, predictive model performance estimation is only based on cross-validation (CV) results with a strong preference for reporting results using leave one out (LOO). The combination of PLS-DA for high dimensionality data and small sample conditions, together with a weak validation methodology is a recipe for unreliable estimations of model performance. In this work, we present a systematic study about the impact of the dataset size, the dimensionality, and the CV technique used on PLS-DA overoptimism when performance estimation is done in cross-validation. Firstly, by using synthetic data generated from a same probability distribution and with assigned random binary labels, we have obtained a dataset where the true classification rate (CR) is 50%. As expected, our results confirm that internal validation provides overoptimistic estimations of the classification accuracy (i.e., overfitting). We have characterized the CR estimator in terms of bias and variance depending on the internal CV technique used and sample to dimensionality ratio. In small sample conditions, due to the large bias and variance of the estimator, the occurrence of extremely good CRs is common. We have found that overfitting peaks when the sample size in the training subset approaches the feature vector dimensionality minus one. In these conditions, the models are neither under- or overdetermined with a unique solution. This effect is particularly intense for LOO and peaks higher in small sample conditions. Overoptimism is decreased beyond this point where the abundance of noisy produces a regularization effect leading to less complex models. In terms of overfitting, our study ranks CV methods as follows: Bootstrap produces the most accurate estimator of the CR, followed by bootstrapped Latin partitions, random subsampling, K-Fold, and finally, the very popular LOO provides the worst results. Simulation results are further confirmed in real datasets from mass spectrometry and microarrays.

Collapse

Rodríguez-Pérez R, Miyao T, Jasial S, Vogt M, Bajorath J. Prediction of Compound Profiling Matrices Using Machine Learning. ACS Omega 2018;3:4713-4723. [PMID: 30023899 PMCID: PMC6045364 DOI: 10.1021/acsomega.8b00462] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 04/20/2018] [Indexed: 05/25/2023]

Rodríguez-Pérez R, Cortés R, Guamán A, Pardo A, Torralba Y, Gómez F, Roca J, Barberà JA, Cascante M, Marco S. Instrumental drift removal in GC-MS data for breath analysis: the short-term and long-term temporal validation of putative biomarkers for COPD. J Breath Res 2018;12:036007. [PMID: 29292699 DOI: 10.1088/1752-7163/aaa492] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Rodríguez-Pérez R, Vogt M, Bajorath J. Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction. ACS Omega 2017;2:6371-6379. [PMID: 30023518 PMCID: PMC6045367 DOI: 10.1021/acsomega.7b01079] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 09/22/2017] [Indexed: 05/15/2023]

Rodríguez-Pérez R, Vogt M, Bajorath J. Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds. J Chem Inf Model 2017;57:710-716. [PMID: 28376613 PMCID: PMC5417594 DOI: 10.1021/acs.jcim.7b00088] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

García-Urquijo A, Rodríguez-Rodríguez J, Rodríguez-Pérez R, Lorenzo-Manzanas, Hernández-González G. Staphylococcus aureus en quemaduras: estudio de incidencia, tendencia y pronóstico. Cir plást iberolatinoam 2015. [DOI: 10.4321/s0376-78922015000200002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open

Garcia Alonso M, Caballero ML, Umpierrez A, Lluch-Bernal M, Knaute T, Rodríguez-Pérez R. Relationships between T cell and IgE/IgG4 epitopes of the Anisakis simplex major allergen Ani s 1. Clin Exp Allergy 2015;45:994-1005. [DOI: 10.1111/cea.12474] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Revised: 11/05/2014] [Accepted: 12/07/2014] [Indexed: 02/06/2023]

Mauriz E, Laliena A, Vallejo D, Tuñón MJ, Rodríguez-López JM, Rodríguez-Pérez R, García-Fernández MC. Effects of a low-fat diet with antioxidant supplementation on biochemical markers of multiple sclerosis long-term care residents. NUTR HOSP 2013;28:2229-35. [PMID: 24506405 DOI: 10.3305/nutr hosp.v28in06.6983] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

Abstract

INTRODUCTION

Multiple sclerosis (MS) treatment options are primarily limited to immunomodulatory therapies in MS non-progressive forms. Nutrition intervention studies suggest that diet may be considered as a complementary treatment to control disease progression. Therefore, dietary intervention may help to improve wellness and ameliorate symptoms of MS patients.

OBJECTIVES

To assess the effect of a low-fat diet with antioxidant supplementation on biochemical markers of institutionalized patients with progressive forms of multiple sclerosis.

METHODS

A randomized prospective placebo-controlled study involving 9 participants, 5 of them assigned to the intervention group (low-fat diet and antioxidant supplementation) and the other 4 to the placebo group (low-fat diet). The effect of the dietary intervention, involving diet modification and antioxidant supplementation, was examined for 42 days by measuring anthropometric, biochemical parameters and oxidative stress markers in blood at baseline (day 0), intermediate (day 15) and end (day 42) stages of the treatment.

RESULTS

The intervention group obtained C reactive protein levels significantly lower than those observed in the corresponding placebo group at the end of the study. Oxidative stress and inflammatory markers isoprostane 8-iso-PGF2α and interleukine IL-6 values also diminished after dietary intervention in the intervention group. Catalase activity increased significantly in the intervention group prior antioxidant supplementation. No significant differences were observed in other oxidative stress markers.

CONCLUSIONS

The results suggest that diet and dietary supplements are involved in cell metabolism modulation and MS-related inflammatory processes. Consequently, low fat diets and antioxidant supplements may be used as complementary therapies for treatment of multiple sclerosis.

Collapse

Iparraguirre A, Rodríguez-Pérez R, Juste S, Ledesma A, Moneo I, Caballero ML. Selective allergy to lobster in a case of primary sensitization to house dust mites. J Investig Allergol Clin Immunol 2009;19:409-413. [PMID: 19862942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023] Open

Bascones O, Rodríguez-Pérez R, Juste S, Moneo I, Caballero ML. Lettuce-induced anaphylaxis. Identification of the allergen involved. J Investig Allergol Clin Immunol 2009;19:154-7. [PMID: 19476020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023] Open

Vicente-Serrano J, Caballero ML, Rodríguez-Pérez R, Carretero P, Pérez R, Blanco JG, Juste S, Moneo I. Sensitization to serum albumins in children allergic to cow's milk and epithelia. Pediatr Allergy Immunol 2007;18:503-7. [PMID: 17680908 DOI: 10.1111/j.1399-3038.2007.00548.x] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zurera-Cosano G, García-Gimeno R, Rodríguez-Pérez R, Hervás-Martínez C. Performance of response surface model for prediction of Leuconostoc mesenteroides growth parameters under different experimental conditions. Food Control 2006. [DOI: 10.1016/j.foodcont.2005.02.003] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

García-Gimeno RM, Hervás-Martínez C, Rodríguez-Pérez R, Zurera-Cosano G. Modelling the growth of Leuconostoc mesenteroides by Artificial Neural Networks. Int J Food Microbiol 2005;105:317-32. [PMID: 16054719 DOI: 10.1016/j.ijfoodmicro.2005.04.013] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2004] [Accepted: 04/18/2005] [Indexed: 11/30/2022]

Pérez-Guillé G, Camacho-Vieyra A, Toledo-López A, Guillé-Pérez A, Flores-Pérez J, Rodríguez-Pérez R, Juárez-Olguín H, Lares-Asseff I. Patterns of drug consumption in relation with the pathologies of elderly Mexican subjects resident in nursing homes. J Pharm Pharm Sci 2001;4:159-66. [PMID: 11466173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/20/2023]