Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hong H, Tong W, Perkins R, Fang H, Xie Q, Shi L. Multiclass Decision Forest—A Novel Pattern Recognition Method for Multiclass Classification in Microarray Data Analysis. DNA Cell Biol 2004;23:685-94. [PMID: 15585126 DOI: 10.1089/dna.2004.23.685] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

For:	Hong H, Tong W, Perkins R, Fang H, Xie Q, Shi L. Multiclass Decision Forest—A Novel Pattern Recognition Method for Multiclass Classification in Microarray Data Analysis. DNA Cell Biol 2004;23:685-94. [PMID: 15585126 DOI: 10.1089/dna.2004.23.685] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

Number

Cited by Other Article(s)

Liu J, Guo W, Dong F, Aungst J, Fitzpatrick S, Patterson TA, Hong H. Machine learning models for rat multigeneration reproductive toxicity prediction. Front Pharmacol 2022;13:1018226. [PMID: 36238576 PMCID: PMC9552001 DOI: 10.3389/fphar.2022.1018226] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 09/09/2022] [Indexed: 11/13/2022] Open

Abstract Reproductive toxicity is one of the prominent endpoints in the risk assessment of environmental and industrial chemicals. Due to the complexity of the reproductive system, traditional reproductive toxicity testing in animals, especially guideline multigeneration reproductive toxicity studies, take a long time and are expensive. Therefore, machine learning, as a promising alternative approach, should be considered when evaluating the reproductive toxicity of chemicals. We curated rat multigeneration reproductive toxicity testing data of 275 chemicals from ToxRefDB (Toxicity Reference Database) and developed predictive models using seven machine learning algorithms (decision tree, decision forest, random forest, k-nearest neighbors, support vector machine, linear discriminant analysis, and logistic regression). A consensus model was built based on the seven individual models. An external validation set was curated from the COSMOS database and the literature. The performances of individual and consensus models were evaluated using 500 iterations of 5-fold cross-validations and the external validation data set. The balanced accuracy of the models ranged from 58% to 65% in the 5-fold cross-validations and 45%–61% in the external validations. Prediction confidence analysis was conducted to provide additional information for more appropriate applications of the developed models. The impact of our findings is in increasing confidence in machine learning models. We demonstrate the importance of using consensus models for harnessing the benefits of multiple machine learning models (i.e., using redundant systems to check validity of outcomes). While we continue to build upon the models to better characterize weak toxicants, there is current utility in saving resources by being able to screen out strong reproductive toxicants before investing in vivo testing. The modeling approach (machine learning models) is offered for assessing the rat multigeneration reproductive toxicity of chemicals. Our results suggest that machine learning may be a promising alternative approach to evaluate the potential reproductive toxicity of chemicals. Collapse

A Machine Learning Model to Predict Citation Counts of Scientific Papers in Otology Field. BIOMED RESEARCH INTERNATIONAL 2022;2022:2239152. [PMID: 35909490 PMCID: PMC9329008 DOI: 10.1155/2022/2239152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 06/26/2022] [Indexed: 12/04/2022]

Liu J, Guo W, Sakkiah S, Ji Z, Yavas G, Zou W, Chen M, Tong W, Patterson TA, Hong H. Machine Learning Models for Predicting Liver Toxicity. Methods Mol Biol 2022;2425:393-415. [PMID: 35188640 DOI: 10.1007/978-1-0716-1960-5_15] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Mansouri K, Karmaus AL, Fitzpatrick J, Patlewicz G, Pradeep P, Alberga D, Alepee N, Allen TE, Allen D, Alves VM, Andrade CH, Auernhammer TR, Ballabio D, Bell S, Benfenati E, Bhattacharya S, Bastos JV, Boyd S, Brown J, Capuzzi SJ, Chushak Y, Ciallella H, Clark AM, Consonni V, Daga PR, Ekins S, Farag S, Fedorov M, Fourches D, Gadaleta D, Gao F, Gearhart JM, Goh G, Goodman JM, Grisoni F, Grulke CM, Hartung T, Hirn M, Karpov P, Korotcov A, Lavado GJ, Lawless M, Li X, Luechtefeld T, Lunghini F, Mangiatordi GF, Marcou G, Marsh D, Martin T, Mauri A, Muratov EN, Myatt GJ, Nguyen DT, Nicolotti O, Note R, Pande P, Parks AK, Peryea T, Polash AH, Rallo R, Roncaglioni A, Rowlands C, Ruiz P, Russo DP, Sayed A, Sayre R, Sheils T, Siegel C, Silva AC, Simeonov A, Sosnin S, Southall N, Strickland J, Tang Y, Teppen B, Tetko IV, Thomas D, Tkachenko V, Todeschini R, Toma C, Tripodi I, Trisciuzzi D, Tropsha A, Varnek A, Vukovic K, Wang Z, Wang L, Waters KM, Wedlake AJ, Wijeyesakere SJ, Wilson D, Xiao Z, Yang H, Zahoranszky-Kohalmi G, Zakharov AV, Zhang FF, Zhang Z, Zhao T, Zhu H, Zorn KM, Casey W, Kleinstreuer NC. CATMoS: Collaborative Acute Toxicity Modeling Suite. ENVIRONMENTAL HEALTH PERSPECTIVES 2021;129:47013. [PMID: 33929906 PMCID: PMC8086800 DOI: 10.1289/ehp8495] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]

Abstract

BACKGROUND

Humans are exposed to tens of thousands of chemical substances that need to be assessed for their potential toxicity. Acute systemic toxicity testing serves as the basis for regulatory hazard classification, labeling, and risk management. However, it is cost- and time-prohibitive to evaluate all new and existing chemicals using traditional rodent acute toxicity tests. In silico models built using existing data facilitate rapid acute toxicity predictions without using animals.

OBJECTIVES

The U.S. Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM) Acute Toxicity Workgroup organized an international collaboration to develop in silico models for predicting acute oral toxicity based on five different end points: Lethal Dose 50 (LD50 value, U.S. Environmental Protection Agency hazard (four) categories, Globally Harmonized System for Classification and Labeling hazard (five) categories, very toxic chemicals [LD50 (LD50≤50mg/kg)], and nontoxic chemicals (LD50>2,000mg/kg).

METHODS

An acute oral toxicity data inventory for 11,992 chemicals was compiled, split into training and evaluation sets, and made available to 35 participating international research groups that submitted a total of 139 predictive models. Predictions that fell within the applicability domains of the submitted models were evaluated using external validation sets. These were then combined into consensus models to leverage strengths of individual approaches.

RESULTS

The resulting consensus predictions, which leverage the collective strengths of each individual model, form the Collaborative Acute Toxicity Modeling Suite (CATMoS). CATMoS demonstrated high performance in terms of accuracy and robustness when compared with in vivo results.

DISCUSSION

CATMoS is being evaluated by regulatory agencies for its utility and applicability as a potential replacement for in vivo rat acute oral toxicity studies. CATMoS predictions for more than 800,000 chemicals have been made available via the National Toxicology Program's Integrated Chemical Environment tools and data sets (ice.ntp.niehs.nih.gov). The models are also implemented in a free, standalone, open-source tool, OPERA, which allows predictions of new and untested chemicals to be made. https://doi.org/10.1289/EHP8495.

Collapse

Affiliation(s)

Kamel Mansouri Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
Agnes L. Karmaus Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Jeremy Fitzpatrick ScitoVation, Research Triangle Park, North Carolina, USA
Grace Patlewicz Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Prachi Pradeep Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Oak Ridge Institute for Science and Education (ORISE) Research Participation Program, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Domenico Alberga Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Nathalie Alepee L’Oréal Research & Innovation, Aulnay-sous-Bois, France
Timothy E.H. Allen Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
Dave Allen Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Vinicius M. Alves Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Carolina H. Andrade Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Tyler R. Auernhammer The Dow Chemical Company, Midland, Michigan, USA
Davide Ballabio Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Shannon Bell Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Emilio Benfenati Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Sudin Bhattacharya Institute for Quantitative Health Science and Engineering, Department of Biomedical Engineering, Michigan State University, East Lansing, Michigan, USA
Joyce V. Bastos Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Stephen Boyd Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
J.B. Brown Kyoto University Graduate School of Medicine, Kyoto, Japan
Stephen J. Capuzzi Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
Yaroslav Chushak Aeromedical Research Department, Force Health Protection, USAFSAM, Dayton, Ohio, USA Henry M Jackson Foundation for the Advancement of Military Medicine, Dayton, Ohio, USA
Heather Ciallella Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
Alex M. Clark Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
Viviana Consonni Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Pankaj R. Daga Simulations Plus, Inc., Lancaster, California, USA
Sean Ekins Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
Sherif Farag Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
Maxim Fedorov Skoltech, Skolkovo Institute of Science and Technology, Moscow, Russia
Denis Fourches Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
Domenico Gadaleta Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Feng Gao Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
Jeffery M. Gearhart Aeromedical Research Department, Force Health Protection, USAFSAM, Dayton, Ohio, USA Henry M Jackson Foundation for the Advancement of Military Medicine, Dayton, Ohio, USA
Garett Goh Pacific Northwest National Laboratory, Richland, Washington, USA
Jonathan M. Goodman Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
Francesca Grisoni Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Christopher M. Grulke Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Thomas Hartung Underwriters Laboratories, Northbrook, Illinois, USA
Matthew Hirn Department of Computational Mathematics, Science & Engineering, Department of Mathematics, Michigan State University, East Lansing, Michigan, USA
Pavel Karpov Institute of Structural Biology, Helmholtz Zentrum München (GmbH), Neuherberg, Germany
Alexandru Korotcov Science Data Software, LLC, Rockville, Maryland, USA
Giovanna J. Lavado Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Michael Lawless Simulations Plus, Inc., Lancaster, California, USA
Xinhao Li Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA
Thomas Luechtefeld Underwriters Laboratories, Northbrook, Illinois, USA
Filippo Lunghini Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
Giuseppe F. Mangiatordi Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Gilles Marcou Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
Dan Marsh Underwriters Laboratories, Northbrook, Illinois, USA
Todd Martin Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Cincinnati, Ohio, USA
Andrea Mauri Alvascience Srl, Lecco, Italy
Eugene N. Muratov Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Glenn J. Myatt Leadscope Inc., Columbus, Ohio, USA
Dac-Trung Nguyen National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Orazio Nicolotti Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Reine Note L’Oréal Research & Innovation, Aulnay-sous-Bois, France
Paritosh Pande Pacific Northwest National Laboratory, Richland, Washington, USA
Amanda K. Parks The Dow Chemical Company, Midland, Michigan, USA
Tyler Peryea National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Ahsan H. Polash Kyoto University Graduate School of Medicine, Kyoto, Japan
Robert Rallo Pacific Northwest National Laboratory, Richland, Washington, USA
Alessandra Roncaglioni Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Craig Rowlands Underwriters Laboratories, Northbrook, Illinois, USA
Patricia Ruiz Office of Innovation and Analytics, Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Daniel P. Russo Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
Ahmed Sayed Rosettastein Consulting UG, Freising, Germany
Risa Sayre Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Oak Ridge Institute for Science and Education (ORISE) Research Participation Program, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Timothy Sheils National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Charles Siegel Pacific Northwest National Laboratory, Richland, Washington, USA
Arthur C. Silva Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Anton Simeonov National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Sergey Sosnin Skoltech, Skolkovo Institute of Science and Technology, Moscow, Russia
Noel Southall National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Judy Strickland Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Yun Tang Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Brian Teppen Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
Igor V. Tetko Institute of Structural Biology, Helmholtz Zentrum München (GmbH), Neuherberg, Germany BIGCHEM GmbH, Unterschleissheim, Germany
Dennis Thomas Pacific Northwest National Laboratory, Richland, Washington, USA
Valery Tkachenko Science Data Software, LLC, Rockville, Maryland, USA
Roberto Todeschini Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Cosimo Toma Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Ignacio Tripodi Computer Science/Interdisciplinary Quantitative Biology, University of Colorado, Boulder, Colorado, USA
Daniela Trisciuzzi Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Alexander Tropsha Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
Alexandre Varnek Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
Kristijan Vukovic Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Zhongyu Wang School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
Liguo Wang School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
Katrina M. Waters Pacific Northwest National Laboratory, Richland, Washington, USA
Andrew J. Wedlake Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
Sanjeeva J. Wijeyesakere The Dow Chemical Company, Midland, Michigan, USA
Dan Wilson The Dow Chemical Company, Midland, Michigan, USA
Zijun Xiao School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
Hongbin Yang Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Gergely Zahoranszky-Kohalmi National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Alexey V. Zakharov National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Fagen F. Zhang The Dow Chemical Company, Midland, Michigan, USA
Zhen Zhang Dow Agrosciences, Indianapolis, Indiana, USA
Tongan Zhao National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Hao Zhu Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
Kimberley M. Zorn Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
Warren Casey National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
Nicole C. Kleinstreuer National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA

Collapse

Mansouri K, Kleinstreuer N, Abdelaziz AM, Alberga D, Alves VM, Andersson PL, Andrade CH, Bai F, Balabin I, Ballabio D, Benfenati E, Bhhatarai B, Boyer S, Chen J, Consonni V, Farag S, Fourches D, García-Sosa AT, Gramatica P, Grisoni F, Grulke CM, Hong H, Horvath D, Hu X, Huang R, Jeliazkova N, Li J, Li X, Liu H, Manganelli S, Mangiatordi GF, Maran U, Marcou G, Martin T, Muratov E, Nguyen DT, Nicolotti O, Nikolov NG, Norinder U, Papa E, Petitjean M, Piir G, Pogodin P, Poroikov V, Qiao X, Richard AM, Roncaglioni A, Ruiz P, Rupakheti C, Sakkiah S, Sangion A, Schramm KW, Selvaraj C, Shah I, Sild S, Sun L, Taboureau O, Tang Y, Tetko IV, Todeschini R, Tong W, Trisciuzzi D, Tropsha A, Van Den Driessche G, Varnek A, Wang Z, Wedebye EB, Williams AJ, Xie H, Zakharov AV, Zheng Z, Judson RS. CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity. ENVIRONMENTAL HEALTH PERSPECTIVES 2020;128:27002. [PMID: 32074470 DOI: 10.23645/epacomptox.5176876] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Abstract

BACKGROUND

Endocrine disrupting chemicals (EDCs) are xenobiotics that mimic the interaction of natural hormones and alter synthesis, transport, or metabolic pathways. The prospect of EDCs causing adverse health effects in humans and wildlife has led to the development of scientific and regulatory approaches for evaluating bioactivity. This need is being addressed using high-throughput screening (HTS) in vitro approaches and computational modeling.

OBJECTIVES

In support of the Endocrine Disruptor Screening Program, the U.S. Environmental Protection Agency (EPA) led two worldwide consortiums to virtually screen chemicals for their potential estrogenic and androgenic activities. Here, we describe the Collaborative Modeling Project for Androgen Receptor Activity (CoMPARA) efforts, which follows the steps of the Collaborative Estrogen Receptor Activity Prediction Project (CERAPP).

METHODS

The CoMPARA list of screened chemicals built on CERAPP's list of 32,464 chemicals to include additional chemicals of interest, as well as simulated ToxCast™ metabolites, totaling 55,450 chemical structures. Computational toxicology scientists from 25 international groups contributed 91 predictive models for binding, agonist, and antagonist activity predictions. Models were underpinned by a common training set of 1,746 chemicals compiled from a combined data set of 11 ToxCast™/Tox21 HTS in vitro assays.

RESULTS

The resulting models were evaluated using curated literature data extracted from different sources. To overcome the limitations of single-model approaches, CoMPARA predictions were combined into consensus models that provided averaged predictive accuracy of approximately 80% for the evaluation set.

DISCUSSION

The strengths and limitations of the consensus predictions were discussed with example chemicals; then, the models were implemented into the free and open-source OPERA application to enable screening of new chemicals with a defined applicability domain and accuracy assessment. This implementation was used to screen the entire EPA DSSTox database of ∼875,000 chemicals, and their predicted AR activities have been made available on the EPA CompTox Chemicals dashboard and National Toxicology Program's Integrated Chemical Environment. https://doi.org/10.1289/EHP5580.

Collapse

Affiliation(s)

Kamel Mansouri National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA ScitoVation LLC, Research Triangle Park, North Carolina, USA Integrated Laboratory Systems, Inc., Morrisville, North Carolina, USA
Nicole Kleinstreuer National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
Ahmed M Abdelaziz Technische Universität München, Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt, Department für Biowissenschaftliche Grundlagen, Weihenstephaner Steig 23, 85350 Freising, Germany
Domenico Alberga Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Vinicius M Alves Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Brazil Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Patrik L Andersson Chemistry Department, Umeå University, Umeå, Sweden
Carolina H Andrade Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Brazil
Fang Bai School of Pharmacy, Lanzhou University, China
Ilya Balabin Information Systems & Global Solutions (IS&GS), Lockheed Martin, USA
Davide Ballabio Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Emilio Benfenati Istituto di Ricerche Farmacologiche "Mario Negri", IRCCS, Milan, Italy
Barun Bhhatarai QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Scott Boyer Swedish Toxicology Sciences Research Center, Karolinska Institutet, Södertälje, Sweden
Jingwen Chen School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Viviana Consonni Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Sherif Farag Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Denis Fourches Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
Alfonso T García-Sosa Institute of Chemistry, University of Tartu, Tartu, Estonia
Paola Gramatica QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Francesca Grisoni Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Chris M Grulke National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Huixiao Hong Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Dragos Horvath Laboratoire de Chémoinformatique-UMR7140, University of Strasbourg/CNRS, Strasbourg, France
Xin Hu National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Ruili Huang National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Nina Jeliazkova IdeaConsult, Ltd., Sofia, Bulgaria
Jiazhong Li School of Pharmacy, Lanzhou University, China
Xuehua Li School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Huanxiang Liu School of Pharmacy, Lanzhou University, China
Serena Manganelli Istituto di Ricerche Farmacologiche "Mario Negri", IRCCS, Milan, Italy
Giuseppe F Mangiatordi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Uko Maran Institute of Chemistry, University of Tartu, Tartu, Estonia
Gilles Marcou Laboratoire de Chémoinformatique-UMR7140, University of Strasbourg/CNRS, Strasbourg, France
Todd Martin National Risk Management Research Laboratory, U.S. EPA, Cincinnati, Ohio, USA
Eugene Muratov Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Dac-Trung Nguyen National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Orazio Nicolotti Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Nikolai G Nikolov Division of Risk Assessment and Nutrition, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Ulf Norinder Swedish Toxicology Sciences Research Center, Karolinska Institutet, Södertälje, Sweden
Ester Papa QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Michel Petitjean Computational Modeling of Protein-Ligand Interactions (CMPLI)-INSERM UMR 8251, INSERM ERL U1133, Functional and Adaptative Biology (BFA), Universite de Paris, Paris, France
Geven Piir Institute of Chemistry, University of Tartu, Tartu, Estonia
Pavel Pogodin Institute of Biomedical Chemistry IBMC, 10 Building 8, Pogodinskaya st., Moscow 119121, Russia
Vladimir Poroikov Institute of Biomedical Chemistry IBMC, 10 Building 8, Pogodinskaya st., Moscow 119121, Russia
Xianliang Qiao School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Ann M Richard National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Alessandra Roncaglioni Istituto di Ricerche Farmacologiche "Mario Negri", IRCCS, Milan, Italy
Patricia Ruiz Computational Toxicology and Methods Development Laboratory, Division of Toxicology and Human Health Sciences, Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Chetan Rupakheti National Risk Management Research Laboratory, U.S. EPA, Cincinnati, Ohio, USA Department of Biochemistry and Molecular Biophysics, University of Chicago, Chicago, Illinois, USA
Sugunadevi Sakkiah Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Alessandro Sangion QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Karl-Werner Schramm Technische Universität München, Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt, Department für Biowissenschaftliche Grundlagen, Weihenstephaner Steig 23, 85350 Freising, Germany
Chandrabose Selvaraj Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Imran Shah National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Sulev Sild Institute of Chemistry, University of Tartu, Tartu, Estonia
Lixia Sun Department of Pharmaceutical Sciences, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Olivier Taboureau Computational Modeling of Protein-Ligand Interactions (CMPLI)-INSERM UMR 8251, INSERM ERL U1133, Functional and Adaptative Biology (BFA), Universite de Paris, Paris, France
Yun Tang Department of Pharmaceutical Sciences, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Igor V Tetko BIGCHEM GmbH, Neuherberg, Germany Helmholtz Zentrum Muenchen - German Research Center for Environmental Health (GmbH), Neuherberg, Germany
Roberto Todeschini Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Weida Tong Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Daniela Trisciuzzi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Alexander Tropsha Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
George Van Den Driessche Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
Alexandre Varnek Laboratoire de Chémoinformatique-UMR7140, University of Strasbourg/CNRS, Strasbourg, France
Zhongyu Wang School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Eva B Wedebye Division of Risk Assessment and Nutrition, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Antony J Williams National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Hongbin Xie School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Alexey V Zakharov National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Ziye Zheng Chemistry Department, Umeå University, Umeå, Sweden
Richard S Judson National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA

Collapse

Mansouri K, Kleinstreuer N, Abdelaziz AM, Alberga D, Alves VM, Andersson PL, Andrade CH, Bai F, Balabin I, Ballabio D, Benfenati E, Bhhatarai B, Boyer S, Chen J, Consonni V, Farag S, Fourches D, García-Sosa AT, Gramatica P, Grisoni F, Grulke CM, Hong H, Horvath D, Hu X, Huang R, Jeliazkova N, Li J, Li X, Liu H, Manganelli S, Mangiatordi GF, Maran U, Marcou G, Martin T, Muratov E, Nguyen DT, Nicolotti O, Nikolov NG, Norinder U, Papa E, Petitjean M, Piir G, Pogodin P, Poroikov V, Qiao X, Richard AM, Roncaglioni A, Ruiz P, Rupakheti C, Sakkiah S, Sangion A, Schramm KW, Selvaraj C, Shah I, Sild S, Sun L, Taboureau O, Tang Y, Tetko IV, Todeschini R, Tong W, Trisciuzzi D, Tropsha A, Van Den Driessche G, Varnek A, Wang Z, Wedebye EB, Williams AJ, Xie H, Zakharov AV, Zheng Z, Judson RS. CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity. ENVIRONMENTAL HEALTH PERSPECTIVES 2020;128:27002. [PMID: 32074470 PMCID: PMC7064318 DOI: 10.1289/ehp5580] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 11/27/2019] [Accepted: 12/05/2019] [Indexed: 05/04/2023]

Abstract

BACKGROUND

OBJECTIVES

METHODS

RESULTS

DISCUSSION

The strengths and limitations of the consensus predictions were discussed with example chemicals; then, the models were implemented into the free and open-source OPERA application to enable screening of new chemicals with a defined applicability domain and accuracy assessment. This implementation was used to screen the entire EPA DSSTox database of ∼ 875,000 chemicals, and their predicted AR activities have been made available on the EPA CompTox Chemicals dashboard and National Toxicology Program's Integrated Chemical Environment. https://doi.org/10.1289/EHP5580.

Collapse

Affiliation(s)

Kamel Mansouri National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA ScitoVation LLC, Research Triangle Park, North Carolina, USA Integrated Laboratory Systems, Inc., Morrisville, North Carolina, USA
Nicole Kleinstreuer National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
Ahmed M. Abdelaziz Technische Universität München, Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt, Department für Biowissenschaftliche Grundlagen, Weihenstephaner Steig 23, 85350 Freising, Germany
Domenico Alberga Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Vinicius M. Alves Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Brazil Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Patrik L. Andersson Chemistry Department, Umeå University, Umeå, Sweden
Carolina H. Andrade Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Brazil
Fang Bai School of Pharmacy, Lanzhou University, China
Ilya Balabin Information Systems & Global Solutions (IS&GS), Lockheed Martin, USA
Davide Ballabio Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Emilio Benfenati Istituto di Ricerche Farmacologiche “Mario Negri”, IRCCS, Milan, Italy
Barun Bhhatarai QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Scott Boyer Swedish Toxicology Sciences Research Center, Karolinska Institutet, Södertälje, Sweden
Jingwen Chen School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Viviana Consonni Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Sherif Farag Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Denis Fourches Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
Alfonso T. García-Sosa Institute of Chemistry, University of Tartu, Tartu, Estonia
Paola Gramatica QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Francesca Grisoni Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Chris M. Grulke National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Huixiao Hong Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Dragos Horvath Laboratoire de Chémoinformatique—UMR7140, University of Strasbourg/CNRS, Strasbourg, France
Xin Hu National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Ruili Huang National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Nina Jeliazkova IdeaConsult, Ltd., Sofia, Bulgaria
Jiazhong Li School of Pharmacy, Lanzhou University, China
Xuehua Li School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Huanxiang Liu School of Pharmacy, Lanzhou University, China
Serena Manganelli Istituto di Ricerche Farmacologiche “Mario Negri”, IRCCS, Milan, Italy
Giuseppe F. Mangiatordi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Uko Maran Institute of Chemistry, University of Tartu, Tartu, Estonia
Gilles Marcou Laboratoire de Chémoinformatique—UMR7140, University of Strasbourg/CNRS, Strasbourg, France
Todd Martin National Risk Management Research Laboratory, U.S. EPA, Cincinnati, Ohio, USA
Eugene Muratov Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Dac-Trung Nguyen National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Orazio Nicolotti Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Nikolai G. Nikolov Division of Risk Assessment and Nutrition, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Ulf Norinder Swedish Toxicology Sciences Research Center, Karolinska Institutet, Södertälje, Sweden
Ester Papa QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Michel Petitjean Computational Modeling of Protein-Ligand Interactions (CMPLI)–INSERM UMR 8251, INSERM ERL U1133, Functional and Adaptative Biology (BFA), Universite de Paris, Paris, France
Geven Piir Institute of Chemistry, University of Tartu, Tartu, Estonia
Pavel Pogodin Institute of Biomedical Chemistry IBMC, 10 Building 8, Pogodinskaya st., Moscow 119121, Russia
Vladimir Poroikov Institute of Biomedical Chemistry IBMC, 10 Building 8, Pogodinskaya st., Moscow 119121, Russia
Xianliang Qiao School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Ann M. Richard National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Alessandra Roncaglioni Istituto di Ricerche Farmacologiche “Mario Negri”, IRCCS, Milan, Italy
Patricia Ruiz Computational Toxicology and Methods Development Laboratory, Division of Toxicology and Human Health Sciences, Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Chetan Rupakheti National Risk Management Research Laboratory, U.S. EPA, Cincinnati, Ohio, USA Department of Biochemistry and Molecular Biophysics, University of Chicago, Chicago, Illinois, USA
Sugunadevi Sakkiah Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Alessandro Sangion QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
Karl-Werner Schramm Technische Universität München, Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt, Department für Biowissenschaftliche Grundlagen, Weihenstephaner Steig 23, 85350 Freising, Germany
Chandrabose Selvaraj Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Imran Shah National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Sulev Sild Institute of Chemistry, University of Tartu, Tartu, Estonia
Lixia Sun Department of Pharmaceutical Sciences, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Olivier Taboureau Computational Modeling of Protein-Ligand Interactions (CMPLI)–INSERM UMR 8251, INSERM ERL U1133, Functional and Adaptative Biology (BFA), Universite de Paris, Paris, France
Yun Tang Department of Pharmaceutical Sciences, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Igor V. Tetko BIGCHEM GmbH, Neuherberg, Germany Helmholtz Zentrum Muenchen – German Research Center for Environmental Health (GmbH), Neuherberg, Germany
Roberto Todeschini Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Weida Tong Division of Bioinformatics and Biostatistics, National Center for Toxicology Research, U.S. Food and Drug Administration, Jefferson, Arkansas, USA
Daniela Trisciuzzi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Alexander Tropsha Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
George Van Den Driessche Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
Alexandre Varnek Laboratoire de Chémoinformatique—UMR7140, University of Strasbourg/CNRS, Strasbourg, France
Zhongyu Wang School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Eva B. Wedebye Division of Risk Assessment and Nutrition, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Antony J. Williams National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA
Hongbin Xie School of Environmental Science and Technology, Dalian University of Technology, Dalian, China
Alexey V. Zakharov National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Ziye Zheng Chemistry Department, Umeå University, Umeå, Sweden
Richard S. Judson National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency (U.S. EPA), Research Triangle Park, North Carolina, USA

Collapse

Applications of Molecular Dynamics Simulations in Computational Toxicology. ACTA ACUST UNITED AC 2019. [DOI: 10.1007/978-3-030-16443-0_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]

Competitive docking model for prediction of the human nicotinic acetylcholine receptor α7 binding of tobacco constituents. Oncotarget 2018;9:16899-16916. [PMID: 29682193 PMCID: PMC5908294 DOI: 10.18632/oncotarget.24458] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 02/01/2018] [Indexed: 12/21/2022] Open

Development of Decision Forest Models for Prediction of Drug-Induced Liver Injury in Humans Using A Large Set of FDA-approved Drugs. Sci Rep 2017;7:17311. [PMID: 29229971 PMCID: PMC5725422 DOI: 10.1038/s41598-017-17701-7] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 11/30/2017] [Indexed: 12/11/2022] Open

Sakkiah S, Selvaraj C, Gong P, Zhang C, Tong W, Hong H. Development of estrogen receptor beta binding prediction model using large sets of chemicals. Oncotarget 2017;8:92989-93000. [PMID: 29190972 PMCID: PMC5696238 DOI: 10.18632/oncotarget.21723] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 08/27/2017] [Indexed: 12/31/2022] Open

Selvaraj C, Sakkiah S, Tong W, Hong H. Molecular dynamics simulations and applications in computational toxicology and nanotoxicology. Food Chem Toxicol 2017;112:495-506. [PMID: 28843597 DOI: 10.1016/j.fct.2017.08.028] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Revised: 08/08/2017] [Accepted: 08/22/2017] [Indexed: 12/13/2022]

Hong H, Rua D, Sakkiah S, Selvaraj C, Ge W, Tong W. Consensus Modeling for Prediction of Estrogenic Activity of Ingredients Commonly Used in Sunscreen Products. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2016;13:ijerph13100958. [PMID: 27690075 PMCID: PMC5086697 DOI: 10.3390/ijerph13100958] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2016] [Revised: 09/16/2016] [Accepted: 09/20/2016] [Indexed: 11/16/2022]

sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides. Sci Rep 2016;6:32115. [PMID: 27558848 PMCID: PMC4997263 DOI: 10.1038/srep32115] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 08/02/2016] [Indexed: 12/19/2022] Open

Experimental Data Extraction and in Silico Prediction of the Estrogenic Activity of Renewable Replacements for Bisphenol A. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2016;13:ijerph13070705. [PMID: 27420082 PMCID: PMC4962246 DOI: 10.3390/ijerph13070705] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Revised: 07/01/2016] [Accepted: 07/05/2016] [Indexed: 01/23/2023]

Mansouri K, Abdelaziz A, Rybacka A, Roncaglioni A, Tropsha A, Varnek A, Zakharov A, Worth A, Richard AM, Grulke CM, Trisciuzzi D, Fourches D, Horvath D, Benfenati E, Muratov E, Wedebye EB, Grisoni F, Mangiatordi GF, Incisivo GM, Hong H, Ng HW, Tetko IV, Balabin I, Kancherla J, Shen J, Burton J, Nicklaus M, Cassotti M, Nikolov NG, Nicolotti O, Andersson PL, Zang Q, Politi R, Beger RD, Todeschini R, Huang R, Farag S, Rosenberg SA, Slavov S, Hu X, Judson RS. CERAPP: Collaborative Estrogen Receptor Activity Prediction Project. ENVIRONMENTAL HEALTH PERSPECTIVES 2016;124:1023-33. [PMID: 26908244 PMCID: PMC4937869 DOI: 10.1289/ehp.1510267] [Citation(s) in RCA: 222] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Revised: 10/05/2015] [Accepted: 02/08/2016] [Indexed: 05/18/2023]

Abstract

BACKGROUND

Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program.

OBJECTIVES

We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing.

METHODS

CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies.

RESULTS

Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing.

CONCLUSION

This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points.

CITATION

Mansouri K, Abdelaziz A, Rybacka A, Roncaglioni A, Tropsha A, Varnek A, Zakharov A, Worth A, Richard AM, Grulke CM, Trisciuzzi D, Fourches D, Horvath D, Benfenati E, Muratov E, Wedebye EB, Grisoni F, Mangiatordi GF, Incisivo GM, Hong H, Ng HW, Tetko IV, Balabin I, Kancherla J, Shen J, Burton J, Nicklaus M, Cassotti M, Nikolov NG, Nicolotti O, Andersson PL, Zang Q, Politi R, Beger RD, Todeschini R, Huang R, Farag S, Rosenberg SA, Slavov S, Hu X, Judson RS. 2016.

CERAPP

Collaborative Estrogen Receptor Activity Prediction Project. Environ Health Perspect 124:1023-1033; http://dx.doi.org/10.1289/ehp.1510267.

Collapse

Affiliation(s)

Kamel Mansouri National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Oak Ridge Institute for Science and Education, Oak Ridge, Tennessee, USA
Ahmed Abdelaziz Institute of Structural Biology, Helmholtz Zentrum Muenchen-German Research Center for Environmental Health (GmbH), Neuherberg, Germany
Aleksandra Rybacka Chemistry Department, Umeå University, Umeå, Sweden
Alessandra Roncaglioni Environmental Chemistry and Toxicology Laboratory, IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico)-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
Alexander Tropsha Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Alexandre Varnek Laboratoire de Chemoinformatique, University of Strasbourg, Strasbourg, France
Alexey Zakharov National Cancer Institute, National Institutes of Health (NIH), Department of Health and Human Services (DHHS), Bethesda, Maryland, USA
Andrew Worth Institute for Health and Consumer Protection (IHCP), Joint Research Centre of the European Commission in Ispra, Ispra, Italy
Ann M. Richard National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Christopher M. Grulke National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Daniela Trisciuzzi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Denis Fourches Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Dragos Horvath Laboratoire de Chemoinformatique, University of Strasbourg, Strasbourg, France
Emilio Benfenati Environmental Chemistry and Toxicology Laboratory, IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico)-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
Eugene Muratov Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Eva Bay Wedebye Division of Toxicology and Risk Assessment, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Francesca Grisoni Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, Italy
Giuseppe F. Mangiatordi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Giuseppina M. Incisivo Environmental Chemistry and Toxicology Laboratory, IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico)-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
Huixiao Hong Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration (USDA), Jefferson, Arizona, USA
Hui W. Ng Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration (USDA), Jefferson, Arizona, USA
Igor V. Tetko Institute of Structural Biology, Helmholtz Zentrum Muenchen-German Research Center for Environmental Health (GmbH), Neuherberg, Germany BigChem GmbH, Neuherberg, Germany
Ilya Balabin High Performance Computing, Lockheed Martin, Research Triangle Park, North Carolina, USA
Jayaram Kancherla National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Jie Shen Research Institute for Fragrance Materials, Inc., Woodcliff Lake, New Jersey, USA
Julien Burton Institute for Health and Consumer Protection (IHCP), Joint Research Centre of the European Commission in Ispra, Ispra, Italy
Marc Nicklaus National Cancer Institute, National Institutes of Health (NIH), Department of Health and Human Services (DHHS), Bethesda, Maryland, USA
Matteo Cassotti Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, Italy
Nikolai G. Nikolov Division of Toxicology and Risk Assessment, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Orazio Nicolotti Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Patrik L. Andersson Chemistry Department, Umeå University, Umeå, Sweden
Qingda Zang Integrated Laboratory Systems, Inc., Research Triangle Park, North Carolina, USA
Regina Politi Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Richard D. Beger Division of Systems Biology, National Center for Toxicological Research, USDA, Jefferson, Arizona, USA
Roberto Todeschini Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, Italy
Ruili Huang National Center for Advancing Translational Sciences, NIH, DHHS, Bethesda, Maryland, USA
Sherif Farag Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Sine A. Rosenberg Division of Toxicology and Risk Assessment, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Svetoslav Slavov Integrated Laboratory Systems, Inc., Research Triangle Park, North Carolina, USA
Xin Hu National Center for Advancing Translational Sciences, NIH, DHHS, Bethesda, Maryland, USA
Richard S. Judson National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Address correspondence to R.S. Judson, U.S. EPA, National Center for Computational Toxicology, 109 T.W. Alexander Dr., Research Triangle Park, NC 27711 USA. Telephone: (919) 541-3085. E-mail:

Collapse

Hong H, Shen J, Ng HW, Sakkiah S, Ye H, Ge W, Gong P, Xiao W, Tong W. A Rat α-Fetoprotein Binding Activity Prediction Model to Facilitate Assessment of the Endocrine Disruption Potential of Environmental Chemicals. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2016;13:372. [PMID: 27023588 PMCID: PMC4847034 DOI: 10.3390/ijerph13040372] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2016] [Revised: 03/10/2016] [Accepted: 03/22/2016] [Indexed: 11/21/2022]

Hong H, Chen M, Ng HW, Tong W. QSAR Models at the US FDA/NCTR. Methods Mol Biol 2016;1425:431-59. [PMID: 27311476 DOI: 10.1007/978-1-4939-3609-0_18] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Ng HW, Doughty SW, Luo H, Ye H, Ge W, Tong W, Hong H. Development and Validation of Decision Forest Model for Estrogen Receptor Binding Prediction of Chemicals Using Large Data Sets. Chem Res Toxicol 2015;28:2343-51. [PMID: 26524122 DOI: 10.1021/acs.chemrestox.5b00358] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Tong W, Fang H, Xie Q, Hong H, Shi L, Perkins R, Scherf U, Goodsaid F, Frueh F. Gaining Confidence on Molecular Classification through Consensus Modeling and Validation. Toxicol Mech Methods 2012;16:59-68. [PMID: 20020998 DOI: 10.1080/15376520600558259] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

McPhail B, Tie Y, Hong H, Pearce BA, Schnackenberg LK, Ge W, Fuscoe JC, Tong W, Buzatu DA, Wilkes JG, Fowler BA, Demchuk E, Beger RD. Modeling chemical interaction profiles: I. Spectral data-activity relationship and structure-activity relationship models for inhibitors and non-inhibitors of cytochrome P450 CYP3A4 and CYP2D6 isozymes. Molecules 2012;17:3383-406. [PMID: 22421792 PMCID: PMC6268752 DOI: 10.3390/molecules17033383] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Revised: 02/27/2012] [Accepted: 02/28/2012] [Indexed: 02/07/2023] Open

Abstract

An interagency collaboration was established to model chemical interactions that may cause adverse health effects when an exposure to a mixture of chemicals occurs. Many of these chemicals—drugs, pesticides, and environmental pollutant—interact at the level of metabolic biotransformations mediated by cytochrome P450 (CYP) enzymes. In the present work, spectral data-activity relationship (SDAR) and structure-activity relationship (SAR) approaches were used to develop machine-learning classifiers of inhibitors and non-inhibitors of the CYP3A4 and CYP2D6 isozymes. The models were built upon 602 reference pharmaceutical compounds whose interactions have been deduced from clinical data, and 100 additional chemicals that were used to evaluate model performance in an external validation (EV) test. SDAR is an innovative modeling approach that relies on discriminant analysis applied to binned nuclear magnetic resonance (NMR) spectral descriptors. In the present work, both 1D ¹³C and 1D ¹⁵N-NMR spectra were used together in a novel implementation of the SDAR technique. It was found that increasing the binning size of 1D ¹³C-NMR and ¹⁵N-NMR spectra caused an increase in the tenfold cross-validation (CV) performance in terms of both the rate of correct classification and sensitivity. The results of SDAR modeling were verified using SAR. For SAR modeling, a decision forest approach involving from 6 to 17 Mold² descriptors in a tree was used. Average rates of correct classification of SDAR and SAR models in a hundred CV tests were 60% and 61% for CYP3A4, and 62% and 70% for CYP2D6, respectively. The rates of correct classification of SDAR and SAR models in the EV test were 73% and 86% for CYP3A4, and 76% and 90% for CYP2D6, respectively. Thus, both SDAR and SAR methods demonstrated a comparable performance in modeling a large set of structurally diverse data. Based on unique NMR structural descriptors, the new SDAR modeling method complements the existing SAR techniques, providing an independent estimator that can increase confidence in a structure-activity assessment. When modeling was applied to hazardous environmental chemicals, it was found that up to 20% of them may be substrates and up to 10% of them may be inhibitors of the CYP3A4 and CYP2D6 isoforms. The developed models provide a rare opportunity for the environmental health branch of the public health service to extrapolate to hazardous chemicals directly from human clinical data. Therefore, the pharmacological and environmental health branches are both expected to benefit from these reported models.

Collapse

Affiliation(s)

Brooks McPhail Division of Toxicology and Environmental Medicine, Agency for Toxic Substances and Disease Registry, Atlanta, GA 30333, USA; (B.M.); (Y.T.); (B.A.F.)
Yunfeng Tie Division of Toxicology and Environmental Medicine, Agency for Toxic Substances and Disease Registry, Atlanta, GA 30333, USA; (B.M.); (Y.T.); (B.A.F.)
Huixiao Hong Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Bruce A. Pearce Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Laura K. Schnackenberg Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Weigong Ge Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
James C. Fuscoe Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Weida Tong Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Dan A. Buzatu Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Jon G. Wilkes Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)
Bruce A. Fowler Division of Toxicology and Environmental Medicine, Agency for Toxic Substances and Disease Registry, Atlanta, GA 30333, USA; (B.M.); (Y.T.); (B.A.F.)
Eugene Demchuk Division of Toxicology and Environmental Medicine, Agency for Toxic Substances and Disease Registry, Atlanta, GA 30333, USA; (B.M.); (Y.T.); (B.A.F.) Department of Basic Pharmaceutical Sciences, West Virginia University, Morgantown, WV 26506-9530, USA Author to whom correspondence should be addressed; ; Tel.: +1-770-488-3327; Fax: +1-404-248-4142
Richard D. Beger Division of Systems Biology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (H.H.); (B.A.P.); (L.K.S.); (W.G.); (J.C.F.); (W.T.); (D.A.B.); (J.G.W.); (R.D.B.)

Collapse

Hong H, Goodsaid F, Shi L, Tong W. Molecular biomarkers: a US FDA effort. Biomark Med 2010;4:215-25. [PMID: 20406066 DOI: 10.2217/bmm.09.81] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data. Comput Biol Med 2010;40:519-24. [DOI: 10.1016/j.compbiomed.2010.03.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2009] [Revised: 01/09/2010] [Accepted: 03/22/2010] [Indexed: 11/22/2022]

Tsai YS, Lin CT, Tseng GC, Chung IF, Pal NR. Discovery of dominant and dormant genes from expression data using a novel generalization of SNR for multi-class problems. BMC Bioinformatics 2008;9:425. [PMID: 18842155 PMCID: PMC2620271 DOI: 10.1186/1471-2105-9-425] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2008] [Accepted: 10/09/2008] [Indexed: 12/14/2022] Open

Abstract

Background

The Signal-to-Noise-Ratio (SNR) is often used for identification of biomarkers for two-class problems and no formal and useful generalization of SNR is available for multiclass problems. We propose innovative generalizations of SNR for multiclass cancer discrimination through introduction of two indices, Gene Dominant Index and Gene Dormant Index (GDIs). These two indices lead to the concepts of dominant and dormant genes with biological significance. We use these indices to develop methodologies for discovery of dominant and dormant biomarkers with interesting biological significance. The dominancy and dormancy of the identified biomarkers and their excellent discriminating power are also demonstrated pictorially using the scatterplot of individual gene and 2-D Sammon's projection of the selected set of genes. Using information from the literature we have shown that the GDI based method can identify dominant and dormant genes that play significant roles in cancer biology. These biomarkers are also used to design diagnostic prediction systems.

Results and discussion

To evaluate the effectiveness of the GDIs, we have used four multiclass cancer data sets (Small Round Blue Cell Tumors, Leukemia, Central Nervous System Tumors, and Lung Cancer). For each data set we demonstrate that the new indices can find biologically meaningful genes that can act as biomarkers. We then use six machine learning tools, Nearest Neighbor Classifier (NNC), Nearest Mean Classifier (NMC), Support Vector Machine (SVM) classifier with linear kernel, and SVM classifier with Gaussian kernel, where both SVMs are used in conjunction with one-vs-all (OVA) and one-vs-one (OVO) strategies. We found GDIs to be very effective in identifying biomarkers with strong class specific signatures. With all six tools and for all data sets we could achieve better or comparable prediction accuracies usually with fewer marker genes than results reported in the literature using the same computational protocols. The dominant genes are usually easy to find while good dormant genes may not always be available as dormant genes require stronger constraints to be satisfied; but when they are available, they can be used for authentication of diagnosis.

Conclusion

Since GDI based schemes can find a small set of dominant/dormant biomarkers that is adequate to design diagnostic prediction systems, it opens up the possibility of using real-time qPCR assays or antibody based methods such as ELISA for an easy and low cost diagnosis of diseases. The dominant and dormant genes found by GDIs can be used in different ways to design more reliable diagnostic prediction systems.

Collapse

Hong H, Xie Q, Ge W, Qian F, Fang H, Shi L, Su Z, Perkins R, Tong W. Mold(2), molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics. J Chem Inf Model 2008;48:1337-44. [PMID: 18564836 DOI: 10.1021/ci800038f] [Citation(s) in RCA: 182] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Rokach L. An evolutionary algorithm for constructing a decision forest: Combining the classification of disjoints decision trees. INT J INTELL SYST 2008. [DOI: 10.1002/int.20277] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Beisvag V, Lehre PK, Midelfart H, Aass H, Geiran O, Sandvik AK, Laegreid A, Komorowski J, Ellingsen O. Aetiology-specific patterns in end-stage heart failure patients identified by functional annotation and classification of microarray data. Eur J Heart Fail 2006;8:381-9. [PMID: 16753336 DOI: 10.1016/j.ejheart.2006.05.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2005] [Revised: 03/07/2006] [Accepted: 05/09/2006] [Indexed: 11/21/2022] Open

Xie Q, Ratnasinghe LD, Hong H, Perkins R, Tang ZZ, Hu N, Taylor PR, Tong W. Decision forest analysis of 61 single nucleotide polymorphisms in a case-control study of esophageal cancer; a novel method. BMC Bioinformatics 2005;6 Suppl 2:S4. [PMID: 16026601 PMCID: PMC1637030 DOI: 10.1186/1471-2105-6-s2-s4] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Abstract

Background

Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world.

Methods

High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes.

We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data.

Results

The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk.

Conclusion

The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses.

Collapse