52
|
Shuryak I. Advantages of Synthetic Noise and Machine Learning for Analyzing Radioecological Data Sets. PLoS One 2017; 12:e0170007. [PMID: 28068401 PMCID: PMC5222373 DOI: 10.1371/journal.pone.0170007] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 12/26/2016] [Indexed: 11/25/2022] Open
Abstract
The ecological effects of accidental or malicious radioactive contamination are insufficiently understood because of the hazards and difficulties associated with conducting studies in radioactively-polluted areas. Data sets from severely contaminated locations can therefore be small. Moreover, many potentially important factors, such as soil concentrations of toxic chemicals, pH, and temperature, can be correlated with radiation levels and with each other. In such situations, commonly-used statistical techniques like generalized linear models (GLMs) may not be able to provide useful information about how radiation and/or these other variables affect the outcome (e.g. abundance of the studied organisms). Ensemble machine learning methods such as random forests offer powerful alternatives. We propose that analysis of small radioecological data sets by GLMs and/or machine learning can be made more informative by using the following techniques: (1) adding synthetic noise variables to provide benchmarks for distinguishing the performances of valuable predictors from irrelevant ones; (2) adding noise directly to the predictors and/or to the outcome to test the robustness of analysis results against random data fluctuations; (3) adding artificial effects to selected predictors to test the sensitivity of the analysis methods in detecting predictor effects; (4) running a selected machine learning method multiple times (with different random-number seeds) to test the robustness of the detected “signal”; (5) using several machine learning methods to test the “signal’s” sensitivity to differences in analysis techniques. Here, we applied these approaches to simulated data, and to two published examples of small radioecological data sets: (I) counts of fungal taxa in samples of soil contaminated by the Chernobyl nuclear power plan accident (Ukraine), and (II) bacterial abundance in soil samples under a ruptured nuclear waste storage tank (USA). We show that the proposed techniques were advantageous compared with the methodology used in the original publications where the data sets were presented. Specifically, our approach identified a negative effect of radioactive contamination in data set I, and suggested that in data set II stable chromium could have been a stronger limiting factor for bacterial abundance than the radionuclides 137Cs and 99Tc. This new information, which was extracted from these data sets using the proposed techniques, can potentially enhance the design of radioactive waste bioremediation.
Collapse
Affiliation(s)
- Igor Shuryak
- Center for Radiological Research, Columbia University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
53
|
Rodríguez J, Barrera-Animas AY, Trejo LA, Medina-Pérez MA, Monroy R. Ensemble of One-Class Classifiers for Personal Risk Detection Based on Wearable Sensor Data. SENSORS 2016; 16:s16101619. [PMID: 27690054 PMCID: PMC5087407 DOI: 10.3390/s16101619] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Revised: 09/24/2016] [Accepted: 09/24/2016] [Indexed: 11/17/2022]
Abstract
This study introduces the One-Class K-means with Randomly-projected features Algorithm (OCKRA). OCKRA is an ensemble of one-class classifiers built over multiple projections of a dataset according to random feature subsets. Algorithms found in the literature spread over a wide range of applications where ensembles of one-class classifiers have been satisfactorily applied; however, none is oriented to the area under our study: personal risk detection. OCKRA has been designed with the aim of improving the detection performance in the problem posed by the Personal RIsk DEtection(PRIDE) dataset. PRIDE was built based on 23 test subjects, where the data for each user were captured using a set of sensors embedded in a wearable band. The performance of OCKRA was compared against support vector machine and three versions of the Parzen window classifier. On average, experimental results show that OCKRA outperformed the other classifiers for at least 0.53% of the area under the curve (AUC). In addition, OCKRA achieved an AUC above 90% for more than 57% of the users.
Collapse
Affiliation(s)
- Jorge Rodríguez
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Carretera al Lago de Guadalupe Km. 3.5, Atizapán, Edo. de México C.P. 52926, Mexico.
| | - Ari Y Barrera-Animas
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Carretera al Lago de Guadalupe Km. 3.5, Atizapán, Edo. de México C.P. 52926, Mexico.
| | - Luis A Trejo
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Carretera al Lago de Guadalupe Km. 3.5, Atizapán, Edo. de México C.P. 52926, Mexico.
| | - Miguel Angel Medina-Pérez
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Carretera al Lago de Guadalupe Km. 3.5, Atizapán, Edo. de México C.P. 52926, Mexico.
| | - Raúl Monroy
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Carretera al Lago de Guadalupe Km. 3.5, Atizapán, Edo. de México C.P. 52926, Mexico.
| |
Collapse
|
54
|
Gurard-Levin ZA, Wilson LOW, Pancaldi V, Postel-Vinay S, Sousa FG, Reyes C, Marangoni E, Gentien D, Valencia A, Pommier Y, Cottu P, Almouzni G. Chromatin Regulators as a Guide for Cancer Treatment Choice. Mol Cancer Ther 2016; 15:1768-77. [PMID: 27196757 DOI: 10.1158/1535-7163.mct-15-1008] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Accepted: 04/26/2016] [Indexed: 12/22/2022]
Abstract
The limited capacity to predict a patient's response to distinct chemotherapeutic agents is a major hurdle in cancer management. The efficiency of a large fraction of current cancer therapeutics (radio- and chemotherapies) is influenced by chromatin structure. Reciprocally, alterations in chromatin organization may affect resistance mechanisms. Here, we explore how the misexpression of chromatin regulators-factors involved in the establishment and maintenance of functional chromatin domains-can inform about the extent of docetaxel response. We exploit Affymetrix and NanoString gene expression data for a set of chromatin regulators generated from breast cancer patient-derived xenograft models and patient samples treated with docetaxel. Random Forest classification reveals specific panels of chromatin regulators, including key components of the SWI/SNF chromatin remodeler, which readily distinguish docetaxel high-responders and poor-responders. Further exploration of SWI/SNF components in the comprehensive NCI-60 dataset reveals that the expression inversely correlates with docetaxel sensitivity. Finally, we show that loss of the SWI/SNF subunit BRG1 (SMARCA4) in a model cell line leads to enhanced docetaxel sensitivity. Altogether, our findings point toward chromatin regulators as biomarkers for drug response as well as therapeutic targets to sensitize patients toward docetaxel and combat drug resistance. Mol Cancer Ther; 15(7); 1768-77. ©2016 AACR.
Collapse
Affiliation(s)
- Zachary A Gurard-Levin
- Institut Curie, PSL Research University, CNRS, UMR3664, Equipe Labellisée Ligue contre le Cancer, Paris, France. Sorbonne Universités, UPMC Universite Paris 06, CNRS, UMR3664, Paris, France.
| | - Laurence O W Wilson
- Institut Curie, PSL Research University, CNRS, UMR3664, Equipe Labellisée Ligue contre le Cancer, Paris, France. Sorbonne Universités, UPMC Universite Paris 06, CNRS, UMR3664, Paris, France
| | - Vera Pancaldi
- Spanish National Cancer Research Centre (CNIO), c/Melchor Fernandez, Almagro, Madrid, Spain
| | - Sophie Postel-Vinay
- DITEP (Département d'Innovations Thérapeutiques et Essais Précoces), Gustave Roussy, France. Inserm Unit U981, Gustave Roussy, Villejuif, France. Université Paris Saclay, Université Paris-Sud, Faculté de Médicine, Le Kremlin Bicêtre, France
| | - Fabricio G Sousa
- Developmental Therapeutics Branch and Laboratory of Molecular Pharmacology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Cecile Reyes
- Institut Curie, PSL Research University, Translational Research Department, Genomics Platform, Paris, France
| | - Elisabetta Marangoni
- Institut Curie, PSL Research University, Translational Research Department, Genomics Platform, Paris, France
| | - David Gentien
- Institut Curie, PSL Research University, Translational Research Department, Genomics Platform, Paris, France
| | - Alfonso Valencia
- Spanish National Cancer Research Centre (CNIO), c/Melchor Fernandez, Almagro, Madrid, Spain
| | - Yves Pommier
- Developmental Therapeutics Branch and Laboratory of Molecular Pharmacology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Paul Cottu
- Institut Curie, Medical Oncology, Paris, France
| | - Geneviève Almouzni
- Institut Curie, PSL Research University, CNRS, UMR3664, Equipe Labellisée Ligue contre le Cancer, Paris, France. Sorbonne Universités, UPMC Universite Paris 06, CNRS, UMR3664, Paris, France.
| |
Collapse
|
55
|
Franke SK, van Kesteren RE, Hofman S, Wubben JAM, Smit AB, Philippens IHCHM. Individual and Familial Susceptibility to MPTP in a Common Marmoset Model for Parkinson's Disease. NEURODEGENER DIS 2016; 16:293-303. [PMID: 26999593 DOI: 10.1159/000442574] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 11/11/2015] [Indexed: 11/19/2022] Open
Abstract
INTRODUCTION Insight into susceptibility mechanisms underlying Parkinson's disease (PD) would aid the understanding of disease etiology, enable target finding and benefit the development of more refined disease-modifying strategies. METHODS We used intermittent low-dose MPTP (0.5 mg/kg/week) injections in marmosets and measured multiple behavioral and neurochemical parameters. Genetically diverse monkeys from different breeding families were selected to investigate inter- and intrafamily differences in susceptibility to MPTP treatment. RESULTS We show that such differences exist in clinical signs, in particular nonmotor PD-related behaviors, and that they are accompanied by differences in neurotransmitter levels. In line with the contribution of a genetic component, different susceptibility phenotypes could be traced back through genealogy to individuals of the different families. CONCLUSION Our findings show that low-dose MPTP treatment in marmosets represents a clinically relevant PD model, with a window of opportunity to examine the onset of the disease, allowing the detection of individual variability in disease susceptibility, which may be of relevance for the diagnosis and treatment of PD in humans.
Collapse
Affiliation(s)
- Sigrid K Franke
- Department of Molecular and Cellular Neurobiology, Center for Neurogenomics and Cognitive Research, Neuroscience Campus Amsterdam, VU University, Amsterdam, The Netherlands
| | | | | | | | | | | |
Collapse
|
56
|
Shuryak I, Dadachova E. Quantitative Modeling of Microbial Population Responses to Chronic Irradiation Combined with Other Stressors. PLoS One 2016; 11:e0147696. [PMID: 26808049 PMCID: PMC4726741 DOI: 10.1371/journal.pone.0147696] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Accepted: 12/07/2015] [Indexed: 11/19/2022] Open
Abstract
Microbial population responses to combined effects of chronic irradiation and other stressors (chemical contaminants, other sub-optimal conditions) are important for ecosystem functioning and bioremediation in radionuclide-contaminated areas. Quantitative mathematical modeling can improve our understanding of these phenomena. To identify general patterns of microbial responses to multiple stressors in radioactive environments, we analyzed three data sets on: (1) bacteria isolated from soil contaminated by nuclear waste at the Hanford site (USA); (2) fungi isolated from the Chernobyl nuclear-power plant (Ukraine) buildings after the accident; (3) yeast subjected to continuous γ-irradiation in the laboratory, where radiation dose rate and cell removal rate were independently varied. We applied generalized linear mixed-effects models to describe the first two data sets, whereas the third data set was amenable to mechanistic modeling using differential equations. Machine learning and information-theoretic approaches were used to select the best-supported formalism(s) among biologically-plausible alternatives. Our analysis suggests the following: (1) Both radionuclides and co-occurring chemical contaminants (e.g. NO2) are important for explaining microbial responses to radioactive contamination. (2) Radionuclides may produce non-monotonic dose responses: stimulation of microbial growth at low concentrations vs. inhibition at higher ones. (3) The extinction-defining critical radiation dose rate is dramatically lowered by additional stressors. (4) Reproduction suppression by radiation can be more important for determining the critical dose rate, than radiation-induced cell mortality. In conclusion, the modeling approaches used here on three diverse data sets provide insight into explaining and predicting multi-stressor effects on microbial communities: (1) the most severe effects (e.g. extinction) on microbial populations may occur when unfavorable environmental conditions (e.g. fluctuations of temperature and/or nutrient levels) coincide with radioactive contamination; (2) an organism’s radioresistance and bioremediation efficiency in rich laboratory media may be insufficient to carry out radionuclide bioremediation in the field—robustness against multiple stressors is needed.
Collapse
Affiliation(s)
- Igor Shuryak
- Center for Radiological Research, Columbia University, New York, NY, United States of America
- * E-mail:
| | - Ekaterina Dadachova
- Department of Radiology, Albert Einstein College of Medicine, Bronx, New York, United States of America
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| |
Collapse
|
57
|
Liao B, Ding S, Chen H, Li Z, Cai L. Identifying human microRNA–disease associations by a new diffusion-based method. J Bioinform Comput Biol 2015; 13:1550014. [DOI: 10.1142/s0219720015500146] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Identifying the microRNA–disease relationship is vital for investigating the pathogenesis of various diseases. However, experimental verification of disease-related microRNAs remains considerable challenge to many researchers, particularly for the fact that numerous new microRNAs are discovered every year. As such, development of computational methods for disease-related microRNA prediction has recently gained eminent attention. In this paper, first, we construct a miRNA functional network and a disease similarity network by integrating different information sources. Then, we further introduce a new diffusion-based method (NDBM) to explore global network similarity for miRNA–disease association inference. Even though known miRNA–disease associations in the database are rare, NDBM still achieves an area under the ROC curve (AUC) of 85.62% in the leave-one-out cross-validation in improving the prediction accuracy of previous methods significantly. Moreover, our method is applicable to diseases with no known related miRNAs as well as new miRNAs with unknown target diseases. Some associations who strongly predicted by our method are confirmed by public databases. These superior performances suggest that NDBM could be an effective and important tool for biomedical research.
Collapse
Affiliation(s)
- Bo Liao
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China
| | - Sumei Ding
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China
| | - Haowen Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China
| | - Zejun Li
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China
| | - Lijun Cai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China
| |
Collapse
|