1
|
Janoudi G, Uzun (Rada) M, Fell DB, Ray JG, Foster AM, Giffen R, Clifford T, Walker MC. Outlier analysis for accelerating clinical discovery: An augmented intelligence framework and a systematic review. PLOS DIGITAL HEALTH 2024; 3:e0000515. [PMID: 38776276 PMCID: PMC11111092 DOI: 10.1371/journal.pdig.0000515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 04/19/2024] [Indexed: 05/24/2024]
Abstract
Clinical discoveries largely depend on dedicated clinicians and scientists to identify and pursue unique and unusual clinical encounters with patients and communicate these through case reports and case series. This process has remained essentially unchanged throughout the history of modern medicine. However, these traditional methods are inefficient, especially considering the modern-day availability of health-related data and the sophistication of computer processing. Outlier analysis has been used in various fields to uncover unique observations, including fraud detection in finance and quality control in manufacturing. We propose that clinical discovery can be formulated as an outlier problem within an augmented intelligence framework to be implemented on any health-related data. Such an augmented intelligence approach would accelerate the identification and pursuit of clinical discoveries, advancing our medical knowledge and uncovering new therapies and management approaches. We define clinical discoveries as contextual outliers measured through an information-based approach and with a novelty-based root cause. Our augmented intelligence framework has five steps: define a patient population with a desired clinical outcome, build a predictive model, identify outliers through appropriate measures, investigate outliers through domain content experts, and generate scientific hypotheses. Recognizing that the field of obstetrics can particularly benefit from this approach, as it is traditionally neglected in commercial research, we conducted a systematic review to explore how outlier analysis is implemented in obstetric research. We identified two obstetrics-related studies that assessed outliers at an aggregate level for purposes outside of clinical discovery. Our findings indicate that using outlier analysis in clinical research in obstetrics and clinical research, in general, requires further development.
Collapse
Affiliation(s)
- Ghayath Janoudi
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
| | | | - Deshayne B. Fell
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
| | - Joel G. Ray
- Departments of Medicine, Health Policy Management and Evaluation, and Obstetrics and Gynecology, St Michael’s Hospital, University of Toronto, Toronto, Canada
| | - Angel M. Foster
- Faculty of Health Sciences, University of Ottawa, Ottawa, Canada
| | | | - Tammy Clifford
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
- Canadian Institute of Health Research, Government of Canada, Ottawa, Canada
| | - Mark C. Walker
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Canada
- International and Global Health Office, University of Ottawa, Ottawa, Canada
- Department of Obstetrics and Gynecology, University of Ottawa, Ottawa, Canada
- Department of Obstetrics, Gynecology & Newborn Care, The Ottawa Hospital, Ottawa, Canada
- BORN Ontario, Children’s Hospital of Eastern Ontario, Ottawa, Canada
| |
Collapse
|
2
|
Zhang X, Shi S, Sun H, Chen D, Wang G, Wu K. ACVAE: A novel self-adversarial variational auto-encoder combined with contrast learning for time series anomaly detection. Neural Netw 2024; 171:383-395. [PMID: 38141474 DOI: 10.1016/j.neunet.2023.12.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 10/23/2023] [Accepted: 12/13/2023] [Indexed: 12/25/2023]
Abstract
Deep generative models have advantages in modeling complex time series and are widely used in anomaly detection. Nevertheless, the existing deep generative approaches mainly concentrate on the investigation of models' reconstruction capability rather than customizing a model suitable for anomaly detection. Meanwhile, VAE-based models suffer from posterior collapse, which can lead to a series of undesirable consequences, such as high false positive rate etc. Based on these considerations, in this paper, we propose a novel self-adversarial variational auto-encoder combined with contrast learning, short for ACVAE, to address these challenges. ACVAE consist of three parts 〈T,E,G〉, wherein the transformation network T is employed to generate abnormal latent representations similar to those normal latent representations encoded by the encoder E, and the decoder G is used to distinguish the two representations. In the framework of this model, the normal reconstructions are considered as positive samples and abnormal reconstructions as negative samples, and the contrast learning is executed on the part E to measure the similarities between inputs and positive samples, dissimilarities between inputs and negative samples. Thus, an improved objective function is proposed by integrating two novel regularizers, one refers to adversarial mechanism and the other involves contrast learning, in which the encoder E and decoder G hold the capability to distinguish, and decoder G is constrained to mitigate the posterior collapse. We perform several experiments on five datasets, whose results show ACVAE outperforms state-of-the-art methods.
Collapse
Affiliation(s)
- Xiaoxia Zhang
- Chongqing Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China; Key Laboratory of Big Data Intelligent Computing, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.
| | - Shang Shi
- Chongqing Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China; Key Laboratory of Big Data Intelligent Computing, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
| | - HaiChao Sun
- Chongqing Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China; Key Laboratory of Big Data Intelligent Computing, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
| | - Degang Chen
- Department of Mathematics and Physics, North China Electric Power University, Beijing, 102206, China
| | - Guoyin Wang
- Chongqing Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China; Key Laboratory of Big Data Intelligent Computing, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.
| | - Kesheng Wu
- Computational Research Division, Lawrance Berkeley National Laboratory, Berkeley, 94720, USA
| |
Collapse
|
3
|
Clermont G. The Learning Electronic Health Record. Crit Care Clin 2023; 39:689-700. [PMID: 37704334 DOI: 10.1016/j.ccc.2023.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2023]
Abstract
Electronic medical records (EMRs) constitute the electronic version of all medical information included in a patient's paper chart. The electronic health record (EHR) technology has witnessed massive expansion in developed countries and to a lesser extent in underresourced countries during the last 2 decades. We will review factors leading to this expansion, how the emergence of EHRs is affecting several health-care stakeholders; some of the growing pains associated with EHRs with a particular emphasis on the delivery of care to the critically ill; and ongoing developments on the path to improve the quality of research, health-care delivery, and stakeholder satisfaction.
Collapse
Affiliation(s)
- Gilles Clermont
- VA Pittsburgh Medical Center, 1054 Aliquippa Street, Pittsburgh, PA 15104, USA; Critical Care Medicine, University of Pittsburgh, 200 Lothrop Street, Pittsburgh, PA 15061, USA.
| |
Collapse
|
4
|
Lee JM, Hauskrecht M. Personalized event prediction for Electronic Health Records. Artif Intell Med 2023; 143:102620. [PMID: 37673563 PMCID: PMC10503594 DOI: 10.1016/j.artmed.2023.102620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 03/01/2023] [Accepted: 04/24/2023] [Indexed: 09/08/2023]
Abstract
Clinical event sequences consist of hundreds of clinical events that represent records of patient care in time. Developing accurate predictive models of such sequences is of a great importance for supporting a variety of models for interpreting/classifying the current patient condition, or predicting adverse clinical events and outcomes, all aimed to improve patient care. One important challenge of learning predictive models of clinical sequences is their patient-specific variability. Based on underlying clinical conditions, each patient's sequence may consist of different sets of clinical events (observations, lab results, medications, procedures). Hence, simple population-wide models learned from event sequences for many different patients may not accurately predict patient-specific dynamics of event sequences and their differences. To address the problem, we propose and investigate multiple new event sequence prediction models and methods that let us better adjust the prediction for individual patients and their specific conditions. The methods developed in this work pursue refinement of population-wide models to subpopulations, self-adaptation, and a meta-level model switching that is able to adaptively select the model with the best chance to support the immediate prediction. We analyze and test the performance of these models on clinical event sequences of patients in MIMIC-III database.
Collapse
Affiliation(s)
- Jeong Min Lee
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA.
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
5
|
She C, Zeng S. An efficient local outlier detection optimized by rough clustering. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-211433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Outlier detection is a hot issue in data mining, which has plenty of real-world applications. LOF (Local Outlier Factor) can capture the abnormal degree of objects in the dataset with different density levels, and many extended algorithms have been proposed in recent years. However, the LOF needs to search the nearest neighborhood of each object on the whole dataset, which greatly increases the time cost. Most of these extended algorithms only consider the distance between an object and its neighborhood, but ignore the local distribution of an object within its neighborhood, resulting in a high false-positive rate. To improve the running speed, a rough clustering based on triple fusion is proposed, which divides a dataset into several subsets and outlier detection is performed only on each subset. Then, considering the local distribution of an object within its neighborhood, a new local outlier factor is constructed to estimate the abnormal degree of each object. Finally, the experimental results indicate that the proposed algorithm has better performance and lower running time than the others.
Collapse
Affiliation(s)
- Chunyan She
- College of Computer and Information Science, Chongqing Normal University, Chongqing, China
- Chongqing Center of Engineering Technology Research on Digital Agricultural Service, Chongqing, China
| | - Shaohua Zeng
- College of Computer and Information Science, Chongqing Normal University, Chongqing, China
- Chongqing Center of Engineering Technology Research on Digital Agricultural Service, Chongqing, China
| |
Collapse
|
6
|
Lee JM, Hauskrecht M. Learning to Adapt Dynamic Clinical Event Sequences with Residual Mixture of Experts. Artif Intell Med 2022. [DOI: 10.1007/978-3-031-09342-5_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
7
|
Liu S, Hauskrecht M. Event Outlier Detection in Continuous Time. PROCEEDINGS OF MACHINE LEARNING RESEARCH 2021; 139:6793-6803. [PMID: 34712956 PMCID: PMC8549655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Continuous-time event sequences represent discrete events occurring in continuous time. Such sequences arise frequently in real-life. Usually we expect the sequences to follow some regular pattern over time. However, sometimes these patterns may be interrupted by unexpected absence or occurrences of events. Identification of these unexpected cases can be very important as they may point to abnormal situations that need human attention. In this work, we study and develop methods for detecting outliers in continuous-time event sequences, including unexpected absence and unexpected occurrences of events. Since the patterns that event sequences tend to follow may change in different contexts, we develop outlier detection methods based on point processes that can take context information into account. Our methods are based on Bayesian decision theory and hypothesis testing with theoretical guarantees. To test the performance of the methods, we conduct experiments on both synthetic data and real-world clinical data and show the effectiveness of the proposed methods.
Collapse
Affiliation(s)
- Siqi Liu
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA
- Borealis AI, Vancouver, BC, Canada
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
8
|
Barren M, Hauskrecht M. Improving Prediction of Low-Prior Clinical Events with Simultaneous General Patient-State Representation Learning. ARTIFICIAL INTELLIGENCE IN MEDICINE. CONFERENCE ON ARTIFICIAL INTELLIGENCE IN MEDICINE (2005- ) 2021; 12721:479-490. [PMID: 34308430 PMCID: PMC8301230 DOI: 10.1007/978-3-030-77211-6_57] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2023]
Abstract
Low-prior targets are common among many important clinical events, which introduces the challenge of having enough data to support learning of their predictive models. Many prior works have addressed this problem by first building a general patient-state representation model, and then adapting it to a new low-prior prediction target. In this schema, there is potential for the predictive performance to be hindered by the misalignment between the general patient-state model and the target task. To overcome this challenge, we propose a new method that simultaneously optimizes a shared model through multi-task learning of both the low-prior supervised target and general purpose patient-state representation (GPSR). More specifically, our method improves prediction performance of a low-prior task by jointly optimizing a shared model that combines the loss of the target event and a broad range of generic clinical events. We study the approach in the context of Recurrent Neural Networks (RNNs). Through extensive experiments on multiple clinical event targets using MIMIC-III [8] data, we show that the inclusion of general patient-state representation tasks during model training improves the prediction of individual low-prior targets.
Collapse
|
9
|
Li L, Yan J, Wang H, Jin Y. Anomaly Detection of Time Series With Smoothness-Inducing Sequential Variational Auto-Encoder. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:1177-1191. [PMID: 32287020 DOI: 10.1109/tnnls.2020.2980749] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Deep generative models have demonstrated their effectiveness in learning latent representation and modeling complex dependencies of time series. In this article, we present a smoothness-inducing sequential variational auto-encoder (VAE) (SISVAE) model for the robust estimation and anomaly detection of multidimensional time series. Our model is based on VAE, and its backbone is fulfilled by a recurrent neural network to capture latent temporal structures of time series for both the generative model and the inference model. Specifically, our model parameterizes mean and variance for each time-stamp with flexible neural networks, resulting in a nonstationary model that can work without the assumption of constant noise as commonly made by existing Markov models. However, such flexibility may cause the model fragile to anomalies. To achieve robust density estimation which can also benefit detection tasks, we propose a smoothness-inducing prior over possible estimations. The proposed prior works as a regularizer that places penalty at nonsmooth reconstructions. Our model is learned efficiently with a novel stochastic gradient variational Bayes estimator. In particular, we study two decision criteria for anomaly detection: reconstruction probability and reconstruction error. We show the effectiveness of our model on both synthetic data sets and public real-world benchmarks.
Collapse
|
10
|
Lee JM, Hauskrecht M. Modeling multivariate clinical event time-series with recurrent temporal mechanisms. Artif Intell Med 2021; 112:102021. [PMID: 33581828 PMCID: PMC7943294 DOI: 10.1016/j.artmed.2021.102021] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 12/26/2020] [Accepted: 01/10/2021] [Indexed: 12/18/2022]
Abstract
In this work, we propose a novel autoregressive event time-series model that can predict future occurrences of multivariate clinical events. Our model represents multivariate event time-series using different temporal mechanisms aimed to fit different temporal characteristics of the time-series. In particular, information about distant past is modeled through the hidden state space defined by an LSTM-based model, information on recently observed clinical events is modeled through discriminative projections, and information about periodic (repeated) events is modeled using a special recurrent mechanism based on probability distributions of inter-event gaps compiled from past data. We evaluate our proposed model on electronic health record (EHRs) data derived from MIMIC-III dataset. We show that our new model equipped with the above temporal mechanisms leads to improved prediction performance compared to multiple baselines.
Collapse
Affiliation(s)
- Jeong Min Lee
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15260, USA.
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15260, USA.
| |
Collapse
|
11
|
Goryaeva AM, Lapointe C, Dai C, Dérès J, Maillet JB, Marinica MC. Reinforcing materials modelling by encoding the structures of defects in crystalline solids into distortion scores. Nat Commun 2020; 11:4691. [PMID: 32943615 PMCID: PMC7499431 DOI: 10.1038/s41467-020-18282-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Accepted: 08/13/2020] [Indexed: 11/09/2022] Open
Abstract
This work revises the concept of defects in crystalline solids and proposes a universal strategy for their characterization at the atomic scale using outlier detection based on statistical distances. The proposed strategy provides a generic measure that describes the distortion score of local atomic environments. This score facilitates automatic defect localization and enables a stratified description of defects, which allows to distinguish the zones with different levels of distortion within the structure. This work proposes applications for advanced materials modelling ranging from the surrogate concept for the energy per atom to the relevant information selection for evaluation of energy barriers from the mean force. Moreover, this concept can serve for design of robust interatomic machine learning potentials and high-throughput analysis of their databases. The proposed definition of defects opens up many perspectives for materials design and characterization, promoting thereby the development of novel techniques in materials science.
Collapse
Affiliation(s)
- Alexandra M Goryaeva
- Université Paris-Saclay, CEA, Service de Recherches de Métallurgie Physique, Gif-sur-Yvette, 91191, France.
| | - Clovis Lapointe
- Université Paris-Saclay, CEA, Service de Recherches de Métallurgie Physique, Gif-sur-Yvette, 91191, France
| | - Chendi Dai
- Université Paris-Saclay, CEA, Service de Recherches de Métallurgie Physique, Gif-sur-Yvette, 91191, France
| | - Julien Dérès
- Université Paris-Saclay, CEA, Service de Recherches de Métallurgie Physique, Gif-sur-Yvette, 91191, France
| | | | - Mihai-Cosmin Marinica
- Université Paris-Saclay, CEA, Service de Recherches de Métallurgie Physique, Gif-sur-Yvette, 91191, France.
| |
Collapse
|
12
|
Response score of deep learning for out-of-distribution sample detection of medical images. J Biomed Inform 2020; 107:103442. [PMID: 32450299 DOI: 10.1016/j.jbi.2020.103442] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 05/02/2020] [Accepted: 05/05/2020] [Indexed: 02/07/2023]
Abstract
Deep learning Convolutional Neural Networks have achieved remarkable performance in a variety of classification tasks. The data-driven nature of deep learning indicates that a model behaves in response to the data used to train the model, and the quality of datasets may lead to substantial influence on the model's performance, especially when dealing with complicated clinical images. In this paper, we propose a simple and novel method to investigate and quantify a deep learning model's response with respect to a given sample, allowing us to detect out-of-distribution samples based on a newly proposed metric, Response Score. The key idea is that samples belonging to different classes may have different degrees of influence on a model. We quantify the resulting consequence of a single sample to a trained-model and relate the quantitative measure of the consequence (by the Response Score) to detect the out-of-distribution samples. The proposed method can find multiple applications such as (1) recognizing abnormal samples, (2) detecting mixed-domain data, and (3) identifying mislabeled data. We present extensive experiments on the three different applications using four biomedical imaging datasets. Experimental results show that our method exhibits remarkable performance and outperforms the compared methods.
Collapse
|
13
|
Data-Driven Activities Involving Electronic Health Records: An Activity and Task Analysis Framework for Interactive Visualization Tools. MULTIMODAL TECHNOLOGIES AND INTERACTION 2020. [DOI: 10.3390/mti4010007] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Electronic health records (EHRs) can be used to make critical decisions, to study the effects of treatments, and to detect hidden patterns in patient histories. In this paper, we present a framework to identify and analyze EHR-data-driven tasks and activities in the context of interactive visualization tools (IVTs)—that is, all the activities, sub-activities, tasks, and sub-tasks that are and can be supported by EHR-based IVTs. A systematic literature survey was conducted to collect the research papers that describe the design, implementation, and/or evaluation of EHR-based IVTs that support clinical decision-making. Databases included PubMed, the ACM Digital Library, the IEEE Library, and Google Scholar. These sources were supplemented by gray literature searching and reference list reviews. Of the 946 initially identified articles, the survey analyzes 19 IVTs described in 24 articles that met the final selection criteria. The survey includes an overview of the goal of each IVT, a brief description of its visualization, and an analysis of how sub-activities, tasks, and sub-tasks blend and combine to accomplish the tool’s main higher-level activities of interpreting, predicting, and monitoring. Our proposed framework shows the gaps in support of higher-level activities supported by existing IVTs. It appears that almost all existing IVTs focus on the activity of interpreting, while only a few of them support predicting and monitoring—this despite the importance of these activities in assisting users in finding patients that are at high risk and tracking patients’ status after treatment.
Collapse
|
14
|
Kimura D, Chaudhury S, Narita M, Munawar A, Tachibana R. Adversarial Discriminative Attention for Robust Anomaly Detection. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) 2020. [DOI: 10.1109/wacv45572.2020.9093428] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/19/2023]
|
15
|
Lee JM, Hauskrecht M. Multi-scale Temporal Memory for Clinical Event Time-Series Prediction. Artif Intell Med 2020. [DOI: 10.1007/978-3-030-59137-3_28] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
16
|
Kathuria M, Gambhir S. Critical Condition Detection Using Lion Hunting Optimizer and SVM Classifier in a Healthcare WBAN. INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS 2020. [DOI: 10.4018/ijehmc.2020010104] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A timely critical condition detection and early notification are two essential requirements in a healthcare wireless body area network for the correct treatment of patients. However, most of the systems have limited capabilities and so could not detect the exact condition in a precise time interval. In addition to these it needs a reduction in the false alert rate, as issuing alerts for the deviation in each incoming packet increases the false alert rate and these false alerts consume more network resources. In order to fulfill the above-mentioned requirements, a dynamic alert system has been designed in this regard to make it more efficient, also, a new kind of hybridization approach is being introduced to it with the additive support of a nature-inspired optimization strategy named Lion Hunting and a machine-learning technique called support vector machine. The simulation is done using a network simulator NS-2.35, and the proposed alerting system outperforms others.
Collapse
|
17
|
Rozenblum R, Rodriguez-Monguio R, Volk LA, Forsythe KJ, Myers S, McGurrin M, Williams DH, Bates DW, Schiff G, Seoane-Vazquez E. Using a Machine Learning System to Identify and Prevent Medication Prescribing Errors: A Clinical and Cost Analysis Evaluation. Jt Comm J Qual Patient Saf 2019; 46:3-10. [PMID: 31786147 DOI: 10.1016/j.jcjq.2019.09.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 09/13/2019] [Accepted: 09/16/2019] [Indexed: 11/15/2022]
Abstract
BACKGROUND Clinical decision support (CDS) alerting tools can identify and reduce medication errors. However, they are typically rule-based and can identify only the errors previously programmed into their alerting logic. Machine learning holds promise for improving medication error detection and reducing costs associated with adverse events. This study evaluates the ability of a machine learning system (MedAware) to generate clinically valid alerts and estimates the cost savings associated with potentially prevented adverse events. METHODS Alerts were generated retrospectively by the MedAware system on outpatient data from two academic medical centers between 2009 and 2013. MedAware alerts were compared to alerts in an existing CDS system. A random sample of 300 alerts was selected for medical record review. Frequency and severity of potential outcomes of alerted medication errors of medium and high clinical value were estimated, along with associated health care costs of these potentially prevented adverse events. RESULTS A total of 10,668 alerts were generated. Overall, 68.2% of MedAware alerts would not have been generated by the existing CDS system. Ninety-two percent of a random sample of the chart-reviewed alerts were accurate based on structured data available in the record, and 79.7% were clinically valid. Estimated cost of adverse events potentially prevented in an outpatient setting was more than $60 per drug alert and $1.3 million when extrapolating study findings to the full patient population. CONCLUSION A machine learning system identified clinically valid medication error alerts that might otherwise be missed with existing CDS systems. Estimates show potential for cost savings associated with potentially prevented adverse events.
Collapse
|
18
|
Fast supervised novelty detection and its application in remote sensing. Soft comput 2019. [DOI: 10.1007/s00500-018-03740-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
19
|
Lin CH, Hsu KC, Johnson KR, Luby M, Fann YC. Applying density-based outlier identifications using multiple datasets for validation of stroke clinical outcomes. Int J Med Inform 2019; 132:103988. [PMID: 31590140 DOI: 10.1016/j.ijmedinf.2019.103988] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 08/20/2019] [Accepted: 10/01/2019] [Indexed: 12/13/2022]
Abstract
INTRODUCTION Clinicians commonly use the modified Rankin Scale (mRS) and the Barthel Index (BI) to measure clinical outcome after stroke. These are potential targets in machine learning models for stroke outcome prediction. Therefore, the quality of the measurements is crucial for training and validation of these models. The objective of this study was to apply and evaluate density-based outlier detection methods for identifying potentially incorrect measurements in multiple large stroke datasets to assess the measurement quality. METHOD We applied three density-based outlier detection methods including density-based spatial clustering of applications (DBSCAN), hierarchical DBSCAN (HDBSCAN) and local outlier factor (LOF) based on a large dataset obtained from a nationwide prospective stroke registry in Taiwan. The testing of each method was done by using four different NINDS funded stroke datasets. RESULT The DBSCAN achieved a high performance across all mRS values where the highest average accuracy was 99.2 ± 0.7 at mRS of 4 and the lowest average accuracy was 92.0 ± 4.6 at mRS of 3. The LOF also achieved similar performance, however, the HDBSCAN with default parameters setting required further tuning improvement. CONCLUSION The density-based outlier detection methods were proven to be promising for validation of stroke outcome measures. The outlier detection algorithm developed from a large prospective registry dataset was effectively applied in four different NINDS stroke datasets with high performance results. The tool developed from this detection algorithm can be further applied to real world datasets to increase the data quality in stroke outcome measures.
Collapse
Affiliation(s)
- Ching-Heng Lin
- Center for Information Technology, National Institutes of Health, Bethesda, MD, United States
| | - Kai-Cheng Hsu
- Bioinformatics Section, National Institute of Neurological Disorder and Stroke, National Institutes of Health, Bethesda, MD, United States; Department of Neurology, National Taiwan University Hospital, Taipei, Taiwan
| | - Kory R Johnson
- Bioinformatics Section, National Institute of Neurological Disorder and Stroke, National Institutes of Health, Bethesda, MD, United States
| | - Marie Luby
- Stroke Branch, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, United States
| | - Yang C Fann
- Bioinformatics Section, National Institute of Neurological Disorder and Stroke, National Institutes of Health, Bethesda, MD, United States.
| |
Collapse
|
20
|
A probabilistic framework for predicting disease dynamics: A case study of psychotic depression. J Biomed Inform 2019; 95:103232. [PMID: 31201965 DOI: 10.1016/j.jbi.2019.103232] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 04/30/2019] [Accepted: 06/11/2019] [Indexed: 11/23/2022]
Abstract
Unsupervised learning is often used to obtain insight into the underlying structure of medical data, but it is not always clear how to use such structure in an effective way. In this paper, we propose a probabilistic framework for predicting disease dynamics guided by latent states. The framework is based on hidden Markov models and aims to facilitate the selection of hypotheses that might yield insight into the dynamics. We demonstrate this by using clinical trial data for psychotic depression treatment as a case study. The discovered latent structure and proposed outcome are then validated using standard depression criteria, and are shown to provide new insight into the heterogeneity of psychotic depression in terms of predictive symptoms for different interventions.
Collapse
|
21
|
Lee JM, Hauskrecht M. Recent Context-aware LSTM for Clinical Event Time-series Prediction. ARTIFICIAL INTELLIGENCE IN MEDICINE. CONFERENCE ON ARTIFICIAL INTELLIGENCE IN MEDICINE (2005- ) 2019; 11526:13-23. [PMID: 31528857 DOI: 10.1007/978-3-030-21642-9_3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
In this work, we propose a novel clinical event time-series model based on the long short-term memory architecture (LSTM) that can predict future event occurrences for a large number of different clinical events. Our model relies on two sources of information to predict future events. One source is derived from the set of recently observed clinical events. The other one is based on the hidden state space defined by the LSTM that aims to abstract past, more distant, patient information that is predictive of future events. We evaluate our proposed model on electronic health record (EHRs) data derived from MIMIC-III dataset. We show that the combination of the two sources of information implemented in our method leads to improved prediction performance compared to the models based on individual sources.
Collapse
|
22
|
Huang CY, Nguyen PA, Yang HC, Islam MM, Liang CW, Lee FP, Jack Li YC. A probabilistic model for reducing medication errors: A sensitivity analysis using Electronic Health Records data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2019; 170:31-38. [PMID: 30712602 DOI: 10.1016/j.cmpb.2018.12.033] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 12/20/2018] [Accepted: 12/29/2018] [Indexed: 06/09/2023]
Abstract
OBJECTIVES Medication-related clinical decision support systems have already been considered as a sophisticated method to improve healthcare quality, however, its importance has not been fully recognized. This paper's aim was to validate an existing probabilistic model that can automatically identify medication errors by performing a sensitivity analysis from electronic medical record data. METHODS We first built a knowledge base that consisted of 2.22 million disease-medication (DM) and 0.78 million medication-medication (MM) associations using Taiwan Health and Welfare data science claims data between January 1st, 2009 and December 31st, 2011. Further, we collected 0.6 million outpatient visit prescriptions from six departments across five different medical centers/hospitals. Afterward, we employed the data to our AESOP model and validated it using a sensitivity analysis of 11 various thresholds (α = [0.5; 1.5]) that were used to identify positive DM and MM associations. We randomly selected 2400 randomly prescriptions and compared them to the gold standard of 18 physicians' manual review for appropriateness. RESULTS One hundred twenty-one results of 2400 prescriptions with various thresholds were tested by the AESOP model. Validation against the gold standard showed a high accuracy (over 80%), sensitivity (80-96%), and positive predictive value (over 85%). The negative predictive values ranged from 45 to 75% across three departments, cardiology, neurology, and ophthalmology. CONCLUSION We performed a sensitivity analysis and validated the AESOP model in different hospitals. Thus, picking the optimal threshold of the model depended on balancing false negatives with false positives and depending on the specialty and the purpose of the system.
Collapse
Affiliation(s)
- Chu-Ya Huang
- Graduate Institute of Biomedical Informatics, College of Medical Science & Technology, Taipei Medical University, Taipei, Taiwan; Taiwan College of Healthcare Executives, Taipei, Taiwan
| | - Phung-Anh Nguyen
- International Center for Health Information Technology, Taipei Medical University, Taipei, Taiwan
| | - Hsuan-Chia Yang
- International Center for Health Information Technology, Taipei Medical University, Taipei, Taiwan
| | - Md Mohaimenul Islam
- Graduate Institute of Biomedical Informatics, College of Medical Science & Technology, Taipei Medical University, Taipei, Taiwan
| | - Chia-Wei Liang
- International Center for Health Information Technology, Taipei Medical University, Taipei, Taiwan
| | - Fei-Peng Lee
- Department of Otolaryngology, Taipei Medical University Shuang-Ho Hospital, New Taipei City, Taiwan
| | - Yu-Chuan Jack Li
- Graduate Institute of Biomedical Informatics, College of Medical Science & Technology, Taipei Medical University, Taipei, Taiwan; International Center for Health Information Technology, Taipei Medical University, Taipei, Taiwan; Department of Dermatology, Taipei Medical University Wan-Fang Hospital, Taipei, Taiwan.
| |
Collapse
|
23
|
Luo Z, Hauskrecht M. Hierarchical Active Learning with Proportion Feedback on Regions. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES : EUROPEAN CONFERENCE, ECML PKDD ... : PROCEEDINGS. ECML PKDD (CONFERENCE) 2019; 11052:464-480. [PMID: 30740605 DOI: 10.1007/978-3-030-10928-8_28] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process can be very time-consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. To solve this problem, instead of soliciting instance-based annotation we explore region-based annotation as the feedback. A region is defined as a hyper-cubic subspace of the input feature space and it covers a subpopulation of data instances that fall into this region. Each region is labeled with a number in [0,1] (in binary classification setting), representing a human estimate of the positive (or negative) class proportion in the subpopulation. To learn a classifier from region-based feedback we develop an active learning framework that hierarchically divides the input space into smaller and smaller regions. In each iteration we split the region with the highest potential to improve the classification models. This iterative process allows us to gradually learn more refined classification models from more specific regions with more accurate proportions. Through experiments on numerous datasets we demonstrate that our approach offers a new and promising active learning direction that can outperform existing active learning approaches especially in situations when labeling budget is limited and small.
Collapse
Affiliation(s)
- Zhipeng Luo
- Department of Computer Science University of Pittsburgh, Pittsburgh PA 15260, USA
| | - Milos Hauskrecht
- Department of Computer Science University of Pittsburgh, Pittsburgh PA 15260, USA
| |
Collapse
|
24
|
Barda AJ, Ruiz VM, Gigliotti T, Tsui FR. An argument for reporting data standardization procedures in multi-site predictive modeling: case study on the impact of LOINC standardization on model performance. JAMIA Open 2019; 2:197-204. [PMID: 30944914 PMCID: PMC6435008 DOI: 10.1093/jamiaopen/ooy063] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2018] [Revised: 11/22/2018] [Accepted: 12/20/2018] [Indexed: 11/13/2022] Open
Abstract
Objectives We aimed to gain a better understanding of how standardization of laboratory data can impact predictive model performance in multi-site datasets. We hypothesized that standardizing local laboratory codes to logical observation identifiers names and codes (LOINC) would produce predictive models that significantly outperform those learned utilizing local laboratory codes. Materials and Methods We predicted 30-day hospital readmission for a set of heart failure-specific visits to 13 hospitals from 2008 to 2012. Laboratory test results were extracted and then manually cleaned and mapped to LOINC. We extracted features to summarize laboratory data for each patient and used a training dataset (2008–2011) to learn models using a variety of feature selection techniques and classifiers. We evaluated our hypothesis by comparing model performance on an independent test dataset (2012). Results Models that utilized LOINC performed significantly better than models that utilized local laboratory test codes, regardless of the feature selection technique and classifier approach used. Discussion and Conclusion We quantitatively demonstrated the positive impact of standardizing multi-site laboratory data to LOINC prior to use in predictive models. We used our findings to argue for the need for detailed reporting of data standardization procedures in predictive modeling, especially in studies leveraging multi-site datasets extracted from electronic health records.
Collapse
Affiliation(s)
- Amie J Barda
- Tsui Laboratory, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Victor M Ruiz
- Tsui Laboratory, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Tony Gigliotti
- Information Services Division, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, USA
| | - Fuchiang Rich Tsui
- Tsui Laboratory, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.,Department of Anesthesiology and Critical Care Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.,School of Computing Information, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.,Department of Bioengineering, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
25
|
Xue Y, Hauskrecht M. Active Learning of Multi-class Classification Models from Ordered Class Sets. PROCEEDINGS OF THE ... AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE. AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE 2019; 33:5589-5596. [PMID: 31750011 PMCID: PMC6867686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In this paper, we study the problem of learning multi-class classification models from a limited set of labeled examples obtained from human annotator. We propose a new machine learning framework that learns multi-class classification models from ordered class sets the annotator may use to express not only her top class choice but also other competing classes still under consideration. Such ordered sets of competing classes are common, for example, in various diagnostic tasks. In this paper, we first develop strategies for learning multi-class classification models from examples associated with ordered class set information. After that we develop an active learning strategy that considers such a feedback. We evaluate the benefit of the framework on multiple datasets. We show that class-order feedback and active learning can reduce the annotation cost both individually and jointly.
Collapse
Affiliation(s)
- Yanbing Xue
- Department of Computer Science, University of Pittsburgh,
| | | |
Collapse
|
26
|
Unsupervised Novelty Detection Using Deep Autoencoders with Density Based Clustering. APPLIED SCIENCES-BASEL 2018. [DOI: 10.3390/app8091468] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Novelty detection is a classification problem to identify abnormal patterns; therefore, it is an important task for applications such as fraud detection, fault diagnosis and disease detection. However, when there is no label that indicates normal and abnormal data, it will need expensive domain and professional knowledge, so an unsupervised novelty detection approach will be used. On the other hand, nowadays, using novelty detection on high dimensional data is a big challenge and previous research suggests approaches based on principal component analysis (PCA) and an autoencoder in order to reduce dimensionality. In this paper, we propose deep autoencoders with density based clustering (DAE-DBC); this approach calculates compressed data and error threshold from deep autoencoder model, sending the results to a density based cluster. Points that are not involved in any groups are not considered a novelty; the grouping points will be defined as a novelty group depending on the ratio of the points exceeding the error threshold. We have conducted the experiment by substituting components to show that the components of the proposed method together are more effective. As a result of the experiment, the DAE-DBC approach is more efficient; its area under the curve (AUC) is shown to be 13.5 percent higher than state-of-the-art algorithms and other versions of the proposed method that we have demonstrated.
Collapse
|
27
|
Liu S, Wright A, Hauskrecht M. Change-point detection method for clinical decision support system rule monitoring. Artif Intell Med 2018; 91:49-56. [PMID: 30041919 DOI: 10.1016/j.artmed.2018.06.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 03/24/2018] [Accepted: 06/16/2018] [Indexed: 11/25/2022]
Abstract
A clinical decision support system (CDSS) helps clinicians to manage patients, but malfunctions of its components or other systems on which it depends may affect its intended functions. Monitoring the system and detecting changes in its behavior that may indicate the malfunction can help to avoid any potential costs associated with its improper operation. In this paper, we investigate the problem of detecting changes in the CDSS operation, in particular its monitoring and alerting subsystem, by monitoring its rule firing counts. We aim to screen and detect changes in real-time, that is whenever a new datum (rule firing count) arrives, we want to have a score indicating how likely there is a change in the system. We develop a new method based on Seasonal-Trend decomposition with locally weighted regression (Loess) and likelihood ratio statistics to detect the changes. Experiments on daily rule-firing-count data collected from a real CDSS and known change-points show that our method improves the detection performance when compared with existing change-point detection methods.
Collapse
Affiliation(s)
- Siqi Liu
- Department of Computer Science, University of Pittsburgh, USA.
| | - Adam Wright
- Brigham and Women's Hospital and Harvard Medical School, USA.
| | | |
Collapse
|
28
|
Santos HDPD, Ulbrich AHDPS, Woloszyn V, Vieira R. DDC-Outlier: Preventing Medication Errors Using Unsupervised Learning. IEEE J Biomed Health Inform 2018; 23:874-881. [PMID: 29993649 DOI: 10.1109/jbhi.2018.2828028] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Electronic health records have brought valuable improvements to hospital practices by integrating patient information. In fact, the understanding of these data can prevent mistakes that may put patients' lives at risk. Nonetheless, to the best of our knowledge, there are no previous studies addressing the automatic detection of outlier prescriptions, regarding dosage and frequency. In this paper, we propose an unsupervised method, called density-distance-centrality (DDC), to detect potential outlier prescriptions. A dataset with 563 thousand prescribed medications was used to assess our proposed approach against different state-of-the-art techniques for outlier detection. In the experiments, our approach achieves better results in the task of overdose and underdose detection in medical prescriptions, compared to other methods applied to this problem. Additionally, most of the false positive instances detected by our algorithm were potential prescriptions errors.
Collapse
|
29
|
Liu Z, Hauskrecht M. A Personalized Predictive Framework for Multivariate Clinical Time Series via Adaptive Model Selection. PROCEEDINGS OF THE ... ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT. ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT 2017; 2017:1169-1177. [PMID: 29296289 DOI: 10.1145/3132847.3132859] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Building of an accurate predictive model of clinical time series for a patient is critical for understanding of the patient condition, its dynamics, and optimal patient management. Unfortunately, this process is not straightforward. First, patient-specific variations are typically large and population-based models derived or learned from many different patients are often unable to support accurate predictions for each individual patient. Moreover, time series observed for one patient at any point in time may be too short and insufficient to learn a high-quality patient-specific model just from the patient's own data. To address these problems we propose, develop and experiment with a new adaptive forecasting framework for building multivariate clinical time series models for a patient and for supporting patient-specific predictions. The framework relies on the adaptive model switching approach that at any point in time selects the most promising time series model out of the pool of many possible models, and consequently, combines advantages of the population, patient-specific and short-term individualized predictive models. We demonstrate that the adaptive model switching framework is very promising approach to support personalized time series prediction, and that it is able to outperform predictions based on pure population and patient-specific models, as well as, other patient-specific model adaptation strategies.
Collapse
Affiliation(s)
- Zitao Liu
- Pinterest, 651 Brannan St, San Francisco, California 94107
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| |
Collapse
|
30
|
Xue Y, Hauskrecht M. Efficient Learning of Classification Models from Soft-label Information by Binning and Ranking. PROCEEDINGS OF THE ... INTERNATIONAL FLORIDA AI RESEARCH SOCIETY CONFERENCE. FLORIDA AI RESEARCH SYMPOSIUM 2017; 2017:164-169. [PMID: 28725883 PMCID: PMC5512716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Construction of classification models from data in practice often requires additional human effort to annotate (label) observed data instances. However, this annotation effort may often be too costly and only a limited number of data instances may be feasibly labeled. The challenge is to find methods that let us reduce the number of the labeled instances but at the same time preserve the quality of the learned models. In this paper we study the idea of learning classification from soft label information in which each instance is associated with a soft-label further refining its class label. One caveat of applying this idea is that soft-labels based on human assessment are often noisy. To address this problem, we develop and test a new classification model learning algorithm that relies on soft-label binning to limit the effect of soft-label noise. We show this approach is able to learn classification models more rapidly and with a smaller number of labeled instances than (1) existing soft label learning methods, as well as, (2) methods that learn from class-label information.
Collapse
Affiliation(s)
- Yanbing Xue
- Department of Computer Science, University of Pittsburgh,
| | | |
Collapse
|
31
|
Liu S, Wright A, Hauskrecht M. Online Conditional Outlier Detection in Nonstationary Time Series. PROCEEDINGS OF THE ... INTERNATIONAL FLORIDA AI RESEARCH SOCIETY CONFERENCE. FLORIDA AI RESEARCH SYMPOSIUM 2017; 2017:86-91. [PMID: 29644345 PMCID: PMC5891145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The objective of this work is to develop methods for detecting outliers in time series data. Such methods can become the key component of various monitoring and alerting systems, where an outlier may be equal to some adverse condition that needs human attention. However, real-world time series are often affected by various sources of variability present in the environment that may influence the quality of detection; they may (1) explain some of the changes in the signal that would otherwise lead to false positive detections, as well as, (2) reduce the sensitivity of the detection algorithm leading to increase in false negatives. To alleviate these problems, we propose a new two-layer outlier detection approach that first tries to model and account for the nonstationarity and periodic variation in the time series, and then tries to use other observable variables in the environment to explain any additional signal variation. Our experiments on several data sets in different domains show that our method provides more accurate modeling of the time series, and that it is able to significantly improve outlier detection performance.
Collapse
Affiliation(s)
- Siqi Liu
- Department of Computer Science, University of Pittsburgh
| | - Adam Wright
- Brigham and Women's Hospital and Harvard Medical School
| | | |
Collapse
|
32
|
Xue Y, Hauskrecht M. Active Learning of Classification Models with Likert-Scale Feedback. PROCEEDINGS OF THE ... SIAM INTERNATIONAL CONFERENCE ON DATA MINING. SIAM INTERNATIONAL CONFERENCE ON DATA MINING 2017; 2017:28-35. [PMID: 28979827 PMCID: PMC5624557 DOI: 10.1137/1.9781611974973.4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Annotation of classification data by humans can be a time-consuming and tedious process. Finding ways of reducing the annotation effort is critical for building the classification models in practice and for applying them to a variety of classification tasks. In this paper, we develop a new active learning framework that combines two strategies to reduce the annotation effort. First, it relies on label uncertainty information obtained from the human in terms of the Likert-scale feedback. Second, it uses active learning to annotate examples with the greatest expected change. We propose a Bayesian approach to calculate the expectation and an incremental SVM solver to reduce the time complexity of the solvers. We show the combination of our active learning strategy and the Likert-scale feedback can learn classification models more rapidly and with a smaller number of labeled instances than methods that rely on either Likert-scale labels or active learning alone.
Collapse
Affiliation(s)
- Yanbing Xue
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|
33
|
Outlier-based detection of unusual patient-management actions: An ICU study. J Biomed Inform 2016; 64:211-221. [PMID: 27720983 DOI: 10.1016/j.jbi.2016.10.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Revised: 08/21/2016] [Accepted: 10/04/2016] [Indexed: 11/22/2022]
Abstract
Medical errors remain a significant problem in healthcare. This paper investigates a data-driven outlier-based monitoring and alerting framework that uses data in the Electronic Medical Records (EMRs) repositories of past patient cases to identify any unusual clinical actions in the EMR of a current patient. Our conjecture is that these unusual clinical actions correspond to medical errors often enough to justify their detection and alerting. Our approach works by using EMR repositories to learn statistical models that relate patient states to patient-management actions. We evaluated this approach on the EMR data for 24,658 intensive care unit (ICU) patient cases. A total of 16,500 cases were used to train statistical models for ordering medications and laboratory tests given the patient state summarizing the patient's clinical history. The models were applied to a separate test set of 8158 ICU patient cases and used to generate alerts. A subset of 240 alerts generated by the models were evaluated and assessed by eighteen ICU clinicians. The overall true positive rates for the alerts (TPARs) ranged from 0.44 to 0.71. The TPAR for medication order alerts specifically ranged from 0.31 to 0.61 and for laboratory order alerts from 0.44 to 0.75. These results support outlier-based alerting as a promising new approach to data-driven clinical alerting that is generated automatically based on past EMR data.
Collapse
|
34
|
Moskovitch R, Wang F, Shahar Y, Hripcsak G. Temporal data analytics. J Biomed Inform 2016. [DOI: 10.1016/j.jbi.2016.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
35
|
Johnson AEW, Ghassemi MM, Nemati S, Niehaus KE, Clifton DA, Clifford GD. Machine Learning and Decision Support in Critical Care. PROCEEDINGS OF THE IEEE. INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS 2016; 104:444-466. [PMID: 27765959 PMCID: PMC5066876 DOI: 10.1109/jproc.2015.2501978] [Citation(s) in RCA: 160] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Clinical data management systems typically provide caregiver teams with useful information, derived from large, sometimes highly heterogeneous, data sources that are often changing dynamically. Over the last decade there has been a significant surge in interest in using these data sources, from simply re-using the standard clinical databases for event prediction or decision support, to including dynamic and patient-specific information into clinical monitoring and prediction problems. However, in most cases, commercial clinical databases have been designed to document clinical activity for reporting, liability and billing reasons, rather than for developing new algorithms. With increasing excitement surrounding "secondary use of medical records" and "Big Data" analytics, it is important to understand the limitations of current databases and what needs to change in order to enter an era of "precision medicine." This review article covers many of the issues involved in the collection and preprocessing of critical care data. The three challenges in critical care are considered: compartmentalization, corruption, and complexity. A range of applications addressing these issues are covered, including the modernization of static acuity scoring; on-line patient tracking; personalized prediction and risk assessment; artifact detection; state estimation; and incorporation of multimodal data sources such as genomic and free text data.
Collapse
Affiliation(s)
- Alistair E. W. Johnson
- Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Boston, USA
| | - Mohammad M. Ghassemi
- Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Boston, USA
| | - Shamim Nemati
- Department of Biomedical Informatics, Emory University, Atlanta, USA
| | - Katherine E. Niehaus
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - David A. Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Gari D. Clifford
- Department of Biomedical Informatics, Emory University, Atlanta, USA; Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, USA
| |
Collapse
|
36
|
Liu Z, Hauskrecht M. Learning Adaptive Forecasting Models from Irregularly Sampled Multivariate Clinical Data. PROCEEDINGS OF THE ... AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE. AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE 2016; 2016:1273-1279. [PMID: 27525189 PMCID: PMC4980099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Building accurate predictive models of clinical multivariate time series is crucial for understanding of the patient condition, the dynamics of a disease, and clinical decision making. A challenging aspect of this process is that the model should be flexible and adaptive to reflect well patient-specific temporal behaviors and this also in the case when the available patient-specific data are sparse and short span. To address this problem we propose and develop an adaptive two-stage forecasting approach for modeling multivariate, irregularly sampled clinical time series of varying lengths. The proposed model (1) learns the population trend from a collection of time series for past patients; (2) captures individual-specific short-term multivariate variability; and (3) adapts by automatically adjusting its predictions based on new observations. The proposed forecasting model is evaluated on a real-world clinical time series dataset. The results demonstrate the benefits of our approach on the prediction tasks for multivariate, irregularly sampled clinical time series, and show that it can outperform both the population based and patient-specific time series prediction models in terms of prediction accuracy.
Collapse
Affiliation(s)
- Zitao Liu
- Computer Science Department, University of Pittsburgh, 210 South Bouquet St., Pittsburgh, PA, 15260 USA
| | - Milos Hauskrecht
- Computer Science Department, University of Pittsburgh, 210 South Bouquet St., Pittsburgh, PA, 15260 USA
| |
Collapse
|
37
|
Hong C, Hauskrecht M. Multivariate Conditional Outlier Detection and Its Clinical Application. PROCEEDINGS OF THE ... AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE. AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE 2016; 2016:4216-4217. [PMID: 27226927 PMCID: PMC4877029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
This paper overviews and discusses our recent work on a multivariate conditional outlier detection framework for clinical applications.
Collapse
|
38
|
Batal I, Cooper G, Fradkin D, Harrison J, Moerchen F, Hauskrecht M. An Efficient Pattern Mining Approach for Event Detection in Multivariate Temporal Data. Knowl Inf Syst 2016; 46:115-150. [PMID: 26752800 PMCID: PMC4704806 DOI: 10.1007/s10115-015-0819-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2013] [Revised: 08/31/2014] [Accepted: 12/06/2014] [Indexed: 11/27/2022]
Abstract
This work proposes a pattern mining approach to learn event detection models from complex multivariate temporal data, such as electronic health records. We present Recent Temporal Pattern mining, a novel approach for efficiently finding predictive patterns for event detection problems. This approach first converts the time series data into time-interval sequences of temporal abstractions. It then constructs more complex time-interval patterns backward in time using temporal operators. We also present the Minimal Predictive Recent Temporal Patterns framework for selecting a small set of predictive and non-spurious patterns. We apply our methods for predicting adverse medical events in real-world clinical data. The results demonstrate the benefits of our methods in learning accurate event detection models, which is a key step for developing intelligent patient monitoring and decision support systems.
Collapse
Affiliation(s)
| | - Gregory Cooper
- Department of Biomedical Informatics, University of Pittsburgh,
| | | | - James Harrison
- Department of Public Health Sciences, University of Virginia,
| | | | | |
Collapse
|
39
|
Hong C, Batal I, Hauskrecht M. A Generalized Mixture Framework for Multi-label Classification. PROCEEDINGS OF THE ... SIAM INTERNATIONAL CONFERENCE ON DATA MINING. SIAM INTERNATIONAL CONFERENCE ON DATA MINING 2015; 2015:712-720. [PMID: 26613069 PMCID: PMC4657574 DOI: 10.1137/1.9781611974010.80] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We develop a novel probabilistic ensemble framework for multi-label classification that is based on the mixtures-of-experts architecture. In this framework, we combine multi-label classification models in the classifier chains family that decompose the class posterior distribution P(Y1, …, Yd |X) using a product of posterior distributions over components of the output space. Our approach captures different input-output and output-output relations that tend to change across data. As a result, we can recover a rich set of dependency relations among inputs and outputs that a single multi-label classification model cannot capture due to its modeling simplifications. We develop and present algorithms for learning the mixtures-of-experts models from data and for performing multi-label predictions on unseen data instances. Experiments on multiple benchmark datasets demonstrate that our approach achieves highly competitive results and outperforms the existing state-of-the-art multi-label classification methods.
Collapse
Affiliation(s)
- Charmgil Hong
- Department of Computer Science, University of Pittsburgh
| | | | | |
Collapse
|
40
|
Heim E, Hauskrecht M. Sparse Multidimensional Patient Modeling using Auxiliary Confidence Labels. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2015; 2015:331-336. [PMID: 26949568 PMCID: PMC4774858 DOI: 10.1109/bibm.2015.7359703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
In this work, we focus on the problem of learning a classification model that performs inference on patient Electronic Health Records (EHRs). Often, a large amount of costly expert supervision is required to learn such a model. To reduce this cost, we obtain confidence labels that indicate how sure an expert is in the class labels she provides. If meaningful confidence information can be incorporated into a learning method, fewer patient instances may need to be labeled to learn an accurate model. In addition, while accuracy of predictions is important for any inference model, a model of patients must be interpretable so that clinicians can understand how the model is making decisions. To these ends, we develop a novel metric learning method called Confidence bAsed MEtric Learning (CAMEL) that supports inclusion of confidence labels, but also emphasizes interpretability in three ways. First, our method induces sparsity, thus producing simple models that use only a few features from patient EHRs. Second, CAMEL naturally produces confidence scores that can be taken into consideration when clinicians make treatment decisions. Third, the metrics learned by CAMEL induce multidimensional spaces where each dimension represents a different "factor" that clinicians can use to assess patients. In our experimental evaluation, we show on a real-world clinical data set that our CAMEL methods are able to learn models that are as or more accurate as other methods that use the same supervision. Furthermore, we show that when CAMEL uses confidence scores it is able to learn models as or more accurate as others we tested while using only 10% of the training instances. Finally, we perform qualitative assessments on the metrics learned by CAMEL and show that they identify and clearly articulate important factors in how the model performs inference.
Collapse
|
41
|
Liu Z, Hauskrecht M. Clinical time series prediction: Toward a hierarchical dynamical system framework. Artif Intell Med 2015; 65:5-18. [PMID: 25534671 PMCID: PMC4422790 DOI: 10.1016/j.artmed.2014.10.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Revised: 10/08/2014] [Accepted: 10/09/2014] [Indexed: 11/16/2022]
Abstract
OBJECTIVE Developing machine learning and data mining algorithms for building temporal models of clinical time series is important for understanding of the patient condition, the dynamics of a disease, effect of various patient management interventions and clinical decision making. In this work, we propose and develop a novel hierarchical framework for modeling clinical time series data of varied length and with irregularly sampled observations. MATERIALS AND METHODS Our hierarchical dynamical system framework for modeling clinical time series combines advantages of the two temporal modeling approaches: the linear dynamical system and the Gaussian process. We model the irregularly sampled clinical time series by using multiple Gaussian process sequences in the lower level of our hierarchical framework and capture the transitions between Gaussian processes by utilizing the linear dynamical system. The experiments are conducted on the complete blood count (CBC) panel data of 1000 post-surgical cardiac patients during their hospitalization. Our framework is evaluated and compared to multiple baseline approaches in terms of the mean absolute prediction error and the absolute percentage error. RESULTS We tested our framework by first learning the time series model from data for the patients in the training set, and then using it to predict future time series values for the patients in the test set. We show that our model outperforms multiple existing models in terms of its predictive accuracy. Our method achieved a 3.13% average prediction accuracy improvement on ten CBC lab time series when it was compared against the best performing baseline. A 5.25% average accuracy improvement was observed when only short-term predictions were considered. CONCLUSION A new hierarchical dynamical system framework that lets us model irregularly sampled time series data is a promising new direction for modeling clinical time series and for improving their predictive performance.
Collapse
Affiliation(s)
- Zitao Liu
- Computer Science Department, University of Pittsburgh, 210 South Bouquet Street, Pittsburgh, PA 15260, USA.
| | - Milos Hauskrecht
- Computer Science Department, University of Pittsburgh, 210 South Bouquet Street, Pittsburgh, PA 15260, USA
| |
Collapse
|
42
|
Visweswaran S, Ferreira A, Ribeiro GA, Oliveira AC, Cooper GF. Personalized Modeling for Prediction with Decision-Path Models. PLoS One 2015; 10:e0131022. [PMID: 26098570 PMCID: PMC4476684 DOI: 10.1371/journal.pone.0131022] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 05/26/2015] [Indexed: 11/25/2022] Open
Abstract
Deriving predictive models in medicine typically relies on a population approach where a single model is developed from a dataset of individuals. In this paper we describe and evaluate a personalized approach in which we construct a new type of decision tree model called decision-path model that takes advantage of the particular features of a given person of interest. We introduce three personalized methods that derive personalized decision-path models. We compared the performance of these methods to that of Classification And Regression Tree (CART) that is a population decision tree to predict seven different outcomes in five medical datasets. Two of the three personalized methods performed statistically significantly better on area under the ROC curve (AUC) and Brier skill score compared to CART. The personalized approach of learning decision path models is a new approach for predictive modeling that can perform better than a population approach.
Collapse
Affiliation(s)
- Shyam Visweswaran
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America; The Intelligent Systems Program, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Antonio Ferreira
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Guilherme A Ribeiro
- Department of Informatics, Federal University of Maranhão-UFMA, Sao Luis, MA, Brazil
| | - Alexandre C Oliveira
- Department of Informatics, Federal University of Maranhão-UFMA, Sao Luis, MA, Brazil
| | - Gregory F Cooper
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America; The Intelligent Systems Program, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
43
|
Haque SA, Rahman M, Aziz SM. Sensor anomaly detection in wireless sensor networks for healthcare. SENSORS 2015; 15:8764-86. [PMID: 25884786 PMCID: PMC4431209 DOI: 10.3390/s150408764] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2014] [Revised: 03/25/2015] [Accepted: 04/01/2015] [Indexed: 11/28/2022]
Abstract
Wireless Sensor Networks (WSN) are vulnerable to various sensor faults and faulty measurements. This vulnerability hinders efficient and timely response in various WSN applications, such as healthcare. For example, faulty measurements can create false alarms which may require unnecessary intervention from healthcare personnel. Therefore, an approach to differentiate between real medical conditions and false alarms will improve remote patient monitoring systems and quality of healthcare service afforded by WSN. In this paper, a novel approach is proposed to detect sensor anomaly by analyzing collected physiological data from medical sensors. The objective of this method is to effectively distinguish false alarms from true alarms. It predicts a sensor value from historic values and compares it with the actual sensed value for a particular instance. The difference is compared against a threshold value, which is dynamically adjusted, to ascertain whether the sensor value is anomalous. The proposed approach has been applied to real healthcare datasets and compared with existing approaches. Experimental results demonstrate the effectiveness of the proposed system, providing high Detection Rate (DR) and low False Positive Rate (FPR).
Collapse
Affiliation(s)
- Shah Ahsanul Haque
- School of Engineering, University of South Australia, Mawson Lakes, SA 5095, Australia.
| | - Mustafizur Rahman
- Department of Defence, Defence Science and Technology Organization, SA 5111, Australia.
| | - Syed Mahfuzul Aziz
- School of Engineering, University of South Australia, Mawson Lakes, SA 5095, Australia.
| |
Collapse
|
44
|
Liu Z, Hauskrecht M. A Regularized Linear Dynamical System Framework for Multivariate Time Series Analysis. PROCEEDINGS OF THE ... AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE. AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE 2015; 2015:1798-1804. [PMID: 25905027 PMCID: PMC4402162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Linear Dynamical System (LDS) is an elegant mathematical framework for modeling and learning Multivariate Time Series (MTS). However, in general, it is difficult to set the dimension of an LDS's hidden state space. A small number of hidden states may not be able to model the complexities of a MTS, while a large number of hidden states can lead to overfitting. In this paper, we study learning methods that impose various regularization penalties on the transition matrix of the LDS model and propose a regularized LDS learning framework (rLDS) which aims to (1) automatically shut down LDSs' spurious and unnecessary dimensions, and consequently, address the problem of choosing the optimal number of hidden states; (2) prevent the overfitting problem given a small amount of MTS data; and (3) support accurate MTS forecasting. To learn the regularized LDS from data we incorporate a second order cone program and a generalized gradient descent method into the Maximum a Posteriori framework and use Expectation Maximization to obtain a low-rank transition matrix of the LDS model. We propose two priors for modeling the matrix which lead to two instances of our rLDS. We show that our rLDS is able to recover well the intrinsic dimensionality of the time series dynamics and it improves the predictive performance when compared to baselines on both synthetic and real-world MTS datasets.
Collapse
Affiliation(s)
- Zitao Liu
- Computer Science Department, University of Pittsburgh, 210 South Bouquet St., Pittsburgh, PA, 15260 USA
| | - Milos Hauskrecht
- Computer Science Department, University of Pittsburgh, 210 South Bouquet St., Pittsburgh, PA, 15260 USA
| |
Collapse
|
45
|
Nguyen Q, Valizadegan H, Hauskrecht M. Learning classification models with soft-label information. J Am Med Inform Assoc 2014; 21:501-8. [PMID: 24259520 PMCID: PMC3994863 DOI: 10.1136/amiajnl-2013-001964] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Revised: 10/24/2013] [Accepted: 11/01/2013] [Indexed: 11/04/2022] Open
Abstract
OBJECTIVE Learning of classification models in medicine often relies on data labeled by a human expert. Since labeling of clinical data may be time-consuming, finding ways of alleviating the labeling costs is critical for our ability to automatically learn such models. In this paper we propose a new machine learning approach that is able to learn improved binary classification models more efficiently by refining the binary class information in the training phase with soft labels that reflect how strongly the human expert feels about the original class labels. MATERIALS AND METHODS Two types of methods that can learn improved binary classification models from soft labels are proposed. The first relies on probabilistic/numeric labels, the other on ordinal categorical labels. We study and demonstrate the benefits of these methods for learning an alerting model for heparin induced thrombocytopenia. The experiments are conducted on the data of 377 patient instances labeled by three different human experts. The methods are compared using the area under the receiver operating characteristic curve (AUC) score. RESULTS Our AUC results show that the new approach is capable of learning classification models more efficiently compared to traditional learning methods. The improvement in AUC is most remarkable when the number of examples we learn from is small. CONCLUSIONS A new classification learning framework that lets us learn from auxiliary soft-label information provided by a human expert is a promising new direction for learning classification models from expert labels, reducing the time and cost needed to label data.
Collapse
Affiliation(s)
- Quang Nguyen
- Computer Science Department, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | | | | |
Collapse
|
46
|
Hong C, Batal I, Hauskrecht M. A Mixtures-of-Trees Framework for Multi-Label Classification. PROCEEDINGS OF THE ... ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT. ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT 2014; 2014:211-220. [PMID: 25927011 PMCID: PMC4410801 DOI: 10.1145/2661829.2661989] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
We propose a new probabilistic approach for multi-label classification that aims to represent the class posterior distribution P(Y|X). Our approach uses a mixture of tree-structured Bayesian networks, which can leverage the computational advantages of conditional tree-structured models and the abilities of mixtures to compensate for tree-structured restrictions. We develop algorithms for learning the model from data and for performing multi-label predictions using the learned model. Experiments on multiple datasets demonstrate that our approach outperforms several state-of-the-art multi-label classification methods.
Collapse
Affiliation(s)
- Charmgil Hong
- Computer Science Dept., University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Milos Hauskrecht
- Computer Science Dept., University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
47
|
Valizadegan H, Nguyen Q, Hauskrecht M. Learning classification models from multiple experts. J Biomed Inform 2013; 46:1125-35. [PMID: 24035760 PMCID: PMC3922063 DOI: 10.1016/j.jbi.2013.08.007] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2013] [Revised: 07/15/2013] [Accepted: 08/17/2013] [Indexed: 10/26/2022]
Abstract
Building classification models from clinical data using machine learning methods often relies on labeling of patient examples by human experts. Standard machine learning framework assumes the labels are assigned by a homogeneous process. However, in reality the labels may come from multiple experts and it may be difficult to obtain a set of class labels everybody agrees on; it is not uncommon that different experts have different subjective opinions on how a specific patient example should be classified. In this work we propose and study a new multi-expert learning framework that assumes the class labels are provided by multiple experts and that these experts may differ in their class label assessments. The framework explicitly models different sources of disagreements and lets us naturally combine labels from different human experts to obtain: (1) a consensus classification model representing the model the group of experts converge to, as well as, and (2) individual expert models. We test the proposed framework by building a model for the problem of detection of the Heparin Induced Thrombocytopenia (HIT) where examples are labeled by three experts. We show that our framework is superior to multiple baselines (including standard machine learning framework in which expert differences are ignored) and that our framework leads to both improved consensus and individual expert models.
Collapse
Affiliation(s)
- Hamed Valizadegan
- Department of Computer Science, University of Pittsburgh, United States.
| | | | | |
Collapse
|
48
|
|