51
|
Kwak H, Chang J, Choe B, Park S, Jung K. Interpretable disease prediction using heterogeneous patient records with self-attentive fusion encoder. J Am Med Inform Assoc 2021; 28:2155-2164. [PMID: 34198329 PMCID: PMC8449612 DOI: 10.1093/jamia/ocab109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 04/09/2021] [Accepted: 06/17/2021] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVE We propose an interpretable disease prediction model that efficiently fuses multiple types of patient records using a self-attentive fusion encoder. We assessed the model performance in predicting cardiovascular disease events, given the records of a general patient population. MATERIALS AND METHODS We extracted 798111 ses and 67 623 controls from the sample cohort database and nationwide healthcare claims data of South Korea. Among the information provided, our model used the sequential records of medical codes and patient characteristics, such as demographic profiles and the most recent health examination results. These two types of patient records were combined in our self-attentive fusion module, whereas previously dominant methods aggregated them using a simple concatenation. The prediction performance was compared to state-of-the-art recurrent neural network-based approaches and other widely used machine learning approaches. RESULTS Our model outperformed all the other compared methods in predicting cardiovascular disease events. It achieved an area under the curve of 0.839, while the other compared methods achieved between 0.74111 d 0.830. Moreover, our model consistently outperformed the other methods in a more challenging setting in which we tested the model's ability to draw an inference from more nonobvious, diverse factors. DISCUSSION We also interpreted the attention weights provided by our model as the relative importance of each time step in the sequence. We showed that our model reveals the informative parts of the patients' history by measuring the attention weights. CONCLUSION We suggest an interpretable disease prediction model that efficiently fuses heterogeneous patient records and demonstrates superior disease prediction performance.
Collapse
Affiliation(s)
- Heeyoung Kwak
- Department of Electrical Engineering, Seoul National University, Seoul, Republic of Korea
| | - Jooyoung Chang
- Department of Biomedical Sciences, Seoul National University, Seoul, Republic of Korea
| | - Byeongjin Choe
- Department of Electrical Engineering, Seoul National University , Seoul, Republic of Korea
| | - Sangmin Park
- Department of Biomedical Sciences, Seoul National University, Seoul, Republic of Korea
- Department of Family Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Kyomin Jung
- Department of Electrical Engineering , Seoul National University, Seoul, Republic of Korea
| |
Collapse
|
52
|
Yuan M, Ren J. Numerical Feature Transformation-based Sequence Generation Model for Multi-disease Diagnosis. INT J PATTERN RECOGN 2021. [DOI: 10.1142/s0218001421590345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The goal of computer-aided diagnosis is to predict patient’s diseases based on patient’s clinical data. The development of deep learning technology provides new help for clinical diagnosis. In this paper, we propose a new sequence generation model for multi-disease diagnosis prediction based on numerical feature transformation. Our model simultaneously uses patient’s laboratory test results and clinical text as input to diagnose and predict the disease that the patient may have. According to medical knowledge, our model can transform numerical features into descriptive text features, thereby enriching the semantic information of clinical texts. Besides, our model uses attention-based sequence generation methods to achieve the diagnosis of multiple diseases and better utilizes the correlation information between multiple diseases. We evaluate our model’s performance on a dataset of respiratory diseases from the real world, and experimental results show that our model’s accuracy reaches 42.75%, and the [Formula: see text] score reaches 65.65%, which is better than many other methods. It is suitable for the accurate diagnosis of multiple diseases.
Collapse
Affiliation(s)
- Ming Yuan
- School of Computer Science and Engineering, Guangdong Province Key Lab of Computational Science, Sun Yat-sen University Guangzhou, Guangdong, P. R. China
| | - Jiangtao Ren
- School of Computer Science and Engineering, Guangdong Province Key Lab of Computational Science, Sun Yat-sen University Guangzhou, Guangdong, P. R. China
| |
Collapse
|
53
|
Huang Y, Wang N, Zhang Z, Liu H, Fei X, Wei L, Chen H. Patient Representation From Structured Electronic Medical Records Based on Embedding Technique: Development and Validation Study. JMIR Med Inform 2021; 9:e19905. [PMID: 34297000 PMCID: PMC8367145 DOI: 10.2196/19905] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 12/18/2020] [Accepted: 06/05/2021] [Indexed: 01/22/2023] Open
Abstract
Background The secondary use of structured electronic medical record (sEMR) data has become a challenge due to the diversity, sparsity, and high dimensionality of the data representation. Constructing an effective representation for sEMR data is becoming more and more crucial for subsequent data applications. Objective We aimed to apply the embedding technique used in the natural language processing domain for the sEMR data representation and to explore the feasibility and superiority of the embedding-based feature and patient representations in clinical application. Methods The entire training corpus consisted of records of 104,752 hospitalized patients with 13,757 medical concepts of disease diagnoses, physical examinations and procedures, laboratory tests, medications, etc. Each medical concept was embedded into a 200-dimensional real number vector using the Skip-gram algorithm with some adaptive changes from shuffling the medical concepts in a record 20 times. The average of vectors for all medical concepts in a patient record represented the patient. For embedding-based feature representation evaluation, we used the cosine similarities among the medical concept vectors to capture the latent clinical associations among the medical concepts. We further conducted a clustering analysis on stroke patients to evaluate and compare the embedding-based patient representations. The Hopkins statistic, Silhouette index (SI), and Davies-Bouldin index were used for the unsupervised evaluation, and the precision, recall, and F1 score were used for the supervised evaluation. Results The dimension of patient representation was reduced from 13,757 to 200 using the embedding-based representation. The average cosine similarity of the selected disease (subarachnoid hemorrhage) and its 15 clinically relevant medical concepts was 0.973. Stroke patients were clustered into two clusters with the highest SI (0.852). Clustering analyses conducted on patients with the embedding representations showed higher applicability (Hopkins statistic 0.931), higher aggregation (SI 0.862), and lower dispersion (Davies-Bouldin index 0.551) than those conducted on patients with reference representation methods. The clustering solutions for patients with the embedding-based representation achieved the highest F1 scores of 0.944 and 0.717 for two clusters. Conclusions The feature-level embedding-based representations can reflect the potential clinical associations among medical concepts effectively. The patient-level embedding-based representation is easy to use as continuous input to standard machine learning algorithms and can bring performance improvements. It is expected that the embedding-based representation will be helpful in a wide range of secondary uses of sEMR data.
Collapse
Affiliation(s)
- Yanqun Huang
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Ni Wang
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Zhiqiang Zhang
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Honglei Liu
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Xiaolu Fei
- Information Center, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Lan Wei
- Information Center, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Hui Chen
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| |
Collapse
|
54
|
A Deep Recurrent Neural Network-Based Explainable Prediction Model for Progression from Atrophic Gastritis to Gastric Cancer. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11136194] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Gastric cancer is the fifth most common cancer type worldwide and one of the most frequently diagnosed cancers in South Korea. In this study, we propose DeepPrevention, which comprises a prediction module to predict the possibility of progression from atrophic gastritis to gastric cancer and an explanation module to identify risk factors for progression from atrophic gastritis to gastric cancer, to identify patients with atrophic gastritis who are at high risk of gastric cancer. The data set used in this study was South Korea National Health Insurance Service (NHIS) medical checkup data for atrophic gastritis patients from 2002 to 2013. Our experimental results showed that the most influential predictors of gastric cancer development were sex, smoking duration, and current smoking status. In addition, we found that the average age of gastric cancer diagnosis in a group of high-risk patients was 57, and income, BMI, regular exercise, and the number of endoscopic screenings did not show any significant difference between groups. At the individual level, we identified that there were relatively strong associations between gastric cancer and smoking duration and smoking status.
Collapse
|
55
|
Chen Z, Chen M, Sun X, Guo X, Li Q, Huang Y, Zhang Y, Wu L, Liu Y, Xu J, Fang Y, Lin X. Analysis of the Impact of Medical Features and Risk Prediction of Acute Kidney Injury for Critical Patients Using Temporal Electronic Health Record Data With Attention-Based Neural Network. Front Med (Lausanne) 2021; 8:658665. [PMID: 34150797 PMCID: PMC8212017 DOI: 10.3389/fmed.2021.658665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 05/03/2021] [Indexed: 12/02/2022] Open
Abstract
Acute kidney injury (AKI) is one of the most severe consequences of kidney injury, and it will also cause or aggravate the complications by the fast decline of kidney excretory function. Accurate AKI prediction, including the AKI case, AKI stage, and AKI onset time interval, can provide adequate support for effective interventions. Besides, discovering how the medical features affect the AKI result may also provide supporting information for disease treatment. An attention-based temporal neural network approach was employed in this study for AKI prediction and for the analysis of the impact of medical features from temporal electronic health record (EHR) data of patients before AKI diagnosis. We used the publicly available dataset provided by the Medical Information Mart for Intensive Care (MIMIC) for model training, validation, and testing, and then the model was applied in clinical practice. The improvement of AKI case prediction is around 5% AUC (area under the receiver operating characteristic curve), and the AUC value of AKI stage prediction on AKI stage 3 is over 82%. We also analyzed the data by two steps: the associations between the medical features and the AKI case (positive or inverse) and the extent of the impact of medical features on AKI prediction result. It shows that features, such as lactate, glucose, creatinine, blood urea nitrogen (BUN), prothrombin time (PT), and partial thromboplastin time (PTT), are positively associated with the AKI case, while there are inverse associations between the AKI case and features such as platelet, hemoglobin, hematocrit, urine, and international normalized ratio (INR). The laboratory test features such as urine, glucose, creatinine, sodium, and blood urea nitrogen and the medication features such as nonsteroidal anti-inflammatory drugs, agents acting on the renin–angiotensin system, and lipid-lowering medication were detected to have higher weights than other features in the proposed model, which may imply that these features have a great impact on the AKI case.
Collapse
Affiliation(s)
- Zhimeng Chen
- Information Department, Fujian Province Lianpu Network Technology Co., Ltd., Quanzhou, China
| | - Ming Chen
- Department of Nephrology, Jinjiang Municipal Hospital, Quanzhou, China
| | - Xuri Sun
- Department of Endocrinology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Xieli Guo
- Department of Nephrology, Jinjiang Municipal Hospital, Quanzhou, China
| | - Qiuna Li
- Department of Nephrology, Jinjiang Municipal Hospital, Quanzhou, China
| | - Yinqiong Huang
- Department of Endocrinology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Yuren Zhang
- Department of Nephrology, Jinjiang Municipal Hospital, Quanzhou, China
| | - Lianwei Wu
- School of Informatics, Xiamen University, Xiamen, China
| | - Yu Liu
- Network and Information Technology Center, Sun Yat-sen University, Guangzhou, China
| | - Jinting Xu
- Department of Nephrology, Jinjiang Municipal Hospital, Quanzhou, China
| | - Yuming Fang
- Department of Endocrinology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Xiahong Lin
- The Department of Endocrinology of the Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen, China
| |
Collapse
|
56
|
Shobana G, Shankar S. A Convolutional Neural Network-Based Triplet-Mining Metric Learning Schema for Measuring Patient Similarity Addressing Personalized Healthcare. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS 2021. [DOI: 10.1166/jmihi.2021.3471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Prediction of the development risk of some diseases is an important area of Health Care Research. When exploring the personalized care of the patients, precise identification and classification of similarity in patients from their past report is an important process. Electronically
stored health information EHRs that has been sampled unevenly as well as which has variable appointment durations, is considered to be unsuitable for measuring the similarity among patients directly, as there is no proper representation that are fitting. In addition, a technique is required
that is efficient to evaluate similarities in patient. We propose two new similarities learning environments using deep learning that learn simultaneously the representations of the patients as well as measurement of similarity in pairs. A Convolutional Neural Network (CNN) is used to understand
EHRs that contains crucial information which are local thereby providing scholastic illumination in the triplet loss otherwise entropy loss. When the training is completed, distances are calculated as well as similarities scores. Using this similarity information, disease predictions along
with patient grouping is performed. Experimentally the results gives an idea that CNN can represent the EHR sequences in a better way and the schema offered are more efficient than the modern metric distance learning.
Collapse
Affiliation(s)
- G. Shobana
- Department of Information Technology, Sri Krishna College of Engineering and Technology, Coimbatore 641008, Tamil Nadu, India
| | - S. Shankar
- Professor and Head, Department of Computer Science and Engineering, Hindusthan College of Engineering and Technology, Coimbatore 641050, Tamil Nadu, India
| |
Collapse
|
57
|
Lee JM, Hauskrecht M. Neural Clinical Event Sequence Prediction through Personalized Online Adaptive Learning. ARTIFICIAL INTELLIGENCE IN MEDICINE. CONFERENCE ON ARTIFICIAL INTELLIGENCE IN MEDICINE (2005- ) 2021; 12721:175-186. [PMID: 34179895 PMCID: PMC8232901 DOI: 10.1007/978-3-030-77211-6_20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Clinical event sequences consist of thousands of clinical events that represent records of patient care in time. Developing accurate prediction models for such sequences is of a great importance for defining representations of a patient state and for improving patient care. One important challenge of learning a good predictive model of clinical sequences is patient-specific variability. Based on underlying clinical complications, each patient's sequence may consist of different sets of clinical events. However, population-based models learned from such sequences may not accurately predict patient-specific dynamics of event sequences. To address the problem, we develop a new adaptive event sequence prediction framework that learns to adjust its prediction for individual patients through an online model update.
Collapse
Affiliation(s)
- Jeong Min Lee
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
58
|
Yang Y, Huo H, Jiang J, Sun X, Guan Y, Guo X, Wan X, Liu S. Clinical decision-making framework against over-testing based on modeling implicit evaluation criteria. J Biomed Inform 2021; 119:103823. [PMID: 34044155 DOI: 10.1016/j.jbi.2021.103823] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 05/20/2021] [Accepted: 05/21/2021] [Indexed: 12/25/2022]
Abstract
Different statistical methods include various subjective criteria that can prevent over-testing. However, no unified framework that defines generalized objective criteria for various diseases is available to determine the appropriateness of diagnostic tests recommended by doctors. We present the clinical decision-making framework against over-testing based on modeling the implicit evaluation criteria (CDFO-MIEC). The CDFO-MIEC quantifies the subjective evaluation process using statistics-based methods to identify over-testing. Furthermore, it determines the test's appropriateness with extracted entities obtained via named entity recognition and entity alignment. More specifically, implicit evaluation criteria are defined-namely, the correlation among the diagnostic tests, symptoms, and diseases, confirmation function, and exclusion function. Additionally, four evaluation strategies are implemented by applying statistical methods, including the multi-label k-nearest neighbor and the conditional probability algorithms, to model the implicit evaluation criteria. Finally, they are combined into a classification and regression tree to make the final decision. The CDFO-MIEC also provides interpretability by decision conditions for supporting each clinical decision of over-testing. We tested the CDFO-MIEC on 2,860 clinical texts obtained from a single respiratory medicine department in China with the appropriate confirmation by physicians. The dataset was supplemented with random inappropriate tests. The proposed framework excelled against the best competing text classification methods with a Mean_F1 of 0.9167. This determined whether the appropriate and inappropriate tests were properly classified. The four evaluation strategies captured the features effectively, and they were imperative. Therefore, the proposed CDFO-MIEC is feasible because it exhibits high performance and can prevent over-testing.
Collapse
Affiliation(s)
- Yang Yang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Hongxing Huo
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Jingchi Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Xuemei Sun
- Hospital of Harbin Institute of Technology, Harbin 150003, China
| | - Yi Guan
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.
| | - Xitong Guo
- School of Management, Harbin Institute of Technology, Harbin 150001, China
| | - Xiang Wan
- Shenzhen Research Institute of Big Data, Shenzhen 518000, China
| | - Shengping Liu
- Unisound AI Technology Co., Ltd, Beijing 100083, China
| |
Collapse
|
59
|
Early Prediction of Organ Failures in Patients with Acute Pancreatitis Using Text Mining. SCIENTIFIC PROGRAMMING 2021. [DOI: 10.1155/2021/6683942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
It is of great significance to establish an assessment model for organ failures in the early stage of admission in acute pancreatitis (AP). And the clinical notes are underutilized. To predict organ failures for AP patients using early clinical notes in hospital, early text features obtained from the pretrained Chinese Bidirectional Encoder Representations from Transformers model and attention-based LSTM were combined with early structured features (laboratory tests, vital signs, and demographic characteristics) to predict organ failures (respiratory, cardiovascular, and renal) in 12,748 AP inpatients in West China Hospital, Sichuan University, from 2008 to 2018. The text plus structured features fusion model was used to predict organ failures, compared to the baseline model with only structured features. The performance of the model with text features added is superior to the model that only includes structured features.
Collapse
|
60
|
An Y, Huang N, Chen X, Wu F, Wang J. High-Risk Prediction of Cardiovascular Diseases via Attention-Based Deep Neural Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1093-1105. [PMID: 31425047 DOI: 10.1109/tcbb.2019.2935059] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
High-risk prediction of cardiovascular disease is of great significance and impendency in medical fields with the increasing phenomenon of sub-health these years. Most existing pathological methods for the prognosis prediction are either costly or prone to misjudgement. Therefore, plenty of automated models based on machine learning have been proposed to predict the onset of cardiovascular disease with the premorbid information of patients extracted from their historical Electronic Health Records (EHRs). However, it is a tough job to select proper features from longitudinal and heterogeneous EHRs, and also a great challenge to obtain accurate and robust representations for patients. In this paper, we propose an entirely end-to-end model called DeepRisk based on attention mechanism and deep neural networks, which can not only learn high-quality features automatically from EHRs, but also efficiently integrate heterogeneous and time-ordered medical data, and finally predict patients' risk of cardiovascular diseases. Experiments are carried out on a real medical dataset and results show that DeepRisk can significantly improve the high-risk prediction accuracy for cardiovascular disease compared with state-of-the-art approaches.
Collapse
|
61
|
Yang S, Zhu F, Ling X, Liu Q, Zhao P. Intelligent Health Care: Applications of Deep Learning in Computational Medicine. Front Genet 2021; 12:607471. [PMID: 33912213 PMCID: PMC8075004 DOI: 10.3389/fgene.2021.607471] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 03/05/2021] [Indexed: 12/24/2022] Open
Abstract
With the progress of medical technology, biomedical field ushered in the era of big data, based on which and driven by artificial intelligence technology, computational medicine has emerged. People need to extract the effective information contained in these big biomedical data to promote the development of precision medicine. Traditionally, the machine learning methods are used to dig out biomedical data to find the features from data, which generally rely on feature engineering and domain knowledge of experts, requiring tremendous time and human resources. Different from traditional approaches, deep learning, as a cutting-edge machine learning branch, can automatically learn complex and robust feature from raw data without the need for feature engineering. The applications of deep learning in medical image, electronic health record, genomics, and drug development are studied, where the suggestion is that deep learning has obvious advantage in making full use of biomedical data and improving medical health level. Deep learning plays an increasingly important role in the field of medical health and has a broad prospect of application. However, the problems and challenges of deep learning in computational medical health still exist, including insufficient data, interpretability, data privacy, and heterogeneity. Analysis and discussion on these problems provide a reference to improve the application of deep learning in medical health.
Collapse
Affiliation(s)
- Sijie Yang
- School of Computer Science and Technology, Soochow University, Suzhou, China
| | - Fei Zhu
- School of Computer Science and Technology, Soochow University, Suzhou, China
| | - Xinghong Ling
- School of Computer Science and Technology, Soochow University, Suzhou, China
- WenZheng College of Soochow University, Suzhou, China
| | - Quan Liu
- School of Computer Science and Technology, Soochow University, Suzhou, China
| | - Peiyao Zhao
- School of Computer Science and Technology, Soochow University, Suzhou, China
| |
Collapse
|
62
|
Dai D, Tang J, Yu Z, Wong HS, You J, Cao W, Hu Y, Chen CLP. An Inception Convolutional Autoencoder Model for Chinese Healthcare Question Clustering. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:2019-2031. [PMID: 31180903 DOI: 10.1109/tcyb.2019.2916580] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Healthcare question answering (HQA) system plays a vital role in encouraging patients to inquire for professional consultation. However, there are some challenging factors in learning and representing the question corpus of HQA datasets, such as high dimensionality, sparseness, noise, nonprofessional expression, etc. To address these issues, we propose an inception convolutional autoencoder model for Chinese healthcare question clustering (ICAHC). First, we select a set of kernels with different sizes using convolutional autoencoder networks to explore both the diversity and quality in the clustering ensemble. Thus, these kernels encourage to capture diverse representations. Second, we design four ensemble operators to merge representations based on whether they are independent, and input them into the encoder using different skip connections. Third, it maps features from the encoder into a lower-dimensional space, followed by clustering. We conduct comparative experiments against other clustering algorithms on a Chinese healthcare dataset. Experimental results show the effectiveness of ICAHC in discovering better clustering solutions. The results can be used in the prediction of patients' conditions and the development of an automatic HQA system.
Collapse
|
63
|
Teng X, Pei S, Lin YR. StoCast: Stochastic Disease Forecasting With Progression Uncertainty. IEEE J Biomed Health Inform 2021; 25:850-861. [PMID: 32750951 DOI: 10.1109/jbhi.2020.3006719] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Forecasting patients' disease progressions with rich longitudinal clinical data has drawn much attention in recent years due to its impactful application in healthcare and the medical field. Researchers have tackled this problem by leveraging traditional machine learning, statistical techniques and deep learning based models. However, existing methods suffer from either deterministic internal structures or over-simplified stochastic components, failing to deal with complex uncertain scenarios such as progression uncertainty (i.e., multiple possible trajectories) and data uncertainty (i.e., imprecise observations and misdiagnosis). To overcome these major uncertainty issues, we propose a novel deep generative model, Stochastic Disease Forecasting Model (StoCast), along with an associated neural network architecture StoCastNet, that can be trained efficiently via stochastic optimization techniques. Our StoCast model uses internal stochastic components to deal with departures of observed data from patients' true health states, and more importantly, is able to produce a comprehensive estimate of future disease progression trajectories. Based on two public datasets related to Alzheimer's disease and Parkinson's disease, we demonstrate how our StoCast model achieves robust and superior performance than deterministic baseline approaches, and conveys richer information that can potentially assist doctors to make decisions with greater confidence in a complex uncertain scenario.
Collapse
|
64
|
Yang S, Zheng X, Ji C, Chen X. Multi-layer Representation Learning and Its Application to Electronic Health Records. Neural Process Lett 2021; 53:1417-1433. [PMID: 33623481 PMCID: PMC7891814 DOI: 10.1007/s11063-021-10449-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/02/2021] [Indexed: 12/04/2022]
Abstract
Electronic Health Records (EHRs) are digital records associated with hospitalization, diagnosis, medications and so on. Secondary use of EHRs can promote the clinical informatics applications and the development of healthcare undertaking. EHRs have the unique characteristic where the patient visits are temporally ordered but the diagnosis codes within a visit are randomly ordered. The hierarchical structure requires a multi-layer network to explore the different relational information of EHRs. In this paper, we propose a Multi-Layer Representation Learning method (MLRL), which is capable of learning effective patient representation by hierarchically exploring the valuable information in both diagnosis codes and patient visits. Firstly, MLRL utilizes the multi-head attention mechanism to explore the potential connections in diagnosis codes, and a linear transformation is implemented to further map the code vectors to non-negative real-valued representations. The initial visit vectors are then obtained by summarizing all the code representations. Secondly, the proposed method combines Bidirectional Long Short-Term Memory with self-attention mechanism to learn the weighted visit vectors which are aggregated to form the patient representation. Finally, to evaluate the performance of MLRL, we apply it to patient's mortality prediction on real EHRs and the experimental results demonstrate that MLRL has a significant improvement in prediction performance. MLRL achieves around 0.915 in Area Under Curve which is superior to the results obtained by baseline methods. Furthermore, compared with raw data and other data representations, the learned representation with MLRL shows its outstanding results and availability on multiple different classifiers.
Collapse
Affiliation(s)
- Shan Yang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Xiangwei Zheng
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Cun Ji
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Xuanchi Chen
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| |
Collapse
|
65
|
Majnarić LT, Babič F, O’Sullivan S, Holzinger A. AI and Big Data in Healthcare: Towards a More Comprehensive Research Framework for Multimorbidity. J Clin Med 2021; 10:jcm10040766. [PMID: 33672914 PMCID: PMC7918668 DOI: 10.3390/jcm10040766] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 02/02/2021] [Accepted: 02/11/2021] [Indexed: 12/11/2022] Open
Abstract
Multimorbidity refers to the coexistence of two or more chronic diseases in one person. Therefore, patients with multimorbidity have multiple and special care needs. However, in practice it is difficult to meet these needs because the organizational processes of current healthcare systems tend to be tailored to a single disease. To improve clinical decision making and patient care in multimorbidity, a radical change in the problem-solving approach to medical research and treatment is needed. In addition to the traditional reductionist approach, we propose interactive research supported by artificial intelligence (AI) and advanced big data analytics. Such research approach, when applied to data routinely collected in healthcare settings, provides an integrated platform for research tasks related to multimorbidity. This may include, for example, prediction, correlation, and classification problems based on multiple interaction factors. However, to realize the idea of this paradigm shift in multimorbidity research, the optimization, standardization, and most importantly, the integration of electronic health data into a common national and international research infrastructure is needed. Ultimately, there is a need for the integration and implementation of efficient AI approaches, particularly deep learning, into clinical routine directly within the workflows of the medical professionals.
Collapse
Affiliation(s)
- Ljiljana Trtica Majnarić
- Department of Internal Medicine, Family Medicine and the History of Medicine, Faculty of Medicine, University Josip Juraj Strossmayer, 31000 Osijek, Croatia;
- Department of Public Health, Faculty of Dental Medicine, University Josip Juraj Strossmayer, 31000 Osijek, Croatia
| | - František Babič
- Department of Cybernetics and Artificial Intelligence, Faculty of Electrical Engineering and Informatics, Technical University of Košice, 066 01 Košice, Slovakia
- Correspondence: ; Tel.: +421-55-602-4220
| | - Shane O’Sullivan
- Department of Pathology, Faculdade de Medicina, Universidade de São Paulo, 05508-220 São Paulo, Brazil;
| | - Andreas Holzinger
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, 8036 Graz, Austria;
| |
Collapse
|
66
|
Predictive Maintenance in Building Facilities: A Machine Learning-Based Approach. SENSORS 2021; 21:s21041044. [PMID: 33546418 PMCID: PMC7913483 DOI: 10.3390/s21041044] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 01/26/2021] [Accepted: 01/26/2021] [Indexed: 11/17/2022]
Abstract
The operation and maintenance of buildings has seen several advances in recent years. Multiple information and communication technology (ICT) solutions have been introduced to better manage building maintenance. However, maintenance practices in buildings remain less efficient and lead to significant energy waste. In this paper, a predictive maintenance framework based on machine learning techniques is proposed. This framework aims to provide guidelines to implement predictive maintenance for building installations. The framework is organised into five steps: data collection, data processing, model development, fault notification and model improvement. A sport facility was selected as a case study in this work to demonstrate the framework. Data were collected from different heating ventilation and air conditioning (HVAC) installations using Internet of Things (IoT) devices and a building automation system (BAS). Then, a deep learning model was used to predict failures. The case study showed the potential of this framework to predict failures. However, multiple obstacles and barriers were observed related to data availability and feedback collection. The overall results of this paper can help to provide guidelines for scientists and practitioners to implement predictive maintenance approaches in buildings.
Collapse
|
67
|
Lee JM, Hauskrecht M. Modeling multivariate clinical event time-series with recurrent temporal mechanisms. Artif Intell Med 2021; 112:102021. [PMID: 33581828 PMCID: PMC7943294 DOI: 10.1016/j.artmed.2021.102021] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 12/26/2020] [Accepted: 01/10/2021] [Indexed: 12/18/2022]
Abstract
In this work, we propose a novel autoregressive event time-series model that can predict future occurrences of multivariate clinical events. Our model represents multivariate event time-series using different temporal mechanisms aimed to fit different temporal characteristics of the time-series. In particular, information about distant past is modeled through the hidden state space defined by an LSTM-based model, information on recently observed clinical events is modeled through discriminative projections, and information about periodic (repeated) events is modeled using a special recurrent mechanism based on probability distributions of inter-event gaps compiled from past data. We evaluate our proposed model on electronic health record (EHRs) data derived from MIMIC-III dataset. We show that our new model equipped with the above temporal mechanisms leads to improved prediction performance compared to multiple baselines.
Collapse
Affiliation(s)
- Jeong Min Lee
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15260, USA.
| | - Milos Hauskrecht
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15260, USA.
| |
Collapse
|
68
|
Abstract
Clinical decision-making in healthcare is already being influenced by predictions or recommendations made by data-driven machines. Numerous machine learning applications have appeared in the latest clinical literature, especially for outcome prediction models, with outcomes ranging from mortality and cardiac arrest to acute kidney injury and arrhythmia. In this review article, we summarize the state-of-the-art in related works covering data processing, inference, and model evaluation, in the context of outcome prediction models developed using data extracted from electronic health records. We also discuss limitations of prominent modeling assumptions and highlight opportunities for future research.
Collapse
|
69
|
Steinberg E, Jung K, Fries JA, Corbin CK, Pfohl SR, Shah NH. Language models are an effective representation learning technique for electronic health record data. J Biomed Inform 2021; 113:103637. [PMID: 33290879 PMCID: PMC7863633 DOI: 10.1016/j.jbi.2020.103637] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Revised: 10/10/2020] [Accepted: 11/26/2020] [Indexed: 11/17/2022]
Abstract
Widespread adoption of electronic health records (EHRs) has fueled the development of using machine learning to build prediction models for various clinical outcomes. However, this process is often constrained by having a relatively small number of patient records for training the model. We demonstrate that using patient representation schemes inspired from techniques in natural language processing can increase the accuracy of clinical prediction models by transferring information learned from the entire patient population to the task of training a specific model, where only a subset of the population is relevant. Such patient representation schemes enable a 3.5% mean improvement in AUROC on five prediction tasks compared to standard baselines, with the average improvement rising to 19% when only a small number of patient records are available for training the clinical prediction model.
Collapse
Affiliation(s)
- Ethan Steinberg
- Stanford University, 450 Serra Mall, Stanford, CA 94305, USA.
| | - Ken Jung
- Stanford University, 450 Serra Mall, Stanford, CA 94305, USA
| | - Jason A Fries
- Stanford University, 450 Serra Mall, Stanford, CA 94305, USA
| | - Conor K Corbin
- Stanford University, 450 Serra Mall, Stanford, CA 94305, USA
| | - Stephen R Pfohl
- Stanford University, 450 Serra Mall, Stanford, CA 94305, USA
| | - Nigam H Shah
- Stanford University, 450 Serra Mall, Stanford, CA 94305, USA
| |
Collapse
|
70
|
Mahajan A, Mahajan SM. Deep Learning Methods and Their Application to Nursing Workflows: Technology and Perspectives. Comput Inform Nurs 2021; 39:1-6. [PMID: 33417313 DOI: 10.1097/cin.0000000000000702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Amey Mahajan
- Author Affiliations: C2OPS Inc, Cupertino, CA (Mr Mahajan); and VA Palo Alto Health Care System, Palo Alto, CA (Dr Mahajan)
| | | |
Collapse
|
71
|
Rashidian S, Abell-Hart K, Hajagos J, Moffitt R, Lingam V, Garcia V, Tsai CW, Wang F, Dong X, Sun S, Deng J, Gupta R, Miller J, Saltz J, Saltz M. Detecting Miscoded Diabetes Diagnosis Codes in Electronic Health Records for Quality Improvement: Temporal Deep Learning Approach. JMIR Med Inform 2020; 8:e22649. [PMID: 33331828 PMCID: PMC7775195 DOI: 10.2196/22649] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Revised: 09/24/2020] [Accepted: 09/27/2020] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Diabetes affects more than 30 million patients across the United States. With such a large disease burden, even a small error in classification can be significant. Currently billing codes, assigned at the time of a medical encounter, are the "gold standard" reflecting the actual diseases present in an individual, and thus in aggregate reflect disease prevalence in the population. These codes are generated by highly trained coders and by health care providers but are not always accurate. OBJECTIVE This work provides a scalable deep learning methodology to more accurately classify individuals with diabetes across multiple health care systems. METHODS We leveraged a long short-term memory-dense neural network (LSTM-DNN) model to identify patients with or without diabetes using data from 5 acute care facilities with 187,187 patients and 275,407 encounters, incorporating data elements including laboratory test results, diagnostic/procedure codes, medications, demographic data, and admission information. Furthermore, a blinded physician panel reviewed discordant cases, providing an estimate of the total impact on the population. RESULTS When predicting the documented diagnosis of diabetes, our model achieved an 84% F1 score, 96% area under the curve-receiver operating characteristic curve, and 91% average precision on a heterogeneous data set from 5 distinct health facilities. However, in 81% of cases where the model disagreed with the documented phenotype, a blinded physician panel agreed with the model. Taken together, this suggests that 4.3% of our studied population have either missing or improper diabetes diagnosis. CONCLUSIONS This study demonstrates that deep learning methods can improve clinical phenotyping even when patient data are noisy, sparse, and heterogeneous.
Collapse
Affiliation(s)
- Sina Rashidian
- Department of Computer Science, Stony Brook University, Stony Brook, NY, United States
| | - Kayley Abell-Hart
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Janos Hajagos
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Richard Moffitt
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Veena Lingam
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Victor Garcia
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Chao-Wei Tsai
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Fusheng Wang
- Department of Computer Science, Stony Brook University, Stony Brook, NY, United States
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Xinyu Dong
- Department of Computer Science, Stony Brook University, Stony Brook, NY, United States
| | - Siao Sun
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, United States
| | - Jianyuan Deng
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Rajarsi Gupta
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Joshua Miller
- Department of Medicine, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Joel Saltz
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| | - Mary Saltz
- Department of Biomedical Informatics, Renaissance School of Medicine at Stony Brook, Stony Brook, NY, United States
| |
Collapse
|
72
|
Shamshirband S, Fathi M, Dehzangi A, Chronopoulos AT, Alinejad-Rokny H. A review on deep learning approaches in healthcare systems: Taxonomies, challenges, and open issues. J Biomed Inform 2020; 113:103627. [PMID: 33259944 DOI: 10.1016/j.jbi.2020.103627] [Citation(s) in RCA: 89] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2020] [Revised: 08/29/2020] [Accepted: 11/14/2020] [Indexed: 12/11/2022]
Abstract
In the last few years, the application of Machine Learning approaches like Deep Neural Network (DNN) models have become more attractive in the healthcare system given the rising complexity of the healthcare data. Machine Learning (ML) algorithms provide efficient and effective data analysis models to uncover hidden patterns and other meaningful information from the considerable amount of health data that conventional analytics are not able to discover in a reasonable time. In particular, Deep Learning (DL) techniques have been shown as promising methods in pattern recognition in the healthcare systems. Motivated by this consideration, the contribution of this paper is to investigate the deep learning approaches applied to healthcare systems by reviewing the cutting-edge network architectures, applications, and industrial trends. The goal is first to provide extensive insight into the application of deep learning models in healthcare solutions to bridge deep learning techniques and human healthcare interpretability. And then, to present the existing open challenges and future directions.
Collapse
Affiliation(s)
- Shahab Shamshirband
- Department of Computer Science, Norwegian University of Science and Technology, Trondheim, Norway; Future Technology Research Center, College of Future, National Yunlin University of Science and Technology, 123 University Road, Section 3, Douliou, Yunlin 64002, Taiwan, ROC.
| | - Mahdis Fathi
- Faculty of Computer and Information Technology Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran
| | - Abdollah Dehzangi
- Department of Computer Science, Rutgers University, Camden, NJ 08102, USA; Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA
| | - Anthony Theodore Chronopoulos
- Department of Computer Science, University of Texas at San Antonio, San Antonio, TX 78249, USA; (Visiting Faculty) Department of Computer Science, University of Patras, 26500 Rio, Greece
| | - Hamid Alinejad-Rokny
- Systems Biology and Health Data Analytics Lab, The Graduate School of Biomedical Engineering, UNSW Sydney, 2052 Sydney, Australia; School of Computer Science and Engineering, The University of New South Wales (UNSW Sydney), 2052 Sydney, Australia; Health Data Analytics Program Leader, AI-enabled Processes (AIP) Research Centre, Macquarie University, Sydney 2109, Australia
| |
Collapse
|
73
|
Darabi S, Kachuee M, Fazeli S, Sarrafzadeh M. TAPER: Time-Aware Patient EHR Representation. IEEE J Biomed Health Inform 2020; 24:3268-3275. [PMID: 32287023 DOI: 10.1109/jbhi.2020.2984931] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Effective representation learning of electronic health records is a challenging task and is becoming more important as the availability of such data is becoming pervasive. The data contained in these records are irregular and contain multiple modalities such as notes, and medical codes. They are preempted by medical conditions the patient may have, and are typically recorded by medical staff. Accompanying codes are notes containing valuable information about patients beyond the structured information contained in electronic health records. We use transformer networks and the recently proposed BERT language model to embed these data streams into a unified vector representation. The presented approach effectively encodes a patient's visit data into a single a distributed representation, which can be used for downstream tasks. Our model demonstrates superior performance and generalization on mortality, readmission and length of stay tasks using the publicly available MIMIC-III ICU dataset.
Collapse
|
74
|
Chen Z, Lai C, Ren J. Hospital readmission prediction based on long-term and short-term information fusion. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106690] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
75
|
Hassaine A, Canoy D, Solares JRA, Zhu Y, Rao S, Li Y, Zottoli M, Rahimi K, Salimi-Khorshidi G. Learning multimorbidity patterns from electronic health records using Non-negative Matrix Factorisation. J Biomed Inform 2020; 112:103606. [PMID: 33127447 DOI: 10.1016/j.jbi.2020.103606] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 08/01/2020] [Accepted: 10/16/2020] [Indexed: 11/29/2022]
Abstract
Multimorbidity, or the presence of several medical conditions in the same individual, has been increasing in the population - both in absolute and relative terms. Nevertheless, multimorbidity remains poorly understood, and the evidence from existing research to describe its burden, determinants and consequences has been limited. Previous studies attempting to understand multimorbidity patterns are often cross-sectional and do not explicitly account for multimorbidity patterns' evolution over time; some of them are based on small datasets and/or use arbitrary and narrow age ranges; and those that employed advanced models, usually lack appropriate benchmarking and validations. In this study, we (1) introduce a novel approach for using Non-negative Matrix Factorisation (NMF) for temporal phenotyping (i.e., simultaneously mining disease clusters and their trajectories); (2) provide quantitative metrics for the evaluation of these clusters and trajectories; and (3) demonstrate how the temporal characteristics of the disease clusters that result from our model can help mine multimorbidity networks and generate new hypotheses for the emergence of various multimorbidity patterns over time. We trained and evaluated our models on one of the world's largest electronic health records (EHR) datasets, containing more than 7 million patients, from which over 2 million where relevant to, and hence included in this study.
Collapse
Affiliation(s)
- Abdelaali Hassaine
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, United Kingdom
| | - Dexter Canoy
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, United Kingdom
| | - Jose Roberto Ayala Solares
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, United Kingdom
| | - Yajie Zhu
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom
| | - Shishir Rao
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom
| | - Yikuan Li
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom
| | - Mariagrazia Zottoli
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, United Kingdom
| | - Kazem Rahimi
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, United Kingdom; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, United Kingdom.
| | | |
Collapse
|
76
|
Abstract
In recent years, facility management (FM) has adopted many computer technology solutions for building maintenance, such as building information modelling (BIM) and computerized maintenance management systems (CMMS). However, maintenance requests management in buildings remains a manual and a time-consuming process that depends on human management. In this paper, a machine-learning algorithm based on natural language processing (NLP) is proposed to classify maintenance requests. This algorithm aims to assist the FM teams in managing day-to-day maintenance activities. A healthcare facility is addressed as a case study in this work. Ten-year maintenance records from the facility contributed to the design and development of the algorithm. Multiple NLP methods were used in this study, and the results reveal that the NLP model can classify work requests with an average accuracy of 78%. Furthermore, NLP methods have proven to be effective for managing unstructured text data.
Collapse
|
77
|
Rebane J, Samsten I, Papapetrou P. Exploiting complex medical data with interpretable deep learning for adverse drug event prediction. Artif Intell Med 2020; 109:101942. [PMID: 34756221 DOI: 10.1016/j.artmed.2020.101942] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 06/18/2020] [Accepted: 08/10/2020] [Indexed: 11/29/2022]
Abstract
A variety of deep learning architectures have been developed for the goal of predictive modelling and knowledge extraction from medical records. Several models have placed strong emphasis on temporal attention mechanisms and decay factors as a means to include highly temporally relevant information regarding the recency of medical event occurrence while facilitating medical code-level interpretability. In this study we utilise such models with a large Electronic Patient Record (EPR) data set consisting of diagnoses, medication, and clinical text data for the purpose of adverse drug event (ADE) prediction. The first contribution of this work is an empirical evaluation of two state-of-the-art medical-code based models in terms of objective performance metrics for ADE prediction on diagnosis and medication data. Secondly, as an extension of previous work, we augment an interpretable deep learning architecture to permit numerical risk and clinical text features and demonstrate how this approach yields improved predictive performance compared to the other baselines. Finally, we assess the importance of attention mechanisms in regards to their usefulness for medical code-level and text-level interpretability, which may facilitate novel insights pertaining to the nature of ADE occurrence within the health care domain.
Collapse
Affiliation(s)
- Jonathan Rebane
- Department of Computer and Systems Sciences, Stockholm University, Stockholm, Sweden.
| | - Isak Samsten
- Department of Computer and Systems Sciences, Stockholm University, Stockholm, Sweden
| | - Panagiotis Papapetrou
- Department of Computer and Systems Sciences, Stockholm University, Stockholm, Sweden
| |
Collapse
|
78
|
Kumaravel SK, Subramani RK, Jayaraj Sivakumar TK, Madurai Elavarasan R, Manavalanagar Vetrichelvan A, Annam A, Subramaniam U. Investigation on the impacts of COVID-19 quarantine on society and environment: Preventive measures and supportive technologies. 3 Biotech 2020; 10:393. [PMID: 32821645 PMCID: PMC7429420 DOI: 10.1007/s13205-020-02382-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 08/05/2020] [Indexed: 12/14/2022] Open
Abstract
The present outbreak of the novel coronavirus SARS-CoV-2, epicentered in China in December 2019, has spread to many other countries. The entire humanity has a vital responsibility to tackle this pandemic and the technologies are being helpful to them to a greater extent. The purpose of the work is to precisely bring scientific and general awareness to the people all around the world who are currently fighting the war against COVID-19. It's visible that the number of people infected is increasing day by day and the medical community is tirelessly working to maintain the situation under control. Other than the negative effects caused by COVID-19, it is also equally important for the public to understand some of the positive impacts it has directly or indirectly given to society. This work emphasizes the various impacts that are created on society as well as the environment. As a special additive, some important key areas are highlighted namely, how the modernized technologies are aiding the people during the period of social distancing. Some effective technological implications carried out by both information technology and educational institutions are highlighted. There are also several steps taken by the state government and central government in each country in adopting the complete lockdown rule. These steps are taken primarily to prevent the people from COVID-19 impact. Moreover, the teachings we need to learn from the quarantine situation created to prevent further spread of this global pandemic is discussed in brief and the importance of carrying them to the future. Finally, the paper also elucidates the general preventive measures that have to be taken to prevent this deadly coronavirus, and the role of technology in this pandemic situation has also been discussed.
Collapse
Affiliation(s)
- Santhosh Kumar Kumaravel
- Department of Electrical and Electronics Engineering, Sri Venkateswara College of Engineering, Chennai, Tamil Nadu 602117 India
| | - Ranjith Kumar Subramani
- Department of Electrical and Electronics Engineering, Sri Venkateswara College of Engineering, Chennai, Tamil Nadu 602117 India
| | - Tharun Kumar Jayaraj Sivakumar
- Department of Electrical and Electronics Engineering, Sri Venkateswara College of Engineering, Chennai, Tamil Nadu 602117 India
| | - Rajvikram Madurai Elavarasan
- Department of Electrical and Electronics Engineering, Sri Venkateswara College of Engineering, Chennai, Tamil Nadu 602117 India
- Electrical and Automotive Parts Manufacturing Unit, AA Industries, Chennai, Tamil Nadu 600123 India
| | | | - Annapurna Annam
- Department of Electrical and Electronics Engineering, Sri Venkateswara College of Engineering, Chennai, Tamil Nadu 602117 India
| | - Umashankar Subramaniam
- Renewable Energy Laboratory, Faculty of Engineering, Prince Sultan University, Riyadh, 11586 Saudi Arabia
| |
Collapse
|
79
|
Kabeshova A, Yu Y, Lukacs B, Bacry E, Gaïffas S. ZiMM: A deep learning model for long term and blurry relapses with non-clinical claims data. J Biomed Inform 2020; 110:103531. [PMID: 32818667 DOI: 10.1016/j.jbi.2020.103531] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Revised: 07/25/2020] [Accepted: 08/09/2020] [Indexed: 11/28/2022]
Abstract
This paper considers the problems of modeling and predicting a long-term and "blurry" relapse that occurs after a medical act, such as a surgery. We do not consider a short-term complication related to the act itself, but a long-term relapse that clinicians cannot explain easily, since it depends on unknown sets or sequences of past events that occurred before the act. The relapse is observed only indirectly, in a "blurry" fashion, through longitudinal prescriptions of drugs over a long period of time after the medical act. We introduce a new model, called ZiMM (Zero-inflated Mixture of Multinomial distributions) in order to capture long-term and blurry relapses. On top of it, we build an end-to-end deep-learning architecture called ZiMM Encoder-Decoder (ZiMM ED) that can learn from the complex, irregular, highly heterogeneous and sparse patterns of health events that are observed through a claims-only database. ZiMM ED is applied on a "non-clinical" claims database, that contains only timestamped reimbursement codes for drug purchases, medical procedures and hospital diagnoses, the only available clinical feature being the age of the patient. This setting is more challenging than a setting where bedside clinical signals are available. Our motivation for using such a non-clinical claims database is its exhaustivity population-wise, compared to clinical electronic health records coming from a single or a small set of hospitals. Indeed, we consider a dataset containing the claims of almost all French citizens who had surgery for prostatic problems, with a history between 1.5 and 5 years. We consider a long-term (18 months) relapse (urination problems still occur despite surgery), which is blurry since it is observed only through the reimbursement of a specific set of drugs for urination problems. Our experiments show that ZiMM ED improves several baselines, including non-deep learning and deep-learning approaches, and that it allows working on such a dataset with minimal preprocessing work.
Collapse
Affiliation(s)
| | | | | | | | - Stéphane Gaïffas
- LPSM, Université de Paris, France; DMA, Ecole normale supérieure, Paris, France.
| |
Collapse
|
80
|
Le N, Wiley M, Loza A, Hristidis V, El-Kareh R. Prediction of Medical Concepts in Electronic Health Records: Similar Patient Analysis. JMIR Med Inform 2020; 8:e16008. [PMID: 32706678 PMCID: PMC7395257 DOI: 10.2196/16008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 03/01/2020] [Accepted: 03/28/2020] [Indexed: 11/13/2022] Open
Abstract
Background Medicine 2.0—the adoption of Web 2.0 technologies such as social networks in health care—creates the need for apps that can find other patients with similar experiences and health conditions based on a patient’s electronic health record (EHR). Concurrently, there is an increasing number of longitudinal EHR data sets with rich information, which are essential to fulfill this need. Objective This study aimed to evaluate the hypothesis that we can leverage similar EHRs to predict possible future medical concepts (eg, disorders) from a patient’s EHR. Methods We represented patients’ EHRs using time-based prefixes and suffixes, where each prefix or suffix is a set of medical concepts from a medical ontology. We compared the prefixes of other patients in the collection with the state of the current patient using various interpatient distance measures. The set of similar prefixes yields a set of suffixes, which we used to determine probable future concepts for the current patient’s EHR. Results We evaluated our methods on the Multiparameter Intelligent Monitoring in Intensive Care II data set of patients, where we achieved precision up to 56.1% and recall up to 69.5%. For a limited set of clinically interesting concepts, specifically a set of procedures, we found that 86.9% (353/406) of the true-positives are clinically useful, that is, these procedures were actually performed later on the patient, and only 4.7% (19/406) of true-positives were completely irrelevant. Conclusions These initial results indicate that predicting patients’ future medical concepts is feasible. Effectively predicting medical concepts can have several applications, such as managing resources in a hospital.
Collapse
Affiliation(s)
- Nhat Le
- Department of Computer Science & Engineering, University of California, Riverside, Riverside, CA, United States
| | - Matthew Wiley
- Department of Computer Science & Engineering, University of California, Riverside, Riverside, CA, United States
| | - Antonio Loza
- School of Medicine, University of California, Riverside, Riverside, CA, United States
| | - Vagelis Hristidis
- Department of Computer Science & Engineering, University of California, Riverside, Riverside, CA, United States
| | - Robert El-Kareh
- Department of Medicine, University of California, San Diego, San Diego, CA, United States
| |
Collapse
|
81
|
Landi I, Glicksberg BS, Lee HC, Cherng S, Landi G, Danieletto M, Dudley JT, Furlanello C, Miotto R. Deep representation learning of electronic health records to unlock patient stratification at scale. NPJ Digit Med 2020; 3:96. [PMID: 32699826 PMCID: PMC7367859 DOI: 10.1038/s41746-020-0301-z] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 06/17/2020] [Indexed: 12/15/2022] Open
Abstract
Deriving disease subtypes from electronic health records (EHRs) can guide next-generation personalized medicine. However, challenges in summarizing and representing patient data prevent widespread practice of scalable EHR-based stratification analysis. Here we present an unsupervised framework based on deep learning to process heterogeneous EHRs and derive patient representations that can efficiently and effectively enable patient stratification at scale. We considered EHRs of 1,608,741 patients from a diverse hospital cohort comprising a total of 57,464 clinical concepts. We introduce a representation learning model based on word embeddings, convolutional neural networks, and autoencoders (i.e., ConvAE) to transform patient trajectories into low-dimensional latent vectors. We evaluated these representations as broadly enabling patient stratification by applying hierarchical clustering to different multi-disease and disease-specific patient cohorts. ConvAE significantly outperformed several baselines in a clustering task to identify patients with different complex conditions, with 2.61 entropy and 0.31 purity average scores. When applied to stratify patients within a certain condition, ConvAE led to various clinically relevant subtypes for different disorders, including type 2 diabetes, Parkinson's disease, and Alzheimer's disease, largely related to comorbidities, disease progression, and symptom severity. With these results, we demonstrate that ConvAE can generate patient representations that lead to clinically meaningful insights. This scalable framework can help better understand varying etiologies in heterogeneous sub-populations and unlock patterns for EHR-based research in the realm of personalized medicine.
Collapse
Affiliation(s)
- Isotta Landi
- Bruno Kessler Institute, Povo, TN Italy
- Department of Psychology and Cognitive Science, University of Trento, Rovereto, TN Italy
| | - Benjamin S. Glicksberg
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | - Hao-Chih Lee
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | - Sarah Cherng
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | - Giulia Landi
- Department of Mental Health and Pathological Addiction, Azienda USL Centro “Santi”, Parma, Italy
| | - Matteo Danieletto
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | - Joel T. Dudley
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | | | - Riccardo Miotto
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai, New York, NY USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| |
Collapse
|
82
|
Wang T, Xuan P, Liu Z, Zhang T. Assistant diagnosis with Chinese electronic medical records based on CNN and BiLSTM with phrase-level and word-level attentions. BMC Bioinformatics 2020; 21:230. [PMID: 32503424 PMCID: PMC7275511 DOI: 10.1186/s12859-020-03554-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 05/25/2020] [Indexed: 01/09/2023] Open
Abstract
Background Inferring diseases related to the patient’s electronic medical records (EMRs) is of great significance for assisting doctor diagnosis. Several recent prediction methods have shown that deep learning-based methods can learn the deep and complex information contained in EMRs. However, they do not consider the discriminative contributions of different phrases and words. Moreover, local information and context information of EMRs should be deeply integrated. Results A new method based on the fusion of a convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM) with attention mechanisms is proposed for predicting a disease related to a given EMR, and it is referred to as FCNBLA. FCNBLA deeply integrates local information, context information of the word sequence and more informative phrases and words. A novel framework based on deep learning is developed to learn the local representation, the context representation and the combination representation. The left side of the framework is constructed based on CNN to learn the local representation of adjacent words. The right side of the framework based on BiLSTM focuses on learning the context representation of the word sequence. Not all phrases and words contribute equally to the representation of an EMR meaning. Therefore, we establish the attention mechanisms at the phrase level and word level, and the middle module of the framework learns the combination representation of the enhanced phrases and words. The macro average f-score and accuracy of FCNBLA achieved 91.29 and 92.78%, respectively. Conclusion The experimental results indicate that FCNBLA yields superior performance compared with several state-of-the-art methods. The attention mechanisms and combination representations are also confirmed to be helpful for improving FCNBLA’s prediction performance. Our method is helpful for assisting doctors in diagnosing diseases in patients.
Collapse
Affiliation(s)
- Tong Wang
- School of Computer Science and Technology, Heilongjiang University, Harbin, 150080, China
| | - Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin, 150080, China.
| | - Zonglin Liu
- School of Computer Science and Technology, Heilongjiang University, Harbin, 150080, China
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin, 150080, China
| |
Collapse
|
83
|
Gupta P, Malhotra P, Narwariya J, Vig L, Shroff G. Transfer Learning for Clinical Time Series Analysis Using Deep Neural Networks. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2020; 4:112-137. [PMID: 35415440 PMCID: PMC8982827 DOI: 10.1007/s41666-019-00062-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Revised: 10/24/2019] [Accepted: 11/12/2019] [Indexed: 01/16/2023]
|
84
|
Lee D, Jiang X, Yu H. Harmonized representation learning on dynamic EHR graphs. J Biomed Inform 2020; 106:103426. [DOI: 10.1016/j.jbi.2020.103426] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 02/18/2020] [Accepted: 04/19/2020] [Indexed: 11/29/2022]
|
85
|
Li Y, Nair P, Lu XH, Wen Z, Wang Y, Dehaghi AAK, Miao Y, Liu W, Ordog T, Biernacka JM, Ryu E, Olson JE, Frye MA, Liu A, Guo L, Marelli A, Ahuja Y, Davila-Velderrain J, Kellis M. Inferring multimodal latent topics from electronic health records. Nat Commun 2020; 11:2536. [PMID: 32439869 PMCID: PMC7242436 DOI: 10.1038/s41467-020-16378-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 04/23/2020] [Indexed: 11/10/2022] Open
Abstract
Electronic health records (EHR) are rich heterogeneous collections of patient health information, whose broad adoption provides clinicians and researchers unprecedented opportunities for health informatics, disease-risk prediction, actionable clinical recommendations, and precision medicine. However, EHRs present several modeling challenges, including highly sparse data matrices, noisy irregular clinical notes, arbitrary biases in billing code assignment, diagnosis-driven lab tests, and heterogeneous data types. To address these challenges, we present MixEHR, a multi-view Bayesian topic model. We demonstrate MixEHR on MIMIC-III, Mayo Clinic Bipolar Disorder, and Quebec Congenital Heart Disease EHR datasets. Qualitatively, MixEHR disease topics reveal meaningful combinations of clinical features across heterogeneous data types. Quantitatively, we observe superior prediction accuracy of diagnostic codes and lab test imputations compared to the state-of-art methods. We leverage the inferred patient topic mixtures to classify target diseases and predict mortality of patients in critical conditions. In all comparison, MixEHR confers competitive performance and reveals meaningful disease-related topics.
Collapse
Affiliation(s)
- Yue Li
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada.
| | - Pratheeksha Nair
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | - Xing Han Lu
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | - Zhi Wen
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | - Yuening Wang
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | | | - Yan Miao
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | - Weiqi Liu
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | - Tamas Ordog
- Department of Physiology and Biomedical Engineering and Division of Gastroenterology and Hepatology, Department of Medicine, and Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA
| | - Joanna M Biernacka
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Euijung Ryu
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Janet E Olson
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Mark A Frye
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Aihua Liu
- McGill Adult Unit for Congenital Heart Disease Excellence (MAUDE Unit), Montreal, QC H4A 3J1, Quebec, Canada
| | - Liming Guo
- McGill Adult Unit for Congenital Heart Disease Excellence (MAUDE Unit), Montreal, QC H4A 3J1, Quebec, Canada
| | - Ariane Marelli
- McGill Adult Unit for Congenital Heart Disease Excellence (MAUDE Unit), Montreal, QC H4A 3J1, Quebec, Canada
| | - Yuri Ahuja
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, MA, 02139, USA
| | - Jose Davila-Velderrain
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, MA, 02139, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, MA, 02139, USA.
- The Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, MA, 02142, USA.
| |
Collapse
|
86
|
Gao S, Alawad M, Schaefferkoetter N, Penberthy L, Wu XC, Durbin EB, Coyle L, Ramanathan A, Tourassi G. Using case-level context to classify cancer pathology reports. PLoS One 2020; 15:e0232840. [PMID: 32396579 PMCID: PMC7217446 DOI: 10.1371/journal.pone.0232840] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 04/22/2020] [Indexed: 11/18/2022] Open
Abstract
Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence-for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks-site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks.
Collapse
Affiliation(s)
- Shang Gao
- Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States of America
| | - Mohammed Alawad
- Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States of America
| | - Noah Schaefferkoetter
- Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States of America
| | - Lynne Penberthy
- Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, United States of America
| | - Xiao-Cheng Wu
- Louisiana Tumor Registry, Louisiana State University Health Sciences Center School of Public Health, New Orleans, LA, United States of America
| | - Eric B. Durbin
- Kentucky Cancer Registry, University of Kentucky, Lexington, KY, United States of America
| | - Linda Coyle
- Information Management Services Inc, Calverton, MD, United States of America
| | - Arvind Ramanathan
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, United States of America
| | - Georgia Tourassi
- Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States of America
| |
Collapse
|
87
|
Abstract
Today, despite decades of developments in medicine and the growing interest in precision healthcare, vast majority of diagnoses happen once patients begin to show noticeable signs of illness. Early indication and detection of diseases, however, can provide patients and carers with the chance of early intervention, better disease management, and efficient allocation of healthcare resources. The latest developments in machine learning (including deep learning) provides a great opportunity to address this unmet need. In this study, we introduce BEHRT: A deep neural sequence transduction model for electronic health records (EHR), capable of simultaneously predicting the likelihood of 301 conditions in one's future visits. When trained and evaluated on the data from nearly 1.6 million individuals, BEHRT shows a striking improvement of 8.0-13.2% (in terms of average precision scores for different tasks), over the existing state-of-the-art deep EHR models. In addition to its scalability and superior accuracy, BEHRT enables personalised interpretation of its predictions; its flexible architecture enables it to incorporate multiple heterogeneous concepts (e.g., diagnosis, medication, measurements, and more) to further improve the accuracy of its predictions; its (pre-)training results in disease and patient representations can be useful for future studies (i.e., transfer learning).
Collapse
|
88
|
SECNLP: A survey of embeddings in clinical natural language processing. J Biomed Inform 2020; 101:103323. [DOI: 10.1016/j.jbi.2019.103323] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2019] [Revised: 09/12/2019] [Accepted: 10/27/2019] [Indexed: 12/11/2022]
|
89
|
Ayala Solares JR, Diletta Raimondi FE, Zhu Y, Rahimian F, Canoy D, Tran J, Pinho Gomes AC, Payberah AH, Zottoli M, Nazarzadeh M, Conrad N, Rahimi K, Salimi-Khorshidi G. Deep learning for electronic health records: A comparative review of multiple deep neural architectures. J Biomed Inform 2020; 101:103337. [DOI: 10.1016/j.jbi.2019.103337] [Citation(s) in RCA: 80] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 09/25/2019] [Accepted: 11/04/2019] [Indexed: 12/24/2022]
|
90
|
Svenson P, Haralabopoulos G, Torres Torres M. Sepsis Deterioration Prediction Using Channelled Long Short-Term Memory Networks. Artif Intell Med 2020. [DOI: 10.1007/978-3-030-59137-3_32] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
91
|
Ma F, Wang Y, Xiao H, Yuan Y, Chitta R, Zhou J, Gao J. Incorporating medical code descriptions for diagnosis prediction in healthcare. BMC Med Inform Decis Mak 2019; 19:267. [PMID: 31856806 PMCID: PMC6921390 DOI: 10.1186/s12911-019-0961-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with attention mechanisms to make predictions. However, these approaches ignore the importance of code descriptions, i.e., the medical definitions of diagnosis codes. We believe that taking diagnosis code descriptions into account can help the state-of-the-art models not only to learn meaning code representations, but also to improve the predictive performance, especially when the EHR data are insufficient. METHODS We propose a simple, but general diagnosis prediction framework, which includes two basic components: diagnosis code embedding and predictive model. To learn the interpretable code embeddings, we apply convolutional neural networks (CNN) to model medical descriptions of diagnosis codes extracted from online medical websites. The learned medical embedding matrix is used to embed the input visits into vector representations, which are fed into the predictive models. Any existing diagnosis prediction approach (referred to as the base model) can be cast into the proposed framework as the predictive model (called the enhanced model). RESULTS We conduct experiments on two real medical datasets: the MIMIC-III dataset and the Heart Failure claim dataset. Experimental results show that the enhanced diagnosis prediction approaches significantly improve the prediction performance. Moreover, we validate the effectiveness of the proposed framework with insufficient EHR data. Finally, we visualize the learned medical code embeddings to show the interpretability of the proposed framework. CONCLUSIONS Given the historical visit records of a patient, the proposed framework is able to predict the next visit information by incorporating medical code descriptions.
Collapse
Affiliation(s)
- Fenglong Ma
- Pennsylvania State University, State College, PA USA
| | | | | | - Ye Yuan
- Beijing University of Technology, Beijing, China
| | | | | | - Jing Gao
- University at Buffalo, Buffalo, NY USA
| |
Collapse
|
92
|
Ruan T, Lei L, Zhou Y, Zhai J, Zhang L, He P, Gao J. Representation learning for clinical time series prediction tasks in electronic health records. BMC Med Inform Decis Mak 2019; 19:259. [PMID: 31842854 PMCID: PMC6916209 DOI: 10.1186/s12911-019-0985-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Electronic health records (EHRs) provide possibilities to improve patient care and facilitate clinical research. However, there are many challenges faced by the applications of EHRs, such as temporality, high dimensionality, sparseness, noise, random error and systematic bias. In particular, temporal information is difficult to effectively use by traditional machine learning methods while the sequential information of EHRs is very useful. Method In this paper, we propose a general-purpose patient representation learning approach to summarize sequential EHRs. Specifically, a recurrent neural network based denoising autoencoder (RNN-DAE) is employed to encode inhospital records of each patient into a low dimensional dense vector. Results Based on EHR data collected from Shuguang Hospital affiliated to Shanghai University of Traditional Chinese Medicine, we experimentally evaluate our proposed RNN-DAE method on both mortality prediction task and comorbidity prediction task. Extensive experimental results show that our proposed RNN-DAE method outperforms existing methods. In addition, we apply the “Deep Feature” represented by our proposed RNN-DAE method to track similar patients with t-SNE, which also achieves some interesting observations. Conclusion We propose an effective unsupervised RNN-DAE method to summarize patient sequential information in EHR data. Our proposed RNN-DAE method is useful on both mortality prediction task and comorbidity prediction task.
Collapse
Affiliation(s)
- Tong Ruan
- School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Liqi Lei
- School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Yangming Zhou
- School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China.
| | - Jie Zhai
- School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Le Zhang
- School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Ping He
- Shanghai Hospital Development Center, 2 Kangding Road, Shanghai, 200000, China
| | - Ju Gao
- Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine, 528 Zhangheng Road, Shanghai, 201203, China
| |
Collapse
|
93
|
Wang R, Weng Y, Zhou Z, Chen L, Hao H, Wang J. Multi-objective ensemble deep learning using electronic health records to predict outcomes after lung cancer radiotherapy. Phys Med Biol 2019; 64:245005. [PMID: 31698346 DOI: 10.1088/1361-6560/ab555e] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Accurately predicting treatment outcome is crucial for creating personalized treatment plans and follow-up schedules. Electronic health records (EHRs) contain valuable patient-specific information that can be leveraged to improve outcome prediction. We propose a reliable multi-objective ensemble deep learning (MoEDL) method that uses features extracted from EHRs to predict high risk of treatment failure after radiotherapy in patients with lung cancer. The dataset used in this study contains EHRs of 814 patients who had not achieved disease-free status and 193 patients who were disease-free with at least one year follow-up time after lung cancer radiation therapy. The proposed MoEDL consists of three phases: (1) training with dynamic ensemble deep learning; (2) model selection with adaptive multi-objective optimization; and (3) testing with evidential reasoning (ER) fusion. Specifically, in the training phase, we employ deep perceptron networks as base learners to handle various issues with EHR data. The architecture and key hyper-parameters of each base learner are dynamically adjusted to increase the diversity of learners while reducing the time spent tuning hyper-parameters. Furthermore, we integrate the snapshot ensembles (SE) restarting strategy, multi-objective optimization, and ER fusion to improve the prediction robustness and accuracy of individual networks. The SE restarting strategy can yield multiple candidate models at no additional training cost in the training stage. The multi-objective model simultaneously considers sensitivity, specificity, and AUC as objective functions, overcoming the limitations of single-objective-based model selection. For the testing stage, we utilized an analytic ER rule to fuse the output scores from each optimal model to obtain reliable and robust predictive results. Our experimental results demonstrate that MoEDL can perform better than other conventional methods.
Collapse
Affiliation(s)
- Rongfang Wang
- School of Artificial Intelligence, Xidian University, Xi'an 710071, People's Republic of China. Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, TX 75235, United States of America. Medical Artificial Intelligence and Automation (MAIA) Lab, University of Texas Southwestern Medical Center, Dallas, TX 75390, United States of America
| | | | | | | | | | | |
Collapse
|
94
|
Talaei-Khoei A, Tavana M, Wilson JM. A predictive analytics framework for identifying patients at risk of developing multiple medical complications caused by chronic diseases. Artif Intell Med 2019; 101:101750. [PMID: 31813486 DOI: 10.1016/j.artmed.2019.101750] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Revised: 07/07/2019] [Accepted: 10/30/2019] [Indexed: 01/22/2023]
Abstract
Chronic diseases often cause several medical complications. This paper aims to predict multiple complications among patients with a chronic disease. The literature uses single-task learning algorithms to predict complications independently and assumes no correlation among complications of chronic diseases. We propose two methods (independent prediction of complications with single-task learning and concurrent prediction of complications with multi-task learning) and show that medical complications of chronic diseases can be correlated. We use a case study and compare the performance of these two methods by predicting complications of hypertrophic cardiomyopathy on 106 predictors in 1078 electronic medical records from April 2009-April 2017, inclusive. The methods are implemented using logistic regression, artificial neural networks, decision trees, and support vector machines. The results show multi-task learning with logistic regression improves the performance of predictions in terms of both discrimination and calibration.
Collapse
Affiliation(s)
- Amir Talaei-Khoei
- Department of Information Systems, University of Nevada, Reno, USA; School of Software, University of Technology Sydney, Australia.
| | - Madjid Tavana
- Business Systems and Analytics Department, Distinguished Chair of Business Analytics, La Salle University, Philadelphia, USA; Business Information Systems Department, Faculty of Business Administration and Economics, University of Paderborn, Paderborn, Germany.
| | - James M Wilson
- School of Community Health Sciences, University of Nevada, Reno, USA.
| |
Collapse
|
95
|
Dligach D, Afshar M, Miller T. Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse. J Am Med Inform Assoc 2019; 26:1272-1278. [PMID: 31233140 PMCID: PMC6798566 DOI: 10.1093/jamia/ocz072] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 04/06/2019] [Accepted: 04/28/2019] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVE Our objective is to develop algorithms for encoding clinical text into representations that can be used for a variety of phenotyping tasks. MATERIALS AND METHODS Obtaining large datasets to take advantage of highly expressive deep learning methods is difficult in clinical natural language processing (NLP). We address this difficulty by pretraining a clinical text encoder on billing code data, which is typically available in abundance. We explore several neural encoder architectures and deploy the text representations obtained from these encoders in the context of clinical text classification tasks. While our ultimate goal is learning a universal clinical text encoder, we also experiment with training a phenotype-specific encoder. A universal encoder would be more practical, but a phenotype-specific encoder could perform better for a specific task. RESULTS We successfully train several clinical text encoders, establish a new state-of-the-art on comorbidity data, and observe good performance gains on substance misuse data. DISCUSSION We find that pretraining using billing codes is a promising research direction. The representations generated by this type of pretraining have universal properties, as they are highly beneficial for many phenotyping tasks. Phenotype-specific pretraining is a viable route for trading the generality of the pretrained encoder for better performance on a specific phenotyping task. CONCLUSIONS We successfully applied our approach to many phenotyping tasks. We conclude by discussing potential limitations of our approach.
Collapse
Affiliation(s)
- Dmitriy Dligach
- Department of Computer Science, Loyola University Chicago, Chicago, Illinois, USA
- Department of Public Health Sciences, Stritch School of Medicine, Loyola University, Maywood, Illinois, USA
- Center for Health Outcomes and Informatics Research, Loyola University, Maywood, Illinois, USA
| | - Majid Afshar
- Department of Public Health Sciences, Stritch School of Medicine, Loyola University, Maywood, Illinois, USA
- Center for Health Outcomes and Informatics Research, Loyola University, Maywood, Illinois, USA
| | - Timothy Miller
- Computational Health Informatics Program (CHIP), Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
96
|
Wang R, Pan W, Jin L, Li Y, Geng Y, Gao C, Chen G, Wang H, Ma D, Liao S. Artificial intelligence in reproductive medicine. Reproduction 2019; 158:R139-R154. [PMID: 30970326 PMCID: PMC6733338 DOI: 10.1530/rep-18-0523] [Citation(s) in RCA: 92] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Accepted: 04/10/2019] [Indexed: 12/16/2022]
Abstract
Artificial intelligence (AI) has experienced rapid growth over the past few years, moving from the experimental to the implementation phase in various fields, including medicine. Advances in learning algorithms and theories, the availability of large datasets and improvements in computing power have contributed to breakthroughs in current AI applications. Machine learning (ML), a subset of AI, allows computers to detect patterns from large complex datasets automatically and uses these patterns to make predictions. AI is proving to be increasingly applicable to healthcare, and multiple machine learning techniques have been used to improve the performance of assisted reproductive technology (ART). Despite various challenges, the integration of AI and reproductive medicine is bound to give an essential direction to medical development in the future. In this review, we discuss the basic aspects of AI and machine learning, and we address the applications, potential limitations and challenges of AI. We also highlight the prospects and future directions in the context of reproductive medicine.
Collapse
Affiliation(s)
- Renjie Wang
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Wei Pan
- School of Economics and Management, Wuhan University, Wuhan, Hubei, People’s Republic of China
| | - Lei Jin
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Yuehan Li
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Yudi Geng
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Chun Gao
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Gang Chen
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Hui Wang
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Ding Ma
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
| | - Shujie Liao
- Department of Obstetrics and Gynecology, Cancer Biology Research Center, Tongji Hospital, Tongji Medical College of HUST, Wuhan, Hubei, People’s Republic of China
- Correspondence should be addressed to S Liao;
| |
Collapse
|
97
|
Development and Validation of a Deep Neural Network Model for Prediction of Postoperative In-hospital Mortality. Anesthesiology 2019; 129:649-662. [PMID: 29664888 DOI: 10.1097/aln.0000000000002186] [Citation(s) in RCA: 127] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
WHAT WE ALREADY KNOW ABOUT THIS TOPIC WHAT THIS ARTICLE TELLS US THAT IS NEW: BACKGROUND:: The authors tested the hypothesis that deep neural networks trained on intraoperative features can predict postoperative in-hospital mortality. METHODS The data used to train and validate the algorithm consists of 59,985 patients with 87 features extracted at the end of surgery. Feed-forward networks with a logistic output were trained using stochastic gradient descent with momentum. The deep neural networks were trained on 80% of the data, with 20% reserved for testing. The authors assessed improvement of the deep neural network by adding American Society of Anesthesiologists (ASA) Physical Status Classification and robustness of the deep neural network to a reduced feature set. The networks were then compared to ASA Physical Status, logistic regression, and other published clinical scores including the Surgical Apgar, Preoperative Score to Predict Postoperative Mortality, Risk Quantification Index, and the Risk Stratification Index. RESULTS In-hospital mortality in the training and test sets were 0.81% and 0.73%. The deep neural network with a reduced feature set and ASA Physical Status classification had the highest area under the receiver operating characteristics curve, 0.91 (95% CI, 0.88 to 0.93). The highest logistic regression area under the curve was found with a reduced feature set and ASA Physical Status (0.90, 95% CI, 0.87 to 0.93). The Risk Stratification Index had the highest area under the receiver operating characteristics curve, at 0.97 (95% CI, 0.94 to 0.99). CONCLUSIONS Deep neural networks can predict in-hospital mortality based on automatically extractable intraoperative data, but are not (yet) superior to existing methods.
Collapse
|
98
|
Tobore I, Li J, Yuhang L, Al-Handarish Y, Kandwal A, Nie Z, Wang L. Deep Learning Intervention for Health Care Challenges: Some Biomedical Domain Considerations. JMIR Mhealth Uhealth 2019; 7:e11966. [PMID: 31376272 PMCID: PMC6696854 DOI: 10.2196/11966] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 04/14/2019] [Accepted: 06/12/2019] [Indexed: 01/10/2023] Open
Abstract
The use of deep learning (DL) for the analysis and diagnosis of biomedical and health care problems has received unprecedented attention in the last decade. The technique has recorded a number of achievements for unearthing meaningful features and accomplishing tasks that were hitherto difficult to solve by other methods and human experts. Currently, biological and medical devices, treatment, and applications are capable of generating large volumes of data in the form of images, sounds, text, graphs, and signals creating the concept of big data. The innovation of DL is a developing trend in the wake of big data for data representation and analysis. DL is a type of machine learning algorithm that has deeper (or more) hidden layers of similar function cascaded into the network and has the capability to make meaning from medical big data. Current transformation drivers to achieve personalized health care delivery will be possible with the use of mobile health (mHealth). DL can provide the analysis for the deluge of data generated from mHealth apps. This paper reviews the fundamentals of DL methods and presents a general view of the trends in DL by capturing literature from PubMed and the Institute of Electrical and Electronics Engineers database publications that implement different variants of DL. We highlight the implementation of DL in health care, which we categorize into biological system, electronic health record, medical image, and physiological signals. In addition, we discuss some inherent challenges of DL affecting biomedical and health domain, as well as prospective research directions that focus on improving health management by promoting the application of physiological signals and modern internet technology.
Collapse
Affiliation(s)
- Igbe Tobore
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China.,Graduate University, Chinese Academy of Sciences, Beijing, China
| | - Jingzhen Li
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Liu Yuhang
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yousef Al-Handarish
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Abhishek Kandwal
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Zedong Nie
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Lei Wang
- Center for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advance Technology, Chinese Academy of Sciences, Shenzhen, China
| |
Collapse
|
99
|
Ashfaq A, Sant'Anna A, Lingman M, Nowaczyk S. Readmission prediction using deep learning on electronic health records. J Biomed Inform 2019; 97:103256. [PMID: 31351136 DOI: 10.1016/j.jbi.2019.103256] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 06/24/2019] [Accepted: 07/22/2019] [Indexed: 01/30/2023]
Abstract
Unscheduled 30-day readmissions are a hallmark of Congestive Heart Failure (CHF) patients that pose significant health risks and escalate care cost. In order to reduce readmissions and curb the cost of care, it is important to initiate targeted intervention programs for patients at risk of readmission. This requires identifying high-risk patients at the time of discharge from hospital. Here, using real data from over 7500 CHF patients hospitalized between 2012 and 2016 in Sweden, we built and tested a deep learning framework to predict 30-day unscheduled readmission. We present a cost-sensitive formulation of Long Short-Term Memory (LSTM) neural network using expert features and contextual embedding of clinical concepts. This study targets key elements of an Electronic Health Record (EHR) driven prediction model in a single framework: using both expert and machine derived features, incorporating sequential patterns and addressing the class imbalance problem. We evaluate the contribution of each element towards prediction performance (ROC-AUC, F1-measure) and cost-savings. We show that the model with all key elements achieves higher discrimination ability (AUC: 0.77; F1: 0.51; Cost: 22% of maximum possible savings) outperforming the reduced models in at least two evaluation metrics. Additionally, we present a simple financial analysis to estimate annual savings if targeted interventions are offered to high risk patients.
Collapse
Affiliation(s)
- Awais Ashfaq
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden; Halland Hospital, Region Halland, Sweden.
| | - Anita Sant'Anna
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden
| | - Markus Lingman
- Halland Hospital, Region Halland, Sweden; Institute of Medicine, Dept. of Molecular and Clinical Medicine/Cardiology, Sahlgrenska Academy, University of Gothenburg, Sweden
| | - Sławomir Nowaczyk
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden
| |
Collapse
|
100
|
Gensheimer MF, Henry AS, Wood DJ, Hastie TJ, Aggarwal S, Dudley SA, Pradhan P, Banerjee I, Cho E, Ramchandran K, Pollom E, Koong AC, Rubin DL, Chang DT. Automated Survival Prediction in Metastatic Cancer Patients Using High-Dimensional Electronic Medical Record Data. J Natl Cancer Inst 2019; 111:568-574. [PMID: 30346554 PMCID: PMC6579743 DOI: 10.1093/jnci/djy178] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Revised: 06/28/2018] [Accepted: 09/05/2018] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Oncologists use patients' life expectancy to guide decisions and may benefit from a tool that accurately predicts prognosis. Existing prognostic models generally use only a few predictor variables. We used an electronic medical record dataset to train a prognostic model for patients with metastatic cancer. METHODS The model was trained and tested using 12 588 patients treated for metastatic cancer in the Stanford Health Care system from 2008 to 2017. Data sources included provider note text, labs, vital signs, procedures, medication orders, and diagnosis codes. Patients were divided randomly into a training set used to fit the model coefficients and a test set used to evaluate model performance (80%/20% split). A regularized Cox model with 4126 predictor variables was used. A landmarking approach was used due to the multiple observations per patient, with t0 set to the time of metastatic cancer diagnosis. Performance was also evaluated using 399 palliative radiation courses in test set patients. RESULTS The C-index for overall survival was 0.786 in the test set (averaged across landmark times). For palliative radiation courses, the C-index was 0.745 (95% confidence interval [CI] = 0.715 to 0.775) compared with 0.635 (95% CI = 0.601 to 0.669) for a published model using performance status, primary tumor site, and treated site (two-sided P < .001). Our model's predictions were well-calibrated. CONCLUSIONS The model showed high predictive performance, which will need to be validated using external data. Because it is fully automated, the model can be used to examine providers' practice patterns and could be deployed in a decision support tool to help improve quality of care.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Eunpi Cho
- Stanford University, Stanford, CA; Genentech, South San Francisco, CA
| | | | | | - Albert C Koong
- Department of Radiation Oncology
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX
| | - Daniel L Rubin
- Department of Biomedical Data Science
- Department of Statistics
| | | |
Collapse
|