1
|
van der Haar D, Moustafa A, Warren SL, Alashwal H, van Zyl T. An Alzheimer's disease category progression sub-grouping analysis using manifold learning on ADNI. Sci Rep 2023; 13:10483. [PMID: 37380746 DOI: 10.1038/s41598-023-37569-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 06/23/2023] [Indexed: 06/30/2023] Open
Abstract
Many current statistical and machine learning methods have been used to explore Alzheimer's disease (AD) and its associated patterns that contribute to the disease. However, there has been limited success in understanding the relationship between cognitive tests, biomarker data, and patient AD category progressions. In this work, we perform exploratory data analysis of AD health record data by analyzing various learned lower dimensional manifolds to separate early-stage AD categories further. Specifically, we used Spectral embedding, Multidimensional scaling, Isomap, t-Distributed Stochastic Neighbour Embedding, Uniform Manifold Approximation and Projection, and sparse denoising autoencoder based manifolds on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. We then determine the clustering potential of the learned embeddings and then determine if category sub-groupings or sub-categories can be found. We then used a Kruskal-sWallis H test to determine the statistical significance of the discovered AD subcategories. Our results show that the existing AD categories do exhibit sub-groupings, especially in mild cognitive impairment transitions in many of the tested manifolds, showing there may be a need for further subcategories to describe AD progression.
Collapse
Affiliation(s)
- Dustin van der Haar
- Academy of Computer Science and Software Engineering, University of Johannesburg, Gauteng, South Africa.
| | - Ahmed Moustafa
- Department of Human Anatomy and Physiology, University of Johannesburg, Gauteng, South Africa
- School of Psychology, Faculty of Society and Design, Bond University, Gold Coast, QLD, Australia
| | - Samuel L Warren
- School of Psychology, Faculty of Society and Design, Bond University, Gold Coast, QLD, Australia
| | - Hany Alashwal
- College of Information Technology, United Arab Emirates University, Al-Ain, United Arab Emirates
| | - Terence van Zyl
- Institute for Intelligent Systems, University of Johannesburg, Gauteng, South Africa
| |
Collapse
|
2
|
Yun WJ, Shin M, Jung S, Ko J, Lee HC, Kim J. Deep reinforcement learning-based propofol infusion control for anesthesia: A feasibility study with a 3000-subject dataset. Comput Biol Med 2023; 156:106739. [PMID: 36889025 DOI: 10.1016/j.compbiomed.2023.106739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 11/26/2022] [Accepted: 03/01/2023] [Indexed: 03/06/2023]
Abstract
In this work, we present a deep reinforcement learning-based approach as a baseline system for autonomous propofol infusion control. Specifically, design an environment for simulating the possible conditions of a target patient based on input demographic data and design our reinforcement learning model-based system so that it effectively makes predictions on the proper level of propofol infusion to maintain stable anesthesia even under dynamic conditions that can affect the decision-making process, such as the manual control of remifentanil by anesthesiologists and the varying patient conditions under anesthesia. Through an extensive set of evaluations using patient data from 3000 subjects, we show that the proposed method results in stabilization in the anesthesia state, by managing the bispectral index (BIS) and effect-site concentration for a patient showing varying conditions.
Collapse
Affiliation(s)
- Won Joon Yun
- School of Electrical Engineering, Korea University, Seoul 02841, Republic of Korea.
| | | | - Soyi Jung
- Department of Electrical and Computer Engineering, Ajou University, Suwon 16499, Republic of Korea.
| | - JeongGil Ko
- School of Integrated Technology, Yonsei University, Seoul 03722, Republic of Korea.
| | - Hyung-Chul Lee
- Seoul National University College of Medicine & Seoul National University Hospital, Seoul 03080, Republic of Korea.
| | - Joongheon Kim
- School of Electrical Engineering, Korea University, Seoul 02841, Republic of Korea.
| |
Collapse
|
3
|
Le TD, Noumeir R, Rambaud J, Sans G, Jouvet P. Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2023; 11:469-478. [PMID: 37817825 PMCID: PMC10561736 DOI: 10.1109/jtehm.2023.3241635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 01/08/2023] [Accepted: 01/30/2023] [Indexed: 10/12/2023]
Abstract
When dealing with clinical text classification on a small dataset, recent studies have confirmed that a well-tuned multilayer perceptron outperforms other generative classifiers, including deep learning ones. To increase the performance of the neural network classifier, feature selection for the learning representation can effectively be used. However, most feature selection methods only estimate the degree of linear dependency between variables and select the best features based on univariate statistical tests. Furthermore, the sparsity of the feature space involved in the learning representation is ignored. GOAL Our aim is, therefore, to access an alternative approach to tackle the sparsity by compressing the clinical representation feature space, where limited French clinical notes can also be dealt with effectively. METHODS This study proposed an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation. The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space. The classification performance of the classifiers was then evaluated in the trained and compressed feature space. RESULTS The proposed approach provided overall performance gains of up to 3% for each test set evaluation. Finally, the classifier achieved 92% accuracy, 91% recall, 91% precision, and 91% f1-score in detecting the patient's condition. Furthermore, the compression working mechanism and the autoencoder prediction process were demonstrated by applying the theoretic information bottleneck framework. Clinical and Translational Impact Statement- An autoencoder learning algorithm effectively tackles the problem of sparsity in the representation feature space from a small clinical narrative dataset. Significantly, it can learn the best representation of the training data because of its lossless compression capacity compared to other approaches. Consequently, its downstream classification ability can be significantly improved, which cannot be done using deep learning models.
Collapse
Affiliation(s)
- Thanh-Dung Le
- Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of QuebecMontrealQCH3C 1K3Canada
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Rita Noumeir
- Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of QuebecMontrealQCH3C 1K3Canada
| | - Jerome Rambaud
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Guillaume Sans
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Philippe Jouvet
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| |
Collapse
|
4
|
Wang M, Wei Z, Jia M, Chen L, Ji H. Deep learning model for multi-classification of infectious diseases from unstructured electronic medical records. BMC Med Inform Decis Mak 2022; 22:41. [PMID: 35168624 PMCID: PMC8848865 DOI: 10.1186/s12911-022-01776-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 01/28/2022] [Indexed: 01/21/2023] Open
Abstract
Purpose Predictively diagnosing infectious diseases helps in providing better treatment and enhances the prevention and control of such diseases. This study uses actual data from a hospital. A multiple infectious disease diagnostic model (MIDDM) is designed for conducting multi-classification of infectious diseases so as to assist in clinical infectious-disease decision-making. Methods Based on actual hospital medical records of infectious diseases from December 2012 to December 2020, a deep learning model for multi-classification research on infectious diseases is constructed. The data includes 20,620 cases covering seven types of infectious diseases, including outpatients and inpatients, of which training data accounted for 80%, i.e., 16,496 cases, and test data accounted for 20%, i.e., 4124 cases. Through the auto-encoder, data normalization and sparse data densification processing are carried out to improve the model training effect. A residual network and attention mechanism are introduced into the MIDDM model to improve the performance of the model. Result MIDDM achieved improved prediction results in diagnosing seven kinds of infectious diseases. In the case of similar disease diagnosis characteristics and similar interference factors, the prediction accuracy of disease classification with more sample data is significantly higher than the prediction accuracy of disease classification with fewer sample data. For instance, the training data for viral hepatitis, influenza, and hand foot and mouth disease were 2954, 3924, and 3015 respectively and the corresponding test accuracy rates were 99.86%, 98.47%, and 97.31%. There is less training data for syphilis, infectious diarrhea, and measles, i.e., 1208, 575, and 190 respectively and the corresponding test accuracy rates were noticeably lower, i.e., 83.03%, 87.30%, and42.11%. We also compared the MIDDM model with the models used in other studies. Using the same input data, taking viral hepatitis as an example, the accuracy of MIDDM is 99.44%, which is significantly higher than that of XGBoost (96.19%), Decision tree (90.13%), Bayesian method (85.19%), and logistic regression (91.26%). Other diseases were also significantly better predicted by MIDDM than by these three models. Conclusion The application of the MIDDM model to multi-class diagnosis and prediction of infectious diseases can improve the accuracy of infectious-disease diagnosis. However, these results need to be further confirmed via clinical randomized controlled trials.
Collapse
Affiliation(s)
- Mengying Wang
- Information Management and Big Data Center, Peking University Third Hospital, Beijing, China
| | - Zhenhao Wei
- Goodwill Hessian Health Technology Co. Ltd, Beijing, China
| | - Mo Jia
- Information Management and Big Data Center, Peking University Third Hospital, Beijing, China
| | - Lianzhong Chen
- Goodwill Hessian Health Technology Co. Ltd, Beijing, China
| | - Hong Ji
- Information Management and Big Data Center, Peking University Third Hospital, Beijing, China.
| |
Collapse
|
5
|
Si Y, Du J, Li Z, Jiang X, Miller T, Wang F, Jim Zheng W, Roberts K. Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review. J Biomed Inform 2021; 115:103671. [PMID: 33387683 PMCID: PMC11290708 DOI: 10.1016/j.jbi.2020.103671] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Revised: 10/23/2020] [Accepted: 12/23/2020] [Indexed: 12/22/2022]
Abstract
OBJECTIVES Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records (EHRs). This is generally performed using advanced deep learning methods. This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective. METHODS We identified studies developing patient representations from EHRs with deep learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49 papers were included for a comprehensive data collection. RESULTS Publications developing patient representations almost doubled each year from 2015 until 2019. We noticed a typical workflow starting with feeding raw data, applying deep learning models, and ending with clinical outcome predictions as evaluations of the learned representations. Specifically, learning representations from structured EHR data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely applied as the deep learning architecture (Long short-term memory: 13 studies, Gated recurrent unit: 11 studies). Learning was mainly performed in a supervised manner (30 studies) optimized with cross-entropy loss. Disease prediction was the most common application and evaluation (31 studies). Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns of EHR data, and code availability was assured in 20 studies. DISCUSSION & CONCLUSION The existing predictive models mainly focus on the prediction of single diseases, rather than considering the complex mechanisms of patients from a holistic review. We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review. Advances in patient representation learning techniques will be essential for powering patient-level EHR analyses. Future work will still be devoted to leveraging the richness and potential of available EHR data. Reproducibility and transparency of reported results will hopefully improve. Knowledge distillation and advanced learning techniques will be exploited to assist the capability of learning patient representation further.
Collapse
Affiliation(s)
- Yuqi Si
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Jingcheng Du
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Zhao Li
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Xiaoqian Jiang
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Timothy Miller
- Computational Health Informatics Program (CHIP), Boston Children's Hospital and Harvard Medical School, MA, USA
| | - Fei Wang
- Department of Population Health Sciences. Weill Cornell Medicine, Cornell University, NY, USA
| | - W Jim Zheng
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Kirk Roberts
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA.
| |
Collapse
|
6
|
Zhang H, Deng K, Li H, Albin RL, Guan Y. Deep Learning Identifies Digital Biomarkers for Self-Reported Parkinson's Disease. PATTERNS 2020; 1. [PMID: 32699844 PMCID: PMC7375444 DOI: 10.1016/j.patter.2020.100042] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Large-scale population screening and in-home monitoring for patients with Parkinson's disease (PD) has so far been mainly carried out by traditional healthcare methods and systems. Development of mobile health may provide an independent, future method to detect PD. Current PD detection algorithms will benefit from better generalizability with data collected in real-world situations. In this paper, we report the top-performing smartphone-based method in the recent DREAM Parkinson's Disease Digital Biomarker Challenge for digital diagnosis of PD. Utilizing real-world accelerometer records, this approach differentiated PD from control subjects with an area under the receiver-operating characteristic curve of 0.87 by 3D augmentation of accelerometer records, a significant improvement over other state-of-the-art methods. This study paves the way for future at-home screening of PD and other neurodegenerative conditions affecting movement.
Collapse
Affiliation(s)
- Hanrui Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Kaiwen Deng
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Hongyang Li
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Roger L Albin
- Department of Neurology, University of Michigan Medical School, Ann Arbor, MI, USA.,Neurology Service & GRECC, VAAAHS, Ann Arbor, MI 48109, USA
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.,Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109, USA.,Lead Contact
| |
Collapse
|
7
|
Kashef A, Khatibi T, Mehrvar A. Treatment outcome classification of pediatric Acute Lymphoblastic Leukemia patients with clinical and medical data using machine learning: A case study at MAHAK hospital. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100399] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|