1
|
Woodman RJ, Mangoni AA. A comprehensive review of machine learning algorithms and their application in geriatric medicine: present and future. Aging Clin Exp Res 2023; 35:2363-2397. [PMID: 37682491 PMCID: PMC10627901 DOI: 10.1007/s40520-023-02552-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 08/24/2023] [Indexed: 09/09/2023]
Abstract
The increasing access to health data worldwide is driving a resurgence in machine learning research, including data-hungry deep learning algorithms. More computationally efficient algorithms now offer unique opportunities to enhance diagnosis, risk stratification, and individualised approaches to patient management. Such opportunities are particularly relevant for the management of older patients, a group that is characterised by complex multimorbidity patterns and significant interindividual variability in homeostatic capacity, organ function, and response to treatment. Clinical tools that utilise machine learning algorithms to determine the optimal choice of treatment are slowly gaining the necessary approval from governing bodies and being implemented into healthcare, with significant implications for virtually all medical disciplines during the next phase of digital medicine. Beyond obtaining regulatory approval, a crucial element in implementing these tools is the trust and support of the people that use them. In this context, an increased understanding by clinicians of artificial intelligence and machine learning algorithms provides an appreciation of the possible benefits, risks, and uncertainties, and improves the chances for successful adoption. This review provides a broad taxonomy of machine learning algorithms, followed by a more detailed description of each algorithm class, their purpose and capabilities, and examples of their applications, particularly in geriatric medicine. Additional focus is given on the clinical implications and challenges involved in relying on devices with reduced interpretability and the progress made in counteracting the latter via the development of explainable machine learning.
Collapse
Affiliation(s)
- Richard J Woodman
- Centre of Epidemiology and Biostatistics, College of Medicine and Public Health, Flinders University, GPO Box 2100, Adelaide, SA, 5001, Australia.
| | - Arduino A Mangoni
- Discipline of Clinical Pharmacology, College of Medicine and Public Health, Flinders University, Adelaide, SA, Australia
- Department of Clinical Pharmacology, Flinders Medical Centre, Southern Adelaide Local Health Network, Adelaide, SA, Australia
| |
Collapse
|
2
|
Chen K, Abtahi F, Carrero JJ, Fernandez-Llatas C, Seoane F. Process mining and data mining applications in the domain of chronic diseases: A systematic review. Artif Intell Med 2023; 144:102645. [PMID: 37783545 DOI: 10.1016/j.artmed.2023.102645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 08/24/2023] [Accepted: 08/28/2023] [Indexed: 10/04/2023]
Abstract
The widespread use of information technology in healthcare leads to extensive data collection, which can be utilised to enhance patient care and manage chronic illnesses. Our objective is to summarise previous studies that have used data mining or process mining methods in the context of chronic diseases in order to identify research trends and future opportunities. The review covers articles that pertain to the application of data mining or process mining methods on chronic diseases that were published between 2000 and 2022. Articles were sourced from PubMed, Web of Science, EMBASE, and Google Scholar based on predetermined inclusion and exclusion criteria. A total of 71 articles met the inclusion criteria and were included in the review. Based on the literature review results, we detected a growing trend in the application of data mining methods in diabetes research. Additionally, a distinct increase in the use of process mining methods to model clinical pathways in cancer research was observed. Frequently, this takes the form of a collaborative integration of process mining, data mining, and traditional statistical methods. In light of this collaborative approach, the meticulous selection of statistical methods based on their underlying assumptions is essential when integrating these traditional methods with process mining and data mining methods. Another notable challenge is the lack of standardised guidelines for reporting process mining studies in the medical field. Furthermore, there is a pressing need to enhance the clinical interpretation of data mining and process mining results.
Collapse
Affiliation(s)
- Kaile Chen
- Department of Clinical Science, Intervention and Technology, Karolinska Institutet, 17177 Stockholm, Sweden; School of Engineering Sciences in Chemistry, Biotechnology and Health, Department of Biomedical Engineering and Health Systems, Division of Ergonomics, KTH Royal Institute of Technology, 14157 Stockholm, Sweden.
| | - Farhad Abtahi
- Department of Clinical Science, Intervention and Technology, Karolinska Institutet, 17177 Stockholm, Sweden; School of Engineering Sciences in Chemistry, Biotechnology and Health, Department of Biomedical Engineering and Health Systems, Division of Ergonomics, KTH Royal Institute of Technology, 14157 Stockholm, Sweden; Department of Clinical Physiology, Karolinska University Hospital, 17176 Stockholm, Sweden
| | - Juan-Jesus Carrero
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 17177 Stockholm, Sweden
| | - Carlos Fernandez-Llatas
- Department of Clinical Science, Intervention and Technology, Karolinska Institutet, 17177 Stockholm, Sweden; SABIEN, ITACA, Universitat Politècnica de València, Spain
| | - Fernando Seoane
- Department of Clinical Science, Intervention and Technology, Karolinska Institutet, 17177 Stockholm, Sweden; Department of Clinical Physiology, Karolinska University Hospital, 17176 Stockholm, Sweden; Department of Medical Technology, Karolinska University Hospital, 17176 Stockholm, Sweden; Department of Textile Technology, University of Borås, 50190 Borås, Sweden
| |
Collapse
|
3
|
Alsaleh MM, Allery F, Choi JW, Hama T, McQuillin A, Wu H, Thygesen JH. Prediction of disease comorbidity using explainable artificial intelligence and machine learning techniques: A systematic review. Int J Med Inform 2023; 175:105088. [PMID: 37156169 DOI: 10.1016/j.ijmedinf.2023.105088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 03/23/2023] [Accepted: 05/01/2023] [Indexed: 05/10/2023]
Abstract
OBJECTIVE Disease comorbidity is a major challenge in healthcare affecting the patient's quality of life and costs. AI-based prediction of comorbidities can overcome this issue by improving precision medicine and providing holistic care. The objective of this systematic literature review was to identify and summarise existing machine learning (ML) methods for comorbidity prediction and evaluate the interpretability and explainability of the models. MATERIALS AND METHODS The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) framework was used to identify articles in three databases: Ovid Medline, Web of Science and PubMed. The literature search covered a broad range of terms for the prediction of disease comorbidity and ML, including traditional predictive modelling. RESULTS Of 829 unique articles, 58 full-text papers were assessed for eligibility. A final set of 22 articles with 61 ML models was included in this review. Of the identified ML models, 33 models achieved relatively high accuracy (80-95%) and AUC (0.80-0.89). Overall, 72% of studies had high or unclear concerns regarding the risk of bias. DISCUSSION This systematic review is the first to examine the use of ML and explainable artificial intelligence (XAI) methods for comorbidity prediction. The chosen studies focused on a limited scope of comorbidities ranging from 1 to 34 (mean = 6), and no novel comorbidities were found due to limited phenotypic and genetic data. The lack of standard evaluation for XAI hinders fair comparisons. CONCLUSION A broad range of ML methods has been used to predict the comorbidities of various disorders. With further development of explainable ML capacity in the field of comorbidity prediction, there is a significant possibility of identifying unmet health needs by highlighting comorbidities in patient groups that were not previously recognised to be at risk for particular comorbidities.
Collapse
Affiliation(s)
- Mohanad M Alsaleh
- Institute of Health Informatics, University College London, London, UK; Department of Health Informatics, College of Public Health and Health Informatics, Qassim University, Al Bukayriyah, Saudi Arabia.
| | - Freya Allery
- Institute of Health Informatics, University College London, London, UK
| | - Jung Won Choi
- Institute of Health Informatics, University College London, London, UK
| | - Tuankasfee Hama
- Institute of Health Informatics, University College London, London, UK
| | | | - Honghan Wu
- Institute of Health Informatics, University College London, London, UK
| | - Johan H Thygesen
- Institute of Health Informatics, University College London, London, UK
| |
Collapse
|
4
|
Oflaz Z, Yozgatligil C, Selcuk-Kestel AS. Modeling comorbidity of chronic diseases using coupled hidden Markov model with bivariate discrete copula. Stat Methods Med Res 2023; 32:829-849. [PMID: 36775994 DOI: 10.1177/09622802231155100] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/14/2023]
Abstract
A range of chronic diseases have a significant influence on each other and share common risk factors. Comorbidity, which shows the existence of two or more diseases interacting or triggering each other, is an important measure for actuarial valuations. The main proposal of the study is to model parallel interacting processes describing two or more chronic diseases by a combination of hidden Markov theory and copula function. This study introduces a coupled hidden Markov model with the bivariate discrete copula function in the hidden process. To estimate the parameters of the model and deal with the numerical intractability of the log-likelihood, we use a variational expectation maximization algorithm. To perform the variational expectation maximization algorithm, a lower bound of the model's log-likelihood is defined, and estimators of the parameters are computed in the M-part. A possible numerical underflow occurring in the computation of forward-backward probabilities is solved. The simulation study is conducted for two different levels of association to assess the performance of the proposed model, resulting in satisfactory findings. The proposed model was applied to hospital appointment data from a private hospital. The model defines the dependency structure of unobserved disease data and its dynamics. The application results demonstrate that the model is useful for investigating disease comorbidity when only population dynamics over time and no clinical data are available.
Collapse
Affiliation(s)
- Zarina Oflaz
- Department of Industrial Engineering, 218507KTO Karatay University, Konya, Turkey
| | - Ceylan Yozgatligil
- Department of Statistics, 52984Middle East Technical University, Ankara, Turkey
| | | |
Collapse
|
5
|
Kim Y, Zhang K, Savitz SI, Chen L, Schulz PE, Jiang X. Counterfactual analysis of differential comorbidity risk factors in Alzheimer's disease and related dementias. PLOS DIGITAL HEALTH 2022; 1:e0000018. [PMID: 36812506 PMCID: PMC9931358 DOI: 10.1371/journal.pdig.0000018] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 01/21/2022] [Indexed: 11/19/2022]
Abstract
Alzheimer's disease and related dementias (ADRD) is a multifactorial disease that involves several different etiologic mechanisms with various comorbidities. There is also significant heterogeneity in the prevalence of ADRD across diverse demographics groups. Association studies on such heterogeneous comorbidity risk factors are limited in their ability to determine causation. We aim to compare counterfactual treatment effects of various comorbidity in ADRD in different racial groups (African Americans and Caucasians). We used 138,026 ADRD and 1:1 matched older adults without ADRD from nationwide electronic health records, which extensively cover a large population's long medical history in breadth. We matched African Americans and Caucasians based on age, sex, and high-risk comorbidities (hypertension, diabetes, obesity, vascular disease, heart disease, and head injury) to build two comparable cohorts. We derived a Bayesian network of 100 comorbidities and selected comorbidities with potential causal effect to ADRD. We estimated the average treatment effect (ATE) of the selected comorbidities on ADRD using inverse probability of treatment weighting. Late effects of cerebrovascular disease significantly predisposed older African Americans (ATE = 0.2715) to ADRD, but not in the Caucasian counterparts; depression significantly predisposed older Caucasian counterparts (ATE = 0.1560) to ADRD, but not in the African Americans. Our extensive counterfactual analysis using a nationwide EHR discovered different comorbidities that predispose older African Americans to ADRD compared to Caucasian counterparts. Despite the noisy and incomplete nature of the real-world data, the counterfactual analysis on the comorbidity risk factors can be a valuable tool to support the risk factor exposure studies.
Collapse
Affiliation(s)
- Yejin Kim
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
- Institute for Stroke and Cerebrovascular Disease, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Kai Zhang
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Sean I. Savitz
- Institute for Stroke and Cerebrovascular Disease, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Luyao Chen
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Paul E. Schulz
- Department of Neurology, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| | - Xiaoqian Jiang
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
- Institute for Stroke and Cerebrovascular Disease, University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| |
Collapse
|
6
|
Faruqui SHA, Alaeddini A, Wang J, Fisher-Hoch SP, McCormick JB. Dynamic Functional Continuous Time Bayesian Networks for Prediction and Monitoring of the Impact of Patients' Modifiable Lifestyle Behaviors on the Emergence of Multiple Chronic Conditions. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2021; 9:169092-169106. [PMID: 35601689 PMCID: PMC9121781 DOI: 10.1109/access.2021.3136618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
More than a quarter of all Americans are estimated to have multiple chronic conditions (MCC). It is known that shared modifiable lifestyle behaviors account for many common MCC. What is not precisely known is the dynamic effect of changes in lifestyle behaviors on the trajectories of MCC emergence. This paper proposes dynamic functional continuous time Bayesian networks to effectively formulate the dynamic effect of patients' modifiable lifestyle behaviors and their interaction with non-modifiable demographics and preexisting conditions on the emergence of MCC. The proposed method considers the parameters of the conditional dependencies of MCC as a nonlinear state-space model and develops an extended Kalman filter to capture the dynamics of the modifiable risk factors on the MCC evolution. It also develops a tensor-based control chart based on the integration of multilinear principal component analysis and multivariate exponentially weighted moving average chart to monitor the effect of changes in the modifiable risk factors on the risk of new MCC. We validate the proposed method based on a combination of simulation and a real dataset of 385 patients from the Cameron County Hispanic Cohort. The dataset examines the emergence of 5 chronic conditions (Diabetes, Obesity, Cognitive Impairment, Hyperlipidemia, Hypertension) based on 4 modifiable lifestyle behaviors representing (Diet, Exercise, Smoking Habits, Drinking Habits) and 3 non-modifiable demographic risk factors (Age, Gender, Education). For the simulated study, the proposed algorithm shows a run-length of 4 samples (4 months) to identify behavioral changes with significant impacts on the risk of new MCC. For the real data study, the proposed algorithm shows a run-length of one sample (one year) to identify behavioral changes with significant impacts on the risk of new MCC. The results demonstrate the sensitivity of the proposed methodology for dynamic prediction and monitoring of the risk of MCC emergence in individual patients.
Collapse
Affiliation(s)
- Syed Hasib Akhter Faruqui
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Adel Alaeddini
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Jing Wang
- College of Nursing, Florida State University, Tallahassee, FL 32306, USA
| | - Susan P Fisher-Hoch
- School of Public Health, The University of Texas Health Houston, Brownsville, TX 78520, USA
| | - Joseph B McCormick
- School of Public Health, The University of Texas Health Houston, Brownsville, TX 78520, USA
| |
Collapse
|
7
|
Cezard G, McHale CT, Sullivan F, Bowles JKF, Keenan K. Studying trajectories of multimorbidity: a systematic scoping review of longitudinal approaches and evidence. BMJ Open 2021; 11:e048485. [PMID: 34810182 PMCID: PMC8609933 DOI: 10.1136/bmjopen-2020-048485] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 10/20/2021] [Indexed: 11/05/2022] Open
Abstract
OBJECTIVES Multimorbidity-the co-occurrence of at least two chronic diseases in an individual-is an important public health challenge in ageing societies. The vast majority of multimorbidity research takes a cross-sectional approach, but longitudinal approaches to understanding multimorbidity are an emerging research area, being encouraged by multiple funders. To support development in this research area, the aim of this study is to scope the methodological approaches and substantive findings of studies that have investigated longitudinal multimorbidity trajectories. DESIGN We conducted a systematic search for relevant studies in four online databases (Medline, Scopus, Web of Science and Embase) in May 2020 using predefined search terms and inclusion and exclusion criteria. The search was complemented by searching reference lists of relevant papers. From the selected studies, we systematically extracted data on study methodology and findings and summarised them in a narrative synthesis. RESULTS We identified 35 studies investigating multimorbidity longitudinally, all published in the last decade, and predominantly in high-income countries from the Global North. Longitudinal approaches employed included constructing change variables, multilevel regression analysis (eg, growth curve modelling), longitudinal group-based methodologies (eg, latent class modelling), analysing disease transitions and visualisation techniques. Commonly identified risk factors for multimorbidity onset and progression were older age, higher socioeconomic and area-level deprivation, overweight and poorer health behaviours. CONCLUSION The nascent research area employs a diverse range of longitudinal approaches that characterise accumulation and disease combinations and to a lesser extent disease sequencing and progression. Gaps include understanding the long-term, life course determinants of different multimorbidity trajectories, and doing so across diverse populations, including those from low-income and middle-income countries. This can provide a detailed picture of morbidity development, with important implications from a clinical and intervention perspective.
Collapse
Affiliation(s)
- Genevieve Cezard
- School of Geography and Sustainable Development, University of St Andrews, St Andrews, UK
| | | | - Frank Sullivan
- School of Medicine, University of St Andrews, St Andrews, UK
| | | | - Katherine Keenan
- School of Geography and Sustainable Development, University of St Andrews, St Andrews, UK
| |
Collapse
|
8
|
Rezaeiahari M, Brown CC, Ali MM, Datta J, Tilford JM. Understanding racial disparities in severe maternal morbidity using Bayesian network analysis. PLoS One 2021; 16:e0259258. [PMID: 34705872 PMCID: PMC8550416 DOI: 10.1371/journal.pone.0259258] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 10/15/2021] [Indexed: 01/22/2023] Open
Abstract
Previous studies have evaluated the marginal effect of various factors on the risk of severe maternal morbidity (SMM) using regression approaches. We add to this literature by utilizing a Bayesian network (BN) approach to understand the joint effects of clinical, demographic, and area-level factors. We conducted a retrospective observational study using linked birth certificate and insurance claims data from the Arkansas All-Payer Claims Database (APCD), for the years 2013 through 2017. We used various learning algorithms and measures of arc strength to choose the most robust network structure. We then performed various conditional probabilistic queries using Monte Carlo simulation to understand disparities in SMM. We found that anemia and hypertensive disorder of pregnancy may be important clinical comorbidities to target in order to reduce SMM overall as well as racial disparities in SMM.
Collapse
Affiliation(s)
- Mandana Rezaeiahari
- Department of Health Policy and Management, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America
| | - Clare C. Brown
- Department of Health Policy and Management, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America
| | - Mir M. Ali
- University of Arkansas for Medical Sciences, Institute for Digital Health & Innovation, Little Rock, Arkansas, United States of America
| | - Jyotishka Datta
- Department of Statistics, Virginia Polytechnic Institute and State University, Blacksburg, Virginia, United States of America
| | - J. Mick Tilford
- Department of Health Policy and Management, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America
| |
Collapse
|
9
|
Faruqui SHA, Alaeddini A, Wang J, Jaramillo CA, Pugh MJ. A Functional Model for Structure Learning and Parameter Estimation in Continuous Time Bayesian Network: An Application in Identifying Patterns of Multiple Chronic Conditions. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2021; 9:148076-148089. [PMID: 35371895 PMCID: PMC8975131 DOI: 10.1109/access.2021.3122912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Bayesian networks are powerful statistical models to study the probabilistic relationships among sets of random variables with significant applications in disease modeling and prediction. Here, we propose a continuous time Bayesian network with conditional dependencies represented as regularized Poisson regressions to model the impact of exogenous variables on the conditional intensities of the network. We also propose an adaptive group regularization method with an intuitive early stopping feature based on Gaussian mixture model clustering for efficient learning of the structure and parameters of the proposed network. Using a dataset of patients with multiple chronic conditions extracted from electronic health records of the Department of Veterans Affairs, we compare the performance of the proposed network with some of the existing methods in the literature for both short-term (one-year ahead) and long-term (multi-year ahead) predictions. The proposed model provides a sparse intuitive representation of the complex functional relationships between multiple chronic conditions. It also provides the capability of analyzing multiple disease trajectories over time, given any combination of preexisting conditions.
Collapse
Affiliation(s)
- Syed Hasib Akhter Faruqui
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Adel Alaeddini
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Jing Wang
- College of Nursing, Florida State University, Tallahassee, FL 32306, USA
| | | | - Mary Jo Pugh
- VA Salt Lake City Health Care System, Salt Lake City, UT 84148, USA
| |
Collapse
|
10
|
Faruqui SHA, Alaeddini A, Chang MC, Shirinkam S, Jaramillo C, NajafiRad P, Wang J, Pugh MJ. Summarizing Complex Graphical Models of Multiple Chronic Conditions Using the Second Eigenvalue of Graph Laplacian: Algorithm Development and Validation. JMIR Med Inform 2020; 8:e16372. [PMID: 32554376 PMCID: PMC7330739 DOI: 10.2196/16372] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 01/06/2020] [Accepted: 03/22/2020] [Indexed: 01/16/2023] Open
Abstract
Background It is important but challenging to understand the interactions of multiple chronic conditions (MCC) and how they develop over time in patients and populations. Clinical data on MCC can now be represented using graphical models to study their interaction and identify the path toward the development of MCC. However, the current graphical models representing MCC are often complex and difficult to analyze. Therefore, it is necessary to develop improved methods for generating these models. Objective This study aimed to summarize the complex graphical models of MCC interactions to improve comprehension and aid analysis. Methods We examined the emergence of 5 chronic medical conditions (ie, traumatic brain injury [TBI], posttraumatic stress disorder [PTSD], depression [Depr], substance abuse [SuAb], and back pain [BaPa]) over 5 years among 257,633 veteran patients. We developed 3 algorithms that utilize the second eigenvalue of the graph Laplacian to summarize the complex graphical models of MCC by removing less significant edges. The first algorithm learns a sparse probabilistic graphical model of MCC interactions directly from the data. The second algorithm summarizes an existing probabilistic graphical model of MCC interactions when a supporting data set is available. The third algorithm, which is a variation of the second algorithm, summarizes the existing graphical model of MCC interactions with no supporting data. Finally, we examined the coappearance of the 100 most common terms in the literature of MCC to validate the performance of the proposed model. Results The proposed summarization algorithms demonstrate considerable performance in extracting major connections among MCC without reducing the predictive accuracy of the resulting graphical models. For the model learned directly from the data, the area under the curve (AUC) performance for predicting TBI, PTSD, BaPa, SuAb, and Depr, respectively, during the next 4 years is as follows—year 2: 79.91%, 84.04%, 78.83%, 82.50%, and 81.47%; year 3: 76.23%, 80.61%, 73.51%, 79.84%, and 77.13%; year 4: 72.38%, 78.22%, 72.96%, 77.92%, and 72.65%; and year 5: 69.51%, 76.15%, 73.04%, 76.72%, and 69.99%, respectively. This demonstrates an overall 12.07% increase in the cumulative sum of AUC in comparison with the classic multilevel temporal Bayesian network. Conclusions Using graph summarization can improve the interpretability and the predictive power of the complex graphical models of MCC.
Collapse
Affiliation(s)
- Syed Hasib Akhter Faruqui
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Adel Alaeddini
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Mike C Chang
- Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Sara Shirinkam
- Department of Mathematics and Statistics, University of the Incarnate Word, San Antonio, TX, United States
| | - Carlos Jaramillo
- South Texas Veterans Health Care System, San Antonio, TX, United States
| | - Peyman NajafiRad
- Department of Information Systems and Cyber Security, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Jing Wang
- School of Nursing, UT Health San Antonio, San Antonio, TX, United States
| | - Mary Jo Pugh
- VA Salt Lake City Health Care System, Salt Lake City, UT, United States
| |
Collapse
|
11
|
Hossain ME, Uddin S, Khan A, Moni MA. A Framework to Understand the Progression of Cardiovascular Disease for Type 2 Diabetes Mellitus Patients Using a Network Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E596. [PMID: 31963383 PMCID: PMC7013570 DOI: 10.3390/ijerph17020596] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 01/14/2020] [Indexed: 12/13/2022]
Abstract
The prevalence of chronic disease comorbidity has increased worldwide. Comorbidity-i.e., the presence of multiple chronic diseases-is associated with adverse health outcomes in terms of mobility and quality of life as well as financial burden. Understanding the progression of comorbidities can provide valuable insights towards the prevention and better management of chronic diseases. Administrative data can be used in this regard as they contain semantic information on patients' health conditions. Most studies in this field are focused on understanding the progression of one chronic disease rather than multiple diseases. This study aims to understand the progression of two chronic diseases in the Australian health context. It specifically focuses on the comorbidity progression of cardiovascular disease (CVD) in patients with type 2 diabetes mellitus (T2DM), as the prevalence of these chronic diseases in Australians is high. A research framework is proposed to understand and represent the progression of CVD in patients with T2DM using graph theory and social network analysis techniques. Two study cohorts (i.e., patients with both T2DM and CVD and patients with only T2DM) were selected from an administrative dataset obtained from an Australian health insurance company. Two baseline disease networks were constructed from these two selected cohorts. A final disease network from two baseline disease networks was then generated by weight adjustments in a normalized way. The prevalence of renal failure, fluid and electrolyte disorders, hypertension and obesity was significantly higher in patients with both CVD and T2DM than patients with only T2DM. This showed that these chronic diseases occurred frequently during the progression of CVD in patients with T2DM. The proposed network-based model may potentially help the healthcare provider to understand high-risk diseases and the progression patterns between the recurrence of T2DM and CVD. Also, the framework could be useful for stakeholders including governments and private health insurers to adopt appropriate preventive health management programs for patients at a high risk of developing multiple chronic diseases.
Collapse
Affiliation(s)
- Md Ekramul Hossain
- Complex Systems Research Group, Faculty of Engineering, The University of Sydney, Darlington, NSW 2008, Australia;
| | - Shahadat Uddin
- Complex Systems Research Group, Faculty of Engineering, The University of Sydney, Darlington, NSW 2008, Australia;
| | - Arif Khan
- Complex Systems Research Group, Faculty of Engineering, The University of Sydney, Darlington, NSW 2008, Australia;
| | - Mohammad Ali Moni
- School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2006, Australia;
| |
Collapse
|