1
|
Elnakib S, Vecino-Ortiz AI, Gibson DG, Agarwal S, Trujillo AJ, Zhu Y, Labrique A. A novel score for mobile health applications to predict and prevent mortality: Further validation and adaptation to US population using the US NHANES dataset. J Med Internet Res 2022; 24:e36787. [PMID: 35483022 PMCID: PMC9240932 DOI: 10.2196/36787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 04/14/2022] [Accepted: 04/28/2022] [Indexed: 11/28/2022] Open
Abstract
Background The C-Score, which is an individual health score, is based on a predictive model validated in the UK and US populations. It was designed to serve as an individualized point-in-time health assessment tool that could be integrated into clinical counseling or consumer-facing digital health tools to encourage lifestyle modifications that reduce the risk of premature death. Objective Our study aimed to conduct an external validation of the C-Score in the US population and expand the original score to improve its predictive capabilities in the US population. The C-Score is intended for mobile health apps on wearable devices. Methods We conducted a literature review to identify relevant variables that were missing in the original C-Score. Subsequently, we used data from the 2005 to 2014 US National Health and Nutrition Examination Survey (NHANES; N=21,015) to test the capacity of the model to predict all-cause mortality. We used NHANES III data from 1988 to 1994 (N=1440) to conduct an external validation of the test. Only participants with complete data were included in this study. Discrimination and calibration tests were conducted to assess the operational characteristics of the adapted C-Score from receiver operating curves and a design-based goodness-of-fit test. Results Higher C-Scores were associated with reduced odds of all-cause mortality (odds ratio 0.96, P<.001). We found a good fit of the C-Score for all-cause mortality with an area under the curve (AUC) of 0.72. Among participants aged between 40 and 69 years, C-Score models had a good fit for all-cause mortality and an AUC >0.72. A sensitivity analysis using NHANES III data (1988-1994) was performed, yielding similar results. The inclusion of sociodemographic and clinical variables in the basic C-Score increased the AUCs from 0.72 (95% CI 0.71-0.73) to 0.87 (95% CI 0.85-0.88). Conclusions Our study shows that this digital biomarker, the C-Score, has good capabilities to predict all-cause mortality in the general US population. An expanded health score can predict 87% of the mortality in the US population. This model can be used as an instrument to assess individual mortality risk and as a counseling tool to motivate behavior changes and lifestyle modifications.
Collapse
Affiliation(s)
- Shatha Elnakib
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| | - Andres I Vecino-Ortiz
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| | - Dustin G Gibson
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| | - Smisha Agarwal
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| | - Antonio J Trujillo
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| | - Yifan Zhu
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| | - Alain Labrique
- Department of International Health., Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street.E8620, Baltimore, US
| |
Collapse
|
2
|
Li J, Xu Z, Xu T, Lin S. Predicting Diabetes in Patients with Metabolic Syndrome Using Machine-Learning Model Based on Multiple Years' Data. Diabetes Metab Syndr Obes 2022; 15:2951-2961. [PMID: 36186938 PMCID: PMC9525025 DOI: 10.2147/dmso.s381146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 09/16/2022] [Indexed: 11/23/2022] Open
Abstract
PURPOSE To evaluate the performance of machine-learning models based on multiple years of continuous data to predict incident diabetes among patients with metabolic syndrome. PATIENTS AND METHODS The dataset comprises the health records from 2008 to 2020 including 4510 nondiabetic participants with metabolic syndrome (MetS) at baseline and with at least 6 years of records. MetS was defined according to the International Diabetes Federation (IDF) criteria. Overall, 332 patients developed incident diabetes during the 7±1.4 years of follow-up. Three popular classification algorithms were evaluated on the dataset: logistic regression, random forest, and Xgboost. Five models including single-year models (year 1, year 2, and year 3) and multiple-year models (year 1-2 and year 1-3) were developed for each algorithm. RESULTS The model performances improved with the increasing longitudinal dataset as the area under the receiver operating characteristic curve (AUROC) was boosted for both random forest (year 1-3: AUROC=0.893; year 3: AUROC=0.862; year 1-2: AUROC=0.847; year 2: AUROC=0.838) and Xgboost (year 1-3: AUROC=0.897; year 3: AUROC=0.833; year 1-2: AUROC=0.856; year 2: AUROC=0.823) model. In the multiple-year models, the highest fasting plasma glucose, followed by the mean or lowest level of HbA1c and BMI had the most important predictive value for the onset of diabetes. In the "1-3" year model, "delta weight" which reflects the fluctuations of yearly change of weight was the fourth-most important feature. CONCLUSION This study demonstrated improved performance with the accumulation of longitudinal data when using machine learning for diabetes prediction in MetS patients. For individuals with similar clinical parameters, the variation trends of these parameters could change the risk of future diabetes. This result indicated that models based on longitudinal multiple years' data may provide more personalized assessment tools for risk evaluation.
Collapse
Affiliation(s)
- Jing Li
- Department of Health Management, Peking Union Medical College Hospital, Beijing, People’s Republic of China
| | - Zheng Xu
- Department of AI Research, Digital Health China Technologies Co. Ltd, Beijing, People’s Republic of China
| | - Tengda Xu
- Department of Health Management, Peking Union Medical College Hospital, Beijing, People’s Republic of China
| | - Songbai Lin
- Department of Health Management, Peking Union Medical College Hospital, Beijing, People’s Republic of China
- Correspondence: Songbai Lin, Department of Health Management, Peking Union Medical College Hospital, 1# Shuaifuyuan, Dongcheng District, Beijing, 100730, People’s Republic of China, Tel +86 10 6915 9901, Fax +86 10 6915 9901, Email
| |
Collapse
|
3
|
Lenoir KM, Wagenknecht LE, Divers J, Casanova R, Dabelea D, Saydah S, Pihoker C, Liese AD, Standiford D, Hamman R, Wells BJ. Determining diagnosis date of diabetes using structured electronic health record (EHR) data: the SEARCH for diabetes in youth study. BMC Med Res Methodol 2021; 21:210. [PMID: 34629073 PMCID: PMC8502379 DOI: 10.1186/s12874-021-01394-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 09/07/2021] [Indexed: 12/01/2022] Open
Abstract
BACKGROUND Disease surveillance of diabetes among youth has relied mainly upon manual chart review. However, increasingly available structured electronic health record (EHR) data have been shown to yield accurate determinations of diabetes status and type. Validated algorithms to determine date of diabetes diagnosis are lacking. The objective of this work is to validate two EHR-based algorithms to determine date of diagnosis of diabetes. METHODS A rule-based ICD-10 algorithm identified youth with diabetes from structured EHR data over the period of 2009 through 2017 within three children's hospitals that participate in the SEARCH for Diabetes in Youth Study: Cincinnati Children's Hospital, Cincinnati, OH, Seattle Children's Hospital, Seattle, WA, and Children's Hospital Colorado, Denver, CO. Previous research and a multidisciplinary team informed the creation of two algorithms based upon structured EHR data to determine date of diagnosis among diabetes cases. An ICD-code algorithm was defined by the year of occurrence of a second ICD-9 or ICD-10 diabetes code. A multiple-criteria algorithm consisted of the year of first occurrence of any of the following: diabetes-related ICD code, elevated glucose, elevated HbA1c, or diabetes medication. We assessed algorithm performance by percent agreement with a gold standard date of diagnosis determined by chart review. RESULTS Among 3777 cases, both algorithms demonstrated high agreement with true diagnosis year and differed in classification (p = 0.006): 86.5% agreement for the ICD code algorithm and 85.9% agreement for the multiple-criteria algorithm. Agreement was high for both type 1 and type 2 cases for the ICD code algorithm. Performance improved over time. CONCLUSIONS Year of occurrence of the second ICD diabetes-related code in the EHR yields an accurate diagnosis date within these pediatric hospital systems. This may lead to increased efficiency and sustainability of surveillance methods for incidence of diabetes among youth.
Collapse
Affiliation(s)
- Kristin M Lenoir
- Department of Biostatistics and Data Science, Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA.
- Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA.
| | - Lynne E Wagenknecht
- Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Jasmin Divers
- Division of Health Services Research, NYU Winthrop Research Institute, NYU Long Island School of Medicine, Mineola, NY, USA
| | - Ramon Casanova
- Department of Biostatistics and Data Science, Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Dana Dabelea
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Denver, Aurora, CO, USA
| | - Sharon Saydah
- Division of Diabetes Translation, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Catherine Pihoker
- Department of Pediatrics, University of Washington, Seattle, WA, USA
| | - Angela D Liese
- Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina, Columbia, SC, USA
| | - Debra Standiford
- Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Richard Hamman
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Denver, Aurora, CO, USA
| | - Brian J Wells
- Department of Biostatistics and Data Science, Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
| |
Collapse
|
4
|
Garmany A, Yamada S, Terzic A. Longevity leap: mind the healthspan gap. NPJ Regen Med 2021; 6:57. [PMID: 34556664 PMCID: PMC8460831 DOI: 10.1038/s41536-021-00169-5] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 09/02/2021] [Indexed: 02/08/2023] Open
Abstract
Life expectancy has increased by three decades since the mid-twentieth century. Parallel healthspan expansion has however not followed, largely impeded by the pandemic of chronic diseases afflicting a growing older population. The lag in quality of life is a recognized challenge that calls for prioritization of disease-free longevity. Contemporary communal, clinical and research trends aspiring to extend the health horizon are here outlined in the context of an evolving epidemiology. A shared action integrating public and societal endeavors with emerging interventions that target age-related multimorbidity and frailty is needed. A multidimensional buildout of a curative perspective, boosted by modern anti-senescent and regenerative technology with augmented decision making, would require dedicated resources and cost-effective validation to responsibly bridge the healthspan-lifespan gap for a future of equitable global wellbeing.
Collapse
Affiliation(s)
- Armin Garmany
- Center for Regenerative Medicine, Marriott Family Comprehensive Cardiac Regenerative Medicine, Marriott Heart Disease Research Program, Van Cleve Cardiac Regenerative Medicine Program, Mayo Clinic, Rochester, MN, USA
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
- Mayo Clinic Alix School of Medicine, Regenerative Sciences Track, Mayo Clinic Graduate School of Biomedical Sciences, Mayo Clinic, Rochester, MN, USA
| | - Satsuki Yamada
- Center for Regenerative Medicine, Marriott Family Comprehensive Cardiac Regenerative Medicine, Marriott Heart Disease Research Program, Van Cleve Cardiac Regenerative Medicine Program, Mayo Clinic, Rochester, MN, USA
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
- Division of Geriatric Medicine and Gerontology, Department of Medicine, Mayo Clinic, Rochester, MN, USA
| | - Andre Terzic
- Center for Regenerative Medicine, Marriott Family Comprehensive Cardiac Regenerative Medicine, Marriott Heart Disease Research Program, Van Cleve Cardiac Regenerative Medicine Program, Mayo Clinic, Rochester, MN, USA.
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA.
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA.
- Department of Clinical Genomics, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
5
|
R A, R N. Diabetes Mellitus Prediction and Severity Level Estimation Using OWDANN Algorithm. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:5573179. [PMID: 34462631 PMCID: PMC8403056 DOI: 10.1155/2021/5573179] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 08/10/2021] [Indexed: 11/19/2022]
Abstract
Today, diabetes is one of the most prevalent, chronic, and deadly diseases in the world owing to some complications. If accurate early diagnosis is feasible, the risk factor and incidence of diabetes may be greatly decreased. Diabetes prediction is stable and reliable, since there are only minimal labelling evidence and outliers found in the datasets of diabetes. Numerous works coped with diabetes disease prediction and provided the solution. But the existing methods proffered low accuracy detection and consumed more training time. So, this paper proposed an OWDANN algorithm for diabetes mellitus disease prediction and severity level estimation. The proposed system mainly consists of two phases, namely, disease prediction and severity level estimation phase. In the disease prediction phase, the preprocessing is performed for the Pima dataset. Then, the features are extracted from the preprocessed data, and finally, the classification step is performed by using OWDANN. In the severity level estimation phase, the diabetes positive dataset is preprocessed first. Then, the features are extracted, and lastly, the severity level is predicted using GDHC. The extensive experimental results showed that the proposed system outperforms with 98.97% accuracy, 94.98% sensitivity, 95.62% specificity, 97.02% precision, 93.84% recall, 9404% f-measure, 0.094% FDR, and 0.023% FPR compared with the state-of-the-art methods.
Collapse
Affiliation(s)
- Annamalai R
- Department of Information Technology, Jeppiaar Institute of Technology, Kanchipuram 631604, India
| | - Nedunchelian R
- Department of Computer Science and Engineering, Excel Engineering College, Namakkal 637303, India
| |
Collapse
|
6
|
Oh W, Steinbach MS, Castro MR, Peterson KA, Kumar V, Caraballo PJ, Simon GJ. A Computational Method for Learning Disease Trajectories From Partially Observable EHR Data. IEEE J Biomed Health Inform 2021; 25:2476-2486. [PMID: 34129510 PMCID: PMC8388183 DOI: 10.1109/jbhi.2021.3089441] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Diseases can show different courses of progression even when patients share the same risk factors. Recent studies have revealed that the use of trajectories, the order in which diseases manifest throughout life, can be predictive of the course of progression. In this study, we propose a novel computational method for learning disease trajectories from EHR data. The proposed method consists of three parts: first, we propose an algorithm for extracting trajectories from EHR data; second, three criteria for filtering trajectories; and third, a likelihood function for assessing the risk of developing a set of outcomes given a trajectory set. We applied our methods to extract a set of disease trajectories from Mayo Clinic EHR data and evaluated it internally based on log-likelihood, which can be interpreted as the trajectories' ability to explain the observed (partial) disease progressions. We then externally evaluated the trajectories on EHR data from an independent health system, M Health Fairview. The proposed algorithm extracted a comprehensive set of disease trajectories that can explain the observed outcomes substantially better than competing methods and the proposed filtering criteria selected a small subset of disease trajectories that are highly interpretable and suffered only a minimal (relative 5%) loss of the ability to explain disease progression in both the internal and external validation.
Collapse
|
7
|
Ramalho A, Castro P, Lobo M, Souza J, Santos P, Freitas A. Integrated quality assessment for diabetes care in Portuguese primary health care using prevention quality indicators. Prim Care Diabetes 2021; 15:507-512. [PMID: 33441264 DOI: 10.1016/j.pcd.2021.01.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 01/02/2021] [Accepted: 01/04/2021] [Indexed: 01/25/2023]
Abstract
AIMS This study evaluates the prevention quality indicators (PQI) for Diabetes Mellitus (DM) in Portugal using contemporary data and explores their variability according to Primary Health Care (PHC) quality indicators. METHODS We conducted a retrospective observational analysis of secondary data comprising Portuguese PHC indicators by health centres group (ACES) and the National Hospital Morbidity Database. We calculated and analysed age-sex-adjusted rates for each PQI. Worse-performing ACES were identified using the 2017 median PQI values as an assessment cut-off. A multivariate logistic analysis was carried to find variables associated with the likelihood of being a worse-performing ACES for the biennium. RESULTS The median values of the indicator PQI93 - Prevention Quality Diabetes Composite were 79 and 65.2 hospitalizations per 100 000 pop, in 2016 and 2017 respectively. Diabetes long term complications (PQI 03) accounted for most of the hospitalizations. The quality indicator in PHC with greater influence on PQI93 was the proportion of DM patients with <65 years with test results for HbA1c < = 6.5%. CONCLUSIONS This study shows that some PHC quality indicators are closely related to DM care, and so their monitoring is of high importance. Diabetes long term complications (PQI 03) demand greater attention from PHC professionals.
Collapse
Affiliation(s)
- A Ramalho
- MEDCIDS - Department of Community Medicine, Information and Health Decision Sciences, Faculty of Medicine, University of Porto, Porto, Portugal; CINTESIS - Centre for Health Technology and Services Research, Porto, Portugal; ACHE - American College of Healthcare Executives, Chicago, IL, USA.
| | - P Castro
- CINTESIS - Centre for Health Technology and Services Research, Porto, Portugal; USF Camélias, ACeS Gaia - Grande Porto VII (ARS Norte) - Vila Nova de Gaia, Portugal
| | - M Lobo
- MEDCIDS - Department of Community Medicine, Information and Health Decision Sciences, Faculty of Medicine, University of Porto, Porto, Portugal; CINTESIS - Centre for Health Technology and Services Research, Porto, Portugal
| | - J Souza
- CINTESIS - Centre for Health Technology and Services Research, Porto, Portugal
| | - P Santos
- MEDCIDS - Department of Community Medicine, Information and Health Decision Sciences, Faculty of Medicine, University of Porto, Porto, Portugal; CINTESIS - Centre for Health Technology and Services Research, Porto, Portugal
| | - A Freitas
- MEDCIDS - Department of Community Medicine, Information and Health Decision Sciences, Faculty of Medicine, University of Porto, Porto, Portugal; CINTESIS - Centre for Health Technology and Services Research, Porto, Portugal
| |
Collapse
|