1
|
Tanaka M, Akiyama Y, Mori K, Hosaka I, Endo K, Ogawa T, Sato T, Suzuki T, Yano T, Ohnishi H, Hanawa N, Furuhashi M. Machine learning-based analyses of contributing factors for the development of hypertension: a comparative study. Clin Exp Hypertens 2025; 47:2449613. [PMID: 39773295 DOI: 10.1080/10641963.2025.2449613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 11/25/2024] [Accepted: 12/30/2024] [Indexed: 01/11/2025]
Abstract
OBJECTIVES Sufficient attention has not been given to machine learning (ML) models using longitudinal data for investigating important predictors of new onset of hypertension. We investigated the predictive ability of several ML models for the development of hypertension. METHODS A total of 15 965 Japanese participants (men/women: 9,466/6,499, mean age: 45 years) who received annual health examinations were randomly divided into a training group (70%, n = 11,175) and a test group (30%, n = 4,790). The predictive abilities of 58 candidates including fatty liver index (FLI), which is calculated by using body mass index, waist circumference and levels of γ-glutamyl transferase and triglycerides, were investigated by statistics analogous to the area under the curve (AUC) in receiver operating characteristic curve analyses using ML models including logistic regression, random forest, naïve Bayes, extreme gradient boosting and artificial neural network. RESULTS During a 10-year period (mean period: 6.1 years), 2,132 subjects (19.1%) in the training group and 917 subjects (19.1%) in the test group had new onset of hypertension. Among the 58 parameters, systolic blood pressure, age and FLI were identified as important candidates by random forest feature selection with 10-fold cross-validation. The AUCs of ML models were 0.765-0.825, and discriminatory capacity was significantly improved in the artificial neural network model compared to that in the logistic regression model. CONCLUSIONS The development of hypertension can be simply and accurately predicted by each ML model using systolic blood pressure, age and FLI as selected features. By building multiple ML models, more practical prediction might be possible.
Collapse
Affiliation(s)
- Marenao Tanaka
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
- Tanaka Medical Clinic, Yoichi, Japan
| | - Yukinori Akiyama
- Department of Neurosurgery, Sapporo Medical University, Sapporo, Japan
| | - Kazuma Mori
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
- Department of Immunology and Microbiology, National Defense Medical College, Tokorozawa, Japan
| | - Itaru Hosaka
- Department of Cardiovascular Surgery, Sapporo Medical University School of Medicine, Sapporo, Japan
| | - Keisuke Endo
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
| | - Toshifumi Ogawa
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
- Department of Cellular Physiology and Signal Transduction, Sapporo Medical University School of Medicine, Sapporo, Japan
| | - Tatsuya Sato
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
- Department of Cellular Physiology and Signal Transduction, Sapporo Medical University School of Medicine, Sapporo, Japan
| | - Toru Suzuki
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
- Natori Toru Internal Medicine and Diabetes Clinic, Natori, Japan
| | - Toshiyuki Yano
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
| | - Hirofumi Ohnishi
- Department of Public Health, Sapporo Medical University School of Medicine, Sapporo, Japan
| | - Nagisa Hanawa
- Department of Health Checkup and Promotion, Keijinkai Maruyama Clinic, Sapporo, Japan
| | - Masato Furuhashi
- Department of Cardiovascular, Renal and Metabolic Medicine, Sapporo Medical University School of Medicine, Sapporo, Japan
| |
Collapse
|
2
|
Rezaianzadeh A, Johari MG, Baeradeh N, Seif M, Hosseini SV. Sex differences in hypertension incidence and risk factors: a population-based cohort study in Southern Iran. BMC Public Health 2024; 24:3575. [PMID: 39716231 DOI: 10.1186/s12889-024-21082-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Accepted: 12/12/2024] [Indexed: 12/25/2024] Open
Abstract
BACKGROUND Hypertension (HTN) is a major global public health concern. This study aims to identify gender differences to inform more effective prevention strategies and targeted management approaches. METHODS This prospective cohort study included 7,710 participants aged 40 to 70 years, with a mean follow-up duration of 5.2 years. HTN was defined using European hypertension management guidelines. A Cox regression model was employed to determine factors associated with HTN, adjusting for confounding variables effects. RESULTS During the mean follow-up period of 5.2 years, the incidence rate of hypertension was 21.54 per 1,000 person-years, with females exhibiting a higher incidence than males. Several significant predictors of HTN were identified. In men, key risk factors included age (60-70 years, 2.83-fold increase, 95% CI 2.05-3.92), high waist-to-height ratio (5.63-fold increase, 95% CI 2.42-13.07), smoking (2.68-fold increase, 95% CI 1.04-6.91), and opium use (1.93-fold increase, 95% CI 1.06-3.49). In women, significant predictors included age (60-70 years, 3.65-fold increase, 95% CI 2.59-5.15), contraceptive drug use (1.24-fold increase, 95% CI 1.01-1.52), high waist-to-height ratio (1.87-fold increase, 95% CI 1.19-2.92), pre-HTN (3.64-fold increase, 95% CI 3.01-4.40), and kidney stones (1.32-fold increase, 95% CI 1.06-1.65). CONCLUSION This study identified key predictors of hypertension (HTN) with notable gender differences. For men, significant risk factors included age, high waist-to-height ratio, smoking, and opium use; for women, the prominent predictors were age, contraceptive use, pre-HTN, and kidney stones. These findings highlight the need for gender-specific strategies in HTN prevention and management, focusing on modifiable risk factors by gender.
Collapse
Affiliation(s)
- Abbas Rezaianzadeh
- Department of Community Medicine School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Masoumeh Ghoddusi Johari
- Assistant Professor of Community Medicine, Breast Diseases Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Najibullah Baeradeh
- Department of Public Health, Ferdows Faculty of Medical Sciences, Birjand University of Medical Sciences, Birjand, Iran.
| | - Mozhgan Seif
- Department of Epidemiology, School of Health, Non-Communicable Diseases Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Seyed Vahid Hosseini
- Department of Community Medicine School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| |
Collapse
|
3
|
Guo S, Ge JX, Liu SN, Zhou JY, Li C, Chen HJ, Chen L, Shen YQ, Zhou QL. Development of a convenient and effective hypertension risk prediction model and exploration of the relationship between Serum Ferritin and Hypertension Risk: a study based on NHANES 2017-March 2020. Front Cardiovasc Med 2023; 10:1224795. [PMID: 37736023 PMCID: PMC10510409 DOI: 10.3389/fcvm.2023.1224795] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 07/28/2023] [Indexed: 09/23/2023] Open
Abstract
Background Hypertension is a major public health problem, and its resulting other cardiovascular diseases are the leading cause of death worldwide. In this study, we constructed a convenient and high-performance hypertension risk prediction model to assist in clinical diagnosis and explore other important influencing factors. Methods We included 8,073 people from NHANES (2017-March 2020), using their 120 features to form the original dataset. After data pre-processing, we removed several redundant features through LASSO regression and correlation analysis. Thirteen commonly used machine learning methods were used to construct prediction models, and then, the methods with better performance were coupled with recursive feature elimination to determine the optimal feature subset. After data balancing through SMOTE, we integrated these better-performing learners to construct a fusion model based for predicting hypertension risk on stacking strategy. In addition, to explore the relationship between serum ferritin and the risk of hypertension, we performed a univariate analysis and divided it into four level groups (Q1 to Q4) by quartiles, with the lowest level group (Q1) as the reference, and performed multiple logistic regression analysis and trend analysis. Results The optimal feature subsets were: age, BMI, waist, SBP, DBP, Cre, UACR, serum ferritin, HbA1C, and doctors recommend reducing salt intake. Compared to other machine learning models, the constructed fusion model showed better predictive performance with precision, accuracy, recall, F1 value and AUC of 0.871, 0.873, 0.871, 0.869 and 0.966, respectively. For the analysis of the relationship between serum ferritin and hypertension, after controlling for all co-variates, OR and 95% CI from Q2 to Q4, compared to Q1, were 1.396 (1.176-1.658), 1.499 (1.254-1.791), and 1.645 (1.360-1.989), respectively, with P < 0.01 and P for trend <0.001. Conclusion The hypertension risk prediction model developed in this study is efficient in predicting hypertension with only 10 low-cost and easily accessible features, which is cost-effective in assisting clinical diagnosis. We also found a trend correlation between serum ferritin levels and the risk of hypertension.
Collapse
Affiliation(s)
- Shuang Guo
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Jiu-Xin Ge
- Department of Cardiology, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Shan-Na Liu
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Jia-Yu Zhou
- Xinjiang Second Medical College, Karamay, China
| | - Chang Li
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Han-Jie Chen
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Li Chen
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Yu-Qiang Shen
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| | - Qing-Li Zhou
- Information Center, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China
| |
Collapse
|
4
|
Venkatachala Appa Swamy M, Periyasamy J, Thangavel M, Khan SB, Almusharraf A, Santhanam P, Ramaraj V, Elsisi M. Design and Development of IoT and Deep Ensemble Learning Based Model for Disease Monitoring and Prediction. Diagnostics (Basel) 2023; 13:diagnostics13111942. [PMID: 37296794 DOI: 10.3390/diagnostics13111942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 05/04/2023] [Accepted: 05/11/2023] [Indexed: 06/12/2023] Open
Abstract
With the rapidly increasing reliance on advances in IoT, we persist towards pushing technology to new heights. From ordering food online to gene editing-based personalized healthcare, disruptive technologies like ML and AI continue to grow beyond our wildest dreams. Early detection and treatment through AI-assisted diagnostic models have outperformed human intelligence. In many cases, these tools can act upon the structured data containing probable symptoms, offer medication schedules based on the appropriate code related to diagnosis conventions, and predict adverse drug effects, if any, in accordance with medications. Utilizing AI and IoT in healthcare has facilitated innumerable benefits like minimizing cost, reducing hospital-obtained infections, decreasing mortality and morbidity etc. DL algorithms have opened up several frontiers by contributing towards healthcare opportunities through their ability to understand and learn from different levels of demonstration and generalization, which is significant in data analysis and interpretation. In contrast to ML which relies more on structured, labeled data and domain expertise to facilitate feature extractions, DL employs human-like cognitive abilities to extract hidden relationships and patterns from uncategorized data. Through the efficient application of DL techniques on the medical dataset, precise prediction, and classification of infectious/rare diseases, avoiding surgeries that can be preventable, minimization of over-dosage of harmful contrast agents for scans and biopsies can be reduced to a greater extent in future. Our study is focused on deploying ensemble deep learning algorithms and IoT devices to design and develop a diagnostic model that can effectively analyze medical Big Data and diagnose diseases by identifying abnormalities in early stages through medical images provided as input. This AI-assisted diagnostic model based on Ensemble Deep learning aims to be a valuable tool for healthcare systems and patients through its ability to diagnose diseases in the initial stages and present valuable insights to facilitate personalized treatment by aggregating the prediction of each base model and generating a final prediction.
Collapse
Affiliation(s)
| | - Jayalakshmi Periyasamy
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Muthamilselvan Thangavel
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Surbhi B Khan
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos 13-5053, Lebanon
- Department of Data Science, School of Science, Engineering and Environment, University of Sanford, Manchester M5 4WT, UK
| | - Ahlam Almusharraf
- Department of Business Administration, College of Business and Administration, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Prasanna Santhanam
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Vijayan Ramaraj
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Mahmoud Elsisi
- Department of Electrical Engineering, National Kaohsiung University of Science and Technology, Kaohsiung City 807618, Taiwan
- Department of Electrical Engineering, Faculty of Engineering (Shoubra), Benha University, 108 Shoubra St., Cairo P.O. Box 11241, Egypt
| |
Collapse
|
5
|
Kim G, Lim H, Kim Y, Kwon O, Choi JH. Intra-person multi-task learning method for chronic-disease prediction. Sci Rep 2023; 13:1069. [PMID: 36658206 PMCID: PMC9851106 DOI: 10.1038/s41598-023-28383-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 01/18/2023] [Indexed: 01/20/2023] Open
Abstract
In the medical field, various clinical information has been accumulated to help clinicians provide personalized medicine and make better diagnoses. As chronic diseases share similar characteristics, it is possible to predict multiple chronic diseases using the accumulated data of each patient. Thus, we propose an intra-person multi-task learning framework that jointly predicts the status of correlated chronic diseases and improves the model performance. Because chronic diseases occur over a long period and are affected by various factors, we considered features related to each chronic disease and the temporal relationship of the time-series data for accurate prediction. The study was carried out in three stages: (1) data preprocessing and feature selection using bidirectional recurrent imputation for time series (BRITS) and the least absolute shrinkage and selection operator (LASSO); (2) a convolutional neural network and long short-term memory (CNN-LSTM) for single-task models; and (3) a novel intra-person multi-task learning CNN-LSTM framework developed to predict multiple chronic diseases simultaneously. Our multi-task learning method between correlated chronic diseases produced a more stable and accurate system than single-task models and other baseline recurrent networks. Furthermore, the proposed model was tested using different time steps to illustrate its flexibility and generalization across multiple time steps.
Collapse
Affiliation(s)
- Gihyeon Kim
- Department of Computational Medicine, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760, Korea
| | - Heeryung Lim
- Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760, Korea
| | - Yunsoo Kim
- Department of Nutritional Science and Food Management, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760, Korea
| | - Oran Kwon
- Department of Nutritional Science and Food Management, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760, Korea
| | - Jang-Hwan Choi
- Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760, Korea.
| |
Collapse
|
6
|
Silva GFS, Fagundes TP, Teixeira BC, Chiavegatto Filho ADP. Machine Learning for Hypertension Prediction: a Systematic Review. Curr Hypertens Rep 2022; 24:523-533. [PMID: 35731335 DOI: 10.1007/s11906-022-01212-6] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2022] [Indexed: 01/31/2023]
Abstract
PURPOSE OF REVIEW To provide an overview of the literature regarding the use of machine learning algorithms to predict hypertension. A systematic review was performed to select recent articles on the subject. RECENT FINDINGS The screening of the articles was conducted using a machine learning algorithm (ASReview). A total of 21 articles published between January 2018 and May 2021 were identified and compared according to variable selection, train-test split, data balancing, outcome definition, final algorithm, and performance metrics. Overall, the articles achieved an area under the ROC curve (AUROC) between 0.766 and 1.00. The algorithms most frequently identified as having the best performance were support vector machines (SVM), extreme gradient boosting (XGBoost), and random forest. Machine learning algorithms are a promising tool to improve preventive clinical decisions and targeted public health policies for hypertension. However, technical factors such as outcome definition, availability of the final code, predictive performance, explainability, and data leakage need to be consistently and critically evaluated.
Collapse
Affiliation(s)
- Gabriel F S Silva
- Department of Epidemiology, School of Public Health, University of São Paulo, São Paulo, SP, Brazil
| | - Thales P Fagundes
- Laboratory of Big Data and Predictive Analysis in Healthcare, School of Public Health, University of São Paulo, São Paulo, SP, Brazil
| | - Bruno C Teixeira
- Laboratory of Big Data and Predictive Analysis in Healthcare, School of Public Health, University of São Paulo, São Paulo, SP, Brazil
| | | |
Collapse
|
7
|
Islam SMS, Talukder A, Awal MA, Siddiqui MMU, Ahamad MM, Ahammed B, Rawal LB, Alizadehsani R, Abawajy J, Laranjo L, Chow CK, Maddison R. Machine Learning Approaches for Predicting Hypertension and Its Associated Factors Using Population-Level Data From Three South Asian Countries. Front Cardiovasc Med 2022; 9:839379. [PMID: 35433854 PMCID: PMC9008259 DOI: 10.3389/fcvm.2022.839379] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Accepted: 03/11/2022] [Indexed: 12/13/2022] Open
Abstract
BackgroundHypertension is the most common modifiable risk factor for cardiovascular diseases in South Asia. Machine learning (ML) models have been shown to outperform clinical risk predictions compared to statistical methods, but studies using ML to predict hypertension at the population level are lacking. This study used ML approaches in a dataset of three South Asian countries to predict hypertension and its associated factors and compared the model's performances.MethodsWe conducted a retrospective study using ML analyses to detect hypertension using population-based surveys. We created a single dataset by harmonizing individual-level data from the most recent nationally representative Demographic and Health Survey in Bangladesh, Nepal, and India. The variables included blood pressure (BP), sociodemographic and economic factors, height, weight, hemoglobin, and random blood glucose. Hypertension was defined based on JNC-7 criteria. We applied six common ML-based classifiers: decision tree (DT), random forest (RF), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), logistic regression (LR), and linear discriminant analysis (LDA) to predict hypertension and its risk factors.ResultsOf the 8,18,603 participants, 82,748 (10.11%) had hypertension. ML models showed that significant factors for hypertension were age and BMI. Ever measured BP, education, taking medicine to lower BP, and doctor's perception of high BP was also significant but comparatively lower than age and BMI. XGBoost, GBM, LR, and LDA showed the highest accuracy score of 90%, RF and DT achieved 89 and 83%, respectively, to predict hypertension. DT achieved the precision value of 91%, and the rest performed with 90%. XGBoost, GBM, LR, and LDA achieved a recall value of 100%, RF scored 99%, and DT scored 90%. In F1-score, XGBoost, GBM, LR, and LDA scored 95%, while RF scored 94%, and DT scored 90%. All the algorithms performed with good and small log loss values <6%.ConclusionML models performed well to predict hypertension and its associated factors in South Asians. When employed on an open-source platform, these models are scalable to millions of people and might help individuals self-screen for hypertension at an early stage. Future studies incorporating biochemical markers are needed to improve the ML algorithms and evaluate them in real life.
Collapse
Affiliation(s)
- Sheikh Mohammed Shariful Islam
- Institute for Physical Activity and Nutrition, Faculty of Health, Deakin University, Melbourne, VIC, Australia
- *Correspondence: Sheikh Mohammed Shariful Islam
| | - Ashis Talukder
- Statistics Discipline, Khulna University, Khulna, Bangladesh
| | - Md. Abdul Awal
- Electronics and Communication Engineering Discipline, Khulna University, Khulna, Bangladesh
| | | | - Md. Martuza Ahamad
- Department of Computer Science and Engineering, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Gopalganj, Bangladesh
| | - Benojir Ahammed
- Statistics Discipline, Khulna University, Khulna, Bangladesh
| | - Lal B. Rawal
- School of Health Medical and Applied Sciences, Central Queensland University, Sydney, NSW, Australia
| | - Roohallah Alizadehsani
- Institute for Intelligent Systems Research and Innovation, Deakin University, Geelong, VIC, Australia
| | - Jemal Abawajy
- School of Information Technology, Deakin University, Geelong, VIC, Australia
| | - Liliana Laranjo
- Faculty of Medicine and Health, Westmead Applied Research Centre, The University of Sydney, Sydney, NSW, Australia
| | - Clara K. Chow
- Faculty of Medicine and Health, Westmead Applied Research Centre, The University of Sydney, Sydney, NSW, Australia
| | - Ralph Maddison
- Institute for Physical Activity and Nutrition, Faculty of Health, Deakin University, Melbourne, VIC, Australia
| |
Collapse
|
8
|
Determinants of Longitudinal Change of Lung Function in Different Gender in a Large Taiwanese Population Follow-Up Study Categories: Original Investigation. J Pers Med 2021; 11:jpm11101033. [PMID: 34683172 PMCID: PMC8537043 DOI: 10.3390/jpm11101033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 10/09/2021] [Accepted: 10/12/2021] [Indexed: 01/21/2023] Open
Abstract
Chronic lung disease is associated with tremendous social and economic burden worldwide. The aim of this study was to investigate the sex-specific risk factors for changes in lung function in a large longitudinal study. We included 9059 participants from the Taiwan Biobank. None of the participants had a history of smoking, asthma, emphysema or bronchitis. Lung function was assessed using spirometry measurements of forced vital capacity (FVC) and forced expiratory volume in 1 s (FEV1). Change in the FEV1/FVC (ΔFEV1/FVC) was calculated as a follow-up FEV1/FVC minus baseline FEV1/FVC. Linear regression analysis was used to identify associations between variables and ΔFEV1/FVC in the male and female participants. After multivariable adjustments, the male participants (vs. females; p = 0.021) were significantly associated with a low ΔFEV1/FVC. In addition, the male participants with low aspartate aminotransferase (AST) (p = 0.003), high alanine aminotransferase (ALT) (p = 0.006) and a low estimated glomerular filtration rate (eGFR) (p = 0.003) were significantly associated with a low ΔFEV1/FVC. For the female participants, low systolic blood pressure (p = 0.005), low diastolic blood pressure (p = 0.031), low AST (p < 0.001), high ALT (p < 0.001) and a low eGFR (p = 0.001) were significantly associated with a low ΔFEV1/FVC. In this large follow-up study, we found that the male participants had a faster decrease in the FEV1/FVC than the female participants. In addition, liver and renal functions were correlated with changes in lung function in both the male and female participants. Our findings provide useful information on sex-specific changes in lung function.
Collapse
|
9
|
Lin C, Li C, Liu C, Lin C, Wang M, Yang S, Li T. A risk scoring system to predict the risk of new-onset hypertension among patients with type 2 diabetes. J Clin Hypertens (Greenwich) 2021; 23:1570-1580. [PMID: 34251744 PMCID: PMC8678759 DOI: 10.1111/jch.14322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 06/28/2021] [Accepted: 06/30/2021] [Indexed: 12/01/2022]
Abstract
Hypertension (HTN), which frequently co-exists with diabetes mellitus, is the leading major cause of cardiovascular disease and death globally. This study aimed to develop and validate a risk scoring system considering the effects of glycemic and blood pressure (BP) variabilities to predict HTN incidence in patients with type 2 diabetes. This research is a retrospective cohort study that included 3416 patients with type 2 diabetes without HTN and who were enrolled in a managed care program in 2001-2015. The patients were followed up until April 2016, new-onset HTN event, or death. HTN was defined as diastolic BP (DBP) ≥ 90 mm Hg, systolic BP (SBP) ≥ 140 mm Hg, or the initiation of antihypertensive medication. Cox proportional hazard regression model was used to develop the risk scoring system for HTN. Of the patients, 1738 experienced new-onset HTN during an average follow-up period of 3.40 years. Age, sex, physical activity, body mass index, type of DM treatment, family history of HTN, baseline SBP and DBP, variabilities of fasting plasma glucose, SBP, and DBP and macroalbuminuria were significant variables for the prediction of new-onset HTN. Using these predictors, the prediction models for 1-, 3-, and 5-year periods demonstrated good discrimination, with AUC values of 0.70-0.76. Our HTN scoring system for patients with type 2 DM, which involves innovative predictors of glycemic and BP variabilities, has good classification accuracy and identifies risk factors available in clinical settings for prevention of the progression to new-onset HTN.
Collapse
Affiliation(s)
- Cheng‐Chieh Lin
- School of MedicineCollege of MedicineChina Medical UniversityTaichungTaiwan
- Department of Family MedicineChina Medical University HospitalTaichungTaiwan
- Department of Medical ResearchChina Medical University HospitalTaichungTaiwan
| | - Chia‐Ing Li
- School of MedicineCollege of MedicineChina Medical UniversityTaichungTaiwan
- Department of Medical ResearchChina Medical University HospitalTaichungTaiwan
| | - Chiu‐Shong Liu
- School of MedicineCollege of MedicineChina Medical UniversityTaichungTaiwan
- Department of Family MedicineChina Medical University HospitalTaichungTaiwan
| | - Chih‐Hsueh Lin
- School of MedicineCollege of MedicineChina Medical UniversityTaichungTaiwan
- Department of Family MedicineChina Medical University HospitalTaichungTaiwan
| | - Mu‐Cyun Wang
- School of MedicineCollege of MedicineChina Medical UniversityTaichungTaiwan
- Department of Family MedicineChina Medical University HospitalTaichungTaiwan
| | - Shing‐Yu Yang
- Department of Public HealthCollege of Public HealthChina Medical UniversityTaichungTaiwan
| | - Tsai‐Chung Li
- Department of Public HealthCollege of Public HealthChina Medical UniversityTaichungTaiwan
- Department of Healthcare AdministrationCollege of Medical and Health ScienceAsia UniversityTaichungTaiwan
| |
Collapse
|
10
|
Diao X, Huo Y, Yan Z, Wang H, Yuan J, Wang Y, Cai J, Zhao W. An Application of Machine Learning to Etiological Diagnosis of Secondary Hypertension: Retrospective Study Using Electronic Medical Records. JMIR Med Inform 2021; 9:e19739. [PMID: 33492233 PMCID: PMC7870351 DOI: 10.2196/19739] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 09/16/2020] [Accepted: 10/28/2020] [Indexed: 12/12/2022] Open
Abstract
Background Secondary hypertension is a kind of hypertension with a definite etiology and may be cured. Patients with suspected secondary hypertension can benefit from timely detection and treatment and, conversely, will have a higher risk of morbidity and mortality than those with primary hypertension. Objective The aim of this study was to develop and validate machine learning (ML) prediction models of common etiologies in patients with suspected secondary hypertension. Methods The analyzed data set was retrospectively extracted from electronic medical records of patients discharged from Fuwai Hospital between January 1, 2016, and June 30, 2019. A total of 7532 unique patients were included and divided into 2 data sets by time: 6302 patients in 2016-2018 as the training data set for model building and 1230 patients in 2019 as the validation data set for further evaluation. Extreme Gradient Boosting (XGBoost) was adopted to develop 5 models to predict 4 etiologies of secondary hypertension and occurrence of any of them (named as composite outcome), including renovascular hypertension (RVH), primary aldosteronism (PA), thyroid dysfunction, and aortic stenosis. Both univariate logistic analysis and Gini Impurity were used for feature selection. Grid search and 10-fold cross-validation were used to select the optimal hyperparameters for each model. Results Validation of the composite outcome prediction model showed good performance with an area under the receiver-operating characteristic curve (AUC) of 0.924 in the validation data set, while the 4 prediction models of RVH, PA, thyroid dysfunction, and aortic stenosis achieved AUC of 0.938, 0.965, 0.959, and 0.946, respectively, in the validation data set. A total of 79 clinical indicators were identified in all and finally used in our prediction models. The result of subgroup analysis on the composite outcome prediction model demonstrated high discrimination with AUCs all higher than 0.890 among all age groups of adults. Conclusions The ML prediction models in this study showed good performance in detecting 4 etiologies of patients with suspected secondary hypertension; thus, they may potentially facilitate clinical diagnosis decision making of secondary hypertension in an intelligent way.
Collapse
Affiliation(s)
- Xiaolin Diao
- Department of Information Center, Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yanni Huo
- Department of Information Center, Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Zhanzheng Yan
- Department of Information Center, Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Haibin Wang
- Department of Information Center, Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jing Yuan
- Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yuxin Wang
- Department of Information Center, Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jun Cai
- Hypertension Center, State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Wei Zhao
- Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
11
|
A Comparative Analysis of Machine Learning Methods for Class Imbalance in a Smoking Cessation Intervention. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10093307] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Smoking is one of the major public health issues, which has a significant impact on premature death. In recent years, numerous decision support systems have been developed to deal with smoking cessation based on machine learning methods. However, the inevitable class imbalance is considered a major challenge in deploying such systems. In this paper, we study an empirical comparison of machine learning techniques to deal with the class imbalance problem in the prediction of smoking cessation intervention among the Korean population. For the class imbalance problem, the objective of this paper is to improve the prediction performance based on the utilization of synthetic oversampling techniques, which we called the synthetic minority over-sampling technique (SMOTE) and an adaptive synthetic (ADASYN). This has been achieved by the experimental design, which comprises three components. First, the selection of the best representative features is performed in two phases: the lasso method and multicollinearity analysis. Second, generate the newly balanced data utilizing SMOTE and ADASYN technique. Third, machine learning classifiers are applied to construct the prediction models among all subjects and each gender. In order to justify the effectiveness of the prediction models, the f-score, type I error, type II error, balanced accuracy and geometric mean indices are used. Comprehensive analysis demonstrates that Gradient Boosting Trees (GBT), Random Forest (RF) and multilayer perceptron neural network (MLP) classifiers achieved the best performances in all subjects and each gender when SMOTE and ADASYN were utilized. The SMOTE with GBT and RF models also provide feature importance scores that enhance the interpretability of the decision-support system. In addition, it is proven that the presented synthetic oversampling techniques with machine learning models outperformed baseline models in smoking cessation prediction.
Collapse
|
12
|
Amarbayasgalan T, Park KH, Lee JY, Ryu KH. Reconstruction error based deep neural networks for coronary heart disease risk prediction. PLoS One 2019; 14:e0225991. [PMID: 31805166 PMCID: PMC6894870 DOI: 10.1371/journal.pone.0225991] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 11/18/2019] [Indexed: 11/18/2022] Open
Abstract
Coronary heart disease (CHD) is one of the leading causes of death worldwide; if suffering from CHD and being in its end-stage, the most advanced treatments are required, such as heart surgery and heart transplant. Moreover, it is not easy to diagnose CHD at the earlier stage; hospitals diagnose it based on various types of medical tests. Thus, by predicting high-risk people who are to suffer from CHD, it is significant to reduce the risks of developing CHD. In recent years, some research works have been done using data mining to predict the risk of developing diseases based on medical tests. In this study, we have proposed a reconstruction error (RE) based deep neural networks (DNNs); this approach uses a deep autoencoder (AE) model for estimating RE. Initially, a training dataset is divided into two groups by their RE divergence on the deep AE model that learned from the whole training dataset. Next, two DNN classifiers are trained on each group of datasets separately by combining a RE based new feature with other risk factors to predict the risk of developing CHD. For creating the new feature, we use deep AE model that trained on the only high-risk dataset. We have performed an experiment to prove how the components of our proposed method work together more efficiently. As a result of our experiment, the performance measurements include accuracy, precision, recall, F-measure, and AUC score reached 86.3371%, 91.3716%, 82.9024%, 86.9148%, and 86.6568%, respectively. These results show that the proposed AE-DNNs outperformed regular machine learning-based classifiers for CHD risk prediction.
Collapse
Affiliation(s)
- Tsatsral Amarbayasgalan
- Database and Bioinformatics Laboratory, School of Electrical and Computer Engineering, Chungbuk National University, Cheongju, Korea
| | - Kwang Ho Park
- Database and Bioinformatics Laboratory, School of Electrical and Computer Engineering, Chungbuk National University, Cheongju, Korea
| | - Jong Yun Lee
- Database and Bioinformatics Laboratory, School of Electrical and Computer Engineering, Chungbuk National University, Cheongju, Korea
| | - Keun Ho Ryu
- Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
- College of Electrical and Computer Engineering, Chungbuk National University, Cheongju, Korea
| |
Collapse
|