1
|
Dholariya S, Dutta S, Sonagra A, Kaliya M, Singh R, Parchwani D, Motiani A. Unveiling the utility of artificial intelligence for prediction, diagnosis, and progression of diabetic kidney disease: an evidence-based systematic review and meta-analysis. Curr Med Res Opin 2024; 40:2025-2055. [PMID: 39474800 DOI: 10.1080/03007995.2024.2423737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Revised: 10/25/2024] [Accepted: 10/28/2024] [Indexed: 11/14/2024]
Abstract
OBJECTIVE The purpose of this study was to conduct a systematic investigation of the potential of artificial intelligence (AI) models in the prediction, detection of diagnostic biomarkers, and progression of diabetic kidney disease (DKD). In addition, we compared the performance of non-logistic regression (LR) machine learning (ML) models to conventional LR prediction models. METHODS Until January 30, 2024, a comprehensive literature review was conducted by investigating databases such as Medline (via PubMed) and Cochrane. Research that is inclusive of AI or ML models for the prediction, diagnosis, and progression of DKD was incorporated. The area under the Receiver Operating Characteristic Curve (AUROC) served as the principal outcome metric for assessing model performance. A meta-analysis was performed utilizing MedCalc statistical software to calculate pooled AUROC and assess the performance differences between LR and non-LR models. RESULTS A total of 57 studies were included in the meta-analysis. The pooled AUROC of AI or ML model was 0.84 (95% CI = 0.81-0.86, p < 0.0001) for analyzing prediction of DKD, 0.88 (95%CI = 0.84-0.92, p < 0.0001) for detecting diagnostic biomarkers, and 0.80 (95% CI = 0.77-0.82, p < 0.0001) for analyzing progression of DKD. The pooled AUROC of LR and non-LR ML models exhibited no significant differences across all categories (p > 0.05), except for the random forest (RF) model, which displayed a statistically significant increase in predictive accuracy compared to LR for DKD occurrence (p < 0.04). CONCLUSION ML models showed solid DKD prediction effectiveness, with pooled AUROC values over 0.8, suggesting good performance. These data demonstrated that non-LR and LR models perform similarly in overall CKD management, but the RF model outperforms the LR model, particularly in predicting the occurrence of DKD. These findings highlight the promise of AI technologies for better DKD management. To improve model reliability, future study should include extended follow-up periods as well as external validation.
Collapse
Affiliation(s)
- Sagar Dholariya
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Siddhartha Dutta
- Department of Pharmacology, All India Institute of Medical Sciences, Rajkot, India
| | - Amit Sonagra
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Mehul Kaliya
- General Medicine, Department of General Medicine, All India Institute of Medical Sciences, Rajkot, India
| | - Ragini Singh
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Deepak Parchwani
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Anita Motiani
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| |
Collapse
|
2
|
Xiang X, Chen G, Ma Y, Wang H. A non-linear association between low-density lipoprotein cholesterol and the risk of diabetic kidney disease in patients with type 2 diabetes in China. Prev Med Rep 2024; 45:102840. [PMID: 39175593 PMCID: PMC11339054 DOI: 10.1016/j.pmedr.2024.102840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 07/25/2024] [Accepted: 07/26/2024] [Indexed: 08/24/2024] Open
Abstract
Objective To explore the intrinsic relationship between low-density lipoprotein cholesterol (LDL-C) and diabetic kidney disease (DKD) in patients with type 2 diabetes (T2D) in China. Methods This cross-sectional study included 1,313 patients with type 2 diabetes treated at the Affiliated Jinhua Hospital of Zhejiang University School of Medicine, located in Jinhua, China. The data were combined from two periods, 2017 and 2020-2021. Participants were categorized into groups with and without DKD. The relationship between LDL-C levels and DKD was evaluated employing logistics regression analysis and restricted cubic spline (RCS) curves. Results Generally, there was no statistical difference in LDL-C levels between DKD and non-DKD groups, however, a significantly non-linear relationship (Pnon-linear = 0.011) was observed between LDL-C levels and DKD prevalence after adjusting for confounding factors according to the RCS analysis. Two optimal cut-points of 2.97 and 3.61 mmol/L were selected out using random forest algorithm. With the middle LDL-C concentration (2.97-3.61 mmol/L) as the reference, the odds ratios for low (<2.97 mmol/L) and high (>3.61 mmol/L) concentrations were 1.45 (1.08-1.96) and 1.47 (1.01-2.15) respectively, after adjusting for confounding factors in the multivariate analyses. Notably, this association was more pronounced among female participants in the subgroup analyses. Conclusion A non-linear association was observed between LDL-C levels and the risk of DKD in patients with T2D in China. LDL-C levels below 2.97 mmol/L may elevate the risk of DKD, particularly in female patients with T2D.
Collapse
Affiliation(s)
- Xi Xiang
- Department of Clinical Laboratory, Affiliated Jinhua Hospital, Zhejiang University School of Medicine, Jinhua City, Zhejiang Province 321000, China
| | - Guangming Chen
- Department of General Practice, Affiliated Jinhua Hospital, Zhejiang University School of Medicine, Jinhua, China
| | - Yongjun Ma
- Department of Clinical Laboratory, Affiliated Jinhua Hospital, Zhejiang University School of Medicine, Jinhua City, Zhejiang Province 321000, China
| | - Huabin Wang
- Department of Clinical Laboratory, Affiliated Jinhua Hospital, Zhejiang University School of Medicine, Jinhua City, Zhejiang Province 321000, China
| |
Collapse
|
3
|
Basuli D, Kavcar A, Roy S. From bytes to nephrons: AI's journey in diabetic kidney disease. J Nephrol 2024:10.1007/s40620-024-02050-2. [PMID: 39133462 DOI: 10.1007/s40620-024-02050-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 07/21/2024] [Indexed: 08/13/2024]
Abstract
Diabetic kidney disease (DKD) is a significant complication of type 2 diabetes, posing a global health risk. Detecting and predicting diabetic kidney disease at an early stage is crucial for timely interventions and improved patient outcomes. Artificial intelligence (AI) has demonstrated promise in healthcare, and several tools have recently been developed that utilize Machine Learning with clinical data to detect and predict DKD. This review aims to explore the current landscape of AI and machine learning applications in DKD, specifically examining existing literature on risk scores and machine learning approaches for predicting DKD development. A literature search was conducted using Medline (PubMed), Google Scholar, and Scopus databases until July 2023. Relevant keywords were used to extract studies that described the role of AI in DKD. The review revealed that AI and machine learning have been successfully used to predict DKD progression, outperforming traditional risk score models. Artificial intelligence-driven research for DKD extends beyond prediction models, offering opportunities for integrating genetic and epigenetic data, advancing understanding of the disease's molecular basis, personalizing treatment strategies, and fostering the development of novel drugs. However, challenges remain, including the requirement for large datasets and the lack of standardization in AI-driven tools for DKD. Artificial intelligence and machine learning have the potential to revolutionize the management and care of DKD patients, surpassing the limitations of traditional methods reliant on existing knowledge. Future research should address the challenges associated with AI and machine learning in DKD and focus on developing AI-driven tools for clinical practice.
Collapse
Affiliation(s)
- Debargha Basuli
- Department of Nephrology & Hypertension, Brody School of Medicine, East Carolina University, 2355 W Arlington Blvd, Greenville, NC, 27834, USA.
| | - Akil Kavcar
- Department of Internal Medicine, Brody School of Medicine, East Carolina University, Greenville, NC, USA
| | - Sasmit Roy
- Department of Nephrology, University of Virginia, Lynchburg, VA, USA
| |
Collapse
|
4
|
Zhu Y, Zhang Y, Yang M, Tang N, Liu L, Wu J, Yang Y. Machine Learning-Based Predictive Modeling of Diabetic Nephropathy in Type 2 Diabetes Using Integrated Biomarkers: A Single-Center Retrospective Study. Diabetes Metab Syndr Obes 2024; 17:1987-1997. [PMID: 38746045 PMCID: PMC11093114 DOI: 10.2147/dmso.s458263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 04/16/2024] [Indexed: 05/16/2024] Open
Abstract
Purpose Diabetic nephropathy (DN), a major complication of diabetes mellitus, significantly impacts global health. Identifying individuals at risk of developing DN is crucial for early intervention and improving patient outcomes. This study aims to develop and validate a machine learning-based predictive model using integrated biomarkers. Methods A cross-sectional analysis was conducted on a baseline dataset involving 2184 participants without DN, categorized based on their development of DN over a follow-up period of 36 months: DN (n=1270) and Non-DN (n=914). Various demographic and clinical parameters were analyzed. The findings were validated using an independent dataset comprising 468 participants, with 273 developing DN and 195 remaining as Non-DN over the follow-up period. Machine learning algorithms, alongside traditional descriptive statistics and logistic regression were used for statistical analyses. Results Elevated levels of serum creatinine, urea, and reduced eGFR, alongside an increased prevalence of retinopathy and peripheral neuropathy, were prominently observed in those who developed DN. Validation on the independent dataset further confirmed the model's robustness and consistency. The SVM model demonstrated superior performance in the training set (AUC=0.79, F1-score=0.74) and testing set (AUC=0.83, F1-score=0.82), outperforming other models. Significant predictors of DN included serum creatinine, eGFR, presence of diabetic retinopathy, and peripheral neuropathy. Conclusion Integrating machine learning algorithms with clinical and biomarker data at baseline offers a promising avenue for identifying individuals at risk of developing diabetic nephropathy in type 2 diabetes patients over a 36-month period.
Collapse
Affiliation(s)
- Ying Zhu
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| | - Yiyi Zhang
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| | - Miao Yang
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| | - Nie Tang
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| | - Limei Liu
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| | - Jichuan Wu
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| | - Yan Yang
- Department of Endocrinology, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, People’s Republic of China
| |
Collapse
|
5
|
Pasokh Z, Seif M, Ghaem H, Rezaianzadeh A, Ghoddusi Johari M. Age at natural menopause and its determinants in female population of Kharameh cohort study: Comparison of regression, conditional tree and forests. PLoS One 2024; 19:e0300448. [PMID: 38625988 PMCID: PMC11020934 DOI: 10.1371/journal.pone.0300448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 02/28/2024] [Indexed: 04/18/2024] Open
Abstract
BACKGROUND Natural menopause is defined as the permanent cessation of menstruation that occurs after 12 consecutive months of amenorrhea without any obvious pathological or physiological cause. The age of this phenomenon has been reported to be associated with several health outcomes. OBJECTIVES This study aimed to estimate the Age at Natural Menopause (ANM) and to identify reproductive and demographic factors affecting ANM. METHODS This cross-sectional, population-based study was conducted on 2517 post-menopausal women aged 40-70 years participating in the first phase of the PERSIAN cohort study of Kharameh, Iran, during 2014-2017. To more accurately detect the determinants of ANM, we applied multiple linear regression beside some machine learning algorithms including conditional tree, conditional forest, and random forest. Then, the fitness of these methods was compared using Mean Squared Error (MSE) and Pearson correlation coefficient. RESULTS The mean±SD of ANM was 48.95±6.13. Both applied forests provided more accurate results and identified more predictors. However, according to the final comparison, the conditional forest was the most accurate method which recognized that more pregnancies, longer breastfeeding, Fars ethnicity, and urbanization have the greatest impact on later ANM. CONCLUSIONS This study found a wide range of reproductive and demographic factors affecting ANM. Considering our findings in decision-making can reduce the complications related to this phenomenon and, consequently, improve the quality of life of post-menopausal women.
Collapse
Affiliation(s)
- Zahra Pasokh
- Student Research Committee, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Mozhgan Seif
- Non-Communicable Diseases Research Center, Department of Epidemiology, School of Health, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Haleh Ghaem
- Non-Communicable Diseases Research Center, Department of Epidemiology, School of Health, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Abbas Rezaianzadeh
- Colorectal Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | | |
Collapse
|
6
|
Shi S, Gao L, Zhang J, Zhang B, Xiao J, Xu W, Tian Y, Ni L, Wu X. The automatic detection of diabetic kidney disease from retinal vascular parameters combined with clinical variables using artificial intelligence in type-2 diabetes patients. BMC Med Inform Decis Mak 2023; 23:241. [PMID: 37904184 PMCID: PMC10617171 DOI: 10.1186/s12911-023-02343-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 10/16/2023] [Indexed: 11/01/2023] Open
Abstract
BACKGROUND Diabetic kidney disease (DKD) has become the largest cause of end-stage kidney disease. Early and accurate detection of DKD is beneficial for patients. The present detection depends on the measurement of albuminuria or the estimated glomerular filtration rate, which is invasive and not optimal; therefore, new detection tools are urgently needed. Meanwhile, a close relationship between diabetic retinopathy and DKD has been reported; thus, we aimed to develop a novel detection algorithm for DKD using artificial intelligence technology based on retinal vascular parameters combined with several easily available clinical parameters in patients with type-2 diabetes. METHODS A total of 515 consecutive patients with type-2 diabetes mellitus from Xiangyang Central Hospital were included. Patients were stratified by DKD diagnosis and split randomly into either the training set (70%, N = 360) or the testing set (30%, N = 155) (random seed = 1). Data from the training set were used to develop the machine learning algorithm (MLA), while those from the testing set were used to validate the MLA. Model performances were evaluated. RESULTS The MLA using the random forest classifier presented optimal performance compared with other classifiers. When validated, the accuracy, sensitivity, specificity, F1 score, and AUC for the optimal model were 84.5%(95% CI 83.3-85.7), 84.5%(82.3-86.7), 84.5%(82.7-86.3), 0.845(0.831-0.859), and 0.914(0.903-0.925), respectively. CONCLUSIONS A new machine learning algorithm for DKD diagnosis based on fundus images and 8 easily available clinical parameters was developed, which indicated that retinal vascular changes can assist in DKD screening and detection.
Collapse
Affiliation(s)
- Shaomin Shi
- Department of Nephrology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan, 430071, Hubei, China
- Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, Hubei, 441000, China
| | - Ling Gao
- Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, Hubei, 441000, China
| | - Juan Zhang
- Department of Nephrology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan, 430071, Hubei, China
| | - Baifang Zhang
- Department of Biochemistry, Wuhan University TaiKang Medical School (School of Basic Medical Sciences), Wuhan, 430071, Hubei, China
| | - Jing Xiao
- Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, Hubei, 441000, China
| | - Wan Xu
- Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, Hubei, 441000, China
| | - Yuan Tian
- Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, Hubei, 441000, China.
| | - Lihua Ni
- Department of Nephrology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan, 430071, Hubei, China.
| | - Xiaoyan Wu
- Department of Nephrology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan, 430071, Hubei, China.
- Department of General Practice, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan, 430071, Hubei, China.
| |
Collapse
|
7
|
Ordoñez-Guillen NE, Gonzalez-Compean JL, Lopez-Arevalo I, Contreras-Murillo M, Aldana-Bobadilla E. Machine learning based study for the classification of Type 2 diabetes mellitus subtypes. BioData Min 2023; 16:24. [PMID: 37608329 PMCID: PMC10463725 DOI: 10.1186/s13040-023-00340-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 08/07/2023] [Indexed: 08/24/2023] Open
Abstract
PURPOSE Data-driven diabetes research has increased its interest in exploring the heterogeneity of the disease, aiming to support in the development of more specific prognoses and treatments within the so-called precision medicine. Recently, one of these studies found five diabetes subgroups with varying risks of complications and treatment responses. Here, we tackle the development and assessment of different models for classifying Type 2 Diabetes (T2DM) subtypes through machine learning approaches, with the aim of providing a performance comparison and new insights on the matter. METHODS We developed a three-stage methodology starting with the preprocessing of public databases NHANES (USA) and ENSANUT (Mexico) to construct a dataset with N = 10,077 adult diabetes patient records. We used N = 2,768 records for training/validation of models and left the remaining (N = 7,309) for testing. In the second stage, groups of observations -each one representing a T2DM subtype- were identified. We tested different clustering techniques and strategies and validated them by using internal and external clustering indices; obtaining two annotated datasets Dset A and Dset B. In the third stage, we developed different classification models assaying four algorithms, seven input-data schemes, and two validation settings on each annotated dataset. We also tested the obtained models using a majority-vote approach for classifying unseen patient records in the hold-out dataset. RESULTS From the independently obtained bootstrap validation for Dset A and Dset B, mean accuracies across all seven data schemes were [Formula: see text] ([Formula: see text]) and [Formula: see text] ([Formula: see text]), respectively. Best accuracies were [Formula: see text] and [Formula: see text]. Both validation setting results were consistent. For the hold-out dataset, results were consonant with most of those obtained in the literature in terms of class proportions. CONCLUSION The development of machine learning systems for the classification of diabetes subtypes constitutes an important task to support physicians for fast and timely decision-making. We expect to deploy this methodology in a data analysis platform to conduct studies for identifying T2DM subtypes in patient records from hospitals.
Collapse
Affiliation(s)
- Nelson E Ordoñez-Guillen
- Cinvestav Tamaulipas, Carretera Victoria-Soto la Marina km 5.5, Victoria, 87130, Tamaulipas, Mexico
| | | | - Ivan Lopez-Arevalo
- Cinvestav Tamaulipas, Carretera Victoria-Soto la Marina km 5.5, Victoria, 87130, Tamaulipas, Mexico
| | - Miguel Contreras-Murillo
- Cinvestav Tamaulipas, Carretera Victoria-Soto la Marina km 5.5, Victoria, 87130, Tamaulipas, Mexico
| | - Edwin Aldana-Bobadilla
- CONAHCYT-Centro de Investigación y de Estudios Avanzados del IPN, Unidad Tamaulipas, Carretera Victoria-Soto la Marina km 5.5, Victoria, Tamaulipas, 87130, Mexico
| |
Collapse
|
8
|
Qezelbash-Chamak J, Badamchizadeh S, Eshghi K, Asadi Y. A survey of machine learning in kidney disease diagnosis. MACHINE LEARNING WITH APPLICATIONS 2022. [DOI: 10.1016/j.mlwa.2022.100418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
|
9
|
Schena FP, Magistroni R, Narducci F, Abbrescia DI, Anelli VW, Di Noia T. Artificial intelligence in glomerular diseases. Pediatr Nephrol 2022; 37:2533-2545. [PMID: 35266037 DOI: 10.1007/s00467-021-05419-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 12/19/2021] [Accepted: 12/21/2021] [Indexed: 11/30/2022]
Abstract
In this narrative review, we focus on the application of artificial intelligence in the clinical history of patients with glomerular disease, digital pathology in kidney biopsy, renal ultrasonography imaging, and prediction of chronic kidney disease (CKD). With the development of natural language processing, the clinical history of a patient can be used to identify a computable phenotype. In kidney pathology, digital imaging has adopted innovative deep learning algorithms (DLAs) that can improve the predictive capability of the examined lesions. However, at this time, these applications can only be used in research because there is no recognized validation to replace the conventional diagnostic applications. Kidney ultrasonography, used in the clinical examination of patients, provides information about the progression of kidney damage. Machine learning algorithms (MLAs) with promising results for the early detection of CKD have been proposed, but, still, they are not solid enough to be incorporated into the clinical practice. A few tools for glomerulonephritis, based on MLAs, are available in clinical practice. They can be downloaded on computers and cellular phones but can only be applied to uniracial cohorts of patients. To improve their performance, it is necessary to organize large consortia with multiracial cohorts. Finally, in many studies MLA development has been carried out using retrospective cohorts. The performance of the models might differ in retrospective cohorts compared to real-world data. Therefore, the models should be validated in prospective external large cohorts.
Collapse
Affiliation(s)
- Francesco P Schena
- Department of Emergency and Organ Transplantation, University of Bari, Bari, Italy.
| | | | - Fedelucio Narducci
- Department of Electrical and Information Engineering, Polytechnic of Bari, Bari, Italy
| | | | - Vito W Anelli
- Department of Electrical and Information Engineering, Polytechnic of Bari, Bari, Italy
| | - Tommaso Di Noia
- Department of Electrical and Information Engineering, Polytechnic of Bari, Bari, Italy
| |
Collapse
|
10
|
Olusanya MO, Ogunsakin RE, Ghai M, Adeleke MA. Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph192114280. [PMID: 36361161 PMCID: PMC9655196 DOI: 10.3390/ijerph192114280] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/22/2022] [Accepted: 10/25/2022] [Indexed: 05/13/2023]
Abstract
Soft-computing and statistical learning models have gained substantial momentum in predicting type 2 diabetes mellitus (T2DM) disease. This paper reviews recent soft-computing and statistical learning models in T2DM using a meta-analysis approach. We searched for papers using soft-computing and statistical learning models focused on T2DM published between 2010 and 2021 on three different search engines. Of 1215 studies identified, 34 with 136952 patients met our inclusion criteria. The pooled algorithm's performance was able to predict T2DM with an overall accuracy of 0.86 (95% confidence interval [CI] of [0.82, 0.89]). The classification of diabetes prediction was significantly greater in models with a screening and diagnosis (pooled proportion [95% CI] = 0.91 [0.74, 0.97]) when compared to models with nephropathy (pooled proportion = 0.48 [0.76, 0.89] to 0.88 [0.83, 0.91]). For the prediction of T2DM, the decision trees (DT) models had a pooled accuracy of 0.88 [95% CI: 0.82, 0.92], and the neural network (NN) models had a pooled accuracy of 0.85 [95% CI: 0.79, 0.89]. Meta-regression did not provide any statistically significant findings for the heterogeneous accuracy in studies with different diabetes predictions, sample sizes, and impact factors. Additionally, ML models showed high accuracy for the prediction of T2DM. The predictive accuracy of ML algorithms in T2DM is promising, mainly through DT and NN models. However, there is heterogeneity among ML models. We compared the results and models and concluded that this evidence might help clinicians interpret data and implement optimum models for their dataset for T2DM prediction.
Collapse
Affiliation(s)
- Micheal O. Olusanya
- Department of Computer Science and Information Technology, Sol Plaatje University, Kimberley 8300, South Africa
- Correspondence:
| | - Ropo Ebenezer Ogunsakin
- Biostatistics Unit, Discipline of Public Health Medicine, School of Nursing & Public Health, College of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Meenu Ghai
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Matthew Adekunle Adeleke
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| |
Collapse
|
11
|
Magherini R, Mussi E, Volpe Y, Furferi R, Buonamici F, Servi M. Machine Learning for Renal Pathologies: An Updated Survey. SENSORS 2022; 22:s22134989. [PMID: 35808481 PMCID: PMC9269842 DOI: 10.3390/s22134989] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 06/22/2022] [Accepted: 06/29/2022] [Indexed: 12/04/2022]
Abstract
Within the literature concerning modern machine learning techniques applied to the medical field, there is a growing interest in the application of these technologies to the nephrological area, especially regarding the study of renal pathologies, because they are very common and widespread in our society, afflicting a high percentage of the population and leading to various complications, up to death in some cases. For these reasons, the authors have considered it appropriate to collect, using one of the major bibliographic databases available, and analyze the studies carried out until February 2022 on the use of machine learning techniques in the nephrological field, grouping them according to the addressed pathologies: renal masses, acute kidney injury, chronic kidney disease, kidney stone, glomerular disease, kidney transplant, and others less widespread. Of a total of 224 studies, 59 were analyzed according to inclusion and exclusion criteria in this review, considering the method used and the type of data available. Based on the study conducted, it is possible to see a growing trend and interest in the use of machine learning applications in nephrology, becoming an additional tool for physicians, which can enable them to make more accurate and faster diagnoses, although there remains a major limitation given the difficulty in creating public databases that can be used by the scientific community to corroborate and eventually make a positive contribution in this area.
Collapse
|
12
|
Shao B, Qu Y, Zhang W, Zhan H, Li Z, Han X, Ma M, Du Z. Machine Learning-Based Prediction Method for Tremors Induced by Tacrolimus in the Treatment of Nephrotic Syndrome. Front Pharmacol 2022; 13:708610. [PMID: 35571087 PMCID: PMC9091175 DOI: 10.3389/fphar.2022.708610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 03/25/2022] [Indexed: 11/15/2022] Open
Abstract
Tremors have been reported even with a low dose of tacrolimus in patients with nephrotic syndrome and are responsible for hampering the day-to-day work of young active patients with nephrotic syndrome. This study proposes a neural network model based on seven variables to predict the development of tremors following tacrolimus. The sensitivity and specificity of this algorithm are high. A total of 252 patients were included in this study, out of which 39 (15.5%) experienced tremors, 181 patients (including 32 patients who experienced tremors) were randomly assigned to a training dataset, and the remaining were assigned to an external validation set. We used a recursive feature elimination algorithm to train the training dataset, in turn, through 10-fold cross-validation. The classification performance of the classifer was then used as the evaluation criterion for these subsets to find the subset of optimal features. A neural network was used as a classification algorithm to accurately predict tremors using the subset of optimal features. This model was subsequently tested in the validation dataset. The subset of optimal features contained seven variables (creatinine, D-dimer, total protein, calcium ion, platelet distribution width, serum kalium, and fibrinogen), and the highest accuracy obtained was 0.8288. The neural network model based on these seven variables obtained an area under the curve (AUC) value of 0.9726, an accuracy of 0.9345, a sensitivity of 0.9712, and a specificity of 0.7586 in the training set. Meanwhile, the external validation achieved an accuracy of 0.8214, a sensitivity of 0.8378, and a specificity of 0.7000 in the validation dataset. This model was capable of predicting tremors caused by tacrolimus with an excellent degree of accuracy, which can be beneficial in the treatment of nephrotic syndrome patients.
Collapse
Affiliation(s)
- Bing Shao
- Department of Pharmacy, The Second Affiliated Hospital, Harbin Medical University (Key Laboratory of Medications Research, College of Heilongjiang Province), Harbin, China.,School of Pharmacy, Harbin Medical University, Harbin, China
| | - Youyang Qu
- Neurology Department, The Second Affiliated Hospital, Harbin Medical University, Harbin, China
| | - Wei Zhang
- Nephrology Department, The Second Affiliated Hospital, Harbin Medical University, Harbin, China
| | - Haihe Zhan
- Department of Pharmacy, The Second Affiliated Hospital, Harbin Medical University (Key Laboratory of Medications Research, College of Heilongjiang Province), Harbin, China
| | - Zerong Li
- Department of Pharmacy, The Second Affiliated Hospital, Harbin Medical University (Key Laboratory of Medications Research, College of Heilongjiang Province), Harbin, China
| | - Xingyu Han
- Department of Pharmacy, The Second Affiliated Hospital, Harbin Medical University (Key Laboratory of Medications Research, College of Heilongjiang Province), Harbin, China.,School of Pharmacy, Harbin Medical University, Harbin, China
| | - Mengchao Ma
- School of Pharmacy, Harbin Medical University, Harbin, China
| | - Zhimin Du
- Department of Pharmacy, The Second Affiliated Hospital, Harbin Medical University (Key Laboratory of Medications Research, College of Heilongjiang Province), Harbin, China.,School of Pharmacy, Harbin Medical University, Harbin, China.,State Key Laboratory of Quality Research in Chinese Medicines, Macau University of Science and Technology, Macau, China
| |
Collapse
|
13
|
Sheh A, Artim SC, Burns MA, Molina-Mora JA, Lee MA, Dzink-Fox J, Muthupalani S, Fox JG. Alterations in common marmoset gut microbiome associated with duodenal strictures. Sci Rep 2022; 12:5277. [PMID: 35347206 PMCID: PMC8960757 DOI: 10.1038/s41598-022-09268-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Accepted: 03/21/2022] [Indexed: 12/13/2022] Open
Abstract
Chronic gastrointestinal (GI) diseases are the most common diseases in captive common marmosets (Callithrix jacchus). Despite standardized housing, diet and husbandry, a recently described gastrointestinal syndrome characterized by duodenal ulcers and strictures was observed in a subset of marmosets sourced from the New England Primate Research Center. As changes in the gut microbiome have been associated with GI diseases, the gut microbiome of 52 healthy, non-stricture marmosets (153 samples) were compared to the gut microbiome of 21 captive marmosets diagnosed with a duodenal ulcer/stricture (57 samples). No significant changes were observed using alpha diversity metrics, and while the community structure was significantly different when comparing beta diversity between healthy and stricture cases, the results were inconclusive due to differences observed in the dispersion of both datasets. Differences in the abundance of individual taxa using ANCOM, as stricture-associated dysbiosis was characterized by Anaerobiospirillum loss and Clostridium perfringens increases. To identify microbial and serum biomarkers that could help classify stricture cases, we developed models using machine learning algorithms (random forest, classification and regression trees, support vector machines and k-nearest neighbors) to classify microbiome, serum chemistry or complete blood count (CBC) data. Random forest (RF) models were the most accurate models and correctly classified strictures using either 9 ASVs (amplicon sequence variants), 4 serum chemistry tests or 6 CBC tests. Based on the RF model and ANCOM results, C. perfringens was identified as a potential causative agent associated with the development of strictures. Clostridium perfringens was also isolated by microbiological culture in 4 of 9 duodenum samples from marmosets with histologically confirmed strictures. Due to the enrichment of C. perfringens in situ, we analyzed frozen duodenal tissues using both 16S microbiome profiling and RNAseq. Microbiome analysis of the duodenal tissues of 29 marmosets from the MIT colony confirmed an increased abundance of Clostridium in stricture cases. Comparison of the duodenal gene expression from stricture and non-stricture marmosets found enrichment of genes associated with intestinal absorption, and lipid metabolism, localization, and transport in stricture cases. Using machine learning, we identified increased abundance of C. perfringens, as a potential causative agent of GI disease and intestinal strictures in marmosets.
Collapse
Affiliation(s)
- Alexander Sheh
- Division of Comparative Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Stephen C Artim
- Division of Comparative Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- Merck Research Laboratories, Merck, South San Francisco, CA, USA
| | - Monika A Burns
- Division of Comparative Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jose Arturo Molina-Mora
- Centro de Investigación en Enfermedades Tropicales (CIET), Universidad de Costa Rica, San José, Costa Rica
| | - Mary Anne Lee
- Division of Comparative Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biological Sciences, Wellesley College, Wellesley, MA, USA
| | - JoAnn Dzink-Fox
- Division of Comparative Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | - James G Fox
- Division of Comparative Medicine, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
14
|
Liu XQ, Jiang TT, Wang MY, Liu WT, Huang Y, Huang YL, Jin FY, Zhao Q, Wang GH, Ruan XZ, Liu BC, Ma KL. Using Machine Learning to Evaluate the Role of Microinflammation in Cardiovascular Events in Patients With Chronic Kidney Disease. Front Immunol 2022; 12:796383. [PMID: 35082785 PMCID: PMC8784809 DOI: 10.3389/fimmu.2021.796383] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 12/16/2021] [Indexed: 11/19/2022] Open
Abstract
Background Lipid metabolism disorder, as one major complication in patients with chronic kidney disease (CKD), is tied to an increased risk for cardiovascular disease (CVD). Traditional lipid-lowering statins have been found to have limited benefit for the final CVD outcome of CKD patients. Therefore, the purpose of this study was to investigate the effect of microinflammation on CVD in statin-treated CKD patients. Methods We retrospectively analysed statin-treated CKD patients from January 2013 to September 2020. Machine learning algorithms were employed to develop models of low-density lipoprotein (LDL) levels and CVD indices. A fivefold cross-validation method was employed against the problem of overfitting. The accuracy and area under the receiver operating characteristic (ROC) curve (AUC) were acquired for evaluation. The Gini impurity index of the predictors for the random forest (RF) model was ranked to perform an analysis of importance. Results The RF algorithm performed best for both the LDL and CVD models, with accuracies of 82.27% and 74.15%, respectively, and is therefore the most suitable method for clinical data processing. The Gini impurity ranking of the LDL model revealed that hypersensitive C-reactive protein (hs-CRP) was highly relevant, whereas statin use and sex had the least important effects on the outcomes of both the LDL and CVD models. hs-CRP was the strongest predictor of CVD events. Conclusion Microinflammation is closely associated with potential CVD events in CKD patients, suggesting that therapeutic strategies against microinflammation should be implemented to prevent CVD events in CKD patients treated by statin.
Collapse
Affiliation(s)
- Xiao Qi Liu
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Ting Ting Jiang
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Meng Ying Wang
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Wen Tao Liu
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Yang Huang
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Yu Lin Huang
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Feng Yong Jin
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Qing Zhao
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Gui Hua Wang
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Xiong Zhong Ruan
- John Moorhead Research Laboratory, Department of Renal Medicine, University College London (UCL) Medical School, London, United Kingdom
| | - Bi Cheng Liu
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| | - Kun Ling Ma
- Institute of Nephrology, Zhongda Hospital, School of Medicine, Southeast University, Nanjing, China
| |
Collapse
|
15
|
Allen A, Iqbal Z, Green-Saxena A, Hurtado M, Hoffman J, Mao Q, Das R. Prediction of diabetic kidney disease with machine learning algorithms, upon the initial diagnosis of type 2 diabetes mellitus. BMJ Open Diabetes Res Care 2022; 10:10/1/e002560. [PMID: 35046014 PMCID: PMC8772425 DOI: 10.1136/bmjdrc-2021-002560] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 12/27/2021] [Indexed: 02/06/2023] Open
Abstract
INTRODUCTION Diabetic kidney disease (DKD) accounts for the majority of increased risk of mortality for patients with diabetes, and eventually manifests in approximately half of those patients diagnosed with type 2 diabetes mellitus (T2DM). Although increased screening frequency can avoid delayed diagnoses, this is not uniformly implemented. The purpose of this study was to develop and retrospectively validate a machine learning algorithm (MLA) that predicts stages of DKD within 5 years upon diagnosis of T2DM. RESEARCH DESIGN AND METHODS Two MLAs were trained to predict stages of DKD severity, and compared with the Centers for Disease Control and Prevention (CDC) risk score to evaluate performance. The models were validated on a hold-out test set as well as an external dataset sourced from separate facilities. RESULTS The MLAs outperformed the CDC risk score in both the hold-out test and external datasets. Our algorithms achieved an area under the receiver operating characteristic curve (AUROC) of 0.75 on the hold-out set for prediction of any-stage DKD and an AUROC of over 0.82 for more severe endpoints, compared with the CDC risk score with an AUROC <0.70 on all test sets and endpoints. CONCLUSION This retrospective study shows that an MLA can provide timely predictions of DKD among patients with recently diagnosed T2DM.
Collapse
Affiliation(s)
- Angier Allen
- Research and Development, Dascena, Houston, Texas, USA
| | - Zohora Iqbal
- Research and Development, Dascena, Houston, Texas, USA
| | | | - Myrna Hurtado
- Research and Development, Dascena, Houston, Texas, USA
| | - Jana Hoffman
- Research and Development, Dascena, Houston, Texas, USA
| | - Qingqing Mao
- Research and Development, Dascena, Houston, Texas, USA
| | - Ritankar Das
- Research and Development, Dascena, Houston, Texas, USA
| |
Collapse
|
16
|
Zhang W, Liu X, Dong Z, Wang Q, Pei Z, Chen Y, Zheng Y, Wang Y, Chen P, Feng Z, Sun X, Cai G, Chen X. New Diagnostic Model for the Differentiation of Diabetic Nephropathy From Non-Diabetic Nephropathy in Chinese Patients. Front Endocrinol (Lausanne) 2022; 13:913021. [PMID: 35846333 PMCID: PMC9279696 DOI: 10.3389/fendo.2022.913021] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 05/20/2022] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The disease pathology for diabetes mellitus patients with chronic kidney disease (CKD) may be diabetic nephropathy (DN), non-diabetic renal disease (NDRD), or DN combined with NDRD. Considering that the prognosis and treatment of DN and NDRD differ, their differential diagnosis is of significance. Renal pathological biopsy is the gold standard for diagnosing DN and NDRD. However, it is invasive and cannot be implemented in many patients due to contraindications. This article constructed a new noninvasive evaluation model for differentiating DN and NDRD. METHODS We retrospectively screened 1,030 patients with type 2 diabetes who has undergone kidney biopsy from January 2005 to March 2017 in a single center. Variables were ranked according to importance, and the machine learning methods (random forest, RF, and support vector machine, SVM) were then used to construct the model. The final model was validated with an external group (338 patients, April 2017-April 2019). RESULTS In total, 929 patients were assigned. Ten variables were selected for model development. The areas under the receiver operating characteristic curves (AUCROCs) for the RF and SVM methods were 0.953 and 0.947, respectively. Additionally, 329 patients were analyzed for external validation. The AUCROCs for the external validation of the RF and SVM methods were 0.920 and 0.911, respectively. CONCLUSION We successfully constructed a predictive model for DN and NDRD using machine learning methods, which were better than our regression methods. CLINICAL TRIAL REGISTRATION ClinicalTrial.gov, NCT03865914.
Collapse
Affiliation(s)
- WeiGuang Zhang
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - XiaoMin Liu
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - ZheYi Dong
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - Qian Wang
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - ZhiYong Pei
- Beijing Computing Center, Beike Industry, Yongfeng Industrial Base, Beijing, China
| | - YiZhi Chen
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
- Department of Nephrology, Hainan Hospital of Chinese PLA General Hospital, the Hainan Academician Team Innovation Center, Sanya, China
| | - Ying Zheng
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - Yong Wang
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - Pu Chen
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - Zhe Feng
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - XueFeng Sun
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
| | - Guangyan Cai
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
- *Correspondence: XiangMei Chen, ; Guangyan Cai,
| | - XiangMei Chen
- Department of Nephrology, The First Medical Center, Chinese PLA General Hospital, Chinese PLA Institute of Nephrology, State Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing, China
- *Correspondence: XiangMei Chen, ; Guangyan Cai,
| |
Collapse
|
17
|
Robson B, Boray S, Weisman J. Mining real-world high dimensional structured data in medicine and its use in decision support. Some different perspectives on unknowns, interdependency, and distinguishability. Comput Biol Med 2021; 141:105118. [PMID: 34971979 DOI: 10.1016/j.compbiomed.2021.105118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Revised: 11/18/2021] [Accepted: 12/02/2021] [Indexed: 11/03/2022]
Abstract
There are many difficulties in extracting and using knowledge for medical analytic and predictive purposes from Real-World Data, even when the data is already well structured in the manner of a large spreadsheet. Preparative curation and standardization or "normalization" of such data involves a variety of chores but underlying them is an interrelated set of fundamental problems that can in part be dealt with automatically during the datamining and inference processes. These fundamental problems are reviewed here and illustrated and investigated with examples. They concern the treatment of unknowns, the need to avoid independency assumptions, and the appearance of entries that may not be fully distinguished from each other. Unknowns include errors detected as implausible (e.g., out of range) values that are subsequently converted to unknowns. These problems are further impacted by high dimensionality and problems of sparse data that inevitably arise from high-dimensional datamining even if the data is extensive. All these considerations are different aspects of incomplete information, though they also relate to problems that arise if care is not taken to avoid or ameliorate consequences of including the same information twice or more, or if misleading or inconsistent information is combined. This paper addresses these aspects from a slightly different perspective using the Q-UEL language and inference methods based on it by borrowing some ideas from the mathematics of quantum mechanics and information theory. It takes the view that detection and correction of probabilistic elements of knowledge subsequently used in inference need only involve testing and correction so that they satisfy certain extended notions of coherence between probabilities. This is by no means the only possible view, and it is explored here and later compared with a related notion of consistency.
Collapse
Affiliation(s)
- Barry Robson
- Ingine Inc, Ohio, USA; The Dirac Foundation, Oxfordshire, UK.
| | | | - J Weisman
- The Dirac Foundation, Oxfordshire, UK.
| |
Collapse
|
18
|
Nesaragi N, Patidar S, Thangaraj V. A correlation matrix-based tensor decomposition method for early prediction of sepsis from clinical data. Biocybern Biomed Eng 2021. [DOI: 10.1016/j.bbe.2021.06.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
19
|
Wei Y, Jiang Z. The evolution and future of diabetic kidney disease research: a bibliometric analysis. BMC Nephrol 2021; 22:158. [PMID: 33926393 PMCID: PMC8084262 DOI: 10.1186/s12882-021-02369-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 04/20/2021] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Diabetic kidney disease (DKD) is one of the most important complications of diabetic mellitus. It is essential for nephrologists to understand the evolution and development trends of DKD. METHODS Based on the total cited numbers in the Web of Science Core Collection, which was searched through September 28th, 2020, we performed a bibliometric analysis of the top 100 most cited full-length original articles on the subject of DKD. The timespans, authors, contributions, subcategories, and topics of those 100 articles were analysed. In addition, the evolution of topics in DKD research was investigated. RESULTS There were 23,968 items under the subject of DKD in the Web of Science Core Collection. The top 100 cited articles, published from 1999 to 2017, were cited 38,855 times in total. Researchers from the USA contributed the most publications. The number of articles included in 'Experimental studies (EG)', 'Clinical studies (CS)', 'Epidemiological studies (ES)', and 'Pathological and pathophysiological studies (PP)' were 65, 26, 7, and 2, respectively. Among the 15 topics, the most popular topic is the renin-angiotensin-aldosterone system (RAAS), occurring in 26 articles, including 6 of the top 10 most cited articles. The evolution of topics reveals that the role of RAAS inhibitor is a continuous hotspot, and sodium-glucose cotransporter 2 (SGLT-2) inhibitor and glucagon-like peptide 1 (GLP-1) agonist are two renoprotective agents which represent novel therapeutic methods in DKD. In addition, the 26 clinical studies among the top 100 most cited articles were highlighted, as they help guide clinical practice to better serve patients. CONCLUSIONS This bibliometric analysis of the top 100 most cited articles revealed important studies, popular topics, and trends in DKD research to assist researchers in further understanding the subject.
Collapse
Affiliation(s)
- Yi Wei
- Department of Nephrology, The Sixth Affiliated Hospital, Sun Yat-sen University, 26 Yuancun Er Heng Road, Guangzhou, 510655, China
| | - Zongpei Jiang
- Department of Nephrology, The Sixth Affiliated Hospital, Sun Yat-sen University, 26 Yuancun Er Heng Road, Guangzhou, 510655, China.
| |
Collapse
|
20
|
Molina Mora JA, Montero-Manso P, García-Batán R, Campos-Sánchez R, Vilar-Fernández J, García F. A first perturbome of Pseudomonas aeruginosa: Identification of core genes related to multiple perturbations by a machine learning approach. Biosystems 2021; 205:104411. [PMID: 33757842 DOI: 10.1016/j.biosystems.2021.104411] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 03/11/2021] [Accepted: 03/12/2021] [Indexed: 01/27/2023]
Abstract
Tolerance to stress conditions is vital for organismal survival, including bacteria under specific environmental conditions, antibiotics, and other perturbations. Some studies have described common modulation and shared genes during stress response to different types of disturbances (termed as perturbome), leading to the idea of central control at the molecular level. We implemented a robust machine learning approach to identify and describe genes associated with multiple perturbations or perturbome in a Pseudomonas aeruginosa PAO1 model. Using microarray datasets from the Gene Expression Omnibus (GEO), we evaluated six approaches to rank and select genes: using two methodologies, data single partition (SP method) or multiple partitions (MP method) for training and testing datasets, we evaluated three classification algorithms (SVM Support Vector Machine, KNN K-Nearest neighbor and RF Random Forest). Gene expression patterns and topological features at the systems level were included to describe the perturbome elements. We were able to select and describe 46 core response genes associated with multiple perturbations in P. aeruginosa PAO1 and it can be considered a first report of the P. aeruginosa perturbome. Molecular annotations, patterns in expression levels, and topological features in molecular networks revealed biological functions of biosynthesis, binding, and metabolism, many of them related to DNA damage repair and aerobic respiration in the context of tolerance to stress. We also discuss different issues related to implemented and assessed algorithms, including data partitioning, classification approaches, and metrics. Altogether, this work offers a different and robust framework to select genes using a machine learning approach.
Collapse
Affiliation(s)
- Jose Arturo Molina Mora
- Centro de Investigacion en Enfermedades Tropicales (CIET) and Facultad de Microbiología, Universidad de Costa Rica, San Jose, Costa Rica.
| | | | - Raquel García-Batán
- Centro de Investigacion en Enfermedades Tropicales (CIET) and Facultad de Microbiología, Universidad de Costa Rica, San Jose, Costa Rica.
| | - Rebeca Campos-Sánchez
- Centro de Investigación en Biología Celular y Molecular (CIBCM), Universidad de Costa Rica, San José, Costa Rica.
| | | | - Fernando García
- Centro de Investigacion en Enfermedades Tropicales (CIET) and Facultad de Microbiología, Universidad de Costa Rica, San Jose, Costa Rica.
| |
Collapse
|
21
|
Abstract
Machine learning shows enormous potential in facilitating decision-making regarding kidney diseases. With the development of data preservation and processing, as well as the advancement of machine learning algorithms, machine learning is expected to make remarkable breakthroughs in nephrology. Machine learning models have yielded many preliminaries to moderate and several excellent achievements in the fields, including analysis of renal pathological images, diagnosis and prognosis of chronic kidney diseases and acute kidney injury, as well as management of dialysis treatments. However, it is just scratching the surface of the field; at the same time, machine learning and its applications in renal diseases are facing a number of challenges. In this review, we discuss the application status, challenges and future prospects of machine learning in nephrology to help people further understand and improve the capacity for prediction, detection, and care quality in kidney diseases.
Collapse
|
22
|
Abstract
Immunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nanjing Eastern War Zone Hospital, chose eXtreme Gradient Boosting (XGBoost), random forest (RF), CatBoost, support vector machines (SVM), k-nearest neighbor (KNN), and extreme learning machine (ELM) models in order to predict the probability that the patient would not reach or reach end-stage renal disease (ESRD) within five years, used the chi-square test to select the most relevant 16 features as the input of the model, and designed a decision-making system (DMS) of IgAN prediction in children that is based on XGBoost and Django framework. The receiver operating characteristic (ROC) curve was used in order to evaluate the performance of the models and XGBoost had the best performance by comparison. The AUC value, accuracy, precision, recall, and f1-score of XGBoost were 85.11%, 78.60%, 75.96%, 76.70%, and 76.33%, respectively. The XGBoost model is useful for physicians and pediatric patients in providing predictions regarding IgAN. As an advantage, a DMS can be designed based on the XGBoost model to assist a physician to effectively treat IgAN in children for preventing deterioration.
Collapse
|
23
|
Liu S, Zhang R, Shang X, Li W. Analysis for warning factors of type 2 diabetes mellitus complications with Markov blanket based on a Bayesian network model. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2020; 188:105302. [PMID: 31923820 DOI: 10.1016/j.cmpb.2019.105302] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Revised: 12/05/2019] [Accepted: 12/24/2019] [Indexed: 06/10/2023]
Abstract
BACKGROUND AND OBJECTIVE Type 2 diabetes mellitus (T2DM) complications seriously affect the quality of life and could not be cured completely. Actions should be taken for prevention and self-management. Analysis of warning factors is beneficial for patients, on which some previous studies focused. They generally used the professional medical test factors or complete factors to predict and prevent, but it was inconvenient and impractical for patients to self-manage. With this in mind, this study built a Bayesian network (BN) model, from the perspective of diabetic patients' self-management and prevention, to predict six complications of T2DM using the selected warning factors which patients could have access from medical examination. Furthermore, the model was analyzed to explore the relationships between physiological variables and T2DM complications, as well as the complications themselves. The model aims to help patients with T2DM self-manage and prevent themselves from complications. METHODS The dataset was collected from a well-known data center called the National Health Clinical Center between 1st January 2009 and 31st December 2009. After preprocess and impute the data, a BN model merging expert knowledge was built with Bootstrap and Tabu search algorithm. Markov Blanket (MB) was used to select the warning factors and predict T2DM complications. Moreover, a Bayesian network without prior information (BN-wopi) model learned using 10-fold cross-validation both in structure and in parameters was added to compare with other classifiers learned using 10-fold cross-validation fairly. The warning factors were selected according the structure learned in each fold and were used to predict. Finally, the performance of two BN models using warning features were compared with Naïve Bayes model, Random Forest model, and C5.0 Decision Tree model, which used all features to predict. Besides, the validation parameters of the proposed model were also compared with those in existing studies using some other variables in clinical data or biomedical data to predict T2DM complications. RESULTS Experimental results indicated that the BN models using warning factors performed statistically better than their counterparts using all other variables in predicting T2DM complications. In addition, the proposed BN model were effective and significant in predicting diabetic nephropathy (DN) (AUC: 0.831), diabetic foot (DF) (AUC: 0.905), diabetic macrovascular complications (DMV) (AUC: 0.753) and diabetic ketoacidosis (DK) (AUC: 0.877) with the selected warning factors compared with other experiments. CONCLUSIONS The warning factors of DN, DF, DMV, and DK selected by MB in this research might be able to help predict certain T2DM complications effectively, and the proposed BN model might be used as a general tool for prevention, monitoring, and self-management.
Collapse
Affiliation(s)
- Siying Liu
- School of Economics and Management, Beijing Jiaotong University, Beijing 100044, PR China
| | - Runtong Zhang
- School of Economics and Management, Beijing Jiaotong University, Beijing 100044, PR China.
| | - Xiaopu Shang
- School of Economics and Management, Beijing Jiaotong University, Beijing 100044, PR China
| | - Weizi Li
- Informatics Research Center, University of Reading, Berkshire RG6 6AH, United Kingdom
| |
Collapse
|
24
|
Yuan Q, Zhang H, Deng T, Tang S, Yuan X, Tang W, Xie Y, Ge H, Wang X, Zhou Q, Xiao X. Role of Artificial Intelligence in Kidney Disease. Int J Med Sci 2020; 17:970-984. [PMID: 32308551 PMCID: PMC7163364 DOI: 10.7150/ijms.42078] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 03/17/2020] [Indexed: 12/17/2022] Open
Abstract
Artificial intelligence (AI), as an advanced science technology, has been widely used in medical fields to promote medical development, mainly applied to early detections, disease diagnoses, and management. Owing to the huge number of patients, kidney disease remains a global health problem. Challenges remain in its diagnosis and treatment. AI could take individual conditions into account, produce suitable decisions and promise to make great strides in kidney disease management. Here, we review the current studies of AI applications in kidney disease in alerting systems, diagnostic assistance, guiding treatment and evaluating prognosis. Although the number of studies related to AI applications in kidney disease is small, the potential of AI in the management of kidney disease is well recognized by clinicians; AI will greatly enhance clinicians' capacity in their clinical practice in the future.
Collapse
Affiliation(s)
- Qiongjing Yuan
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Haixia Zhang
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China.,Department of Nephrology, Second Affiliated Hospital of Soochow University, 1055 Sanxiang Road, Suzhou, Jiangsu 215000, China
| | - Tianci Deng
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Shumei Tang
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Xiangning Yuan
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Wenbin Tang
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Yanyun Xie
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Huipeng Ge
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Xiufen Wang
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Qiaoling Zhou
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| | - Xiangcheng Xiao
- Department of Nephrology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, China
| |
Collapse
|
25
|
Abhari S, Niakan Kalhori SR, Ebrahimi M, Hasannejadasl H, Garavand A. Artificial Intelligence Applications in Type 2 Diabetes Mellitus Care: Focus on Machine Learning Methods. Healthc Inform Res 2019; 25:248-261. [PMID: 31777668 PMCID: PMC6859270 DOI: 10.4258/hir.2019.25.4.248] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Revised: 10/06/2019] [Accepted: 10/09/2019] [Indexed: 12/18/2022] Open
Abstract
Objectives The incidence of type 2 diabetes mellitus has increased significantly in recent years. With the development of artificial intelligence applications in healthcare, they are used for diagnosis, therapeutic decision making, and outcome prediction, especially in type 2 diabetes mellitus. This study aimed to identify the artificial intelligence (AI) applications for type 2 diabetes mellitus care. Methods This is a review conducted in 2018. We searched the PubMed, Web of Science, and Embase scientific databases, based on a combination of related mesh terms. The article selection process was based on Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). Finally, 31 articles were selected after inclusion and exclusion criteria were applied. Data gathering was done by using a data extraction form. Data were summarized and reported based on the study objectives. Results The main applications of AI for type 2 diabetes mellitus care were screening and diagnosis in different stages. Among all of the reviewed AI methods, machine learning methods with 71% (n = 22) were the most commonly applied techniques. Many applications were in multi method forms (23%). Among the machine learning algorithms applications, support vector machine (21%) and naive Bayesian (19%) were the most commonly used methods. The most important variables that were used in the selected studies were body mass index, fasting blood sugar, blood pressure, HbA1c, triglycerides, low-density lipoprotein, high-density lipoprotein, and demographic variables. Conclusions It is recommended to select optimal algorithms by testing various techniques. Support vector machine and naive Bayesian might achieve better performance than other applications due to the type of variables and targets in diabetes-related outcomes classification.
Collapse
Affiliation(s)
- Shahabeddin Abhari
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Sharareh R Niakan Kalhori
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Mehdi Ebrahimi
- Department of Internal Medicine, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran.,Endocrinology and Metabolism Research Center, Endocrinology and Metabolism Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Hajar Hasannejadasl
- Department of Health Information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Ali Garavand
- Department of Health Information Management and Technology, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
26
|
Santhanam P, Ahima RS. Machine learning and blood pressure. J Clin Hypertens (Greenwich) 2019; 21:1735-1737. [PMID: 31536164 DOI: 10.1111/jch.13700] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 08/26/2019] [Indexed: 12/25/2022]
Abstract
Machine learning (ML) is a type of artificial intelligence (AI) based on pattern recognition. There are different forms of supervised and unsupervised learning algorithms that are being used to identify and predict blood pressure (BP) and other measures of cardiovascular risk. Since 1999, starting with neural network methods, ML has been used to gauge the relationship between BP and pulse wave forms. Since then, the scope of the research has expanded to using different cardiometabolic risk factors like BMI, waist circumference, waist-to-hip ratio in concert with BP and its various pharmaceutical agents to estimate biochemical measures (like HDL cholesterol, LDL and total cholesterol, fibrinogen, and uric acid) as well as effectiveness of anti-hypertensive regimens. Data from large clinical trials like the SPRINT are being re-analyzed by ML methods to unearth new findings and identify unique relationships between predictors and outcomes. In summary, AI and ML methods are gaining immense attention in the management of chronic disease. Elevated BP is a very important early metric for the risk of development of cardiovascular and renal injury; therefore, advances in AI and ML will aid in early disease prediction and intervention.
Collapse
Affiliation(s)
- Prasanna Santhanam
- Division of Endocrinology, Diabetes, & Metabolism, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Rexford S Ahima
- Division of Endocrinology, Diabetes, & Metabolism, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| |
Collapse
|
27
|
Robson B. Bidirectional General Graphs for inference. Principles and implications for medicine. Comput Biol Med 2019; 108:382-399. [PMID: 31075569 DOI: 10.1016/j.compbiomed.2019.04.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 04/03/2019] [Accepted: 04/04/2019] [Indexed: 12/17/2022]
Abstract
Probabilistic inference methods require a more general and realistic description of the world as a Bidirectional General Graph (BGG). While in its original form the Bayes Net (BN) has been promoted as a predictive tool, it is more immediately a way of testing a hypothesis or model about interactions in a system usually considered on a causal basis. Once established, the model can be used in a predictive way, but the problem here is that for a traditional BN the hypotheses or models that can be formed are limited to the Directed Acyclic Graph (DAG) by definition. Three interrelated features are highlighted that represent deficiencies of the DAG which are corrected by conversion to a method based on a BGG: (i) lack of intrinsic representation of coherence by Bayes' rule, (ii) relatedly the need to consider interdependence in parent nodes, and (iii) the need for management of a property called recurrence. These deficiencies can represent large errors in absolute estimates of probabilities, and while relative and renormalized probabilities ameliorate that, they can often make much of a net superfluous through cancelations by division. The Hyperbolic Dirac Net (HDN) based on Dirac's quantum mechanics is a solution that led naturally to avoiding these deficiencies. It encodes bidirectional probabilities in an h-complex value rediscovered by Dirac, i.e. with the imaginary number h such that hh = +1. Properties of the HDN described previously are reviewed (though emphasis is on descriptions in familiar probability terms), the issue of recurrence is introduced, methods of construction are simplified, and the severity of the quantitative differences between BNs and analogous HDNs are exemplified. There is also discussion of how results compare with other approaches in practice.
Collapse
Affiliation(s)
- Barry Robson
- Ingine Inc. Viginia, USA; The Dirac Foundation, OxfordShire, UK.
| |
Collapse
|
28
|
Identification of Traditional Chinese Medicine Constitutions and Physiological Indexes Risk Factors in Metabolic Syndrome: A Data Mining Approach. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2019; 2019:1686205. [PMID: 30854002 PMCID: PMC6378021 DOI: 10.1155/2019/1686205] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Accepted: 01/13/2019] [Indexed: 12/20/2022]
Abstract
Objective In order to find the predictive indexes for metabolic syndrome (MS), a data mining method was used to identify significant physiological indexes and traditional Chinese medicine (TCM) constitutions. Methods The annual health check-up data including physical examination data; biochemical tests and Constitution in Chinese Medicine Questionnaire (CCMQ) measurement data from 2014 to 2016 were screened according to the inclusion and exclusion criteria. A predictive matrix was established by the longitudinal data of three consecutive years. TreeNet machine learning algorithm was applied to build prediction model to uncover the dependence relationship between physiological indexes, TCM constitutions, and MS. Results By model testing, the overall accuracy rate for prediction model by TreeNet was 73.23%. Top 12.31% individuals in test group (n=325) that have higher probability of having MS covered 23.68% MS patients, showing 0.92 times more risk of having MS than the general population. Importance of ranked top 15 was listed in descending order . The top 5 variables of great importance in MS prediction were TBIL difference between 2014 and 2015 (D_TBIL), TBIL in 2014 (TBIL 2014), LDL-C difference between 2014 and 2015 (D_LDL-C), CCMQ scores for balanced constitution in 2015 (balanced constitution 2015), and TCH in 2015 (TCH 2015). When D_TBIL was between 0 and 2, TBIL 2014 was between 10 and 15, D_LDL-C was above 19, balanced constitution 2015 was below 60, or TCH 2015 was above 5.7, the incidence of MS was higher. Furthermore, there were interactions between balanced constitution 2015 score and TBIL 2014 or D_LDL-C in MS prediction. Conclusion Balanced constitution, TBIL, LDL-C, and TCH level can act as predictors for MS. The combination of TCM constitution and physiological indexes can give early warning to MS.
Collapse
|
29
|
Thippakorn C, Schaduangrat N, Nantasenamat C. Proteomic and bioinformatic discovery of biomarkers for diabetic nephropathy. EXCLI JOURNAL 2018; 17:312-330. [PMID: 29805343 PMCID: PMC5962897 DOI: 10.17179/excli2018-1150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 03/12/2018] [Indexed: 11/13/2022]
Abstract
Diabetes is associated with numerous metabolic and vascular risk factors that contribute to a high rate of micro-vascular and macro-vascular disorders leading to mortality and morbidity from diabetic complications. In this case, the major cause of death in overall diabetic patients results from diabetic nephropathy (DN) or renal failure. The risk factors and mechanisms that correspond to the development of DN are not fully understood and so far, no specific and sufficient diagnostic biomarkers are currently available other than micro- or macroalbuminuria. Therefore, this review describes current and novel protein biomarkers in the context of DN as well as probable proteins biomarkers associated with pathological processes for the early stage of DN via proteomics data together with bioinformatics. In addition, the mechanisms involved in early development of diabetic vascular disorders and complications resulting from glucose induced oxidative stress will also be explored.
Collapse
Affiliation(s)
- Chadinee Thippakorn
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
30
|
Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I. Machine Learning and Data Mining Methods in Diabetes Research. Comput Struct Biotechnol J 2017; 15:104-116. [PMID: 28138367 PMCID: PMC5257026 DOI: 10.1016/j.csbj.2016.12.005] [Citation(s) in RCA: 358] [Impact Index Per Article: 51.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Revised: 12/20/2016] [Accepted: 12/27/2016] [Indexed: 12/14/2022] Open
Abstract
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.
Collapse
Affiliation(s)
- Ioannis Kavakiotis
- Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
- Institute of Applied Biosciences, CERTH, Thessaloniki, Greece
| | - Olga Tsave
- Laboratory of Inorganic Chemistry, Department of Chemical Engineering, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Athanasios Salifoglou
- Laboratory of Inorganic Chemistry, Department of Chemical Engineering, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Nicos Maglaveras
- Institute of Applied Biosciences, CERTH, Thessaloniki, Greece
- Lab of Computing and Medical Informatics, Medical School, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Ioannis Vlahavas
- Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Ioanna Chouvarda
- Institute of Applied Biosciences, CERTH, Thessaloniki, Greece
- Lab of Computing and Medical Informatics, Medical School, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| |
Collapse
|
31
|
Langarizadeh M, Moghbeli F. Applying Naive Bayesian Networks to Disease Prediction: a Systematic Review. Acta Inform Med 2016; 24:364-369. [PMID: 28077895 PMCID: PMC5203736 DOI: 10.5455/aim.2016.24.364-369] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2016] [Accepted: 10/11/2016] [Indexed: 12/15/2022] Open
Abstract
INTRODUCTION Naive Bayesian networks (NBNs) are one of the most effective and simplest Bayesian networks for prediction. OBJECTIVE This paper aims to review published evidence about the application of NBNs in predicting disease and it tries to show NBNs as the fundamental algorithm for the best performance in comparison with other algorithms. METHODS PubMed was electronically checked for articles published between 2005 and 2015. For characterizing eligible articles, a comprehensive electronic searching method was conducted. Inclusion criteria were determined based on NBN and its effects on disease prediction. A total of 99 articles were found. After excluding the duplicates (n= 5), the titles and abstracts of 94 articles were skimmed according to the inclusion criteria. Finally, 38 articles remained. They were reviewed in full text and 15 articles were excluded. Eventually, 23 articles were selected which met our eligibility criteria and were included in this study. RESULT In this article, the use of NBN in predicting diseases was described. Finally, the results were reported in terms of Accuracy, Sensitivity, Specificity and Area under ROC curve (AUC). The last column in Table 2 shows the differences between NBNs and other algorithms. DISCUSSION This systematic review (23 studies, 53,725 patients) indicates that predicting diseases based on a NBN had the best performance in most diseases in comparison with the other algorithms. Finally in most cases NBN works better than other algorithms based on the reported accuracy. CONCLUSION The method, termed NBNs is proposed and can efficiently construct a prediction model for disease.
Collapse
Affiliation(s)
- Mostafa Langarizadeh
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Fateme Moghbeli
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
32
|
Rodriguez M, Salmeron MD, Martin-Malo A, Barbieri C, Mari F, Molina RI, Costa P, Aljama P. A New Data Analysis System to Quantify Associations between Biochemical Parameters of Chronic Kidney Disease-Mineral Bone Disease. PLoS One 2016; 11:e0146801. [PMID: 26808154 PMCID: PMC4726537 DOI: 10.1371/journal.pone.0146801] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Accepted: 12/21/2015] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND In hemodialysis patients, deviations from KDIGO recommended values of individual parameters, phosphate, calcium or parathyroid hormone (PTH), are associated with increased mortality. However, it is widely accepted that these parameters are not regulated independently of each other and that therapy aimed to correct one parameter often modifies the others. The aim of the present study is to quantify the degree of association between parameters of chronic kidney disease and mineral bone disease (CKD-MBD). METHODS Data was extracted from a cohort of 1758 adult HD patients between January 2000 and June 2013 obtaining a total of 46.141 records (10 year follow-up). We used an advanced data analysis system called Random Forest (RF) which is based on self-learning procedure with similar axioms to those utilized for the development of artificial intelligence. This new approach is particularly useful when the variables analyzed are closely dependent to each other. RESULTS The analysis revealed a strong association between PTH and phosphate that was superior to that of PTH and Calcium. The classical linear regression analysis between PTH and phosphate shows a correlation coefficient is 0.27, p<0.001, the possibility to predict PTH changes from phosphate modification is marginal. Alternatively, RF assumes that changes in phosphate will cause modifications in other associated variables (calcium and others) that may also affect PTH values. Using RF the correlation coefficient between changes in serum PTH and phosphate is 0.77, p<0.001; thus, the power of prediction is markedly increased. The effect of therapy on biochemical variables was also analyzed using this RF. CONCLUSION Our results suggest that the analysis of the complex interactions between mineral metabolism parameters in CKD-MBD may demand a more advanced data analysis system such as RF.
Collapse
Affiliation(s)
- Mariano Rodriguez
- Nephrology Service, Hospital Reina Sofia, IMIBIC, University of Cordoba, Cordoba, Spain
| | - M. Dolores Salmeron
- Nephrology Service, Hospital Reina Sofia, IMIBIC, University of Cordoba, Cordoba, Spain
| | - Alejandro Martin-Malo
- Nephrology Service, Hospital Reina Sofia, IMIBIC, University of Cordoba, Cordoba, Spain
- * E-mail:
| | | | - Flavio Mari
- Fresenius Medical Care, Bad Homburg, Germany
| | - Rafael I. Molina
- Nephrology Service, Hospital Reina Sofia, IMIBIC, University of Cordoba, Cordoba, Spain
| | | | - Pedro Aljama
- Nephrology Service, Hospital Reina Sofia, IMIBIC, University of Cordoba, Cordoba, Spain
| |
Collapse
|
33
|
Huang GM, Huang KY, Lee TY, Weng J. An interpretable rule-based diagnostic classification of diabetic nephropathy among type 2 diabetes patients. BMC Bioinformatics 2015; 16 Suppl 1:S5. [PMID: 25707942 PMCID: PMC4331704 DOI: 10.1186/1471-2105-16-s1-s5] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Background The prevalence of type 2 diabetes is increasing at an alarming rate. Various complications are associated with type 2 diabetes, with diabetic nephropathy being the leading cause of renal failure among diabetics. Often, when patients are diagnosed with diabetic nephropathy, their renal functions have already been significantly damaged. Therefore, a risk prediction tool may be beneficial for the implementation of early treatment and prevention. Results In the present study, we developed a decision tree-based model integrating genetic and clinical features in a gender-specific classification for the identification of diabetic nephropathy among type 2 diabetic patients. Clinical and genotyping data were obtained from a previous genetic association study involving 345 type 2 diabetic patients (185 with diabetic nephropathy and 160 without diabetic nephropathy). Using a five-fold cross-validation approach, the performance of using clinical or genetic features alone in various classifiers (decision tree, random forest, Naïve Bayes, and support vector machine) was compared with that of utilizing a combination of attributes. The inclusion of genetic features and the implementation of an additional gender-based rule yielded better classification results. Conclusions The current model supports the notion that genes and gender are contributing factors of diabetic nephropathy. Further refinement of the proposed approach has the potential to facilitate the early identification of diabetic nephropathy and the development of more efficient treatment in a clinical setting.
Collapse
|
34
|
Machine Learning for Predictive Modelling based on Small Data in Biomedical Engineering. ACTA ACUST UNITED AC 2015. [DOI: 10.1016/j.ifacol.2015.10.185] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
|
35
|
Attenuation of hyperlipidemia- and diabetes-induced early-stage apoptosis and late-stage renal dysfunction via administration of fibroblast growth factor-21 is associated with suppression of renal inflammation. PLoS One 2013; 8:e82275. [PMID: 24349242 PMCID: PMC3857822 DOI: 10.1371/journal.pone.0082275] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 11/01/2013] [Indexed: 12/21/2022] Open
Abstract
Background Lipotoxicity is a key feature of the pathogenesis of diabetic kidney disease, and is attributed to excessive lipid accumulation (hyperlipidemia). Increasing evidence suggests that fibroblast growth factor (FGF)21 has a crucial role in lipid metabolism under diabetic conditions. Objective The present study investigated whether FGF21 can prevent hyperlipidemia- or diabetes-induced renal damage, and if so, the possible mechanism. Methods Mice were injected with free fatty acids (FFAs, 10 mg/10 g body weight) or streptozotocin (150 mg/kg) to establish a lipotoxic model or type 1 diabetic model, respectively. Simultaneously the mice were treated with FGF21 (100 µg/kg) for 10 or 80 days. The kidney weight-to-tibia length ratio and renal function were assessed. Systematic and renal lipid levels were detected by ELISA and Oil Red O staining. Renal apoptosis was examined by TUNEL assay. Inflammation, oxidative stress, and fibrosis were assessed by Western blot. Results Acute FFA administration and chronic diabetes were associated with lower kidney-to-tibia length ratio, higher lipid levels, severe renal apoptosis and renal dysfunction. Obvious inflammation, oxidative stress and fibrosis also observed in the kidney of both mice models. Deletion of the fgf21 gene further enhanced the above pathological changes, which were significantly prevented by administration of exogenous FGF21. Conclusion These results suggest that FFA administration and diabetes induced renal damage, which was further enhanced in FGF21 knock-out mice. Administration of FGF21 significantly prevented both FFA- and diabetes-induced renal damage partially by decreasing renal lipid accumulation and suppressing inflammation, oxidative stress, and fibrosis.
Collapse
|