1
|
Yu S, Han S, Shi M, Harada M, Ge J, Li X, Cai X, Heier M, Karstenmüller G, Suhre K, Gieger C, Koenig W, Rathmann W, Peters A, Wang-Sattler R. Prediction of Myocardial Infarction Using a Combined Generative Adversarial Network Model and Feature-Enhanced Loss Function. Metabolites 2024; 14:258. [PMID: 38786735 PMCID: PMC11122941 DOI: 10.3390/metabo14050258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 04/23/2024] [Accepted: 04/24/2024] [Indexed: 05/25/2024] Open
Abstract
Accurate risk prediction for myocardial infarction (MI) is crucial for preventive strategies, given its significant impact on global mortality and morbidity. Here, we propose a novel deep-learning approach to enhance the prediction of incident MI cases by incorporating metabolomics alongside clinical risk factors. We utilized data from the KORA cohort, including the baseline S4 and follow-up F4 studies, consisting of 1454 participants without prior history of MI. The dataset comprised 19 clinical variables and 363 metabolites. Due to the imbalanced nature of the dataset (78 observed MI cases and 1376 non-MI individuals), we employed a generative adversarial network (GAN) model to generate new incident cases, augmenting the dataset and improving feature representation. To predict MI, we further utilized multi-layer perceptron (MLP) models in conjunction with the synthetic minority oversampling technique (SMOTE) and edited nearest neighbor (ENN) methods to address overfitting and underfitting issues, particularly when dealing with imbalanced datasets. To enhance prediction accuracy, we propose a novel GAN for feature-enhanced (GFE) loss function. The GFE loss function resulted in an approximate 2% improvement in prediction accuracy, yielding a final accuracy of 70%. Furthermore, we evaluated the contribution of each clinical variable and metabolite to the predictive model and identified the 10 most significant variables, including glucose tolerance, sex, and physical activity. This is the first study to construct a deep-learning approach for producing 7-year MI predictions using the newly proposed loss function. Our findings demonstrate the promising potential of our technique in identifying novel biomarkers for MI prediction.
Collapse
Affiliation(s)
- Shixiang Yu
- TUM School of Medicine and Health, Technical University of Munich, 81675 München, Germany; (S.Y.); (S.H.); (M.S.); (J.G.)
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Siyu Han
- TUM School of Medicine and Health, Technical University of Munich, 81675 München, Germany; (S.Y.); (S.H.); (M.S.); (J.G.)
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Mengya Shi
- TUM School of Medicine and Health, Technical University of Munich, 81675 München, Germany; (S.Y.); (S.H.); (M.S.); (J.G.)
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Makoto Harada
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Jianhong Ge
- TUM School of Medicine and Health, Technical University of Munich, 81675 München, Germany; (S.Y.); (S.H.); (M.S.); (J.G.)
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Xuening Li
- Biocomputing R&D Department, Beijing Huanyang Bole Consulting Co., Ltd., Beijing 100010, China;
| | - Xiang Cai
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541214, China;
| | - Margit Heier
- KORA Study Centre, University Hospital of Augsburg, 86153 Augsburg, Germany;
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Gabi Karstenmüller
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Karsten Suhre
- Department of Physiology and Biophysics, Weill Cornell Medicine and Director of the Bioinformatics Core, Doha 24144, Qatar;
| | - Christian Gieger
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
| | - Wolfgang Koenig
- Deutsches Herzzentrum München, Technische Universität München, 80636 München, Germany;
| | - Wolfgang Rathmann
- Institute for Biometrics and Epidemiology, German Diabetes Center, Leibniz Center for Diabetes Research, Heinrich Heine University, 40225 Düsseldorf, Germany;
| | - Annette Peters
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
- Institute for Medical Information Processing, Biometry, and Epidemiology (IBE), Pettenkofer School of Public Health, Faculty of Medicine, Ludwig-Maximilians-Universität München, 81377 München, Germany
| | - Rui Wang-Sattler
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany;
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| |
Collapse
|
2
|
Lim NK, Park JH. The use of machine learning for investigating the role of plastic surgeons in anatomical injuries: A retrospective observational study. Medicine (Baltimore) 2022; 101:e30943. [PMID: 36221333 PMCID: PMC9542809 DOI: 10.1097/md.0000000000030943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
While plastic surgeons have been historically indispensable in the reconstruction of posttraumatic defects, their role in trauma centers worldwide has not been clearly defined. Therefore, we aimed to investigate the contribution of plastic surgeons in trauma care using machine learning from an anatomic injury viewpoint. We conducted a retrospective study reviewing the data for all trauma patients of our hospital from March 2019 to February 2021. In total, 4809 patients were classified in duplicate according to the 17 trauma-related departments while conducting the initial treatment. We evaluated several covariates, including age, sex, cause of trauma, treatment outcomes, surgical data, and severity indices, such as the Injury Severity Score and Abbreviated Injury Scale (AIS). A random forest algorithm was used to rank the relevance of 17 trauma-related departments in each category for the AIS and outcomes. Additionally, t test and chi-square test were performed to compare two groups, which were based on whether the patients had received initial treatment in the trauma bay from the plastic surgery department (PS group) or not (non-PS group), in each AIS category. The department of PS was ranked first in the face and external categories after analyzing the relevance of the 17 trauma-related departments in six categories of AIS, through the random forest algorithm. Of the 1108 patients in the face category of AIS, the PS group was not correlated with all outcomes, except for the rate of discharge to home (P < .0001). Upon re-verifying the results using random forest, we found that PS did not affect the outcomes. In the external category in AIS, there were 30 patients in the PS group and 56 patients in the non-PS group, and there was no statistically significant difference between the two groups when comparing the outcomes. PS has contributed considerably to the face and external regions among the six AIS categories; however, there was no correlation between plastic surgical treatment and the outcome of trauma patients. We investigated the plastic surgeons' role based on anatomical injury, using machine learning for the first time in the field of trauma care.
Collapse
Affiliation(s)
- Nam Kyu Lim
- Department of Plastic and Reconstructive Surgery, Dankook University College of Medicine, Cheonan, Republic of Korea
- Department of Plastic and Reconstructive Surgery, Dankook University Hospital, Cheonan, Republic of Korea
- *Correspondence: Nam Kyu Lim, Department of Plastic and Reconstructive surgery, Dankook University College of Medicine, 119 Dandae-ro, Dongnam-gu, Cheonan, Chungnam 31116, Republic of Korea (e-mail: )
| | - Jong Hyun Park
- Department of Plastic and Reconstructive Surgery, Dankook University Hospital, Cheonan, Republic of Korea
| |
Collapse
|
3
|
Comprehensive Analysis of LINC01615 in Head and Neck Squamous Cell Carcinoma: A Hub Biomarker Identified by Machine Learning and Experimental Validation. JOURNAL OF ONCOLOGY 2022; 2022:5039962. [PMID: 35794984 PMCID: PMC9252709 DOI: 10.1155/2022/5039962] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 05/27/2022] [Accepted: 05/28/2022] [Indexed: 11/17/2022]
Abstract
Background Head and neck squamous cell carcinoma (HNSCC) is one of the most common cancers, but in clinical practice, the lack of precise biomarkers often results in an advanced diagnosis. Hence, it is crucial to explore novel biomarkers to improve the clinical outcome of HNSCC patients. Methods We downloaded RNA-seq data consisting of 502 HNSCC tissues and 44 normal tissues from the TCGA database, and lncRNA genomic sequence information was downloaded from the GENECODE database for annotating lncRNA expression profiles. We used Cox regression analysis to screen prognostic lncRNAs, the threshold as HR >1 and p value <0.05. Subsequently, three survival outcomes (overall survival, progress-free interval, and disease-specific survival)-related lncRNAs overlapped to get the common lncRNAs. The hub biomarker was identified using LASSO and random forest models. Subsequently, we used a variety of statistical methods to validate the prognostic ability of the hub marker. In addition, Spearman correlation analysis between the hub marker expression and genomic heterogeneity was conducted, such as instability (MSI), homologous recombination deficiency (HRD), and tumor mutational burden (TMB). Finally, we used enrichment analysis, ssGSEA, and ESTIMATE algorithms to explore the changes in the underlying immune-related pathway and function. Finally, the MTT assay and transwell assay were performed to determine the effect of LINC01615 silencing on tumor cell proliferation, invasion, and migration. Results Cox regression analysis revealed 133 lncRNAs with multiple prognostic significance. The machine learning algorithm screened out the hub lncRNA with the highest importance in the RF model: LINC01615. Clinical correlation analysis revealed that the LINC01615 increased with increasing the T stage, N stage, pathology grade, and clinical stage. LINC01615 could be used as a predictor of HNSCC prognosis validating by a variety of statistical methods. Subsequently, when clinical indicators were combined with the LINC01615 expression, the visualization model (nomogram) was more applicable to clinical practice. Finally, immune algorithms indicated that LINC01615 may be involved in the regulation of lymphocyte recruitment and immunological infiltration in HNSCC, and the LINC01615 expression represented genomic heterogeneity in pan-cancer. Functionally, silencing of LINC01615 suppresses cell proliferation, invasion, and migration in HEP-2 and TU212 cells. Conclusion LINC01615 may play an important role in the prostromal cell enrichment and immunosuppressive state and serve as a prognostic biomarker in HNSCC.
Collapse
|
4
|
Dong H, Wang X. Identification of Signature Genes and Construction of an Artificial Neural Network Model of Prostate Cancer. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:1562511. [PMID: 35432828 PMCID: PMC9010146 DOI: 10.1155/2022/1562511] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 03/21/2022] [Accepted: 03/23/2022] [Indexed: 11/22/2022]
Abstract
This study aimed to establish an artificial neural network (ANN) model based on prostate cancer signature genes (PCaSGs) to predict the patients with prostate cancer (PCa). In the present study, 270 differentially expressed genes (DEGs) were identified between PCa and normal prostate (NP) groups by differential gene expression analysis. Next, we performed Metascape gene annotation, pathway and process enrichment analysis, and PPI enrichment analysis on all 270 DEGs. Then, we identified and screened out 30 PCaSGs based on the random forest analysis and constructed an ANN model based on the gene score matrix consisting of 30 PCaSGs. Lastly, analysis of microarray dataset GSE46602 showed that the accuracy of this model for predicating PCa and NP samples was 88.9 and 78.6%, respectively. Our results suggested that the ANN model based on PCaSGs can be used for effectively predicting the patients with PCa and will be helpful for early PCa diagnosis and treatment.
Collapse
Affiliation(s)
- Hongye Dong
- Department of Kidney Disease and Blood Purifification Center, The Second Hospital of Tianjin Medical University, Tianjin 300211, China
| | - Xu Wang
- Department of Urology, Tianjin Institute of Urology, The Second Hospital of Tianjin Medical University, Tianjin 300211, China
| |
Collapse
|
5
|
Dai P, Chang W, Xin Z, Cheng H, Ouyang W, Luo A. Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression. Front Public Health 2021; 9:678276. [PMID: 34211956 PMCID: PMC8239170 DOI: 10.3389/fpubh.2021.678276] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 05/17/2021] [Indexed: 11/17/2022] Open
Abstract
Aim: With the improvement in people's living standards, the incidence of chronic renal failure (CRF) is increasing annually. The increase in the number of patients with CRF has significantly increased pressure on China's medical budget. Predicting hospitalization expenses for CRF can provide guidance for effective allocation and control of medical costs. The purpose of this study was to use the random forest (RF) method and least absolute shrinkage and selection operator (LASSO) regression to predict personal hospitalization expenses of hospitalized patients with CRF and to evaluate related influencing factors. Methods: The data set was collected from the first page of data of the medical records of three tertiary first-class hospitals for the whole year of 2016. Factors influencing hospitalization expenses for CRF were analyzed. Random forest and least absolute shrinkage and selection operator regression models were used to establish a prediction model for the hospitalization expenses of patients with CRF, and comparisons and evaluations were carried out. Results: For CRF inpatients, statistically significant differences in hospitalization expenses were found for major procedures, medical payment method, hospitalization frequency, length of stay, number of other diagnoses, and number of procedures. The R2 of LASSO regression model and RF regression model are 0.6992 and 0.7946, respectively. The mean absolute error (MAE) and root mean square error (RMSE) of the LASSO regression model were 0.0268 and 0.043, respectively, and the MAE and RMSE of the RF prediction model were 0.0171 and 0.0355, respectively. In the RF model, and the weight of length of stay was the highest (0.730). Conclusions: The hospitalization expenses of patients with CRF are most affected by length of stay. The RF prediction model is superior to the LASSO regression model and can be used to predict the hospitalization expenses of patients with CRF. Health administration departments may consider formulating accurate individualized hospitalization expense reimbursement mechanisms accordingly.
Collapse
Affiliation(s)
- Pingping Dai
- Key Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, China.,Department of Medical Information, School of Life Science, Central South University, Changsha, China
| | - Weifu Chang
- Key Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, China
| | - Zirui Xin
- Key Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, China.,Department of Medical Information, School of Life Science, Central South University, Changsha, China
| | - Haiwei Cheng
- Department of Sociology, Central South University, Changsha, China
| | - Wei Ouyang
- Key Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, China.,Department of Medical Information, School of Life Science, Central South University, Changsha, China
| | - Aijing Luo
- Second Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|