Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li Q, Yang H, Wang P, Liu X, Lv K, Ye M. XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer. J Transl Med 2022;20:177. [PMID: 35436939 PMCID: PMC9014628 DOI: 10.1186/s12967-022-03369-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/26/2022] [Indexed: 12/23/2022] Open

For:	Li Q, Yang H, Wang P, Liu X, Lv K, Ye M. XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer. J Transl Med 2022;20:177. [PMID: 35436939 PMCID: PMC9014628 DOI: 10.1186/s12967-022-03369-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/26/2022] [Indexed: 12/23/2022] Open

Number

Cited by Other Article(s)

Rawshani A, Hessulf F, Deminger J, Sultanian P, Gupta V, Lundgren P, Mohammed M, Abu Al Chay M, Siöland T, Gryska E, Piasecki A. Prediction of neurologic outcome after out-of-hospital cardiac arrest: an interpretable approach with machine learning. Resuscitation 2024:110359. [PMID: 39142467 DOI: 10.1016/j.resuscitation.2024.110359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 08/02/2024] [Accepted: 08/08/2024] [Indexed: 08/16/2024]

Abstract

Out-of-hospital cardiac arrest (OHCA) is a critical condition with low survival rates. In patients with a return of spontaneous circulation, brain injury is a leading cause of death. In this study, we propose an interpretable machine learning approach for predicting neurologic outcome after OHCA, using information available at the time of hospital admission.

METHODS

The study population were 55 615 OHCA cases registered in the Swedish Cardiopulmonary Resuscitation Registry between 2010 and 2020. The dataset was split to training and validation sets (for model development) and test set (for evaluation of the final model). We used an XGBoost algorithm with stratified, repeated 10-fold cross-validation along with Optuna framework for hyperparameters tuning. The final model was trained on 10 features selected based on the importance scores and evaluated on the test set in terms of discrimination, calibration and bias-variance tradeoff. We used SHapley Additive exPlanations to address the 'black-box' model and align with eXplainable artificial intelligence.

RESULTS

The final model achieved: area under the receiver operating characteristic value 0.964 (95% confidence interval (CI) [0.960-0.968]), sensitivity 0.606 (95% CI [0.573-0.634]), specificity 0.975 (95% CI [0.972-0.978]), positive predictive value (PPV) 0.664 (95% CI [0.625-0.696]), negative predictive value (NPV) 0.969 (95% CI [0.966-0.972]), macro F1 0.803 (95% CI [0.788-0.816]), and showed a very good calibration. SHAP features with the highest impact on the model's output were: 'ROSC on arrival to hospital', 'Initial rhythm asystole' and 'Conscious on arrival to hospital'.

CONCLUSIONS

The XGBoost machine learning model with 10 features available at the time of hospital admission showed good performance for predicting neurologic outcome after OHCA, with no apparent signs of overfitting.

Collapse

Affiliation(s)

Araz Rawshani Department of Molecular and Clinical Medicine, Institute of Medicine, University of Gothenburg, Wallenberg Laboratory, Blå stråket 5, Sahlgrenska University Hospital, 413 45 Gothenburg, Sweden; Department of Cardiology, Sahlgrenska University Hospital, Blå stråket 5, 413 45 Gothenburg, Sweden; The Swedish Registry for Cardiopulmonary Resuscitation, Medicinaregatan 18G, 413 90 Gothenburg, Sweden
Fredrik Hessulf Department of Anesthesiology and Intensive Care, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Blå stråket 5, 413 45 Gothenburg, Sweden; Department of Anesthesiology and Intensive Care, Sahlgrenska University Hospital, Göteborgsvägen 31, 431 30 Mölndal, Sweden
John Deminger Department of Medicine and Emergency Care, Sahlgrenska University Hospital, Göteborgsvägen 33, 431 30 Mölndal, Sweden
Pedram Sultanian Department of Molecular and Clinical Medicine, Institute of Medicine, University of Gothenburg, Wallenberg Laboratory, Blå stråket 5, Sahlgrenska University Hospital, 413 45 Gothenburg, Sweden
Vibha Gupta Department of Molecular and Clinical Medicine, Institute of Medicine, University of Gothenburg, Wallenberg Laboratory, Blå stråket 5, Sahlgrenska University Hospital, 413 45 Gothenburg, Sweden
Peter Lundgren Department of Molecular and Clinical Medicine, Institute of Medicine, University of Gothenburg, Wallenberg Laboratory, Blå stråket 5, Sahlgrenska University Hospital, 413 45 Gothenburg, Sweden; Department of Cardiology, Sahlgrenska University Hospital, Blå stråket 5, 413 45 Gothenburg, Sweden
Mohammed Mohammed Department of Cardiology, Sahlgrenska University Hospital, Blå stråket 5, 413 45 Gothenburg, Sweden
Moner Abu Al Chay Department of Cardiology, Sahlgrenska University Hospital, Blå stråket 5, 413 45 Gothenburg, Sweden
Tobias Siöland Department of Anesthesiology and Intensive Care, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Blå stråket 5, 413 45 Gothenburg, Sweden; Department of Anesthesiology and Intensive Care, Sahlgrenska University Hospital, Göteborgsvägen 31, 431 30 Mölndal, Sweden
Emilia Gryska Department of Hand Surgery, Sahlgrenska University Hospital, Göteborgsvägen 31, 431 30 Mölndal, Sweden; Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Adam Piasecki Department of Anesthesiology and Intensive Care, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Blå stråket 5, 413 45 Gothenburg, Sweden; Department of Anesthesiology and Intensive Care, Sahlgrenska University Hospital, Göteborgsvägen 31, 431 30 Mölndal, Sweden.

Collapse

Zhang Z, Lan H, Zhao S. Analysis of the Value of Quantitative Features in Multimodal MRI Images to Construct a Radio-Omics Model for Breast Cancer Diagnosis. BREAST CANCER (DOVE MEDICAL PRESS) 2024;16:305-318. [PMID: 38895649 PMCID: PMC11182731 DOI: 10.2147/bctt.s458036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 05/24/2024] [Indexed: 06/21/2024]

Abstract

Objective

To analyze the diagnostic value of quantitative features in multimodal magnetic resonance imaging (MRI) images to construct a radio-omics model for breast cancer.

Methods

Ninety-five patients with breast-related diseases from January 2020 to January 2021 were grouped into the benign group (n=57) and malignant group (n=38) according to the pathological findings. All cases were randomized as the training group (n=66) and validation group (n=29) in a 7:3 ratio based on the examination time. All subjects were examined by T1-weighted imaging (T1WI), T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI), dynamic contrast enhancement (DCE), and apparent diffusion coefficient (ADC) multimodality MRI. The MRI findings were analyzed against pathological findings. A diagnostic breast cancer radiomics model was constructed. The diagnostic efficacy of the model in the validation group was analyzed, and the diagnostic efficacy was analyzed via the ROC curve.

Results

Fibroadenoma accounted for 49.12% of benign breast diseases, and invasive ductal carcinoma accounted for 73.68% of malignant breast diseases. The sensitivity of T1WI, T2WI, DWI, ADC, and DCE in diagnosing breast cancer was 61.14%, 66.67%, 73.30%, 78.95%, and 85.96%, using the four-fold table method. The area under the curves (AUCs) of T1WI, T2WI, DWI, ADC, and DCE for diagnosing breast cancer were 0.715, 0.769, 0.785, 0.835, and 0.792, respectively. The AUCs of plain scan, diffuse, enhanced, plain scan + diffuse, plain scan + enhanced, enhanced + diffuse, and plain scan + enhanced + diffuse for diagnosing breast cancer were 0.746, 0.798, 0.816, 0.839, 0.890, 0.906, and 0.927, respectively.

Conclusion

The construction of a radio-omics model by quantitative features in multimodal MRI images was valuable in the diagnosis of breast cancer. The value of radio-omics models such as plain scan + enhanced + diffuse was higher than the other models in diagnosing breast cancer and could be widely applied in clinical practice.

Collapse

Patil AR, Schug J, Liu C, Lahori D, Descamps HC, Naji A, Kaestner KH, Faryabi RB, Vahedi G. Modeling type 1 diabetes progression using machine learning and single-cell transcriptomic measurements in human islets. Cell Rep Med 2024;5:101535. [PMID: 38677282 PMCID: PMC11148720 DOI: 10.1016/j.xcrm.2024.101535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 01/22/2024] [Accepted: 04/07/2024] [Indexed: 04/29/2024]

Affiliation(s)

Abhijeet R Patil Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Jonathan Schug Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Chengyang Liu Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Deeksha Lahori Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Hélène C Descamps Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Ali Naji Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Klaus H Kaestner Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Robert B Faryabi Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Abramson Family Cancer Research Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
Golnaz Vahedi Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Immunology and Immune Health, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Diabetes, Obesity and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Abramson Family Cancer Research Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA.

Collapse

Su L, Hounye AH, Pan Q, Miao K, Wang J, Hou M, Xiong L. Explainable cancer factors discovery: Shapley additive explanation for machine learning models demonstrates the best practices in the case of pancreatic cancer. Pancreatology 2024;24:404-423. [PMID: 38342661 DOI: 10.1016/j.pan.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 01/07/2024] [Accepted: 02/05/2024] [Indexed: 02/13/2024]

Klauschen F, Dippel J, Keyl P, Jurmeister P, Bockmayr M, Mock A, Buchstab O, Alber M, Ruff L, Montavon G, Müller KR. Toward Explainable Artificial Intelligence for Precision Pathology. ANNUAL REVIEW OF PATHOLOGY 2024;19:541-570. [PMID: 37871132 DOI: 10.1146/annurev-pathmechdis-051222-113147] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Affiliation(s)

Frederick Klauschen Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany; Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
Jonas Dippel Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany;
Philipp Keyl Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
Philipp Jurmeister Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany; German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
Michael Bockmayr Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany Department of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany Research Institute Children's Cancer Center Hamburg, Hamburg, Germany
Andreas Mock Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany; German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
Oliver Buchstab Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
Maximilian Alber Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany Aignostics, Berlin, Germany
Lukas Ruff Aignostics, Berlin, Germany
Grégoire Montavon Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany; Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
Klaus-Robert Müller Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany; Department of Artificial Intelligence, Korea University, Seoul, Korea Max Planck Institute for Informatics, Saarbrücken, Germany

Collapse

Teng X, Wang Z. Online COVID-19 diagnosis prediction using complete blood count: an innovative tool for public health. BMC Public Health 2023;23:2536. [PMID: 38114942 PMCID: PMC10729447 DOI: 10.1186/s12889-023-17477-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 12/13/2023] [Indexed: 12/21/2023] Open

Abstract

BACKGROUND

COVID-19, caused by SARS-CoV-2, presents distinct diagnostic challenges due to its wide range of clinical manifestations and the overlapping symptoms with other common respiratory diseases. This study focuses on addressing these difficulties by employing machine learning (ML) methodologies, particularly the XGBoost algorithm, to utilize Complete Blood Count (CBC) parameters for predictive analysis.

METHODS

We performed a retrospective study involving 2114 COVID-19 patients treated between December 2022 and January 2023 at our healthcare facility. These patients were classified into fever (1057 patients) and pneumonia groups (1057 patients), based on their clinical symptoms. The CBC data were utilized to create predictive models, with model performance evaluated through metrics like Area Under the Receiver Operating Characteristics Curve (AUC), accuracy, sensitivity, specificity, and precision. We selected the top 10 predictive variables based on their significance in disease prediction. The data were then split into a training set (70% of patients) and a validation set (30% of patients) for model validation.

RESULTS

We identified 31 indicators with significant disparities. The XGBoost model outperformed others, with an AUC of 0.920 and high precision, sensitivity, specificity, and accuracy. The top 10 features (Age, Monocyte%, Mean Platelet Volume, Lymphocyte%, SIRI, Eosinophil count, Platelet count, Hemoglobin, Platelet Distribution Width, and Neutrophil count.) were crucial in constructing a more precise predictive model. The model demonstrated strong performance on both training (AUC = 0.977) and validation (AUC = 0.912) datasets, validated by decision curve analysis and calibration curve.

CONCLUSION

ML models that incorporate CBC parameters offer an innovative and effective tool for data analysis in COVID-19. They potentially enhance diagnostic accuracy and the efficacy of therapeutic interventions, ultimately contributing to a reduction in the mortality rate of this infectious disease.

Collapse

Umar H, Aliyu MR, Usman AG, Ghali UM, Abba SI, Ozsahin DU. Prediction of cell migration potential on human breast cancer cells treated with Albizia lebbeck ethanolic extract using extreme machine learning. Sci Rep 2023;13:22242. [PMID: 38097683 PMCID: PMC10721884 DOI: 10.1038/s41598-023-49363-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023] Open

Abstract

Cancer is one of the major causes of death in the modern world, and the incidence varies considerably based on race, ethnicity, and region. Novel cancer treatments, such as surgery and immunotherapy, are ineffective and expensive. In this situation, ion channels responsible for cell migration have appeared to be the most promising targets for cancer treatment. This research presents findings on the organic compounds present in Albizia lebbeck ethanolic extracts (ALEE), as well as their impact on the anti-migratory, anti-proliferative and cytotoxic potentials on MDA-MB 231 and MCF-7 human breast cancer cell lines. In addition, artificial intelligence (AI) based models, multilayer perceptron (MLP), extreme gradient boosting (XGB), and extreme learning machine (ELM) were performed to predict in vitro cancer cell migration on both cell lines, based on our experimental data. The organic compounds composition of the ALEE was studied using gas chromatography-mass spectrometry (GC-MS) analysis. Cytotoxicity, anti-proliferations, and anti-migratory activity of the extract using Tryphan Blue, MTT, and Wound Heal assay, respectively. Among the various concentrations (2.5-200 μg/mL) of the ALEE that were used in our study, 2.5-10 μg/mL revealed anti-migratory potential with increased concentrations, and they did not show any effect on the proliferation of the cells (P < 0.05; n ≥ 3). Furthermore, the three data-driven models, Multi-layer perceptron (MLP), Extreme gradient boosting (XGB), and Extreme learning machine (ELM), predict the potential migration ability of the extract on the treated cells based on our experimental data. Overall, the concentrations of the plant extract that do not affect the proliferation of the type cells used demonstrated promising effects in reducing cell migration. XGB outperformed the MLP and ELM models and increased their performance efficiency by up to 3% and 1% for MCF and 1% and 2% for MDA-MB231, respectively, in the testing phase.

Collapse

Wang Y, Wei B, Zhao T, Shen H, Liu X, Wang J, Wang Q, Shen R, Feng D. Machine learning-based prediction models for parathyroid carcinoma using pre-surgery cognitive function and clinical features. Sci Rep 2023;13:19007. [PMID: 37923800 PMCID: PMC10624903 DOI: 10.1038/s41598-023-46294-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Accepted: 10/30/2023] [Indexed: 11/06/2023] Open

Chen G, Dai X, Zhang M, Tian Z, Jin X, Mei K, Huang H, Wu Z. Machine learning-based prediction model and visual interpretation for prostate cancer. BMC Urol 2023;23:164. [PMID: 37838656 PMCID: PMC10576344 DOI: 10.1186/s12894-023-01316-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 09/03/2023] [Indexed: 10/16/2023] Open

Zhao T, Wu H, Wang X, Zhao Y, Wang L, Pan J, Mei H, Han J, Wang S, Lu K, Li M, Gao M, Cao Z, Zhang H, Wan K, Li J, Fang L, Zhang T, Guan X. Integration of eQTL and machine learning to dissect causal genes with pleiotropic effects in genetic regulation networks of seed cotton yield. Cell Rep 2023;42:113111. [PMID: 37676770 DOI: 10.1016/j.celrep.2023.113111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/19/2023] [Accepted: 08/24/2023] [Indexed: 09/09/2023] Open

Affiliation(s)

Ting Zhao Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China; Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
Hongyu Wu Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China
Xutong Wang Hubei Hongshan Laboratory, Wuhan 430070, China
Yongyan Zhao Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China; Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
Luyao Wang Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
Jiaying Pan Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China; Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
Huan Mei Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China
Jin Han Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China
Siyuan Wang Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China
Kening Lu State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R & D Engineering Center (the Ministry of Education), College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China
Menglin Li State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R & D Engineering Center (the Ministry of Education), College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China
Mengtao Gao State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R & D Engineering Center (the Ministry of Education), College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China
Zeyi Cao Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China
Hailin Zhang Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China
Ke Wan State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R & D Engineering Center (the Ministry of Education), College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China
Jie Li State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R & D Engineering Center (the Ministry of Education), College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China
Lei Fang Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China; Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
Tianzhen Zhang Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China; Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
Xueying Guan Zhejiang Provincial Key Laboratory of Crop Genetic Resources, The Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 300058, China; Hainan Institute of Zhejiang University, Building 11, Yonyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China.

Collapse

Guan SW, Lin Q, Wu XD, Yu HB. Weighted gene coexpression network analysis and machine learning reveal oncogenome associated microbiome plays an important role in tumor immunity and prognosis in pan-cancer. J Transl Med 2023;21:537. [PMID: 37573394 PMCID: PMC10422781 DOI: 10.1186/s12967-023-04411-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Accepted: 08/02/2023] [Indexed: 08/14/2023] Open

Abstract

BACKGROUND

For many years, the role of the microbiome in tumor progression, particularly the tumor microbiome, was largely overlooked. The connection between the tumor microbiome and the tumor genome still requires further investigation.

METHODS

The TCGA microbiome and genome data were obtained from Haziza et al.'s article and UCSC Xena database, respectively. Separate WGCNA networks were constructed for the tumor microbiome and genomic data after filtering the datasets. Correlation analysis between the microbial and mRNA modules was conducted to identify oncogenome associated microbiome module (OAM) modules, with three microbial modules selected for each tumor type. Reactome analysis was used to enrich biological processes. Machine learning techniques were implemented to explore the tumor type-specific enrichment and prognostic value of OAM, as well as the ability of the tumor microbiome to differentiate TP53 mutations.

RESULTS

We constructed a total of 182 tumor microbiome and 570 mRNA WGCNA modules. Our results show that there is a correlation between tumor microbiome and tumor genome. Gene enrichment analysis results suggest that the genes in the mRNA module with the highest correlation with the tumor microbiome group are mainly enriched in infection, transcriptional regulation by TP53 and antigen presentation. The correlation analysis of OAM with CD8+ T cells or TAM1 cells suggests the existence of many microbiota that may be involved in tumor immune suppression or promotion, such as Williamsia in breast cancer, Biostraticola in stomach cancer, Megasphaera in cervical cancer and Lottiidibacillus in ovarian cancer. In addition, the results show that the microbiome-genome prognostic model has good predictive value for short-term prognosis. The analysis of tumor TP53 mutations shows that tumor microbiota has a certain ability to distinguish TP53 mutations, with an AUROC value of 0.755. The tumor microbiota with high importance scores are Corallococcus, Bacillus and Saezia. Finally, we identified a potential anti-cancer microbiota, Tissierella, which has been shown to be associated with improved prognosis in tumors including breast cancer, lung adenocarcinoma and gastric cancer.

CONCLUSION

There is an association between the tumor microbiome and the tumor genome, and the existence of this association is not accidental and could change the landscape of tumor research.

Collapse

Mirza Z, Ansari MS, Iqbal MS, Ahmad N, Alganmi N, Banjar H, Al-Qahtani MH, Karim S. Identification of Novel Diagnostic and Prognostic Gene Signature Biomarkers for Breast Cancer Using Artificial Intelligence and Machine Learning Assisted Transcriptomics Analysis. Cancers (Basel) 2023;15:3237. [PMID: 37370847 DOI: 10.3390/cancers15123237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/10/2023] [Accepted: 06/13/2023] [Indexed: 06/29/2023] Open

Abstract

BACKGROUND

Breast cancer (BC) is one of the most common female cancers. Clinical and histopathological information is collectively used for diagnosis, but is often not precise. We applied machine learning (ML) methods to identify the valuable gene signature model based on differentially expressed genes (DEGs) for BC diagnosis and prognosis.

METHODS

A cohort of 701 samples from 11 GEO BC microarray datasets was used for the identification of significant DEGs. Seven ML methods, including RFECV-LR, RFECV-SVM, LR-L1, SVC-L1, RF, and Extra-Trees were applied for gene reduction and the construction of a diagnostic model for cancer classification. Kaplan-Meier survival analysis was performed for prognostic signature construction. The potential biomarkers were confirmed via qRT-PCR and validated by another set of ML methods including GBDT, XGBoost, AdaBoost, KNN, and MLP.

RESULTS

We identified 355 DEGs and predicted BC-associated pathways, including kinetochore metaphase signaling, PTEN, senescence, and phagosome-formation pathways. A hub of 28 DEGs and a novel diagnostic nine-gene signature (COL10A, S100P, ADAMTS5, WISP1, COMP, CXCL10, LYVE1, COL11A1, and INHBA) were identified using stringent filter conditions. Similarly, a novel prognostic model consisting of eight-gene signatures (CCNE2, NUSAP1, TPX2, S100P, ITM2A, LIFR, TNXA, and ZBTB16) was also identified using disease-free survival and overall survival analysis. Gene signatures were validated by another set of ML methods. Finally, qRT-PCR results confirmed the expression of the identified gene signatures in BC.

CONCLUSION

The ML approach helped construct novel diagnostic and prognostic models based on the expression profiling of BC. The identified nine-gene signature and eight-gene signatures showed excellent potential in BC diagnosis and prognosis, respectively.

Collapse

Guan X, Du Y, Ma R, Teng N, Ou S, Zhao H, Li X. Construction of the XGBoost model for early lung cancer prediction based on metabolic indices. BMC Med Inform Decis Mak 2023;23:107. [PMID: 37312179 DOI: 10.1186/s12911-023-02171-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 04/05/2023] [Indexed: 06/15/2023] Open

Fonseca-Montaño MA, Vázquez-Santillán KI, Hidalgo-Miranda A. The current advances of lncRNAs in breast cancer immunobiology research. Front Immunol 2023;14:1194300. [PMID: 37342324 PMCID: PMC10277570 DOI: 10.3389/fimmu.2023.1194300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Accepted: 05/24/2023] [Indexed: 06/22/2023] Open

Applying Explainable Machine Learning Models for Detection of Breast Cancer Lymph Node Metastasis in Patients Eligible for Neoadjuvant Treatment. Cancers (Basel) 2023;15:cancers15030634. [PMID: 36765592 PMCID: PMC9913601 DOI: 10.3390/cancers15030634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 01/16/2023] [Accepted: 01/17/2023] [Indexed: 01/22/2023] Open

Yuan L, Ji M, Wang S, Wen X, Huang P, Shen L, Xu J. Machine learning model identifies aggressive acute pancreatitis within 48 h of admission: a large retrospective study. BMC Med Inform Decis Mak 2022;22:312. [PMID: 36447180 PMCID: PMC9707001 DOI: 10.1186/s12911-022-02066-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 11/23/2022] [Indexed: 12/05/2022] Open

Abstract

BACKGROUND

Acute pancreatitis (AP) with critical illness is linked to increased morbidity and mortality. Current risk scores to identify high-risk AP patients have certain limitations.

OBJECTIVE

To develop and validate a machine learning tool within 48 h after admission for predicting which patients with AP will develop critical illness based on ubiquitously available clinical, laboratory, and radiologic variables.

METHODS

5460 AP patients were enrolled. Clinical, laboratory, and imaging variables were collected within 48 h after hospital admission. Least Absolute Shrinkage Selection Operator with bootstrap method was employed to select the most informative variables. Five different machine learning models were constructed to predictive likelihood of critical illness, and the optimal model (APCU) was selected. External cohort was used to validate APCU. APCU and other risk scores were compared using multivariate analysis. Models were evaluated by area under the curve (AUC). The decision curve analysis was employed to evaluate the standardized net benefit.

RESULTS

Xgboost was constructed and selected as APCU, involving age, comorbid disease, mental status, pulmonary infiltrates, procalcitonin (PCT), neutrophil percentage (Neu%), ALT/AST, ratio of albumin and globulin, cholinesterase, Urea, Glu, AST and serum total cholesterol. The APCU performed excellently in discriminating AP risk in internal cohort (AUC = 0.95) and external cohort (AUC = 0.873). The APCU was significant for biliogenic AP (OR = 4.25 [2.08-8.72], P < 0.001), alcoholic AP (OR = 3.60 [1.67-7.72], P = 0.001), hyperlipidemic AP (OR = 2.63 [1.28-5.37], P = 0.008) and tumor AP (OR = 4.57 [2.14-9.72], P < 0.001). APCU yielded the highest clinical net benefit, comparatively.

CONCLUSION

Machine learning tool based on ubiquitously available clinical variables accurately predicts the development of AP, optimizing the management of AP.

Collapse

Li Q, Wang P, Yuan J, Zhou Y, Mei Y, Ye M. A two-stage hybrid gene selection algorithm combined with machine learning models to predict the rupture status in intracranial aneurysms. Front Neurosci 2022;16:1034971. [PMID: 36340761 PMCID: PMC9631203 DOI: 10.3389/fnins.2022.1034971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 09/30/2022] [Indexed: 07/31/2023] Open

Abstract

An IA is an abnormal swelling of cerebral vessels, and a subset of these IAs can rupture causing aneurysmal subarachnoid hemorrhage (aSAH), often resulting in death or severe disability. Few studies have used an appropriate method of feature selection combined with machine learning by analyzing transcriptomic sequencing data to identify new molecular biomarkers. Following gene ontology (GO) and enrichment analysis, we found that the distinct status of IAs could lead to differential innate immune responses using all 913 differentially expressed genes, and considering that there are numerous irrelevant and redundant genes, we propose a mixed filter- and wrapper-based feature selection. First, we used the Fast Correlation-Based Filter (FCBF) algorithm to filter a large number of irrelevant and redundant genes in the raw dataset, and then used the wrapper feature selection method based on the he Multi-layer Perceptron (MLP) neural network and the Particle Swarm Optimization (PSO), accuracy (ACC) and mean square error (MSE) were then used as the evaluation criteria. Finally, we constructed a novel 10-gene signature (YIPF1, RAB32, WDR62, ANPEP, LRRCC1, AADAC, GZMK, WBP2NL, PBX1, and TOR1B) by the proposed two-stage hybrid algorithm FCBF-MLP-PSO and used different machine learning models to predict the rupture status in IAs. The highest ACC value increased from 0.817 to 0.919 (12.5% increase), the highest area under ROC curve (AUC) value increased from 0.87 to 0.94 (8.0% increase), and all evaluation metrics improved by approximately 10% after being processed by our proposed gene selection algorithm. Therefore, these 10 informative genes used to predict rupture status of IAs can be used as complements to imaging examinations in the clinic, meanwhile, this selected gene signature also provides new targets and approaches for the treatment of ruptured IAs.

Collapse

Sun CK, Tang YX, Liu TC, Lu CJ. An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:ijerph19159756. [PMID: 35955112 PMCID: PMC9368335 DOI: 10.3390/ijerph19159756] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/02/2022] [Accepted: 08/06/2022] [Indexed: 05/09/2023]

Mammographic Classification of Breast Cancer Microcalcifications through Extreme Gradient Boosting. ELECTRONICS 2022. [DOI: 10.3390/electronics11152435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]