1
|
Chen M, Li Y, Zhou S, Zou L, Yu L, Deng T, Rong X, Shao S, Wu J. Establishment of a risk prediction model for olfactory disorders in patients with transnasal pituitary tumors by machine learning. Sci Rep 2024; 14:12514. [PMID: 38822064 PMCID: PMC11143333 DOI: 10.1038/s41598-024-62963-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 05/23/2024] [Indexed: 06/02/2024] Open
Abstract
To construct a prediction model of olfactory dysfunction after transnasal sellar pituitary tumor resection based on machine learning algorithms. A cross-sectional study was conducted. From January to December 2022, 158 patients underwent transnasal sellar pituitary tumor resection in three tertiary hospitals in Sichuan Province were selected as the research objects. The olfactory status was evaluated one week after surgery. They were randomly divided into a training set and a test set according to the ratio of 8:2. The training set was used to construct the prediction model, and the test set was used to evaluate the effect of the model. Based on different machine learning algorithms, BP neural network, logistic regression, decision tree, support vector machine, random forest, LightGBM, XGBoost, and AdaBoost were established to construct olfactory dysfunction risk prediction models. The accuracy, precision, recall, F1 score, and area under the ROC curve (AUC) were used to evaluate the model's prediction performance, the optimal prediction model algorithm was selected, and the model was verified in the test set of patients. Of the 158 patients, 116 (73.42%) had postoperative olfactory dysfunction. After missing value processing and feature screening, an essential order of influencing factors of olfactory dysfunction was obtained. Among them, the duration of operation, gender, type of pituitary tumor, pituitary tumor apoplexy, nasal adhesion, age, cerebrospinal fluid leakage, blood scar formation, and smoking history became the risk factors of olfactory dysfunction, which were the key indicators of the construction of the model. Among them, the random forest model had the highest AUC of 0.846, and the accuracy, precision, recall, and F1 score were 0.750, 0.870, 0.947, and 0.833, respectively. Compared with the BP neural network, logistic regression, decision tree, support vector machine, LightGBM, XGBoost, and AdaBoost, the random forest model has more advantages in predicting olfactory dysfunction in patients after transnasal sellar pituitary tumor resection, which is helpful for early identification and intervention of high-risk clinical population, and has good clinical application prospects.
Collapse
Affiliation(s)
- Min Chen
- Department of Neurosurgery, Deyang People's Hospital, Deyang, 618000, China
| | - Yuxin Li
- School of Nursing, North Sichuan Medical College, Nanchong, 637000, China
- Department of Nursing, Deyang People's Hospital, Deyang, 618000, Sichuan, China
| | - Sumei Zhou
- Department of Neurosurgery, Deyang People's Hospital, Deyang, 618000, China
| | - Linbo Zou
- Department of Neurosurgery, Deyang People's Hospital, Deyang, 618000, China
| | - Lei Yu
- Institute of Complex Systems, Shanxi University, Taiyuan, 030001, China
| | - Tianfang Deng
- Department of Neurosurgery, Deyang People's Hospital, Deyang, 618000, China
| | - Xian Rong
- Sichuan Nursing Vocational College, Chengdu, 610110, China.
| | - Shirong Shao
- Department of Neurosurgery, Deyang People's Hospital, Deyang, 618000, China.
| | - Jijun Wu
- Department of Nursing, Deyang People's Hospital, Deyang, 618000, Sichuan, China.
| |
Collapse
|
2
|
Zhou G, Wang S, Lin L, Lu K, Lin Z, Zhang Z, Zhang Y, Cheng D, Szeto K, Peng R, Luo C. Screening for immune-related biomarkers associated with myasthenia gravis and dilated cardiomyopathy based on bioinformatics analysis and machine learning. Heliyon 2024; 10:e28446. [PMID: 38571624 PMCID: PMC10988011 DOI: 10.1016/j.heliyon.2024.e28446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 03/18/2024] [Accepted: 03/19/2024] [Indexed: 04/05/2024] Open
Abstract
Background We aim to investigate genes associated with myasthenia gravis (MG), specifically those potentially implicated in the pathogenesis of dilated cardiomyopathy (DCM). Additionally, we seek to identify potential biomarkers for diagnosing myasthenia gravis co-occurring with DCM. Methods We obtained two expression profiling datasets related to DCM and MG from the Gene Expression Omnibus (GEO). Subsequently, we conducted differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) on these datasets. The genes exhibiting differential expression common to both DCM and MG were employed for protein-protein interaction (PPI), Gene Ontology (GO) enrichment analysis, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Additionally, machine learning techniques were employed to identify potential biomarkers and develop a diagnostic nomogram for predicting MG-associated DCM. Subsequently, the machine learning results underwent validation using an external dataset. Finally, gene set enrichment analysis (GSEA) and machine algorithm analysis were conducted on pivotal model genes to further elucidate their potential mechanisms in MG-associated DCM. Results In our analysis of both DCM and MG datasets, we identified 2641 critical module genes and 11 differentially expressed genes shared between the two conditions. Enrichment analysis disclosed that these 11 genes primarily pertain to inflammation and immune regulation. Connectivity map (CMAP) analysis pinpointed SB-216763 as a potential drug for DCM treatment. The results from machine learning indicated the substantial diagnostic value of midline 1 interacting protein1 (MID1IP1) and PI3K-interacting protein 1 (PIK3IP1) in MG-associated DCM. These two hub genes were chosen as candidate biomarkers and employed to formulate a diagnostic nomogram with optimal diagnostic performance through machine learning. Simultaneously, single-gene GSEA results and immune cell infiltration analysis unveiled immune dysregulation in both DCM and MG, with MID1IP1 and PIK3IP1 showing significant associations with invasive immune cells. Conclusion We have elucidated the inflammatory and immune pathways associated with MG-related DCM and formulated a diagnostic nomogram for DCM utilizing MID1IP1/PIK3IP1. This contribution offers novel insights for prospective diagnostic approaches and therapeutic interventions in the context of MG coexisting with DCM.
Collapse
Affiliation(s)
- Guiting Zhou
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Shushu Wang
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Liwen Lin
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Kachun Lu
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Zhichao Lin
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Ziyan Zhang
- Zhongshan Traditional Chinese Medicine Hospital, Zhongshan, China
| | - Yuling Zhang
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Danling Cheng
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - KaMan Szeto
- The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Rui Peng
- Shunde Hospital of Guangzhou University of Chinese Medicine, Foshan, China
| | - Chuanjin Luo
- Cardiology Center, The First Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China
- Guangdong Clinical Research Academy of Chinese Medicine, Guangzhou, China
| |
Collapse
|
3
|
Lu J, Ji X, Liu X, Jiang Y, Li G, Fang P, Li W, Zuo A, Guo Z, Yang S, Ji Y, Lu D. Machine learning-based radiomics strategy for prediction of acquired EGFR T790M mutation following treatment with EGFR-TKI in NSCLC. Sci Rep 2024; 14:446. [PMID: 38172228 PMCID: PMC10764785 DOI: 10.1038/s41598-023-50984-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 12/28/2023] [Indexed: 01/05/2024] Open
Abstract
The epidermal growth factor receptor (EGFR) Thr790 Met (T790M) mutation is responsible for approximately half of the acquired resistance to EGFR-tyrosine kinase inhibitor (TKI) in non-small-cell lung cancer (NSCLC) patients. Identifying patients at diagnosis who are likely to develop this mutation after first- or second-generation EGFR-TKI treatment is crucial for better treatment outcomes. This study aims to develop and validate a radiomics-based machine learning (ML) approach to predict the T790M mutation in NSCLC patients at diagnosis. We collected retrospective data from 210 positive EGFR mutation NSCLC patients, extracting 1316 radiomics features from CT images. Using the LASSO algorithm, we selected 10 radiomics features and 2 clinical features most relevant to the mutations. We built models with 7 ML approaches and assessed their performance through the receiver operating characteristic (ROC) curve. The radiomics model and combined model, which integrated radiomics features and relevant clinical factors, achieved an area under the curve (AUC) of 0.80 (95% confidence interval [CI] 0.79-0.81) and 0.86 (0.87-0.88), respectively, in predicting the T790M mutation. Our study presents a convenient and noninvasive radiomics-based ML model for predicting this mutation at the time of diagnosis, aiding in targeted treatment planning for NSCLC patients with EGFR mutations.
Collapse
Affiliation(s)
- Jiameng Lu
- Department of Respiratory, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Institute of Respiratory Diseases, Shandong Institute of Anesthesia and Respiratory Critical Medicine, 16766 Jingshilu, Lixia, Jinan, 250014, Shandong, People's Republic of China
- School of Microelectronics, Shandong University, Jinan, 250100, Shandong, People's Republic of China
| | - Xiaoqing Ji
- Department of Nursing, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Jinan, 250014, Shandong, People's Republic of China
| | - Xinyi Liu
- Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
| | - Yunxiu Jiang
- Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
| | - Gang Li
- Department of Radiology, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Medicine and Health Key Laboratory of Abdominal Medicine Imaging, Shandong Lung Cancer Institute, Shandong Institute of Neuroimmunology, Jinan, 250000, Shandong, China
| | - Ping Fang
- Department of Blood Transfusion, The First Affiliated Hospital of Shandong First Medical University and Shandong Province Qianfoshan Hospital, Jinan, 250014, Shandong, China
| | - Wei Li
- Department of Radiology, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Medicine and Health Key Laboratory of Abdominal Medicine Imaging, Shandong Lung Cancer Institute, Shandong Institute of Neuroimmunology, Jinan, 250000, Shandong, China
| | - Anli Zuo
- Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
| | - Zihan Guo
- Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
| | - Shuran Yang
- Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
| | - Yanbo Ji
- Department of Nursing, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Jinan, 250014, Shandong, People's Republic of China
| | - Degan Lu
- Department of Respiratory, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Institute of Respiratory Diseases, Shandong Institute of Anesthesia and Respiratory Critical Medicine, 16766 Jingshilu, Lixia, Jinan, 250014, Shandong, People's Republic of China.
| |
Collapse
|
4
|
Li G, Li C, Liu J, Peng H, Lu S, Wei D, Guo J, Wang M, Yang N. Prediction of lymph node metastasis of lung squamous cell carcinoma by machine learning algorithm classifiers. J Cancer Res Ther 2023; 19:1533-1543. [PMID: 38156919 DOI: 10.4103/jcrt.jcrt_2352_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 07/31/2023] [Indexed: 01/03/2024]
Abstract
BACKGROUND Lymph node metastasis (LNM) is an essential factor affecting the prognosis of patients with lung squamous cell carcinoma (LUSC), as well as a critical consideration for the choice of treatment strategy. Exploring effective methods for predicting LNM in LUSC may benefit clinical decision making. MATERIALS AND METHODS We used data collected from the Surveillance, Epidemiology, and End Results (SEER) database to develop machine learning algorithm classifiers, including boosted trees (BTs), based on the primary clinical parameters of patients to predict LNM in LUSC. Training on a large-sample training cohort (n = 8,063) allowed for the construction of several concise classifiers for LNM prediction in LUSC, which were then validated using test and in-house cohorts (n = 2,017 and 57, respectively). RESULTS The six classifiers established in this research enabled distinction between patients with and without LNM. Among these classifiers, the BT classifier was the top performer, with accuracy, F1 scores, precision, recall, sensitivity, and specificity values of 0.654, 0.621, 0.654, 0.592, 0.592, and 0.711, respectively. The precision recall (PR) and receiver operating characteristic (ROC) (with area under the curve = 0.714) curves also supported this result, which was validated by the in-house cohort. Notably, the tumor stage was a critical factor in determining LNM in patients with LUSC. CONCLUSIONS The use of classifiers, especially the BT classifier, may serve as a useful tool for improving clinical precision and individualized treatment of patients with LUSC.
Collapse
Affiliation(s)
- Guosheng Li
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Changqian Li
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Jun Liu
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Huajian Peng
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Shuyu Lu
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Donglin Wei
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Jianji Guo
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Meijing Wang
- Department of Cardiothoracic Surgery, Guilin People's Hospital, Guilin, China
| | - Nuo Yang
- Department of Cardiothoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| |
Collapse
|
5
|
Cui C, Chen FL, Li LQ. [Recent research on machine learning in the diagnosis and treatment of necrotizing enterocolitis in neonates]. ZHONGGUO DANG DAI ER KE ZA ZHI = CHINESE JOURNAL OF CONTEMPORARY PEDIATRICS 2023; 25:767-773. [PMID: 37529961 PMCID: PMC10414163 DOI: 10.7499/j.issn.1008-8830.2302165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 05/08/2023] [Indexed: 08/03/2023]
Abstract
Necrotizing enterocolitis (NEC), with the main manifestations of bloody stool, abdominal distension, and vomiting, is one of the leading causes of death in neonates, and early identification and diagnosis are crucial for the prognosis of NEC. The emergence and development of machine learning has provided the potential for early, rapid, and accurate identification of this disease. This article summarizes the algorithms of machine learning recently used in NEC, analyzes the high-risk predictive factors revealed by these algorithms, evaluates the ability and characteristics of machine learning in the etiology, definition, and diagnosis of NEC, and discusses the challenges and prospects for the future application of machine learning in NEC.
Collapse
Affiliation(s)
- Cheng Cui
- Department of Neonatology, Children's Hospital of Chongqing Medical University/National Clinical Research Center for Child Health and Disorders/Ministry of Education Key Laboratory of Child Development and Disorders/Key Laboratory of Pediatrics in Chongqing, Chongqing 400014, China
| | - Fei-Long Chen
- Department of Neonatology, Children's Hospital of Chongqing Medical University/National Clinical Research Center for Child Health and Disorders/Ministry of Education Key Laboratory of Child Development and Disorders/Key Laboratory of Pediatrics in Chongqing, Chongqing 400014, China
| | - Lu-Quan Li
- Department of Neonatology, Children's Hospital of Chongqing Medical University/National Clinical Research Center for Child Health and Disorders/Ministry of Education Key Laboratory of Child Development and Disorders/Key Laboratory of Pediatrics in Chongqing, Chongqing 400014, China
| |
Collapse
|
6
|
Ma S, Liu J, Li W, Liu Y, Hui X, Qu P, Jiang Z, Li J, Wang J. Machine learning in TCM with natural products and molecules: current status and future perspectives. Chin Med 2023; 18:43. [PMID: 37076902 PMCID: PMC10116715 DOI: 10.1186/s13020-023-00741-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 03/28/2023] [Indexed: 04/21/2023] Open
Abstract
Traditional Chinese medicine (TCM) has been practiced for thousands of years with clinical efficacy. Natural products and their effective agents such as artemisinin and paclitaxel have saved millions of lives worldwide. Artificial intelligence is being increasingly deployed in TCM. By summarizing the principles and processes of deep learning and traditional machine learning algorithms, analyzing the application of machine learning in TCM, reviewing the results of previous studies, this study proposed a promising future perspective based on the combination of machine learning, TCM theory, chemical compositions of natural products, and computational simulations based on molecules and chemical compositions. In the first place, machine learning will be utilized in the effective chemical components of natural products to target the pathological molecules of the disease which could achieve the purpose of screening the natural products on the basis of the pathological mechanisms they target. In this approach, computational simulations will be used for processing the data for effective chemical components, generating datasets for analyzing features. In the next step, machine learning will be used to analyze the datasets on the basis of TCM theories such as the superposition of syndrome elements. Finally, interdisciplinary natural product-syndrome research will be established by unifying the results of the two steps outlined above, potentially realizing an intelligent artificial intelligence diagnosis and treatment model based on the effective chemical components of natural products under the guidance of TCM theory. This perspective outlines an innovative application of machine learning in the clinical practice of TCM based on the investigation of chemical molecules under the guidance of TCM theory.
Collapse
Affiliation(s)
- Suya Ma
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China
| | - Jinlei Liu
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China
| | - Wenhua Li
- Tianjin University of Traditional Chinese Medicine, Tianjin, 301617, China
| | - Yongmei Liu
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China
| | - Xiaoshan Hui
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China
| | - Peirong Qu
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China
| | - Zhilin Jiang
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China
| | - Jun Li
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China.
| | - Jie Wang
- Guang'anmen Hospital, China Academy of Chinese Medicine Sciences, Beijing, 100053, China.
| |
Collapse
|
7
|
Li S, Que Y, Yang R, He P, Xu S, Hu Y. Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network. J Pers Med 2023; 13:jpm13030447. [PMID: 36983630 PMCID: PMC10056981 DOI: 10.3390/jpm13030447] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/19/2023] [Accepted: 02/27/2023] [Indexed: 03/05/2023] Open
Abstract
Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and clinical symptoms are often already present before diagnosis. This study aimed to develop novel and efficient predictive models for the diagnosis of osteosarcoma and to identify potential targets for exploring osteosarcoma markers. First, osteosarcoma and normal tissue expression microarray datasets were downloaded from the Gene Expression Omnibus (GEO). Then we screened the differentially expressed genes (DEGs) in the osteosarcoma and normal groups in the training group. Next, in order to explore the biologically relevant role of DEGs, Metascape and enrichment analyses were also performed on DEGs. The “randomForest” and “neuralnet” packages in R software were used to select representative genes and construct diagnostic models for osteosarcoma. The next step is to validate the model of the artificial neural network. Then, we performed an immune infiltration analysis by using the training set data. Finally, we constructed a prognostic model using representative genes for prognostic analysis. The copy number of osteosarcoma was also analyzed. A random forest classifier identified nine representative genes (ANK1, TGFBR3, RSF21, HSPB8, ITGA7, RHD, AASS, GREM2, NFASC). HSPB8, RHD, AASS, and NFASC were genes we identified that have not been previously reported to be associated with osteosarcoma. The osteosarcoma diagnostic model we constructed has good performance with areas under the curves (AUCs) of 1 and 0.987 in the training and validation groups, respectively. This study opens new horizons for the early diagnosis of osteosarcoma and provides representative markers for the future treatment of osteosarcoma. This is the first study to pioneer the establishment of a genetic diagnosis model for osteosarcoma and advance the development of osteosarcoma diagnosis and treatment.
Collapse
|
8
|
Jin J, Zhou H, Sun S, Tian Z, Ren H, Feng J, Jiang X. Machine learning based gray-level co-occurrence matrix early warning system enables accurate detection of colorectal cancer pelvic bone metastases on MRI. Front Oncol 2023; 13:1121594. [PMID: 37035167 PMCID: PMC10073745 DOI: 10.3389/fonc.2023.1121594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 03/02/2023] [Indexed: 04/11/2023] Open
Abstract
Objective The mortality of colorectal cancer patients with pelvic bone metastasis is imminent, and timely diagnosis and intervention to improve the prognosis is particularly important. Therefore, this study aimed to build a bone metastasis prediction model based on Gray level Co-occurrence Matrix (GLCM) - based Score to guide clinical diagnosis and treatment. Methods We retrospectively included 614 patients with colorectal cancer who underwent pelvic multiparameter magnetic resonance image(MRI) from January 2015 to January 2022 in the gastrointestinal surgery department of Gezhouba Central Hospital of Sinopharm. GLCM-based Score and Machine learning algorithm, that is,artificial neural net7work model(ANNM), random forest model(RFM), decision tree model(DTM) and support vector machine model(SVMM) were used to build prediction model of bone metastasis in colorectal cancer patients. The effectiveness evaluation of each model mainly included decision curve analysis(DCA), area under the receiver operating characteristic (AUROC) curve and clinical influence curve(CIC). Results We captured fourteen categories of radiomics data based on GLCM for variable screening of bone metastasis prediction models. Among them, Haralick_90, IV_0, IG_90, Haralick_30, CSV, Entropy and Haralick_45 were significantly related to the risk of bone metastasis, and were listed as candidate variables of machine learning prediction models. Among them, the prediction efficiency of RFM in combination with Haralick_90, Haralick_all, IV_0, IG_90, IG_0, Haralick_30, CSV, Entropy and Haralick_45 in training set and internal verification set was [AUC: 0.926,95% CI: 0.873-0.979] and [AUC: 0.919,95% CI: 0.868-0.970] respectively. The prediction efficiency of the other four types of prediction models was between [AUC: 0.716,95% CI: 0.663-0.769] and [AUC: 0.912,95% CI: 0.859-0.965]. Conclusion The automatic segmentation model based on diffusion-weighted imaging(DWI) using depth learning method can accurately segment the pelvic bone structure, and the subsequently established radiomics model can effectively detect bone metastases within the pelvic scope, especially the RFM algorithm, which can provide a new method for automatically evaluating the pelvic bone turnover of colorectal cancer patients.
Collapse
|
9
|
Deep Learning-Based Classification of Spoken English Digits. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:3364141. [PMID: 36211015 PMCID: PMC9534619 DOI: 10.1155/2022/3364141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 08/20/2022] [Indexed: 11/17/2022]
Abstract
Classification of isolated digits is the basic challenge for many speech classification systems. While a lot of work has been carried out on spoken languages, only limited research work on spoken English digit data has been reported in the literature. The paper proposes an intelligent-based system based on deep feedforward neural network (DFNN) with hyperparameter optimization techniques, an ensemble method; random forest (RF), and a regression method; gradient boosting (GB) for the classification of spoken digit data. The paper investigates different machine learning (ML) algorithms to determine the best method for the classification of spoken English digit data. The DFNN classifier outperformed the RF and GB classifiers on the public benchmark spoken English digit data and achieved 99.65% validation accuracy. The outcome of the proposed model performs better compared to existing models with only traditional classifiers.
Collapse
|
10
|
Yang Y, Xu L, Qiao Y, Wang T, Zheng Q. Construction of a neural network diagnostic model and investigation of immune infiltration characteristics for Crohn’s disease. Front Genet 2022; 13:976578. [PMID: 36186439 PMCID: PMC9520627 DOI: 10.3389/fgene.2022.976578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 08/16/2022] [Indexed: 11/13/2022] Open
Abstract
Objective: Crohn’s disease (CD), a chronic recurrent illness, is a type of inflammatory bowel disease whose incidence and prevalence rates are gradually increasing. However, there is no universally accepted criterion for CD diagnosis. The aim of this study was to create a diagnostic prediction model for CD and identify immune cell infiltration features in CD. Methods: In this study, gene expression microarray datasets were obtained from the Gene Expression Omnibus (GEO) database. Then, we identified differentially expressed genes (DEGs) between 178 CD and 38 control cases. Enrichment analysis of DEGs was also performed to explore the biological role of DEGs. Moreover, the “randomForest” package was applied to select core genes that were used to create a neural network model. Finally, in the training cohort, we used CIBERSORT to evaluate the immune landscape between the CD and normal groups. Results: The results of enrichment analysis revealed that these DEGs may be involved in biological processes associated with immunity and inflammatory responses. Moreover, the top 3 hub genes in the protein-protein interaction network were IL-1β, CCL2, and CXCR2. The diagnostic model allowed significant discrimination with an area under the ROC curve of 0.984 [95% confidence interval: 0.971–0.993]. A validation cohort (GSE36807) was utilized to ensure the reliability and applicability of the model. In addition, the immune infiltration analysis indicated nine different immune cell types were significantly different between the CD and healthy control groups. Conclusion: In summary, this study offers a novel insight into the diagnosis of CD and provides potential biomarkers for the precise treatment of CD.
Collapse
|
11
|
Analytical Comparison of Risk Prediction Models for the Onset of Macrosomia Based on Three Statistical Methods. DISEASE MARKERS 2022; 2022:9073043. [PMID: 36124028 PMCID: PMC9482546 DOI: 10.1155/2022/9073043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 11/18/2022]
Abstract
Background and Purpose. Fetal overgrowth can pose a serious threat to the safety of a mother and child. Early identification of high-risk pregnant women and timely pregnancy intervention and guidance are of great value in preventing the development of giant babies and improving adverse maternal and infant outcomes. The current clinical methods for predicting macrosomia mainly rely on obstetric examination and imaging, but their accuracy is controversial. And there is no accepted method for accurately predicting macrosomia. We investigated the risk factors influencing the occurrence of macrosomia and established a prediction model for the occurrence of macrosomia to provide a reference basis for interventions to prevent macrosomia. Method. A retrospective selection of 93 women who were hospitalized in our hospital from March 2019 to May 2022 with a singleton pregnancy and delivered at term with macrosomia were the study group. And 356 women who delivered a normal size baby during the same period were the control group. The variables that were associated with the onset of macrosomia were screened from maternal medical records. Logistic regression models, random forest, and CART decision tree models were developed using the screened variables as input variables and whether they were macrosomia as outcome variables, respectively. The performance of the three models was evaluated by accuracy, precision, recall, F1 score, and receiver operating characteristic curve (ROC). Result. The risk prediction models for the onset of macrosomia, logistic regression model, random forest model, and decision tree, were successfully developed, with accuracies of 0.904, 1.000, and 0.901 in the training set and 0.926, 0.582, and 0.852 in the validation set, respectively. The AUC in the training set were 0.898, 1.000, and 0.789, and in the validation set were 0.906, 0.913, and 0.731, respectively. In general, the logistic regression model has the highest diagnostic efficiency, followed by the random forest model. Conclusion. Logistic regression models have high application value in the assessment of predicting the risk of macrosomia, and it is suggested that the advantages of logistic regression models and random forest models should be combined in future studies and applications to make them work better in the prediction of the risk of macrosomia.
Collapse
|
12
|
Zhou P, Gao S, Hu B. Exploration of Potential Biomarkers and Immune Landscape for Hepatoblastoma: Evidence from Machine Learning Algorithm. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE : ECAM 2022; 2022:2417134. [PMID: 35958911 PMCID: PMC9357682 DOI: 10.1155/2022/2417134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 06/02/2022] [Indexed: 11/17/2022]
Abstract
This study aimed to investigate the immune landscape in hepatoblastoma (HB) based on deconvolution methods and identify a biomarkers panel for diagnosis based on a machine learning algorithm. Firstly, we identified 277 differentially expressed genes (DEGs) and differentiated and functionally identified the modules in DEGs. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and GO (gene ontology) were used to annotate these DEGs, and the results suggested that the occurrence of HB was related to DNA adducts, bile secretion, and metabolism of xenobiotics by cytochrome P450. We selected the top 10 genes for our final diagnostic panel based on the random forest tree method. Interestingly, TNFRSF19 and TOP2A were significantly down-regulated in normal samples, while other genes (TRIB1, MAT1A, SAA2-SAA4, NAT2, HABP2, CYP2CB, APOF, and CFHR3) were significantly down-regulated in HB samples. Finally, we constructed a neural network model based on the above hub genes for diagnosis. After cross-validation, the area under the ROC curve was close to 1 (AUC = 0.972), and the AUC of the validation set was 0.870. In addition, the results of single-sample gene-set enrichment analysis (ssGSEA) and deconvolution methods revealed a more active immune responses in the HB tissue. In conclusion, we have developed a robust biomarkers panel for HB patients.
Collapse
Affiliation(s)
- Peng Zhou
- Department of Pediatric, Maternal and Child Health Hospital, Zibo, China
| | - Shanshan Gao
- Department of Ultrasound, Zibo Forth People's Hospital, Zibo, China
| | - Bin Hu
- Department of Pediatric, Maternal and Child Health Hospital, Zibo, China
| |
Collapse
|
13
|
Yang S, Zeng L, Jin X, Lin H, Song J. Feature Genes in Neuroblastoma Distinguishing High-Risk and Non-high-Risk Neuroblastoma Patients: Development and Validation Combining Random Forest With Artificial Neural Network. Front Med (Lausanne) 2022; 9:882348. [PMID: 35911385 PMCID: PMC9336509 DOI: 10.3389/fmed.2022.882348] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 06/13/2022] [Indexed: 11/13/2022] Open
Abstract
There is a significant difference in prognosis among different risk groups. Therefore, it is of great significance to correctly identify the risk grouping of children. Using the genomic data of neuroblastoma samples in public databases, we used GSE49710 as the training set data to calculate the feature genes of the high-risk group and non-high-risk group samples based on the random forest (RF) algorithm and artificial neural network (ANN) algorithm. The screening results of RF showed that EPS8L1, PLCD4, CHD5, NTRK1, and SLC22A4 were the feature differentially expressed genes (DEGs) of high-risk neuroblastoma. The prediction model based on gene expression data in this study showed high overall accuracy and precision in both the training set and the test set (AUC = 0.998 in GSE49710 and AUC = 0.858 in GSE73517). Kaplan–Meier plotter showed that the overall survival and progression-free survival of patients in the low-risk subgroup were significantly better than those in the high-risk subgroup [HR: 3.86 (95% CI: 2.44–6.10) and HR: 3.03 (95% CI: 2.03–4.52), respectively]. Our ANN-based model has better classification performance than the SVM-based model and XGboost-based model. Nevertheless, more convincing data sets and machine learning algorithms will be needed to build diagnostic models for individual organization types in the future.
Collapse
Affiliation(s)
- Sha Yang
- Department of Surgery, Children’s Hospital of Chongqing Medical University, Chongqing, China
- Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing, China
- National Clinical Research Center for Child Health and Disorders, Chongqing, China
- China International Science and Technology Cooperation Base of Child Development and Critical Disorders, Chongqing, China
- Chongqing Key Laboratory of Pediatrics, Chongqing, China
- Chongqing Engineering Research Center of Stem Cell Therapy, Chongqing, China
- Children’s Hospital of Chongqing Medical University, Chongqing, China
| | - Lingfeng Zeng
- Department of Nephrology, The Second Xiangya Hospital of Central South University, Changsha, China
| | - Xin Jin
- Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing, China
- National Clinical Research Center for Child Health and Disorders, Chongqing, China
- China International Science and Technology Cooperation Base of Child Development and Critical Disorders, Chongqing, China
- Chongqing Key Laboratory of Pediatrics, Chongqing, China
- Chongqing Engineering Research Center of Stem Cell Therapy, Chongqing, China
- Children’s Hospital of Chongqing Medical University, Chongqing, China
- Department of Cardiacthoracic, Children’s Hospital of Chongqing Medical University, Chongqing, China
| | - Huapeng Lin
- Department of Intensive Care Unit, Affiliated Hangzhou First People’s Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Jianning Song
- Department of General Surgery, Guiqian International General Hospital, Guiyang, China
- *Correspondence: Jianning Song, ,
| |
Collapse
|
14
|
Feng JW, Ye J, Qi GF, Hong LZ, Wang F, Liu SY, Jiang Y. LASSO-based machine learning models for the prediction of central lymph node metastasis in clinically negative patients with papillary thyroid carcinoma. Front Endocrinol (Lausanne) 2022; 13:1030045. [PMID: 36506061 PMCID: PMC9727241 DOI: 10.3389/fendo.2022.1030045] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Accepted: 11/07/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND The presence of central lymph node metastasis (CLNM) is crucial for surgical decision-making in clinical N0 (cN0) papillary thyroid carcinoma (PTC) patients. We aimed to develop and validate machine learning (ML) algorithms-based models for predicting the risk of CLNM in cN0 patients. METHODS A total of 1099 PTC patients with cN0 central neck from July 2019 to March 2022 at our institution were retrospectively analyzed. All patients were randomly split into the training dataset (70%) and the validation dataset (30%). Eight ML algorithms, including the Logistic Regression, Gradient Boosting Machine, Extreme Gradient Boosting (XGB), Random Forest (RF), Decision Tree, Neural Network, Support Vector Machine and Bayesian Network were used to evaluate the risk of CLNM. The performance of ML models was evaluated by the area under curve (AUC), sensitivity, specificity, and decision curve analysis (DCA). RESULTS We firstly used the LASSO Logistic regression method to select the most relevant factors for predicting CLNM. The AUC of XGB was slightly higher than RF (0.907 and 0.902, respectively). According to DCA, RF model significantly outperformed XGB model at most threshold points and was therefore used to develop the predictive model. The diagnostic performance of RF algorithm was dependent on the following nine top-rank variables: size, margin, extrathyroidal extension, sex, echogenic foci, shape, number, lateral lymph node metastasis and chronic lymphocytic thyroiditis. CONCLUSION By incorporating clinicopathological and sonographic characteristics, we developed ML-based models, suggesting that this non-invasive method can be applied to facilitate individualized prediction of occult CLNM in cN0 central neck PTC patients.
Collapse
|
15
|
El-Deeb OM, Elbadawy W, Elzanfaly DS. The Effect of Imbalanced Classes on Students' Academic Performance Prediction. INTERNATIONAL JOURNAL OF E-COLLABORATION 2022. [DOI: 10.4018/ijec.304373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Imbalanced classes in data mining have more challenges in the educational data mining field. This is because most of the datasets collected from educational records are imbalanced by nature. Some classes dominate others and cause bias predictions. This paper studies the effects of the imbalanced classes on the performance of seven different classifiers, which are J48, Random Forest, k-Nearest Neighbors, Naïve Bayes, Random Tree, SVM, and Linear Regression. Moreover, the effectiveness of the SMOTE technique for handling imbalanced data is evaluated against these classifiers. This will be done through the proposal of an early predictive model that predicts student’s academic performance and recommends their appropriate department in a multi-disciplinary institute. According to our results, the Random Forest technique is the best and has the highest level of accuracy is 94.585%.
Collapse
|
16
|
A Method for Detecting Incipient Faults in Satellites Based on Dynamic Linear Discriminant Analysis. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:1303936. [PMID: 34691165 PMCID: PMC8531832 DOI: 10.1155/2021/1303936] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 09/16/2021] [Accepted: 09/25/2021] [Indexed: 11/17/2022]
Abstract
Timely detection and treatment of possible incipient faults in satellites will effectively reduce the damage and harm they could cause. Although much work has been done concerning fault detection problems, the related questions about satellite incipient faults are little addressed. In this paper, a new satellite incipient fault detection method was proposed by combining the ideas of deviation in unsupervised fault detection methods and classification in supervised fault detection methods. First, the proposed method uses dynamic linear discriminant analysis (LDA) to find an optimal projection vector that separates the in-orbit data from the normal historical data as much as possible. Second, under the assumption that the parameters obey a multidimensional Gaussian distribution, it applies the normal historical data and the optimal projection vector to build a normal model. Finally, it employs the noncentral F-distribution to test whether a fault has occurred. The proposed method was validated using a numerical simulation case and a real satellite fault case. The results show that the method proposed in this paper is more effective at detecting incipient faults than traditional methods.
Collapse
|