1
|
Chai H, Huang Y, Xu L, Song X, He M, Wang Q. A decentralized federated learning-based cancer survival prediction method with privacy protection. Heliyon 2024; 10:e31873. [PMID: 38845954 PMCID: PMC11153246 DOI: 10.1016/j.heliyon.2024.e31873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 05/18/2024] [Accepted: 05/23/2024] [Indexed: 06/09/2024] Open
Abstract
Background Survival prediction is one of the crucial goals in precision medicine, as accurate survival assessment can aid physicians in selecting appropriate treatment for individual patients. To achieve this aim, extensive data must be utilized to train the prediction model and prevent overfitting. However, the collection of patient data for disease prediction is challenging due to potential variations in data sources across institutions and concerns regarding privacy and ownership issues in data sharing. To facilitate the integration of cancer data from different institutions without violating privacy laws, we developed a federated learning-based data integration framework called AdFed, which can be used to evaluate patients' survival while considering the privacy protection problem by utilizing the decentralized federated learning technology and regularization method. Results AdFed was tested on different cancer datasets that contain the patients' information from different institutions. The experimental results show that AdFed using distributed data can achieve better performance in cancer survival prediction (AUC = 0.605) than the compared federated-learning-based methods (average AUC = 0.554). Additionally, to assess the biological interpretability of our method, in the case study we list 10 identified genes related to liver cancer selected by AdFed, among which 5 genes have been proved by literature review. Conclusions The results indicate that AdFed outperforms better than other federated-learning-based methods, and the interpretable algorithm can select biologically significant genes and pathways while ensuring the confidentiality and integrity of data.
Collapse
Affiliation(s)
- Hua Chai
- School of Mathematics and Big Data, Foshan University, Foshan, 528000, China
| | - Yiqian Huang
- School of Mathematics and Big Data, Foshan University, Foshan, 528000, China
| | - Lekai Xu
- School of Mathematics and Big Data, Foshan University, Foshan, 528000, China
| | - Xinpeng Song
- School of Mathematics and Big Data, Foshan University, Foshan, 528000, China
| | - Minfan He
- School of Mathematics and Big Data, Foshan University, Foshan, 528000, China
| | - Qingyong Wang
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
- Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Hefei, 230036, China
| |
Collapse
|
2
|
Hammad A, Elshaer M, Tang X. Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021; 18:8997-9015. [PMID: 34814332 DOI: 10.3934/mbe.2021443] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Colorectal cancer (CRC) is one of the most common malignancies worldwide. Biomarker discovery is critical to improve CRC diagnosis, however, machine learning offers a new platform to study the etiology of CRC for this purpose. Therefore, the current study aimed to perform an integrated bioinformatics and machine learning analyses to explore novel biomarkers for CRC prognosis. In this study, we acquired gene expression microarray data from Gene Expression Omnibus (GEO) database. The microarray expressions GSE103512 dataset was downloaded and integrated. Subsequently, differentially expressed genes (DEGs) were identified and functionally analyzed via Gene Ontology (GO) and Kyoto Enrichment of Genes and Genomes (KEGG). Furthermore, protein protein interaction (PPI) network analysis was conducted using the STRING database and Cytoscape software to identify hub genes; however, the hub genes were subjected to Support Vector Machine (SVM), Receiver operating characteristic curve (ROC) and survival analyses to explore their diagnostic values. Meanwhile, TCGA transcriptomics data in Gene Expression Profiling Interactive Analysis (GEPIA) database and the pathology data presented by in the human protein atlas (HPA) database were used to verify our transcriptomic analyses. A total of 105 DEGs were identified in this study. Functional enrichment analysis showed that these genes were significantly enriched in biological processes related to cancer progression. Thereafter, PPI network explored a total of 10 significant hub genes. The ROC curve was used to predict the potential application of biomarkers in CRC diagnosis, with an area under ROC curve (AUC) of these genes exceeding 0.92 suggesting that this risk classifier can discriminate between CRC patients and normal controls. Moreover, the prognostic values of these hub genes were confirmed by survival analyses using different CRC patient cohorts. Our results demonstrated that these 10 differentially expressed hub genes could be used as potential biomarkers for CRC diagnosis.
Collapse
Affiliation(s)
- Ahmed Hammad
- Department of Biochemistry and Department of Thoracic Surgery of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China
- Radiation Biology Department, National Center for Radiation Research and Technology, Egyptian Atomic Energy Authority, Cairo 13759, Egypt
| | - Mohamed Elshaer
- Department of Biochemistry and Department of Thoracic Surgery of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China
- Labeled Compounds Department, Hot Labs Center, Egyptian Atomic Energy Authority, Cairo 13759, Egypt
| | - Xiuwen Tang
- Department of Biochemistry and Department of Thoracic Surgery of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China
| |
Collapse
|
3
|
Liang GC, Duan WG, Chen SY, Fang JK. Analysis of the Composition and Anti-Rheumatoid Arthritis Mechanism of Qintengtongbi Decoction Based on Network Pharmacology. Nat Prod Commun 2021. [DOI: 10.1177/1934578x211041421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Qintengtongbi Decoction (QTTBD) is a traditional prescription for rheumatoid arthritis (RA) treatment in Ruikang Hospital Affiliated to Guangxi University of Chinese Medicine, Nanning, southern China's Guangxi Zhuang Autonomous Region. However, there is not yet any analysis on its active compounds or action mechanism for treating RA. Moreover, the prescription has not been investigated from the perspective of network pharmacology. Therefore, this study aimed to analyze the compounds QTTBD and their potential pharmacological effects and the mechanism by which they treat RA via an integrated network pharmacology approach. With the aid of the relevant database tools and research indices, 188 compounds and 272 related drug targets genes/proteins were collected from QTTBD through the compound-target network, and 175 common gene targets between the QTTBD and RA were obtained by Venn 2.1. Finally, the top 10 gene targets and pathways were identified through the protein–protein interaction network, gene ontology, and KEGG pathway analysis: the gene targets include AKT1, IL6, TP53, VEGFA, MAPK3, TNF, CASP3, JUN, EGF, and EGFR; the pathways include oxytocin signaling pathway, amphetamine addiction, graft-versus-host disease, ovarian steroidogenesis, cGMP-PKG signaling pathway, Rap1 signaling pathway, allograft rejection, cytokine–cytokine receptor interaction, regulation of lipolysis in adipocytes and inflammatory mediator regulation of transient receptor potential channels. Therefore, it is concluded that a network pharmacology-based approach can help reveal and clarify the anti-RA role of QTTBD, and provide a scientific basis for further research into the mechanism.
Collapse
Affiliation(s)
- Guo-Cheng Liang
- School of Chemistry and Chemical Engineering, Guangxi University, Nanning, China
- Ruikang Hospital Affiliated to Guangxi University of Chinese Medicine, Nanning, China
| | - Wen-Gui Duan
- School of Chemistry and Chemical Engineering, Guangxi University, Nanning, China
| | - Shu-Yin Chen
- Ruikang Hospital Affiliated to Guangxi University of Chinese Medicine, Nanning, China
| | - Jian-Kang Fang
- Ruikang Hospital Affiliated to Guangxi University of Chinese Medicine, Nanning, China
| |
Collapse
|
4
|
Pérez-Fidalgo JA, Gambardella V, Pineda B, Burgues O, Piñero O, Cervantes A. Aurora kinases in ovarian cancer. ESMO Open 2021; 5:e000718. [PMID: 33087400 PMCID: PMC7580081 DOI: 10.1136/esmoopen-2020-000718] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 07/06/2020] [Accepted: 07/11/2020] [Indexed: 01/18/2023] Open
Abstract
Aurora kinases (AURK) are key regulators of the mitotic spindle formation. AURK is frequently overexpressed in ovarian cancer and this overexpression has been frequently associated with prognosis in these tumours. Interestingly, AURK have been shown to interact with DNA repair mechanisms and other cell cycle regulators. These functions have brought light to Aurora family as a potential target for anticancer therapy. In the last years, two clinical trials with different AURK inhibitors have shown activity in epithelial and clear-cell ovarian cancer. Although there is a lack of predictive factors of AURK inhibition activity, recent trials have identified some candidates. This review will focus in the functions of the AURK family, its role as prognostic factor in epithelial ovarian cancer and potential clinical implications.
Collapse
Affiliation(s)
- J Alejandro Pérez-Fidalgo
- Department of Medical Oncology, Hospital Clinico Universitario of Valencia, Biomedical Research Institute, INCLIVA, CIBERONC and University of Valencia, Valencia, Spain.
| | - Valentina Gambardella
- Department of Medical Oncology, Hospital Clinico Universitario of Valencia, Biomedical Research Institute, INCLIVA, CIBERONC and University of Valencia, Valencia, Spain
| | - Begoña Pineda
- Department of Physiology, Hospital Clinico Universitario of Valencia, Biomedical Research Institute INCLIVA, CIBERONC and University of Valencia, Valencia, Spain
| | - Octavio Burgues
- Department of Pathology, Hospital Clinico Universitario Valencai, Valencia, Spain
| | - Oscar Piñero
- Department of Gynaecology, Hospital Clinico Universitario of Valencia, Valencia, Spain
| | - Andrés Cervantes
- Department of Medical Oncology, Hospital Clinico Universitario of Valencia, Biomedical Research Institute INCLIVA, CIBERONC and University of Valencia, Valencia, Spain
| |
Collapse
|
5
|
Liñares-Blanco J, Pazos A, Fernandez-Lozano C. Machine learning analysis of TCGA cancer data. PeerJ Comput Sci 2021; 7:e584. [PMID: 34322589 PMCID: PMC8293929 DOI: 10.7717/peerj-cs.584] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 05/17/2021] [Indexed: 06/13/2023]
Abstract
In recent years, machine learning (ML) researchers have changed their focus towards biological problems that are difficult to analyse with standard approaches. Large initiatives such as The Cancer Genome Atlas (TCGA) have allowed the use of omic data for the training of these algorithms. In order to study the state of the art, this review is provided to cover the main works that have used ML with TCGA data. Firstly, the principal discoveries made by the TCGA consortium are presented. Once these bases have been established, we begin with the main objective of this study, the identification and discussion of those works that have used the TCGA data for the training of different ML approaches. After a review of more than 100 different papers, it has been possible to make a classification according to following three pillars: the type of tumour, the type of algorithm and the predicted biological problem. One of the conclusions drawn in this work shows a high density of studies based on two major algorithms: Random Forest and Support Vector Machines. We also observe the rise in the use of deep artificial neural networks. It is worth emphasizing, the increase of integrative models of multi-omic data analysis. The different biological conditions are a consequence of molecular homeostasis, driven by both protein coding regions, regulatory elements and the surrounding environment. It is notable that a large number of works make use of genetic expression data, which has been found to be the preferred method by researchers when training the different models. The biological problems addressed have been classified into five types: prognosis prediction, tumour subtypes, microsatellite instability (MSI), immunological aspects and certain pathways of interest. A clear trend was detected in the prediction of these conditions according to the type of tumour. That is the reason for which a greater number of works have focused on the BRCA cohort, while specific works for survival, for example, were centred on the GBM cohort, due to its large number of events. Throughout this review, it will be possible to go in depth into the works and the methodologies used to study TCGA cancer data. Finally, it is intended that this work will serve as a basis for future research in this field of study.
Collapse
Affiliation(s)
- Jose Liñares-Blanco
- CITIC-Research Center of Information and Communication Technologies, University of A Coruna, A Coruña, Spain
- Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruna, A Coruña, Spain
| | - Alejandro Pazos
- CITIC-Research Center of Information and Communication Technologies, University of A Coruna, A Coruña, Spain
- Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruna, A Coruña, Spain
- Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR). Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, Instituto de Investigación Biomédica de A Coruña (INIBIC), A Coruña, Spain
| | - Carlos Fernandez-Lozano
- CITIC-Research Center of Information and Communication Technologies, University of A Coruna, A Coruña, Spain
- Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruna, A Coruña, Spain
- Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR). Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, Instituto de Investigación Biomédica de A Coruña (INIBIC), A Coruña, Spain
| |
Collapse
|
6
|
Zhao S, Bao Z, Zhao X, Xu M, Li MD, Yang Z. Identification of Diagnostic Markers for Major Depressive Disorder Using Machine Learning Methods. Front Neurosci 2021; 15:645998. [PMID: 34220416 PMCID: PMC8249859 DOI: 10.3389/fnins.2021.645998] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2020] [Accepted: 05/25/2021] [Indexed: 12/12/2022] Open
Abstract
Background Major depressive disorder (MDD) is a global health challenge that impacts the quality of patients’ lives severely. The disorder can manifest in many forms with different combinations of symptoms, which makes its clinical diagnosis difficult. Robust biomarkers are greatly needed to improve diagnosis and to understand the etiology of the disease. The main purpose of this study was to create a predictive model for MDD diagnosis based on peripheral blood transcriptomes. Materials and Methods We collected nine RNA expression datasets for MDD patients and healthy samples from the Gene Expression Omnibus database. After a series of quality control and heterogeneity tests, 302 samples from six studies were deemed suitable for the study. R package “MetaOmics” was applied for systematic meta-analysis of genome-wide expression data. Receiver operating characteristic (ROC) curve analysis was used to evaluate the diagnostic effectiveness of individual genes. To obtain a better diagnostic model, we also adopted the support vector machine (SVM), random forest (RF), k-nearest neighbors (kNN), and naive Bayesian (NB) tools for modeling, with the RF method being used for feature selection. Results Our analysis revealed six differentially expressed genes (AKR1C3, ARG1, KLRB1, MAFG, TPST1, and WWC3) with a false discovery rate (FDR) < 0.05 between MDD patients and control subjects. We then evaluated the diagnostic ability of these genes individually. With single gene prediction, we achieved a corresponding area under the curve (AUC) value of 0.63 ± 0.04, 0.67 ± 0.07, 0.70 ± 0.11, 0.64 ± 0.08, 0.68 ± 0.07, and 0.62 ± 0.09, respectively, for these genes. Next, we constructed the classifiers of SVM, RF, kNN, and NB with an AUC of 0.84 ± 0.09, 0.81 ± 0.10, 0.73 ± 0.11, and 0.83 ± 0.09, respectively, in validation datasets, suggesting that the SVM classifier might be superior for constructing an MDD diagnostic model. The final SVM classifier including 70 feature genes was capable of distinguishing MDD samples from healthy controls and yielded an AUC of 0.78 in an independent dataset. Conclusion This study provides new insights into potential biomarkers through meta-analysis of GEO data. Constructing different machine learning models based on these biomarkers could be a valuable approach for diagnosing MDD in clinical practice.
Collapse
Affiliation(s)
- Shu Zhao
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Zhiwei Bao
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Xinyi Zhao
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Mengxiang Xu
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Ming D Li
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.,Research Center for Air Pollution and Health, Zhejiang University, Hangzhou, China
| | - Zhongli Yang
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| |
Collapse
|
7
|
Zhou J, Zeng ZY, Li L. Progress of Artificial Intelligence in Gynecological Malignant Tumors. Cancer Manag Res 2020; 12:12823-12840. [PMID: 33364831 PMCID: PMC7751777 DOI: 10.2147/cmar.s279990] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 10/22/2020] [Indexed: 12/12/2022] Open
Abstract
Artificial intelligence (AI) is a sort of new technical science which can simulate, extend and expand human intelligence by developing theories, methods and application systems. In the last five years, the application of AI in medical research has become a hot topic in modern science and technology. Gynecological malignant tumors involves a wide range of knowledge, and AI can play an important part in these aspects, such as medical image recognition, auxiliary diagnosis, drug research and development, treatment scheme formulation and other fields. The purpose of this paper is to describe the progress of AI in gynecological malignant tumors and discuss some problems in its application. It is believed that AI improves the efficiency of diagnosis, reduces the burden of clinicians, and improves the effect of treatment and prognosis. AI will play an irreplaceable role in the field of gynecological malignant oncology and will promote the development of medicine and further promote the transformation from traditional medicine to precision medicine and preventive medicine. However, there are also some problems in the application of AI in gynecologic malignant tumors. For example, AI, inseparable from human participation, still needs to be more “humanized”, and needs to further protect patients’ privacy and health, improve legal and insurance protection, and further improve according to local ethnic conditions and national conditions. However, it is believed that with the continuous development of AI, especially ensemble classifier, and deep learning will have a profound influence on the future of medical technology, which is a powerful driving force for future medical innovation and reform.
Collapse
Affiliation(s)
- Jie Zhou
- Department of Gynecologic Oncology, Guangxi Medical University Cancer Hospital, Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor, Ministry of Education, Nanning 530021, Guangxi, People's Republic of China.,Department of Gynecology, The Second Affiliated Hospital, University of South China, Hengyang 421001, Hunan, People's Republic of China
| | - Zhi Ying Zeng
- Department of Anesthesiology, The Second Affiliated Hospital, University of South China, Hengyang 421001, Hunan, People's Republic of China
| | - Li Li
- Department of Gynecologic Oncology, Guangxi Medical University Cancer Hospital, Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor, Ministry of Education, Nanning 530021, Guangxi, People's Republic of China
| |
Collapse
|
8
|
Recurrence-Associated Multi-RNA Signature to Predict Disease-Free Survival for Ovarian Cancer Patients. BIOMED RESEARCH INTERNATIONAL 2020; 2020:1618527. [PMID: 32149080 PMCID: PMC7044477 DOI: 10.1155/2020/1618527] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 12/12/2019] [Accepted: 12/23/2019] [Indexed: 02/07/2023]
Abstract
Ovarian cancer (OvCa) is an intractable gynecological malignancy due to the high recurrence rate. Several molecular biomarkers have been previously screened for early identifying patients with a high recurrence risk and poor prognosis. However, all the known studies focused on a single type of RNAs, not integrating various types. This study was to construct a new multi-RNA-based model to predict the recurrence and prognosis for OvCa patients by using the messenger RNA (mRNA, including long noncoding RNA (lncRNA)) and microRNA (miRNA) sequencing data of The Cancer Genome Atlas database. After univariate Cox regression and least absolute shrinkage and selection operator analyses, a multi-RNA-based signature (2 miRNAs: hsa-miR-508, hsa-miR-506; 1 lncRNA: TM4SF1-AS1; 11 mRNAs: MAGI3, SLAMF7, GLI2, PDK1, ARID3A, PLEKHG4B, TNFAIP8L3, C1QTNF3, NDUFAF1, CH25H, TMEM129) was generated and used to establish a risk score model. The high- and low-risk patients classified by the median risk score exhibited significantly different recurrence risks (89% versus 61%, p < 0.001) and survival time (the area under the receiver operating characteristic curve (AUC) = 0.901 for 5-year disease-free survival (DFS)). This risk model was independent of other clinical features and superior to pathologic staging for DFS prediction (AUC, 0.906 versus 0.524; C-index, 0.633 versus 0.510). Furthermore, some new interaction axes were revealed to explain the possible functions of these RNAs (competing endogenous RNA: TM4SF1-AS1-miR-186-STEAP2, LINC00536-miR-508-STEAP2, LINC00475-miR-506-TMEM129; coexpression: LINC00598-PLEKHG4B). In conclusion, this multi-RNA-based risk model may be clinically useful to stratify OvCa patients with different recurrence risks and survival outcomes and included RNAs may be potential therapeutic targets.
Collapse
|
9
|
DA-Based Parameter Optimization of Combined Kernel Support Vector Machine for Cancer Diagnosis. Processes (Basel) 2019. [DOI: 10.3390/pr7050263] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
As is well known, the correct diagnosis for cancer is critical to save patients’ lives. Support vector machine (SVM) has already made an important contribution to the field of cancer classification. However, different kernel function configurations and their parameters will significantly affect the performance of SVM classifier. To improve the classification accuracy of SVM classifier for cancer diagnosis, this paper proposed a novel cancer classification algorithm based on the dragonfly algorithm and SVM with a combined kernel function (DA-CKSVM) which was constructed from a radial basis function (RBF) kernel and a polynomial kernel. Experiments were performed on six cancer data sets from University of California, Irvine (UCI) machine learning repository and two cancer data sets from Cancer Program Legacy Publication Resources to evaluate the validity of the proposed algorithm. Compared with four well-known algorithms: dragonfly algorithm-SVM (DA-SVM), particle swarm optimization-SVM (PSO-SVM), bat algorithm-SVM (BA-SVM), and genetic algorithm-SVM (GA-SVM), the proposed algorithm was able to find the optimal parameters of SVM classifier and achieved better classification accuracy on cancer datasets.
Collapse
|