1
|
Abd Elaziz M, Dahou A, Aseeri AO, Ewees AA, Al-Qaness MAA, Ibrahim RA. Cross vision transformer with enhanced Growth Optimizer for breast cancer detection in IoMT environment. Comput Biol Chem 2024; 111:108110. [PMID: 38815500 DOI: 10.1016/j.compbiolchem.2024.108110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 04/19/2024] [Accepted: 05/19/2024] [Indexed: 06/01/2024]
Abstract
The recent advances in artificial intelligence modern approaches can play vital roles in the Internet of Medical Things (IoMT). Automatic diagnosis is one of the most important topics in the IoMT, including cancer diagnosis. Breast cancer is one of the top causes of death among women. Accurate diagnosis and early detection of breast cancer can improve the survival rate of patients. Deep learning models have demonstrated outstanding potential in accurately detecting and diagnosing breast cancer. This paper proposes a novel technology for breast cancer detection using CrossViT as the deep learning model and an enhanced version of the Growth Optimizer algorithm (MGO) as the feature selection method. CrossVit is a hybrid deep learning model that combines the strengths of both convolutional neural networks (CNNs) and transformers. The MGO is a meta-heuristic algorithm that selects the most relevant features from a large pool of features to enhance the performance of the model. The developed approach was evaluated on three publicly available breast cancer datasets and achieved competitive performance compared to other state-of-the-art methods. The results show that the combination of CrossViT and the MGO can effectively identify the most informative features for breast cancer detection, potentially assisting clinicians in making accurate diagnoses and improving patient outcomes. The MGO algorithm improves accuracy by approximately 1.59% on INbreast, 5.00% on MIAS, and 0.79% on MiniDDSM compared to other methods on each respective dataset. The developed approach can also be utilized to improve the Quality of Service (QoS) in the healthcare system as a deployable IoT-based intelligent solution or a decision-making assistance service, enhancing the efficiency and precision of the diagnosis.
Collapse
Affiliation(s)
- Mohamed Abd Elaziz
- Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt; Faculty of Computer Science and Engineering, Galala University, Suze 435611, Egypt; Artificial Intelligence Research Center (AIRC), Ajman University, Ajman 346, United Arab Emirates; MEU Research Unit, Middle East University, Amman 11831, Jordan.
| | - Abdelghani Dahou
- Mathematics and Computer Science Department, University of Ahmed DRAIA, 01000, Adrar, Algeria; LDDI Laboratory, Faculty of Science and Technology, University of Ahmed DRAIA, 01000, Adrar, Algeria.
| | - Ahmad O Aseeri
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia.
| | - Ahmed A Ewees
- Department of Computer, Damietta University, Damietta 34517, Egypt.
| | - Mohammed A A Al-Qaness
- College of Physics and Electronic Information Engineering, Zhejiang Normal University, Jinhua 321004, China; Zhejiang Optoelectronics Research Institute, Jinhua 321004, China; College of Engineering and Information Technology, Emirates International University, Sana'a 16881, Yemen.
| | - Rehab Ali Ibrahim
- Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt.
| |
Collapse
|
2
|
Li X, Chen X, Rezaeipanah A. Automatic breast cancer diagnosis based on hybrid dimensionality reduction technique and ensemble classification. J Cancer Res Clin Oncol 2023; 149:7609-7627. [PMID: 36995408 DOI: 10.1007/s00432-023-04699-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 03/17/2023] [Indexed: 03/31/2023]
Abstract
INTRODUCTION Feature selection in the face of high-dimensional data can reduce overfitting and learning time, and at the same time improve the accuracy and efficiency of the system. Since there are many irrelevant and redundant features in breast cancer diagnosis, removing such features leads to more accurate prediction and reduced decision time when dealing with large-scale data. Meanwhile, ensemble classifiers are powerful techniques to improve the prediction performance of classification models, where several individual classifier models are combined to achieve higher accuracy. METHODS In this paper, an ensemble classifier algorithm based on multilayer perceptron neural network is proposed for the classification task, in which the parameters (e.g., number of hidden layers, number of neurons in each hidden layer, and weights of links) are adjusted based on an evolutionary approach. Meanwhile, this paper uses a hybrid dimensionality reduction technique based on principal component analysis and information gain to address this problem. RESULTS The effectiveness of the proposed algorithm was evaluated based on the Wisconsin breast cancer database. In particular, the proposed algorithm provides an average of 17% better accuracy compared to the best results obtained from the existing state-of-the-art methods. CONCLUSION Experimental results show that the proposed algorithm can be used as an intelligent medical assistant system for breast cancer diagnosis.
Collapse
Affiliation(s)
- Xingyuan Li
- Depiecement of Oncology, The PLA Navy Anqing Hospital, Anqing, 246000, Anhui, China
| | - Xi Chen
- Department of Thyroid and Breast Surgery, Anqing Municipal Hospital, Anqing, 246000, Anhui, China.
| | - Amin Rezaeipanah
- Department of Computer Engineering, Persian Gulf University, Bushehr, Iran.
| |
Collapse
|
3
|
An ensemble algorithm using quantum evolutionary optimization of weighted type-II fuzzy system and staged Pegasos Quantum Support Vector Classifier with multi-criteria decision making system for diagnosis and grading of breast cancer. Soft comput 2023. [DOI: 10.1007/s00500-023-07939-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2023]
|
4
|
Pramanik P, Mukhopadhyay S, Mirjalili S, Sarkar R. Deep feature selection using local search embedded social ski-driver optimization algorithm for breast cancer detection in mammograms. Neural Comput Appl 2023; 35:5479-5499. [PMID: 36373132 PMCID: PMC9638217 DOI: 10.1007/s00521-022-07895-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 09/25/2022] [Indexed: 11/06/2022]
Abstract
Breast cancer has become a common malignancy in women. However, early detection and identification of this disease can save many lives. As computer-aided detection helps radiologists in detecting abnormalities efficiently, researchers across the world are striving to develop reliable models to deal with. One of the common approaches to identifying breast cancer is through breast mammograms. However, the identification of malignant breasts from mass lesions is a challenging research problem. In the current work, we propose a method for the classification of breast mass using mammograms which consists of two main stages. At first, we extract deep features from the input mammograms using the well-known VGG16 model while incorporating an attention mechanism into this model. Next, we apply a meta-heuristic called Social Ski-Driver (SSD) algorithm embedded with Adaptive Beta Hill Climbing based local search to obtain an optimal features subset. The optimal features subset is fed to the K-nearest neighbors (KNN) classifier for the classification. The proposed model is demonstrated to be very useful for identifying and differentiating malignant and healthy breasts successfully. For experimentation, we evaluate our model on the digital database for screening mammography (DDSM) database and achieve 96.07% accuracy using only 25% of features extracted by the attention-aided VGG16 model. The Python code of our research work is publicly available at: https://github.com/Ppayel/BreastLocalSearchSSD.
Collapse
Affiliation(s)
- Payel Pramanik
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | | | - Seyedali Mirjalili
- Centre for Artificial Intelligence Research and Optimisation, Torrens University Australia, Brisbane, QLD 4006 Australia ,Yonsei Frontier Lab, Yonsei University, Seoul, South Korea ,University Research and Innovation Center, Óbuda University, Budapest, 1034 Hungary
| | - Ram Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| |
Collapse
|
5
|
Improved swarm-optimization-based filter-wrapper gene selection from microarray data for gene expression tumor classification. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01117-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
6
|
An ensemble algorithm integrating consensus-clustering with feature weighting based ranking and probabilistic fuzzy logic-multilayer perceptron classifier for diagnosis and staging of breast cancer using heterogeneous datasets. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04157-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
7
|
PSOWNNs-CNN: A Computational Radiology for Breast Cancer Diagnosis Improvement Based on Image Processing Using Machine Learning Methods. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:5667264. [PMID: 35602611 PMCID: PMC9117073 DOI: 10.1155/2022/5667264] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 03/29/2022] [Indexed: 02/06/2023]
Abstract
Early diagnosis of breast cancer is an important component of breast cancer therapy. A variety of diagnostic platforms can provide valuable information regarding breast cancer patients, including image-based diagnostic techniques. However, breast abnormalities are not always easy to identify. Mammography, ultrasound, and thermography are some of the technologies developed to detect breast cancer. Using image processing and artificial intelligence techniques, the computer enables radiologists to identify chest problems more accurately. The purpose of this article was to review various approaches to detecting breast cancer using artificial intelligence and image processing. The authors present an innovative approach for identifying breast cancer using machine learning methods. Compared to current approaches, such as CNN, our particle swarm optimized wavelet neural network (PSOWNN) method appears to be relatively superior. The use of machine learning methods is clearly beneficial in terms of improved performance, efficiency, and quality of images, which are crucial to the most innovative medical applications. According to a comparison of the process's 905 images to those of other illnesses, 98.6% of the disorders are correctly identified. In summary, PSOWNNs, therefore, have a specificity of 98.8%. Furthermore, PSOWNNs have a precision of 98.6%, which means that, despite the high number of women diagnosed with breast cancer, only 830 (95.2%) are diagnosed. In other words, 95.2% of images are correctly classified. PSOWNNs are more accurate than other machine learning algorithms, SVM, KNN, and CNN.
Collapse
|
8
|
Punitha S, Stephan T, Gandomi AH. A Novel Breast Cancer Diagnosis Scheme With Intelligent Feature and Parameter Selections. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 214:106432. [PMID: 34844767 DOI: 10.1016/j.cmpb.2021.106432] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 09/15/2021] [Indexed: 06/13/2023]
Abstract
BACKGROUND AND OBJECTIVE Breast cancer is the most commonly occurring cancer among women, which contributes to the global death rate. The key to increasing the survival rate of affected patients is early diagnosis along with appropriate treatments. Manual methods for breast cancer diagnosis fail due to human errors, inaccurate diagnoses, and are time-consuming when demands are high. Intelligent systems based on Artificial Neural Network (ANN) for automated breast cancer diagnosis are powerful due to their strong decision-making capabilities in complicated cases. Artificial Bee Colony, Artificial Immune System, and Bacterial Foraging Optimization are swarm intelligence algorithms that solve combinatorial optimization problems. This paper proposes two novel hybrid Artificial Bee Colony (ABC) optimization algorithms that overcome the demerits of standard ABC algorithms. First, this paper proposes a hybrid ABC approach called HABC, in which the standard ABC optimization is hybridized with a modified clonal selection algorithm of the Artificial Immune System that eliminates the poor exploration capabilities of standard ABC optimization. Further, this paper proposes a novel hybrid Artificial Bee Colony (Hybrid ABC) optimization where the strong explorative capabilities of the chemotaxis phase of the bacterial foraging optimization are integrated with a spiral model-based exploitative phase of the ABC by which the proposed Hybrid ABC overcomes the demerits of poor exploration and exploitation of the standard ABC algorithm. METHODS In this work, the two proposed hybrid approaches were used in concurrent feature selection and parameter optimization of an ANN model. The proposed algorithm is implemented using various back-propagation algorithms, including resilient back-propagation (HABC-RP and Hybrid ABC-RP), Levenberg Marquart (HABC-LM and Hybrid ABC-LM), and momentum-based gradient descent (HABC-MGD and Hybrid ABC-GD) for parameter tuning of ANN. The Wisconsin breast cancer dataset was used to evaluate the performance of the proposed algorithms in terms of accuracy, complexity, and computational time. RESULTS The mean accuracy of the proposed HABC-RP was 99.14% and 99.54% for Hybrid ABC which is better than the results found in the existing literature. HABC-RP attained a sensitivity of 98.32%, a specificity of 99.63%, and a precision of 99.38% whereas Hybrid ABC attained sensitivity of 99.08% and Specificity of 99.81%. CONCLUSIONS HABC-RP and Hybrid ABC-RP yielded high accuracy with a low complexity ANN structure compared to other variants. After evaluation, interestingly it is found that the Hybrid ABC-RP has achieved the highest mean accuracy of 99.54% with low complexity of 10.25 mean connections when compared to other variants proposed in this paper. It can be concluded that the concurrent selection of input features and tuning of parameters of ANN plays a vital role in increasing the accuracy of a breast cancer diagnosis. The proposed HABC-RP and Hybrid ABC-RP showed better results when compared to the existing breast cancer diagnosis systems taken for comparison. In the future, the proposed two-hybrid approaches can be used to generate optimal thresholds for the segmentation of tumors in abnormal images. HABC and Hybrid ABC can be used for tuning the parameters of various classifiers.
Collapse
Affiliation(s)
- S Punitha
- Department of Computer Science Engineering, Karunya Insitute of Technology and Sciences, Tamilnadu, India
| | - Thompson Stephan
- Department of Computer Science Engineering, Faculty of Engineering and Technology, M. S. Ramaiah University of Applied Sciences, Bengaluru, India
| | - Amir H Gandomi
- Faculty of Engineering and Information Technology, University of Technology Sydney, Australia.
| |
Collapse
|
9
|
A Review of the Modification Strategies of the Nature Inspired Algorithms for Feature Selection Problem. MATHEMATICS 2022. [DOI: 10.3390/math10030464] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
This survey is an effort to provide a research repository and a useful reference for researchers to guide them when planning to develop new Nature-inspired Algorithms tailored to solve Feature Selection problems (NIAs-FS). We identified and performed a thorough literature review in three main streams of research lines: Feature selection problem, optimization algorithms, particularly, meta-heuristic algorithms, and modifications applied to NIAs to tackle the FS problem. We provide a detailed overview of 156 different articles about NIAs modifications for tackling FS. We support our discussions by analytical views, visualized statistics, applied examples, open-source software systems, and discuss open issues related to FS and NIAs. Finally, the survey summarizes the main foundations of NIAs-FS with approximately 34 different operators investigated. The most popular operator is chaotic maps. Hybridization is the most widely used modification technique. There are three types of hybridization: Integrating NIA with another NIA, integrating NIA with a classifier, and integrating NIA with a classifier. The most widely used hybridization is the one that integrates a classifier with the NIA. Microarray and medical applications are the dominated applications where most of the NIA-FS are modified and used. Despite the popularity of the NIAs-FS, there are still many areas that need further investigation.
Collapse
|
10
|
Li H, Zhang L. A Bilevel Learning Model and Algorithm for Self-Organizing Feed-Forward Neural Networks for Pattern Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:4901-4915. [PMID: 33017295 DOI: 10.1109/tnnls.2020.3026114] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Conventional artificial neural network (ANN) learning algorithms for classification tasks, either derivative-based optimization algorithms or derivative-free optimization algorithms work by training ANN first (or training and validating ANN) and then testing ANN, which are a two-stage and one-pass learning mechanism. Thus, this learning mechanism may not guarantee the generalization ability of a trained ANN. In this article, a novel bilevel learning model is constructed for self-organizing feed-forward neural network (FFNN), in which the training and testing processes are integrated into a unified framework. In this bilevel model, the upper level optimization problem is built for testing error on testing data set and network architecture based on network complexity, whereas the lower level optimization problem is constructed for network weights based on training error on training data set. For the bilevel framework, an interactive learning algorithm is proposed to optimize the architecture and weights of an FFNN with consideration of both training error and testing error. In this interactive learning algorithm, a hybrid binary particle swarm optimization (BPSO) taken as an upper level optimizer is used to self-organize network architecture, whereas the Levenberg-Marquardt (LM) algorithm as a lower level optimizer is utilized to optimize the connection weights of an FFNN. The bilevel learning model and algorithm have been tested on 20 benchmark classification problems. Experimental results demonstrate that the bilevel learning algorithm can significantly produce more compact FFNNs with more excellent generalization ability when compared with conventional learning algorithms.
Collapse
|
11
|
Chowdhury S, Lahiri SK, Hens A, Katiyar S. Performance enhancement of commercial ethylene oxide reactor by artificial intelligence approach. INTERNATIONAL JOURNAL OF CHEMICAL REACTOR ENGINEERING 2021. [DOI: 10.1515/ijcre-2020-0230] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
The present work emphasizes the development of a generic methodology that addresses the core issue of any running chemical plant, i.e., how to maintain a delicate balance between profit and environmental impact. Here, an ethylene oxide (EO) production plant has been taken as a case study. The production of EO takes place in a multiphase catalytic reactor, the reliable first principle-based model of which is still not available in the literature. Artificial neural network (ANN) was therefore applied to develop a data-driven model of the complex reactor with the help of actual industrial data. The model successfully built up a correlation between the catalyst selectivity and temperature with other operational parameters. A hybrid multi-objective metaheuristic optimization technique, namely ANN-multi-objective genetic algorithm (MOGA) algorithm was used to develop a Pareto diagram of selectivity versus reactor temperature. The Pareto diagram will help the plant engineers to make a strategy on what operating conditions to be maintained to make a delicate balance between profit and environmental impact. It was also found that by applying this hybrid ANN-MOGA modeling and optimization technique, for a 720 KTA ethylene glycol plant, approximately 32,345 ton/year of carbon-di-oxide emission into the atmosphere can be reduced. Along with the reduction of environmental impact, this hybrid approach enables the plant to reduce raw material cost of nine million USD per annum simultaneously.
Collapse
Affiliation(s)
- Somnath Chowdhury
- Chemical Engineering Department , National Institute of Technology Durgapur , Durgapur 713209 , India
| | - Sandip Kumar Lahiri
- Chemical Engineering Department , National Institute of Technology Durgapur , Durgapur 713209 , India
| | - Abhiram Hens
- Chemical Engineering Department , National Institute of Technology Durgapur , Durgapur 713209 , India
| | - Samarth Katiyar
- Chemical Engineering Department , National Institute Of Technology Karnataka , Surathkal 575025 , India
| |
Collapse
|
12
|
Shaikh TA, Ali R. An automated machine learning tool for breast cancer diagnosis for healthcare professionals. Health Syst (Basingstoke) 2021; 11:303-333. [DOI: 10.1080/20476965.2021.1966324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
Affiliation(s)
- Tawseef Ayoub Shaikh
- Department Of Computer Science & Engineering, Baba Ghulam Shah Badshah University Rajouri, Rajouri, J&K, India
| | - Rashid Ali
- Department Of Computer Engineering, Aligarh Muslim University, Aligarh, Uttar Pradesh, India
| |
Collapse
|
13
|
Talatian Azad S, Ahmadi G, Rezaeipanah A. An intelligent ensemble classification method based on multi-layer perceptron neural network and evolutionary algorithms for breast cancer diagnosis. J EXP THEOR ARTIF IN 2021. [DOI: 10.1080/0952813x.2021.1938698] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
| | - Gholamreza Ahmadi
- Department of Computer Engineering, Persian Gulf University, Bushehr, Iran
| | - Amin Rezaeipanah
- Department of Computer Engineering, Persian Gulf University, Bushehr, Iran
| |
Collapse
|
14
|
Stephan P, Stephan T, Kannan R, Abraham A. A hybrid artificial bee colony with whale optimization algorithm for improved breast cancer diagnosis. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-05997-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
15
|
A Simultaneous Moth Flame Optimizer Feature Selection Approach Based on Levy Flight and Selection Operators for Medical Diagnosis. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2021. [DOI: 10.1007/s13369-021-05478-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
16
|
Training Feedforward Neural Network Using Enhanced Black Hole Algorithm: A Case Study on COVID-19 Related ACE2 Gene Expression Classification. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2021; 46:3807-3828. [PMID: 33520590 PMCID: PMC7823180 DOI: 10.1007/s13369-020-05217-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Accepted: 12/07/2020] [Indexed: 11/05/2022]
Abstract
The aim of this paper is twofold. First, black hole algorithm (BHA) is proposed as a new training algorithm for feedforward neural networks (FNNs), since most traditional and metaheuristic algorithms for training FNNs suffer from the problem of slow coverage and getting stuck at local optima. BHA provides a reliable alternative to address these drawbacks. Second, complementary learning components and Levy flight random walk are introduced into BHA to result in a novel optimization algorithm (BHACRW) for the purpose of improving the FNNs’ accuracy by finding optimal weights and biases. Four benchmark functions are first used to evaluate BHACRW’s performance in numerical optimization problems. Later, the classification performance of the suggested models, using BHA and BHACRW for training FNN, is evaluated against seven various benchmark datasets: iris, wine, blood, liver disorders, seeds, Statlog (Heart), balance scale. Experimental result demonstrates that the BHACRW performs better in terms of mean square error (MSE) and accuracy of training FNN, compared to standard BHA and eight well-known metaheuristic algorithms: whale optimization algorithm (WOA), biogeography-based optimizer (BBO), gravitational search algorithm (GSA), genetic algorithm (GA), cuckoo search (CS), multiverse optimizer (MVO), symbiotic organisms search (SOS), and particle swarm optimization (PSO). Moreover, we examined the classification performance of the suggested approach on the angiotensin-converting enzyme 2 (ACE2) gene expression as a coronavirus receptor, which has been overexpressed in human rhinovirus-infected nasal tissue. Results demonstrate that BHACRW-FNN achieves the highest accuracy on the dataset compared to other classifiers.
Collapse
|
17
|
Abu Khurmaa R, Aljarah I, Sharieh A. An intelligent feature selection approach based on moth flame optimization for medical diagnosis. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05483-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
18
|
Khandezamin Z, Naderan M, Rashti MJ. Detection and classification of breast cancer using logistic regression feature selection and GMDH classifier. J Biomed Inform 2020; 111:103591. [PMID: 33039588 DOI: 10.1016/j.jbi.2020.103591] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 10/01/2020] [Accepted: 10/04/2020] [Indexed: 11/17/2022]
Abstract
Breast cancer is the most common cancer among women such that the existence of a precise and reliable system for the diagnosis of benign or malignant tumors is critical. Nowadays, using the results of Fine Needle Aspiration (FNA) cytology and machine learning techniques, detection and early diagnosis of this cancer can be done with greater accuracy. In this paper, we propose a method consisting of two steps: in the first step, to eliminate the less important features, logistic regression has been used. In the second step, the Group Method Data Handling (GMDH) neural network is used for the diagnosis of benign and malignant samples. To evaluate the performance of the proposed method, three datasets WBCD, WDBC and WPBC are investigated with metrics: precision, the Area Under the ROC (AUC), true positive rate, false positive rate, accuracy and F-criteria. Simulation results show that the proposed method reaches a precision of 99.4% for WBCD, 99.6% for WDBC and a precision of 96.9% for WPBC dataset.
Collapse
Affiliation(s)
- Ziba Khandezamin
- M.Sc. of Artificial Intelligence, Department of Computer Engineering, Faculty of Engineering, Shahid Chamran University of Ahvaz, Ahwvz, Iran.
| | - Marjan Naderan
- Associate Professor, Department of Computer Engineering, Faculty of Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran.
| | - Mohammad Javad Rashti
- Assistant Professor, Department of Computer Engineering, Faculty of Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran.
| |
Collapse
|
19
|
A fast parallel genetic programming framework with adaptively weighted primitives for symbolic regression. Soft comput 2020. [DOI: 10.1007/s00500-019-04379-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
20
|
Shaikh TA, Ali R. An intelligent healthcare system for optimized breast cancer diagnosis using harmony search and simulated annealing (HS-SA) algorithm. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100408] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
21
|
|
22
|
A Wrapper Feature Subset Selection Method Based on Randomized Search and Multilayer Structure. BIOMED RESEARCH INTERNATIONAL 2019; 2019:9864213. [PMID: 31828154 PMCID: PMC6885241 DOI: 10.1155/2019/9864213] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Revised: 08/10/2019] [Accepted: 08/27/2019] [Indexed: 12/11/2022]
Abstract
The identification of discriminative features from information-rich data with the goal of clinical diagnosis is crucial in the field of biomedical science. In this context, many machine-learning techniques have been widely applied and achieved remarkable results. However, disease, especially cancer, is often caused by a group of features with complex interactions. Unlike traditional feature selection methods, which only focused on finding single discriminative features, a multilayer feature subset selection method (MLFSSM), which employs randomized search and multilayer structure to select a discriminative subset, is proposed herein. In each level of this method, many feature subsets are generated to assure the diversity of the combinations, and the weights of features are evaluated on the performances of the subsets. The weight of a feature would increase if the feature is selected into more subsets with better performances compared with other features on the current layer. In this manner, the values of feature weights are revised layer-by-layer; the precision of feature weights is constantly improved; and better subsets are repeatedly constructed by the features with higher weights. Finally, the topmost feature subset of the last layer is returned. The experimental results based on five public gene datasets showed that the subsets selected by MLFSSM were more discriminative than the results by traditional feature methods including LVW (a feature subset method used the Las Vegas method for randomized search strategy), GAANN (a feature subset selection method based genetic algorithm (GA)), and support vector machine recursive feature elimination (SVM-RFE). Furthermore, MLFSSM showed higher classification performance than some state-of-the-art methods which selected feature pairs or groups, including top scoring pair (TSP), k-top scoring pairs (K-TSP), and relative simplicity-based direct classifier (RS-DC).
Collapse
|
23
|
Zhang L, Li H, Kong XG. Evolving feedforward artificial neural networks using a two-stage approach. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.03.097] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
24
|
Enhanced Monarchy Butterfly Optimization Technique for effective breast cancer diagnosis. J Med Syst 2019; 43:206. [PMID: 31144128 DOI: 10.1007/s10916-019-1348-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Accepted: 05/20/2019] [Indexed: 10/26/2022]
Abstract
Breast cancer is the biggest curse for the women society in the world since the survival factor of the infected patients is ensured only when it is detected at the early localized stage. The majority of the intelligent schemes proposed for detecting the breast cancer relies on the human skill that helps in trustworthy determination of essential pattern that confirms the existence of the infected cancer cells for deciding upon the course of treatment. Further, most of the research works contributed in the literature for detecting breast cancer necessitates huge time and laborinvolved that increases the time of diagnosis. This Intelligent Artificial Bee Colony and Enhanced Monarchy Butterfly Optimization Technique (IABC-EMBOT) is proposed for effective breast cancer diagnosis. The core idea behind the formulation of IABC-EMBOT relies on two significant ameliorations that, i) focuses on the modification of Monarchy Butterfly Optimization that enhances the exploration degree based on the rate of exploitation of the searching space and ii) concentrates on the elimination in the limitations of the ABC scheme by enhancing the possibility of search diversification process through phenomenal update facilitated through the dynamic and adaptive butterfly operator that improves the search globally. The proposed IABC-EMBOT scheme investigated using the Wisconsin data set is proven to facilitate an improved average classification accuracy of 97.53%.
Collapse
|
25
|
Ibrahim AO, Shamsuddin SM, Abraham A, Qasem SN. Adaptive memetic method of multi-objective genetic evolutionary algorithm for backpropagation neural network. Neural Comput Appl 2019. [DOI: 10.1007/s00521-018-03990-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
26
|
A reliable method for colorectal cancer prediction based on feature selection and support vector machine. Med Biol Eng Comput 2018; 57:901-912. [PMID: 30478811 DOI: 10.1007/s11517-018-1930-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2017] [Accepted: 11/17/2018] [Indexed: 02/07/2023]
Abstract
Colorectal cancer (CRC) is a common cancer responsible for approximately 600,000 deaths per year worldwide. Thus, it is very important to find the related factors and detect the cancer accurately. However, timely and accurate prediction of the disease is challenging. In this study, we build an integrated model based on logistic regression (LR) and support vector machine (SVM) to classify the CRC into cancer and normal samples. From various factors, human location, age, gender, BMI, and cancer tumor type, tumor grade, and DNA, of the cancer, we select the most significant factors (p < 0.05) using logistic regression as main features, and with these features, a grid-search SVM model is designed using different kernel types (Linear, radial basis function (RBF), Sigmoid, and Polynomial). The result of the logistic regression indicates that the Firmicutes (AUC 0.918), Bacteroidetes (AUC 0.856), body mass index (BMI) (AUC 0.777), and age (AUC 0.710) and their combined factors (AUC 0.942) are effective for CRC detection. And the best kernel type is RBF, which achieves an accuracy of 90.1% when k = 5, and 91.2% when k = 10. This study provides a new method for colorectal cancer prediction based on independent risky factors. Graphical abstract Flow chart depicting the method adopted in the study. LR (logistic regression) and ROC curve are used to select independent features as input of SVM. SVM kernel selection aims to find the best kernel function for classification by comparing Linear, RBF, Sigmoid, and Polynomial kernel types of SVM, and the result shows the best kernel is RBF. Classification performance of LR + RF, LR + NB, LR + KNN, and LR + ANNs models are compared with LR + SVM. After these steps, the cancer and healthy individuals can be classified, and the best model is selected.
Collapse
|
27
|
Arunkumar C, Ramakrishnan S. Attribute selection using fuzzy roughset based customized similarity measure for lung cancer microarray gene expression data. ACTA ACUST UNITED AC 2018. [DOI: 10.1016/j.fcij.2018.02.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
28
|
Khan S, Zafar N, Khan SS, Setua S, Behrman SW, Stiles ZE, Yallapu MM, Sahay P, Ghimire H, Ise T, Nagata S, Wang L, Wan JY, Pradhan P, Jaggi M, Chauhan SC. Clinical significance of MUC13 in pancreatic ductal adenocarcinoma. HPB (Oxford) 2018; 20:563-572. [PMID: 29352660 PMCID: PMC5995635 DOI: 10.1016/j.hpb.2017.12.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 11/22/2017] [Accepted: 12/19/2017] [Indexed: 02/07/2023]
Abstract
BACKGROUND Poor prognosis of pancreatic cancer (PanCa) is associated with lack of an effective early diagnostic biomarker. This study elucidates significance of MUC13, as a diagnostic/prognostic marker of PanCa. METHODS MUC13 was assessed in tissues using our in-house generated anti-MUC13 mouse monoclonal antibody and analyzed for clinical correlation by immunohistochemistry, immunoblotting, RT-PCR, computational and submicron scale mass-density fluctuation analyses, ROC and Kaplan Meir curve analyses. RESULTS MUC13 expression was detected in 100% pancreatic intraepithelial neoplasia (PanIN) lesions (Mean composite score: MCS = 5.8; AUC >0.8, P < 0.0001), 94.6% of pancreatic ductal adenocarcinoma (PDAC) samples (MCS = 9.7, P < 0.0001) as compared to low expression in tumor adjacent tissues (MCS = 4, P < 0.001) along with faint or no expression in normal pancreatic tissues (MCS = 0.8; AUC >0.8; P < 0.0001). Nuclear MUC13 expression positively correlated with nodal metastasis (P < 0.05), invasion of cancer to peripheral tissues (P < 0.5) and poor patient survival (P < 0.05; prognostic AUC = 0.9). Submicron scale mass density and artificial intelligence based algorithm analyses also elucidated association of MUC13 with greater morphological disorder (P < 0.001) and nuclear MUC13 as strong predictor for cancer aggressiveness and poor patient survival. CONCLUSION This study provides significant information regarding MUC13 expression/subcellular localization in PanCa samples and supporting the use anti-MUC13 MAb for the development of PanCa diagnostic/prognostic test.
Collapse
Affiliation(s)
- Sheema Khan
- Department of Pharmaceutical Sciences and Center for Cancer Research, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Nadeem Zafar
- Department of Pathology, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Shabia S. Khan
- Department of Computer Science, University of Kashmir, Srinagar, India
| | - Saini Setua
- Department of Pharmaceutical Sciences and Center for Cancer Research, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Stephen W. Behrman
- Department of Surgery, Baptist Memorial Hospital and the University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Zachary E Stiles
- Department of Surgery, Baptist Memorial Hospital and the University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Murali M. Yallapu
- Department of Pharmaceutical Sciences and Center for Cancer Research, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Peeyush Sahay
- Department of Physics, University of Memphis, Memphis, Tennessee, USA
| | - Hemendra Ghimire
- Department of Physics, University of Memphis, Memphis, Tennessee, USA
| | - Tomoko Ise
- Center for Drug Design Research, National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki-City, Osaka, Japan
| | - Satoshi Nagata
- Center for Drug Design Research, National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki-City, Osaka, Japan
| | - Lei Wang
- Department of Biostatistics & Epidemiology, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Jim Y. Wan
- Department of Biostatistics & Epidemiology, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Prabhakar Pradhan
- Department of Physics, University of Memphis, Memphis, Tennessee, USA
| | - Meena Jaggi
- Department of Pharmaceutical Sciences and Center for Cancer Research, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Subhash C. Chauhan
- Department of Pharmaceutical Sciences and Center for Cancer Research, University of Tennessee Health Science Center, Memphis, Tennessee, USA,Corresponding Authors: Subhash C. Chauhan, Ph.D., Professor, Department of Pharmaceutical Sciences, University of Tennessee Health Science Center, 19 South Manassas, Cancer Research Building, Memphis, TN, 38163. Phone: (901) 448-2175. Fax: (901) 448-1051.
| |
Collapse
|
29
|
|
30
|
Qin CJ, Guan Q, Wang XP. APPLICATION OF ENSEMBLE ALGORITHM INTEGRATING MULTIPLE CRITERIA FEATURE SELECTION IN CORONARY HEART DISEASE DETECTION. BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS 2017. [DOI: 10.4015/s1016237217500430] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Conventional coronary heart disease (CHD) detection methods are expensive, rely much on doctors’ subjective experience, and some of them have side effects. In order to obtain rapid, high-precision, low-cost, non-invasive detection results, several methods in machine learning were attempted for CHD detection in this paper. The paper adopted multiple evaluation criteria to measure features, combined with heuristic search strategy and seven common classification algorithms to verify the validity and the importance of feature selection (FS) in the Z-Alizadeh Sani CHD dataset. On this basis, a novelty algorithm integrating multiple FS methods into the ensemble algorithm (ensemble algorithm based on multiple feature selection, EA-MFS) was further proposed. The algorithm adopted Bagging approach to increase data diversity, used the aforementioned MFS methods for functional perturbation, employed major voting method to carry out the decision results, and performed selective integration in terms of the difference of base classifiers in the ensemble process. Compared with the single FS method, the EA-MFS algorithm could comprehensively describe the relationship of features, enhance the classification effect, and displayed better robustness. That meant the EA-MFS algorithm could reduce the dependence on dataset and strengthen the stability of the algorithm, all of which were of great significance for the clinical application of machine learning algorithm in coronary heart disease detection.
Collapse
Affiliation(s)
- Cai-Jie Qin
- Institute of Information Engineering, SanMing University, SanMing 365004, P. R. China
| | - Qiang Guan
- Institute of Information Engineering, SanMing University, SanMing 365004, P. R. China
| | - Xin-Pei Wang
- School of Control Science and Engineering, ShanDong University, JiNan 250000, P. R. China
| |
Collapse
|
31
|
Feature Selection for Partial Discharge Severity Assessment in Gas-Insulated Switchgear Based on Minimum Redundancy and Maximum Relevance. ENERGIES 2017. [DOI: 10.3390/en10101516] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
32
|
Azami H, Escudero J. A comparative study of breast cancer diagnosis based on neural network ensemble via improved training algorithms. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2016; 2015:2836-9. [PMID: 26736882 DOI: 10.1109/embc.2015.7318982] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Breast cancer is one of the most common types of cancer in women all over the world. Early diagnosis of this kind of cancer can significantly increase the chances of long-term survival. Since diagnosis of breast cancer is a complex problem, neural network (NN) approaches have been used as a promising solution. Considering the low speed of the back-propagation (BP) algorithm to train a feed-forward NN, we consider a number of improved NN trainings for the Wisconsin breast cancer dataset: BP with momentum, BP with adaptive learning rate, BP with adaptive learning rate and momentum, Polak-Ribikre conjugate gradient algorithm (CGA), Fletcher-Reeves CGA, Powell-Beale CGA, scaled CGA, resilient BP (RBP), one-step secant and quasi-Newton methods. An NN ensemble, which is a learning paradigm to combine a number of NN outputs, is used to improve the accuracy of the classification task. Results demonstrate that NN ensemble-based classification methods have better performance than NN-based algorithms. The highest overall average accuracy is 97.68% obtained by NN ensemble trained by RBP for 50%-50% training-test evaluation method.
Collapse
|