1
|
Wang J, Wang LP, Yuan SS, Li F, Liu JX, Shang JL. NLRRC: A Novel Clustering Method of Jointing Non-Negative LRR and Random Walk Graph Regularized NMF for Single-Cell Type Identification. IEEE J Biomed Health Inform 2023; 27:5199-5209. [PMID: 37506010 DOI: 10.1109/jbhi.2023.3299748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2023]
Abstract
The development of single-cell RNA sequencing (scRNA-seq) technology has opened up a new perspective for us to study disease mechanisms at the single cell level. Cell clustering reveals the natural grouping of cells, which is a vital step in scRNA-seq data analysis. However, the high noise and dropout of single-cell data pose numerous challenges to cell clustering. In this study, we propose a novel matrix factorization method named NLRRC for single-cell type identification. NLRRC joins non-negative low-rank representation (LRR) and random walk graph regularized NMF (RWNMFC) to accurately reveal the natural grouping of cells. Specifically, we find the lowest rank representation of single-cell samples by non-negative LRR to reduce the difficulty of analyzing high-dimensional samples and capture the global information of the samples. Meanwhile, by using random walk graph regularization (RWGR) and NMF, RWNMFC captures manifold structure and cluster information before generating a cluster allocation matrix. The cluster assignment matrix contains cluster labels, which can be used directly to get the clustering results. The performance of NLRRC is validated on simulated and real single-cell datasets. The results of the experiments illustrate that NLRRC has a significant advantage in single-cell type identification.
Collapse
|
2
|
Zhang Y, Hu Y, Han N, Yang A, Liu X, Cai H. A survey of drug-target interaction and affinity prediction methods via graph neural networks. Comput Biol Med 2023; 163:107136. [PMID: 37329615 DOI: 10.1016/j.compbiomed.2023.107136] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 05/29/2023] [Accepted: 06/04/2023] [Indexed: 06/19/2023]
Abstract
The tasks of drug-target interaction (DTI) and drug-target affinity (DTA) prediction play important roles in the field of drug discovery. However, biological experiment-based methods are time-consuming and expensive. Recently, computational-based approaches have accelerated the process of drug-target relationship prediction. Drug and target features are represented in structure-based, sequence-based, and graph-based ways. Although some achievements have been made regarding structure-based representations and sequence-based representations, the acquired feature information is not sufficiently rich. Molecular graph-based representations are some of the more popular approaches, and they have also generated a great deal of interest. In this article, we provide an overview of the DTI prediction and DTA prediction tasks based on graph neural networks (GNNs). We briefly discuss the molecular graphs of drugs, the primary sequences of target proteins, and the graph reSLBpresentations of target proteins. Meanwhile, we conducted experiments on various fundamental datasets to substantiate the plausibility of DTI and DTA utilizing graph neural networks.
Collapse
Affiliation(s)
- Yue Zhang
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, 510665, China.
| | - Yuqing Hu
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, 510665, China
| | - Na Han
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, 510665, China
| | - Aqing Yang
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, 510665, China
| | - Xiaoyong Liu
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, 510665, China
| | - Hongmin Cai
- School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China
| |
Collapse
|
3
|
Liu S, Li K, Wang Y, Zhu T, Li J, Chen Z. Knowledge graph embedding by fusing multimodal content via cross-modal learning. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:14180-14200. [PMID: 37679131 DOI: 10.3934/mbe.2023634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
Knowledge graph embedding aims to learn representation vectors for the entities and relations. Most of the existing approaches learn the representation from the structural information in the triples, which neglects the content related to the entity and relation. Though there are some approaches proposed to exploit the related multimodal content to improve knowledge graph embedding, such as the text description and images associated with the entities, they are not effective to address the heterogeneity and cross-modal correlation constraint of different types of content and network structure. In this paper, we propose a multi-modal content fusion model (MMCF) for knowledge graph embedding. To effectively fuse the heterogenous data for knowledge graph embedding, such as text description, related images and structural information, a cross-modal correlation learning component is proposed. It first learns the intra-modal and inter-modal correlation to fuse the multimodal content of each entity, and then they are fused with the structure features by a gating network. Meanwhile, to enhance the features of relation, the features of the associated head entity and tail entity are fused to learn relation embedding. To effectively evaluate the proposed model, we compare it with other baselines in three datasets, i.e., FB-IMG, WN18RR and FB15k-237. Experiment result of link prediction demonstrates that our model outperforms the state-of-the-art in most of the metrics significantly, implying the superiority of the proposed method.
Collapse
Affiliation(s)
- Shi Liu
- Big Data Center of State Grid Corporation, Beijing 100052, China
| | - Kaiyang Li
- Big Data Center of State Grid Corporation, Beijing 100052, China
| | - Yaoying Wang
- Big Data Center of State Grid Corporation, Beijing 100052, China
| | - Tianyou Zhu
- Big Data Center of State Grid Corporation, Beijing 100052, China
| | - Jiwei Li
- Big Data Center of State Grid Corporation, Beijing 100052, China
| | - Zhenyu Chen
- Big Data Center of State Grid Corporation, Beijing 100052, China
| |
Collapse
|
4
|
Yu J, Pan B, Yu S, Leung MF. Robust capped norm dual hyper-graph regularized non-negative matrix tri-factorization. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:12486-12509. [PMID: 37501452 DOI: 10.3934/mbe.2023556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Non-negative matrix factorization (NMF) has been widely used in machine learning and data mining fields. As an extension of NMF, non-negative matrix tri-factorization (NMTF) provides more degrees of freedom than NMF. However, standard NMTF algorithm utilizes Frobenius norm to calculate residual error, which can be dramatically affected by noise and outliers. Moreover, the hidden geometric information in feature manifold and sample manifold is rarely learned. Hence, a novel robust capped norm dual hyper-graph regularized non-negative matrix tri-factorization (RCHNMTF) is proposed. First, a robust capped norm is adopted to handle extreme outliers. Second, dual hyper-graph regularization is considered to exploit intrinsic geometric information in feature manifold and sample manifold. Third, orthogonality constraints are added to learn unique data presentation and improve clustering performance. The experiments on seven datasets testify the robustness and superiority of RCHNMTF.
Collapse
Affiliation(s)
- Jiyang Yu
- College of Electronic and Information Engineering, Southwest University, Chongqing 400715, China
| | - Baicheng Pan
- Chongqing Key Laboratory of Nonlinear Circuits and Intelligent Information Processing, College of Electronic and Information Engineering, Southwest University, Chongqing 400715, China
| | - Shanshan Yu
- Training and Basic Education Management Office, Southwest University, Chongqing 400715, China
| | - Man-Fai Leung
- School of Computing and Information Science, Faculty of Science and Engineering, Anglia Ruskin University, Cambridge, United Kingdom
| |
Collapse
|
5
|
Feng X, Zhang H, Lin H, Long H. Single-cell RNA-seq data analysis based on directed graph neural network. Methods 2023; 211:48-60. [PMID: 36804214 DOI: 10.1016/j.ymeth.2023.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 12/09/2022] [Accepted: 02/13/2023] [Indexed: 02/17/2023] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) data scale surges with high-throughput sequencing technology development. However, although single-cell data analysis is a powerful tool, various issues have been reported, such as sequencing sparsity and complex differential patterns in gene expression. Statistical or traditional machine learning methods are inefficient, and the accuracy needs to be improved. The methods based on deep learning can not directly process non-Euclidean spatial data, such as cell diagrams. In this study, we have developed graph autoencoders and graph attention network for scRNA-seq analysis based on a directed graph neural network named scDGAE. Directed graph neural networks cannot only retain the connection properties of the directed graph but also expand the receptive field of the convolution operation. Cosine similarity, median L1 distance, and root-mean-squared error are used to measure the gene imputation performance of different methods with scDGAE. Furthermore, adjusted mutual information, normalized mutual information, completeness score, and Silhouette coefficient score are used to measure the cell clustering performance of different methods with scDGAE. Experiment results show that the scDGAE model achieves promising performance in gene imputation and cell clustering prediction on four scRNA-seq data sets with gold-standard cell labels. Furthermore, it is a robust framework that can be applied to general scRNA-Seq analyses.
Collapse
Affiliation(s)
- Xiang Feng
- College of Information Science Technology, Hainan Normal University, Haikou, Hainan 571158, China
| | - Hongqi Zhang
- College of Information Science Technology, Hainan Normal University, Haikou, Hainan 571158, China
| | - Hao Lin
- School of Mathematics and Statistics, Hainan Normal University, Haikou, Hainan 571158, China; Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.
| | - Haixia Long
- College of Information Science Technology, Hainan Normal University, Haikou, Hainan 571158, China.
| |
Collapse
|
6
|
Zhao S, Wang P, Heidari AA, Zhao X, Chen H. Boosted crow search algorithm for handling multi-threshold image problems with application to X-ray images of COVID-19. EXPERT SYSTEMS WITH APPLICATIONS 2023; 213:119095. [PMID: 36313263 PMCID: PMC9595503 DOI: 10.1016/j.eswa.2022.119095] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 10/11/2022] [Accepted: 10/18/2022] [Indexed: 06/16/2023]
Abstract
COVID-19 is pervasive and threatens the safety of people around the world. Therefore, now, a method is needed to diagnose COVID-19 accurately. The identification of COVID-19 by X-ray images is a common method. The target area is extracted from the X-ray images by image segmentation to improve classification efficiency and help doctors make a diagnosis. In this paper, we propose an improved crow search algorithm (CSA) based on variable neighborhood descent (VND) and information exchange mutation (IEM) strategies, called VMCSA. The original CSA quickly falls into the local optimum, and the possibility of finding the best solution is significantly reduced. Therefore, to help the algorithm avoid falling into local optimality and improve the global search capability of the algorithm, we introduce VND and IEM into CSA. Comparative experiments are conducted at CEC2014 and CEC'21 to demonstrate the better performance of the proposed algorithm in optimization. We also apply the proposed algorithm to multi-level thresholding image segmentation using Renyi's entropy as the objective function to find the optimal threshold, where we construct 2-D histograms with grayscale images and non-local mean images and maximize the Renyi's entropy on top of the 2-D histogram. The proposed segmentation method is evaluated on X-ray images of COVID-19 and compared with some algorithms. VMCSA has a significant advantage in segmentation results and obtains better robustness than other algorithms. The available extra info can be found at https://github.com/1234zsw/VMCSA.
Collapse
Affiliation(s)
- Songwei Zhao
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, Zhejiang 325035, China
| | - Pengjun Wang
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou 325035, China
| | - Ali Asghar Heidari
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, Zhejiang 325035, China
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Xuehua Zhao
- School of Digital Media, Shenzhen Institute of Information Technology, Shenzhen 518172, China
| | - Huiling Chen
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, Zhejiang 325035, China
| |
Collapse
|
7
|
Li Y, Zhao D, Xu Z, Heidari AA, Chen H, Jiang X, Liu Z, Wang M, Zhou Q, Xu S. bSRWPSO-FKNN: A boosted PSO with fuzzy K-nearest neighbor classifier for predicting atopic dermatitis disease. Front Neuroinform 2023; 16:1063048. [PMID: 36726405 PMCID: PMC9884708 DOI: 10.3389/fninf.2022.1063048] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/05/2022] [Indexed: 01/18/2023] Open
Abstract
Introduction Atopic dermatitis (AD) is an allergic disease with extreme itching that bothers patients. However, diagnosing AD depends on clinicians' subjective judgment, which may be missed or misdiagnosed sometimes. Methods This paper establishes a medical prediction model for the first time on the basis of the enhanced particle swarm optimization (SRWPSO) algorithm and the fuzzy K-nearest neighbor (FKNN), called bSRWPSO-FKNN, which is practiced on a dataset related to patients with AD. In SRWPSO, the Sobol sequence is introduced into particle swarm optimization (PSO) to make the particle distribution of the initial population more uniform, thus improving the population's diversity and traversal. At the same time, this study also adds a random replacement strategy and adaptive weight strategy to the population updating process of PSO to overcome the shortcomings of poor convergence accuracy and easily fall into the local optimum of PSO. In bSRWPSO-FKNN, the core of which is to optimize the classification performance of FKNN through binary SRWPSO. Results To prove that the study has scientific significance, this paper first successfully demonstrates the core advantages of SRWPSO in well-known algorithms through benchmark function validation experiments. Secondly, this article demonstrates that the bSRWPSO-FKNN has practical medical significance and effectiveness through nine public and medical datasets. Discussion The 10 times 10-fold cross-validation experiments demonstrate that bSRWPSO-FKNN can pick up the key features of AD, including the content of lymphocytes (LY), Cat dander, Milk, Dermatophagoides Pteronyssinus/Farinae, Ragweed, Cod, and Total IgE. Therefore, the established bSRWPSO-FKNN method practically aids in the diagnosis of AD.
Collapse
Affiliation(s)
- Yupeng Li
- College of Computer Science and Technology, Changchun Normal University, Changchun, Jilin, China
| | - Dong Zhao
- College of Computer Science and Technology, Changchun Normal University, Changchun, Jilin, China,*Correspondence: Dong Zhao,
| | - Zhangze Xu
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Huiling Chen
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China,Huiling Chen,
| | - Xinyu Jiang
- Department of Dermatology, The Affiliated Hospital of Medical School, Ningbo University, Ningbo, China,School of Medicine, Ningbo University, Ningbo, Zhejiang, China
| | - Zhifang Liu
- Department of Dermatology, The Affiliated Hospital of Medical School, Ningbo University, Ningbo, China,School of Medicine, Ningbo University, Ningbo, Zhejiang, China
| | - Mengmeng Wang
- Department of Dermatology, The Affiliated Hospital of Medical School, Ningbo University, Ningbo, China,School of Medicine, Ningbo University, Ningbo, Zhejiang, China
| | - Qiongyan Zhou
- Department of Dermatology, The Affiliated Hospital of Medical School, Ningbo University, Ningbo, China
| | - Suling Xu
- Department of Dermatology, The Affiliated Hospital of Medical School, Ningbo University, Ningbo, China,Suling Xu,
| |
Collapse
|
8
|
Han Y, Chen W, Heidari AA, Chen H. Multi-verse Optimizer with Rosenbrock and Diffusion Mechanisms for Multilevel Threshold Image Segmentation from COVID-19 Chest X-Ray Images. JOURNAL OF BIONIC ENGINEERING 2023; 20:1198-1262. [PMID: 36619872 PMCID: PMC9811903 DOI: 10.1007/s42235-022-00295-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 10/17/2022] [Accepted: 10/19/2022] [Indexed: 06/17/2023]
Abstract
Coronavirus Disease 2019 (COVID-19) is the most severe epidemic that is prevalent all over the world. How quickly and accurately identifying COVID-19 is of great significance to controlling the spread speed of the epidemic. Moreover, it is essential to accurately and rapidly identify COVID-19 lesions by analyzing Chest X-ray images. As we all know, image segmentation is a critical stage in image processing and analysis. To achieve better image segmentation results, this paper proposes to improve the multi-verse optimizer algorithm using the Rosenbrock method and diffusion mechanism named RDMVO. Then utilizes RDMVO to calculate the maximum Kapur's entropy for multilevel threshold image segmentation. This image segmentation scheme is called RDMVO-MIS. We ran two sets of experiments to test the performance of RDMVO and RDMVO-MIS. First, RDMVO was compared with other excellent peers on IEEE CEC2017 to test the performance of RDMVO on benchmark functions. Second, the image segmentation experiment was carried out using RDMVO-MIS, and some meta-heuristic algorithms were selected as comparisons. The test image dataset includes Berkeley images and COVID-19 Chest X-ray images. The experimental results verify that RDMVO is highly competitive in benchmark functions and image segmentation experiments compared with other meta-heuristic algorithms.
Collapse
Affiliation(s)
- Yan Han
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035 China
| | - Weibin Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035 China
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035 China
| |
Collapse
|
9
|
Yang X, Ye X, Zhao D, Heidari AA, Xu Z, Chen H, Li Y. Multi-threshold image segmentation for melanoma based on Kapur’s entropy using enhanced ant colony optimization. Front Neuroinform 2022; 16:1041799. [PMID: 36387585 PMCID: PMC9663822 DOI: 10.3389/fninf.2022.1041799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 10/10/2022] [Indexed: 11/07/2022] Open
Abstract
Melanoma is a malignant tumor formed by the cancerous transformation of melanocytes, and its medical images contain much information. However, the percentage of the critical information in the image is small, and the noise is non-uniformly distributed. We propose a new multi-threshold image segmentation model based on the two-dimensional histogram approach to the above problem. We present an enhanced ant colony optimization for continuous domains (EACOR) in the proposed model based on the soft besiege and chase strategies. Further, EACOR is combined with two-dimensional Kapur’s entropy to search for the optimal thresholds. An experiment on the IEEE CEC2014 benchmark function was conducted to measure the reliable global search capability of the EACOR algorithm in the proposed model. Moreover, we have also conducted several sets of experiments to test the validity of the image segmentation model proposed in this paper. The experimental results show that the segmented images from the proposed model outperform the comparison method in several evaluation metrics. Ultimately, the model proposed in this paper can provide high-quality samples for subsequent analysis of melanoma pathology images.
Collapse
Affiliation(s)
- Xiao Yang
- School of Computer Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Xiaojia Ye
- School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai, China
- *Correspondence: Xiaojia Ye,
| | - Dong Zhao
- College of Computer Science and Technology, Changchun Normal University, Changchun, China
- Dong Zhao,
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Zhangze Xu
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Huiling Chen
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Yangyang Li
- Department of Pathology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
- Yangyang Li,
| |
Collapse
|
10
|
Li Y, Zhao D, Liu G, Liu Y, Bano Y, Ibrohimov A, Chen H, Wu C, Chen X. Intradialytic hypotension prediction using covariance matrix-driven whale optimizer with orthogonal structure-assisted extreme learning machine. Front Neuroinform 2022; 16:956423. [PMID: 36387587 PMCID: PMC9659657 DOI: 10.3389/fninf.2022.956423] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 09/28/2022] [Indexed: 09/19/2023] Open
Abstract
Intradialytic hypotension (IDH) is an adverse event occurred during hemodialysis (HD) sessions with high morbidity and mortality. The key to preventing IDH is predicting its pre-dialysis and administering a proper ultrafiltration prescription. For this purpose, this paper builds a prediction model (bCOWOA-KELM) to predict IDH using indices of blood routine tests. In the study, the orthogonal learning mechanism is applied to the first half of the WOA to improve the search speed and accuracy. The covariance matrix is applied to the second half of the WOA to enhance the ability to get out of local optimum and convergence accuracy. Combining the above two improvement methods, this paper proposes a novel improvement variant (COWOA) for the first time. More, the core of bCOWOA-KELM is that the binary COWOA is utilized to improve the performance of the KELM. In order to verify the comprehensive performance of the study, the paper sets four types of comparison experiments for COWOA based on 30 benchmark functions and a series of prediction experiments for bCOWOA-KELM based on six public datasets and the HD dataset. Finally, the results of the experiments are analyzed separately in this paper. The results of the comparison experiments prove fully that the COWOA is superior to other famous methods. More importantly, the bCOWOA performs better than its peers in feature selection and its accuracy is 92.41%. In addition, bCOWOA improves the accuracy by 0.32% over the second-ranked bSCA and by 3.63% over the worst-ranked bGWO. Therefore, the proposed model can be used for IDH prediction with future applications.
Collapse
Affiliation(s)
- Yupeng Li
- College of Computer Science and Technology, Changchun Normal University, Changchun, China
| | - Dong Zhao
- College of Computer Science and Technology, Changchun Normal University, Changchun, China
| | - Guangjie Liu
- College of Computer Science and Technology, Changchun Normal University, Changchun, China
| | - Yi Liu
- Department of Nephrology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Yasmeen Bano
- Department of Nephrology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Alisherjon Ibrohimov
- Department of Nephrology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Huiling Chen
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Chengwen Wu
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Xumin Chen
- Department of Nephrology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou University, Wenzhou, China
| |
Collapse
|
11
|
Qiu F, Zheng P, Heidari AA, Liang G, Chen H, Karim FK, Elmannai H, Lin H. Mutational Slime Mould Algorithm for Gene Selection. Biomedicines 2022; 10:biomedicines10082052. [PMID: 36009599 PMCID: PMC9406076 DOI: 10.3390/biomedicines10082052] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/14/2022] [Accepted: 08/16/2022] [Indexed: 02/02/2023] Open
Abstract
A large volume of high-dimensional genetic data has been produced in modern medicine and biology fields. Data-driven decision-making is particularly crucial to clinical practice and relevant procedures. However, high-dimensional data in these fields increase the processing complexity and scale. Identifying representative genes and reducing the data’s dimensions is often challenging. The purpose of gene selection is to eliminate irrelevant or redundant features to reduce the computational cost and improve classification accuracy. The wrapper gene selection model is based on a feature set, which can reduce the number of features and improve classification accuracy. This paper proposes a wrapper gene selection method based on the slime mould algorithm (SMA) to solve this problem. SMA is a new algorithm with a lot of application space in the feature selection field. This paper improves the original SMA by combining the Cauchy mutation mechanism with the crossover mutation strategy based on differential evolution (DE). Then, the transfer function converts the continuous optimizer into a binary version to solve the gene selection problem. Firstly, the continuous version of the method, ISMA, is tested on 33 classical continuous optimization problems. Then, the effect of the discrete version, or BISMA, was thoroughly studied by comparing it with other gene selection methods on 14 gene expression datasets. Experimental results show that the continuous version of the algorithm achieves an optimal balance between local exploitation and global search capabilities, and the discrete version of the algorithm has the highest accuracy when selecting the least number of genes.
Collapse
Affiliation(s)
- Feng Qiu
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China
| | - Pan Zheng
- Information Systems, University of Canterbury, Christchurch 8014, New Zealand
| | - Ali Asghar Heidari
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China
| | - Guoxi Liang
- Department of Information Technology, Wenzhou Polytechnic, Wenzhou 325035, China
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China
- Correspondence:
| | - Faten Khalid Karim
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Hela Elmannai
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Haiping Lin
- Department of Information Engineering, Hangzhou Vocational & Technical College, Hangzhou 310018, China
| |
Collapse
|
12
|
Chen Y, Sun X, Yang J. Prediction of Gastric Cancer-Related Genes Based on the Graph Transformer Network. Front Oncol 2022; 12:902616. [PMID: 35847949 PMCID: PMC9281472 DOI: 10.3389/fonc.2022.902616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 04/26/2022] [Indexed: 02/01/2023] Open
Abstract
Gastric cancer is a complex multifactorial and multistage process that involves a large number of tumor-related gene structural changes and abnormal expression. Therefore, knowing the related genes of gastric cancer can further understand the pathogenesis of gastric cancer and provide guidance for the development of targeted drugs. Traditional methods to discover gastric cancer-related genes based on biological experiments are time-consuming and expensive. In recent years, a large number of computational methods have been developed to identify gastric cancer-related genes. In addition, a large number of experiments show that establishing a biological network to identify disease-related genes has higher accuracy than ordinary methods. However, most of the current computing methods focus on the processing of homogeneous networks, and do not have the ability to encode heterogeneous networks. In this paper, we built a heterogeneous network using a disease similarity network and a gene interaction network. We implemented the graph transformer network (GTN) to encode this heterogeneous network. Meanwhile, the deep belief network (DBN) was applied to reduce the dimension of features. We call this method “DBN-GTN”, and it performed best among four traditional methods and five similar methods.
Collapse
|