1
|
Multi-Objective Artificial Bee Colony Algorithm Based on Scale-Free Network for Epistasis Detection. Genes (Basel) 2022; 13:genes13050871. [PMID: 35627256 PMCID: PMC9140669 DOI: 10.3390/genes13050871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 04/30/2022] [Accepted: 05/10/2022] [Indexed: 12/04/2022] Open
Abstract
In genome-wide association studies, epistasis detection is of great significance for the occurrence and diagnosis of complex human diseases, but it also faces challenges such as high dimensionality and a small data sample size. In order to cope with these challenges, several swarm intelligence methods have been introduced to identify epistasis in recent years. However, the existing methods still have some limitations, such as high-consumption and premature convergence. In this study, we proposed a multi-objective artificial bee colony (ABC) algorithm based on the scale-free network (SFMOABC). The SFMOABC incorporates the scale-free network into the ABC algorithm to guide the update and selection of solutions. In addition, the SFMOABC uses mutual information and the K2-Score of the Bayesian network as objective functions, and the opposition-based learning strategy is used to improve the search ability. Experiments were performed on both simulation datasets and a real dataset of age-related macular degeneration (AMD). The results of the simulation experiments showed that the SFMOABC has better detection power and efficiency than seven other epistasis detection methods. In the real AMD data experiment, most of the single nucleotide polymorphism combinations detected by the SFMOABC have been shown to be associated with AMD disease. Therefore, SFMOABC is a promising method for epistasis detection.
Collapse
|
2
|
A Secure High-Order Gene Interaction Detecting Method for Infectious Diseases. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4471736. [PMID: 35495886 PMCID: PMC9050263 DOI: 10.1155/2022/4471736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 03/01/2022] [Indexed: 12/04/2022]
Abstract
Infectious diseases pose a serious threat to human life, the Genome Wide Association Studies (GWAS) can analyze susceptibility genes of infectious diseases from the genetic level and carry out targeted prevention and treatment. The susceptibility genes for infectious diseases often act in combination with multiple susceptibility sites; therefore, high-order epistasis detection has become an important means. However, due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models. Furthermore, these methods are exposed to repeated query and model inversion attacks in the process of iterative optimization, which may disclose Single Nucleotide Polymorphism (SNP) information associated with individual privacy. Therefore, in order to solve these problems, this paper proposed a safe harmony search algorithm for high-order gene interaction detection, termed as HS-DP. Firstly, the linear weighting method was used to integrate 5 objective functions to screen out high-order SNP sets with high correlation, including K2-Score, JS divergence, logistic regression, mutual information, and Gini. Then, based on the Differential Privacy (DP) theory, the function disturbance mechanism was introduced to protect the security of individual privacy information associated with the objective function, and we proved the rationality of the disturbance mechanism theoretically. Finally, the practicability and superiority of the algorithm were verified by experiments. Experimental results showed that the algorithm proposed in this paper could improve the detection accuracy to the greatest extent while guaranteeing privacy.
Collapse
|
3
|
Guan B, Zhao Y, Yin Y, Li Y. Detecting Disease-Associated SNP-SNP Interactions Using Progressive Screening Memetic Algorithm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:878-887. [PMID: 32857698 DOI: 10.1109/tcbb.2020.3019256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Hundreds of thousands of single nucleotide polymorphisms (SNPs)are currently available for genome-wide association study (GWAS). Detecting disease-associated SNP-SNP interactions is considered an important way to capture the underlying genetic causes of complex diseases. In the combinatorially explosive search space, evolutionary algorithms are promising in solving this difficult problem because of their controllable time complexity. However, in existing evolutionary algorithms, some possible SNP-SNP interactions are evaluated multiple times by the fitness function. Such reevaluations not only waste computing resources but also make these algorithms easy to fall into local optima. To tackle this drawback, a progressive screening memetic algorithm (PSMA)is proposed in the paper. PSMA first represents all possible SNP-SNP interactions in a constructed graph. Then, the proposed algorithm uses the progressive screening strategy to guarantee that every possible SNP-SNP interaction can only be evaluated once by reducing the constructed graph. Furthermore, two types of local search algorithms are introduced to enhance the detecting power of PSMA. For detecting disease-associated SNP-SNP interactions, experimental results show that our proposed method outperforms other existing state-of-the-art methods in terms of accuracy and time.
Collapse
|
4
|
Li X, Zhang S, Wong KC. Evolving Transcriptomic Profiles From Single-Cell RNA-Seq Data Using Nature-Inspired Multiobjective Optimization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2445-2458. [PMID: 32031947 DOI: 10.1109/tcbb.2020.2971993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Transcriptomic profiling plays an important role in post-genomic analysis. Especially, the single-cell RNA-seq technology has advanced our understanding of gene expression from cell population level into individual cell level. Many computational methods have been proposed to decipher transcriptomic profiles from those RNA-seq data. However, most of the related algorithms suffer from realistic restrictions such as high dimensionality and premature convergence. In this paper, we propose and formulate an evolutionary multiobjective blind compressed sensing (EMOBCS) to address those problems for evolving transcriptomic profiles from single-cell RNA-seq data. In the proposed framework, to characterize various gene expression profile models, two objective functions including chi-squared kernel score and euclidean distance of different gene expression profiles are formulated. After that, multiobjective blind compressed sensing based on artificial bee colony is designed to optimize the two objective functions on single-cell RNA-seq data by proposing a rank probability model and two new search strategies into the cooperative convolution framework in an unbiased manner. To demonstrate its effectiveness, extensive experiments have been conducted, comparing the proposed algorithm with 14 algorithms including eight state-of-the-art algorithms and six different EMOBCS algorithms under different search strategies on 10 single-cell RNA-seq datasets and one case study. The experimental results reveal that the proposed algorithm is better than or comparable with those compared algorithms. Furthermore, we also conduct the time complexity analysis, convergence analysis, and parameter analysis to demonstrate various properties of EMOBCS.
Collapse
|
5
|
Wang L, Wang Y, Fu Y, Gao Y, Du J, Yang C, Liu J. AFSBN: A Method of Artificial Fish Swarm Optimizing Bayesian Network for Epistasis Detection. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1369-1383. [PMID: 31670676 DOI: 10.1109/tcbb.2019.2949780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
How to mine the interaction between SNPs (namely epistasis) efficiently and accurately must be considered when to tackle the complexity of underlying biological mechanisms. In order to overcome the defect of low learning efficiency and local optimal, this work proposes an epistasis mining method using artificial fish swarm optimizing Bayesian network (AFSBN). This method uses the characteristics of global optimization, good robustness and fast convergence about the artificial fish swarm algorithm, and uses the algorithm into the heuristic search strategy of Bayesian network. The initial network structure can be evolved through the manipulations of foraging behavior, clustering behavior, tail-chasing behavior and random behavior. This algorithm chooses different behaviors to modify the network state according to the changing of surrounding environment and the states of partners. It realizes the interaction between each artificial fish and its neighboring environment, and finally finds the optimal network in the population. We compared AFSBN with other existing algorithms on both simulated and real datasets. The experimental results demonstrate that our method outperforms others in epistasis detection accuracy in the case of not affecting the efficiency basically for different datasets.
Collapse
|
6
|
Manavalan R, Priya S. Genetic interactions effects for cancer disease identification using computational models: a review. Med Biol Eng Comput 2021; 59:733-758. [PMID: 33839998 DOI: 10.1007/s11517-021-02343-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 03/10/2021] [Indexed: 11/29/2022]
Abstract
Genome-wide association studies (GWAS) provide clear insight into understanding genetic variations and environmental influences responsible for various human diseases. Cancer identification through genetic interactions (epistasis) is one of the significant ongoing researches in GWAS. The growth of the cancer cell emerges from multi-locus as well as complex genetic interaction. It is impractical for the physician to detect cancer via manual examination of SNPs interaction. Due to its importance, several computational approaches have been modeled to infer epistasis effects. This article includes a comprehensive and multifaceted review of all relevant genetic studies published between 2001 and 2020. In this contemporary review, various computational methods are as follows: multifactor dimensionality reduction-based approaches, statistical strategies, machine learning, and optimization-based techniques are carefully reviewed and presented with their evaluation results. Moreover, these computational approaches' strengths and limitations are described. The issues behind the computational methods for identifying the cancer disease through genetic interactions and the various evaluation parameters used by researchers have been analyzed. This review is highly beneficial for researchers and medical professionals to learn techniques adapted to discover the epistasis and aids to design novel automatic epistasis detection systems with strong robustness and maximum efficiency to address the different research problems in finding practical solutions effectively.
Collapse
Affiliation(s)
- R Manavalan
- Department of Computer Science, Arignar Anna Government Arts College, Villupuram, Tamil Nadu, 605602, India.
| | - S Priya
- Computer Science, Arignar Anna Government Arts College, Villupuram, Tamil Nadu, India
| |
Collapse
|
7
|
Zhang S, Guo J, Wang Z. Combing K-means Clustering and Local Weighted Maximum Discriminant Projections for Weed Species Recognition. FRONTIERS IN COMPUTER SCIENCE 2019. [DOI: 10.3389/fcomp.2019.00004] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
8
|
Ciulla C, Agyapong G. Intensity-curvature functional based digital high pass filter of the bivariate cubic B-spline model polynomial function. Vis Comput Ind Biomed Art 2019; 2:9. [PMID: 32240391 PMCID: PMC7099544 DOI: 10.1186/s42492-019-0017-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 06/26/2019] [Indexed: 11/10/2022] Open
Abstract
This research addresses the design of intensity-curvature functional (ICF) based digital high pass filter (HPF). ICF is calculated from bivariate cubic B-spline model polynomial function and is called ICF-based HPF. In order to calculate ICF, the model function needs to be second order differentiable and to have non-null classic-curvature calculated at the origin (0, 0) of the pixel coordinate system. The theoretical basis of this research is called intensity-curvature concept. The concept envisions to replace signal intensity with the product between signal intensity and sum of second order partial derivatives of the model function. Extrapolation of the concept in two-dimensions (2D) makes it possible to calculate the ICF of an image. Theoretical treatise is presented to demonstrate the hypothesis that ICF is HPF signal. Empirical evidence then validates the assumption and also extends the comparison between ICF-based HPF and ten different HPFs among which is traditional HPF and particle swarm optimization (PSO) based HPF. Through comparison of image space and k-space magnitude, results indicate that HPFs behave differently. Traditional HPF filtering and ICF-based filtering are superior to PSO-based filtering. Images filtered with traditional HPF are sharper than images filtered with ICF-based filter. The contribution of this research can be summarized as follows: (1) Math description of the constraints that ICF need to obey to in order to function as HPF; (2) Math of ICF-based HPF of bivariate cubic B-spline; (3) Image space comparisons between HPFs; (4) K-space magnitude comparisons between HPFs. This research provides confirmation on the math procedure to use in order to design 2D HPF from a model bivariate polynomial function.
Collapse
Affiliation(s)
- Carlo Ciulla
- Faculty of Information Systems, Visualization, Multimedia, and Animation, University of Information Science and Technology, St. Paul the Apostle, Partizanska B.B., Ohrid, 6000, Republic of North Macedonia.
| | - Grace Agyapong
- Faculty of Communication Networks and Security, University of Information Science and Technology, St. Paul the Apostle, Partizanska B.B., Ohrid, 6000, Republic of North Macedonia
| |
Collapse
|
9
|
Hesami M, Naderi R, Tohidfar M. Modeling and Optimizing in vitro Sterilization of Chrysanthemum via Multilayer Perceptron-Non-dominated Sorting Genetic Algorithm-II (MLP-NSGAII). FRONTIERS IN PLANT SCIENCE 2019; 10:282. [PMID: 30923529 PMCID: PMC6426794 DOI: 10.3389/fpls.2019.00282] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2018] [Accepted: 02/20/2019] [Indexed: 05/26/2023]
Abstract
In vitro sterilization is a primary step of plant tissue culture which the ultimate results of in vitro culture are directly depended on the efficiency of the sterilization. Artificial intelligence models in a combination of optimization algorithms could be beneficial computational approaches for modeling and optimizing in vitro culture. The aim of this study was modeling and optimizing in vitro sterilization of chrysanthemum, as a case study, through Multilayer Perceptron- Non-dominated Sorting Genetic Algorithm-II (MLP-NSGAII). MLP was used for modeling two outputs including contamination frequency (CF), and explant viability (EV) based on seven variables including HgCl2, Ca(ClO)2, Nano-silver, H2O2, NaOCl, AgNO3, and immersion times. Subsequently, models were linked to NSGAII for optimizing the process, and the importance of each input was evaluated by sensitivity analysis. Results showed all of the R2 of training and testing data were over 94%. According to MLP-NSGAII, optimal CF (0%), and EV (99.98%) can be obtained from 1.62% NaOCl at 13.96 min immersion time. The results of sensitivity analysis showed that CF and EV were more sensitive to immersion time and less sensitive to AgNO3. Subsequently, the performance of predicted and optimized sterilants × immersion times combination were tested, and results indicated that the differences between the MLP predicted and validation data were negligible. Generally, MLP-NSGAII as a powerful methodology may pave the way for establishing new computational strategies in plant tissue culture.
Collapse
Affiliation(s)
- Mohsen Hesami
- Department of Horticultural Science, Faculty of Agriculture, University of Tehran, Karaj, Iran
| | - Roohangiz Naderi
- Department of Horticultural Science, Faculty of Agriculture, University of Tehran, Karaj, Iran
| | - Masoud Tohidfar
- Department of Plant Biotechnology, Faculty of Life Science and Biotechnology, Shahid Beheshti University, Tehran, Iran
| |
Collapse
|