1
|
Soleimani M, Harooni A, Erfani N, Khan AR, Saba T, Bahaj SA. Classification of cancer types based on microRNA expression using a hybrid radial basis function and particle swarm optimization algorithm. Microsc Res Tech 2024; 87:1052-1062. [PMID: 38230557 DOI: 10.1002/jemt.24492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 11/27/2023] [Accepted: 12/25/2023] [Indexed: 01/18/2024]
Abstract
The diagnosis and treatment of cancer is one of the most challenging aspects of the medical profession, despite advances in disease diagnosis. MicroRNAs are small noncoding RNA molecules involved in regulating gene expression and are associated with several cancer types. Therefore, the analysis of microRNA data has become one of the most important areas of cancer research in recent years. This paper presents an improved method for cancer-type classification based on microRNA expression data using a hybrid radial basis function (RBF) and particle swarm optimization (PSO) algorithm. Two datasets containing microRNA information were used, and preprocessing and normalization operations were performed on the raw data. Feature selection was carried out by using the PSO algorithm, which can identify the most relevant and informative features in the data along with helping to prioritize them. Using a PSO algorithm for feature selection is an effective approach to microRNA analysis. This enhances the accuracy and reliability of cancer-type classifications based on microRNA expression data. In the proposed method, we, respectively, achieved an accuracy of 0.95% and 0.91% on both datasets, with an average of 0.93%, using an improved RBF neural network classifier. These results demonstrate that the proposed method outperforms previous works. RESEARCH HIGHLIGHTS: To enhance the accuracy of cancer-type classifications based on microRNA expression data. We present a minimal feature selection method using particle swarm optimization to reduce computational load & radial basis function to improve accuracy.
Collapse
Affiliation(s)
- Masoumeh Soleimani
- Department of Mathematics and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| | - Aryan Harooni
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| | - Nasim Erfani
- Department of Computer Engineering, Dolatabad Branch, Islamic Azad University, Isfahan, Iran
| | - Amjad Rehman Khan
- Artificial Intelligence & Data Analytics Lab CCIS Prince Sultan University, Riyadh, Saudi Arabia
| | - Tanzila Saba
- Artificial Intelligence & Data Analytics Lab CCIS Prince Sultan University, Riyadh, Saudi Arabia
| | - Saeed Ali Bahaj
- MIS Department College of Business Administration, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia
| |
Collapse
|
2
|
Mármol-Sánchez E, Cirera S, Quintanilla R, Pla A, Amills M. Discovery and annotation of novel microRNAs in the porcine genome by using a semi-supervised transductive learning approach. Genomics 2019; 112:2107-2118. [PMID: 31816430 DOI: 10.1016/j.ygeno.2019.12.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Revised: 11/13/2019] [Accepted: 12/05/2019] [Indexed: 12/15/2022]
Abstract
Despite the broad variety of available microRNA (miRNA) prediction tools, their application to the discovery and annotation of novel miRNA genes in domestic species is still limited. In this study we designed a comprehensive pipeline (eMIRNA) for miRNA identification in the yet poorly annotated porcine genome and demonstrated the usefulness of implementing a motif search positional refinement strategy for the accurate determination of precursor miRNA boundaries. The small RNA fraction from gluteus medius skeletal muscle of 48 Duroc gilts was sequenced and used for the prediction of novel miRNA loci. Additionally, we selected the human miRNA annotation for a homology-based search of porcine miRNAs with orthologous genes in the human genome. A total of 20 novel expressed miRNAs were identified in the porcine muscle transcriptome and 27 additional novel porcine miRNAs were also detected by homology-based search using the human miRNA annotation. The existence of three selected novel miRNAs (ssc-miR-483, ssc-miR484 and ssc-miR-200a) was further confirmed by reverse transcription quantitative real-time PCR analyses in the muscle and liver tissues of Göttingen minipigs. In summary, the eMIRNA pipeline presented in the current work allowed us to expand the catalogue of porcine miRNAs and showed better performance than other commonly used miRNA prediction approaches. More importantly, the flexibility of our pipeline makes possible its application in other yet poorly annotated non-model species.
Collapse
Affiliation(s)
- Emilio Mármol-Sánchez
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain.
| | - Susanna Cirera
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Grønnegårdsvej 3, 2nd Floor, 1870 Frederiksberg C, Denmark
| | - Raquel Quintanilla
- Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140 Caldes de Montbui, Spain
| | - Albert Pla
- Department of Medical Genetics, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Marcel Amills
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain; Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| |
Collapse
|
3
|
An Approach to Identify Individual Functional Single Nucleotide Polymorphisms and Isoform MicroRNAs. BIOMED RESEARCH INTERNATIONAL 2019; 2019:6193673. [PMID: 31467902 PMCID: PMC6699389 DOI: 10.1155/2019/6193673] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 05/28/2019] [Accepted: 06/24/2019] [Indexed: 01/08/2023]
Abstract
MicroRNAs (miRNAs) and single nucleotide polymorphisms (SNPs) play important roles in disease risk and development, especially cancer. Importantly, when SNPs are located in pre-miRNAs, they affect their splicing mechanism and change the function of miRNAs. To improve disease risk assessment, we propose an approach and developed a software tool, IsomiR_Find, to identify disease/phenotype-related SNPs and isomiRs in individuals. Our approach is based on the individual's samples, with SNP information extracted from the 1000 Genomes Project. SNPs were mapped to pre-miRNAs based on whole-genome coordinates and then SNP-pre-miRNA sequences were constructed. Moreover, we developed matpred2, a software tool to identify the four splicing sites of mature miRNAs. Using matpred2, we identified isomiRs and then verified them by searching within individual miRNA sequencing data. Our approach yielded biomarkers for biological experiments, mined functions of miRNAs and SNPs, improved disease risk assessment, and provided a way to achieve individualized precision medicine.
Collapse
|
4
|
Delsin LEA, Salomao KB, Pezuk JA, Brassesco MS. Expression profiles and prognostic value of miRNAs in retinoblastoma. J Cancer Res Clin Oncol 2018; 145:1-10. [PMID: 30350021 DOI: 10.1007/s00432-018-2773-7] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Accepted: 10/16/2018] [Indexed: 01/01/2023]
Abstract
Current cure rates for retinoblastoma (RB) are very high in developed countries. Nonetheless, in less privileged places worldwide, delayed diagnosis and refusal to adhere to treatment still endure an obstacle to improve overall patient survival. Thus, the access to consistent biomarkers for diagnosis at an earlier stage may facilitate treatment and improve outcomes. Over recent years, much attention has been focused on miRNAs, key post-transcriptional regulators that when altered, largely contribute to carcinogenesis and tumor progression. Many of the ~ 2500 microRNAs described in humans have shown differential expression profiles in tumors. In this review, we summarize current data about the roles of miRNAs in RB along with their value as diagnostic/prognostic factors using electronic databases such as PubMed. We reviewed the importance of miRNA in RB biology and discussed their implications in clinic intervention. Several miRNAs have pointed out reliable diagnostic and prognostic molecular biomarkers. The emergence of targeted therapies has significantly improved cancer treatment. In the near future, the modulation of miRNAs will represent a good treatment strategy.
Collapse
Affiliation(s)
| | | | - Julia Alejandra Pezuk
- Anhanguera University of Sao Paulo, UNIAN, Av. Raimundo Pereira de Magalhaes 3305, Sao Paulo, SP, CEP 05145-200, Brazil.
| | - Maria Sol Brassesco
- Faculty of Philosophy, Sciences and Letters at Ribeirao Preto, University of Sao Paulo, Sao Paulo, Brazil
| |
Collapse
|
5
|
Yu CY, Li XX, Yang H, Li YH, Xue WW, Chen YZ, Tao L, Zhu F. Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate. Int J Mol Sci 2018; 19:E183. [PMID: 29316706 PMCID: PMC5796132 DOI: 10.3390/ijms19010183] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2017] [Revised: 12/09/2017] [Accepted: 01/04/2018] [Indexed: 12/27/2022] Open
Abstract
The function of a protein is of great interest in the cutting-edge research of biological mechanisms, disease development and drug/target discovery. Besides experimental explorations, a variety of computational methods have been designed to predict protein function. Among these in silico methods, the prediction of BLAST is based on protein sequence similarity, while that of machine learning is also based on the sequence, but without the consideration of their similarity. This unique characteristic of machine learning makes it a good complement to BLAST and many other approaches in predicting the function of remotely relevant proteins and the homologous proteins of distinct function. However, the identification accuracies of these in silico methods and their false discovery rate have not yet been assessed so far, which greatly limits the usage of these algorithms. Herein, a comprehensive comparison of the performances among four popular prediction algorithms (BLAST, SVM, PNN and KNN) was conducted. In particular, the performance of these methods was systematically assessed by four standard statistical indexes based on the independent test datasets of 93 functional protein families defined by UniProtKB keywords. Moreover, the false discovery rates of these algorithms were evaluated by scanning the genomes of four representative model organisms (Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae and Mycobacterium tuberculosis). As a result, the substantially higher sensitivity of SVM and BLAST was observed compared with that of PNN and KNN. However, the machine learning algorithms (PNN, KNN and SVM) were found capable of substantially reducing the false discovery rate (SVM < PNN < KNN). In sum, this study comprehensively assessed the performance of four popular algorithms applied to protein function prediction, which could facilitate the selection of the most appropriate method in the related biomedical research.
Collapse
Affiliation(s)
- Chun Yan Yu
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Xiao Xu Li
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Hong Yang
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Ying Hong Li
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Wei Wei Xue
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
| | - Yu Zong Chen
- Bioinformatics and Drug Design Group, Department of Pharmacy, and Center for Computational Science and Engineering, National University of Singapore, Singapore 117543, Singapore.
| | - Lin Tao
- School of Medicine, Hangzhou Normal University, Hangzhou 310012, China.
| | - Feng Zhu
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|