1
|
Maltseva D, Kirillov I, Zhiyanov A, Averinskaya D, Suvorov R, Gubani D, Kudriaeva A, Belogurov A, Tonevitsky A. Incautious design of shRNAs for stable overexpression of miRNAs could result in generation of undesired isomiRs. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2024; 1867:195046. [PMID: 38876159 DOI: 10.1016/j.bbagrm.2024.195046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 06/03/2024] [Accepted: 06/06/2024] [Indexed: 06/16/2024]
Abstract
shRNA-mediated strategy of miRNA overexpression based on RNA Polymerase (Pol III) expression cassettes is widely used for miRNA functional studies. For some miRNAs, e.g., encoded in the genome as a part of a polycistronic miRNA cluster, it is most likely the only way for their individual stable overexpression. Here we have revealed that expression of miRNAs longer than 19 nt (e.g. 23 nt in length hsa-miR-93-5p) using such approach could be accompanied by undesired predominant generation of 5' end miRNA isoforms (5'-isomiRs). Extra U residues (up to five) added by Pol III at the 3' end of the transcribed shRNA during transcription termination could cause a shift in the Dicer cleavage position of the shRNA. This results in the formation of 5'-isomiRs, which have a significantly altered seed region compared to the initially encoded canonical hsa-miR-93-5p. We demonstrated that the commonly used qPCR method is insensitive to the formation of 5'-isomiRs and cannot be used to confirm miRNA overexpression. However, the predominant expression of 5'-isomiRs without three or four first nucleotides instead of the canonical isoform could be disclosed based on miRNA-Seq analysis. Moreover, mRNA sequencing data showed that the 5'-isomiRs of hsa-miR-93-5p presumably regulate their own mRNA targets. Thus, omitting miRNA-Seq analysis may lead to erroneous conclusions regarding revealed mRNA targets and possible molecular mechanisms in which studied miRNA is involved. Overall, the presented results show that structures of shRNAs for stable overexpression of miRNAs requires careful design to avoid generation of undesired 5'-isomiRs.
Collapse
Affiliation(s)
- Diana Maltseva
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia
| | - Ivan Kirillov
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia
| | - Anton Zhiyanov
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia
| | - Daria Averinskaya
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia
| | - Roman Suvorov
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia
| | - Daria Gubani
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia
| | - Anna Kudriaeva
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia
| | - Alexey Belogurov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia
| | - Alexander Tonevitsky
- Faculty of Biology and Biotechnology, National Research University Higher School of Economics, Moscow 101000, Russia; Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia; Art Photonics GmbH, Berlin 12489, Germany.
| |
Collapse
|
2
|
Wong LL, Fadzil AB, Chen Q, Rademaker MT, Charles CJ, Richards AM, Wang P. Interrogating the Role of miR-125b and Its 3'isomiRs in Protection against Hypoxia. Int J Mol Sci 2023; 24:16015. [PMID: 37958999 PMCID: PMC10650460 DOI: 10.3390/ijms242116015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/01/2023] [Accepted: 11/03/2023] [Indexed: 11/15/2023] Open
Abstract
MiR-125b has therapeutic potential in the amelioration of myocardial ischemic injury. MicroRNA isomiRs, with either 5' or 3' addition or deletion of nucleotide(s), have been reported from next-generation sequencing data (NGS). However, due to technical challenges, validation and functional studies of isomiRs are few. In this study, we discovered using NGS, four 3'isomiRs of miR-125b, i.e., addition of A (adenosine), along with deletions of A, AG (guanosine) and AGU (uridine) from rat and sheep heart. These findings were validated using RT-qPCR. Comprehensive functional studies were carried out in the H9C2 hypoxia model. After miR-125b, isomiRs of Plus A, Trim A, AG and AGU mimic transfection, the H9C2 cells were subjected to hypoxic challenge. As assessed using cell viability, apoptosis, CCK-8 and LDH release, miR-125b and isomiRs were all protective against hypoxia. However, Plus A and Trim A were more effective than miR-125b, whilst Trim AG and Trim AGU had far weaker effects than miR-125b. Interestingly, both the gene regulation profile and apoptotic gene validation indicated a major overlap among miR-125b, Plus A and Trim A, whilst Trims AG and AGU revealed a different profile compared to miR-125b. Conclusions: miR-125b and its 3' isomiRs are expressed stably in the heart. miR-125b and isomiRs with addition or deletion of A might function concurrently and concordantly under specific physiological and pathophysiological conditions. In-depth understanding of isomiRs' metabolism and function will contribute to better miRNA therapeutic drug design.
Collapse
Affiliation(s)
- Lee Lee Wong
- Cardiovascular Research Institute, National University Health System, Singapore 117599, Singapore; (A.B.F.); (Q.C.); (A.M.R.)
- Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117599, Singapore
| | - Azizah Binti Fadzil
- Cardiovascular Research Institute, National University Health System, Singapore 117599, Singapore; (A.B.F.); (Q.C.); (A.M.R.)
- Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117599, Singapore
| | - Qiying Chen
- Cardiovascular Research Institute, National University Health System, Singapore 117599, Singapore; (A.B.F.); (Q.C.); (A.M.R.)
- Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117599, Singapore
| | - Miriam T. Rademaker
- Christchurch Heart Institute, Department of Medicine, University of Otago-Christchurch, Christchurch P.O. Box 4345, New Zealand;
| | - Christopher J. Charles
- Cardiovascular Research Institute, National University Health System, Singapore 117599, Singapore; (A.B.F.); (Q.C.); (A.M.R.)
- Christchurch Heart Institute, Department of Medicine, University of Otago-Christchurch, Christchurch P.O. Box 4345, New Zealand;
| | - Arthur Mark Richards
- Cardiovascular Research Institute, National University Health System, Singapore 117599, Singapore; (A.B.F.); (Q.C.); (A.M.R.)
- Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117599, Singapore
- Christchurch Heart Institute, Department of Medicine, University of Otago-Christchurch, Christchurch P.O. Box 4345, New Zealand;
| | - Peipei Wang
- Cardiovascular Research Institute, National University Health System, Singapore 117599, Singapore; (A.B.F.); (Q.C.); (A.M.R.)
- Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117599, Singapore
| |
Collapse
|
3
|
Lausten MA, Boman BM. A Review of IsomiRs in Colorectal Cancer. Noncoding RNA 2023; 9:34. [PMID: 37368334 DOI: 10.3390/ncrna9030034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 05/26/2023] [Accepted: 06/02/2023] [Indexed: 06/28/2023] Open
Abstract
As advancements in sequencing technology rapidly continue to develop, a new classification of microRNAs has occurred with the discovery of isomiRs, which are relatively common microRNAs with sequence variations compared to their established template microRNAs. This review article seeks to compile all known information about isomiRs in colorectal cancer (CRC), which has not, to our knowledge, been gathered previously to any great extent. A brief overview is given of the history of microRNAs, their implications in colon cancer, the canonical pathway of biogenesis and isomiR classification. This is followed by a comprehensive review of the literature that is available on microRNA isoforms in CRC. The information on isomiRs presented herein shows that isomiRs hold great promise for translation into new diagnostics and therapeutics in clinical medicine.
Collapse
Affiliation(s)
- Molly A Lausten
- Cawley Center for Translational Cancer Research, Helen F. Graham Cancer Center & Research Institute, Newark, DE 19713, USA
- Department of Biological Sciences, University of Delaware, Newark, DE 19713, USA
| | - Bruce M Boman
- Cawley Center for Translational Cancer Research, Helen F. Graham Cancer Center & Research Institute, Newark, DE 19713, USA
- Department of Biological Sciences, University of Delaware, Newark, DE 19713, USA
- Department of Pharmacology & Experimental Therapeutics, Thomas Jefferson University, Philadelphia, PA 19107, USA
| |
Collapse
|
4
|
Zhiyanov A, Engibaryan N, Nersisyan S, Shkurnikov M, Tonevitsky A. Differential co-expression network analysis with DCoNA reveals isomiR targeting aberrations in prostate cancer. Bioinformatics 2023; 39:6998206. [PMID: 36688696 PMCID: PMC9901399 DOI: 10.1093/bioinformatics/btad051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 01/10/2023] [Accepted: 01/22/2023] [Indexed: 01/24/2023] Open
Abstract
MOTIVATION One of the standard methods of high-throughput RNA sequencing analysis is differential expression. However, it does not detect changes in molecular regulation. In contrast to the standard differential expression analysis, differential co-expression one aims to detect pairs or clusters whose mutual expression changes between two conditions. RESULTS We developed Differential Co-expression Network Analysis (DCoNA)-an open-source statistical tool that allows one to identify pair interactions, which correlation significantly changes between two conditions. Comparing DCoNA with the state-of-the-art analog, we showed that DCoNA is a faster, more accurate and less memory-consuming tool. We applied DCoNA to prostate mRNA/miRNA-seq data collected from The Cancer Genome Atlas (TCGA) and compared predicted regulatory interactions of miRNA isoforms (isomiRs) and their target mRNAs between normal and cancer samples. As a result, almost all highly expressed isomiRs lost negative correlation with their targets in prostate cancer samples compared to ones without the pathology. One exception to this trend was the canonical isomiR of hsa-miR-93-5p acquiring cancer-specific targets. Further analysis showed that cancer aggressiveness simultaneously increased with the expression level of this isomiR in both TCGA primary tumor samples and 153 blood plasma samples of P. Hertsen Moscow Oncology Research Institute patients' cohort analyzed by miRNA microarrays. AVAILABILITY AND IMPLEMENTATION Source code and documentation of DCoNA are available at https://github.com/zhiyanov/DCoNA. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anton Zhiyanov
- Faculty of Biology and Biotechnology, HSE University, Moscow 101000, Russia
| | - Narek Engibaryan
- Faculty of Biology and Biotechnology, HSE University, Moscow 101000, Russia
| | - Stepan Nersisyan
- Institute of Molecular Biology, The National Academy of Sciences of the Republic of Armenia, Yerevan 0014, Armenia.,Armenian Bioinformatics Institute (ABI), Yerevan, Armenia
| | - Maxim Shkurnikov
- Faculty of Biology and Biotechnology, HSE University, Moscow 101000, Russia.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia.,P. Hertsen Moscow Oncology Research Institute, National Center of Medical Radiological Research, Moscow 125284, Russia
| | - Alexander Tonevitsky
- Faculty of Biology and Biotechnology, HSE University, Moscow 101000, Russia.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia.,Art Photonics GmbH, Berlin 12489, Germany
| |
Collapse
|
5
|
Nersisyan S, Gorbonos A, Makhonin A, Zhiyanov A, Shkurnikov M, Tonevitsky A. isomiRTar: a comprehensive portal of pan-cancer 5'-isomiR targeting. PeerJ 2022; 10:e14205. [PMID: 36275459 PMCID: PMC9583861 DOI: 10.7717/peerj.14205] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 09/19/2022] [Indexed: 01/24/2023] Open
Abstract
Inaccurate cleavage of pri- and pre-miRNA hairpins by Drosha and Dicer results in the generation of miRNA isoforms known as isomiRs. isomiRs with 5'-end variations (5'-isomiRs) create a new dimension in miRNA research since they have different seed regions and distinct targetomes. We developed isomiRTar (https://isomirtar.hse.ru)-a comprehensive portal that allows one to analyze expression profiles and targeting activity of 5'-isomiRs in cancer. Using the Cancer Genome Atlas sequencing data, we compiled the list of 1022 5'-isomiRs expressed in 9282 tumor samples across 31 cancer types. Sequences of these isomiRs were used to predict target genes with miRDB and TargetScan. The putative interactions were then subjected to the co-expression analysis in each cancer type to identify isomiR-target pairs supported by significant negative correlations. Downstream analysis of the data deposited in isomiRTar revealed both cancer-specific and cancer-conserved 5'-isomiR expression landscapes. Pairs of isomiRs differing in one nucleotide shift from 5'-end had poorly overlapping targetomes with the median Jaccard index of 0.06. The analysis of colorectal cancer 5'-isomiR-mediated regulatory networks revealed promising candidate tumor suppressor isomiRs: hsa-miR-203a-3p-+1, hsa-miR-192-5p-+1 and hsa-miR-148a-3p-0. In summary, we believe that isomiRTar will help researchers find novel mechanisms of isomiR-mediated gene silencing in different types of cancer.
Collapse
Affiliation(s)
- Stepan Nersisyan
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia,Institute of Molecular Biology, The National Academy of Sciences of the Republic of Armenia, Yerevan, Armenia,Armenian Bioinformatics Institute (ABI), Yerevan, Armenia,Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | | | - Alexey Makhonin
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | - Anton Zhiyanov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia,Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | - Maxim Shkurnikov
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | - Alexander Tonevitsky
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia,Faculty of Biology and Biotechnology, HSE University, Moscow, Russia,Art Photonics GmbH, Berlin, Germany
| |
Collapse
|
6
|
Askari H, Raeis-Abdollahi E, Abazari MF, Akrami H, Vakili S, Savardashtaki A, Tajbakhsh A, Sanadgol N, Azarnezhad A, Rahmati L, Abdullahi PR, Zare Karizi S, Safarpour AR. Recent findings on the role of microRNAs in genetic kidney diseases. Mol Biol Rep 2022; 49:7039-7056. [PMID: 35717474 DOI: 10.1007/s11033-022-07620-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 05/19/2022] [Indexed: 10/18/2022]
Abstract
BACKGROUND MicroRNAs (miRNAs) are non-coding, endogenous, single-stranded, small (21-25 nucleotides) RNAs. Various target genes at the post-transcriptional stage are modulated by miRNAs that are involved in the regulation of a variety of biological processes such as embryonic development, differentiation, proliferation, apoptosis, inflammation, and metabolic homeostasis. Abnormal miRNA expression is strongly associated with the pathogenesis of multiple common human diseases including cardiovascular diseases, cancer, hepatitis, and metabolic diseases. METHODS AND RESULTS Various signaling pathways including transforming growth factor-β, apoptosis, and Wnt signaling pathways have also been characterized to play an essential role in kidney diseases. Most importantly, miRNA-targeted pharmaceutical manipulation has represented a promising new therapeutic approach against kidney diseases. Furthermore, miRNAs such as miR-30e-5p, miR-98-5p, miR-30d-5p, miR-30a-5p, miR-194-5p, and miR-192-5p may be potentially employed as biomarkers for various human kidney diseases. CONCLUSIONS A significant correlation has also been found between some miRNAs and the clinical markers of renal function like baseline estimated glomerular filtration rate (eGFR). Classification of miRNAs in different genetic renal disorders may promote discoveries in developing innovative therapeutic interventions and treatment tools. Herein, the recent advances in miRNAs associated with renal pathogenesis, emphasizing genetic kidney diseases and development, have been summarized.
Collapse
Affiliation(s)
- Hassan Askari
- Gastroenterohepatology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Ehsan Raeis-Abdollahi
- Applied Physiology Research Center, Qom Medical Sciences, Islamic Azad University, Qom, Iran.,Department of Basic Medical Sciences, Faculty of Medicine, Qom Medical Sciences, Islamic Azad University, Qom, Iran
| | - Mohammad Foad Abazari
- Research Center for Clinical Virology, Tehran University of Medical Sciences, Tehran, Iran
| | - Hassan Akrami
- Gastroenterohepatology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Sina Vakili
- Infertility Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Amir Savardashtaki
- Infertility Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.,Department of Medical Biotechnology, School of Advanced Medical Sciences and Technologies, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Amir Tajbakhsh
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Nima Sanadgol
- Institute of Neuroanatomy, RWTH University Hospital Aachen, 52074, Aachen, Germany
| | - Asaad Azarnezhad
- Liver and Digestive Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Leila Rahmati
- Gastroenterohepatology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Payman Raise Abdullahi
- Neuroscience Research Center, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Shohreh Zare Karizi
- Department of Biology, Varamin Pishva Branch, Islamic Azad University, Pishva, Varamin, Iran.
| | - Ali Reza Safarpour
- Gastroenterohepatology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
| |
Collapse
|
7
|
Nersisyan SA. Isoforms of miR-148a and miR-203a are putative suppressors of colorectal cancer. BULLETIN OF RUSSIAN STATE MEDICAL UNIVERSITY 2022. [DOI: 10.24075/brsmu.2022.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
MicroRNAs are short non-coding molecules which regulate translation in a gene-specific manner. MicroRNA isoforms that differ by few extra or missing nucleotides at the 5'-terminus (5'-isomiR) show strikingly different target specificity. This study aimed to identify functional roles of 5′-isomiR in colorectal cancers. Transcriptomic targets of microRNA isoforms were predicted using bioinformatics tools miRDB and TargetScan. The sets of putative targets identified for 5′-isomiR were integrated with mRNA and microRNA sequencing data for primary colorectal tumors retrieved from The Cancer Genome Atlas Colon Adenocarcinoma (TCGA-COAD) database. The network of interactions among miRNA, their targets and transcription factors was built using the miRGTF-net algorithm. The results indicate that microRNA isoforms highly expressed in colorectal cancer and differing by a single nucleotide position at the 5'-terminus have ≤ 30% common targets. The regulatory network of interactions enables identification of the most engaged microRNA isoforms. Anti-correlated expression levels of canonical microRNA hsa-miR-148a-3p and its putative targets including CSF1, ETS1, FLT1, ITGA5, MEIS1, MITF and RUNX2 proliferation regulators suggest an anti-tumor role for this molecule. The canonical microRNA hsa-miR-203a-3p|0 and its 5′-isoform bind different sets of anti-correlated putative targets, although both of them interact with genes involved in the epithelial-mesenchymal transition: SNAI2 and TNC.
Collapse
Affiliation(s)
- SA Nersisyan
- National Research University Higher School of Economics (HSE), Moscow, Russia
| |
Collapse
|
8
|
Raigorodskaya MP, Zhiyanov AP, Averinskaya DA, Tonevitsky EA. Changes in the Expression of miRNA Isoforms and Their Targets in HT-29 Cells after Hypoxic Exposure. Bull Exp Biol Med 2022; 173:123-127. [PMID: 35624351 DOI: 10.1007/s10517-022-05506-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Indexed: 11/28/2022]
Abstract
Tumor hypoxia is one of the main causes of progression and metastasis of colorectal cancer. Changes in the expression of miRNA responsible for post-translation regulation of gene expression is an important molecular mechanism of cell response to hypoxia. We performed sequencing of miRNA and mRNA of human colorectal adenocarcinoma HT-29 cells treated with two chemical agents mimicking hypoxia: cobalt (II) chloride and oxyquinoline. Bioinformatics analysis revealed differentially expressed miRNA isoforms (hsa-miR-210-3p|0, hsa-miR- 22-3p|0, hsa-let-7a-3p|0, hsa-miR-615-3p|0, and hsa-miR-4521|0) and their targets that changed their expression in both models of hypoxia. Thus, we identified new regulatory mechanisms of cell response to hypoxia.
Collapse
Affiliation(s)
- M P Raigorodskaya
- Faculty of Biology and Biotechnologies, Higher School of Economics (HSE University), Moscow, Russia
| | - A P Zhiyanov
- Faculty of Biology and Biotechnologies, Higher School of Economics (HSE University), Moscow, Russia.
| | - D A Averinskaya
- Faculty of Biology and Biotechnologies, Higher School of Economics (HSE University), Moscow, Russia
| | - E A Tonevitsky
- Faculty of Biology and Biotechnologies, Higher School of Economics (HSE University), Moscow, Russia
| |
Collapse
|
9
|
Nersisyan S, Novosad V, Galatenko A, Sokolov A, Bokov G, Konovalov A, Alekseev D, Tonevitsky A. ExhauFS: exhaustive search-based feature selection for classification and survival regression. PeerJ 2022; 10:e13200. [PMID: 35378930 PMCID: PMC8976470 DOI: 10.7717/peerj.13200] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 03/09/2022] [Indexed: 01/12/2023] Open
Abstract
Feature selection is one of the main techniques used to prevent overfitting in machine learning applications. The most straightforward approach for feature selection is an exhaustive search: one can go over all possible feature combinations and pick up the model with the highest accuracy. This method together with its optimizations were actively used in biomedical research, however, publicly available implementation is missing. We present ExhauFS-the user-friendly command-line implementation of the exhaustive search approach for classification and survival regression. Aside from tool description, we included three application examples in the manuscript to comprehensively review the implemented functionality. First, we executed ExhauFS on a toy cervical cancer dataset to illustrate basic concepts. Then, multi-cohort microarray breast cancer datasets were used to construct gene signatures for 5-year recurrence classification. The vast majority of signatures constructed by ExhauFS passed 0.65 threshold of sensitivity and specificity on all datasets, including the validation one. Moreover, a number of gene signatures demonstrated reliable performance on independent RNA-seq dataset without any coefficient re-tuning, i.e., turned out to be cross-platform. Finally, Cox survival regression models were used to fit isomiR signatures for overall survival prediction for patients with colorectal cancer. Similarly to the previous example, the major part of models passed the pre-defined concordance index threshold 0.65 on all datasets. In both real-world scenarios (breast and colorectal cancer datasets), ExhauFS was benchmarked against state-of-the-art feature selection models, including L1-regularized sparse models. In case of breast cancer, we were unable to construct reliable cross-platform classifiers using alternative feature selection approaches. In case of colorectal cancer not a single model passed the same 0.65 threshold. Source codes and documentation of ExhauFS are available on GitHub: https://github.com/s-a-nersisyan/ExhauFS.
Collapse
Affiliation(s)
- Stepan Nersisyan
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | - Victor Novosad
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry RAS, Moscow, Russia
| | - Alexei Galatenko
- Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow, Russia,Moscow Center for Fundamental and Applied Mathematics, Moscow, Russia
| | - Andrey Sokolov
- Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow, Russia,Moscow Center for Fundamental and Applied Mathematics, Moscow, Russia
| | - Grigoriy Bokov
- Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow, Russia,Moscow Center for Fundamental and Applied Mathematics, Moscow, Russia
| | - Alexander Konovalov
- Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow, Russia,Moscow Center for Fundamental and Applied Mathematics, Moscow, Russia
| | - Dmitry Alekseev
- Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow, Russia,Moscow Center for Fundamental and Applied Mathematics, Moscow, Russia
| | - Alexander Tonevitsky
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry RAS, Moscow, Russia,Institute of Nanotechnologies of Microelectronics RAS, Moscow, Russia
| |
Collapse
|
10
|
Nersisyan S, Novosad V, Engibaryan N, Ushkaryov Y, Nikulin S, Tonevitsky A. ECM-Receptor Regulatory Network and Its Prognostic Role in Colorectal Cancer. Front Genet 2021; 12:782699. [PMID: 34938324 PMCID: PMC8685507 DOI: 10.3389/fgene.2021.782699] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 11/05/2021] [Indexed: 12/12/2022] Open
Abstract
Interactions of the extracellular matrix (ECM) and cellular receptors constitute one of the crucial pathways involved in colorectal cancer progression and metastasis. With the use of bioinformatics analysis, we comprehensively evaluated the prognostic information concentrated in the genes from this pathway. First, we constructed a ECM-receptor regulatory network by integrating the transcription factor (TF) and 5'-isomiR interaction databases with mRNA/miRNA-seq data from The Cancer Genome Atlas Colon Adenocarcinoma (TCGA-COAD). Notably, one-third of interactions mediated by 5'-isomiRs was represented by noncanonical isomiRs (isomiRs, whose 5'-end sequence did not match with the canonical miRBase version). Then, exhaustive search-based feature selection was used to fit prognostic signatures composed of nodes from the network for overall survival prediction. Two reliable prognostic signatures were identified and validated on the independent The Cancer Genome Atlas Rectum Adenocarcinoma (TCGA-READ) cohort. The first signature was made up by six genes, directly involved in ECM-receptor interaction: AGRN, DAG1, FN1, ITGA5, THBS3, and TNC (concordance index 0.61, logrank test p = 0.0164, 3-years ROC AUC = 0.68). The second hybrid signature was composed of three regulators: hsa-miR-32-5p, NR1H2, and SNAI1 (concordance index 0.64, logrank test p = 0.0229, 3-years ROC AUC = 0.71). While hsa-miR-32-5p exclusively regulated ECM-related genes (COL1A2 and ITGA5), NR1H2 and SNAI1 also targeted other pathways (adhesion, cell cycle, and cell division). Concordant distributions of the respective risk scores across four stages of colorectal cancer and adjacent normal mucosa additionally confirmed reliability of the models.
Collapse
Affiliation(s)
- Stepan Nersisyan
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | - Victor Novosad
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Narek Engibaryan
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
| | - Yuri Ushkaryov
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
- Medway School of Pharmacy, University of Kent, Chatham, United Kingdom
| | - Sergey Nikulin
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
- P. Hertsen Moscow Oncology Research Institute—Branch, National Medical Research Radiological Centre, Ministry of Health of Russian Federation, Moscow, Russia
- School of Biomedicine, Far Eastern Federal University, Vladivostok, Russia
| | - Alexander Tonevitsky
- Faculty of Biology and Biotechnology, HSE University, Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
- SRC Bioclinicum, Moscow, Russia
| |
Collapse
|