1
|
Yin R, Zhai X, Han H, Tong X, Li Y, Deng K. Characterizing the landscape of cervical squamous cell carcinoma immune microenvironment by integrating the single-cell transcriptomics and RNA-Seq. Immun Inflamm Dis 2022; 10:e608. [PMID: 35634956 PMCID: PMC9091987 DOI: 10.1002/iid3.608] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 03/01/2022] [Accepted: 03/03/2022] [Indexed: 12/21/2022] Open
Abstract
Background Cervical squamous cell carcinoma (CSCC), caused by the infection of high‐risk human papillomavirus, is one of the most common malignancies in women worldwide. Methods RNA expression data, including those from the Cancer Genome Atlas, Gene Expression Omnibus, and Genotype‐Tissue Expression databases, were used to identify the expression of RNAs in normal and tumor tissue. Correlation analysis was performed to identify the immune‐related long noncoding RNAs (IRLs) and hypoxia‐related genes (IRHs) that can influence the activity of the immune system. Prognosis models of immune‐related RNAs (IRRs) were used to construct a coexpression network of the immune system. We identified the role of IRRs in immunotherapy by correlation analysis with immune checkpoint genes (ICGs). We then validated the expression data by integrating two single‐cell sequencing data sets of CSCC to identify the key immune features. Results In total, six immune‐related gene (IRG), four IRL, and five IRH signatures that can significantly influence the characteristics of the tumor immune microenvironment (TIME) were selected using machine learning methods. The expression level of ICGs was significantly upregulated in GZMB+CD8+ T‐cells and tumor‐associated macrophages (TAMs) in tumor tissues. TGFBI+ TAMs are a kind of blood‐derived monocyte‐derived M0‐like TAM linked to hypoxia and a poor prognosis. IFI30+ M1‐like TAMs participate in the process of immune‐regulation and showed a role in the promotion of CD8+ T‐cells and Type 1 T helper (Th1)/Th2 cells in the coexpression network, together with several IRLs, IRGs, and ICGs. Conclusions CD16+ monocyte‐derived IFI30+ TAMs participated in our coexpression network to regulate the TIME, showing the potential to be a novel immunotherapy target. The enrichment of M0‐like TAMs was associated with a worse prognosis in the high‐risk score group with IRH signatures. Remarkably, M0‐like TAMs in tumor tissues overexpressed TGFBI and were associated with several well‐known tumor‐proliferation pathways.
Collapse
Affiliation(s)
- Ruiling Yin
- Department of Laboratory Medicine, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Xiuming Zhai
- Department of Laboratory Medicine, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Hongyan Han
- Department of Laboratory Medicine, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Xuedong Tong
- Department of Laboratory Medicine, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Yan Li
- Department of Laboratory Medicine, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Kun Deng
- Department of Laboratory Medicine, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| |
Collapse
|
2
|
Wang Y, Zhang M, Hu X, Qin W, Wu H, Wei M. Colon cancer-specific diagnostic and prognostic biomarkers based on genome-wide abnormal DNA methylation. Aging (Albany NY) 2020; 12:22626-22655. [PMID: 33202377 PMCID: PMC7746390 DOI: 10.18632/aging.103874] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 07/25/2020] [Indexed: 12/11/2022]
Abstract
Abnormal DNA methylation is a major early contributor to colon cancer (COAD) development. We conducted a cohort-based systematic investigation of genome-wide DNA methylation using 299 COAD and 38 normal tissue samples from TCGA. Through conditional screening and machine learning with a training cohort, we identified one hypomethylated and nine hypermethylated differentially methylated CpG sites as potential diagnostic biomarkers, and used them to construct a COAD-specific diagnostic model. Unlike previous models, our model precisely distinguished COAD from nine other cancer types (e.g., breast cancer and liver cancer; error rate ≤ 0.05) and from normal tissues in the training cohort (AUC = 1). The diagnostic model was verified using a validation cohort from The Cancer Genome Atlas (AUC = 1) and five independent cohorts from the Gene Expression Omnibus (AUC ≥ 0.951). Using Cox regression analyses, we established a prognostic model based on six CpG sites in the training cohort, and verified the model in the validation cohort. The prognostic model sensitively predicted patients’ survival (p ≤ 0.00011, AUC ≥ 0.792) independently of important clinicopathological characteristics of COAD (e.g., gender and age). Thus, our DNA methylation analysis provided precise biomarkers and models for the early diagnosis and prognostic evaluation of COAD.
Collapse
Affiliation(s)
- Yilin Wang
- Department of Pharmacology, School of Pharmacy, China Medical University, Shenyang 110122, Liaoning Province, P. R. China.,Liaoning Key Laboratory of Molecular Targeted Anti-Tumor Drug Development and Evaluation, Liaoning Cancer Immune Peptide Drug Engineering Technology Research Center, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors, Ministry of Education, China Medical University, Shenyang 110122, Liaoning Province, P. R. China
| | - Ming Zhang
- Department of Pharmacology, School of Pharmacy, China Medical University, Shenyang 110122, Liaoning Province, P. R. China.,Liaoning Key Laboratory of Molecular Targeted Anti-Tumor Drug Development and Evaluation, Liaoning Cancer Immune Peptide Drug Engineering Technology Research Center, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors, Ministry of Education, China Medical University, Shenyang 110122, Liaoning Province, P. R. China
| | - Xiaoyun Hu
- Department of Pharmacology, School of Pharmacy, China Medical University, Shenyang 110122, Liaoning Province, P. R. China.,Liaoning Key Laboratory of Molecular Targeted Anti-Tumor Drug Development and Evaluation, Liaoning Cancer Immune Peptide Drug Engineering Technology Research Center, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors, Ministry of Education, China Medical University, Shenyang 110122, Liaoning Province, P. R. China
| | - Wenyan Qin
- Department of Pharmacology, School of Pharmacy, China Medical University, Shenyang 110122, Liaoning Province, P. R. China.,Liaoning Key Laboratory of Molecular Targeted Anti-Tumor Drug Development and Evaluation, Liaoning Cancer Immune Peptide Drug Engineering Technology Research Center, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors, Ministry of Education, China Medical University, Shenyang 110122, Liaoning Province, P. R. China
| | - Huizhe Wu
- Department of Pharmacology, School of Pharmacy, China Medical University, Shenyang 110122, Liaoning Province, P. R. China.,Liaoning Key Laboratory of Molecular Targeted Anti-Tumor Drug Development and Evaluation, Liaoning Cancer Immune Peptide Drug Engineering Technology Research Center, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors, Ministry of Education, China Medical University, Shenyang 110122, Liaoning Province, P. R. China
| | - Minjie Wei
- Department of Pharmacology, School of Pharmacy, China Medical University, Shenyang 110122, Liaoning Province, P. R. China.,Liaoning Key Laboratory of Molecular Targeted Anti-Tumor Drug Development and Evaluation, Liaoning Cancer Immune Peptide Drug Engineering Technology Research Center, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors, Ministry of Education, China Medical University, Shenyang 110122, Liaoning Province, P. R. China
| |
Collapse
|
3
|
Valeri JA, Collins KM, Ramesh P, Alcantar MA, Lepe BA, Lu TK, Camacho DM. Sequence-to-function deep learning frameworks for engineered riboregulators. Nat Commun 2020; 11:5058. [PMID: 33028819 PMCID: PMC7541510 DOI: 10.1038/s41467-020-18676-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 09/02/2020] [Indexed: 12/26/2022] Open
Abstract
While synthetic biology has revolutionized our approaches to medicine, agriculture, and energy, the design of completely novel biological circuit components beyond naturally-derived templates remains challenging due to poorly understood design rules. Toehold switches, which are programmable nucleic acid sensors, face an analogous design bottleneck; our limited understanding of how sequence impacts functionality often necessitates expensive, time-consuming screens to identify effective switches. Here, we introduce Sequence-based Toehold Optimization and Redesign Model (STORM) and Nucleic-Acid Speech (NuSpeak), two orthogonal and synergistic deep learning architectures to characterize and optimize toeholds. Applying techniques from computer vision and natural language processing, we 'un-box' our models using convolutional filters, attention maps, and in silico mutagenesis. Through transfer-learning, we redesign sub-optimal toehold sensors, even with sparse training data, experimentally validating their improved performance. This work provides sequence-to-function deep learning frameworks for toehold selection and design, augmenting our ability to construct potent biological circuit components and precision diagnostics.
Collapse
Affiliation(s)
- Jacqueline A Valeri
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Katherine M Collins
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Pradeep Ramesh
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA
| | - Miguel A Alcantar
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Bianca A Lepe
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Timothy K Lu
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
- Synthetic Biology Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| | - Diogo M Camacho
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA.
| |
Collapse
|
4
|
Premkumar KAR, Bharanikumar R, Palaniappan A. Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy. Front Bioeng Biotechnol 2020; 8:808. [PMID: 32760712 PMCID: PMC7371854 DOI: 10.3389/fbioe.2020.00808] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 06/23/2020] [Indexed: 01/05/2023] Open
Abstract
Riboswitches are cis-regulatory genetic elements that use an aptamer to control gene expression. Specificity to cognate ligand and diversity of such ligands have expanded the functional repertoire of riboswitches to mediate mounting apt responses to sudden metabolic demands and signal changes in environmental conditions. Given their critical role in microbial life, riboswitch characterisation remains a challenging computational problem. Here we have addressed the issue with advanced deep learning frameworks, namely convolutional neural networks (CNN), and bidirectional recurrent neural networks (RNN) with Long Short-Term Memory (LSTM). Using a comprehensive dataset of 32 ligand classes and a stratified train-validate-test approach, we demonstrated the accurate performance of both the deep learning models (CNN and RNN) relative to conventional hyperparameter-optimized machine learning classifiers on all key performance metrics, including the ROC curve analysis. In particular, the bidirectional LSTM RNN emerged as the best-performing learning method for identifying the ligand-specificity of riboswitches with an accuracy >0.99 and macro-averaged F-score of 0.96. An additional attraction is that the deep learning models do not require prior feature engineering. A dynamic update functionality is built into the models to factor for the constant discovery of new riboswitches, and extend the predictive modeling to new classes. Our work would enable the design of genetic circuits with custom-tuned riboswitch aptamers that would effect precise translational control in synthetic biology. The associated software is available as an open-source Python package and standalone resource for use in genome annotation, synthetic biology, and biotechnology workflows.
Collapse
Affiliation(s)
- Keshav Aditya R. Premkumar
- MS Program in Computer Science, Department of Computer Science, College of Engineering and Applied Sciences, Stony Brook University, Stony Brook, NY, United States
| | - Ramit Bharanikumar
- MS in Bioinformatics, Georgia Institute of Technology, Atlanta, GA, United States
| | - Ashok Palaniappan
- Department of Bioinformatics, School of Chemical and Biotechnology, SASTRA Deemed University, Thanjavur, India
| |
Collapse
|
5
|
Beyene SS, Ling T, Ristevski B, Chen M. A novel riboswitch classification based on imbalanced sequences achieved by machine learning. PLoS Comput Biol 2020; 16:e1007760. [PMID: 32687488 PMCID: PMC7392346 DOI: 10.1371/journal.pcbi.1007760] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 07/30/2020] [Accepted: 05/13/2020] [Indexed: 11/24/2022] Open
Abstract
Riboswitch, a part of regulatory mRNA (50-250nt in length), has two main classes: aptamer and expression platform. One of the main challenges raised during the classification of riboswitch is imbalanced data. That is a circumstance in which the records of a sequences of one group are very small compared to the others. Such circumstances lead classifier to ignore minority group and emphasize on majority ones, which results in a skewed classification. We considered sixteen riboswitch families, to be in accord with recent riboswitch classification work, that contain imbalanced sequences. The sequences were split into training and test set using a newly developed pipeline. From 5460 k-mers (k value 1 to 6) produced, 156 features were calculated based on CfsSubsetEval and BestFirst function found in WEKA 3.8. Statistically tested result was significantly difference between balanced and imbalanced sequences (p < 0.05). Besides, each algorithm also showed a significant difference in sensitivity, specificity, accuracy, and macro F-score when used in both groups (p < 0.05). Several k-mers clustered from heat map were discovered to have biological functions and motifs at the different positions like interior loops, terminal loops and helices. They were validated to have a biological function and some are riboswitch motifs. The analysis has discovered the importance of solving the challenges of majority bias analysis and overfitting. Presented results were generalized evaluation of both balanced and imbalanced models, which implies their ability of classifying, to classify novel riboswitches. The Python source code is available at https://github.com/Seasonsling/riboswitch.
Collapse
Affiliation(s)
- Solomon Shiferaw Beyene
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Tianyi Ling
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China
- School of Medicine, Zhejiang University, Hangzhou, Zhejiang, China
| | - Blagoj Ristevski
- Faculty of Information and Communication Technologies, Bitola, St. Kliment Ohridski University Bitola, ul. Partizanska Bitola, Republic of North Macedonia
| | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
6
|
Golabi F, Mehdizadeh Aghdam E, Shamsi M, Sedaaghi MH, Barzegar A, Hejazi MS. Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method. ACTA ACUST UNITED AC 2020; 11:101-109. [PMID: 33842280 PMCID: PMC8022236 DOI: 10.34172/bi.2021.17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2019] [Revised: 03/12/2020] [Accepted: 03/20/2020] [Indexed: 11/09/2022]
Abstract
Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes' mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.
Collapse
Affiliation(s)
- Faegheh Golabi
- Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, Iran.,Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Elnaz Mehdizadeh Aghdam
- Molecular Medicine Research Center, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Pharmaceutical Biotechnology, Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Mousa Shamsi
- Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, Iran
| | | | - Abolfazl Barzegar
- Research Institute of Bioscience and Biotechnology, University of Tabriz, Tabriz, Iran
| | - Mohammad Saeid Hejazi
- Molecular Medicine Research Center, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Pharmaceutical Biotechnology, Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
7
|
Golabi F, Shamsi M, Sedaaghi MH, Barzegar A, Hejazi MS. Development of a new oligonucleotide block location-based feature extraction (BLBFE) method for the classification of riboswitches. Mol Genet Genomics 2020; 295:525-534. [PMID: 31901978 DOI: 10.1007/s00438-019-01642-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 12/18/2019] [Indexed: 12/20/2022]
Abstract
As knowledge of genetics and genome elements increases, the demand for the development of bioinformatics tools for analyzing these data is raised. Riboswitches are genetic components, usually located in the untranslated regions of mRNAs, that regulate gene expression. Additionally, their interaction with antibiotics has been recently suggested, implying a role in antibiotic effects and resistance. Following a previously published sequential block finding algorithm, herein, we report the development of a new block location-based feature extraction strategy (BLBFE). This procedure utilizes the locations of family-specific sequential blocks on riboswitch sequences as features. Furthermore, the performance of other feature extraction strategies, including mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods, was investigated. KNN, LDA, naïve Bayes, PNN and decision tree classifiers accompanied by V-fold cross-validation were applied for all methods of feature extraction, and their performances based on the defined feature extraction strategies were compared. Performance measures of accuracy, sensitivity, specificity and F-score for each method of feature extraction were studied. The proposed feature extraction strategy resulted in classification of riboswitches with an average correct classification rate (CCR) of 90.8%. Furthermore, the obtained data confirmed the performance of the developed feature extraction method with an average accuracy of 96.1%, an average sensitivity of 90.8%, an average specificity of 97.52% and an average F-score of 90.69%. Our results implied that the proposed feature extraction (BLBFE) method can classify and discriminate riboswitch families with high CCR, accuracy, sensitivity, specificity and F-score values.
Collapse
Affiliation(s)
- F Golabi
- Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, Iran.,School of Advanced Biomedical Sciences (SABS), Tabriz University of Medical Sciences, Tabriz, Iran
| | - Mousa Shamsi
- Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, Iran.
| | - M H Sedaaghi
- Faculty of Electrical Engineering, Sahand University of Technology, Tabriz, Iran
| | - A Barzegar
- School of Advanced Biomedical Sciences (SABS), Tabriz University of Medical Sciences, Tabriz, Iran.,Research Institute for Fundamental Sciences (RIFS), University of Tabriz, Tabriz, Iran
| | - Mohammad Saeid Hejazi
- Molecular Medicine Research Center, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, Iran. .,Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
8
|
Golabi F, Shamsi M, Sedaaghi MH, Barzegar A, Hejazi MS. Classification of Riboswitch Families Using Block Location-Based Feature Extraction (BLBFE) Method. Adv Pharm Bull 2020; 10:97-105. [PMID: 32002367 PMCID: PMC6983983 DOI: 10.15171/apb.2020.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2019] [Revised: 09/04/2019] [Accepted: 09/30/2019] [Indexed: 12/18/2022] Open
Abstract
Purpose: Riboswitches are special non-coding sequences usually located in mRNAs' un-translated regions and regulate gene expression and consequently cellular function. Furthermore, their interaction with antibiotics has been recently implicated. This raises more interest in development of bioinformatics tools for riboswitch studies. Herein, we describe the development and employment of novel block location-based feature extraction (BLBFE) method for classification of riboswitches. Methods: We have already developed and reported a sequential block finding (SBF) algorithm which, without operating alignment methods, identifies family specific sequential blocks for riboswitch families. Herein, we employed this algorithm for 7 riboswitch families including lysine, cobalamin, glycine, SAM-alpha, SAM-IV, cyclic-di-GMP-I and SAH. Then the study was extended toward implementation of BLBFE method for feature extraction. The outcome features were applied in various classifiers including linear discriminant analysis (LDA), probabilistic neural network (PNN), decision tree and k-nearest neighbors (KNN) classifiers for classification of the riboswitch families. The performance of the classifiers was investigated according to performance measures such as correct classification rate (CCR), accuracy, sensitivity, specificity and f-score. Results: As a result, average CCR for classification of riboswitches was 87.87%. Furthermore, application of BLBFE method in 4 classifiers displayed average accuracies of 93.98% to 96.1%, average sensitivities of 76.76% to 83.61%, average specificities of 96.53% to 97.69% and average f-scores of 74.9% to 81.91%. Conclusion: Our results approved that the proposed method of feature extraction; i.e. BLBFE method; can be successfully used for classification and discrimination of the riboswitch families with high CCR, accuracy, sensitivity, specificity and f-score values.
Collapse
Affiliation(s)
- Faegheh Golabi
- Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, Iran
- School of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Mousa Shamsi
- Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, Iran
| | | | - Abolfazl Barzegar
- School of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
- Research Institute for Fundamental Sciences (RIFS), University of Tabriz, Tabriz, Iran
| | - Mohammad Saeid Hejazi
- Molecular Medicine Research Center, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, Iran
- Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
9
|
Groher AC, Jager S, Schneider C, Groher F, Hamacher K, Suess B. Tuning the Performance of Synthetic Riboswitches using Machine Learning. ACS Synth Biol 2019; 8:34-44. [PMID: 30513199 DOI: 10.1021/acssynbio.8b00207] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Riboswitch development for clinical, technological, and synthetic biology applications constantly seeks to optimize regulatory behavior. Here, we present a machine learning approach to improve the regulation of a tetracycline (tc)-dependent riboswitch device composed of two individual tc aptamers. We developed a bioinformatics model that combines random forest analysis with a convolutional neural network to predict the switching behavior of such tandem riboswitches. We found that both biophysical parameters and the hydrogen bond pattern influence regulation. Our new design pipeline led to significant improvement of the tc riboswitch device with a dynamic range extension from 8.5 to 40-fold. We are confident that our novel method not only results in an excellent tc-dependent riboswitch device but further holds great promise and potential for the optimization of other riboswitches.
Collapse
|