1
|
Long R, Guo Z, Han D, Liu B, Yuan X, Chen G, Heng PA, Zhang L. siRNADiscovery: a graph neural network for siRNA efficacy prediction via deep RNA sequence analysis. Brief Bioinform 2024; 25:bbae563. [PMID: 39503523 PMCID: PMC11539000 DOI: 10.1093/bib/bbae563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 09/28/2024] [Accepted: 10/21/2024] [Indexed: 11/08/2024] Open
Abstract
The clinical adoption of small interfering RNAs (siRNAs) has prompted the development of various computational strategies for siRNA design, from traditional data analysis to advanced machine learning techniques. However, previous studies have inadequately considered the full complexity of the siRNA silencing mechanism, neglecting critical elements such as siRNA positioning on mRNA, RNA base-pairing probabilities, and RNA-AGO2 interactions, thereby limiting the insight and accuracy of existing models. Here, we introduce siRNADiscovery, a Graph Neural Network (GNN) framework that leverages both non-empirical and empirical rule-based features of siRNA and mRNA to effectively capture the complex dynamics of gene silencing. On multiple internal datasets, siRNADiscovery achieves state-of-the-art performance. Significantly, siRNADiscovery also outperforms existing methodologies in in vitro studies and on an externally validated dataset. Additionally, we develop a new data-splitting methodology that addresses the data leakage issue, a frequently overlooked problem in previous studies, ensuring the robustness and stability of our model under various experimental settings. Through rigorous testing, siRNADiscovery has demonstrated remarkable predictive accuracy and robustness, making significant contributions to the field of gene silencing. Furthermore, our approach to redefining data-splitting standards aims to set new benchmarks for future research in the domain of predictive biological modeling for siRNA.
Collapse
Affiliation(s)
- Rongzhuo Long
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, 211198, Nanjing, China
| | - Ziyu Guo
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Central Ave, Hong Kong SAR, China
| | - Da Han
- Institute of Molecular Medicine (IMM) and Department of Laboratory Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, 200240, Shanghai, China
- Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, 310022, Hangzhou, Zhejiang, China
| | - Boxiang Liu
- Department of Pharmacy, Faculty of Science, National University of Singapore, Singapore, 117543, Singapore
| | - Xudong Yuan
- ACON Pharmaceuticals, 2557 Route 130 S, Ste 3, Cranbury, NJ 08512, USA
| | | | - Pheng-Ann Heng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Central Ave, Hong Kong SAR, China
| | - Liang Zhang
- Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, 310022, Hangzhou, Zhejiang, China
| |
Collapse
|
2
|
Fopase R, Panda C, Rajendran AP, Uludag H, Pandey LM. Potential of siRNA in COVID-19 therapy: Emphasis on in silico design and nanoparticles based delivery. Front Bioeng Biotechnol 2023; 11:1112755. [PMID: 36814718 PMCID: PMC9939533 DOI: 10.3389/fbioe.2023.1112755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 01/13/2023] [Indexed: 02/09/2023] Open
Abstract
Small interfering RNA (siRNA)-mediated mRNA degradation approach have imparted its eminence against several difficult-to-treat genetic disorders and other allied diseases. Viral outbreaks and resulting pandemics have repeatedly threatened public health and questioned human preparedness at the forefront of drug design and biomedical readiness. During the recent pandemic caused by the SARS-CoV-2, mRNA-based vaccination strategies have paved the way for a new era of RNA therapeutics. RNA Interference (RNAi) based approach using small interfering RNA may complement clinical management of the COVID-19. RNA Interference approach will primarily work by restricting the synthesis of the proteins required for viral replication, thereby hampering viral cellular entry and trafficking by targeting host as well as protein factors. Despite promising benefits, the stability of small interfering RNA in the physiological environment is of grave concern as well as site-directed targeted delivery and evasion of the immune system require immediate attention. In this regard, nanotechnology offers viable solutions for these challenges. The review highlights the potential of small interfering RNAs targeted toward specific regions of the viral genome and the features of nanoformulations necessary for the entrapment and delivery of small interfering RNAs. In silico design of small interfering RNA for different variants of SARS-CoV-2 has been discussed. Various nanoparticles as promising carriers of small interfering RNAs along with their salient properties, including surface functionalization, are summarized. This review will help tackle the real-world challenges encountered by the in vivo delivery of small interfering RNAs, ensuring a safe, stable, and readily available drug candidate for efficient management of SARS-CoV-2 in the future.
Collapse
Affiliation(s)
- Rushikesh Fopase
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| | - Chinmaya Panda
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| | - Amarnath P. Rajendran
- Department of Chemical & Materials Engineering, Faculty of Engineering, University of Alberta, Edmonton, AB, Canada
| | - Hasan Uludag
- Department of Chemical & Materials Engineering, Faculty of Engineering, University of Alberta, Edmonton, AB, Canada
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB, Canada
- Department of Biomedical Engineering, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, AB, Canada
| | - Lalit M. Pandey
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| |
Collapse
|
3
|
Roy T, Sharma K, Dhall A, Patiyal S, Raghava GPS. In silico method for predicting infectious strains of influenza A virus from its genome and protein sequences. J Gen Virol 2022; 103. [PMID: 36318663 DOI: 10.1099/jgv.0.001802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023] Open
Abstract
Influenza A is a contagious viral disease responsible for four pandemics in the past and a major public health concern. Being zoonotic in nature, the virus can cross the species barrier and transmit from wild aquatic bird reservoirs to humans via intermediate hosts. In this study, we have developed a computational method for the prediction of human-associated and non-human-associated influenza A virus sequences. The models were trained and validated on proteins and genome sequences of influenza A virus. Firstly, we have developed prediction models for 15 types of influenza A proteins using composition-based and one-hot-encoding features. We have achieved a highest AUC of 0.98 for HA protein on a validation dataset using dipeptide composition-based features. Of note, we obtained a maximum AUC of 0.99 using one-hot-encoding features for protein-based models on a validation dataset. Secondly, we built models using whole genome sequences which achieved an AUC of 0.98 on a validation dataset. In addition, we showed that our method outperforms a similarity-based approach (i.e., blast) on the same validation dataset. Finally, we integrated our best models into a user-friendly web server 'FluSPred' (https://webs.iiitd.edu.in/raghava/fluspred/index.html) and a standalone version (https://github.com/raghavagps/FluSPred) for the prediction of human-associated/non-human-associated influenza A virus strains.
Collapse
Affiliation(s)
- Trinita Roy
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Khushal Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Sumeet Patiyal
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Gajendra Pal Singh Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| |
Collapse
|
4
|
Jia X, Han Q, Lu Z. Constructing the boundary between potent and ineffective siRNAs by MG-algorithm with C-features. BMC Bioinformatics 2022; 23:337. [PMID: 35963993 PMCID: PMC9375269 DOI: 10.1186/s12859-022-04867-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Accepted: 07/29/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In siRNA based antiviral therapeutics, selection of potent siRNAs is an indispensable step, but these commonly used features are unable to construct the boundary between potent and ineffective siRNAs. RESULTS Here, we select potent siRNAs by removing ineffective ones, where these conditions for removals are constructed by C-features of siRNAs, C-features are generated by MG-algorithm, Icc-cluster and the different combinations of some commonly used features, MG-algorithm and Icc-cluster are two different algorithms to search the nearest siRNA neighbors. For the ineffective siRNAs in test data, they are removed from test data by I-iteration, where I-iteration continually updates training data by adding these successively removed siRNAs. Furthermore, the efficacy of siRNAs of test data is predicted by their nearest neighbors of training data. CONCLUSIONS By siRNAs of Hencken dataset, results show that our algorithm removes almost ineffective siRNAs from test data, gives the clear boundary between potent and ineffective siRNAs, and accurately predicts the efficacy of siRNAs also. We suggest that our algorithm can provide new insights for selecting the potent siRNAs.
Collapse
Affiliation(s)
- Xingang Jia
- School of Mathematics, Southeast University, Nanjing, 210096, People's Republic of China.
| | - Qiuhong Han
- Department of Mathematics, Nanjing Forestry University, Nanjing, 210037, People's Republic of China
| | - Zuhong Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, People's Republic of China
| |
Collapse
|
5
|
Chen MX, Zhu XD, Zhang H, Liu Z, Liu YN. SMRI: A New Method for siRNA Design for COVID-19 Therapy. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 2022; 37:991-1002. [PMID: 35992496 PMCID: PMC9374573 DOI: 10.1007/s11390-021-0826-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 08/31/2021] [Indexed: 06/15/2023]
Abstract
UNLABELLED First discovered in Wuhan, China, SARS-CoV-2 is a highly pathogenic novel coronavirus, which rapidly spread globally and became a pandemic with no vaccine and limited distinctive clinical drugs available till March 13th, 2020. Ribonucleic Acid interference (RNAi) technology, a gene-silencing technology that targets mRNA, can cause damage to RNA viruses effectively. Here, we report a new efficient small interfering RNA (siRNA) design method named Simple Multiple Rules Intelligent Method (SMRI) to propose a new solution of the treatment of COVID-19. To be specific, this study proposes a new model named Base Preference and Thermodynamic Characteristic model (BPTC model) indicating the siRNA silencing efficiency and a new index named siRNA Extended Rules index (SER index) based on the BPTC model to screen high-efficiency siRNAs and filter out the siRNAs that are difficult to take effect or synthesize as a part of the SMRI method, which is more robust and efficient than the traditional statistical indicators under the same circumstances. Besides, to silence the spike protein of SARS-CoV-2 to invade cells, this study further puts forward the SMRI method to search candidate high-efficiency siRNAs on SARS-CoV-2's S gene. This study is one of the early studies applying RNAi therapy to the COVID-19 treatment. According to the analysis, the average value of predicted interference efficiency of the candidate siRNAs designed by the SMRI method is comparable to that of the mainstream siRNA design algorithms. Moreover, the SMRI method ensures that the designed siRNAs have more than three base mismatches with human genes, thus avoiding silencing normal human genes. This is not considered by other mainstream methods, thereby the five candidate high-efficiency siRNAs which are easy to take effect or synthesize and much safer for human body are obtained by our SMRI method, which provide a new safer, small dosage and long efficacy solution for the treatment of COVID-19. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s11390-021-0826-x.
Collapse
Affiliation(s)
- Meng-Xin Chen
- College of Software, Jilin University, Changchun, 130012 China
| | - Xiao-Dong Zhu
- Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, Jilin University, Changchun, 130012 China
| | - Hao Zhang
- College of Computer Science and Technology, Jilin University, Changchun, 130012 China
| | - Zhen Liu
- College of Computer Science and Technology, Jilin University, Changchun, 130012 China
- Graduate School of Engineering, Nagasaki Institute of Applied Science, Nagasaki, 851-0193 Japan
| | - Yuan-Ning Liu
- College of Computer Science and Technology, Jilin University, Changchun, 130012 China
| |
Collapse
|
6
|
Randhawa V, Pathania S, Kumar M. Computational Identification of Potential Multitarget Inhibitors of Nipah Virus by Molecular Docking and Molecular Dynamics. Microorganisms 2022; 10:microorganisms10061181. [PMID: 35744699 PMCID: PMC9227315 DOI: 10.3390/microorganisms10061181] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 04/28/2022] [Accepted: 05/11/2022] [Indexed: 02/04/2023] Open
Abstract
Nipah virus (NiV) is a recently emerged paramyxovirus that causes severe encephalitis and respiratory diseases in humans. Despite the severe pathogenicity of this virus and its pandemic potential, not even a single type of molecular therapeutics has been approved for human use. Considering the role of NiV attachment glycoprotein G (NiV-G), fusion glycoprotein (NiV-F), and nucleoprotein (NiV-N) in virus replication and spread, these are the most attractive targets for anti-NiV drug discovery. Therefore, to prospect for potential multitarget chemical/phytochemical inhibitor(s) against NiV, a sequential molecular docking and molecular-dynamics-based approach was implemented by simultaneously targeting NiV-G, NiV-F, and NiV-N. Information on potential NiV inhibitors was compiled from the literature, and their 3D structures were drawn manually, while the information and 3D structures of phytochemicals were retrieved from the established structural databases. Molecules were docked against NiV-G (PDB ID:2VSM), NiV-F (PDB ID:5EVM), and NiV-N (PDB ID:4CO6) and then prioritized based on (1) strong protein-binding affinity, (2) interactions with critically important binding-site residues, (3) ADME and pharmacokinetic properties, and (4) structural stability within the binding site. The molecules that bind to all the three viral proteins (NiV-G ∩ NiV-F ∩ NiV-N) were considered multitarget inhibitors. This study identified phytochemical molecules RASE0125 (17-O-Acetyl-nortetraphyllicine) and CARS0358 (NA) as distinct multitarget inhibitors of all three viral proteins, and chemical molecule ND_nw_193 (RSV604) as an inhibitor of NiV-G and NiV-N. We expect the identified compounds to be potential candidates for in vitro and in vivo antiviral studies, followed by clinical treatment of NiV.
Collapse
Affiliation(s)
- Vinay Randhawa
- Virology Discovery Unit and Bioinformatics Centre, CSIR-Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh 160036, India; (V.R.); (S.P.)
| | - Shivalika Pathania
- Virology Discovery Unit and Bioinformatics Centre, CSIR-Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh 160036, India; (V.R.); (S.P.)
| | - Manoj Kumar
- Virology Discovery Unit and Bioinformatics Centre, CSIR-Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh 160036, India; (V.R.); (S.P.)
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
- Correspondence: ; Tel.: +91-172-6665453
| |
Collapse
|
7
|
Zhang D, Tian J, Wang Y, Lu J. Evitar: designing anti-viral RNA therapies against future RNA viruses. Bioinformatics 2022; 38:2437-2443. [PMID: 35294970 PMCID: PMC9048652 DOI: 10.1093/bioinformatics/btac144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 02/07/2022] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The coronavirus disease 2019 (COVID-19) pandemic has highlighted the threat of emerging respiratory viruses and has exposed the lack of availability of off-the-shelf therapeutics against new RNA viruses. Previous research has established the potential that siRNAs and RNA-targeting CRISPR have in combating known RNA viruses. However, the feasibility and tools for designing anti-viral RNA therapeutics against future RNA viruses have not yet been established. RESULTS We develop the Emerging-Virus-Targeting RNA (Evitar) pipeline for designing anti-viral siRNAs and CRISPR Cas13a guide RNA (gRNA) sequences. Within Evitar, we develop Greedy Algorithm with Redundancy and Similarity-weighted Greedy Algorithm with Redundancy to enhance the performance. Time simulations using known coronavirus genomes deposited as early as 10 years prior to the COVID-19 outbreak show that at least three SARS-CoV-2-targeting siRNAs are among the top 30 pre-designed siRNAs. In addition, among the top 19 pre-designed gRNAs, there are three SARS-CoV-2-targeting Cas13a gRNAs that could be predicted using information from 2011. Before-the-outbreak design is also possible against the MERS-CoV virus and the 2009-H1N1 swine flu virus. Designed siRNAs are further shown to suppress SARS-CoV-2 viral sequences using in vitro reporter assays. Our results support the utility of Evitar to pre-design anti-viral siRNAs/gRNAs against future viruses. Therefore, we propose the development of a collection consisting of roughly 30 pre-designed, safety-tested and off-the-shelf siRNA/CRISPR therapeutics that could accelerate responses to future RNA virus outbreaks. AVAILABILITY AND IMPLEMENTATION Codes are available at GitHub (https://github.com/dingyaozhang/Evitar). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dingyao Zhang
- Yale Stem Cell Center, Yale University, New Haven, CT 06520, USA.,Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Jingru Tian
- Yale Stem Cell Center, Yale University, New Haven, CT 06520, USA.,Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Yadong Wang
- Yale Stem Cell Center, Yale University, New Haven, CT 06520, USA.,Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA.,Yale Center for RNA Science and Medicine, Yale Cancer Center, Yale University, New Haven, CT 06520, USA
| | - Jun Lu
- Yale Stem Cell Center, Yale University, New Haven, CT 06520, USA.,Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA.,Yale Center for RNA Science and Medicine, Yale Cancer Center, Yale University, New Haven, CT 06520, USA.,Yale Cooperative Center of Excellence in Hematology, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
8
|
Khan T, Khan A, Wei DQ. MMV-db: vaccinomics and RNA-based therapeutics database for infectious hemorrhagic fever-causing mammarenaviruses. Database (Oxford) 2021; 2021:baab063. [PMID: 34679165 PMCID: PMC8533362 DOI: 10.1093/database/baab063] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 08/24/2021] [Accepted: 10/18/2021] [Indexed: 12/24/2022]
Abstract
The recent viral outbreaks and the current pandemic situation urges us to timely address any emerging viral infections by designing therapeutic strategies. Multi-omics and therapeutic data are of great interest to develop early remedial interventions. This work provides a therapeutic data platform (Mammarenavirus (MMV)-db) for pathogenic mammarenaviruses with potential catastrophic effects on human health around the world. The database integrates vaccinomics and RNA-based therapeutics data for seven human pathogenic MMVs associated with severe viral hemorrhagic fever and lethality in humans. Protein-specific cytotoxic T lymphocytes, B lymphocytes, helper T-cell and interferon-inducing epitopes were mapped using a cluster of immune-omics-based algorithms and tools for the seven human pathogenic viral species. Furthermore, the physiochemical and antigenic properties were also explored to guide protein-specific multi-epitope subunit vaccine for each species. Moreover, highly efficacious RNAs (small Interfering RNA (siRNA), microRNA and single guide RNA (sgRNA)) after extensive genome-based analysis with therapeutic relevance were explored. All the therapeutic RNAs were further classified and listed on the basis of predicted higher efficacy. The online platform (http://www.mmvdb.dqweilab-sjtu.com/index.php) contains easily accessible data sets and vaccine designs with potential utility in further computational and experimental work. Conclusively, the current study provides a baseline data platform to secure better future therapeutic interventions against the hemorrhagic fever causing mammarenaviruses. Database URL: http://www.mmvdb.dqweilab-sjtu.com/index.php.
Collapse
Affiliation(s)
- Taimoor Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, P.R. China
| | - Abbas Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, P.R. China
| | - Dong-Qing Wei
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, P.R. China
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
- Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nashan District, Shenzhen, Guangdong 518055, P.R China
| |
Collapse
|
9
|
Khan A, Khan S, Ahmad S, Anwar Z, Hussain Z, Safdar M, Rizwan M, Waseem M, Hussain A, Akhlaq M, Khan T, Ali SS, Wei DQ. HantavirusesDB: Vaccinomics and RNA-based therapeutics database for the potentially emerging human respiratory pandemic agents. Microb Pathog 2021; 160:105161. [PMID: 34461244 DOI: 10.1016/j.micpath.2021.105161] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 08/25/2021] [Accepted: 08/26/2021] [Indexed: 12/29/2022]
Abstract
Hantaviruses are etiological agents of several severe respiratory illnesses in humans and their human-to-human transmission has been reported. To cope with any potential pandemic, this group of viruses needs further research and a data platform. Therefore, herein we developed a database "HantavirusesDB (HVdb)", where genomics, proteomics, immune resource, RNAi based therapeutics and information on the 3D structures of druggable targets of the Orthohantaviruses are provided on a single platform. The database allows the researchers to effectively map the therapeutic strategies by designing multi-epitopes subunit vaccine and RNA based therapeutics. Moreover, the ease of the web interface allow the users to retrieve specific information from the database. Because of the high quality and excellent functionality of the HVdb, therapeutic research of Hantaviruses can be accelerated, and data analysis might be a foundation to design better treatment strategies targeting the hantaviruses. The database is accessible at http://hvdb.dqweilab-sjtu.com/index.php.
Collapse
Affiliation(s)
- Abbas Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China.
| | - Shahzeb Khan
- Center for Biotechnology and Microbiology, University of Swat, Swat, KP, Pakistan
| | - Sajjad Ahmad
- Department of Health and Biological Sciences, Abasyn University, Peshawar, 25000, Pakistan
| | - Zeeshan Anwar
- Department of Pharmacy, Abdul Wali Khan University, Mardan, Khyber Pakhtunkhwa, Pakistan
| | - Zahid Hussain
- Center for Biotechnology and Microbiology, University of Swat, Swat, KP, Pakistan
| | - Muhammad Safdar
- Faculty of Pharmacy, Gomal University, DI Khan, Khyber Pakhtunkhwa, Pakistan
| | - Muhammad Rizwan
- Center for Biotechnology and Microbiology, University of Swat, Swat, KP, Pakistan
| | - Muhammad Waseem
- Faculty of Rehabilitation and Allied Health Science, Riphah International University, Islamabad, Pakistan
| | - Abid Hussain
- Department of Pharmacy, University of Poonch, Rawalakot, Azad Jammu and Kashmir, Pakistan
| | - Muhammad Akhlaq
- Faculty of Pharmacy, Gomal University, DI Khan, Khyber Pakhtunkhwa, Pakistan
| | - Taimoor Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Syed Shujait Ali
- Center for Biotechnology and Microbiology, University of Swat, Swat, KP, Pakistan
| | - Dong-Qing Wei
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China; State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, PR China; Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nashan District, Shenzhen, Guangdong, 518055, PR China.
| |
Collapse
|
10
|
Dhall A, Patiyal S, Sharma N, Devi NL, Raghava GPS. Computer-aided prediction of inhibitors against STAT3 for managing COVID-19 associated cytokine storm. Comput Biol Med 2021; 137:104780. [PMID: 34450382 PMCID: PMC8378993 DOI: 10.1016/j.compbiomed.2021.104780] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Revised: 08/11/2021] [Accepted: 08/18/2021] [Indexed: 12/27/2022]
Abstract
Background Proinflammatory cytokines are correlated with the severity of disease in patients with COVID-19. IL6-mediated activation of STAT3 proliferates proinflammatory responses that lead to cytokine storm promotion. Thus, STAT3 inhibitors may play a crucial role in managing the COVID-19 pathogenesis. The present study discusses a method for predicting inhibitors against the STAT3 signaling pathway. Method The main dataset comprises 1565 STAT3 inhibitors and 1671 non-inhibitors used for training, testing, and evaluation of models. A number of machine learning classifiers have been implemented to develop the models. Results The outcomes of the data analysis show that rings and aromatic groups are significantly abundant in STAT3 inhibitors compared to non-inhibitors. First, we developed models using 2-D and 3-D chemical descriptors and achieved a maximum AUC of 0.84 and 0.73, respectively. Second, fingerprints are used to build predictive models and achieved 0.86 AUC with an accuracy of 78.70% on the validation dataset. Finally, models were developed using hybrid descriptors, which achieved a maximum of 0.87 AUC with 78.55% accuracy on the validation dataset. Conclusion We used the best model to identify STAT3 inhibitors in FDA-approved drugs and found few drugs (e.g., Tamoxifen and Perindopril) to manage the cytokine storm in COVID-19 patients. A webserver “STAT3In” (https://webs.iiitd.edu.in/raghava/stat3in/) has been developed to predict and design STAT3 inhibitors.
Collapse
Affiliation(s)
- Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Sumeet Patiyal
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Neelam Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Naorem Leimarembi Devi
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| |
Collapse
|
11
|
Rajput A, Kumar M. Anti-Ebola: an initiative to predict Ebola virus inhibitors through machine learning. Mol Divers 2021; 26:1635-1644. [PMID: 34357513 PMCID: PMC8343361 DOI: 10.1007/s11030-021-10291-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Accepted: 07/28/2021] [Indexed: 01/17/2023]
Abstract
Ebola virus is a deadly pathogen responsible for a frequent series of outbreaks since 1976. Despite various efforts from researchers worldwide, its mortality and fatality are quite high. For antiviral drug discovery, the computational efforts are considered highly useful. Therefore, we have developed an 'anti-Ebola' web server, through quantitative structure-activity relationship information of available molecules with experimental anti-Ebola activities. Three hundred and five unique anti-Ebola compounds with their respective IC50 values were extracted from the 'DrugRepV' database. Later, the compounds were used to extract the molecular descriptors, which were subjected to regression-based model development. The robust machine learning techniques, namely support vector machine, random forest and artificial neural network, were employed using tenfold cross-validation. After a randomization approach, the best predictive model showed Pearson's correlation coefficient ranges from 0.83 to 0.98 on training/testing (T274) dataset. The robustness of the developed models was cross-evaluated using William's plot. The highly robust computational models are integrated into the web server. The 'anti-Ebola' web server is freely available at https://bioinfo.imtech.res.in/manojk/antiebola . We anticipate this will serve the scientific community for developing effective inhibitors against the Ebola virus.
Collapse
Affiliation(s)
- Akanksha Rajput
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, 160036, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, 160036, India. .,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
| |
Collapse
|
12
|
Currá A, Cacciabue M, Gravisaco MJ, Asurmendi S, Taboga O, Gismondi MI. Antiviral efficacy of short-hairpin RNAs and artificial microRNAs targeting foot-and-mouth disease virus. PeerJ 2021; 9:e11227. [PMID: 34178434 PMCID: PMC8197037 DOI: 10.7717/peerj.11227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 03/16/2021] [Indexed: 11/20/2022] Open
Abstract
RNA interference (RNAi) is a well-conserved mechanism in eukaryotic cells that directs post-transcriptional gene silencing through small RNA molecules. RNAi has been proposed as an alternative approach for rapid and specific control of viruses including foot-and-mouth disease virus (FMDV), the causative agent of a devastating animal disease with high economic impact. The aim of this work was to assess the antiviral activity of different small RNA shuttles targeting the FMDV RNA-dependent RNA polymerase coding sequence (3D). Three target sequences were predicted within 3D considering RNA accessibility as a major criterion. The silencing efficacy of short-hairpin RNAs (shRNAs) and artificial microRNAs (amiRNAs) targeting the selected sequences was confirmed in fluorescent reporter assays. Furthermore, BHK-21 cells transiently expressing shRNAs or amiRNAs proved 70 to >95% inhibition of FMDV growth. Interestingly, dual expression of amiRNAs did not improve FMDV silencing. Lastly, stable cell lines constitutively expressing amiRNAs were established and characterized in terms of antiviral activity against FMDV. As expected, viral replication in these cell lines was delayed. These results show that the target RNA-accessibility-guided approach for RNAi design rendered efficient amiRNAs that constrain FMDV replication. The application of amiRNAs to complement FMDV vaccination in specific epidemiological scenarios shall be explored further.
Collapse
Affiliation(s)
- Anabella Currá
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Hurlingham, Buenos Aires, Argentina
| | - Marco Cacciabue
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Hurlingham, Buenos Aires, Argentina
| | - María José Gravisaco
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Hurlingham, Buenos Aires, Argentina
| | - Sebastián Asurmendi
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Hurlingham, Buenos Aires, Argentina
| | - Oscar Taboga
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Hurlingham, Buenos Aires, Argentina
| | - María I. Gismondi
- Instituto de Agrobiotecnología y Biología Molecular (IABiMo), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Hurlingham, Buenos Aires, Argentina
| |
Collapse
|
13
|
Khan T, Khan A, Nasir SN, Ahmad S, Ali SS, Wei DQ. CytomegaloVirusDb: Multi-omics knowledge database for cytomegaloviruses. Comput Biol Med 2021; 135:104563. [PMID: 34256256 DOI: 10.1016/j.compbiomed.2021.104563] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/06/2021] [Accepted: 06/06/2021] [Indexed: 11/16/2022]
Abstract
Cytomegalovirus infection is a significant health concern and need further exploration in immunologic response mechanisms during primary and reactivated CMV infection. In this work, we evaluated the whole genomes and proteomes of different CMV species and developed an integrated open-access platform, CytomegaloVirusDb, a multi-Omics knowledge database for Cytomegaloviruses. The resource is categorized into the main sections "Genomics," "Proteomics," "Immune response," and "Therapeutics,". The database is annotated with the list of all CMV species included in the study, and available information is freely accessible at http://www.cmvdb.dqweilab-sjtu.com/index.php. Various parameters used in the analysis for each section were primarily based on the whole genome or proteome of each specie. The platform provided datasets are open to access for researchers to obtain CMV species-specific information. This will help further to explore the dynamics of CMV-specific immune response and therapeutics. This platform is a useful resource to aid in advancing research against Cytomegaloviruses.
Collapse
Affiliation(s)
- Taimoor Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Abbas Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Syed Nouman Nasir
- National Center for Bioinformatics, Quaid-i-Azam University, Islamabad, Pakistan
| | - Sajjad Ahmad
- Department of Health and Biological Sciences, Abasyn University, Peshawar, 25000, Pakistan
| | - Syed Shujait Ali
- Center for Biotechnology and Microbiology, University of Swat, Swat, KP, Pakistan
| | - Dong-Qing Wei
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China; State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, PR China; Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nashan District, Shenzhen, Guangdong, 518055, PR China.
| |
Collapse
|
14
|
Tonk M, Růžek D, Vilcinskas A. Compelling Evidence for the Activity of Antiviral Peptides against SARS-CoV-2. Viruses 2021; 13:v13050912. [PMID: 34069206 PMCID: PMC8156787 DOI: 10.3390/v13050912] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 05/09/2021] [Accepted: 05/12/2021] [Indexed: 12/20/2022] Open
Abstract
Multiple outbreaks of epidemic and pandemic viral diseases have occurred in the last 20 years, including those caused by Ebola virus, Zika virus, and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The emergence or re-emergence of such diseases has revealed the deficiency in our pipeline for the discovery and development of antiviral drugs. One promising solution is the extensive library of antimicrobial peptides (AMPs) produced by all eukaryotic organisms. AMPs are widely known for their activity against bacteria, but many possess additional antifungal, antiparasitic, insecticidal, anticancer, or antiviral activities. AMPs could therefore be suitable as leads for the development of new peptide-based antiviral drugs. Sixty therapeutic peptides had been approved by the end of 2018, with at least another 150 in preclinical or clinical development. Peptides undergoing clinical trials include analogs, mimetics, and natural AMPs. The advantages of AMPs include novel mechanisms of action that hinder the evolution of resistance, low molecular weight, low toxicity toward human cells but high specificity and efficacy, the latter enhanced by the optimization of AMP sequences. In this opinion article, we summarize the evidence supporting the efficacy of antiviral AMPs and discuss their potential to treat emerging viral diseases including COVID-19.
Collapse
Affiliation(s)
- Miray Tonk
- Institute for Insect Biotechnology, Justus Liebig University of Giessen, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany;
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325 Frankfurt, Germany
| | - Daniel Růžek
- Department of Virology, Veterinary Research Institute, Hudcova 70, CZ-62100 Brno, Czech Republic;
- Biology Centre of the Czech Academy of Sciences, Institute of Parasitology, Branisovska 31, 37005 Ceske Budejovice, Czech Republic
| | - Andreas Vilcinskas
- Institute for Insect Biotechnology, Justus Liebig University of Giessen, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany;
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325 Frankfurt, Germany
- Department of Bioresources, Fraunhofer Institute for Molecular Biology and Applied Ecology, Ohlebergsweg 12, 35392 Giessen, Germany
- Correspondence:
| |
Collapse
|
15
|
Dhall A, Patiyal S, Sharma N, Usmani SS, Raghava GPS. Computer-aided prediction and design of IL-6 inducing peptides: IL-6 plays a crucial role in COVID-19. Brief Bioinform 2021; 22:936-945. [PMID: 33034338 PMCID: PMC7665369 DOI: 10.1093/bib/bbaa259] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Revised: 08/28/2020] [Accepted: 09/13/2020] [Indexed: 12/16/2022] Open
Abstract
Interleukin 6 (IL-6) is a pro-inflammatory cytokine that stimulates acute phase responses, hematopoiesis and specific immune reactions. Recently, it was found that the IL-6 plays a vital role in the progression of COVID-19, which is responsible for the high mortality rate. In order to facilitate the scientific community to fight against COVID-19, we have developed a method for predicting IL-6 inducing peptides/epitopes. The models were trained and tested on experimentally validated 365 IL-6 inducing and 2991 non-inducing peptides extracted from the immune epitope database. Initially, 9149 features of each peptide were computed using Pfeature, which were reduced to 186 features using the SVC-L1 technique. These features were ranked based on their classification ability, and the top 10 features were used for developing prediction models. A wide range of machine learning techniques has been deployed to develop models. Random Forest-based model achieves a maximum AUROC of 0.84 and 0.83 on training and independent validation dataset, respectively. We have also identified IL-6 inducing peptides in different proteins of SARS-CoV-2, using our best models to design vaccine against COVID-19. A web server named as IL-6Pred and a standalone package has been developed for predicting, designing and screening of IL-6 inducing peptides (https://webs.iiitd.edu.in/raghava/il6pred/).
Collapse
Affiliation(s)
- Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Sumeet Patiyal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Neelam Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Salman Sadullah Usmani
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| |
Collapse
|
16
|
Gupta AK, Kumar M. HPVomics: An integrated resource for the human papillomavirus epitome and therapeutics. Genomics 2020; 112:4853-4862. [PMID: 32871223 DOI: 10.1016/j.ygeno.2020.08.025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 08/07/2020] [Accepted: 08/19/2020] [Indexed: 12/01/2022]
Abstract
Human papillomaviruses (HPVs) belongs to the Papillomaviridae family, which is divided into high-risk (HR), and low-risk (LR) HPVs based on their disease-causing competence. HR-HPVs 16 and 18 are known to cause distinct carcinomas like cervical and head and neck, whereas LR-HPVs are commonly associated with the genital warts. We have developed an integrative platform; HPVomics dedicated to the potential therapeutic regimens targeting all HPV genes including oncoproteins E6, E7 and E5. We primarily focused on eighteen HR-HPVs and eleven LR-HPVs. It mainly deals with therapeutically imperative elements, i.e., vaccine epitopes, siRNAs, sgRNAs, and anti-viral peptides. Simultaneously, it also comprises of genome browser, whole-genome sequences and annotation of HPVs with searching and filtering capabilities. Moreover, we have also developed an integrated support vector machine (SVM) based computational algorithm "HPVepi" for the prediction of HPV epitome. We hope that HPVomics (http://bioinfo.imtech.res.in/manojk/hpvomics/) will assist the scientific community engaged in HPV research.
Collapse
Affiliation(s)
- Amit Kumar Gupta
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India.
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
| |
Collapse
|
17
|
Perez-Mendez M, Zárate-Segura P, Salas-Benito J, Bastida-González F. siRNA Design to Silence the 3'UTR Region of Zika Virus. BIOMED RESEARCH INTERNATIONAL 2020; 2020:6759346. [PMID: 32802864 PMCID: PMC7421096 DOI: 10.1155/2020/6759346] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Revised: 05/16/2020] [Accepted: 07/02/2020] [Indexed: 12/25/2022]
Abstract
The disease caused by the Zika virus (ZIKV) has positioned itself as one of the main public health problems in Mexico. One of the main reasons is it causes microcephaly and other birth defects. The transmission of ZIKV is through Aedes aegypti and Ae. albopictus mosquitoes, which are found in a larger space of the national territory. In addition, it can also be transmitted via blood transfusion, sexual relations, and maternal-fetal route. So far, there are no vaccines or specific treatments to deal with this infection. Currently, some new therapeutics such as small interfering RNAs (siRNAs) are able to regulate or suppress transcription in viruses. Therefore, in this project, an in silico siRNA was designed for the 3'UTR region of ZIKV via bioinformatics tools. The designed siRNA was synthesized and transfected into the C6/36 cell line, previously infected with ZIKV in order to assess the ability of the siRNA to inhibit viral replication. The designed siRNA was able to inhibit significantly (p < 0.05) ZIKV replication; this siRNA could be considered a potential therapeutic towards the disease that causes ZIKV and the medical problems generated.
Collapse
Affiliation(s)
- María Perez-Mendez
- Laboratorio de Medicina Traslacional, Escuela Superior de Medicina, Instituto Politécnico Nacional, St. Salvador Díaz Mirón Esquina Plan de San Luis, Santo Tomas, Miguel Hidalgo, CDMX 11340, Mexico
| | - Paola Zárate-Segura
- Laboratorio de Medicina Traslacional, Escuela Superior de Medicina, Instituto Politécnico Nacional, St. Salvador Díaz Mirón Esquina Plan de San Luis, Santo Tomas, Miguel Hidalgo, CDMX 11340, Mexico
| | - Juan Salas-Benito
- Laboratorio de Biomedicina Molecular 3, Escuela Nacional de Medicina y Homeopatía, Instituto Politécnico Nacional, Guillermo Massieu Helguera 239, La Escalera-Ticomán, Gustavo A. Madero, CDMX 07329, Mexico
| | - Fernando Bastida-González
- Laboratorio de Biología Molecular, Laboratorio Estatal de Salud Pública del Estado de México, Paseo Tollocan s/n, La Moderna de la Cruz, EDOMEX, Toluca, 50180, Mexico
| |
Collapse
|
18
|
Gupta AK, Khan MS, Choudhury S, Mukhopadhyay A, Sakshi, Rastogi A, Thakur A, Kumari P, Kaur M, Shalu, Saini C, Sapehia V, Barkha, Patel PK, Bhamare KT, Kumar M. CoronaVR: A Computational Resource and Analysis of Epitopes and Therapeutics for Severe Acute Respiratory Syndrome Coronavirus-2. Front Microbiol 2020; 11:1858. [PMID: 32849449 PMCID: PMC7412965 DOI: 10.3389/fmicb.2020.01858] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Accepted: 07/15/2020] [Indexed: 12/21/2022] Open
Abstract
In December 2019, the Chinese city of Wuhan was the center of origin of a pneumonia-like disease outbreak with an unknown causative pathogen. The CDC, China, managed to track the source of infection to a novel coronavirus (2019-nCoV; SARS-CoV-2) that shares approximately 79.6% of its genome with SARS-CoV. The World Health Organization (WHO) initially declared COVID-19 as a Public Health Emergency of International Concern (PHEIC) and later characterized it as a global pandemic on March 11, 2020. Due to the novel nature of this virus, there is an urgent need for vaccines and therapeutics to control the spread of SARS-CoV-2 and its associated disease, COVID-19. Global efforts are underway to circumvent its further spread and treat COVID-19 patients through experimental vaccine formulations and therapeutic interventions, respectively. In the absence of any effective therapeutics, we have devised h bioinformatics-based approaches to accelerate global efforts in the fight against SARS-CoV-2 and to assist researchers in the initial phase of vaccine and therapeutics development. In this study, we have performed comprehensive meta-analyses and developed an integrative resource, “CoronaVR” (http://bioinfo.imtech.res.in/manojk/coronavr/). Predominantly, we identified potential epitope-based vaccine candidates, siRNA-based therapeutic regimens, and diagnostic primers. The resource is categorized into the main sections “Genomes,” “Epitopes,” “Therapeutics,” and Primers.” The genome section harbors different components, viz, genomes, a genome browser, phylogenetic analysis, codon usage, glycosylation sites, and structural analysis. Under the umbrella of epitopes, sub-divisions, namely cross-protective epitopes, B-cell (linear/discontinuous), T-cell (CD4+/CD8+), CTL, and MHC binders, are presented. The therapeutics section has different sub-sections like siRNA, miRNAs, and sgRNAs. Further, experimentally confirmed and designed diagnostic primers are earmarked in the primers section. Our study provided a set of shortlisted B-cell and T-cell (CD4+ and CD8+) epitopes that can be experimentally tested for their incorporation in vaccine formulations. The list of selected primers can be used in testing kits to identify SARS-CoV-2, while the recommended siRNAs, sgRNAs, and miRNAs can be used in therapeutic regimens. We foresee that this resource will help in advancing the research against coronaviruses.
Collapse
Affiliation(s)
- Amit Kumar Gupta
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Md Shoaib Khan
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Shubham Choudhury
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Adhip Mukhopadhyay
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Sakshi
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Amber Rastogi
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Pallawi Kumari
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Manmeet Kaur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Shalu
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Chanchal Saini
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Vandna Sapehia
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Barkha
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Pradeep Kumar Patel
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Kailash T Bhamare
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
19
|
Bhalla S, Kaur H, Kaur R, Sharma S, Raghava GPS. Expression based biomarkers and models to classify early and late-stage samples of Papillary Thyroid Carcinoma. PLoS One 2020; 15:e0231629. [PMID: 32324757 PMCID: PMC7179925 DOI: 10.1371/journal.pone.0231629] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Accepted: 03/27/2020] [Indexed: 12/19/2022] Open
Abstract
INTRODUCTION Recently, the rise in the incidences of thyroid cancer worldwide renders it to be the sixth most common cancer among women. Commonly, Fine Needle Aspiration biopsy predominantly facilitates the diagnosis of the nature of thyroid nodules. However, it is inconsiderable in determining the tumor's state, i.e., benign or malignant. This study aims to identify the key RNA transcripts that can segregate the early and late-stage samples of Thyroid Carcinoma (THCA) using RNA expression profiles. MATERIALS AND METHODS In this study, we used the THCA RNA-Seq dataset of The Cancer Genome Atlas, consisting of 500 cancer and 58 normal (adjacent non-tumorous) samples obtained from the Genomics Data Commons (GDC) data portal. This dataset was dissected to identify key RNA expression features using various feature selection techniques. Subsequently, samples were classified based on selected features employing different machine learning algorithms. RESULTS Single gene ranking based on the Area Under the Receiver Operating Characteristics (AUROC) curve identified the DCN transcript that can classify the early-stage samples from late-stage samples with 0.66 AUROC. To further improve the performance, we identified a panel of 36 RNA transcripts that achieved F1 score of 0.75 with 0.73 AUROC (95% CI: 0.62-0.84) on the validation dataset. Moreover, prediction models based on 18-features from this panel correctly predicted 75% of the samples of the external validation dataset. In addition, the multiclass model classified normal, early, and late-stage samples with AUROC of 0.95 (95% CI: 0.84-1), 0.76 (95% CI: 0.66-0.85) and 0.72 (95% CI: 0.61-0.83) on the validation dataset. Besides, a five protein-coding transcripts panel was also recognized, which segregated cancer and normal samples in the validation dataset with F1 score of 0.97 and 0.99 AUROC (95% CI: 0.91-1). CONCLUSION We identified 36 important RNA transcripts whose expression segregated early and late-stage samples with reasonable accuracy. The models and dataset used in this study are available from the webserver CancerTSP (http://webs.iiitd.edu.in/raghava/cancertsp/).
Collapse
Affiliation(s)
- Sherry Bhalla
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
- Centre for Systems Biology and Bioinformatics, Panjab University, Chandigarh, India
| | - Harpreet Kaur
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Rishemjit Kaur
- CSIR-Central Scientific Instruments Organization, Chandigarh, India
| | - Suresh Sharma
- Centre for Systems Biology and Bioinformatics, Panjab University, Chandigarh, India
| | - Gajendra P. S. Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
- * E-mail:
| |
Collapse
|
20
|
Gupta AK, Kumar A, Rajput A, Kaur K, Dar SA, Thakur A, Megha K, Kumar M. NipahVR: a resource of multi-targeted putative therapeutics and epitopes for the Nipah virus. Database (Oxford) 2020; 2020:baz159. [PMID: 32090261 PMCID: PMC7036594 DOI: 10.1093/database/baz159] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 12/20/2019] [Accepted: 12/23/2020] [Indexed: 12/20/2022]
Abstract
Nipah virus (NiV) is an emerging and priority pathogen from the Paramyxoviridae family with a high fatality rate. It causes various diseases such as respiratory ailments and encephalitis and poses a great threat to humans and livestock. Despite various efforts, there is no approved antiviral treatment available. Therefore, to expedite and assist the research, we have developed an integrative resource NipahVR (http://bioinfo.imtech.res.in/manojk/nipahvr/) for the multi-targeted putative therapeutics and epitopes for NiV. It is structured into different sections, i.e. genomes, codon usage, phylogenomics, molecular diagnostic primers, therapeutics (siRNAs, sgRNAs, miRNAs) and vaccine epitopes (B-cell, CTL, MHC-I and -II binders). Most decisively, potentially efficient therapeutic regimens targeting different NiV proteins and genes were anticipated and projected. We hope this computational resource would be helpful in developing combating strategies against this deadly pathogen. Database URL: http://bioinfo.imtech.res.in/manojk/nipahvr/.
Collapse
Affiliation(s)
- Amit Kumar Gupta
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Archit Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Akanksha Rajput
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Karambir Kaur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Showkat Ahmed Dar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Kirti Megha
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| |
Collapse
|
21
|
Monga I, Banerjee I. Computational Identification of piRNAs Using Features Based on RNA Sequence, Structure, Thermodynamic and Physicochemical Properties. Curr Genomics 2020; 20:508-518. [PMID: 32655289 PMCID: PMC7327968 DOI: 10.2174/1389202920666191129112705] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2019] [Revised: 11/08/2019] [Accepted: 11/22/2019] [Indexed: 01/09/2023] Open
Abstract
Rationale PIWI-interacting RNAs (piRNAs) are a recently-discovered class of small non-coding RNAs (ncRNAs) with a length of 21-35 nucleotides. They play a role in gene expression regulation, transposon silencing, and viral infection inhibition. Once considered as "dark matter" of ncRNAs, piRNAs emerged as important players in multiple cellular functions in different organisms. However, our knowledge of piRNAs is still very limited as many piRNAs have not been yet identified due to lack of robust computational predictive tools. Methods To identify novel piRNAs, we developed piRNAPred, an integrated framework for piRNA prediction employing hybrid features like k-mer nucleotide composition, secondary structure, thermodynamic and physicochemical properties. A non-redundant dataset (D3349 or D1684p+1665n) comprising 1684 experimentally verified piRNAs and 1665 non-piRNA sequences was obtained from piRBase and NONCODE, respectively. These sequences were subjected to the computation of various sequence-structure based features in binary format and trained using different machine learning techniques, of which support vector machine (SVM) performed the best. Results During the ten-fold cross-validation approach (10-CV), piRNAPred achieved an overall accuracy of 98.60% with Mathews correlation coefficient (MCC) of 0.97 and receiver operating characteristic (ROC) of 0.99. Furthermore, we achieved a dimensionality reduction of feature space using an attribute selected classifier. Conclusion We obtained the highest performance in accurately predicting piRNAs as compared to the current state-of-the-art piRNA predictors. In conclusion, piRNAPred would be helpful to expand the piRNA repertoire, and provide new insights on piRNA functions.
Collapse
Affiliation(s)
- Isha Monga
- Cellular Virology Laboratory, Department of Biological Sciences, Indian Institute of Science Education and Research, Mohali (IISER Mohali) Sector 81, S.A.S. Nagar, Mohali-140306, India
| | - Indranil Banerjee
- Cellular Virology Laboratory, Department of Biological Sciences, Indian Institute of Science Education and Research, Mohali (IISER Mohali) Sector 81, S.A.S. Nagar, Mohali-140306, India
| |
Collapse
|
22
|
Pathania S, Randhawa V, Kumar M. Identifying potential entry inhibitors for emerging Nipah virus by molecular docking and chemical-protein interaction network. J Biomol Struct Dyn 2019; 38:5108-5125. [PMID: 31771426 DOI: 10.1080/07391102.2019.1696705] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Shivalika Pathania
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific & Industrial Research, Chandigarh, India
| | - Vinay Randhawa
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific & Industrial Research, Chandigarh, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific & Industrial Research, Chandigarh, India
| |
Collapse
|
23
|
Bhalla S, Kaur H, Dhall A, Raghava GPS. Prediction and Analysis of Skin Cancer Progression using Genomics Profiles of Patients. Sci Rep 2019; 9:15790. [PMID: 31673075 PMCID: PMC6823463 DOI: 10.1038/s41598-019-52134-4] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 10/07/2019] [Indexed: 02/07/2023] Open
Abstract
The metastatic Skin Cutaneous Melanoma (SKCM) has been associated with diminished survival rates and high mortality rates worldwide. Thus, segregating metastatic melanoma from the primary tumors is crucial to employ an optimal therapeutic strategy for the prolonged survival of patients. The SKCM mRNA, miRNA and methylation data of TCGA is comprehensively analysed to recognize key genomic features that can segregate metastatic and primary tumors. Further, machine learning models have been developed using selected features to distinguish the same. The Support Vector Classification with Weight (SVC-W) model developed using the expression of 17 mRNAs achieved Area under the Receiver Operating Characteristic (AUROC) curve of 0.95 and an accuracy of 89.47% on an independent validation dataset. This study reveals the genes C7, MMP3, KRT14, LOC642587, CASP7, S100A7 and miRNAs hsa-mir-205 and hsa-mir-203b as the key genomic features that may substantially contribute to the oncogenesis of melanoma. Our study also proposes genes ESM1, NFATC3, C7orf4, CDK14, ZNF827, and ZSWIM7 as novel putative markers for cutaneous melanoma metastasis. The major prediction models and analysis modules to predict metastatic and primary tumor samples of SKCM are available from a webserver, CancerSPP ( http://webs.iiitd.edu.in/raghava/cancerspp/ ).
Collapse
Affiliation(s)
- Sherry Bhalla
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
- Centre for Systems Biology and Bioinformatics, Panjab University, Chandigarh, India
| | - Harpreet Kaur
- CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Anjali Dhall
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Gajendra P S Raghava
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
| |
Collapse
|
24
|
Classification of early and late stage liver hepatocellular carcinoma patients from their genomics and epigenomics profiles. PLoS One 2019; 14:e0221476. [PMID: 31490960 PMCID: PMC6730898 DOI: 10.1371/journal.pone.0221476] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 08/07/2019] [Indexed: 02/07/2023] Open
Abstract
Background Liver Hepatocellular Carcinoma (LIHC) is one of the major cancers worldwide, responsible for millions of premature deaths every year. Prediction of clinical staging is vital to implement optimal therapeutic strategy and prognostic prediction in cancer patients. However, to date, no method has been developed for predicting the stage of LIHC from the genomic profile of samples. Methods The Cancer Genome Atlas (TCGA) dataset of 173 early stage (stage-I), 177 late stage (stage-II, Stage-III and stage-IV) and 50 adjacent normal tissue samples for 60,483 RNA transcripts and 485,577 methylation CpG sites, was extensively analyzed to identify the key transcriptomic expression and methylation-based features using different feature selection techniques. Further, different classification models were developed based on selected key features to categorize different classes of samples implementing different machine learning algorithms. Results In the current study, in silico models have been developed for classifying LIHC patients in the early vs. late stage and cancerous vs. normal samples using RNA expression and DNA methylation data. TCGA datasets were extensively analyzed to identify differentially expressed RNA transcripts and methylated CpG sites that can discriminate early vs. late stages and cancer vs. normal samples of LIHC with high precision. Naive Bayes model developed using 51 features that combine 21 CpG methylation sites and 30 RNA transcripts achieved maximum MCC (Matthew’s correlation coefficient) 0.58 with an accuracy of 78.87% on the validation dataset in discrimination of early and late stage. Additionally, the prediction models developed based on 5 RNA transcripts and 5 CpG sites classify LIHC and normal samples with an accuracy of 96–98% and AUC (Area Under the Receiver Operating Characteristic curve) 0.99. Besides, multiclass models also developed for classifying samples in the normal, early and late stage of cancer and achieved an accuracy of 76.54% and AUC of 0.86. Conclusion Our study reveals stage prediction of LIHC samples with high accuracy based on the genomics and epigenomics profiling is a challenging task in comparison to the classification of cancerous and normal samples. Comprehensive analysis, differentially expressed RNA transcripts, methylated CpG sites in LIHC samples and prediction models are available from CancerLSP (http://webs.iiitd.edu.in/raghava/cancerlsp/).
Collapse
|
25
|
Shmushkovich T, Monopoli KR, Homsy D, Leyfer D, Betancur-Boissel M, Khvorova A, Wolfson AD. Functional features defining the efficacy of cholesterol-conjugated, self-deliverable, chemically modified siRNAs. Nucleic Acids Res 2019; 46:10905-10916. [PMID: 30169779 PMCID: PMC6237813 DOI: 10.1093/nar/gky745] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 08/24/2018] [Indexed: 12/14/2022] Open
Abstract
Progress in oligonucleotide chemistry has produced a shift in the nature of siRNA used, from formulated, minimally modified siRNAs, to unformulated, heavily modified siRNA conjugates. The introduction of extensive chemical modifications is essential for conjugate-mediated delivery. Modifications have a significant impact on siRNA efficacy through interference with recognition and processing by RNAi enzymatic machinery, severely restricting the sequence space available for siRNA design. Many algorithms available publicly can successfully predict the activity of non-modified siRNAs, but the efficiency of the algorithms for designing heavily modified siRNAs has never been systematically evaluated experimentally. Here we screened 356 cholesterol-conjugated siRNAs with extensive modifications and developed a linear regression-based algorithm that effectively predicts siRNA activity using two independent datasets. We further demonstrate that predictive determinants for modified and non-modified siRNAs differ substantially. The algorithm developed from the non-modified siRNAs dataset has no predictive power for modified siRNAs and vice versa. In the context of heavily modified siRNAs, the introduction of chemical asymmetry fully eliminates the requirement for thermodynamic bias, the major determinant for non-modified siRNA efficacy. Finally, we demonstrate that in addition to the sequence of the target site, the accessibility of the neighboring 3′ region significantly contributes to siRNA efficacy.
Collapse
Affiliation(s)
| | | | - Diana Homsy
- Advirna, 60 Prescott Street, Worcester, MA 01605, USA
| | - Dmitriy Leyfer
- Advirna, 60 Prescott Street, Worcester, MA 01605, USA.,Boston University, 44 Cummington Mall, Boston, MA 02215, USA
| | | | - Anastasia Khvorova
- University of Massachusetts Medical School, 368 Plantation Street. Worcester, MA 01655, USA
| | | |
Collapse
|
26
|
Narang P, Dangi M, Sharma D, Khichi A, Chhillar AK. An Integrated Chikungunya Virus Database to Facilitate Therapeutic Analysis: ChkVDb. Curr Bioinform 2019. [DOI: 10.2174/1574893613666181029124848] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Background:
Chikungunya infection flare-ups have manifested in nations of Africa,
Asia, and Europe including Indian and Pacific seas. It causes fever and different side effects
include muscle torment, migraine, sickness, exhaustion and rash. It has turned into another,
startling general medical issue in numerous tropical African and Asian countries and is presently
being viewed as a genuine risk. No antiviral treatment or vaccine is yet available for this ailment.
The current treatment is centered just on mitigating its side effects.
Objective:
The objective was to encourage the study on this viral pathogen, by the development of
a database dedicated to Chikungunya Virus, that annotates and unifies the related data from
various resources.
associations while known disease-lncRNA associations are required only.
Method:
It undertook a consolidated approach for Chikungunya Virus genomic, proteomic,
phylogenetics and therapeutic learning, involving the entire genome sequences and their
annotation utilizing different in silico tools. Annotation included the information for CpG Island,
usage bias, codon context and phylogenetic analysis at both the genome and proteome levels.
Results:
This database incorporates information of 41 strains of virus causing Chikungunya
infection that can be accessed conveniently as well as downloaded effortlessly. Therapeutics
section of this database contains data about B and T cell Epitopes, siRNAs and miRNAs that can
be used as potential therapeutic targets.
Conclusion:
This database can be explored by specialists and established researchers around the
world to assist their research on this non-treatable virus. It is a public database available from
“www.chkv.in”.</P>
Collapse
Affiliation(s)
- Priya Narang
- Centre for Bioinformatics, M.D. University, Rohtak, India
| | - Mehak Dangi
- Centre for Bioinformatics, M.D. University, Rohtak, India
| | - Deepak Sharma
- Centre for Bioinformatics, M.D. University, Rohtak, India
| | - Alka Khichi
- Centre for Bioinformatics, M.D. University, Rohtak, India
| | | |
Collapse
|
27
|
To accelerate the Zika beat: Candidate design for RNA interference-based therapy. Virus Res 2018; 255:133-140. [DOI: 10.1016/j.virusres.2018.07.010] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 06/11/2018] [Accepted: 07/17/2018] [Indexed: 12/12/2022]
|
28
|
Qureshi A, Tantray VG, Kirmani AR, Ahangar AG. A review on current status of antiviral siRNA. Rev Med Virol 2018; 28:e1976. [PMID: 29656441 PMCID: PMC7169094 DOI: 10.1002/rmv.1976] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Revised: 01/18/2018] [Accepted: 02/12/2018] [Indexed: 01/12/2023]
Abstract
Viral diseases like influenza, AIDS, hepatitis, and Ebola cause severe epidemics worldwide. Along with their resistant strains, new pathogenic viruses continue to be discovered so creating an ongoing need for new antiviral treatments. RNA interference is a cellular gene‐silencing phenomenon in which sequence‐specific degradation of target mRNA is achieved by means of complementary short interfering RNA (siRNA) molecules. Short interfering RNA technology affords a potential tractable strategy to combat viral pathogenesis because siRNAs are specific, easy to design, and can be directed against multiple strains of a virus by targeting their conserved gene regions. In this review, we briefly summarize the current status of siRNA therapy for representative examples from different virus families. In addition, other aspects like their design, delivery, medical significance, bioinformatics resources, and limitations are also discussed.
Collapse
Affiliation(s)
- Abid Qureshi
- Biomedical Informatics Center, Sher-i-Kashmir Institute of Medical Sciences (SKIMS), Srinagar, India
| | - Vaqar Gani Tantray
- Biomedical Informatics Center, Sher-i-Kashmir Institute of Medical Sciences (SKIMS), Srinagar, India
| | - Altaf Rehman Kirmani
- Biomedical Informatics Center, Sher-i-Kashmir Institute of Medical Sciences (SKIMS), Srinagar, India
| | - Abdul Ghani Ahangar
- Biomedical Informatics Center, Sher-i-Kashmir Institute of Medical Sciences (SKIMS), Srinagar, India
| |
Collapse
|
29
|
Thakur A, Qureshi A, Kumar M. vhfRNAi: a web-platform for analysis of host genes involved in viral infections discovered by genome wide RNAi screens. MOLECULAR BIOSYSTEMS 2018; 13:1377-1387. [PMID: 28561835 DOI: 10.1039/c6mb00841k] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Knockdown of host genes using high-throughput genome-wide RNA interference screens has identified numerous host factors that affect viral infections, which would be helpful in understanding host-virus interactions. We have developed a vhfRNAi web resource based on genome-wide RNAi experiments for viruses. It contains experimental details of 12 249 entries (host factors + restriction factors) for 18 viruses. Simultaneously, this resource encompasses analysis of overlapping genes, genome wide association studies, gene ontology (GO), pathogen interacting proteins, interaction networks and pathway enrichment. Using overlap analysis, it was found that Influenza A virus shared overlapping host genes with the majority of viruses including Hepatitis C virus and Dengue virus 2. In the genome wide association studies analysis, 429 diseases/traits were mapped, of which obesity-related traits were the most common. GO analysis revealed that the major categories belonged to metabolic processes, molecule transport, signal transduction, proteolysis, etc. In the pathogen interacting protein analysis, protein interaction data from different resources can be explored for further understanding of host-virus biology. By pathway enrichment analysis, a total of 8955 genes were mapped on 303 pathways with most of the hits coming from metabolic pathways. We have found 491 genes that are not essential for the host but essential for the virus and can be targeted to inhibit the virus. These may be explored as potential candidates for drug targets. The resource is freely accessible at and will be useful in understanding host-virus biology as well as identification of targets for the development of antiviral therapeutics.
Collapse
Affiliation(s)
- Anamika Thakur
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39-A, Chandigarh-160036, India.
| | | | | |
Collapse
|
30
|
Abstract
A number of anti-retroviral drugs are being used for treating Human Immunodeficiency Virus (HIV) infection. Due to emergence of drug resistant strains, there is a constant quest to discover more effective anti-HIV compounds. In this endeavor, computational tools have proven useful in accelerating drug discovery. Although methods were published to design a class of compounds against a specific HIV protein, but an integrated web server for the same is lacking. Therefore, we have developed support vector machine based regression models using experimentally validated data from ChEMBL repository. Quantitative structure activity relationship based features were selected for predicting inhibition activity of a compound against HIV proteins namely protease (PR), reverse transcriptase (RT) and integrase (IN). The models presented a maximum Pearson correlation coefficient of 0.78, 0.76, 0.74 and 0.76, 0.68, 0.72 during tenfold cross-validation on IC50 and percent inhibition datasets of PR, RT, IN respectively.
These models performed equally well on the independent datasets. Chemical space mapping, applicability domain analyses and other statistical tests further support robustness of the predictive models. Currently, we have identified a number of chemical descriptors that are imperative in predicting the compound inhibition potential. HIVprotI platform (http://bioinfo.imtech.res.in/manojk/hivproti) would be useful in virtual screening of inhibitors as well as designing of new molecules against the important HIV proteins for therapeutics development.![]()
Collapse
|
31
|
Villegas PM, Ortega E, Villa-Tanaca L, Barrón BL, Torres-Flores J. Inhibition of dengue virus infection by small interfering RNAs that target highly conserved sequences in the NS4B or NS5 coding regions. Arch Virol 2018; 163:1331-1335. [PMID: 29392497 DOI: 10.1007/s00705-018-3757-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 01/12/2018] [Indexed: 01/19/2023]
Abstract
Dengue fever is one of the most common viral infections in the world. Although a vaccine against dengue virus (DENV) has been approved in several countries, this disease is still considered a public health priority worldwide. The ability of three small interfering RNAs (FG-siRNAs) targeting conserved sequences in the NS4B and NS5 regions of the DENV genome to inhibit DENV replication was tested in vitro in both Vero and C6/36 cells. The FG-siRNAs were effective against DENV-1, -3, and -4, but not DENV-2. A fourth siRNA specifically targeting the NS5 region of the DENV-2 genome (SG-siRNA) was designed and tested against two different DENV-2 strains, showing high levels of inhibition in both mammalian and insect cells.
Collapse
Affiliation(s)
- Paula M Villegas
- Laboratorio de Virología, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Carpio y Plan de Ayala S/N, 11340, Mexico City, Mexico
| | - Elizabeth Ortega
- Laboratorio de Virología, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Carpio y Plan de Ayala S/N, 11340, Mexico City, Mexico
| | - Lourdes Villa-Tanaca
- Laboratorio de Genética Microbiana, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Carpio y Plan de Ayala S/N, 11340, Mexico City, Mexico
| | - Blanca L Barrón
- Laboratorio de Virología, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Carpio y Plan de Ayala S/N, 11340, Mexico City, Mexico
| | - Jesus Torres-Flores
- Laboratorio de Virología, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Carpio y Plan de Ayala S/N, 11340, Mexico City, Mexico.
| |
Collapse
|
32
|
Time-series oligonucleotide count to assign antiviral siRNAs with long utility fit in the big data era. Gene Ther 2017; 24:668-673. [PMID: 28905886 PMCID: PMC5658673 DOI: 10.1038/gt.2017.76] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Revised: 06/18/2017] [Accepted: 07/31/2017] [Indexed: 12/25/2022]
Abstract
Oligonucleotides are key elements of nucleic acid therapeutics such as small interfering RNAs (siRNAs). Influenza and Ebolaviruses are zoonotic RNA viruses mutating very rapidly, and their sequence changes must be characterized intensively to design therapeutic oligonucleotides with long utility. Focusing on a total of 182 experimentally validated siRNAs for influenza A, B and Ebolaviruses compiled by the siRNA database, we conducted time-series analyses of occurrences of siRNA targets in these viral genomes. Reflecting their high mutation rates, occurrences of target oligonucleotides evidently fluctuate in viral populations and often disappear. Time-series analysis of the one-base changed sequences derived from each original target identified the oligonucleotide that shows a compensatory increase and will potentially become the ‘awaiting-type oligonucleotide’; the combined use of this oligonucleotide with the original can provide therapeutics with long utility. This strategy is also useful for assigning diagnostic reverse transcription-PCR primers with long utility.
Collapse
|
33
|
ASPsiRNA: A Resource of ASP-siRNAs Having Therapeutic Potential for Human Genetic Disorders and Algorithm for Prediction of Their Inhibitory Efficacy. G3-GENES GENOMES GENETICS 2017; 7:2931-2943. [PMID: 28696921 PMCID: PMC5592921 DOI: 10.1534/g3.117.044024] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Allele-specific siRNAs (ASP-siRNAs) have emerged as promising therapeutic molecules owing to their selectivity to inhibit the mutant allele or associated single-nucleotide polymorphisms (SNPs) sparing the expression of the wild-type counterpart. Thus, a dedicated bioinformatics platform encompassing updated ASP-siRNAs and an algorithm for the prediction of their inhibitory efficacy will be helpful in tackling currently intractable genetic disorders. In the present study, we have developed the ASPsiRNA resource (http://crdd.osdd.net/servers/aspsirna/) covering three components viz (i) ASPsiDb, (ii) ASPsiPred, and (iii) analysis tools like ASP-siOffTar. ASPsiDb is a manually curated database harboring 4543 (including 422 chemically modified) ASP-siRNAs targeting 78 unique genes involved in 51 different diseases. It furnishes comprehensive information from experimental studies on ASP-siRNAs along with multidimensional genetic and clinical information for numerous mutations. ASPsiPred is a two-layered algorithm to predict efficacy of ASP-siRNAs for fully complementary mutant (Effmut) and wild-type allele (Effwild) with one mismatch by ASPsiPredSVM and ASPsiPredmatrix, respectively. In ASPsiPredSVM, 922 unique ASP-siRNAs with experimentally validated quantitative Effmut were used. During 10-fold cross-validation (10nCV) employing various sequence features on the training/testing dataset (T737), the best predictive model achieved a maximum Pearson’s correlation coefficient (PCC) of 0.71. Further, the accuracy of the classifier to predict Effmut against novel genes was assessed by leave one target out cross-validation approach (LOTOCV). ASPsiPredmatrix was constructed from rule-based studies describing the effect of single siRNA:mRNA mismatches on the efficacy at 19 different locations of siRNA. Thus, ASPsiRNA encompasses the first database, prediction algorithm, and off-target analysis tool that is expected to accelerate research in the field of RNAi-based therapeutics for human genetic diseases.
Collapse
|
34
|
Qureshi A, Kaur G, Kumar M. AVCpred: an integrated web server for prediction and design of antiviral compounds. Chem Biol Drug Des 2017; 89:74-83. [PMID: 27490990 PMCID: PMC7162012 DOI: 10.1111/cbdd.12834] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 07/21/2016] [Accepted: 07/25/2016] [Indexed: 12/11/2022]
Abstract
Viral infections constantly jeopardize the global public health due to lack of effective antiviral therapeutics. Therefore, there is an imperative need to speed up the drug discovery process to identify novel and efficient drug candidates. In this study, we have developed quantitative structure-activity relationship (QSAR)-based models for predicting antiviral compounds (AVCs) against deadly viruses like human immunodeficiency virus (HIV), hepatitis C virus (HCV), hepatitis B virus (HBV), human herpesvirus (HHV) and 26 others using publicly available experimental data from the ChEMBL bioactivity database. Support vector machine (SVM) models achieved a maximum Pearson correlation coefficient of 0.72, 0.74, 0.66, 0.68, and 0.71 in regression mode and a maximum Matthew's correlation coefficient 0.91, 0.93, 0.70, 0.89, and 0.71, respectively, in classification mode during 10-fold cross-validation. Furthermore, similar performance was observed on the independent validation sets. We have integrated these models in the AVCpred web server, freely available at http://crdd.osdd.net/servers/avcpred. In addition, the datasets are provided in a searchable format. We hope this web server will assist researchers in the identification of potential antiviral agents. It would also save time and cost by prioritizing new drugs against viruses before their synthesis and experimental testing.
Collapse
Affiliation(s)
- Abid Qureshi
- Bioinformatics CentreInstitute of Microbial TechnologyCouncil of Scientific and Industrial ResearchChandigarhIndia
| | - Gazaldeep Kaur
- Bioinformatics CentreInstitute of Microbial TechnologyCouncil of Scientific and Industrial ResearchChandigarhIndia
| | - Manoj Kumar
- Bioinformatics CentreInstitute of Microbial TechnologyCouncil of Scientific and Industrial ResearchChandigarhIndia
| |
Collapse
|
35
|
ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis. Sci Rep 2016; 6:32713. [PMID: 27633273 PMCID: PMC5025660 DOI: 10.1038/srep32713] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Accepted: 08/11/2016] [Indexed: 12/22/2022] Open
Abstract
Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates.
Collapse
|
36
|
Dar SA, Gupta AK, Thakur A, Kumar M. SMEpred workbench: A web server for predicting efficacy of chemicallymodified siRNAs. RNA Biol 2016; 13:1144-1151. [PMID: 27603513 DOI: 10.1080/15476286.2016.1229733] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Chemical modifications have been extensively exploited to circumvent shortcomings in therapeutic applications of small interfering RNAs (siRNAs). However, experimental designing and testing of these siRNAs or chemically modified siRNAs (cm-siRNAs) involves enormous resources. Therefore, in-silico intervention in designing cm-siRNAs would be of utmost importance. We developed SMEpred workbench to predict the efficacy of normal siRNAs as well as cm-siRNAs using 3031 heterogeneous cm-siRNA sequences from siRNAmod database. These include 30 frequently used chemical modifications on different positions of either siRNA strand. Support Vector Machine (SVM) was employed to develop predictive models utilizing various sequence features namely mono-, di-nucleotide composition, binary pattern and their hybrids. We achieved highest Pearson Correlation Coefficient (PCC) of 0.80 during 10-fold cross validation and similar PCC value in independent validation. We have provided the algorithm in the 'SMEpred' pipeline to predict the normal siRNAs from the gene or mRNA sequence. For multiple modifications, we have assembled 'MultiModGen' module to design multiple modifications and further process them to evaluate their predicted efficacies. SMEpred webserver will be useful to scientific community engaged in use of RNAi-based technology as well as for therapeutic development. Web server is available for public use at following URL address: http://bioinfo.imtech.res.in/manojk/smepred .
Collapse
Affiliation(s)
- Showkat Ahmad Dar
- a Bioinformatics Center, Institute of Microbial Technology, Council of Scientific and Industrial Research , Chandigarh , India
| | - Amit Kumar Gupta
- a Bioinformatics Center, Institute of Microbial Technology, Council of Scientific and Industrial Research , Chandigarh , India
| | - Anamika Thakur
- a Bioinformatics Center, Institute of Microbial Technology, Council of Scientific and Industrial Research , Chandigarh , India
| | - Manoj Kumar
- a Bioinformatics Center, Institute of Microbial Technology, Council of Scientific and Industrial Research , Chandigarh , India
| |
Collapse
|
37
|
Kaur K, Gupta AK, Rajput A, Kumar M. ge-CRISPR - An integrated pipeline for the prediction and analysis of sgRNAs genome editing efficiency for CRISPR/Cas system. Sci Rep 2016; 6:30870. [PMID: 27581337 PMCID: PMC5007494 DOI: 10.1038/srep30870] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Accepted: 07/08/2016] [Indexed: 12/28/2022] Open
Abstract
Genome editing by sgRNA a component of CRISPR/Cas system emerged as a preferred technology for genome editing in recent years. However, activity and stability of sgRNA in genome targeting is greatly influenced by its sequence features. In this endeavor, a few prediction tools have been developed to design effective sgRNAs but these methods have their own limitations. Therefore, we have developed "ge-CRISPR" using high throughput data for the prediction and analysis of sgRNAs genome editing efficiency. Predictive models were employed using SVM for developing pipeline-1 (classification) and pipeline-2 (regression) using 2090 and 4139 experimentally verified sgRNAs respectively from Homo sapiens, Mus musculus, Danio rerio and Xenopus tropicalis. During 10-fold cross validation we have achieved accuracy and Matthew's correlation coefficient of 87.70% and 0.75 for pipeline-1 on training dataset (T(1840)) while it performed equally well on independent dataset (V(250)). In pipeline-2 we attained Pearson correlation coefficient of 0.68 and 0.69 using best models on training (T(3169)) and independent dataset (V(520)) correspondingly. ge-CRISPR (http://bioinfo.imtech.res.in/manojk/gecrispr/) for a given genomic region will identify potent sgRNAs, their qualitative as well as quantitative efficiencies along with potential off-targets. It will be useful to scientific community engaged in CRISPR research and therapeutics development.
Collapse
Affiliation(s)
- Karambir Kaur
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39A, Chandigarh-160036, India
| | - Amit Kumar Gupta
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39A, Chandigarh-160036, India
| | - Akanksha Rajput
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39A, Chandigarh-160036, India
| | - Manoj Kumar
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39A, Chandigarh-160036, India
| |
Collapse
|
38
|
Qureshi A, Tandon H, Kumar M. AVP-IC50 Pred: Multiple machine learning techniques-based prediction of peptide antiviral activity in terms of half maximal inhibitory concentration (IC50). Biopolymers 2015; 104:753-63. [PMID: 26213387 PMCID: PMC7161829 DOI: 10.1002/bip.22703] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Revised: 06/16/2015] [Accepted: 07/21/2015] [Indexed: 01/29/2023]
Abstract
Peptide-based antiviral therapeutics has gradually paved their way into mainstream drug discovery research. Experimental determination of peptides' antiviral activity as expressed by their IC50 values involves a lot of effort. Therefore, we have developed "AVP-IC50 Pred," a regression-based algorithm to predict the antiviral activity in terms of IC50 values (μM). A total of 759 non-redundant peptides from AVPdb and HIPdb were divided into a training/test set having 683 peptides (T(683)) and a validation set with 76 independent peptides (V(76)) for evaluation. We utilized important peptide sequence features like amino-acid compositions, binary profile of N8-C8 residues, physicochemical properties and their hybrids. Four different machine learning techniques (MLTs) namely Support vector machine, Random Forest, Instance-based classifier, and K-Star were employed. During 10-fold cross validation, we achieved maximum Pearson correlation coefficients (PCCs) of 0.66, 0.64, 0.56, 0.55, respectively, for the above MLTs using the best combination of feature sets. All the predictive models also performed well on the independent validation dataset and achieved maximum PCCs of 0.74, 0.68, 0.59, 0.57, respectively, on the best combination of feature sets. The AVP-IC50 Pred web server is anticipated to assist the researchers working on antiviral therapeutics by enabling them to computationally screen many compounds and focus experimental validation on the most promising set of peptides, thus reducing cost and time efforts. The server is available at http://crdd.osdd.net/servers/ic50avp.
Collapse
Affiliation(s)
- Abid Qureshi
- Bioinformatics Centre, Institute of Microbial TechnologyCouncil of Scientific and Industrial ResearchSector 39‐AChandigarh160036India
| | - Himani Tandon
- Bioinformatics Centre, Institute of Microbial TechnologyCouncil of Scientific and Industrial ResearchSector 39‐AChandigarh160036India
| | - Manoj Kumar
- Bioinformatics Centre, Institute of Microbial TechnologyCouncil of Scientific and Industrial ResearchSector 39‐AChandigarh160036India
| |
Collapse
|
39
|
Unraveling the web of viroinformatics: computational tools and databases in virus research. J Virol 2014; 89:1489-501. [PMID: 25428870 DOI: 10.1128/jvi.02027-14] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The beginning of the second century of research in the field of virology (the first virus was discovered in 1898) was marked by its amalgamation with bioinformatics, resulting in the birth of a new domain--viroinformatics. The availability of more than 100 Web servers and databases embracing all or specific viruses (for example, dengue virus, influenza virus, hepatitis virus, human immunodeficiency virus [HIV], hemorrhagic fever virus [HFV], human papillomavirus [HPV], West Nile virus, etc.) as well as distinct applications (comparative/diversity analysis, viral recombination, small interfering RNA [siRNA]/short hairpin RNA [shRNA]/microRNA [miRNA] studies, RNA folding, protein-protein interaction, structural analysis, and phylotyping and genotyping) will definitely aid the development of effective drugs and vaccines. However, information about their access and utility is not available at any single source or on any single platform. Therefore, a compendium of various computational tools and resources dedicated specifically to virology is presented in this article.
Collapse
|