1
|
Agu PC, Obulose CN. Piquing artificial intelligence towards drug discovery: Tools, techniques, and applications. Drug Dev Res 2024; 85:e22159. [PMID: 38375772 DOI: 10.1002/ddr.22159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 01/12/2024] [Accepted: 01/29/2024] [Indexed: 02/21/2024]
Abstract
The purpose of this study was to discuss how artificial intelligence (AI) methods have affected the field of drug development. It looks at how AI models and data resources are reshaping the drug development process by offering more affordable and expedient options to conventional approaches. The paper opens with an overview of well-known information sources for drug development. The discussion then moves on to molecular representation techniques that make it possible to convert data into representations that computers can understand. The paper also gives a general overview of the algorithms used in the creation of drug discovery models based on AI. In particular, the paper looks at how AI algorithms might be used to forecast drug toxicity, drug bioactivity, and drug physicochemical properties. De novo drug design, binding affinity prediction, and other AI-based models for drug-target interaction were covered in deeper detail. Modern applications of AI in nanomedicine design and pharmacological synergism/antagonism prediction were also covered. The potential advantages of AI in drug development are highlighted as the evaluation comes to a close. It underlines how AI may greatly speed up and improve the efficiency of drug discovery, resulting in the creation of new and better medicines. To fully realize the promise of AI in drug discovery, the review acknowledges the difficulties that come with its uses in this field and advocates for more study and development.
Collapse
Affiliation(s)
- Peter Chinedu Agu
- Department of Biochemistry, College of Science, Evangel University, Akaeze, Ebonyi State, Nigeria
| | - Chidiebere Nwiboko Obulose
- Department of Computer Sciences, Our Savior Institute of Science, Agriculture, and Technology (OSISATECH Polytechnic), Enugu, Nigeria
| |
Collapse
|
2
|
Liu T, Huang J, Luo D, Ren L, Ning L, Huang J, Lin H, Zhang Y. Cm-siRPred: Predicting chemically modified siRNA efficiency based on multi-view learning strategy. Int J Biol Macromol 2024; 264:130638. [PMID: 38460652 DOI: 10.1016/j.ijbiomac.2024.130638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 12/27/2023] [Accepted: 03/03/2024] [Indexed: 03/11/2024]
Abstract
The rational modification of siRNA molecules is crucial for ensuring their drug-like properties. Machine learning-based prediction of chemically modified siRNA (cm-siRNA) efficiency can significantly optimize the design process of siRNA chemical modifications, saving time and cost in siRNA drug development. However, existing in-silico methods suffer from limitations such as small datasets, inadequate data representation capabilities, and lack of interpretability. Therefore, in this study, we developed the Cm-siRPred algorithm based on a multi-view learning strategy. The algorithm employs a multi-view strategy to represent the double-strand sequences, chemical modifications, and physicochemical properties of cm-siRNA. It incorporates a cross-attention model to globally correlate different representation vectors and a two-layer CNN module to learn local correlation features. The algorithm demonstrates exceptional performance in cross-validation experiments, independent dataset, and case studies on approved siRNA drugs, and showcasing its robustness and generalization ability. In addition, we developed a user-friendly webserver that enables efficient prediction of cm-siRNA efficiency and assists in the design of siRNA drug chemical modifications. In summary, Cm-siRPred is a practical tool that offers valuable technical support for siRNA chemical modification and drug efficiency research, while effectively assisting in the development of novel small nucleic acid drugs. Cm-siRPred is freely available at https://cellknowledge.com.cn/sirnapredictor/.
Collapse
Affiliation(s)
- Tianyuan Liu
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Junyang Huang
- School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Delun Luo
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China; Chengdu Jingrunze Gene Technology Company Limited, Chengdu 611138, China
| | - Liping Ren
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Lin Ning
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.
| |
Collapse
|
3
|
Arora P, Behera M, Saraf SA, Shukla R. Leveraging Artificial Intelligence for Synergies in Drug Discovery: From Computers to Clinics. Curr Pharm Des 2024; 30:2187-2205. [PMID: 38874046 DOI: 10.2174/0113816128308066240529121148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/27/2024] [Accepted: 04/03/2024] [Indexed: 06/15/2024]
Abstract
Over the period of the preceding decade, artificial intelligence (AI) has proved an outstanding performance in entire dimensions of science including pharmaceutical sciences. AI uses the concept of machine learning (ML), deep learning (DL), and neural networks (NNs) approaches for novel algorithm and hypothesis development by training the machines in multiple ways. AI-based drug development from molecule identification to clinical approval tremendously reduces the cost of development and the time over conventional methods. The COVID-19 vaccine development and approval by regulatory agencies within 1-2 years is the finest example of drug development. Hence, AI is fast becoming a boon for scientific researchers to streamline their advanced discoveries. AI-based FDA-approved nanomedicines perform well as target selective, synergistic therapies, recolonize the theragnostic pharmaceutical stream, and significantly improve drug research outcomes. This comprehensive review delves into the fundamental aspects of AI along with its applications in the realm of pharmaceutical life sciences. It explores AI's role in crucial areas such as drug designing, drug discovery and development, traditional Chinese medicine, integration of multi-omics data, as well as investigations into drug repurposing and polypharmacology studies.
Collapse
Affiliation(s)
- Priyanka Arora
- Department of Pharmaceutics, National Institute of Pharmaceutical Education and Research (NIPER)-Raebareli, Near CRPF Base Camp, Bijnor-Sisendi Road, Sarojini Nagar, Lucknow (UP)-226002, India
| | - Manaswini Behera
- Department of Pharmaceutics, National Institute of Pharmaceutical Education and Research (NIPER)-Raebareli, Near CRPF Base Camp, Bijnor-Sisendi Road, Sarojini Nagar, Lucknow (UP)-226002, India
| | - Shubhini A Saraf
- Department of Pharmaceutics, National Institute of Pharmaceutical Education and Research (NIPER)-Raebareli, Near CRPF Base Camp, Bijnor-Sisendi Road, Sarojini Nagar, Lucknow (UP)-226002, India
| | - Rahul Shukla
- Department of Pharmaceutics, National Institute of Pharmaceutical Education and Research (NIPER)-Raebareli, Near CRPF Base Camp, Bijnor-Sisendi Road, Sarojini Nagar, Lucknow (UP)-226002, India
| |
Collapse
|
4
|
Chen W, Liu X, Zhang S, Chen S. Artificial intelligence for drug discovery: Resources, methods, and applications. MOLECULAR THERAPY. NUCLEIC ACIDS 2023; 31:691-702. [PMID: 36923950 PMCID: PMC10009646 DOI: 10.1016/j.omtn.2023.02.019] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
Conventional wet laboratory testing, validations, and synthetic procedures are costly and time-consuming for drug discovery. Advancements in artificial intelligence (AI) techniques have revolutionized their applications to drug discovery. Combined with accessible data resources, AI techniques are changing the landscape of drug discovery. In the past decades, a series of AI-based models have been developed for various steps of drug discovery. These models have been used as complements of conventional experiments and have accelerated the drug discovery process. In this review, we first introduced the widely used data resources in drug discovery, such as ChEMBL and DrugBank, followed by the molecular representation schemes that convert data into computer-readable formats. Meanwhile, we summarized the algorithms used to develop AI-based models for drug discovery. Subsequently, we discussed the applications of AI techniques in pharmaceutical analysis including predicting drug toxicity, drug bioactivity, and drug physicochemical property. Furthermore, we introduced the AI-based models for de novo drug design, drug-target structure prediction, drug-target interaction, and binding affinity prediction. Moreover, we also highlighted the advanced applications of AI in drug synergism/antagonism prediction and nanomedicine design. Finally, we discussed the challenges and future perspectives on the applications of AI to drug discovery.
Collapse
Affiliation(s)
- Wei Chen
- State Key Laboratory of Southwestern Chinese Medicine Resources, Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.,Institute of Herbgenomics, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Xuesong Liu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Sanyin Zhang
- State Key Laboratory of Southwestern Chinese Medicine Resources, Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.,Institute of Herbgenomics, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Shilin Chen
- State Key Laboratory of Southwestern Chinese Medicine Resources, Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.,Institute of Herbgenomics, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| |
Collapse
|
5
|
Fopase R, Panda C, Rajendran AP, Uludag H, Pandey LM. Potential of siRNA in COVID-19 therapy: Emphasis on in silico design and nanoparticles based delivery. Front Bioeng Biotechnol 2023; 11:1112755. [PMID: 36814718 PMCID: PMC9939533 DOI: 10.3389/fbioe.2023.1112755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 01/13/2023] [Indexed: 02/09/2023] Open
Abstract
Small interfering RNA (siRNA)-mediated mRNA degradation approach have imparted its eminence against several difficult-to-treat genetic disorders and other allied diseases. Viral outbreaks and resulting pandemics have repeatedly threatened public health and questioned human preparedness at the forefront of drug design and biomedical readiness. During the recent pandemic caused by the SARS-CoV-2, mRNA-based vaccination strategies have paved the way for a new era of RNA therapeutics. RNA Interference (RNAi) based approach using small interfering RNA may complement clinical management of the COVID-19. RNA Interference approach will primarily work by restricting the synthesis of the proteins required for viral replication, thereby hampering viral cellular entry and trafficking by targeting host as well as protein factors. Despite promising benefits, the stability of small interfering RNA in the physiological environment is of grave concern as well as site-directed targeted delivery and evasion of the immune system require immediate attention. In this regard, nanotechnology offers viable solutions for these challenges. The review highlights the potential of small interfering RNAs targeted toward specific regions of the viral genome and the features of nanoformulations necessary for the entrapment and delivery of small interfering RNAs. In silico design of small interfering RNA for different variants of SARS-CoV-2 has been discussed. Various nanoparticles as promising carriers of small interfering RNAs along with their salient properties, including surface functionalization, are summarized. This review will help tackle the real-world challenges encountered by the in vivo delivery of small interfering RNAs, ensuring a safe, stable, and readily available drug candidate for efficient management of SARS-CoV-2 in the future.
Collapse
Affiliation(s)
- Rushikesh Fopase
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| | - Chinmaya Panda
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| | - Amarnath P. Rajendran
- Department of Chemical & Materials Engineering, Faculty of Engineering, University of Alberta, Edmonton, AB, Canada
| | - Hasan Uludag
- Department of Chemical & Materials Engineering, Faculty of Engineering, University of Alberta, Edmonton, AB, Canada
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB, Canada
- Department of Biomedical Engineering, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, AB, Canada
| | - Lalit M. Pandey
- Bio-Interface & Environmental Engineering Lab, Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Assam, India
| |
Collapse
|
6
|
Dong X, Zheng W. Cheminformatics Modeling of Gene Silencing for Both Natural and Chemically Modified siRNAs. Molecules 2022; 27:6412. [PMID: 36234948 PMCID: PMC9570765 DOI: 10.3390/molecules27196412] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 09/23/2022] [Accepted: 09/25/2022] [Indexed: 11/17/2022] Open
Abstract
In designing effective siRNAs for a specific mRNA target, it is critically important to have predictive models for the potency of siRNAs. None of the published methods characterized the chemical structures of individual nucleotides constituting a siRNA molecule; therefore, they cannot predict the potency of gene silencing by chemically modified siRNAs (cm-siRNA). We propose a new approach that can predict the potency of gene silencing by cm-siRNAs, which characterizes each nucleotide (NT) using 12 BCUT cheminformatics descriptors describing its charge distribution, hydrophobic and polar properties. Thus, a 21-NT siRNA molecule is described by 252 descriptors resulting from concatenating all the BCUT values of its composing nucleotides. Partial Least Square is employed to develop statistical models. The Huesken data (2431 natural siRNA molecules) were used to perform model building and evaluation for natural siRNAs. Our results were comparable with or superior to those from Huesken's algorithm. The Bramsen dataset (48 cm-siRNAs) was used to build and test the models for cm-siRNAs. The predictive r2 of the resulting models reached 0.65 (or Pearson r values of 0.82). Thus, this new method can be used to successfully model gene silencing potency by both natural and chemically modified siRNA molecules.
Collapse
Affiliation(s)
| | - Weifan Zheng
- BRITE Institute and Department of Pharmaceutical Sciences, College of Health and Sciences (CHAS), North Carolina Central University, Durham, NC 27707, USA
| |
Collapse
|
7
|
Design of siRNA molecules for silencing of membrane glycoprotein, nucleocapsid phosphoprotein, and surface glycoprotein genes of SARS-CoV2. J Genet Eng Biotechnol 2022; 20:65. [PMID: 35482116 PMCID: PMC9047631 DOI: 10.1186/s43141-022-00346-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 04/18/2022] [Indexed: 12/24/2022]
Abstract
The global COVID-19 pandemic caused by SARS-CoV2 infected millions of people and resulted in more than 4 million deaths worldwide. Apart from vaccines and drugs, RNA silencing is a novel approach for treating COVID-19. In the present study, siRNAs were designed for the conserved regions targeting three structural genes, M, N, and S, from forty whole-genome sequences of SARS-CoV2 using four different software, RNAxs, siDirect, i-Score Designer, and OligoWalk. Only siRNAs which were predicted in common by all the four servers were considered for further shortlisting. A multistep filtering approach has been adopted in the present study for the final selection of siRNAs by the usage of different online tools, viz., siRNA scales, MaxExpect, DuplexFold, and SMEpred. All these web-based tools consider several important parameters for designing functional siRNAs, e.g., target-site accessibility, duplex stability, position-specific nucleotide preference, inhibitory score, thermodynamic parameters, GC content, and efficacy in cleaving the target. In addition, a few parameters like GC content and dG value of the entire siRNA were also considered for shortlisting of the siRNAs. Antisense strands were subjected to check for any off-target similarities using BLAST. Molecular docking was carried out to study the interactions of guide strands with AGO2 protein. A total of six functional siRNAs (two for each gene) have been finally selected for targeting M, N, and S genes of SARS-CoV2. The siRNAs have not shown any off-target effects, interacted with the domain(s) of AGO2 protein, and were efficacious in cleaving the target mRNA. However, the siRNAs designed in the present study need to be tested in vitro and in vivo in the future.
Collapse
|
8
|
Abstract
Systematics is described for annotation of variations in RNA molecules. The conceptual framework is part of Variation Ontology (VariO) and facilitates depiction of types of variations, their functional and structural effects and other consequences in any RNA molecule in any organism. There are more than 150 RNA related VariO terms in seven levels, which can be further combined to generate even more complicated and detailed annotations. The terms are described together with examples, usually for variations and effects in human and in diseases. RNA variation type has two subcategories: variation classification and origin with subterms. Altogether six terms are available for function description. Several terms are available for affected RNA properties. The ontology contains also terms for structural description for affected RNA type, post-transcriptional RNA modifications, secondary and tertiary structure effects and RNA sugar variations. Together with the DNA and protein concepts and annotations, RNA terms allow comprehensive description of variations of genetic and non-genetic origin at all possible levels. The VariO annotations are readable both for humans and computer programs for advanced data integration and mining.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
9
|
Gupta AK, Kumar M. HPVomics: An integrated resource for the human papillomavirus epitome and therapeutics. Genomics 2020; 112:4853-4862. [PMID: 32871223 DOI: 10.1016/j.ygeno.2020.08.025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 08/07/2020] [Accepted: 08/19/2020] [Indexed: 12/01/2022]
Abstract
Human papillomaviruses (HPVs) belongs to the Papillomaviridae family, which is divided into high-risk (HR), and low-risk (LR) HPVs based on their disease-causing competence. HR-HPVs 16 and 18 are known to cause distinct carcinomas like cervical and head and neck, whereas LR-HPVs are commonly associated with the genital warts. We have developed an integrative platform; HPVomics dedicated to the potential therapeutic regimens targeting all HPV genes including oncoproteins E6, E7 and E5. We primarily focused on eighteen HR-HPVs and eleven LR-HPVs. It mainly deals with therapeutically imperative elements, i.e., vaccine epitopes, siRNAs, sgRNAs, and anti-viral peptides. Simultaneously, it also comprises of genome browser, whole-genome sequences and annotation of HPVs with searching and filtering capabilities. Moreover, we have also developed an integrated support vector machine (SVM) based computational algorithm "HPVepi" for the prediction of HPV epitome. We hope that HPVomics (http://bioinfo.imtech.res.in/manojk/hpvomics/) will assist the scientific community engaged in HPV research.
Collapse
Affiliation(s)
- Amit Kumar Gupta
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India.
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
| |
Collapse
|
10
|
Shmushkovich T, Monopoli KR, Homsy D, Leyfer D, Betancur-Boissel M, Khvorova A, Wolfson AD. Functional features defining the efficacy of cholesterol-conjugated, self-deliverable, chemically modified siRNAs. Nucleic Acids Res 2019; 46:10905-10916. [PMID: 30169779 PMCID: PMC6237813 DOI: 10.1093/nar/gky745] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 08/24/2018] [Indexed: 12/14/2022] Open
Abstract
Progress in oligonucleotide chemistry has produced a shift in the nature of siRNA used, from formulated, minimally modified siRNAs, to unformulated, heavily modified siRNA conjugates. The introduction of extensive chemical modifications is essential for conjugate-mediated delivery. Modifications have a significant impact on siRNA efficacy through interference with recognition and processing by RNAi enzymatic machinery, severely restricting the sequence space available for siRNA design. Many algorithms available publicly can successfully predict the activity of non-modified siRNAs, but the efficiency of the algorithms for designing heavily modified siRNAs has never been systematically evaluated experimentally. Here we screened 356 cholesterol-conjugated siRNAs with extensive modifications and developed a linear regression-based algorithm that effectively predicts siRNA activity using two independent datasets. We further demonstrate that predictive determinants for modified and non-modified siRNAs differ substantially. The algorithm developed from the non-modified siRNAs dataset has no predictive power for modified siRNAs and vice versa. In the context of heavily modified siRNAs, the introduction of chemical asymmetry fully eliminates the requirement for thermodynamic bias, the major determinant for non-modified siRNA efficacy. Finally, we demonstrate that in addition to the sequence of the target site, the accessibility of the neighboring 3′ region significantly contributes to siRNA efficacy.
Collapse
Affiliation(s)
| | | | - Diana Homsy
- Advirna, 60 Prescott Street, Worcester, MA 01605, USA
| | - Dmitriy Leyfer
- Advirna, 60 Prescott Street, Worcester, MA 01605, USA.,Boston University, 44 Cummington Mall, Boston, MA 02215, USA
| | | | - Anastasia Khvorova
- University of Massachusetts Medical School, 368 Plantation Street. Worcester, MA 01655, USA
| | | |
Collapse
|
11
|
Dar SA, Kumar M. saRNAdb: Resource of Small Activating RNAs for Up-regulating the Gene Expression. J Mol Biol 2018; 430:2212-2218. [PMID: 29625201 DOI: 10.1016/j.jmb.2018.03.023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Revised: 02/19/2018] [Accepted: 03/22/2018] [Indexed: 12/19/2022]
Abstract
RNA activation (RNAa) is the process of enhancing selective gene expression at transcriptional level using double-stranded RNAs, targeting gene promoter. These RNA molecules are usually 21 nucleotides long and termed as small activating RNAs (saRNAs). They are involved in gene regulation, epigenetics, gain-of-function studies and have potential therapeutic applications for various diseases especially cancer. RNAa is opposite to RNA interference in functionality; however, both processes share some protein machinery. There are many RNA interference centered online resources but no one for saRNAs; therefore, we developed "saRNAdb" database (http://bioinfo.imtech.res.in/manojk/sarna/). It contains 2150 manually curated saRNA entries with detailed information about their nucleotide sequences, activities, corresponding target gene, promoter and other experimental data. Besides, saRNA-promoter binding location, predicted saRNA features, tools (off-target, map) and RNAa-related proteins with their interacting partners are provided. saRNAdb is expected to assist in RNA research especially for nucleic acid-based therapeutics development.
Collapse
Affiliation(s)
- Showkat Ahmad Dar
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39A, Chandigarh 160036, India
| | - Manoj Kumar
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Sector 39A, Chandigarh 160036, India.
| |
Collapse
|
12
|
ASPsiRNA: A Resource of ASP-siRNAs Having Therapeutic Potential for Human Genetic Disorders and Algorithm for Prediction of Their Inhibitory Efficacy. G3-GENES GENOMES GENETICS 2017; 7:2931-2943. [PMID: 28696921 PMCID: PMC5592921 DOI: 10.1534/g3.117.044024] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Allele-specific siRNAs (ASP-siRNAs) have emerged as promising therapeutic molecules owing to their selectivity to inhibit the mutant allele or associated single-nucleotide polymorphisms (SNPs) sparing the expression of the wild-type counterpart. Thus, a dedicated bioinformatics platform encompassing updated ASP-siRNAs and an algorithm for the prediction of their inhibitory efficacy will be helpful in tackling currently intractable genetic disorders. In the present study, we have developed the ASPsiRNA resource (http://crdd.osdd.net/servers/aspsirna/) covering three components viz (i) ASPsiDb, (ii) ASPsiPred, and (iii) analysis tools like ASP-siOffTar. ASPsiDb is a manually curated database harboring 4543 (including 422 chemically modified) ASP-siRNAs targeting 78 unique genes involved in 51 different diseases. It furnishes comprehensive information from experimental studies on ASP-siRNAs along with multidimensional genetic and clinical information for numerous mutations. ASPsiPred is a two-layered algorithm to predict efficacy of ASP-siRNAs for fully complementary mutant (Effmut) and wild-type allele (Effwild) with one mismatch by ASPsiPredSVM and ASPsiPredmatrix, respectively. In ASPsiPredSVM, 922 unique ASP-siRNAs with experimentally validated quantitative Effmut were used. During 10-fold cross-validation (10nCV) employing various sequence features on the training/testing dataset (T737), the best predictive model achieved a maximum Pearson’s correlation coefficient (PCC) of 0.71. Further, the accuracy of the classifier to predict Effmut against novel genes was assessed by leave one target out cross-validation approach (LOTOCV). ASPsiPredmatrix was constructed from rule-based studies describing the effect of single siRNA:mRNA mismatches on the efficacy at 19 different locations of siRNA. Thus, ASPsiRNA encompasses the first database, prediction algorithm, and off-target analysis tool that is expected to accelerate research in the field of RNAi-based therapeutics for human genetic diseases.
Collapse
|
13
|
Barman RK, Mukhopadhyay A, Das S. An improved method for identification of small non-coding RNAs in bacteria using support vector machine. Sci Rep 2017; 7:46070. [PMID: 28383059 PMCID: PMC5382675 DOI: 10.1038/srep46070] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Accepted: 03/08/2017] [Indexed: 12/25/2022] Open
Abstract
Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method to identify sRNAs in bacteria using support vector machine (SVM) classifier. The primary sequence and secondary structure features of experimentally-validated sRNAs of Salmonella Typhimurium LT2 (SLT2) was used to build the optimal SVM model. We found that a tri-nucleotide composition feature of sRNAs achieved an accuracy of 88.35% for SLT2. We validated the SVM model also on the experimentally-detected sRNAs of E. coli and Salmonella Typhi. The proposed model had robustly attained an accuracy of 81.25% and 88.82% for E. coli K-12 and S. Typhi Ty2, respectively. We confirmed that this method significantly improved the identification of sRNAs in bacteria. Furthermore, we used a sliding window-based method and identified sRNAs from complete genomes of SLT2, S. Typhi Ty2 and E. coli K-12 with sensitivities of 89.09%, 83.33% and 67.39%, respectively.
Collapse
Affiliation(s)
- Ranjan Kumar Barman
- Biomedical Informatics Centre, National Institute Of Cholera and Enteric Diseases, Kolkata, West Bengal, India
| | - Anirban Mukhopadhyay
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, India
| | - Santasabuj Das
- Biomedical Informatics Centre, National Institute Of Cholera and Enteric Diseases, Kolkata, West Bengal, India.,Division of Clinical Medicine, National Institute of Cholera and Enteric Diseases, Kolkata, West Bengal, India
| |
Collapse
|