1
|
Sasse A, Ray D, Laverty KU, Tam CL, Albu M, Zheng H, Lyudovyk O, Dalal T, Nie K, Magis C, Notredame C, Weirauch MT, Hughes TR, Morris Q. Reconstructing the sequence specificities of RNA-binding proteins across eukaryotes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.15.618476. [PMID: 39464061 PMCID: PMC11507768 DOI: 10.1101/2024.10.15.618476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
RNA-binding proteins (RBPs) are key regulators of gene expression. Here, we introduce EuPRI (Eukaryotic Protein-RNA Interactions) - a freely available resource of RNA motifs for 34,736 RBPs from 690 eukaryotes. EuPRI includes in vitro binding data for 504 RBPs, including newly collected RNAcompete data for 174 RBPs, along with thousands of reconstructed motifs. We reconstruct these motifs with a new computational platform - Joint Protein-Ligand Embedding (JPLE) - which can detect distant homology relationships and map specificity-determining peptides. EuPRI quadruples the number of known RBP motifs, expanding the motif repertoire across all major eukaryotic clades, and assigning motifs to the majority of human RBPs. EuPRI drastically improves knowledge of RBP motifs in flowering plants. For example, it increases the number of Arabidopsis thaliana RBP motifs 7-fold, from 14 to 105. EuPRI also has broad utility for inferring post-transcriptional function and evolutionary relationships. We demonstrate this by predicting a role for 12 Arabidopsis thaliana RBPs in RNA stability and identifying rapid and recent evolution of post-transcriptional regulatory networks in worms and plants. In contrast, the vertebrate RNA motif set has remained relatively stable after its drastic expansion between the metazoan and vertebrate ancestors. EuPRI represents a powerful resource for the study of gene regulation across eukaryotes.
Collapse
Affiliation(s)
- Alexander Sasse
- Department of Molecular Genetics, University of Toronto, Toronto, ON Canada
- Donnelly Centre, University of Toronto, Toronto, ON Canada
- Department of Computer Science, University of Washington, Seattle, WA, USA
- Vector Institute, Toronto, ON Canada
| | - Debashish Ray
- Donnelly Centre, University of Toronto, Toronto, ON Canada
| | - Kaitlin U Laverty
- Department of Molecular Genetics, University of Toronto, Toronto, ON Canada
- Donnelly Centre, University of Toronto, Toronto, ON Canada
- Vector Institute, Toronto, ON Canada
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Cyrus L Tam
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Graduate Program in Computational Biology and Medicine, Weill-Cornell Graduate School, New York, NY, USA
| | - Mihai Albu
- Donnelly Centre, University of Toronto, Toronto, ON Canada
| | - Hong Zheng
- Donnelly Centre, University of Toronto, Toronto, ON Canada
| | - Olga Lyudovyk
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Graduate Program in Computational Biology and Medicine, Weill-Cornell Graduate School, New York, NY, USA
| | - Taykhoom Dalal
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Graduate Program in Computational Biology and Medicine, Weill-Cornell Graduate School, New York, NY, USA
| | - Kate Nie
- Department of Molecular Genetics, University of Toronto, Toronto, ON Canada
- Donnelly Centre, University of Toronto, Toronto, ON Canada
- Vector Institute, Toronto, ON Canada
| | - Cedrik Magis
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Cedric Notredame
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Matthew T Weirauch
- Center for Autoimmune Genomics and Etiology, Divisions of Allergy & Immunology, Human Genetics, Biomedical Informatics and Developmental Biology, Cincinnati Children's Hospital, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - Timothy R Hughes
- Department of Molecular Genetics, University of Toronto, Toronto, ON Canada
- Donnelly Centre, University of Toronto, Toronto, ON Canada
| | - Quaid Morris
- Department of Molecular Genetics, University of Toronto, Toronto, ON Canada
- Donnelly Centre, University of Toronto, Toronto, ON Canada
- Vector Institute, Toronto, ON Canada
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Graduate Program in Computational Biology and Medicine, Weill-Cornell Graduate School, New York, NY, USA
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| |
Collapse
|
2
|
Villamayor L, Rivero V, López-García D, Topham DJ, Martínez-Sobrido L, Nogales A, DeDiego ML. Interferon alpha inducible protein 6 is a negative regulator of innate immune responses by modulating RIG-I activation. Front Immunol 2023; 14:1105309. [PMID: 36793726 PMCID: PMC9923010 DOI: 10.3389/fimmu.2023.1105309] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 01/05/2023] [Indexed: 01/31/2023] Open
Abstract
Interferons (IFNs), IFN-stimulated genes (ISGs), and inflammatory cytokines mediate innate immune responses, and are essential to establish an antiviral response. Within the innate immune responses, retinoic acid-inducible gene I (RIG-I) is a key sensor of virus infections, mediating the transcriptional induction of IFNs and inflammatory proteins. Nevertheless, since excessive responses could be detrimental to the host, these responses need to be tightly regulated. In this work, we describe, for the first time, how knocking-down or knocking-out the expression of IFN alpha-inducible protein 6 (IFI6) increases IFN, ISG, and pro-inflammatory cytokine expression after the infections with Influenza A Virus (IAV), Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), and Sendai Virus (SeV), or poly(I:C) transfection. We also show how overexpression of IFI6 produces the opposite effect, in vitro and in vivo, indicating that IFI6 negatively modulates the induction of innate immune responses. Knocking-out or knocking-down the expression of IFI6 diminishes the production of infectious IAV and SARS-CoV-2, most likely because of its effect on antiviral responses. Importantly, we report a novel interaction of IFI6 with RIG-I, most likely mediated through binding to RNA, that affects RIG-I activation, providing a molecular mechanism for the effect of IFI6 on negatively regulating innate immunity. Remarkably, these new functions of IFI6 could be targeted to treat diseases associated with an exacerbated induction of innate immune responses and to combat viral infections, such as IAV and SARS-CoV-2.
Collapse
Affiliation(s)
- Laura Villamayor
- Department of Molecular and Cell Biology. Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
| | - Vanessa Rivero
- Department of Molecular and Cell Biology. Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
| | - Darío López-García
- Department of Molecular and Cell Biology. Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
| | - David J. Topham
- David H. Smith Center for Vaccine Biology and Immunology, Department of Microbiology and Immunology, University of Rochester Medical Center, Rochester, NY, United States
| | - Luis Martínez-Sobrido
- Disease Intervention and Prevention and Population Health Programs, Texas Biomedical Research Institute, San Antonio, TX, United States
| | - Aitor Nogales
- Center for Animal Health Research, CISA-INIA-CSIC, Valdeolmos, Madrid, Spain
| | - Marta L. DeDiego
- Department of Molecular and Cell Biology. Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain,*Correspondence: Marta L. DeDiego,
| |
Collapse
|
3
|
Villamayor L, López-García D, Rivero V, Martínez-Sobrido L, Nogales A, DeDiego ML. The IFN-stimulated gene IFI27 counteracts innate immune responses after viral infections by interfering with RIG-I signaling. Front Microbiol 2023; 14:1176177. [PMID: 37187533 PMCID: PMC10175689 DOI: 10.3389/fmicb.2023.1176177] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 04/13/2023] [Indexed: 05/17/2023] Open
Abstract
The recognition of viral nucleic acids by host pattern recognition receptors (PRRs) is critical for initiating innate immune responses against viral infections. These innate immune responses are mediated by the induction of interferons (IFNs), IFN-stimulated genes (ISGs) and pro-inflammatory cytokines. However, regulatory mechanisms are critical to avoid excessive or long-lasting innate immune responses that may cause detrimental hyperinflammation. Here, we identified a novel regulatory function of the ISG, IFN alpha inducible protein 27 (IFI27) in counteracting the innate immune responses triggered by cytoplasmic RNA recognition and binding. Our model systems included three unrelated viral infections caused by Influenza A virus (IAV), Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2), and Sendai virus (SeV), and transfection with an analog of double-stranded (ds) RNA. Furthermore, we found that IFI27 has a positive effect on IAV and SARS-CoV-2 replication, most likely due to its ability to counteract host-induced antiviral responses, including in vivo. We also show that IFI27 interacts with nucleic acids and PRR retinoic acid-inducible gene I (RIG-I), being the interaction of IFI27 with RIG-I most likely mediated through RNA binding. Interestingly, our results indicate that interaction of IFI27 with RIG-I impairs RIG-I activation, providing a molecular mechanism for the effect of IFI27 on modulating innate immune responses. Our study identifies a molecular mechanism that may explain the effect of IFI27 in counterbalancing innate immune responses to RNA viral infections and preventing excessive innate immune responses. Therefore, this study will have important implications in drug design to control viral infections and viral-induced pathology.
Collapse
Affiliation(s)
- Laura Villamayor
- Department of Molecular and Cell Biology, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
| | - Darío López-García
- Department of Molecular and Cell Biology, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
| | - Vanessa Rivero
- Department of Molecular and Cell Biology, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
| | | | - Aitor Nogales
- Center for Animal Health Research, CISA-INIA-CSIC, Madrid, Spain
| | - Marta L. DeDiego
- Department of Molecular and Cell Biology, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
- *Correspondence: Marta L. DeDiego,
| |
Collapse
|
4
|
AI and Immunoinformatics. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
5
|
Yang S, Liu X, Ng RT. ProbeRating: a recommender system to infer binding profiles for nucleic acid-binding proteins. Bioinformatics 2021; 36:4797-4804. [PMID: 32573679 PMCID: PMC7750938 DOI: 10.1093/bioinformatics/btaa580] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 05/18/2020] [Accepted: 06/18/2020] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION The interaction between proteins and nucleic acids plays a crucial role in gene regulation and cell function. Determining the binding preferences of nucleic acid-binding proteins (NBPs), namely RNA-binding proteins (RBPs) and transcription factors (TFs), is the key to decipher the protein-nucleic acids interaction code. Today, available NBP binding data from in vivo or in vitro experiments are still limited, which leaves a large portion of NBPs uncovered. Unfortunately, existing computational methods that model the NBP binding preferences are mostly protein specific: they need the experimental data for a specific protein in interest, and thus only focus on experimentally characterized NBPs. The binding preferences of experimentally unexplored NBPs remain largely unknown. RESULTS Here, we introduce ProbeRating, a nucleic acid recommender system that utilizes techniques from deep learning and word embeddings of natural language processing. ProbeRating is developed to predict binding profiles for unexplored or poorly studied NBPs by exploiting their homologs NBPs which currently have available binding data. Requiring only sequence information as input, ProbeRating adapts FastText from Facebook AI Research to extract biological features. It then builds a neural network-based recommender system. We evaluate the performance of ProbeRating on two different tasks: one for RBP and one for TF. As a result, ProbeRating outperforms previous methods on both tasks. The results show that ProbeRating can be a useful tool to study the binding mechanism for the many NBPs that lack direct experimental evidence. and implementation. AVAILABILITY AND IMPLEMENTATION The source code is freely available at <https://github.com/syang11/ProbeRating>. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shu Yang
- Department of Computer Science, University of British Columbia, Vancouver, BC V6T1Z4, Canada
| | - Xiaoxi Liu
- RIKEN Center for Integrative Medical Sciences (IMS), Yokohama 230-0045, Japan
| | - Raymond T Ng
- Department of Computer Science, University of British Columbia, Vancouver, BC V6T1Z4, Canada
| |
Collapse
|
6
|
Liu Y, Gong W, Yang Z, Li C. SNB-PSSM: A spatial neighbor-based PSSM used for protein-RNA binding site prediction. J Mol Recognit 2021; 34:e2887. [PMID: 33442949 DOI: 10.1002/jmr.2887] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Revised: 12/22/2020] [Accepted: 12/23/2020] [Indexed: 01/09/2023]
Abstract
Protein-RNA interactions play essential roles in a wide variety of biological processes. Recognition of RNA-binding residues on proteins has been a challenging problem. Most of methods utilize the position-specific scoring matrix (PSSM). It has been found that considering the evolutionary information of sequence neighboring residues can improve the prediction. In this work, we introduce a novel method SNB-PSSM (spatial neighbor-based PSSM) combined with the structure window scheme where the evolutionary information of spatially neighboring residues is considered. The results show our method consistently outperforms the standard and smoothed PSSM methods. Tested on multiple datasets, this approach shows an encouraging performance compared with RNABindRPlus, BindN+, PPRInt, xypan, Predict_RBP, SpaPF, PRNA, and KYG, although is inferior to RNAProSite, RBscore, and aaRNA. In addition, since our method is not sensitive to protein structure changes, it can be applied well on binding site predictions of modeled structures. Thus, the result also suggests the evolution of binding sites is spatially cooperative. The proposed method as an effective tool of considering evolutionary information can be widely used for the nucleic acid-/protein-binding site prediction and functional motif finding.
Collapse
Affiliation(s)
- Yang Liu
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing, China
| | - Weikang Gong
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing, China
| | - Zhen Yang
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing, China
| | - Chunhua Li
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing, China
| |
Collapse
|
7
|
Gao J, Miao Z, Zhang Z, Wei H, Kurgan L. Prediction of Ion Channels and their Types from Protein Sequences: Comprehensive Review and Comparative Assessment. Curr Drug Targets 2020; 20:579-592. [PMID: 30360734 DOI: 10.2174/1389450119666181022153942] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2018] [Revised: 10/03/2018] [Accepted: 10/04/2018] [Indexed: 12/20/2022]
Abstract
BACKGROUND Ion channels are a large and growing protein family. Many of them are associated with diseases, and consequently, they are targets for over 700 drugs. Discovery of new ion channels is facilitated with computational methods that predict ion channels and their types from protein sequences. However, these methods were never comprehensively compared and evaluated. OBJECTIVE We offer first-of-its-kind comprehensive survey of the sequence-based predictors of ion channels. We describe eight predictors that include five methods that predict ion channels, their types, and four classes of the voltage-gated channels. We also develop and use a new benchmark dataset to perform comparative empirical analysis of the three currently available predictors. RESULTS While several methods that rely on different designs were published, only a few of them are currently available and offer a broad scope of predictions. Support and availability after publication should be required when new methods are considered for publication. Empirical analysis shows strong performance for the prediction of ion channels and modest performance for the prediction of ion channel types and voltage-gated channel classes. We identify a substantial weakness of current methods that cannot accurately predict ion channels that are categorized into multiple classes/types. CONCLUSION Several predictors of ion channels are available to the end users. They offer practical levels of predictive quality. Methods that rely on a larger and more diverse set of predictive inputs (such as PSIONplus) are more accurate. New tools that address multi-label prediction of ion channels should be developed.
Collapse
Affiliation(s)
- Jianzhao Gao
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China
| | - Zhen Miao
- College of Life Sciences, Nankai University, Tianjin, China
| | - Zhaopeng Zhang
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China
| | - Hong Wei
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, United States
| |
Collapse
|
8
|
PSIONplus m Server for Accurate Multi-Label Prediction of Ion Channels and Their Types. Biomolecules 2020; 10:biom10060876. [PMID: 32517331 PMCID: PMC7355608 DOI: 10.3390/biom10060876] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 05/28/2020] [Accepted: 06/04/2020] [Indexed: 11/26/2022] Open
Abstract
Computational prediction of ion channels facilitates the identification of putative ion channels from protein sequences. Several predictors of ion channels and their types were developed in the last quindecennial. While they offer reasonably accurate predictions, they also suffer a few shortcomings including lack of availability, parallel prediction mode, single-label prediction (inability to predict multiple channel subtypes), and incomplete scope (inability to predict subtypes of the voltage-gated channels). We developed a first-of-its-kind PSIONplusm method that performs sequential multi-label prediction of ion channels and their subtypes for both voltage-gated and ligand-gated channels. PSIONplusm sequentially combines the outputs produced by three support vector machine-based models from the PSIONplus predictor and is available as a webserver. Empirical tests show that PSIONplusm outperforms current methods for the multi-label prediction of the ion channel subtypes. This includes the existing single-label methods that are available to the users, a naïve multi-label predictor that combines results produced by multiple single-label methods, and methods that make predictions based on sequence alignment and domain annotations. We also found that the current methods (including PSIONplusm) fail to accurately predict a few of the least frequently occurring ion channel subtypes. Thus, new predictors should be developed when a larger quantity of annotated ion channels will be available to train predictive models.
Collapse
|