1
|
Zhou M, Wang B, Li H, Han J, Li A, Lu W. RNA-binding protein SAMD4A inhibits breast tumor angiogenesis by modulating the balance of angiogenesis program. Cancer Sci 2021; 112:3835-3845. [PMID: 34219323 PMCID: PMC8409301 DOI: 10.1111/cas.15053] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 06/22/2021] [Accepted: 06/29/2021] [Indexed: 12/13/2022] Open
Abstract
Tumor-induced angiogenesis is important for further progression of solid tumors. The initiation of tumor angiogenesis is dictated by a shift in the balance between proangiogenic and antiangiogenic gene expression programs. However, the potential mechanism controlling the expression of angiogenesis-related genes in the tumor cells, especially the process mediated by RNA-binding protein (RBP) remains unclear. SAMD4A is a conserved RBP across fly to mammals, and is believed to play an important role in controlling gene translation and stability. In this study, we identified the potential role of SAMD4A in modulating angiogenesis-related gene expression and tumor progression in breast cancer. SAMD4A expression was repressed in breast cancer tissues and cells and low SAMD4A expression in human breast tumor samples was strongly associated with poor survival of patients. Overexpression of SAMD4A inhibited breast tumor angiogenesis and caner progression, whereas knockdown of SAMD4A demonstrated a reversed effect. Mechanistically, SAMD4A was found to specifically destabilize the proangiogenic gene transcripts, including C-X-C motif chemokine ligand 5 (CXCL5), endoglin (ENG), interleukin 1β (IL1β), and angiopoietin 1 (ANGPT1), by directly interacting with the stem-loop structure in the 3' untranslated region (3'UTR) of these mRNAs through its sterile alpha motif (SAM) domain, resulting in the imbalance of angiogenic genes expression. Collectively, our results suggest that SAMD4A is a novel breast tumor suppressor that inhibits tumor angiogenesis by specifically downregulating the expression of proangiogenic genes, which might be a potential antiangiogenic target for breast cancer therapy.
Collapse
Affiliation(s)
- Meicen Zhou
- Department of Endocrinology, Beijing Jishuitan Hospital, The 4th Clinical Medical College of Peking University, Beijing, China
| | - Bing Wang
- Institute of Microcirculation, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Hongwei Li
- Institute of Microcirculation, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Jianqun Han
- Institute of Microcirculation, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Ailing Li
- Institute of Microcirculation, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Wenbao Lu
- Institute of Microcirculation, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| |
Collapse
|
2
|
RNA-centric approaches to study RNA-protein interactions in vitro and in silico. Methods 2020; 178:11-18. [DOI: 10.1016/j.ymeth.2019.09.011] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 09/10/2019] [Accepted: 09/10/2019] [Indexed: 01/17/2023] Open
|
3
|
Munteanu A, Mukherjee N, Ohler U. SSMART: sequence-structure motif identification for RNA-binding proteins. Bioinformatics 2019; 34:3990-3998. [PMID: 29893814 DOI: 10.1093/bioinformatics/bty404] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Accepted: 06/07/2018] [Indexed: 01/12/2023] Open
Abstract
Motivation RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. Results We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability and implementation SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alina Munteanu
- Berlin Institute for Medical Systems Biology, Max Delbruck Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.,Department of Computer Science, Humboldt University, Berlin, Germany
| | - Neelanjan Mukherjee
- Berlin Institute for Medical Systems Biology, Max Delbruck Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Uwe Ohler
- Berlin Institute for Medical Systems Biology, Max Delbruck Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.,Department of Computer Science, Humboldt University, Berlin, Germany
| |
Collapse
|
4
|
Hamm DC, Harrison MM. Regulatory principles governing the maternal-to-zygotic transition: insights from Drosophila melanogaster. Open Biol 2018; 8:180183. [PMID: 30977698 PMCID: PMC6303782 DOI: 10.1098/rsob.180183] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 11/09/2018] [Indexed: 12/19/2022] Open
Abstract
The onset of metazoan development requires that two terminally differentiated germ cells, a sperm and an oocyte, become reprogrammed to the totipotent embryo, which can subsequently give rise to all the cell types of the adult organism. In nearly all animals, maternal gene products regulate the initial events of embryogenesis while the zygotic genome remains transcriptionally silent. Developmental control is then passed from mother to zygote through a process known as the maternal-to-zygotic transition (MZT). The MZT comprises an intimately connected set of molecular events that mediate degradation of maternally deposited mRNAs and transcriptional activation of the zygotic genome. This essential developmental transition is conserved among metazoans but is perhaps best understood in the fruit fly, Drosophila melanogaster. In this article, we will review our understanding of the events that drive the MZT in Drosophila embryos and highlight parallel mechanisms driving this transition in other animals.
Collapse
Affiliation(s)
| | - Melissa M. Harrison
- Department of Biomolecular Chemistry, University of Wisconsin School of Medicine and Public Health, Madison, WI 53706, USA
| |
Collapse
|
5
|
Abstract
Until the zygotic genome is activated, early development relies on the products deposited by the mother. Once the zygotic genome starts to be transcribed, most maternal products are not needed anymore by the developing embryo. This emancipation from the maternal genome occurs during the Zygotic Genome Activation (ZGA). Although the process by which the maternal content is replaced with zygotic products differs from species to species, there is a common theme to all of them: maternal transcripts are actively degraded. Here, a review of how the degradation of maternal RNAs is regulated during early development and discussions on some computational tools that may be of use in this research area are outlined.
Collapse
Affiliation(s)
- Antonio Marco
- School of Biological Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK.
| |
Collapse
|
6
|
Affinity regression predicts the recognition code of nucleic acid-binding proteins. Nat Biotechnol 2015; 33:1242-1249. [PMID: 26571099 PMCID: PMC4871164 DOI: 10.1038/nbt.3343] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 07/30/2015] [Indexed: 12/26/2022]
Abstract
Predicting the affinity profiles of nucleic acid-binding proteins directly from the protein sequence is a major unsolved problem. We present a statistical approach for learning the recognition code of a family of transcription factors (TFs) or RNA-binding proteins (RBPs) from high-throughput binding assays. Our method, called affinity regression, trains on protein binding microarray (PBM) or RNA compete experiments to learn an interaction model between proteins and nucleic acids, using only protein domain and probe sequences as inputs. By training on mouse homeodomain PBM profiles, our model correctly identifies residues that confer DNA-binding specificity and accurately predicts binding motifs for an independent set of divergent homeodomains. Similarly, learning from RNA compete profiles for diverse RBPs, our model can predict the binding affinities of held-out proteins and identify key RNA-binding residues. More broadly, we envision applying our method to model and predict biological interactions in any setting where there is a high-throughput ‘affinity’ readout.
Collapse
|
7
|
Gazestani VH, Salavati R. Deciphering RNA Regulatory Elements Involved in the Developmental and Environmental Gene Regulation of Trypanosoma brucei. PLoS One 2015; 10:e0142342. [PMID: 26529602 PMCID: PMC4631447 DOI: 10.1371/journal.pone.0142342] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 10/20/2015] [Indexed: 11/17/2022] Open
Abstract
Trypanosoma brucei is a vector-borne parasite with intricate life cycle that can cause serious diseases in humans and animals. This pathogen relies on fine regulation of gene expression to respond and adapt to variable environments, with implications in transmission and infectivity. However, the involved regulatory elements and their mechanisms of actions are largely unknown. Here, benefiting from a new graph-based approach for finding functional regulatory elements in RNA (GRAFFER), we have predicted 88 new RNA regulatory elements that are potentially involved in the gene regulatory network of T. brucei. We show that many of these newly predicted elements are responsive to both transcriptomic and proteomic changes during the life cycle of the parasite. Moreover, we found that 11 of predicted elements strikingly resemble previously identified regulatory elements for the parasite. Additionally, comparison with previously predicted motifs on T. brucei suggested the superior performance of our approach based on the current limited knowledge of regulatory elements in T. brucei.
Collapse
Affiliation(s)
- Vahid H Gazestani
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec, Canada
| | - Reza Salavati
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec, Canada; McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec, Canada; Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec, Canada
| |
Collapse
|
8
|
Kim HH, Lee SJ, Gardiner AS, Perrone-Bizzozero NI, Yoo S. Different motif requirements for the localization zipcode element of β-actin mRNA binding by HuD and ZBP1. Nucleic Acids Res 2015; 43:7432-46. [PMID: 26152301 PMCID: PMC4551932 DOI: 10.1093/nar/gkv699] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Accepted: 06/29/2015] [Indexed: 11/13/2022] Open
Abstract
Interactions of RNA-binding proteins (RBPs) with their target transcripts are essential for regulating gene expression at the posttranscriptional level including mRNA export/localization, stability, and translation. ZBP1 and HuD are RBPs that play pivotal roles in mRNA transport and local translational control in neuronal processes. While HuD possesses three RNA recognition motifs (RRMs), ZBP1 contains two RRMs and four K homology (KH) domains that either increase target specificity or provide a multi-target binding capability. Here we used isolated cis-element sequences of the target mRNA to examine directly protein-RNA interactions in cell-free systems. We found that both ZBP1 and HuD bind the zipcode element in rat β-actin mRNA's 3' UTR. Differences between HuD and ZBP1 were observed in their binding preference to the element. HuD showed a binding preference for U-rich sequence. In contrast, ZBP1 binding to the zipcode RNA depended more on the structural level, as it required the proper spatial organization of a stem-loop that is mainly determined by the U-rich element juxtaposed to the 3' end of a 5'-ACACCC-3' motif. On the basis of this work, we propose that ZBP1 and HuD bind to overlapping sites in the β-actin zipcode, but they recognize different features of this target sequence.
Collapse
Affiliation(s)
- Hak Hee Kim
- Nemours Biomedical Research, Alfred I. duPont Hosp. for Children, Wilmington, DE 19803, USA
| | - Seung Joon Lee
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | - Amy S Gardiner
- Department of Neuroscience, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA
| | - Nora I Perrone-Bizzozero
- Department of Neuroscience, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA
| | - Soonmoon Yoo
- Nemours Biomedical Research, Alfred I. duPont Hosp. for Children, Wilmington, DE 19803, USA
| |
Collapse
|
9
|
Abstract
Long non-coding RNAs (lncRNAs) are associated to a plethora of cellular functions, most of which require the interaction with one or more RNA-binding proteins (RBPs); similarly, RBPs are often able to bind a large number of different RNAs. The currently available knowledge is already drawing an intricate network of interactions, whose deregulation is frequently associated to pathological states. Several different techniques were developed in the past years to obtain protein–RNA binding data in a high-throughput fashion. In parallel, in silico inference methods were developed for the accurate computational prediction of the interaction of RBP–lncRNA pairs. The field is growing rapidly, and it is foreseeable that in the near future, the protein–lncRNA interaction network will rise, offering essential clues for a better understanding of lncRNA cellular mechanisms and their disease-associated perturbations.
Collapse
|
10
|
Abstract
RNA-binding proteins (RBPs) are important regulators of eukaryotic gene expression. Genomes typically encode dozens to hundreds of proteins containing RNA-binding domains, which collectively recognize diverse RNA sequences and structures. Recent advances in high-throughput methods for assaying the targets of RBPs in vitro and in vivo allow large-scale derivation of RNA-binding motifs as well as determination of RNA–protein interactions in living cells. In parallel, many computational methods have been developed to analyze and interpret these data. The interplay between RNA secondary structure and RBP binding has also been a growing theme. Integrating RNA–protein interaction data with observations of post-transcriptional regulation will enhance our understanding of the roles of these important proteins.
Collapse
|
11
|
Goodarzi H, Tavazoie SF, Tavazoie S. TARBP2 binding structured RNA elements drives metastasis. Cell Cycle 2014; 13:2799-800. [PMID: 25486461 DOI: 10.4161/15384101.2014.954453] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- Hani Goodarzi
- a Laboratory of Systems Cancer Biology ; Rockefeller University ; New York , NY USA
| | | | | |
Collapse
|
12
|
Goodarzi H, Zhang S, Buss CG, Fish L, Tavazoie S, Tavazoie SF. Metastasis-suppressor transcript destabilization through TARBP2 binding of mRNA hairpins. Nature 2014; 513:256-60. [PMID: 25043050 DOI: 10.1038/nature13466] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2013] [Accepted: 05/12/2014] [Indexed: 01/30/2023]
Abstract
Aberrant regulation of RNA stability has an important role in many disease states. Deregulated post-transcriptional modulation, such as that governed by microRNAs targeting linear sequence elements in messenger RNAs, has been implicated in the progression of many cancer types. A defining feature of RNA is its ability to fold into structures. However, the roles of structural mRNA elements in cancer progression remain unexplored. Here we performed an unbiased search for post-transcriptional modulators of mRNA stability in breast cancer by conducting whole-genome transcript stability measurements in poorly and highly metastatic isogenic human breast cancer lines. Using a computational framework that searches RNA sequence and structure space, we discovered a family of GC-rich structural cis-regulatory RNA elements, termed sRSEs for structural RNA stability elements, which are significantly overrepresented in transcripts displaying reduced stability in highly metastatic cells. By integrating computational and biochemical approaches, we identified TARBP2, a double-stranded RNA-binding protein implicated in microRNA processing, as the trans factor that binds the sRSE family and similar structural elements--collectively termed TARBP2-binding structural elements (TBSEs)--in transcripts. TARBP2 is overexpressed in metastatic cells and metastatic human breast tumours and destabilizes transcripts containing TBSEs. Endogenous TARBP2 promotes metastatic cell invasion and colonization by destabilizing amyloid precursor protein (APP) and ZNF395 transcripts, two genes previously associated with Alzheimer's and Huntington's disease, respectively. We reveal these genes to be novel metastasis suppressor genes in breast cancer. The cleavage product of APP, extracellular amyloid-α peptide, directly suppresses invasion while ZNF395 transcriptionally represses a pro-metastatic gene expression program. The expression levels of TARBP2, APP and ZNF395 in human breast carcinomas support their experimentally uncovered roles in metastasis. Our findings establish a non-canonical and direct role for TARBP2 in mammalian gene expression regulation and reveal that regulated RNA destabilization through protein-mediated binding of mRNA structural elements can govern cancer progression.
Collapse
Affiliation(s)
- Hani Goodarzi
- Laboratory of Systems Cancer Biology, Rockefeller University, 1230 York Avenue, New York, New York 10065, USA
| | - Steven Zhang
- Laboratory of Systems Cancer Biology, Rockefeller University, 1230 York Avenue, New York, New York 10065, USA
| | - Colin G Buss
- Laboratory of Systems Cancer Biology, Rockefeller University, 1230 York Avenue, New York, New York 10065, USA
| | - Lisa Fish
- Laboratory of Systems Cancer Biology, Rockefeller University, 1230 York Avenue, New York, New York 10065, USA
| | - Saeed Tavazoie
- Department of Biochemistry and Molecular Biophysics, and Department of Systems Biology, Columbia University, New York, New York 10032, USA
| | - Sohail F Tavazoie
- Laboratory of Systems Cancer Biology, Rockefeller University, 1230 York Avenue, New York, New York 10065, USA
| |
Collapse
|
13
|
Systematic identification of regulatory elements in conserved 3' UTRs of human transcripts. Cell Rep 2014; 7:281-92. [PMID: 24656821 DOI: 10.1016/j.celrep.2014.03.001] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2013] [Revised: 02/03/2014] [Accepted: 03/03/2014] [Indexed: 11/21/2022] Open
Abstract
Posttranscriptional regulatory programs governing diverse aspects of RNA biology remain largely uncharacterized. Understanding the functional roles of RNA cis-regulatory elements is essential for decoding complex programs that underlie the dynamic regulation of transcript stability, splicing, localization, and translation. Here, we describe a combined experimental/computational technology to reveal a catalog of functional regulatory elements embedded in 3' UTRs of human transcripts. We used a bidirectional reporter system coupled with flow cytometry and high-throughput sequencing to measure the effect of short, noncoding, vertebrate-conserved RNA sequences on transcript stability and translation. Information-theoretic motif analysis of the resulting sequence-to-gene-expression mapping revealed linear and structural RNA cis-regulatory elements that positively and negatively modulate the posttranscriptional fates of human transcripts. This combined experimental/computational strategy can be used to systematically characterize the vast landscape of posttranscriptional regulatory elements controlling physiological and pathological cellular state transitions.
Collapse
|
14
|
Gazestani VH, Lu Z, Salavati R. Deciphering RNA regulatory elements in trypanosomatids: one piece at a time or genome-wide? Trends Parasitol 2014; 30:234-40. [PMID: 24642036 DOI: 10.1016/j.pt.2014.02.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2013] [Revised: 02/17/2014] [Accepted: 02/26/2014] [Indexed: 02/02/2023]
Abstract
Morphological and metabolic changes in the life cycle of Trypanosoma brucei are accomplished by precise regulation of hundreds of genes. In the absence of transcriptional control, RNA-binding proteins (RBPs) shape the structure of gene regulatory maps in this organism, but our knowledge about their target RNAs, binding sites, and mechanisms of action is far from complete. Although recent technological advances have revolutionized the RBP-based approaches, the main framework for the RNA regulatory element (RRE)-based approaches has not changed over the last two decades in T. brucei. In this Opinion, after highlighting the current challenges in RRE inference, we explain some genome-wide solutions that can significantly boost our current understanding about gene regulatory networks in T. brucei.
Collapse
Affiliation(s)
- Vahid H Gazestani
- Institute of Parasitology, McGill University, 21,111 Lakeshore Road, Sainte Anne de Bellevue, Montreal, Quebec H9X3V9, Canada
| | - Zhiquan Lu
- Institute of Parasitology, McGill University, 21,111 Lakeshore Road, Sainte Anne de Bellevue, Montreal, Quebec H9X3V9, Canada
| | - Reza Salavati
- Institute of Parasitology, McGill University, 21,111 Lakeshore Road, Sainte Anne de Bellevue, Montreal, Quebec H9X3V9, Canada; McGill Centre for Bioinformatics, McGill University, Duff Medical Building, 3775 University Street, Montreal, Quebec H3A2B4, Canada; Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G1Y6, Canada.
| |
Collapse
|
15
|
Zhang X, Lin H, Zhao H, Hao Y, Mort M, Cooper DN, Zhou Y, Liu Y. Impact of human pathogenic micro-insertions and micro-deletions on post-transcriptional regulation. Hum Mol Genet 2014; 23:3024-34. [PMID: 24436305 DOI: 10.1093/hmg/ddu019] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Small insertions/deletions (INDELs) of ≤21 bp comprise 18% of all recorded mutations causing human inherited disease and are evident in 24% of documented Mendelian diseases. INDELs affect gene function in multiple ways: for example, by introducing premature stop codons that either lead to the production of truncated proteins or affect transcriptional efficiency. However, the means by which they impact post-transcriptional regulation, including alternative splicing, have not been fully evaluated. In this study, we collate disease-causing INDELs from the Human Gene Mutation Database (HGMD) and neutral INDELs from the 1000 Genomes Project. The potential of these two types of INDELs to affect binding-site affinity of RNA-binding proteins (RBPs) was then evaluated. We identified several sequence features that can distinguish disease-causing INDELs from neutral INDELs. Moreover, we built a machine-learning predictor called PinPor (predicting pathogenic small insertions and deletions affecting post-transcriptional regulation, http://watson.compbio.iupui.edu/pinpor/) to ascertain which newly observed INDELs are likely to be pathogenic. Our results show that disease-causing INDELs are more likely to ablate RBP-binding sites and tend to affect more RBP-binding sites than neutral INDELs. Additionally, disease-causing INDELs give rise to greater deviations in binding affinity than neutral INDELs. We also demonstrated that disease-causing INDELs may be distinguished from neutral INDELs by several sequence features, such as their proximity to splice sites and their potential effects on RNA secondary structure. This predictor showed satisfactory performance in identifying numerous pathogenic INDELs, with a Matthews correlation coefficient (MCC) value of 0.51 and an accuracy of 0.75.
Collapse
Affiliation(s)
- Xinjun Zhang
- School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Chen L, Dumelie JG, Li X, Cheng MH, Yang Z, Laver JD, Siddiqui NU, Westwood JT, Morris Q, Lipshitz HD, Smibert CA. Global regulation of mRNA translation and stability in the early Drosophila embryo by the Smaug RNA-binding protein. Genome Biol 2014; 15:R4. [PMID: 24393533 PMCID: PMC4053848 DOI: 10.1186/gb-2014-15-1-r4] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Accepted: 01/07/2014] [Indexed: 12/12/2022] Open
Abstract
Background Smaug is an RNA-binding protein that induces the degradation and represses the translation of mRNAs in the early Drosophila embryo. Smaug has two identified direct target mRNAs that it differentially regulates: nanos and Hsp83. Smaug represses the translation of nanos mRNA but has only a modest effect on its stability, whereas it destabilizes Hsp83 mRNA but has no detectable effect on Hsp83 translation. Smaug is required to destabilize more than one thousand mRNAs in the early embryo, but whether these transcripts represent direct targets of Smaug is unclear and the extent of Smaug-mediated translational repression is unknown. Results To gain a panoramic view of Smaug function in the early embryo, we identified mRNAs that are bound to Smaug using RNA co-immunoprecipitation followed by hybridization to DNA microarrays. We also identified mRNAs that are translationally repressed by Smaug using polysome gradients and microarrays. Comparison of the bound mRNAs to those that are translationally repressed by Smaug and those that require Smaug for their degradation suggests that a large fraction of Smaug’s target mRNAs are both translationally repressed and degraded by Smaug. Smaug directly regulates components of the TRiC/CCT chaperonin, the proteasome regulatory particle and lipid droplets, as well as many metabolic enzymes, including several glycolytic enzymes. Conclusions Smaug plays a direct and global role in regulating the translation and stability of a large fraction of the mRNAs in the early Drosophila embryo, and has unanticipated functions in control of protein folding and degradation, lipid droplet function and metabolism.
Collapse
|
17
|
Re A, Joshi T, Kulberkyte E, Morris Q, Workman CT. RNA-protein interactions: an overview. Methods Mol Biol 2014; 1097:491-521. [PMID: 24639174 DOI: 10.1007/978-1-62703-709-9_23] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
RNA binding proteins (RBPs) are key players in the regulation of gene expression. In this chapter we discuss the main protein-RNA recognition modes used by RBPs in order to regulate multiple steps of RNA processing. We discuss traditional and state-of-the-art technologies that can be used to study RNAs bound by individual RBPs, or vice versa, for both in vitro and in vivo methodologies. To help highlight the biological significance of RBP mediated regulation, online resources on experimentally verified protein-RNA interactions are briefly presented. Finally, we present the major tools to computationally infer RNA binding sites according to the modeling features and to the unsupervised or supervised frameworks that are adopted. Since some RNA binding site search algorithms are derived from DNA binding site search algorithms, we discuss the commonalities and novelties introduced to handle both sequence and structural features uniquely characterizing protein-RNA interactions.
Collapse
Affiliation(s)
- Angela Re
- University of Trento, Mattarello, Italy
| | | | | | | | | |
Collapse
|
18
|
Li X, Kazan H, Lipshitz HD, Morris QD. Finding the target sites of RNA-binding proteins. WILEY INTERDISCIPLINARY REVIEWS-RNA 2013; 5:111-30. [PMID: 24217996 PMCID: PMC4253089 DOI: 10.1002/wrna.1201] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Revised: 09/27/2013] [Accepted: 10/01/2013] [Indexed: 12/15/2022]
Abstract
RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201
Collapse
Affiliation(s)
- Xiao Li
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | | | | | | |
Collapse
|
19
|
Najafabadi HS, Lu Z, MacPherson C, Mehta V, Adoue V, Pastinen T, Salavati R. Global identification of conserved post-transcriptional regulatory programs in trypanosomatids. Nucleic Acids Res 2013; 41:8591-600. [PMID: 23877242 PMCID: PMC3794602 DOI: 10.1093/nar/gkt647] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2012] [Revised: 06/25/2013] [Accepted: 06/29/2013] [Indexed: 12/30/2022] Open
Abstract
While regulatory programs are extensively studied at the level of transcription, elements that are involved in regulation of post-transcriptional processes are largely unknown, and methods for systematic identification of these elements are in early stages. Here, using a novel computational framework, we have integrated sequence information with several functional genomics data sets to characterize conserved regulatory programs of trypanosomatids, a group of eukaryotes that almost entirely rely on post-transcriptional processes for regulation of mRNA abundance. This analysis revealed a complex network of linear and structural RNA elements that potentially govern mRNA abundance across different life stages and environmental conditions. Furthermore, we show that the conserved regulatory network that we have identified is responsive to chemical perturbation of several biological functions in trypanosomatids. We have further characterized one of the most abundant regulatory RNA elements that we discovered, an AU-rich element (ARE) that can be found in 3' untranslated region of many trypanosomatid genes. Using bioinformatics approaches as well as in vitro and in vivo experiments, we have identified three ELAV-like homologs, including the developmentally critical protein TbRBP6, which regulate abundance of a large number of trypanosomatid ARE-containing transcripts. Together, these studies lay out a roadmap for characterization of mechanisms that modulate development and metabolic pathways in trypanosomatids.
Collapse
Affiliation(s)
- Hamed S. Najafabadi
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| | - Zhiquan Lu
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| | - Chad MacPherson
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| | - Vaibhav Mehta
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| | - Véronique Adoue
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| | - Tomi Pastinen
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| | - Reza Salavati
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Ste. Anne de Bellevue, Montreal, Quebec H9X 3V9, Canada, McGill Centre for Bioinformatics, McGill University, 3649 Promenade Sir William Osler, Montreal, Quebec H3G 0B1, Canada, Department of Human Genetics, McGill University Health Centre, Montréal, Québec, Canada, McGill University and Genome Québec Innovation Centre, Montréal, Québec H3A 1A4, Canada, Department of Medical Genetics, McGill University Health Centre, Montréal, Québec, Canada and Department of Biochemistry, McGill University, McIntyre Medical Building, 3655 Promenade Sir William Osler, Montreal, Quebec H3G 1Y6, Canada
| |
Collapse
|
20
|
Kazan H, Morris Q. RBPmotif: a web server for the discovery of sequence and structure preferences of RNA-binding proteins. Nucleic Acids Res 2013; 41:W180-6. [PMID: 23754853 PMCID: PMC3692078 DOI: 10.1093/nar/gkt463] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
RBPmotif web server (http://www.rnamotif.org) implements tools to identify binding preferences of RNA-binding proteins (RBPs). Given a set of sequences that are known to be bound and unbound by the RBP of interest, RBPmotif provides two types of analysis: (i) de novo motif finding when there is no a priori knowledge on RBP’s binding preferences and (ii) analysis of structure preferences when there is a previously identified sequence motif for the RBP. De novo motif finding is performed with the previously published RNAcontext algorithm that learns discriminative motif models to identify both sequence and structure preferences. The results of this analysis include the inferred binding preferences of the RBP and the added predictive value of incorporating structure preferences. Second type of analysis investigates whether the instances of the previously identified sequence motif are enriched in a particular structure context in bound sequences, relative to its instances in unbound sequences. On completion, the results page shows the comparison of structure contexts of the motif instances between bound and unbound sequences and an assessment of statistical significance of detected preferences. In summary, RBPmotif web server enables the concurrent analysis of sequence and structure preferences of RBPs through a user-friendly interface.
Collapse
Affiliation(s)
- Hilal Kazan
- Department of Computer Engineering, Antalya International University, Antalya, 07190, Turkey.
| | | |
Collapse
|
21
|
Abstract
The specificity of protein-DNA interactions is most commonly modeled using position weight matrices (PWMs). First introduced in 1982, they have been adapted to many new types of data and many different approaches have been developed to determine the parameters of the PWM. New high-throughput technologies provide a large amount of data rapidly and offer an unprecedented opportunity to determine accurately the specificities of many transcription factors (TFs). But taking full advantage of the new data requires advanced algorithms that take into account the biophysical processes involved in generating the data. The new large datasets can also aid in determining when the PWM model is inadequate and must be extended to provide accurate predictions of binding sites. This article provides a general mathematical description of a PWM and how it is used to score potential binding sites, a brief history of the approaches that have been developed and the types of data that are used with an emphasis on algorithms that we have developed for analyzing high-throughput datasets from several new technologies. It also describes extensions that can be added when the simple PWM model is inadequate and further enhancements that may be necessary. It briefly describes some applications of PWMs in the discovery and modeling of in vivo regulatory networks.
Collapse
|
22
|
Larsson O, Tian B, Sonenberg N. Toward a genome-wide landscape of translational control. Cold Spring Harb Perspect Biol 2013; 5:a012302. [PMID: 23209130 PMCID: PMC3579401 DOI: 10.1101/cshperspect.a012302] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Genome-wide analysis of translational control has taken strides in recent years owing to the advent of high-throughput technologies, including DNA microarrays and deep sequencing. Global studies have unraveled a principal role, among posttranscriptional mechanisms, for mRNA translation in determining protein levels in the cell. The impact of translational control in dynamic regulation of the proteome under different conditions is increasingly appreciated. Here we review genome-wide studies that use high-throughput techniques and bioinformatics to assess the role of mRNA translation in the regulation of protein levels; we also discuss how genome-wide data on mRNA translation can be obtained, analyzed, and used to identify mechanisms of translational control.
Collapse
Affiliation(s)
- Ola Larsson
- Department of Oncology-Pathology, Karolinska Institute, Stockholm SE-171 76, Sweden.
| | | | | |
Collapse
|
23
|
Janga SC. From specific to global analysis of posttranscriptional regulation in eukaryotes: posttranscriptional regulatory networks. Brief Funct Genomics 2012; 11:505-21. [PMID: 23124862 DOI: 10.1093/bfgp/els046] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Regulation of gene expression occurs at several levels in eukaryotic organisms and is a highly controlled process. Although RNAs have been traditionally viewed as passive molecules in the pathway from transcription to translation, there is mounting evidence that their metabolism is controlled by a class of proteins called RNA-binding proteins (RBPs), as well as a number of small RNAs. In this review, I provide an overview of the recent developments in our understanding of the repertoire of RBPs across diverse model systems, and discuss the computational and experimental approaches currently available for the construction of posttranscriptional networks governed by them. I also present an overview of the different roles played by RBPs in the cellular context, based on their cis-regulatory modules identified in the literature and discuss how their interplay can result in the dynamic, spatial and tissue-specific expression maps of RNAs. I finally present the concept of posttranscriptional network of RBPs and their cognate RNA targets and discuss their cross-talk with other important posttranscriptional regulatory molecules such as microRNAs s, resulting in diverse functional network motifs. I argue that with rapid developments in the genome-wide elucidation of posttranscriptional networks it would not only be possible to gain a deeper understanding of regulation at a level that has been under-appreciated in the past, but would also allow us to use the newly developed high-throughput approaches to interrogate the prevalence of these phenomena in different states, and thereby study their relevance to physiology and disease across organisms.
Collapse
Affiliation(s)
- Sarath Chandra Janga
- School of Informatics, Indiana University Purdue University, Indianapolis, Indiana, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 719 Indiana Ave Ste 319, Walker Plaza Building, IN 46202, USA.
| |
Collapse
|
24
|
Baez MV, Luchelli L, Maschi D, Habif M, Pascual M, Thomas MG, Boccaccio GL. Smaug1 mRNA-silencing foci respond to NMDA and modulate synapse formation. ACTA ACUST UNITED AC 2012; 195:1141-57. [PMID: 22201125 PMCID: PMC3246892 DOI: 10.1083/jcb.201108159] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
S-foci, the first reported mRNA-silencing foci specific to neurons, may control local mRNA translation in response to NMDA receptor stimulation and synaptic plasticity. Mammalian Smaug1/Samd4A is a translational repressor. Here we show that Smaug1 forms mRNA-silencing foci located at postsynapses of hippocampal neurons. These structures, which we have named S-foci, are distinct from P-bodies, stress granules, or other neuronal RNA granules hitherto described, and are the first described mRNA-silencing foci specific to neurons. RNA binding was not required for aggregation, which indicates that S-foci formation is not a consequence of mRNA silencing. N-methyl-d-aspartic acid (NMDA) receptor stimulation provoked a rapid and reversible disassembly of S-foci, transiently releasing transcripts (the CaMKIIα mRNA among others) to allow their translation. Simultaneously, NMDA triggered global translational silencing, which suggests the specific activation of Smaug1-repressed transcripts. Smaug1 is expressed during synaptogenesis, and Smaug1 knockdown affected the number and size of synapses, and also provoked an impaired response to repetitive depolarizing stimuli, as indicated by a reduced induction of Arc/Arg3.1. Our results suggest that S-foci control local translation, specifically responding to NMDA receptor stimulation and affecting synaptic plasticity.
Collapse
|
25
|
Walser CB, Lipshitz HD. Transcript clearance during the maternal-to-zygotic transition. Curr Opin Genet Dev 2011; 21:431-43. [PMID: 21497081 DOI: 10.1016/j.gde.2011.03.003] [Citation(s) in RCA: 115] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2011] [Accepted: 03/21/2011] [Indexed: 02/04/2023]
Abstract
In all animals, a key event in the transition from maternal control of development to control by products of the zygotic genome is the elimination of a significant fraction of the mRNAs loaded into the egg by the mother. Clearance of these maternal mRNAs is accomplished by two activities: the first is maternally encoded while the second requires zygotic transcription. Recent advances include identification of RNA-binding proteins that function as specificity factors to direct the maternal degradation machinery to its target mRNAs; small RNAs-most notably microRNAs-that function as components of the zygotically encoded activity; signaling pathways that trigger production and/or activation of the clearance mechanism in early embryos; and mechanisms for spatial control of transcript clearance.
Collapse
Affiliation(s)
- Claudia B Walser
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, Ontario, Canada M5S 1A8
| | | |
Collapse
|
26
|
Identification of differential translation in genome wide studies. Proc Natl Acad Sci U S A 2010; 107:21487-92. [PMID: 21115840 DOI: 10.1073/pnas.1006821107] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Regulation of gene expression through translational control is a fundamental mechanism implicated in many biological processes ranging from memory formation to innate immunity and whose dysregulation contributes to human diseases. Genome wide analyses of translational control strive to identify differential translation independent of cytosolic mRNA levels. For this reason, most studies measure genes' translation levels as log ratios (translation levels divided by corresponding cytosolic mRNA levels obtained in parallel). Counterintuitively, arising from a mathematical necessity, these log ratios tend to be highly correlated with the cytosolic mRNA levels. Accordingly, they do not effectively correct for cytosolic mRNA level and generate substantial numbers of biological false positives and false negatives. We show that analysis of partial variance, which produces estimates of translational activity that are independent of cytosolic mRNA levels, is a superior alternative. When combined with a variance shrinkage method for estimating error variance, analysis of partial variance has the additional benefit of having greater statistical power and identifying fewer genes as translationally regulated resulting merely from unrealistically low variance estimates rather than from large changes in translational activity. In contrast to log ratios, this formal analytical approach estimates translation effects in a statistically rigorous manner, eliminates the need for inefficient and error-prone heuristics, and produces results that agree with biological function. The method is applicable to datasets obtained from both the commonly used polysome microarray method and the sequencing-based ribosome profiling method.
Collapse
|
27
|
Riordan DP, Herschlag D, Brown PO. Identification of RNA recognition elements in the Saccharomyces cerevisiae transcriptome. Nucleic Acids Res 2010; 39:1501-9. [PMID: 20959291 PMCID: PMC3045596 DOI: 10.1093/nar/gkq920] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Post-transcriptional regulation of gene expression, including mRNA localization, translation and decay, is ubiquitous yet still largely unexplored. How is the post-transcriptional regulatory program of each mRNA encoded in its sequence? Hundreds of specific RNA-binding proteins (RBPs) appear to play roles in mediating the post-transcriptional regulatory program, akin to the roles of specific DNA-binding proteins in transcription. As a step toward decoding the regulatory programs encoded in each mRNA, we focused on specific mRNA–protein interactions. We computationally analyzed the sequences of Saccharomyces cerevisiae mRNAs bound in vivo by 29 specific RBPs, identifying eight novel candidate motifs and confirming or extending six earlier reported recognition elements. Biochemical selections for RNA sequences selectively recognized by 12 yeast RBPs yielded novel motifs bound by Pin4, Nsr1, Hrb1, Gbp2, Sgn1 and Mrn1, and recovered the known recognition elements for Puf3, She2, Vts1 and Whi3. Most of the RNA elements we uncovered were associated with coherent mRNA expression changes and were significantly conserved in related yeasts, supporting their functional importance and suggesting that the corresponding RNA–protein interactions are evolutionarily conserved.
Collapse
Affiliation(s)
- Daniel P Riordan
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, USA.
| | | | | |
Collapse
|
28
|
Kazan H, Ray D, Chan ET, Hughes TR, Morris Q. RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins. PLoS Comput Biol 2010; 6:e1000832. [PMID: 20617199 PMCID: PMC2895634 DOI: 10.1371/journal.pcbi.1000832] [Citation(s) in RCA: 196] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2009] [Accepted: 05/25/2010] [Indexed: 12/31/2022] Open
Abstract
Metazoan genomes encode hundreds of RNA-binding proteins (RBPs). These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures.
Collapse
Affiliation(s)
- Hilal Kazan
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | | | | | | | | |
Collapse
|
29
|
Li X, Quon G, Lipshitz HD, Morris Q. Predicting in vivo binding sites of RNA-binding proteins using mRNA secondary structure. RNA (NEW YORK, N.Y.) 2010; 16:1096-107. [PMID: 20418358 PMCID: PMC2874161 DOI: 10.1261/rna.2017210] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
While many RNA-binding proteins (RBPs) bind RNA in a sequence-specific manner, their sequence preferences alone do not distinguish known target RNAs from other potential targets that are coexpressed and contain the same sequence motifs. Recently, the mRNA targets of dozens of RNA-binding proteins have been identified, facilitating a systematic study of the features of target transcripts. Using these data, we demonstrate that calculating the predicted structural accessibility of a putative RBP binding site allows one to significantly improve the accuracy of predicting in vivo binding for the majority of sequence-specific RBPs. In our new in silico approach, accessibility is predicted based solely on the mRNA sequence without consideration of the locations of bound trans-factors; as such, our results suggest a greater than previously anticipated role for intrinsic mRNA secondary structure in determining RBP binding target preference. Target site accessibility aids in predicting target transcripts and the binding sites for RBPs with a range of RNA-binding domains and subcellular functions. Based on this work, we introduce a new motif-finding algorithm that identifies accessible sequence-specific RBP motifs from in vivo binding data.
Collapse
Affiliation(s)
- Xiao Li
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1E3, Canada
| | | | | | | |
Collapse
|
30
|
News in brief. Nat Methods 2009. [DOI: 10.1038/nmeth0709-483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|