1
|
Li X, Wang GA, Wei Z, Wang H, Zhu X. Protein-DNA interface hotspots prediction based on fusion features of embeddings of protein language model and handcrafted features. Comput Biol Chem 2023; 107:107970. [PMID: 37866116 DOI: 10.1016/j.compbiolchem.2023.107970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 10/06/2023] [Accepted: 10/07/2023] [Indexed: 10/24/2023]
Abstract
The identification of hotspot residues at the protein-DNA binding interfaces plays a crucial role in various aspects such as drug discovery and disease treatment. Although experimental methods such as alanine scanning mutagenesis have been developed to determine the hotspot residues on protein-DNA interfaces, they are both inefficient and costly. Therefore, it is highly necessary to develop efficient and accurate computational methods for predicting hotspot residues. Several computational methods have been developed, however, they are mainly based on hand-crafted features which may not be able to represent all the information of proteins. In this regard, we propose a model called PDH-EH, which utilizes fused features of embeddings extracted from a protein language model (PLM) and handcrafted features. After we extracted the total 1141 dimensional features, we used mRMR to select the optimal feature subset. Based on the optimal feature subset, several different learning algorithms such as Random Forest, Support Vector Machine, and XGBoost were used to build the models. The cross-validation results on the training dataset show that the model built by using Random Forest achieves the highest AUROC. Further evaluation on the independent test set shows that our model outperforms the existing state-of-the-art models. Moreover, the effectiveness and interpretability of embeddings extracted from PLM were demonstrated in our analysis. The codes and datasets used in this study are available at: https://github.com/lixiangli01/PDH-EH.
Collapse
Affiliation(s)
- Xiang Li
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Gang-Ao Wang
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Zhuoyu Wei
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Hong Wang
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Xiaolei Zhu
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China.
| |
Collapse
|
2
|
Pachota M, Grzywa R, Iwanejko J, Synowiec A, Iwan D, Kamińska K, Skoreński M, Bielecka E, Szczubialka K, Nowakowska M, Mackereth CD, Wojaczyńska E, Sieńczyk M, Pyrć K. Novel inhibitors of HSV-1 protease effective in vitro and in vivo. Antiviral Res 2023; 213:105604. [PMID: 37054954 DOI: 10.1016/j.antiviral.2023.105604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/06/2023] [Accepted: 04/10/2023] [Indexed: 04/15/2023]
Abstract
Herpes simplex virus type 1 (HSV-1) is a widespread human pathogen known to cause infections of diverse severity, ranging from mild ulceration of mucosal and dermal tissues to life-threatening viral encephalitis. In most cases, standard treatment with acyclovir is sufficient to manage the disease progression. However, the emergence of ACV-resistant strains drives the need for new therapeutics and molecular targets. HSV-1 VP24 is a protease indispensable for the assembly of mature virions and, as such, constitutes an interesting target for the therapy. In this study, we present novel compounds, KI207M and EWDI/39/55BF, that block the activity of VP24 protease and consequently inhibit HSV-1 infection in vitro and in vivo. The inhibitors were shown to prevent the egress of viral capsids from the cell nucleus and suppress the cell-to-cell spread of the infection. They were also proven effective against ACV-resistant HSV-1 strains. Considering their low toxicity and high antiviral potency, the novel VP24 inhibitors could provide an alternative for treating ACV-resistant infections or a drug to be used in combined, highly effective therapy.
Collapse
Affiliation(s)
- Magdalena Pachota
- Virogenetics Laboratory of Virology, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7a, 30-387, Kraków, Poland; Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Gronostajowa 7, 30-387, Kraków, Poland
| | - Renata Grzywa
- Department of Organic and Medicinal Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland
| | - Jakub Iwanejko
- Department of Physical and Quantum Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland
| | - Aleksandra Synowiec
- Virogenetics Laboratory of Virology, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7a, 30-387, Kraków, Poland
| | - Dominika Iwan
- Department of Physical and Quantum Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland
| | - Karolina Kamińska
- Department of Physical and Quantum Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland
| | - Marcin Skoreński
- Department of Organic and Medicinal Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland
| | - Ewa Bielecka
- Laboratory of Proteolysis and Post-translational Modification of Proteins, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7a, 30-387, Kraków, Poland
| | - Krzysztof Szczubialka
- Faculty of Chemistry, Jagiellonian University, Gronostajowa 2, 30-387, Kraków, Poland
| | - Maria Nowakowska
- Faculty of Chemistry, Jagiellonian University, Gronostajowa 2, 30-387, Kraków, Poland
| | - Cameron D Mackereth
- Univ. Bordeaux, Inserm U1212, CNRS UMR 5320, ARNA Laboratory, IECB, 33706, Pessac, France
| | - Elżbieta Wojaczyńska
- Department of Physical and Quantum Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland.
| | - Marcin Sieńczyk
- Department of Organic and Medicinal Chemistry, Wrocław University of Science and Technology, Wybrzeże Wyspianskiego 27, 50-370, Wrocław, Poland.
| | - Krzysztof Pyrć
- Virogenetics Laboratory of Virology, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7a, 30-387, Kraków, Poland.
| |
Collapse
|
3
|
Guo JT, Malik F. Single-Stranded DNA Binding Proteins and Their Identification Using Machine Learning-Based Approaches. Biomolecules 2022; 12:biom12091187. [PMID: 36139026 PMCID: PMC9496475 DOI: 10.3390/biom12091187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 08/11/2022] [Accepted: 08/24/2022] [Indexed: 11/25/2022] Open
Abstract
Single-stranded DNA (ssDNA) binding proteins (SSBs) are critical in maintaining genome stability by protecting the transient existence of ssDNA from damage during essential biological processes, such as DNA replication and gene transcription. The single-stranded region of telomeres also requires protection by ssDNA binding proteins from being attacked in case it is wrongly recognized as an anomaly. In addition to their critical roles in genome stability and integrity, it has been demonstrated that ssDNA and SSB-ssDNA interactions play critical roles in transcriptional regulation in all three domains of life and viruses. In this review, we present our current knowledge of the structure and function of SSBs and the structural features for SSB binding specificity. We then discuss the machine learning-based approaches that have been developed for the prediction of SSBs from double-stranded DNA (dsDNA) binding proteins (DSBs).
Collapse
|
4
|
Aguion PI, Marchanka A, Carlomagno T. Nucleic acid-protein interfaces studied by MAS solid-state NMR spectroscopy. J Struct Biol X 2022; 6:100072. [PMID: 36090770 PMCID: PMC9449856 DOI: 10.1016/j.yjsbx.2022.100072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 08/11/2022] [Accepted: 08/15/2022] [Indexed: 11/20/2022] Open
Abstract
Solid-state NMR (ssNMR) has become a well-established technique to study large and insoluble protein assemblies. However, its application to nucleic acid-protein complexes has remained scarce, mainly due to the challenges presented by overlapping nucleic acid signals. In the past decade, several efforts have led to the first structure determination of an RNA molecule by ssNMR. With the establishment of these tools, it has become possible to address the problem of structure determination of nucleic acid-protein complexes by ssNMR. Here we review first and more recent ssNMR methodologies that study nucleic acid-protein interfaces by means of chemical shift and peak intensity perturbations, direct distance measurements and paramagnetic effects. At the end, we review the first structure of an RNA-protein complex that has been determined from ssNMR-derived intermolecular restraints.
Collapse
Affiliation(s)
- Philipp Innig Aguion
- Institute for Organic Chemistry and Centre of Biomolecular Drug Research (BMWZ), Leibniz University Hannover, Schneiderberg 38, 30167 Hannover, Germany
| | - Alexander Marchanka
- Institute for Organic Chemistry and Centre of Biomolecular Drug Research (BMWZ), Leibniz University Hannover, Schneiderberg 38, 30167 Hannover, Germany
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Teresa Carlomagno
- School of Biosciences/College of Life and Enviromental Sciences, Institute of Cancer and Genomic Sciences/College of Medical and Dental Sciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK
| |
Collapse
|
5
|
RBM24 in the Post-Transcriptional Regulation of Cancer Progression: Anti-Tumor or Pro-Tumor Activity? Cancers (Basel) 2022; 14:cancers14071843. [PMID: 35406615 PMCID: PMC8997389 DOI: 10.3390/cancers14071843] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 03/30/2022] [Accepted: 04/01/2022] [Indexed: 12/11/2022] Open
Abstract
Simple Summary RBM24 is a highly conserved RNA-binding protein that plays critical roles in the post-transcriptional regulation of gene expression for initiating cell differentiation during embryonic development and for maintaining tissue homeostasis in adult life. Evidence is now accumulating that it is frequently dysregulated across human cancers. Importantly, RBM24 may act as a tumor suppressor or as an oncogene in a context- or background-dependent manner. Its activity can be regulated by protein–protein interactions and post-translational modifications, making it a potential therapeutic target for cancer treatment. However, molecular mechanisms underlying its function in tumor growth and metastasis remain elusive. Further investigation will be necessary to better understand how its post-transcriptional regulatory activity is controlled and how it is implicated in tumor progression. This review provides a comprehensive analysis of recent findings on the implication of RBM24 in cancer and proposes future research directions to delve more deeply into the mechanisms underlying its tumor-suppressive function or oncogenic activity. Abstract RNA-binding proteins are critical post-transcriptional regulators of gene expression. They are implicated in a wide range of physiological and pathological processes by modulating nearly every aspect of RNA metabolisms. Alterations in their expression and function disrupt tissue homeostasis and lead to the occurrence of various cancers. RBM24 is a highly conserved protein that binds to a large spectrum of target mRNAs and regulates many post-transcriptional events ranging from pre-mRNA splicing to mRNA stability, polyadenylation and translation. Studies using different animal models indicate that it plays an essential role in promoting cellular differentiation during organogenesis and tissue regeneration. Evidence is also accumulating that its dysregulation frequently occurs across human cancers. In several tissues, RBM24 clearly functions as a tumor suppressor, which is consistent with its inhibitory potential on cell proliferation. However, upregulation of RBM24 in other cancers appears to promote tumor growth. There is a possibility that RBM24 displays both anti-tumor and pro-tumor activities, which may be regulated in part through differential interactions with its protein partners and by its post-translational modifications. This makes it a potential biomarker for diagnosis and prognosis, as well as a therapeutic target for cancer treatment. The challenge remains to determine the post-transcriptional mechanisms by which RBM24 modulates gene expression and tumor progression in a context- or background-dependent manner. This review discusses recent findings on the potential function of RBM24 in tumorigenesis and provides future directions for better understanding its regulatory role in cancer cells.
Collapse
|
6
|
Calarco JA, Pilaka-Akella PP. Two-Color Fluorescent Reporters for Analysis of Alternative Splicing. Methods Mol Biol 2022; 2537:211-229. [PMID: 35895267 DOI: 10.1007/978-1-0716-2521-7_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Alternative splicing is a key layer of gene regulation that is frequently modulated in a spatiotemporal manner. As such, it is a major goal to understand the mechanisms controlling alternative splicing in specific cellular contexts. Reporters that recapitulate alternative splicing patterns of endogenous transcripts have served as excellent tools for dissecting regulatory mechanisms of splicing. In this chapter, we describe a two-color fluorescent reporter system that enables the visualization of alternative splicing patterns by microscopy at single-cell resolution in live animals. We present this reporter system in the context of the model nematode C. elegans.
Collapse
Affiliation(s)
- John A Calarco
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, Canada.
| | | |
Collapse
|
7
|
Pan Y, Zhou S, Guan J. Computationally identifying hot spots in protein-DNA binding interfaces using an ensemble approach. BMC Bioinformatics 2020; 21:384. [PMID: 32938375 PMCID: PMC7495898 DOI: 10.1186/s12859-020-03675-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Protein-DNA interaction governs a large number of cellular processes, and it can be altered by a small fraction of interface residues, i.e., the so-called hot spots, which account for most of the interface binding free energy. Accurate prediction of hot spots is critical to understand the principle of protein-DNA interactions. There are already some computational methods that can accurately and efficiently predict a large number of hot residues. However, the insufficiency of experimentally validated hot-spot residues in protein-DNA complexes and the low diversity of the employed features limit the performance of existing methods. RESULTS Here, we report a new computational method for effectively predicting hot spots in protein-DNA binding interfaces. This method, called PreHots (the abbreviation of Predicting Hotspots), adopts an ensemble stacking classifier that integrates different machine learning classifiers to generate a robust model with 19 features selected by a sequential backward feature selection algorithm. To this end, we constructed two new and reliable datasets (one benchmark for model training and one independent dataset for validation), which totally consist of 123 hot spots and 137 non-hot spots from 89 protein-DNA complexes. The data were manually collected from the literature and existing databases with a strict process of redundancy removal. Our method achieves a sensitivity of 0.813 and an AUC score of 0.868 in 10-fold cross-validation on the benchmark dataset, and a sensitivity of 0.818 and an AUC score of 0.820 on the independent test dataset. The results show that our approach outperforms the existing ones. CONCLUSIONS PreHots, which is based on stack ensemble of boosting algorithms, can reliably predict hot spots at the protein-DNA binding interface on a large scale. Compared with the existing methods, PreHots can achieve better prediction performance. Both the webserver of PreHots and the datasets are freely available at: http://dmb.tongji.edu.cn/tools/PreHots/ .
Collapse
Affiliation(s)
- Yuliang Pan
- Department of Computer Science and Technology, Tongji University, No. 4800 Caoan Road, Shanghai, 201804, China
| | - Shuigeng Zhou
- Shanghai Key Laboratory of Intelligent Information Processing, and School of Computer Science, Fudan University, No. 220 Handan Road, Shanghai, 200433, China
| | - Jihong Guan
- Department of Computer Science and Technology, Tongji University, No. 4800 Caoan Road, Shanghai, 201804, China.
| |
Collapse
|
8
|
Grifone R, Shao M, Saquet A, Shi DL. RNA-Binding Protein Rbm24 as a Multifaceted Post-Transcriptional Regulator of Embryonic Lineage Differentiation and Cellular Homeostasis. Cells 2020; 9:E1891. [PMID: 32806768 PMCID: PMC7463526 DOI: 10.3390/cells9081891] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 08/06/2020] [Accepted: 08/07/2020] [Indexed: 12/12/2022] Open
Abstract
RNA-binding proteins control the metabolism of RNAs at all stages of their lifetime. They are critically required for the post-transcriptional regulation of gene expression in a wide variety of physiological and pathological processes. Rbm24 is a highly conserved RNA-binding protein that displays strongly regionalized expression patterns and exhibits dynamic changes in subcellular localization during early development. There is increasing evidence that it acts as a multifunctional regulator to switch cell fate determination and to maintain tissue homeostasis. Dysfunction of Rbm24 disrupts cell differentiation in nearly every tissue where it is expressed, such as skeletal and cardiac muscles, and different head sensory organs, but the molecular events that are affected may vary in a tissue-specific, or even a stage-specific manner. Recent works using different animal models have uncovered multiple post-transcriptional regulatory mechanisms by which Rbm24 functions in key developmental processes. In particular, it represents a major splicing factor in muscle cell development, and plays an essential role in cytoplasmic polyadenylation during lens fiber cell terminal differentiation. Here we review the advances in understanding the implication of Rbm24 during development and disease, by focusing on its regulatory roles in physiological and pathological conditions.
Collapse
Affiliation(s)
- Raphaëlle Grifone
- Developmental Biology Laboratory, CNRS-UMR7622, IBPS, Sorbonne University, 75005 Paris, France; (R.G.); (A.S.)
| | - Ming Shao
- Shandong Provincial Key Laboratory of Animal Cell and Developmental Biology, School of Life Sciences, Shandong University, Qingdao 266237, China;
| | - Audrey Saquet
- Developmental Biology Laboratory, CNRS-UMR7622, IBPS, Sorbonne University, 75005 Paris, France; (R.G.); (A.S.)
| | - De-Li Shi
- Developmental Biology Laboratory, CNRS-UMR7622, IBPS, Sorbonne University, 75005 Paris, France; (R.G.); (A.S.)
| |
Collapse
|
9
|
Structural basis for mRNA recognition by human RBM38. Biochem J 2020; 477:161-172. [PMID: 31860021 DOI: 10.1042/bcj20190652] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 12/17/2019] [Accepted: 12/19/2019] [Indexed: 12/29/2022]
Abstract
RNA-binding protein RBM38 was reported to bind the mRNA of several p53-related genes through its RRM domain and to up-regulate or down-regulate protein translation by increasing mRNA stability or recruitment of other effector proteins. The recognition mechanism, however, for RNA-binding of RBM38 remains unclear. Here, we report the crystal structure of the RRM domain of human RBM38 in complex with a single-stranded RNA. Our structural and biological results revealed that RBM38 recognizes G(U/C/A)GUG sequence single-stranded RNA in a sequence-specific and structure-specific manner. Two phenylalanine stacked with bases of RNA were crucial for RNA binding, and a series of hydrogen bonds between the base atoms of RNA and main-chain or side-chain atoms of RBM38 determine the sequence-specific recognition. Our results revealed the RNA-recognition mechanism of human RBM38 and provided structural information for understanding the RNA-binding property of RBM38.
Collapse
|
10
|
Arribere JA, Kuroyanagi H, Hundley HA. mRNA Editing, Processing and Quality Control in Caenorhabditis elegans. Genetics 2020; 215:531-568. [PMID: 32632025 PMCID: PMC7337075 DOI: 10.1534/genetics.119.301807] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 05/03/2020] [Indexed: 02/06/2023] Open
Abstract
While DNA serves as the blueprint of life, the distinct functions of each cell are determined by the dynamic expression of genes from the static genome. The amount and specific sequences of RNAs expressed in a given cell involves a number of regulated processes including RNA synthesis (transcription), processing, splicing, modification, polyadenylation, stability, translation, and degradation. As errors during mRNA production can create gene products that are deleterious to the organism, quality control mechanisms exist to survey and remove errors in mRNA expression and processing. Here, we will provide an overview of mRNA processing and quality control mechanisms that occur in Caenorhabditis elegans, with a focus on those that occur on protein-coding genes after transcription initiation. In addition, we will describe the genetic and technical approaches that have allowed studies in C. elegans to reveal important mechanistic insight into these processes.
Collapse
Affiliation(s)
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan, and
| | - Heather A Hundley
- Medical Sciences Program, Indiana University School of Medicine-Bloomington, Indiana 47405
| |
Collapse
|
11
|
Tourasse NJ, Millet JRM, Dupuy D. Quantitative RNA-seq meta-analysis of alternative exon usage in C. elegans. Genome Res 2017; 27:2120-2128. [PMID: 29089372 PMCID: PMC5741048 DOI: 10.1101/gr.224626.117] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 10/26/2017] [Indexed: 12/16/2022]
Abstract
Almost 20 years after the completion of the C. elegans genome sequence, gene structure annotation is still an ongoing process with new evidence for gene variants still being regularly uncovered by additional in-depth transcriptome studies. While alternative splice forms can allow a single gene to encode several functional isoforms, the question of how much spurious splicing is tolerated is still heavily debated. Here we gathered a compendium of 1682 publicly available C. elegans RNA-seq data sets to increase the dynamic range of detection of RNA isoforms, and obtained robust measurements of the relative abundance of each splicing event. While most of the splicing reads come from reproducibly detected splicing events, a large fraction of purported junctions is only supported by a very low number of reads. We devised an automated curation method that takes into account the expression level of each gene to discriminate robust splicing events from potential biological noise. We found that rarely used splice sites disproportionately come from highly expressed genes and are significantly less conserved in other nematode genomes than splice sites with a higher usage frequency. Our increased detection power confirmed trans-splicing for at least 84% of C. elegans protein coding genes. The genes for which trans-splicing was not observed are overwhelmingly low expression genes, suggesting that the mechanism is pervasive but not fully captured by organism-wide RNA-seq. We generated annotated gene models including quantitative exon usage information for the entire C. elegans genome. This allows users to visualize at a glance the relative expression of each isoform for their gene of interest.
Collapse
Affiliation(s)
- Nicolas J Tourasse
- Université de Bordeaux, Inserm U1212, CNRS UMR5320, Institut Européen de Chimie et Biologie (IECB), 33607 Pessac, France
| | - Jonathan R M Millet
- Université de Bordeaux, Inserm U1212, CNRS UMR5320, Institut Européen de Chimie et Biologie (IECB), 33607 Pessac, France
| | - Denis Dupuy
- Université de Bordeaux, Inserm U1212, CNRS UMR5320, Institut Européen de Chimie et Biologie (IECB), 33607 Pessac, France
| |
Collapse
|
12
|
Krepl M, Blatter M, Cléry A, Damberger FF, Allain FH, Sponer J. Structural study of the Fox-1 RRM protein hydration reveals a role for key water molecules in RRM-RNA recognition. Nucleic Acids Res 2017; 45:8046-8063. [PMID: 28505313 PMCID: PMC5737849 DOI: 10.1093/nar/gkx418] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Revised: 04/26/2017] [Accepted: 05/02/2017] [Indexed: 01/07/2023] Open
Abstract
The Fox-1 RNA recognition motif (RRM) domain is an important member of the RRM protein family. We report a 1.8 Å X-ray structure of the free Fox-1 containing six distinct monomers. We use this and the nuclear magnetic resonance (NMR) structure of the Fox-1 protein/RNA complex for molecular dynamics (MD) analyses of the structured hydration. The individual monomers of the X-ray structure show diverse hydration patterns, however, MD excellently reproduces the most occupied hydration sites. Simulations of the protein/RNA complex show hydration consistent with the isolated protein complemented by hydration sites specific to the protein/RNA interface. MD predicts intricate hydration sites with water-binding times extending up to hundreds of nanoseconds. We characterize two of them using NMR spectroscopy, RNA binding with switchSENSE and free-energy calculations of mutant proteins. Both hydration sites are experimentally confirmed and their abolishment reduces the binding free-energy. A quantitative agreement between theory and experiment is achieved for the S155A substitution but not for the S122A mutant. The S155 hydration site is evolutionarily conserved within the RRM domains. In conclusion, MD is an effective tool for predicting and interpreting the hydration patterns of protein/RNA complexes. Hydration is not easily detectable in NMR experiments but can affect stability of protein/RNA complexes.
Collapse
Affiliation(s)
- Miroslav Krepl
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska 135, 612 65 Brno, Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University Olomouc, 17. listopadu 12, 771 46 Olomouc, Czech Republic
| | - Markus Blatter
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, CH-8093 Zurich, Switzerland
- Present address: Global Discovery Chemistry, Novartis Institute for BioMedical Research, Basel CH-4002, Switzerland
| | - Antoine Cléry
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, CH-8093 Zurich, Switzerland
| | - Fred F. Damberger
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, CH-8093 Zurich, Switzerland
| | - Frédéric H.T. Allain
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, CH-8093 Zurich, Switzerland
| | - Jiri Sponer
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska 135, 612 65 Brno, Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University Olomouc, 17. listopadu 12, 771 46 Olomouc, Czech Republic
| |
Collapse
|
13
|
Wani S, Kuroyanagi H. An emerging model organism Caenorhabditis elegans for alternative pre-mRNA processing in vivo. WILEY INTERDISCIPLINARY REVIEWS-RNA 2017; 8. [PMID: 28703462 DOI: 10.1002/wrna.1428] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 05/02/2017] [Accepted: 05/02/2017] [Indexed: 12/13/2022]
Abstract
A nematode Caenorhabditis elegans is an intron-rich organism and up to 25% of its pre-mRNAs are estimated to be alternatively processed. Its compact genomic organization enables construction of fluorescence splicing reporters with intact genomic sequences and visualization of alternative processing patterns of interest in the transparent living animals with single-cell resolution. Genetic analysis with the reporter worms facilitated identification of trans-acting factors and cis-acting elements, which are highly conserved in mammals. Analysis of unspliced and partially spliced pre-mRNAs in vivo raised models for alternative splicing regulation relying on specific order of intron excision. RNA-seq analysis of splicing factor mutants and CLIP-seq analysis of the factors allow global search for target genes in the whole animal. An mRNA surveillance system is not essential for its viability or fertility, allowing analysis of unproductively spliced noncoding mRNAs. These features offer C. elegans as an ideal model organism for elucidating alternative pre-mRNA processing mechanisms in vivo. Examples of isoform-specific functions of alternatively processed genes are summarized. WIREs RNA 2017, 8:e1428. doi: 10.1002/wrna.1428 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Shotaro Wani
- Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | - Hidehito Kuroyanagi
- Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| |
Collapse
|
14
|
Soufari H, Mackereth CD. Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family. RNA (NEW YORK, N.Y.) 2017; 23:308-316. [PMID: 28003515 PMCID: PMC5311487 DOI: 10.1261/rna.059733.116] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 12/19/2016] [Indexed: 05/16/2023]
Abstract
Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets.
Collapse
Affiliation(s)
- Heddy Soufari
- University of Bordeaux, Institut Européen de Chimie et Biologie, F-33607 Pessac, France
- Inserm U1212, CNRS UMR 5320, ARNA Laboratory, F-33076 Bordeaux, France
| | - Cameron D Mackereth
- University of Bordeaux, Institut Européen de Chimie et Biologie, F-33607 Pessac, France
- Inserm U1212, CNRS UMR 5320, ARNA Laboratory, F-33076 Bordeaux, France
| |
Collapse
|
15
|
Yadav DK, Lukavsky PJ. NMR solution structure determination of large RNA-protein complexes. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2016; 97:57-81. [PMID: 27888840 DOI: 10.1016/j.pnmrs.2016.10.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 10/04/2016] [Accepted: 10/04/2016] [Indexed: 06/06/2023]
Abstract
Structure determination of RNA-protein complexes is essential for our understanding of the multiple layers of RNA-mediated posttranscriptional regulation of gene expression. Over the past 20years, NMR spectroscopy became a key tool for structural studies of RNA-protein interactions. Here, we review the progress being made in NMR structure determination of large ribonucleoprotein assemblies. We discuss approaches for the design of RNA-protein complexes for NMR structural studies, established and emerging isotope and segmental labeling schemes suitable for large RNPs and how to gain distance restraints from NOEs, PREs and EPR and orientational information from RDCs and SAXS/SANS in such systems. The new combination of NMR measurements with MD simulations and its potential will also be discussed. Application and combination of these various methods for structure determination of large RNPs will be illustrated with three large RNA-protein complexes (>40kDa) and other interesting complexes determined in the past six and a half years.
Collapse
Affiliation(s)
- Deepak Kumar Yadav
- Central European Institute of Technology, Masaryk University, Kamenice 753/5, 62500 Brno, Czech Republic
| | - Peter J Lukavsky
- Central European Institute of Technology, Masaryk University, Kamenice 753/5, 62500 Brno, Czech Republic.
| |
Collapse
|
16
|
Upadhyay SK, Mackereth CD. (1)H, (15)N and (13)C backbone and side chain resonance assignments of the RRM domain from human RBM24. BIOMOLECULAR NMR ASSIGNMENTS 2016; 10:237-240. [PMID: 27002326 DOI: 10.1007/s12104-016-9674-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 03/11/2016] [Indexed: 06/05/2023]
Abstract
Tissue development requires the expression of a regulated subset of genes, and it is becoming clear that the process of alternative splicing also plays an important role in the production of necessary tissue-specific isoforms. However, only a few of these tissue-specific splicing factors in mammals have so far been discovered. One of these factors is the RNA-binding protein RBM24 which has been recently identified as a major regulator of alternative splicing in cardiac and skeletal muscle development. The RBM24 protein contains an RNA recognition motif (RRM) domain that presumably mediates the binding to target pre-mRNA required for regulation of the splicing patterns. Here we report (1)H, (15)N and (13)C chemical shift assignments of the backbone and sidechain atoms for the RRM domain from human RBM24. Secondary chemical shift analysis and relaxation measurement confirm the canonical architecture of the RRM domain. The data will allow for atomic level studies aimed at understanding splicing regulation of target genes in heart and muscle development and investigation into a separate role of RBM24 in modulating mRNA stability of genes involved in the p53 tumor suppressor pathway.
Collapse
Affiliation(s)
| | - Cameron D Mackereth
- Institut Européen de Chimie et Biologie (IECB), University of Bordeaux, 2 rue Robert Escarpit, 33607, Pessac, France
- Inserm, U869, ARNA Laboratory, University of Bordeaux, 33076, Bordeaux, France
| |
Collapse
|
17
|
Gracida X, Norris AD, Calarco JA. Regulation of Tissue-Specific Alternative Splicing: C. elegans as a Model System. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 907:229-61. [DOI: 10.1007/978-3-319-29073-7_10] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
18
|
Abstract
RRM-containing proteins are involved in most of the RNA metabolism steps. Their functions are closely related to their mode of RNA recognition, which has been studied by structural biologists for more than 20 years. In this chapter, we report on high-resolution structures of single and multi RRM-RNA complexes to explain the numerous strategies used by these domains to interact specifically with a large repertoire of RNA sequences. We show that multiple variations of their canonical fold can be used to adapt to different single-stranded sequences with a large range of affinities. Furthermore, we describe the consequences on RNA binding of the different structural arrangements found in tandem RRMs and higher order RNPs. Importantly, these structures also reveal with very high accuracy the RNA motifs bound specifically by RRM-containing proteins, which correspond very often to consensus sequences identified with genome-wide approaches. Finally, we show how structural and cellular biology can benefit from each other and pave a way for understanding, defining, and predicting a code of RNA recognition by the RRMs.
Collapse
|
19
|
Mackereth CD. Splicing factor SUP-12 and the molecular complexity of apparent cooperativity. WORM 2015; 3:e991240. [PMID: 26430555 PMCID: PMC4588554 DOI: 10.4161/21624054.2014.991240] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2014] [Accepted: 11/20/2014] [Indexed: 12/17/2022]
Abstract
The splicing factor SUP-12 from C. elegans, in combination with either ASD-1 or FOX-1 from the Fox-1 (RBFOX) family, is required for generating a muscle-specific isoform of the fibroblast growth factor receptor EGL-15. Biophysical techniques have revealed the sequence preference for the RNA Recognition Motif (RRM) domain from SUP-12 as well as the structural details of the RNA-bound complex. Detailed genetics have identified a requisite need for the presence of both SUP-12 and ASD-1/FOX-1 to regulate the alternative splicing event, prompting speculation of a cooperative mechanism between these proteins on binding RNA. In contrast, the interplay between SUP-12 and ASD-1 suggests that although the RRM domains from each protein are in direct contact on the egl-15 pre-mRNA, there is no simple contribution of binding cooperativity. Evidence for an independent binding mechanism by SUP-12 and ASD-1 will be discussed, including a model in which both positive and negative contributions are balanced during complex assembly. The ability to monitor tissue-specific alternative splicing in live nematodes will continue to provide a powerful method to test in vivo mechanistic models derived from atomic-level investigation.
Collapse
Affiliation(s)
- Cameron D Mackereth
- Inserm U869; University of Bordeaux; Institut Européen de Chimie et Biologie Pessac ; France
| |
Collapse
|