Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Ellis JJ, Broom M, Jones S. Protein-RNA interactions: structural analysis and functional classes. Proteins 2007;66:903-11. [PMID: 17186525 DOI: 10.1002/prot.21211] [Citation(s) in RCA: 146] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Number

Cited by Other Article(s)

101

Ahmad S, Sarai A. Analysis of electric moments of RNA-binding proteins: implications for mechanism and prediction. BMC STRUCTURAL BIOLOGY 2011;11:8. [PMID: 21284850 PMCID: PMC3048485 DOI: 10.1186/1472-6807-11-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2010] [Accepted: 02/01/2011] [Indexed: 11/24/2022]

Abstract

Background

Protein-RNA interactions play important role in many biological processes such as gene regulation, replication, protein synthesis and virus assembly. Although many structures of various types of protein-RNA complexes have been determined, the mechanism of protein-RNA recognition remains elusive. We have earlier shown that the simplest electrostatic properties viz. charge, dipole and quadrupole moments, calculated from backbone atomic coordinates of proteins are biased relative to other proteins, and these quantities can be used to identify DNA-binding proteins. Closely related, RNA-binding proteins are investigated in this study. In particular, discrimination between various types of RNA-binding proteins, evolutionary conservation of these bulk electrostatic features and effect of conformational changes by complex formation are investigated. Basic binding mechanism of a putative RNA-binding protein (HI1333 from Haemophilus influenza) is suggested as a potential application of this study.

Results

We found that similar to DNA-binding proteins (DBPs), RNA-binding proteins (RBPs) also show significantly higher values of electric moments. However, higher moments in RBPs are found to strongly depend on their functional class: proteins binding to ribosomal RNA (rRNA) constitute the only class with all three of the properties (charge, dipole and quadrupole moments) being higher than control proteins. Neural networks were trained using leave-one-out cross-validation to predict RBPs from control data as well as pair-wise classification capacity between proteins binding to various RNA types. RBPs and control proteins reached up to 78% accuracy measured by the area under the ROC curve. Proteins binding to rRNA are found to be best distinguished (AUC = 79%). Changes in dipole and quadrupole moments between unbound and bound structures were small and these properties are found to be robust under complex formation.

Conclusions

Bulk electric moments of proteins considered here provide insights into target recognition by RNA-binding proteins, as well as ability to recognize one type of RBP from others. These results help in understanding the mechanism of protein-RNA recognition, and identifying RNA-binding proteins.

Collapse

102

Structural changes in the SL5 and SL6 leader sequences of HIV-1 RNA following interactions with the viral mGag protein. Virus Res 2011;155:98-105. [DOI: 10.1016/j.virusres.2010.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2010] [Revised: 08/31/2010] [Accepted: 09/07/2010] [Indexed: 11/20/2022]

103

Zhou P, Tian F, Ren Y, Shang Z. Systematic classification and analysis of themes in protein-DNA recognition. J Chem Inf Model 2010;50:1476-88. [PMID: 20726602 DOI: 10.1021/ci100145d] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

104

Huang YF, Chiu LY, Huang CC, Huang CK. Predicting RNA-binding residues from evolutionary information and sequence conservation. BMC Genomics 2010;11 Suppl 4:S2. [PMID: 21143803 PMCID: PMC3005934 DOI: 10.1186/1471-2164-11-s4-s2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

105

Speir JA, Taylor DJ, Natarajan P, Pringle FM, Ball LA, Johnson JE. Evolution in action: N and C termini of subunits in related T = 4 viruses exchange roles as molecular switches. Structure 2010;18:700-9. [PMID: 20541507 DOI: 10.1016/j.str.2010.03.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2009] [Revised: 03/04/2010] [Accepted: 03/06/2010] [Indexed: 10/19/2022]

106

Caly DL, O'Toole PW, Moore SA. The 2.2-Å structure of the HP0958 protein from Helicobacter pylori reveals a kinked anti-parallel coiled-coil hairpin domain and a highly conserved ZN-ribbon domain. J Mol Biol 2010;403:405-19. [PMID: 20826163 DOI: 10.1016/j.jmb.2010.08.051] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 08/24/2010] [Accepted: 08/26/2010] [Indexed: 11/25/2022]

107

Li X, Quon G, Lipshitz HD, Morris Q. Predicting in vivo binding sites of RNA-binding proteins using mRNA secondary structure. RNA (NEW YORK, N.Y.) 2010;16:1096-107. [PMID: 20418358 PMCID: PMC2874161 DOI: 10.1261/rna.2017210] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

108

Beck K, Vannini A, Cramer P, Lipps G. The archaeo-eukaryotic primase of plasmid pRN1 requires a helix bundle domain for faithful primer synthesis. Nucleic Acids Res 2010;38:6707-18. [PMID: 20511586 PMCID: PMC2965215 DOI: 10.1093/nar/gkq447] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

109

Murakami Y, Spriggs RV, Nakamura H, Jones S. PiRaNhA: a server for the computational prediction of RNA-binding residues in protein sequences. Nucleic Acids Res 2010;38:W412-6. [PMID: 20507911 PMCID: PMC2896099 DOI: 10.1093/nar/gkq474] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

110

Liu ZP, Wu LY, Wang Y, Zhang XS, Chen L. Prediction of protein-RNA binding sites by a random forest method with combined features. ACTA ACUST UNITED AC 2010;26:1616-22. [PMID: 20483814 DOI: 10.1093/bioinformatics/btq253] [Citation(s) in RCA: 120] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Abstract

MOTIVATION

Protein-RNA interactions play a key role in a number of biological processes, such as protein synthesis, mRNA processing, mRNA assembly, ribosome function and eukaryotic spliceosomes. As a result, a reliable identification of RNA binding site of a protein is important for functional annotation and site-directed mutagenesis. Accumulated data of experimental protein-RNA interactions reveal that a RNA binding residue with different neighbor amino acids often exhibits different preferences for its RNA partners, which in turn can be assessed by the interacting interdependence of the amino acid fragment and RNA nucleotide.

RESULTS

In this work, we propose a novel classification method to identify the RNA binding sites in proteins by combining a new interacting feature (interaction propensity) with other sequence- and structure-based features. Specifically, the interaction propensity represents a binding specificity of a protein residue to the interacting RNA nucleotide by considering its two-side neighborhood in a protein residue triplet. The sequence as well as the structure-based features of the residues are combined together to discriminate the interaction propensity of amino acids with RNA. We predict RNA interacting residues in proteins by implementing a well-built random forest classifier. The experiments show that our method is able to detect the annotated protein-RNA interaction sites in a high accuracy. Our method achieves an accuracy of 84.5%, F-measure of 0.85 and AUC of 0.92 prediction of the RNA binding residues for a dataset containing 205 non-homologous RNA binding proteins, and also outperforms several existing RNA binding residue predictors, such as RNABindR, BindN, RNAProB and PPRint, and some alternative machine learning methods, such as support vector machine, naive Bayes and neural network in the comparison study. Furthermore, we provide some biological insights into the roles of sequences and structures in protein-RNA interactions by both evaluating the importance of features for their contributions in predictive accuracy and analyzing the binding patterns of interacting residues.

AVAILABILITY

All the source data and code are available at http://www.aporc.org/doc/wiki/PRNA or http://www.sysbio.ac.cn/datatools.asp

CONTACT

lnchen@sibs.ac.cn

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

111

Janas T, Widmann JJ, Knight R, Yarus M. Simple, recurring RNA binding sites for L-arginine. RNA (NEW YORK, N.Y.) 2010;16:805-816. [PMID: 20194519 PMCID: PMC2844627 DOI: 10.1261/rna.1979410] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Accepted: 12/09/2009] [Indexed: 05/28/2023]

112

Zhou P, Zou J, Tian F, Shang Z. Geometric similarity between protein-RNA interfaces. J Comput Chem 2010;30:2738-51. [PMID: 19399760 DOI: 10.1002/jcc.21300] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

113

Bahadur RP, Kannan S, Zacharias M. Binding of the bacteriophage P22 N-peptide to the boxB RNA motif studied by molecular dynamics simulations. Biophys J 2010;97:3139-49. [PMID: 20006951 DOI: 10.1016/j.bpj.2009.09.035] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2009] [Revised: 08/28/2009] [Accepted: 09/01/2009] [Indexed: 11/30/2022] Open

114

Davis FP, Sali A. The overlap of small molecule and protein binding sites within families of protein structures. PLoS Comput Biol 2010;6:e1000668. [PMID: 20140189 PMCID: PMC2816688 DOI: 10.1371/journal.pcbi.1000668] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2009] [Accepted: 12/31/2009] [Indexed: 02/03/2023] Open

Abstract

Protein–protein interactions are challenging targets for modulation by small molecules. Here, we propose an approach that harnesses the increasing structural coverage of protein complexes to identify small molecules that may target protein interactions. Specifically, we identify ligand and protein binding sites that overlap upon alignment of homologous proteins. Of the 2,619 protein structure families observed to bind proteins, 1,028 also bind small molecules (250–1000 Da), and 197 exhibit a statistically significant (p<0.01) overlap between ligand and protein binding positions. These “bi-functional positions”, which bind both ligands and proteins, are particularly enriched in tyrosine and tryptophan residues, similar to “energetic hotspots” described previously, and are significantly less conserved than mono-functional and solvent exposed positions. Homology transfer identifies ligands whose binding sites overlap at least 20% of the protein interface for 35% of domain–domain and 45% of domain–peptide mediated interactions. The analysis recovered known small-molecule modulators of protein interactions as well as predicted new interaction targets based on the sequence similarity of ligand binding sites. We illustrate the predictive utility of the method by suggesting structural mechanisms for the effects of sanglifehrin A on HIV virion production, bepridil on the cellular entry of anthrax edema factor, and fusicoccin on vertebrate developmental pathways. The results, available at http://pibase.janelia.org, represent a comprehensive collection of structurally characterized modulators of protein interactions, and suggest that homologous structures are a useful resource for the rational design of interaction modulators.

Proteins function through their interactions with other biological molecules, including other proteins. Often times, these interactions underlie cellular processes that go awry in disease. Therefore, modulating these interactions with small molecules is an active area of research for new drugs to treat diseases and new chemical tools to dissect cellular interaction networks. However, targeting protein–protein interactions has proven to be more challenging than the typical drug targets found on individual proteins. Here, we present a computational approach that aims to help in this challenge by identifying regions of protein–protein interfaces that may be amenable to targeting by small molecules. Through a comprehensive analysis of all known protein structures, we identify closely related proteins that in one case bind a protein and in another case bind a small molecule. We find that a significant number of protein–protein interactions occur through surface regions that bind small molecules in related proteins. These “bi-functional” positions, which can bind both proteins and ligands, will serve as an additional piece of structural information that can aid experimentalists in developing small molecules that modulate protein interactions.

Collapse

115

Ciriello G, Gallina C, Guerra C. Analysis of interactions between ribosomal proteins and RNA structural motifs. BMC Bioinformatics 2010;11 Suppl 1:S41. [PMID: 20122215 PMCID: PMC3009514 DOI: 10.1186/1471-2105-11-s1-s41] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Abstract

Background

One important goal of structural bioinformatics is to recognize and predict the interactions between protein binding sites and RNA. Recently, a comprehensive analysis of ribosomal proteins and their interactions with rRNA has been done. Interesting results emerged from the comparison of r-proteins within the small subunit in T. thermophilus and E. coli, supporting the idea of a core made by both RNA and proteins, conserved by evolution. Recent work showed also that ribosomal RNA is modularly composed. Motifs are generally single-stranded sequences of consecutive nucleotides (ssRNA) with characteristic folding. The role of these motifs in protein-RNA interactions has been so far only sparsely investigated.

Results

This work explores the role of RNA structural motifs in the interaction of proteins with ribosomal RNA (rRNA). We analyze composition, local geometries and conformation of interface regions involving motifs such as tetraloops, kink turns and single extruded nucleotides. We construct an interaction map of protein binding sites that allows us to identify the common types of shared 3-D physicochemical binding patterns for tetraloops. Furthermore, we investigate the protein binding pockets that accommodate single extruded nucleotides either involved in kink-turns or in arbitrary RNA strands. This analysis reveals a new structural motif, called tripod.

It corresponds to small pockets consisting of three aminoacids arranged at the vertices of an almost equilateral triangle. We developed a search procedure for the recognition of tripods, based on an empirical tripod fingerprint.

Conclusion

A comparative analysis with the overall RNA surface and interfaces shows that contact surfaces involving RNA motifs have distinctive features that may be useful for the recognition and prediction of interactions.

Collapse

116

Eliahoo E, Ben Yosef R, Pérez-Cano L, Fernández-Recio J, Glaser F, Manor H. Mapping of interaction sites of the Schizosaccharomyces pombe protein Translin with nucleic acids and proteins: a combined molecular genetics and bioinformatics study. Nucleic Acids Res 2010;38:2975-89. [PMID: 20081200 PMCID: PMC2875027 DOI: 10.1093/nar/gkp1230] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

117

Identification and Analysis of Binding Site Residues in Protein Complexes: Energy Based Approach. ACTA ACUST UNITED AC 2010. [DOI: 10.1007/978-3-642-14922-1_78] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

118

Sutch BT, Chambers EJ, Bayramyan MZ, Gallaher TK, Haworth IS. Similarity of Protein-RNA Interfaces Based on Motif Analysis. J Chem Inf Model 2009;49:2139-46. [DOI: 10.1021/ci900154a] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

119

Spriggs RV, Jones S. RNA-binding residues in sequence space: conservation and interaction patterns. Comput Biol Chem 2009;33:397-403. [PMID: 19700370 DOI: 10.1016/j.compbiolchem.2009.07.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2009] [Revised: 07/14/2009] [Accepted: 07/18/2009] [Indexed: 10/20/2022]

Abstract

RNA-binding proteins (RBPs) perform fundamental and diverse functions within the cell. Approximately 15% of proteins sequences are annotated as RNA-binding, but with a significant number of proteins without functional annotation, many RBPs are yet to be identified. A percentage of uncharacterised proteins can be annotated by transferring functional information from proteins sharing significant sequence homology. However, genomes contain a significant number of orphan open reading frames (ORFs) that do not share significant sequence similarity to other ORFs, but correspond to functional proteins. Hence methods for protein function annotation that go beyond sequence homology are essential. One method of annotation is the identification of ligands that bind to proteins, through the characterisation of binding site residues. In the current work RNA-binding residues (RBRs) are characterised in terms of their evolutionary conservation and the patterns they form in sequence space. The potential for such characteristics to be used to identify RBPs from sequence is then evaluated. In the current work the conservation of residues in 261 RBPs is compared for (a) RBRs vs. non-RBRs surface residues, and for (b) specific and non-specific RBRs. The analysis shows that RBRs are more conserved than other surface residues, and RBRs hydrogen-bonded to the RNA backbone are more conserved than those making hydrogen bonds to RNA bases. This observed conservation of RBRs was then used to inform the construction of RBR sequence patterns from known protein-RNA structures. A series of RBR patterns were generated for a case study protein aspartyl-tRNA synthetase bound to tRNA; and used to differentiate between RNA-binding and non-RNA-binding protein sequences. Six sequence patterns performed with high precision values of >80% and recall values 7 times that of an homology search. When the method was expanded to the complete dataset of 261 proteins, many patterns were of poor predictive value, as they had not been manipulated on a family-specific basis. However, two patterns with precision values > or = 85% were used to make function predictions for a set of hypothetical proteins. This revealed a number of potential RBPs that require experimental verification.

Collapse

120

Czyżnikowska Ż, Lipkowski P, Góra RW, Zaleśny R, Cheng AC. On the Nature of Intermolecular Interactions in Nucleic Acid Base−Amino Acid Side-Chain Complexes. J Phys Chem B 2009;113:11511-20. [DOI: 10.1021/jp904146m] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

121

Pérez-Cano L, Fernández-Recio J. Optimal protein-RNA area, OPRA: A propensity-based method to identify RNA-binding sites on proteins. Proteins 2009;78:25-35. [DOI: 10.1002/prot.22527] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

122

Ligand binding and substrate discrimination by UDP-galactopyranose mutase. J Mol Biol 2009;391:327-40. [PMID: 19500588 DOI: 10.1016/j.jmb.2009.05.081] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2009] [Revised: 05/20/2009] [Accepted: 05/27/2009] [Indexed: 11/23/2022]

Abstract

Galactofuranose (Galf) residues are present in cell wall glycoconjugates of numerous pathogenic microbes. Uridine 5'-diphosphate (UDP) Galf, the biosynthetic precursor of Galf-containing glycoconjugates, is produced from UDP-galactopyranose (UDP-Galp) by the flavoenzyme UDP-galactopyranose mutase (UGM). The gene encoding UGM (glf) is essential for the viability of pathogens, including Mycobacterium tuberculosis, and this finding underscores the need to understand how UGM functions. Considerable effort has been devoted to elucidating the catalytic mechanism of UGM, but progress has been hindered by a lack of structural data for an enzyme-substrate complex. Such data could reveal not only substrate binding interactions but how UGM can act preferentially on two very different substrates, UDP-Galp and UDP-Galf, yet avoid other structurally related UDP sugars present in the cell. Herein, we describe the first structure of a UGM-ligand complex, which provides insight into the catalytic mechanism and molecular basis for substrate selectivity. The structure of UGM from Klebsiella pneumoniae bound to the substrate analog UDP-glucose (UDP-Glc) was solved by X-ray crystallographic methods and refined to 2.5 A resolution. The ligand is proximal to the cofactor, a finding that is consistent with a proposed mechanism in which the reduced flavin engages in covalent catalysis. Despite this proximity, the glucose ring of the substrate analog is positioned such that it disfavors covalent catalysis. This orientation is consistent with data indicating that UDP-Glc is not a substrate for UGM. The relative binding orientations of UDP-Galp and UDP-Glc were compared using saturation transfer difference NMR. The results indicate that the uridine moiety occupies a similar location in both ligand complexes, and this relevant binding mode is defined by our structural data. In contrast, the orientations of the glucose and galactose sugar moieties differ. To understand the consequences of these differences, we derived a model for the productive UGM-substrate complex that highlights interactions that can contribute to catalysis and substrate discrimination.

Collapse

123

Sonavane S, Chakrabarti P. Cavities in protein-DNA and protein-RNA interfaces. Nucleic Acids Res 2009;37:4613-20. [PMID: 19494181 PMCID: PMC2724294 DOI: 10.1093/nar/gkp488] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

124

Lee SJ, Hyun S, Kieft JS, Yu J. An approach to the construction of tailor-made amphiphilic peptides that strongly and selectively bind to hairpin RNA targets. J Am Chem Soc 2009;131:2224-30. [PMID: 19199621 DOI: 10.1021/ja807609m] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Abstract

The hairpin RNA motif is one of the most frequently observed secondary structures and is often targeted by therapeutic agents. An amphiphilic peptide with seven lysine and eight leucine residues and its derivatives were designed for use as ligands against RNA hairpin motifs. We hypothesized that variations in both the hydrophobic leucine-rich and hydrophilic lysine-rich spheres of these amphiphilic peptides would create extra attractive interactions with hairpin RNA targets. A series of alanine-scanned peptides were probed to identify the most influential lysine residues in the hydrophilic sphere. The binding affinities of these modified peptides with several hairpins, such as RRE, TAR from HIV, a short hairpin from IRES of HCV, and a hairpin from the 16S A-site stem from rRNA, were determined. Since the hairpin from IRES of HCV was the most susceptible to the initial series of alanine-scanned peptides, studies investigating how further variations in the peptides effect binding employed the IRES hairpin. Next, the important Lys residues were substituted by shorter chain amines, such as ornithine, to place the peptide deeper into the hairpin groove. In a few cases, a 70-fold improved binding was observed for peptides that contained the specifically located shorter amine side chains. To further explore changes in binding affinities brought about by alterations in the hydrophobic sphere, tryptophan residues were introduced in place of leucine. A few peptides with tryptophan in specific positions also displayed 70-fold improved binding affinities. Finally, double mutant peptides incorporating both specifically located shorter amine side chains in the hydrophilic region and tryptophan residues in the hydrophobic region were synthesized. The binding affinities of peptides containing the simple double modification were observed to be 80 times lower, and their binding specificities were increased 40-fold. The results of this effort provide important information about strategies that can be used to prepare peptides that both strongly and selectively target hairpin RNAs. Specifically, the findings indicate that tailor-made amphiphilic peptide ligands against certain hairpin RNAs can be obtained if the RNA target possesses a deep groove in which both the hydrophobic and hydrophilic spheres of the peptide interact.

Collapse

125

Spriggs RV, Murakami Y, Nakamura H, Jones S. Protein function annotation from sequence: prediction of residues interacting with RNA. Bioinformatics 2009;25:1492-7. [DOI: 10.1093/bioinformatics/btp257] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

126

Investigation of atomic level patterns in protein--small ligand interactions. PLoS One 2009;4:e4473. [PMID: 19221587 PMCID: PMC2637420 DOI: 10.1371/journal.pone.0004473] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2008] [Accepted: 01/07/2009] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

Shape complementarity and non-covalent interactions are believed to drive protein-ligand interaction. To date protein-protein, protein-DNA, and protein-RNA interactions were systematically investigated, which is in contrast to interactions with small ligands. We investigate the role of covalent and non-covalent bonds in protein-small ligand interactions using a comprehensive dataset of 2,320 complexes.

METHODOLOGY AND PRINCIPAL FINDINGS

We show that protein-ligand interactions are governed by different forces for different ligand types, i.e., protein-organic compound interactions are governed by hydrogen bonds, van der Waals contacts, and covalent bonds; protein-metal ion interactions are dominated by electrostatic force and coordination bonds; protein-anion interactions are established with electrostatic force, hydrogen bonds, and van der Waals contacts; and protein-inorganic cluster interactions are driven by coordination bonds. We extracted several frequently occurring atomic-level patterns concerning these interactions. For instance, 73% of investigated covalent bonds were summarized with just three patterns in which bonds are formed between thiol of Cys and carbon or sulfur atoms of ligands, and nitrogen of Lys and carbon of ligands. Similar patterns were found for the coordination bonds. Hydrogen bonds occur in 67% of protein-organic compound complexes and 66% of them are formed between NH- group of protein residues and oxygen atom of ligands. We quantify relative abundance of specific interaction types and discuss their characteristic features. The extracted protein-organic compound patterns are shown to complement and improve a geometric approach for prediction of binding sites.

CONCLUSIONS AND SIGNIFICANCE

We show that for a given type (group) of ligands and type of the interaction force, majority of protein-ligand interactions are repetitive and could be summarized with several simple atomic-level patterns. We summarize and analyze 10 frequently occurring interaction patterns that cover 56% of all considered complexes and we show a practical application for the patterns that concerns interactions with organic compounds.

Collapse

127

Running WE, Reilly JP. Ribosomal Proteins of Deinococcus radiodurans: Their Solvent Accessibility and Reactivity. J Proteome Res 2009;8:1228-46. [DOI: 10.1021/pr800544y] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

128

Relating Macromolecular Function and Association: The Structural Basis of Protein–DNA and RNA Recognition. Cell Mol Bioeng 2008. [DOI: 10.1007/s12195-008-0032-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open

129

Chen YC, Lim C. Common physical basis of macromolecule-binding sites in proteins. Nucleic Acids Res 2008;36:7078-87. [PMID: 18988628 PMCID: PMC2602788 DOI: 10.1093/nar/gkn868] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

130

Yura K. [Trial to predict interactions between proteins and biomolecules based on their three-dimensional structures]. YAKUGAKU ZASSHI 2008;128:1547-55. [PMID: 18981688 DOI: 10.1248/yakushi.128.1547] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

131

Protein–protein interaction and quaternary structure. Q Rev Biophys 2008;41:133-80. [PMID: 18812015 DOI: 10.1017/s0033583508004708] [Citation(s) in RCA: 304] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

132

Tomovic A, Oakeley EJ. Computational structural analysis: multiple proteins bound to DNA. PLoS One 2008;3:e3243. [PMID: 18802470 PMCID: PMC2532747 DOI: 10.1371/journal.pone.0003243] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2008] [Accepted: 08/24/2008] [Indexed: 01/25/2023] Open

Abstract

Background

With increasing numbers of crystal structures of protein∶DNA and protein∶protein∶DNA complexes publically available, it is now possible to extract sufficient structural, physical-chemical and thermodynamic parameters to make general observations and predictions about their interactions. In particular, the properties of macromolecular assemblies of multiple proteins bound to DNA have not previously been investigated in detail.

Methodology/Principal Findings

We have performed computational structural analyses on macromolecular assemblies of multiple proteins bound to DNA using a variety of different computational tools: PISA; PROMOTIF; X3DNA; ReadOut; DDNA and DCOMPLEX. Additionally, we have developed and employed an algorithm for approximate collision detection and overlapping volume estimation of two macromolecules. An implementation of this algorithm is available at http://promoterplot.fmi.ch/Collision1/. The results obtained are compared with structural, physical-chemical and thermodynamic parameters from protein∶protein and single protein∶DNA complexes. Many of interface properties of multiple protein∶DNA complexes were found to be very similar to those observed in binary protein∶DNA and protein∶protein complexes. However, the conformational change of the DNA upon protein binding is significantly higher when multiple proteins bind to it than is observed when single proteins bind. The water mediated contacts are less important (found in less quantity) between the interfaces of components in ternary (protein∶protein∶DNA) complexes than in those of binary complexes (protein∶protein and protein∶DNA).The thermodynamic stability of ternary complexes is also higher than in the binary interactions. Greater specificity and affinity of multiple proteins binding to DNA in comparison with binary protein-DNA interactions were observed. However, protein-protein binding affinities are stronger in complexes without the presence of DNA.

Conclusions/Significance

Our results indicate that the interface properties: interface area; number of interface residues/atoms and hydrogen bonds; and the distribution of interface residues, hydrogen bonds, van der Walls contacts and secondary structure motifs are independent of whether or not a protein is in a binary or ternary complex with DNA. However, changes in the shape of the DNA reduce the off-rate of the proteins which greatly enhances the stability and specificity of ternary complexes compared to binary ones.

Collapse

133

Shazman S, Mandel-Gutfreund Y. Classifying RNA-binding proteins based on electrostatic properties. PLoS Comput Biol 2008;4:e1000146. [PMID: 18716674 PMCID: PMC2518515 DOI: 10.1371/journal.pcbi.1000146] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 06/26/2008] [Indexed: 01/15/2023] Open

Abstract

Protein structure can provide new insight into the biological function of a protein and can enable the design of better experiments to learn its biological roles. Moreover, deciphering the interactions of a protein with other molecules can contribute to the understanding of the protein's function within cellular processes. In this study, we apply a machine learning approach for classifying RNA-binding proteins based on their three-dimensional structures. The method is based on characterizing unique properties of electrostatic patches on the protein surface. Using an ensemble of general protein features and specific properties extracted from the electrostatic patches, we have trained a support vector machine (SVM) to distinguish RNA-binding proteins from other positively charged proteins that do not bind nucleic acids. Specifically, the method was applied on proteins possessing the RNA recognition motif (RRM) and successfully classified RNA-binding proteins from RRM domains involved in protein–protein interactions. Overall the method achieves 88% accuracy in classifying RNA-binding proteins, yet it cannot distinguish RNA from DNA binding proteins. Nevertheless, by applying a multiclass SVM approach we were able to classify the RNA-binding proteins based on their RNA targets, specifically, whether they bind a ribosomal RNA (rRNA), a transfer RNA (tRNA), or messenger RNA (mRNA). Finally, we present here an innovative approach that does not rely on sequence or structural homology and could be applied to identify novel RNA-binding proteins with unique folds and/or binding motifs.

Gene expression in all living organisms is regulated by a complex set of events at both transcriptional and posttranscriptional levels. RNA-binding proteins play a key role in posttranscriptional events including splicing, stability, transport, and translation. Nowadays, there is increasing evidence that many other cellular processes may be mediated by RNA. Identifying new proteins involved in interaction with RNA is thus essential to unraveling the cellular processes in which these interactions are involved. In the current study we present a successful computational approach for classifying RNA-binding proteins and distinguishing them from other proteins based on structural and electrostatic properties. We test the method on a unique protein domain, the RNA recognition motif (RRM), which mediates both RNA and protein interactions. We show that we can discriminate RNA-binding RRMs from protein-binding RRMs. Further, we demonstrate that we can classify known RNA-binding proteins based on their RNA target (mRNA, rRNA, or tRNA). Our method does not rely on any kind of evolutionary information and thus can be applied to identify RNA-binding proteins with novel modes of RNA recognition.

Collapse

134

Biswas S, Guharoy M, Chakrabarti P. Structural segments and residue propensities in protein-RNA interfaces: comparison with protein-protein and protein-DNA complexes. Bioinformation 2008;2:422-7. [PMID: 18841236 PMCID: PMC2561160 DOI: 10.6026/97320630002422] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2008] [Revised: 06/19/2008] [Accepted: 07/07/2008] [Indexed: 12/29/2022] Open

135

Ellis JJ, Jones S. Evaluating conformational changes in protein structures binding RNA. Proteins 2008;70:1518-26. [PMID: 17910059 DOI: 10.1002/prot.21647] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

136

Bahadur RP, Zacharias M, Janin J. Dissecting protein-RNA recognition sites. Nucleic Acids Res 2008;36:2705-16. [PMID: 18353859 PMCID: PMC2377425 DOI: 10.1093/nar/gkn102] [Citation(s) in RCA: 98] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

137

Chen YC, Lim C. Predicting RNA-binding sites from the protein structure based on electrostatics, evolution and geometry. Nucleic Acids Res 2008;36:e29. [PMID: 18276647 PMCID: PMC2275128 DOI: 10.1093/nar/gkn008] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open