1
|
Nagarajan R, Ahmad S, Gromiha MM. Novel approach for selecting the best predictor for identifying the binding sites in DNA binding proteins. Nucleic Acids Res 2013; 41:7606-14. [PMID: 23788679 PMCID: PMC3763535 DOI: 10.1093/nar/gkt544] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Protein-DNA complexes play vital roles in many cellular processes by the interactions of amino acids with DNA. Several computational methods have been developed for predicting the interacting residues in DNA-binding proteins using sequence and/or structural information. These methods showed different levels of accuracies, which may depend on the choice of data sets used in training, the feature sets selected for developing a predictive model, the ability of the models to capture information useful for prediction or a combination of these factors. In many cases, different methods are likely to produce similar results, whereas in others, the predictors may return contradictory predictions. In this situation, a priori estimates of prediction performance applicable to the system being investigated would be helpful for biologists to choose the best method for designing their experiments. In this work, we have constructed unbiased, stringent and diverse data sets for DNA-binding proteins based on various biologically relevant considerations: (i) seven structural classes, (ii) 86 folds, (iii) 106 superfamilies, (iv) 194 families, (v) 15 binding motifs, (vi) single/double-stranded DNA, (vii) DNA conformation (A, B, Z, etc.), (viii) three functions and (ix) disordered regions. These data sets were culled as non-redundant with sequence identities of 25 and 40% and used to evaluate the performance of 11 different methods in which online services or standalone programs are available. We observed that the best performing methods for each of the data sets showed significant biases toward the data sets selected for their benchmark. Our analysis revealed important data set features, which could be used to estimate these context-specific biases and hence suggest the best method to be used for a given problem. We have developed a web server, which considers these features on demand and displays the best method that the investigator should use. The web server is freely available at http://www.biotech.iitm.ac.in/DNA-protein/. Further, we have grouped the methods based on their complexity and analyzed the performance. The information gained in this work could be effectively used to select the best method for designing experiments.
Collapse
Affiliation(s)
- R Nagarajan
- Department of Biotechnology, Indian Institute of Technology Madras, Chennai 600036, India and National Institute of Biomedical Innovation, Osaka, Japan
| | | | | |
Collapse
|
2
|
Gromiha MM. Structure based sequence dependent stiffness scale for trinucleotides: a direct method. J Biol Phys 2013; 26:43-50. [PMID: 23345711 DOI: 10.1023/a:1005250718139] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A new set of stiffness parameters for all the 32trinucleotide units has been set up directly from thethree dimensional structures of DNA molecules. It wasobserved that GAC/GTC is the stiffest trinucleotideand ACC/GGT is the most flexible one. The averagestiffness values computed for a set of operatorsequences using the new parameters correlate very wellwith the protein-DNA binding specificity and bindingfree energy change of 434 repressor and Cro repressor,respectively. The new structure based stiffness scalecan explain the protein-DNA binding specificity to thelevel of 0.92.
Collapse
Affiliation(s)
- M M Gromiha
- The Institute of Physical and Chemical Research (RIKEN), Tsukuba Life Science Center, 3-1-1 Koyadai, Tsukuba, Ibaraki, 305-0074 Japan
| |
Collapse
|
3
|
Gromiha MM, Nagarajan R. Computational approaches for predicting the binding sites and understanding the recognition mechanism of protein-DNA complexes. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2013; 91:65-99. [PMID: 23790211 DOI: 10.1016/b978-0-12-411637-5.00003-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Protein-DNA recognition plays an important role in the regulation of gene expression. Understanding the influence of specific residues for protein-DNA interactions and the recognition mechanism of protein-DNA complexes is a challenging task in molecular and computational biology. Several computational approaches have been put forward to tackle these problems from different perspectives: (i) development of databases for the interactions between protein and DNA and binding specificity of protein-DNA complexes, (ii) structural analysis of protein-DNA complexes, (iii) discriminating DNA-binding proteins from amino acid sequence, (iv) prediction of DNA-binding sites and protein-DNA binding specificity using sequence and/or structural information, and (v) understanding the recognition mechanism of protein-DNA complexes. In this review, we focus on all these issues and extensively discuss the advancements on the development of comprehensive bioinformatics databases for protein-DNA interactions, efficient tools for identifying the binding sites, and plausible mechanisms for understanding the recognition of protein-DNA complexes. Further, the available online resources for understanding protein-DNA interactions are collectively listed, which will serve as ready-to-use information for the research community.
Collapse
Affiliation(s)
- M Michael Gromiha
- Department of Biotechnology, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India.
| | | |
Collapse
|
4
|
Rawat K, Aswal VK, Bohidar HB. DNA–Gelatin Complex Coacervation, UCST and First-Order Phase Transition of Coacervate to Anisotropic ion gel in 1-Methyl-3-octylimidazolium Chloride Ionic Liquid Solutions. J Phys Chem B 2012. [DOI: 10.1021/jp3102089] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- Kamla Rawat
- Polymer and Biophysics Laboratory, School of Physical Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - V. K. Aswal
- Solid State Physics Division, Bhabha Atomic Research Centre, Mumbai 400094, India
| | - H. B. Bohidar
- Polymer and Biophysics Laboratory, School of Physical Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| |
Collapse
|
5
|
Gromiha MM, Fukui K. Scoring function based approach for locating binding sites and understanding recognition mechanism of protein-DNA complexes. J Chem Inf Model 2011; 51:721-9. [PMID: 21361378 DOI: 10.1021/ci1003703] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Protein-DNA recognition plays an essential role in the regulation of gene expression. Understanding the recognition mechanism of protein-DNA complexes is a challenging task in molecular and computational biology. In this work, a scoring function based approach has been developed for identifying the binding sites and delineating the important residues for binding in protein-DNA complexes. This approach considers both the repulsive interactions and the effect of distance between atoms in protein and DNA. The results showed that positively charged, polar, and aromatic residues are important for binding. These residues influence the formation of electrostatic, hydrogen bonding, and stacking interactions. Our observation has been verified with experimental binding specificity of protein-DNA complexes and found to be in good agreement with experiments. The comparison of protein-RNA and protein-DNA complexes reveals that the contribution of phosphate atoms in DNA is twice as large as in protein-RNA complexes. Furthermore, we observed that the positively charged, polar, and aromatic residues serve as hotspot residues in protein-RNA complexes, whereas other residues also altered the binding specificity in protein-DNA complexes. Based on the results obtained in the present study and related reports, a plausible mechanism has been proposed for the recognition of protein-DNA complexes.
Collapse
Affiliation(s)
- M Michael Gromiha
- Department of Biotechnology, Indian Institute of Technology Madras, Chennai, Tamilnadu, India.
| | | |
Collapse
|
6
|
Abstract
In this paper we study the scaling behavior of nucleotide cluster in 11 chromosomes of Encephalitozoon cuniculi Genome. The statistical distribution of nucleotide clusters for 11 chromosomes is characterized by the scaling behavior of P(S) proportional, variant e(-alphaS), where S represents nucleotide cluster size. The cluster-size distribution P(S(1)+S(2)) with the total size of sequential C-G cluster and A-T cluster S(1)+S(2) were also studied. P(S(1)+S(2)) follows exponential decay. There does not exist the case of large C-G cluster following large A-T cluster or large A-T cluster following large C-G cluster. We also discuss the relatively random walk length function L(n) and the local compositional complexity of nucleotide sequences based on a new model. These investigations may provide some insight into nucleotide cluster of DNA sequence.
Collapse
Affiliation(s)
- Jun Cheng
- Department of Physics, Jinhua College of Profession and Technology, Jinhua 321017, China.
| | | | | |
Collapse
|
7
|
Lou C, Yang X, Liu X, He B, Ouyang Q. A quantitative study of lambda-phage SWITCH and its components. Biophys J 2007; 92:2685-93. [PMID: 17259278 PMCID: PMC1831702 DOI: 10.1529/biophysj.106.097089] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We propose what we believe is a new model to quantitatively describe the lambda-phage SWITCH system. The model incorporates facilitated transfer mechanism of transcription factor, which can be simplified into a two-step reaction. We first sequentially obtain two indispensable parameters by fitting our model to experimental data of two simple systems, and then apply them to study the natural lambda-SWITCH system. By incorporating the facilitated transfer mechanism, we find that in RecA(-) host Escherichia coli, the wild-type lambda-lysogenic state is in a monostable regime rather than in a bistable regime. Furthermore, the model explains the weak role of Cro protein and probably sheds light on the evolution of lambda-Cro protein, which is known to be structurally distinct from the other Cros in lambdoid family members.
Collapse
Affiliation(s)
- Chunbo Lou
- Center for Theoretical Biology and School of Physics, Peking University, Beijing, 100871, China
| | | | | | | | | |
Collapse
|
8
|
Becker NB, Wolff L, Everaers R. Indirect readout: detection of optimized subsequences and calculation of relative binding affinities using different DNA elastic potentials. Nucleic Acids Res 2006; 34:5638-49. [PMID: 17038333 PMCID: PMC1636474 DOI: 10.1093/nar/gkl683] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2006] [Revised: 09/05/2006] [Accepted: 09/06/2006] [Indexed: 01/23/2023] Open
Abstract
Essential biological processes require that proteins bind to a set of specific DNA sites with tuned relative affinities. We focus on the indirect readout mechanism and discuss its theoretical description in relation to the present understanding of DNA elasticity on the rigid base pair level. Combining existing parametrizations of elastic potentials for DNA, we derive elastic free energies directly related to competitive binding experiments, and propose a computationally inexpensive local marker for elastically optimized subsequences in protein-DNA co-crystals. We test our approach in an application to the bacteriophage 434 repressor. In agreement with known results we find that indirect readout dominates at the central, non-contacted bases of the binding site. Elastic optimization involves all deformation modes and is mainly due to the adapted equilibrium structure of the operator, while sequence-dependent elasticity plays a minor role. These qualitative observations are robust with respect to current parametrization uncertainties. Predictions for relative affinities mediated by indirect readout depend sensitively on the chosen parametrization. Their quantitative comparison with experimental data allows for a critical evaluation of DNA elastic potentials and of the correspondence between crystal and solution structures. The software written for the presented analysis is included as Supplementary Data.
Collapse
Affiliation(s)
- Nils B Becker
- Max-Planck-Institut für Physik komplexer Systeme, Nöthnitzer Strasse 38, 01187 Dresden, Germany.
| | | | | |
Collapse
|
9
|
Gromiha MM, Siebers JG, Selvaraj S, Kono H, Sarai A. Role of inter and intramolecular interactions in protein-DNA recognition. Gene 2005; 364:108-13. [PMID: 16249059 DOI: 10.1016/j.gene.2005.07.022] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2005] [Revised: 06/27/2005] [Accepted: 07/18/2005] [Indexed: 11/17/2022]
Abstract
Protein-DNA recognition plays an essential role in the regulation of gene expression. Regulatory proteins are known to recognize specific DNA sequences directly through atomic contacts between protein and DNA, and/or indirectly through the conformational properties of the DNA. In this work, we have analyzed the specificity of intermolecular interactions by statistical analysis of base-amino acid interactions within protein-DNA complexes as well as the computer simulations of base-amino acid interactions. The specificity of the intramolecular interactions was studied by statistical analysis of the sequence-dependent DNA conformational parameters and the elastic properties of DNA. Systematic comparison of these specificities in a large number of protein-DNA complexes revealed that both intermolecular and intramolecular interactions contribute to the specificity of protein-DNA recognition, and their relative contributions vary depending upon the protein-DNA complex. We demonstrated that combination of the intermolecular and intramolecular energies leads to enhanced specificity and the combined energy could explain experimental data on binding affinity changes caused by base mutations. These results provided new insight into the relationship between specificity and structure in the process of protein-DNA recognition, which would lead to prediction of specific protein-DNA binding sites.
Collapse
Affiliation(s)
- M Michael Gromiha
- Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), AIST Tokyo Waterfront Bio-IT Research Building, 2-42 Aomi, Koto-ku, Tokyo, 135-0064, Japan.
| | | | | | | | | |
Collapse
|
10
|
Gromiha MM. Influence of DNA stiffness in protein–DNA recognition. J Biotechnol 2005; 117:137-45. [PMID: 15823403 DOI: 10.1016/j.jbiotec.2004.12.016] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2004] [Revised: 11/02/2004] [Accepted: 12/06/2004] [Indexed: 11/21/2022]
Abstract
Protein-DNA recognition plays an essential role in the regulation of gene expression. The protein-DNA binding specificity is based on direct atomic contacts between protein and DNA and/or the conformational properties of DNA. In this work, we have analyzed the influence of DNA stiffness (E) to the specificity of protein-DNA complexes. The average DNA stiffness parameters for several protein-DNA complexes have been computed using the structure based sequence dependent stiffness scale. The relationship between DNA stiffness and experimental protein-DNA binding specificity has been brought out. We have investigated the importance of DNA stiffness with the aid of experimental free energy changes (DeltaDeltaG) due to binding in several protein-DNA complexes, such as, ETS proteins, 434, lambda, Mnt and trp repressors, 434 cro protein, EcoRV endonuclease V and zinc fingers. We found a correlation in the range 0.65-0.97 between DeltaDeltaG and E in these examples. Further, we have qualitatively analyzed the effect of mutations in the target sequence of lambda repressor and we observed that the DNA stiffness could correctly identify 70% of the correct bases among the considered nine positions.
Collapse
Affiliation(s)
- M Michael Gromiha
- Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-42 Aomi, Koto-ku, Tokyo 135-0064, Japan.
| |
Collapse
|
11
|
Michael Gromiha M, Siebers JG, Selvaraj S, Kono H, Sarai A. Intermolecular and Intramolecular Readout Mechanisms in Protein–DNA Recognition. J Mol Biol 2004; 337:285-94. [PMID: 15003447 DOI: 10.1016/j.jmb.2004.01.033] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2003] [Revised: 01/16/2004] [Accepted: 01/21/2004] [Indexed: 11/20/2022]
Abstract
Protein-DNA recognition plays an essential role in the regulation of gene expression. Regulatory proteins are known to recognize specific DNA sequences directly through atomic contacts (intermolecular readout) and/or indirectly through the conformational properties of the DNA (intramolecular readout). However, little is known about the respective contributions made by these so-called direct and indirect readout mechanisms. We addressed this question by making use of information extracted from a structural database containing many protein-DNA complexes. We quantified the specificity of intermolecular (direct) readout by statistical analysis of base-amino acid interactions within protein-DNA complexes. The specificity of the intramolecular (indirect) readout due to DNA was quantified by statistical analysis of the sequence-dependent DNA conformation. Systematic comparison of these specificities in a large number of protein-DNA complexes revealed that both intermolecular and intramolecular readouts contribute to the specificity of protein-DNA recognition, and that their relative contributions vary depending upon the protein-DNA complexes. We demonstrated that combination of the intermolecular and intramolecular energies derived from the statistical analyses lead to enhanced specificity, and that the combined energy could explain experimental data on binding affinity changes caused by base mutations. These results provided new insight into the relationship between specificity and structure in the process of protein-DNA recognition, which would lead to prediction of specific protein-DNA binding sites.
Collapse
Affiliation(s)
- M Michael Gromiha
- Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Aomi Frontier Building 17F, Aomi, Koto-ku, Tokyo 135-0064, Japan
| | | | | | | | | |
Collapse
|
12
|
Fonseca AF, de Aguiar MA. Near equilibrium dynamics of nonhomogeneous Kirchhoff filaments in viscous media. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2001; 63:016611. [PMID: 11304382 DOI: 10.1103/physreve.63.016611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2000] [Revised: 08/11/2000] [Indexed: 05/23/2023]
Abstract
We study the near equilibrium dynamics of nonhomogeneous elastic filaments in viscous media using the Kirchhoff model of rods. Viscosity is incorporated in the model as an external force, which we approximate by the resistance felt by an infinite cylinder immersed in a slowly moving fluid. We use the recently developed method of Goriely and Tabor [Phys. Rev. Lett. 77, 3537 (1996); Physica D 105, 20 (1997); 105, 45 (1997)] to study the dynamics in the vicinity of the simplest equilibrium solution for a closed rod with nonhomogeneous distribution of mass, namely, the planar ring configuration. We show that small variations of the mass density along the rod are sufficient to couple the symmetric modes of the homogeneous rod problem, producing asymmetric deformations that modify substantially the dynamical coiling, even at quite low Reynolds number. The higher-density segments of the rod tend to become more rigid and less coiled. We comment on possible applications to DNA.
Collapse
Affiliation(s)
- A F Fonseca
- Instituto de Física Gleb Wataghin, Universidade Estadual de Campinas, Unicamp 13083-970, Campinas, SP, Brazil
| | | |
Collapse
|
13
|
|