1
|
Dey U, Olymon K, Banik A, Abbas E, Yella VR, Kumar A. DNA structural properties of DNA binding sites for 21 transcription factors in the mycobacterial genome. Front Cell Infect Microbiol 2023; 13:1147544. [PMID: 37396305 PMCID: PMC10312376 DOI: 10.3389/fcimb.2023.1147544] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 05/19/2023] [Indexed: 07/04/2023] Open
Abstract
Mycobacterium tuberculosis, the causative agent of tuberculosis, has evolved over time into a multidrug resistance strain that poses a serious global pandemic health threat. The ability to survive and remain dormant within the host macrophage relies on multiple transcription factors contributing to virulence. To date, very limited structural insights from crystallographic and NMR studies are available for TFs and TF-DNA binding events. Understanding the role of DNA structure in TF binding is critical to deciphering MTB pathogenicity and has yet to be resolved at the genome scale. In this work, we analyzed the compositional and conformational preference of 21 mycobacterial TFs, evident at their DNA binding sites, in local and global scales. Results suggest that most TFs prefer binding to genomic regions characterized by unique DNA structural signatures, namely, high electrostatic potential, narrow minor grooves, high propeller twist, helical twist, intrinsic curvature, and DNA rigidity compared to the flanking sequences. Additionally, preference for specific trinucleotide motifs, with clear periodic signals of tetranucleotide motifs, are observed in the vicinity of the TF-DNA interactions. Altogether, our study reports nuanced DNA shape and structural preferences of 21 TFs.
Collapse
Affiliation(s)
- Upalabdha Dey
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, India
| | - Kaushika Olymon
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, India
| | - Anikesh Banik
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, India
| | - Eshan Abbas
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, India
| | - Venkata Rajesh Yella
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Guntur, India
| | - Aditya Kumar
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, India
| |
Collapse
|
2
|
Sarkar S, Dey U, Khohliwe TB, Yella VR, Kumar A. Analysis of nucleoid-associated protein-binding regions reveals DNA structural features influencing genome organization in Mycobacterium tuberculosis. FEBS Lett 2021; 595:2504-2521. [PMID: 34387867 DOI: 10.1002/1873-3468.14178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/01/2021] [Accepted: 08/11/2021] [Indexed: 11/10/2022]
Abstract
Nucleoid-associated proteins (NAPs) maintain bacterial nucleoid configuration through their architectural properties of DNA bending, wrapping, and bridging. However, the contribution of DNA structural alterations to DNA-NAP recognition at the genomic scale remains unresolved. Present work dissects the DNA sequence, shape and altered structural preferences at a genomic scale for six NAPs in Mycobacterium tuberculosis. Results suggest narrower minor groove width (MGW) and higher DNA rigidity are marked for the binding sites of EspR and Lsr2, while mIHF, MtHU and NapM have heterogeneous DNA structural predilections. In contrast, WhiB4-DNA-binding sites were characterized by wider MGW, highly deformable and less curved DNA. This work provides systematic insight into NAP-mediated genome organization as a function of DNA structural features.
Collapse
Affiliation(s)
- Sharmilee Sarkar
- Department of Molecular Biology and Biotechnology, Tezpur University, India
| | - Upalabdha Dey
- Department of Molecular Biology and Biotechnology, Tezpur University, India
| | | | - Venkata Rajesh Yella
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Guntur, India
| | - Aditya Kumar
- Department of Molecular Biology and Biotechnology, Tezpur University, India
| |
Collapse
|
3
|
Martinez GS, Sarkar S, Kumar A, Pérez‐Rueda E, de Avila e Silva S. Characterization of promoters in archaeal genomes based on DNA structural parameters. Microbiologyopen 2021; 10:e1230. [PMID: 34713600 PMCID: PMC8553660 DOI: 10.1002/mbo3.1230] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 07/27/2021] [Accepted: 07/29/2021] [Indexed: 11/10/2022] Open
Abstract
The transcription machinery of archaea can be roughly classified as a simplified version of eukaryotic organisms. The basal transcription factor machinery binds to the TATA box found around 28 nucleotides upstream of the transcription start site; however, some transcription units lack a clear TATA box and still have TBP/TFB binding over them. This apparent absence of conserved sequences could be a consequence of sequence divergence associated with the upstream region, operon, and gene organization. Furthermore, earlier studies have found that a structural analysis gains more information compared with a simple sequence inspection. In this work, we evaluated and coded 3630 archaeal promoter sequences of three organisms, Haloferax volcanii, Thermococcus kodakarensis, and Sulfolobus solfataricus into DNA duplex stability, enthalpy, curvature, and bendability parameters. We also split our dataset into conserved TATA and degenerated TATA promoters to identify differences among these two classes of promoters. The structural analysis reveals variations in archaeal promoter architecture, that is, a distinctive signal is observed in the TFB, TBP, and TFE binding sites independently of these being TATA-conserved or TATA-degenerated. In addition, the promoter encountering method was validated with upstream regions of 13 other archaea, suggesting that there might be promoter sequences among them. Therefore, we suggest a novel method for locating promoters within the genome of archaea based on DNA energetic/structural features.
Collapse
Affiliation(s)
| | - Sharmilee Sarkar
- Department of Molecular Biology and BiotechnologyTezpur UniversityTezpurAssamIndia
| | - Aditya Kumar
- Department of Molecular Biology and BiotechnologyTezpur UniversityTezpurAssamIndia
| | - Ernesto Pérez‐Rueda
- Unidad Académica de YucatánInstituto de Investigaciones en Matemáticas Aplicadas y en SistemasUniversidad Nacional Autónoma de MéxicoMéridaYucatánMéxico
| | | |
Collapse
|
4
|
Zenil H, Minary P. Training-free measures based on algorithmic probability identify high nucleosome occupancy in DNA sequences. Nucleic Acids Res 2019; 47:e129. [PMID: 31511887 PMCID: PMC6846163 DOI: 10.1093/nar/gkz750] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 07/10/2019] [Accepted: 08/27/2019] [Indexed: 01/01/2023] Open
Abstract
We introduce and study a set of training-free methods of an information-theoretic and algorithmic complexity nature that we apply to DNA sequences to identify their potential to identify nucleosomal binding sites. We test the measures on well-studied genomic sequences of different sizes drawn from different sources. The measures reveal the known in vivo versus in vitro predictive discrepancies and uncover their potential to pinpoint high and low nucleosome occupancy. We explore different possible signals within and beyond the nucleosome length and find that the complexity indices are informative of nucleosome occupancy. We found that, while it is clear that the gold standard Kaplan model is driven by GC content (by design) and by k-mer training; for high occupancy, entropy and complexity-based scores are also informative and can complement the Kaplan model.
Collapse
Affiliation(s)
- Hector Zenil
- Oxford Immune Algorithmics, Oxford University Innovation, Oxford, UK
- Algorithmic Dynamics Lab, Unit of Computational Medicine, SciLifeLab, Center for Molecular Medicine, Karolinska Institute, Stockholm, Sweden
- Algorithmic Nature Group, LABORES for the Natural and Digital Sciences, Paris, France
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Peter Minary
- Department of Computer Science, University of Oxford, Oxford, UK
| |
Collapse
|
5
|
Ghoshdastidar D, Bansal M. Dynamics of physiologically relevant noncanonical DNA structures: an overview from experimental and theoretical studies. Brief Funct Genomics 2018; 18:192-204. [DOI: 10.1093/bfgp/ely026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Revised: 06/23/2018] [Accepted: 07/09/2018] [Indexed: 12/23/2022] Open
Abstract
Abstract
DNA is a complex molecule with phenomenal inherent plasticity and the ability to form different hydrogen bonding patterns of varying stabilities. These properties enable DNA to attain a variety of structural and conformational polymorphic forms. Structurally, DNA can exist in single-stranded form or as higher-order structures, which include the canonical double helix as well as the noncanonical duplex, triplex and quadruplex species. Each of these structural forms in turn encompasses an ensemble of dynamically heterogeneous conformers depending on the sequence composition and environmental context. In vivo, the widely populated canonical B-DNA attains these noncanonical polymorphs during important cellular processes. While several investigations have focused on the structure of these noncanonical DNA, studying their dynamics has remained nontrivial. Here, we outline findings from some recent advanced experimental and molecular simulation techniques that have significantly contributed toward understanding the complex dynamics of physiologically relevant noncanonical forms of DNA.
Collapse
Affiliation(s)
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India
| |
Collapse
|
6
|
Kumar A, Bansal M. Unveiling DNA structural features of promoters associated with various types of TSSs in prokaryotic transcriptomes and their role in gene expression. DNA Res 2017; 24:25-35. [PMID: 27803028 PMCID: PMC5381344 DOI: 10.1093/dnares/dsw045] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Accepted: 09/23/2016] [Indexed: 01/28/2023] Open
Abstract
Next-generation sequencing studies have revealed that a variety of transcripts are present in the prokaryotic transcriptome and a significant fraction of them are functional, being involved in various regulatory activities apart from coding for proteins. Identification of promoters associated with different transcripts is necessary for characterization of the transcriptome. Promoter regions have been shown to have unique structural features as compared with their flanking region, in organisms covering all domains of life. Here we report an in silico analysis of DNA sequence dependent structural properties like stability, bendability and curvature in the promoter region of six different prokaryotic transcriptomes. Using these structural features, we predicted promoters associated with different categories of transcripts (mRNA, internal, antisense and non-coding), which constitute the transcriptome. Promoter annotation using structural features is fairly accurate and reliable with about 50% of the primary promoters being characterized by all three structural properties while at least one property identifies 95%. We also studied the relative differences of these structural features in terms of gene expression and found that the features, viz. lower stability, lesser bendability and higher curvature are more prominent in the promoter regions which are associated with high gene expression as compared with low expression genes. Hence, promoters, which are associated with higher gene expression, get annotated well using DNA structural features as compared with those, which are linked to lower gene expression.
Collapse
Affiliation(s)
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012 Karnataka, India
| |
Collapse
|
7
|
Mondal M, Halder S, Chakrabarti J, Bhattacharyya D. Hybrid simulation approach incorporating microscopic interaction along with rigid body degrees of freedom for stacking between base pairs. Biopolymers 2015; 105:212-26. [PMID: 26600167 DOI: 10.1002/bip.22787] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Revised: 10/19/2015] [Accepted: 11/17/2015] [Indexed: 11/07/2022]
Abstract
Stacking interaction between the aromatic heterocyclic bases plays an important role in the double helical structures of nucleic acids. Considering the base as rigid body, there are total of 18 degrees of freedom of a dinucleotide step. Some of these parameters show sequence preferences, indicating that the detailed atomic interactions are important in the stacking. Large variants of non-canonical base pairs have been seen in the crystallographic structures of RNA. However, their stacking preferences are not thoroughly deciphered yet from experimental results. The current theoretical approaches use either the rigid body degrees of freedom where the atomic information are lost or computationally expensive all atom simulations. We have used a hybrid simulation approach incorporating Monte-Carlo Metropolis sampling in the hyperspace of 18 stacking parameters where the interaction energies using AMBER-parm99bsc0 and CHARMM-36 force-fields were calculated from atomic positions. We have also performed stacking energy calculations for structures from Monte-Carlo ensemble by Dispersion corrected density functional theory. The available experimental data with Watson-Crick base pairs are compared to establish the validity of the method. Stacking interaction involving A:U and G:C base pairs with non-canonical G:U base pairs also were calculated and showed that these structures were also sequence dependent. This approach could be useful to generate multiscale modeling of nucleic acids in terms of coarse-grained parameters where the atomic interactions are preserved. This method would also be useful to predict structure and dynamics of different base pair steps containing non Watson-Crick base pairs, as found often in the non-coding RNA structures. © 2015 Wiley Periodicals, Inc. Biopolymers 105: 212-226, 2016.
Collapse
Affiliation(s)
- Manas Mondal
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700 064, India
| | - Sukanya Halder
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700 064, India
| | - Jaydeb Chakrabarti
- Department of Chemical, Biological and Macro-Molecular Sciences, S.N. Bose National Center for Basic Sciences, Sector III, Salt Lake, Kolkata, 700 098, India
| | - Dhananjay Bhattacharyya
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700 064, India
| |
Collapse
|
8
|
Role of indirect readout mechanism in TATA box binding protein-DNA interaction. J Comput Aided Mol Des 2015; 29:283-95. [PMID: 25575717 DOI: 10.1007/s10822-014-9828-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 12/18/2014] [Indexed: 12/11/2022]
Abstract
Gene expression generally initiates from recognition of TATA-box binding protein (TBP) to the minor groove of DNA of TATA box sequence where the DNA structure is significantly different from B-DNA. We have carried out molecular dynamics simulation studies of TBP-DNA system to understand how the DNA structure alters for efficient binding. We observed rigid nature of the protein while the DNA of TATA box sequence has an inherent flexibility in terms of bending and minor groove widening. The bending analysis of the free DNA and the TBP bound DNA systems indicate presence of some similar structures. Principal coordinate ordination analysis also indicates some structural features of the protein bound and free DNA are similar. Thus we suggest that the DNA of TATA box sequence regularly oscillates between several alternate structures and the one suitable for TBP binding is induced further by the protein for proper complex formation.
Collapse
|
9
|
Mukherjee S, Kundu S, Bhattacharyya D. Temperature effect on poly(dA).poly(dT): molecular dynamics simulation studies of polymeric and oligomeric constructs. J Comput Aided Mol Des 2014; 28:735-49. [PMID: 24865848 DOI: 10.1007/s10822-014-9755-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Accepted: 05/19/2014] [Indexed: 01/27/2023]
Abstract
Understanding unwinding and melting of double helical DNA is very important to characterize role of DNA in replication, transcription, translation etc. Sequence dependent melting thermodynamics is used extensively for detecting promoter regions but melting studies are generally done for short oligonucleotides. This study reports several molecular dynamics (MD) simulations of homopolymeric poly(dA).poly(dT) as regular oligonucleotide fragments as well as its corresponding polymeric constructs with water and charge-neutralizing counterions at different temperatures ranging from 300 to 400 K. We have eliminated the end-effect or terminal peeling propensity by employing MD simulation of DNA oligonucleotides in such a manner that gives rise to properties of polymeric DNA of infinite length. The dynamic properties such as basepairing and stacking geometry, groove width, backbone conformational parameters, bending, distribution of counter ions and number of hydrogen bonds of oligomeric and polymeric constructs of poly(dA).poly(dT) have been analyzed. The oligomer shows terminal fraying or peeling effect at temperatures above 340 K. The polymer shows partial melting at elevated temperatures although complete denaturations of basepairs do not take place. The analysis of cross strand hydrogen bonds shows that the number of N-H···O hydrogen bonds increases with increase in temperature while C-H···O hydrogen bond frequencies decrease with temperature. Restructuring of counterions in the minor groove with temperature appear as initiation of melting in duplex structures.
Collapse
Affiliation(s)
- Sanchita Mukherjee
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
| | | | | |
Collapse
|
10
|
Matyášek R, Fulneček J, Kovařík A. Evaluation of DNA bending models in their capacity to predict electrophoretic migration anomalies of satellite DNA sequences. Electrophoresis 2013; 34:2511-21. [PMID: 23784748 DOI: 10.1002/elps.201300227] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2013] [Revised: 06/06/2013] [Accepted: 06/06/2013] [Indexed: 01/05/2023]
Abstract
DNA containing a sequence that generates a local curvature exhibits a pronounced retardation in electrophoretic mobility. Various theoretical models have been proposed to explain relationship between DNA structural features and migration anomaly. Here, we studied the capacity of 15 static wedge-bending models to predict electrophoretic behavior of 69 satellite monomers derived from four divergent families. All monomers exhibited retarded mobility in PAGE corresponding to retardation factors ranging 1.02-1.54. The curvature varied both within and across the groups and correlated with the number, position, and lengths of A-tracts. Two dinucleotide models provided strong correlation between gel mobility and curvature prediction; two trinucleotide models were satisfactory while remaining dinucleotide models provided intermediate results with reliable prediction for subsets of sequences only. In some cases, similarly shaped molecules exhibited relatively large differences in mobility and vice versa. Generally less accurate predictions were obtained in groups containing less homogeneous sequences possessing distinct structural features. In conclusion, relatively universal theoretical models were identified suitable for the analysis of natural sequences known to harbor relatively moderate curvature. These models could be potentially applied to genome wide studies. However, in silico predictions should be viewed in context of experimental measurement of intrinsic DNA curvature.
Collapse
Affiliation(s)
- Roman Matyášek
- Laboratory of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i, Brno, Czech Republic.
| | | | | |
Collapse
|
11
|
Tan HK, Li D, Gray RK, Yang Z, Ng MTT, Zhang H, Tan JMR, Hiew SH, Lee JY, Li T. Interference of intrinsic curvature of DNA by DNA-intercalating agents. Org Biomol Chem 2012; 10:2227-30. [PMID: 22331171 DOI: 10.1039/c2ob06811g] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
It has been demonstrated in our studies that the intrinsic curvature of DNA can be easily interrupted by low concentrations of chloroquine and ethidium bromide. In addition, the changes of DNA curvature caused by varying the concentration of these two DNA intercalators can be readily verified through using an atomic force microscope.
Collapse
Affiliation(s)
- Hong Kee Tan
- Division of Chemistry and Biological Chemistry, School of Physical and Mathematical Sciences, Nanyang Technological University, 21 Nanyang Link, Singapore 637371
| | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Yang X, Yan Y. Statistical investigation of position-specific deformation pattern of nucleosome DNA based on multiple conformational properties. Bioinformation 2011; 7:120-4. [PMID: 22125381 PMCID: PMC3218313 DOI: 10.6026/97320630007120] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Accepted: 09/11/2011] [Indexed: 11/23/2022] Open
Abstract
The histone octamer induced bending of DNA into the super-helix structure in nucleosome core particle, is very unique and vital for DNA packing into chromatin. We collected 48 nucleosome crystal structures from PDB and applied a multivariate analysis on the nucleosome structural data. Based on the anisotropic nature of DNA structure, a principal conformational subspace (PCS) is derived from multiple properties to represent the most significant variances of nucleosome DNA structures. The coupling of base pair-oriented parameters with sugar phosphate backbone parameters presented in principal dimensionalities reveals two main deformation modes that have supplemented the existing physical model. By using sequence alignment-based statistics, a positiondependent conformational map for the super-helical DNA path is established. The result shows that the crystal structures of nucleosome DNA have much consistency in position-specific structural variations and certain periodicity is found to exist in these variations. Thus, the positions with obvious deformation patterns along the DNA path in nucleosome core particle are relatively conservative from the perspective of statistics.
Collapse
Affiliation(s)
- Xi Yang
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong
| | - Yan Yan
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong
- School of Electrical and Information Engineering, University of Sydney, NSW 2006, Australia
| |
Collapse
|
13
|
Marathe A, Bansal M. An ensemble of B-DNA dinucleotide geometries lead to characteristic nucleosomal DNA structure and provide plasticity required for gene expression. BMC STRUCTURAL BIOLOGY 2011; 11:1. [PMID: 21208404 PMCID: PMC3031206 DOI: 10.1186/1472-6807-11-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Accepted: 01/05/2011] [Indexed: 12/14/2022]
Abstract
BACKGROUND A nucleosome is the fundamental repeating unit of the eukaryotic chromosome. It has been shown that the positioning of a majority of nucleosomes is primarily controlled by factors other than the intrinsic preference of the DNA sequence. One of the key questions in this context is the role, if any, that can be played by the variability of nucleosomal DNA structure. RESULTS In this study, we have addressed this question by analysing the variability at the dinucleotide and trinucleotide as well as longer length scales in a dataset of nucleosome X-ray crystal structures. We observe that the nucleosome structure displays remarkable local level structural versatility within the B-DNA family. The nucleosomal DNA also incorporates a large number of kinks. CONCLUSIONS Based on our results, we propose that the local and global level versatility of B-DNA structure may be a significant factor modulating the formation of nucleosomes in the vicinity of high-plasticity genes, and in varying the probability of binding by regulatory proteins. Hence, these factors should be incorporated in the prediction algorithms and there may not be a unique 'template' for predicting putative nucleosome sequences. In addition, the multimodal distribution of dinucleotide parameters for some steps and the presence of a large number of kinks in the nucleosomal DNA structure indicate that the linear elastic model, used by several algorithms to predict the energetic cost of nucleosome formation, may lead to incorrect results.
Collapse
Affiliation(s)
- Arvind Marathe
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore - 12, India
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore - 12, India
| |
Collapse
|
14
|
Marathe A, Karandur D, Bansal M. Small local variations in B-form DNA lead to a large variety of global geometries which can accommodate most DNA-binding protein motifs. BMC STRUCTURAL BIOLOGY 2009; 9:24. [PMID: 19393049 PMCID: PMC2687451 DOI: 10.1186/1472-6807-9-24] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Accepted: 04/24/2009] [Indexed: 01/01/2023]
Abstract
BACKGROUND An important question of biological relevance is the polymorphism of the double-helical DNA structure in its free form, and the changes that it undergoes upon protein-binding. We have analysed a database of free DNA crystal structures to assess the inherent variability of the free DNA structure and have compared it with a database of protein-bound DNA crystal structures to ascertain the protein-induced variations. RESULTS Most of the dinucleotide steps in free DNA display high flexibility, assuming different conformations in a sequence-dependent fashion. With the exception of the AA/TT and GA/TC steps, which are 'A-phobic', and the GG/CC step, which is 'A-philic', the dinucleotide steps show no preference for A or B forms of DNA. Protein-bound DNA adopts the B-conformation most often. However, in certain cases, protein-binding causes the DNA backbone to take up energetically unfavourable conformations. At the gross structural level, several protein-bound DNA duplexes are observed to assume a curved conformation in the absence of any large distortions, indicating that a series of normal structural parameters at the dinucleotide and trinucleotide level, similar to the ones in free B-DNA, can give rise to curvature at the overall level. CONCLUSION The results illustrate that the free DNA molecule, even in the crystalline state, samples a large amount of conformational space, encompassing both the A and the B-forms, in the absence of any large ligands. A-form as well as some non-A, non-B, distorted geometries are observed for a small number of dinucleotide steps in DNA structures bound to the proteins belonging to a few specific families. However, for most of the bound DNA structures, across a wide variety of protein families, the average step parameters for various dinucleotide sequences as well as backbone torsion angles are observed to be quite close to the free 'B-like' DNA oligomer values, highlighting the flexibility and biological significance of this structural form.
Collapse
Affiliation(s)
- Arvind Marathe
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India.
| | | | | |
Collapse
|
15
|
Rangannan V, Bansal M. Relative stability of DNA as a generic criterion for promoter prediction: whole genome annotation of microbial genomes with varying nucleotide base composition. MOLECULAR BIOSYSTEMS 2009; 5:1758-69. [DOI: 10.1039/b906535k] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
16
|
Cao XQ, Zeng J, Yan H. Structural property of regulatory elements in human promoters. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:041908. [PMID: 18517657 DOI: 10.1103/physreve.77.041908] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2007] [Indexed: 05/26/2023]
Abstract
The capacity of transcription factors to activate gene expression is encoded in the promoter sequences, which are composed of short regulatory motifs that function as transcription factor binding sites (TFBSs) for specific proteins. To the best of our knowledge, the structural property of TFBSs that controls transcription is still poorly understood. Rigidity is one of the important structural properties of DNA, and plays an important role in guiding DNA-binding proteins to the target sites efficiently. After analyzing the rigidity of 2897 TFBSs in 1871 human promoters, we show that TFBSs are generally more flexible than other genomic regions such as exons, introns, 3' untranslated regions, and TFBS-poor promoter regions. Furthermore, we find that the density of TFBSs is consistent with the average rigidity profile of human promoters upstream of the transcription start site, which implies that TFBSs directly influence the promoter structure. We also examine the local rigid regions probably caused by specific TFBSs such as the DNA sequence TATA(A/T)A(A/T) box, which may inhibit nucleosomes and thereby facilitate the access of transcription factors bound nearby. Our results suggest that the structural property of TFBSs accounts for the promoter structure as well as promoter activity.
Collapse
Affiliation(s)
- Xiao-Qin Cao
- Department of Electronic Engineering, City University of Hong Kong, Tat Chee Avenue 83, Hong Kong
| | | | | |
Collapse
|
17
|
Dutta S, Singhal P, Agrawal P, Tomer R, Kritee K, Khurana E, Jayaram B. A physicochemical model for analyzing DNA sequences. J Chem Inf Model 2006; 46:78-85. [PMID: 16426042 DOI: 10.1021/ci050119x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In search of an ab initio model to characterize DNA sequences as genes and nongenes, we examined some physicochemical properties of each trinucleotide (codon), which could accomplish this task. We constructed three-dimensional vectors for each double-helical trinucleotide sequence considering hydrogen-bonding energy, stacking energy, and a third parameter, which we provisionally identified with DNA-protein interactions. As this three-dimensional vector moves along any genome, the net orientation of the resultant vector should differ significantly for gene and nongene regions to make a distinction feasible, if the underlying model has some merits. An analysis of 331 prokaryotic genomes comprising a total of 294 786 experimentally verified genes (nonoverlapping) and an equal number of nongenes presents a proof of concept of the model without the need for further parametrization. Also, initial analyses on Saccharomyces cerevisiae and Arabidopsis thaliana suggest that the methodology is extendable to eukaryotes. The physicochemical model (ChemGenome1.0) introduced has the potential to be developed into a gene-finding algorithm and, more pressingly, could be employed for an independent assessment of the annotation of DNA sequences.
Collapse
Affiliation(s)
- Samrat Dutta
- Department of Chemistry and Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology, Hauz Khas, New Delhi
| | | | | | | | | | | | | |
Collapse
|
18
|
Kanhere A, Bansal M. Structural properties of promoters: similarities and differences between prokaryotes and eukaryotes. Nucleic Acids Res 2005; 33:3165-75. [PMID: 15939933 PMCID: PMC1143579 DOI: 10.1093/nar/gki627] [Citation(s) in RCA: 91] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
During the process of transcription, RNA polymerase can exactly locate a promoter sequence in the complex maze of a genome. Several experimental studies and computational analyses have shown that the promoter sequences apparently possess some special properties, such as unusual DNA structures and low stability, which make them distinct from the rest of the genome. But most of these studies have been carried out on a particular set of promoter sequences or on promoter sequences from similar organisms. To examine whether the promoters from a wide variety of organisms share these special properties, we have carried out an analysis of sets of promoters from bacteria, vertebrates and plants. These promoters were analyzed with respect to the prediction of three different properties, such as DNA curvature, bendability and stability, which are relevant to transcription. All the promoter sequences are predicted to share certain features, such as stability and bendability profiles, but there are significant differences in DNA curvature profiles and nucleotide composition between the different organisms. These similarities and differences are correlated with some of the known facts about transcription process in the promoters from the three groups of organisms.
Collapse
Affiliation(s)
| | - Manju Bansal
- To whom correspondence should be addressed. Tel: +91 80 2293 2534; Fax: +91 80 2360 0535;
| |
Collapse
|
19
|
Beveridge DL, Barreiro G, Byun KS, Case DA, Cheatham TE, Dixit SB, Giudice E, Lankas F, Lavery R, Maddocks JH, Osman R, Seibert E, Sklenar H, Stoll G, Thayer KM, Varnai P, Young MA. Molecular dynamics simulations of the 136 unique tetranucleotide sequences of DNA oligonucleotides. I. Research design and results on d(CpG) steps. Biophys J 2004; 87:3799-813. [PMID: 15326025 PMCID: PMC1304892 DOI: 10.1529/biophysj.104.045252] [Citation(s) in RCA: 218] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Accepted: 08/03/2004] [Indexed: 11/18/2022] Open
Abstract
We describe herein a computationally intensive project aimed at carrying out molecular dynamics (MD) simulations including water and counterions on B-DNA oligomers containing all 136 unique tetranucleotide base sequences. This initiative was undertaken by an international collaborative effort involving nine research groups, the "Ascona B-DNA Consortium" (ABC). Calculations were carried out on the 136 cases imbedded in 39 DNA oligomers with repeating tetranucleotide sequences, capped on both ends by GC pairs and each having a total length of 15 nucleotide pairs. All MD simulations were carried out using a well-defined protocol, the AMBER suite of programs, and the parm94 force field. Phase I of the ABC project involves a total of approximately 0.6 mus of simulation for systems containing approximately 24,000 atoms. The resulting trajectories involve 600,000 coordinate sets and represent approximately 400 gigabytes of data. In this article, the research design, details of the simulation protocol, informatics issues, and the organization of the results into a web-accessible database are described. Preliminary results from 15-ns MD trajectories are presented for the d(CpG) step in its 10 unique sequence contexts, and issues of stability and convergence, the extent of quasiergodic problems, and the possibility of long-lived conformational substates are discussed.
Collapse
Affiliation(s)
- David L Beveridge
- Chemistry Department, Molecular Biology & Biochemistry Department, and Molecular Biophysics Program, Wesleyan University, Middletown, Connecticut 06459, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Beveridge DL, Dixit SB, Barreiro G, Thayer KM. Molecular dynamics simulations of DNA curvature and flexibility: helix phasing and premelting. Biopolymers 2004; 73:380-403. [PMID: 14755574 DOI: 10.1002/bip.20019] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Recent studies of DNA axis curvature and flexibility based on molecular dynamics (MD) simulations on DNA are reviewed. The MD simulations are on DNA sequences up to 25 base pairs in length, including explicit consideration of counterions and waters in the computational model. MD studies are described for ApA steps, A-tracts, for sequences of A-tracts with helix phasing. In MD modeling, ApA steps and A-tracts in aqueous solution are essentially straight, relatively rigid, and exhibit the characteristic features associated with the B'-form of DNA. The results of MD modeling of A-tract oligonucleotides are validated by close accord with corresponding crystal structure results and nuclear magnetic resonance (NMR) nuclear Overhauser effect (NOE) and residual dipolar coupling (RDC) structures of d(CGCGAATTCGCG) and d(GGCAAAAAACGG). MD simulation successfully accounts for enhanced axis curvature in a set of three sequences with phased A-tracts studied to date. The primary origin of the axis curvature in the MD model is found at those pyrimidine/purine YpR "flexible hinge points" in a high roll, open hinge conformational substate. In the MD model of axis curvature in a DNA sequence with both phased A-tracts and YpR steps, the A-tracts appear to act as positioning elements that make the helix phasing more precise, and key YpR steps in the open hinge state serve as curvature elements. Our simulations on a phased A-tract sequence as a function of temperature show that the MD simulations exhibit a premelting transition in close accord with experiment, and predict that the mechanism involves a B'-to-B transition within A-tracts coupled with the prediction of a transition in key YpR steps from the high roll, open hinge, to a low roll, closed hinge substate. Diverse experimental observations on DNA curvature phenomena are examined in light of the MD model with no serious discrepancies. The collected MD results provide independent support for the "non-A-tract model" of DNA curvature. The "junction model" is indicated to be a special case of the non-A-tract model when there is a Y base at the 5' end of an A-tract. In accord with crystallography, the "ApA wedge model" is not supported by MD.
Collapse
Affiliation(s)
- D L Beveridge
- Department of Chemistry, Wesleyan University, Middletown CT 06459, USA.
| | | | | | | |
Collapse
|