1
|
Roldán-Piñero C, Luengo-Márquez J, Assenza S, Pérez R. Systematic Comparison of Atomistic Force Fields for the Mechanical Properties of Double-Stranded DNA. J Chem Theory Comput 2024; 20:2261-2272. [PMID: 38411091 PMCID: PMC10938644 DOI: 10.1021/acs.jctc.3c01089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 02/14/2024] [Accepted: 02/14/2024] [Indexed: 02/28/2024]
Abstract
The response of double-stranded DNA to external mechanical stress plays a central role in its interactions with the protein machinery in the cell. Modern atomistic force fields have been shown to provide highly accurate predictions for the fine structural features of the duplex. In contrast, and despite their pivotal function, less attention has been devoted to the accuracy of the prediction of the elastic parameters. Several reports have addressed the flexibility of double-stranded DNA via all-atom molecular dynamics, yet the collected information is insufficient to have a clear understanding of the relative performance of the various force fields. In this work, we fill this gap by performing a systematic study in which several systems, characterized by different sequence contexts, are simulated with the most popular force fields within the AMBER family, bcs1 and OL15, as well as with CHARMM36. Analysis of our results, together with their comparison with previous work focused on bsc0, allows us to unveil the differences in the predicted rigidity between the newest force fields and suggests a roadmap to test their performance against experiments. In the case of the stretch modulus, we reconcile these differences, showing that a single mapping between sequence-dependent conformation and elasticity via the crookedness parameter captures simultaneously the results of all force fields, supporting the key role of crookedness in the mechanical response of double-stranded DNA.
Collapse
Affiliation(s)
- Carlos Roldán-Piñero
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
| | - Juan Luengo-Márquez
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
- Instituto
Nicolás Cabrera, Universidad Autónoma
de Madrid, E-28049 Madrid, Spain
| | - Salvatore Assenza
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
- Instituto
Nicolás Cabrera, Universidad Autónoma
de Madrid, E-28049 Madrid, Spain
- Condensed
Matter Physics Center (IFIMAC), Universidad
Autónoma de Madrid, E-28049 Madrid, Spain
| | - Rubén Pérez
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
- Condensed
Matter Physics Center (IFIMAC), Universidad
Autónoma de Madrid, E-28049 Madrid, Spain
| |
Collapse
|
2
|
Chhetri KB, Jang YH, Lansac Y, Maiti PK. DNA groove preference shift upon phosphorylation of a protamine-like cationic peptide. Phys Chem Chem Phys 2023; 25:31335-31345. [PMID: 37960891 DOI: 10.1039/d3cp03803c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Protamines, arginine-rich DNA-binding proteins, are responsible for chromatin compaction in sperm cells, but their DNA groove preference, major or minor, is not clearly identified. We herein study the DNA groove preference of a short protamine-like cationic peptide before and after phosphorylation, using all-atom molecular dynamics and umbrella sampling simulations. According to various thermodynamic and structural analyses, a peptide in its non-phosphorylated native state prefers the minor groove over the major groove, but phosphorylation of the peptide bound to the minor groove not only reduces its binding affinity but also brings a serious deformation of the minor groove, eliminating the minor-groove preference. As protamines are heavily phosphorylated before binding to DNA, we expect that the structurally disordered phosphorylated protamines would prefer major grooves to enter into DNA during spermatogenesis.
Collapse
Affiliation(s)
- Khadka B Chhetri
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India.
- Department of Physics, Prithvinarayan Campus, Tribhuvan University, Pokhara, Nepal
| | - Yun Hee Jang
- Department of Energy Science and Engineering, DGIST, Daegu 42988, Korea
- GREMAN, CNRS UMR 7347, Université de Tours, 37200 Tours, France
- Laboratoire de Physique des Solides, CNRS UMR 8502, Université Paris Saclay, 91405 Orsay, France
| | - Yves Lansac
- GREMAN, CNRS UMR 7347, Université de Tours, 37200 Tours, France
- Laboratoire de Physique des Solides, CNRS UMR 8502, Université Paris Saclay, 91405 Orsay, France
| | - Prabal K Maiti
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India.
| |
Collapse
|
3
|
Cooper BH, Dantas Machado AC, Gan Y, Aparicio O, Rohs R. DNA binding specificity of all four Saccharomyces cerevisiae forkhead transcription factors. Nucleic Acids Res 2023; 51:5621-5633. [PMID: 37177995 PMCID: PMC10287902 DOI: 10.1093/nar/gkad372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 04/19/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023] Open
Abstract
Quantifying the nucleotide preferences of DNA binding proteins is essential to understanding how transcription factors (TFs) interact with their targets in the genome. High-throughput in vitro binding assays have been used to identify the inherent DNA binding preferences of TFs in a controlled environment isolated from confounding factors such as genome accessibility, DNA methylation, and TF binding cooperativity. Unfortunately, many of the most common approaches for measuring binding preferences are not sensitive enough for the study of moderate-to-low affinity binding sites, and are unable to detect small-scale differences between closely related homologs. The Forkhead box (FOX) family of TFs is known to play a crucial role in regulating a variety of key processes from proliferation and development to tumor suppression and aging. By using the high-sequencing depth SELEX-seq approach to study all four FOX homologs in Saccharomyces cerevisiae, we have been able to precisely quantify the contribution and importance of nucleotide positions all along an extended binding site. Essential to this process was the alignment of our SELEX-seq reads to a set of candidate core sequences determined using a recently developed tool for the alignment of enriched k-mers and a newly developed approach for the reprioritization of candidate cores.
Collapse
Affiliation(s)
- Brendon H Cooper
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Ana Carolina Dantas Machado
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Yan Gan
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Oscar M Aparicio
- Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
| | - Remo Rohs
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
- Departments of Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
4
|
Olson WK, Li Y, Fenley MO. Insights into DNA solvation found in protein-DNA structures. Biophys J 2022; 121:4749-4758. [PMID: 36380591 PMCID: PMC9808563 DOI: 10.1016/j.bpj.2022.11.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 10/31/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open
Abstract
The proteins that bind double-helical DNA present various microenvironments that sense and/or induce signals in the genetic material. The high-resolution structures of protein-DNA complexes reveal the nature of both the microenvironments and the conformational responses in DNA and protein. Complex networks of interactions within the structures somehow tie the protein and DNA together and induce the observed spatial forms. Here we show how the cumulative buildup of amino acid atoms around the sugars, phosphates, and bases in different protein-DNA complexes produces a binding cloud around the double helix and how different types of atoms fill that cloud. Rather than focusing on the principles of molecular binding and recognition suggested by the arrangements of amino acids and nucleotides in the macromolecular complexes, we consider the proteins in contact with DNA as organized solvents. We describe differences in the mix of atoms that come in closest contact with DNA, subtle sequence-dependent features in the microenvironment of the sugar-phosphate backbone, a direct link between the localized buildup of ionic species and the electrostatic potential surfaces of the DNA bases, and sites of atomic buildup above and below the basepair planes that transmit the unique features of the base environments along the chain backbone. The inferences about solvation that can be drawn from the survey provide new stimuli for improvement of nucleic acid force fields and fresh ideas for exploration of the properties of DNA in solution.
Collapse
Affiliation(s)
- Wilma K Olson
- Department of Chemistry and Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey.
| | - Yun Li
- Department of Chemistry and Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey
| | - Marcia O Fenley
- Department of Chemistry and Chemical Biology and Center for Quantitative Biology, Rutgers, the State University of New Jersey, Piscataway, New Jersey; Institute of Molecular Biophysics, Florida State University, Tallahassee, Florida
| |
Collapse
|
5
|
Ghoshdastidar D, Bansal M. Flexibility of flanking DNA is a key determinant of transcription factor affinity for the core motif. Biophys J 2022; 121:3987-4000. [PMID: 35978548 PMCID: PMC9674967 DOI: 10.1016/j.bpj.2022.08.015] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 07/28/2022] [Accepted: 08/15/2022] [Indexed: 11/02/2022] Open
Abstract
Selective gene regulation is mediated by recognition of specific DNA sequences by transcription factors (TFs). The extremely challenging task of searching out specific cognate DNA binding sites among several million putative sites within the eukaryotic genome is achieved by complex molecular recognition mechanisms. Elements of this recognition code include the core binding sequence, the flanking sequence context, and the shape and conformational flexibility of the composite binding site. To unravel the extent to which DNA flexibility modulates TF binding, in this study, we employed experimentally guided molecular dynamics simulations of ternary complex of closely related Hox heterodimers Exd-Ubx and Exd-Scr with DNA. Results demonstrate that flexibility signatures embedded in the flanking sequences impact TF binding at the cognate binding site. A DNA sequence has intrinsic shape and flexibility features. While shape features are localized, our analyses reveal that flexibility features of the flanking sequences percolate several basepairs and allosterically modulate TF binding at the core. We also show that lack of flexibility in the motif context can render the cognate site resistant to protein-induced shape changes and subsequently lower TF binding affinity. Overall, this study suggests that flexibility-guided DNA shape, and not merely the static shape, is a key unexplored component of the complex DNA-TF recognition code.
Collapse
Affiliation(s)
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, Karnataka, India.
| |
Collapse
|
6
|
Beyond the double helix: DNA structural diversity and the PDB. J Biol Chem 2021; 296:100553. [PMID: 33744292 PMCID: PMC8063756 DOI: 10.1016/j.jbc.2021.100553] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Revised: 01/15/2021] [Accepted: 03/16/2021] [Indexed: 12/11/2022] Open
Abstract
The determination of the double helical structure of DNA in 1953 remains the landmark event in the development of modern biological and biomedical science. This structure has also been the starting point for the determination of some 2000 DNA crystal structures in the subsequent 68 years. Their structural diversity has extended to the demonstration of sequence-dependent local structure in duplex DNA, to DNA bending in short and long sequences and in the DNA wound round the nucleosome, and to left-handed duplex DNAs. Beyond the double helix itself, in circumstances where DNA sequences are or can be induced to unwind from being duplex, a wide variety of topologies and forms can exist. Quadruplex structures, based on four-stranded cores of stacked G-quartets, are prevalent though not randomly distributed in the human and other genomes and can play roles in transcription, translation, and replication. Yet more complex folds can result in DNAs with extended tertiary structures and enzymatic/catalytic activity. The Protein Data Bank is the depository of all these structures, and the resource where structures can be critically examined and validated, as well as compared one with another to facilitate analysis of conformational and base morphology features. This review will briefly survey the major structural classes of DNAs and illustrate their significance, together with some examples of how the use of the Protein Data Bank by for example, data mining, has illuminated DNA structural concepts.
Collapse
|
7
|
Afek A, Shi H, Rangadurai A, Sahay H, Senitzki A, Xhani S, Fang M, Salinas R, Mielko Z, Pufall MA, Poon GMK, Haran TE, Schumacher MA, Al-Hashimi HM, Gordân R. DNA mismatches reveal conformational penalties in protein-DNA recognition. Nature 2020; 587:291-296. [PMID: 33087930 PMCID: PMC7666076 DOI: 10.1038/s41586-020-2843-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2019] [Accepted: 09/17/2020] [Indexed: 12/17/2022]
Abstract
Transcription factors recognize specific genomic sequences to regulate complex gene-expression programs. Although it is well-established that transcription factors bind to specific DNA sequences using a combination of base readout and shape recognition, some fundamental aspects of protein-DNA binding remain poorly understood1,2. Many DNA-binding proteins induce changes in the structure of the DNA outside the intrinsic B-DNA envelope. However, how the energetic cost that is associated with distorting the DNA contributes to recognition has proven difficult to study, because the distorted DNA exists in low abundance in the unbound ensemble3-9. Here we use a high-throughput assay that we term SaMBA (saturation mismatch-binding assay) to investigate the role of DNA conformational penalties in transcription factor-DNA recognition. In SaMBA, mismatched base pairs are introduced to pre-induce structural distortions in the DNA that are much larger than those induced by changes in the Watson-Crick sequence. Notably, approximately 10% of mismatches increased transcription factor binding, and for each of the 22 transcription factors that were examined, at least one mismatch was found that increased the binding affinity. Mismatches also converted non-specific sites into high-affinity sites, and high-affinity sites into 'super sites' that exhibit stronger affinity than any known canonical binding site. Determination of high-resolution X-ray structures, combined with nuclear magnetic resonance measurements and structural analyses, showed that many of the DNA mismatches that increase binding induce distortions that are similar to those induced by protein binding-thus prepaying some of the energetic cost incurred from deforming the DNA. Our work indicates that conformational penalties are a major determinant of protein-DNA recognition, and reveals mechanisms by which mismatches can recruit transcription factors and thus modulate replication and repair activities in the cell10,11.
Collapse
Affiliation(s)
- Ariel Afek
- Center for Genomic and Computational Biology, Duke University School of Medicine, Durham, NC, USA
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, USA
| | - Honglue Shi
- Department of Chemistry, Duke University, Durham, NC, USA
| | - Atul Rangadurai
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | - Harshit Sahay
- Center for Genomic and Computational Biology, Duke University School of Medicine, Durham, NC, USA
- Program in Computational Biology and Bioinformatics, Duke University School of Medicine, Durham, NC, USA
| | - Alon Senitzki
- Department of Biology, Technion-Israel Institute of Technology, Haifa, Israel
| | - Suela Xhani
- Department of Chemistry, Georgia State University, Atlanta, GA, USA
| | - Mimi Fang
- Department of Biochemistry, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
- Holden Comprehensive Cancer Center, University of Iowa, Iowa City, IA, USA
| | - Raul Salinas
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | - Zachery Mielko
- Center for Genomic and Computational Biology, Duke University School of Medicine, Durham, NC, USA
- Program in Genetics and Genomics, Duke University School of Medicine, Durham, NC, USA
| | - Miles A Pufall
- Department of Biochemistry, Carver College of Medicine, University of Iowa, Iowa City, IA, USA
- Holden Comprehensive Cancer Center, University of Iowa, Iowa City, IA, USA
| | - Gregory M K Poon
- Department of Chemistry, Georgia State University, Atlanta, GA, USA
- Center for Diagnostics and Therapeutics, Georgia State University, Atlanta, GA, USA
| | - Tali E Haran
- Department of Biology, Technion-Israel Institute of Technology, Haifa, Israel
| | - Maria A Schumacher
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | - Hashim M Al-Hashimi
- Department of Chemistry, Duke University, Durham, NC, USA.
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA.
| | - Raluca Gordân
- Center for Genomic and Computational Biology, Duke University School of Medicine, Durham, NC, USA.
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, USA.
- Department of Computer Science, Duke University, Durham, NC, USA.
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC, USA.
| |
Collapse
|
8
|
Dantas Machado AC, Cooper BH, Lei X, Di Felice R, Chen L, Rohs R. Landscape of DNA binding signatures of myocyte enhancer factor-2B reveals a unique interplay of base and shape readout. Nucleic Acids Res 2020; 48:8529-8544. [PMID: 32738045 PMCID: PMC7470950 DOI: 10.1093/nar/gkaa642] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 07/16/2020] [Accepted: 07/22/2020] [Indexed: 01/08/2023] Open
Abstract
Myocyte enhancer factor-2B (MEF2B) has the unique capability of binding to its DNA target sites with a degenerate motif, while still functioning as a gene-specific transcriptional regulator. Identifying its DNA targets is crucial given regulatory roles exerted by members of the MEF2 family and MEF2B's involvement in B-cell lymphoma. Analyzing structural data and SELEX-seq experimental results, we deduced the DNA sequence and shape determinants of MEF2B target sites on a high-throughput basis in vitro for wild-type and mutant proteins. Quantitative modeling of MEF2B binding affinities and computational simulations exposed the DNA readout mechanisms of MEF2B. The resulting binding signature of MEF2B revealed distinct intricacies of DNA recognition compared to other transcription factors. MEF2B uses base readout at its half-sites combined with shape readout at the center of its degenerate motif, where A-tract polarity dictates nuances of binding. The predominant role of shape readout at the center of the core motif, with most contacts formed in the minor groove, differs from previously observed protein-DNA readout modes. MEF2B, therefore, represents a unique protein for studies of the role of DNA shape in achieving binding specificity. MEF2B-DNA recognition mechanisms are likely representative for other members of the MEF2 family.
Collapse
Affiliation(s)
- Ana Carolina Dantas Machado
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Brendon H Cooper
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Xiao Lei
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Rosa Di Felice
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Department of Physics & Astronomy, University of Southern California, Los Angeles, CA 90089, USA
| | - Lin Chen
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Department of Chemistry, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
| | - Remo Rohs
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Department of Physics & Astronomy, University of Southern California, Los Angeles, CA 90089, USA
- Department of Chemistry, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
- Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
9
|
Chiu TP, Xin B, Markarian N, Wang Y, Rohs R. TFBSshape: an expanded motif database for DNA shape features of transcription factor binding sites. Nucleic Acids Res 2020; 48:D246-D255. [PMID: 31665425 PMCID: PMC7145579 DOI: 10.1093/nar/gkz970] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 10/08/2019] [Accepted: 10/11/2019] [Indexed: 12/31/2022] Open
Abstract
TFBSshape (https://tfbsshape.usc.edu) is a motif database for analyzing structural profiles of transcription factor binding sites (TFBSs). The main rationale for this database is to be able to derive mechanistic insights in protein-DNA readout modes from sequencing data without available structures. We extended the quantity and dimensionality of TFBSshape, from mostly in vitro to in vivo binding and from unmethylated to methylated DNA. This new release of TFBSshape improves its functionality and launches a responsive and user-friendly web interface for easy access to the data. The current expansion includes new entries from the most recent collections of transcription factors (TFs) from the JASPAR and UniPROBE databases, methylated TFBSs derived from in vitro high-throughput EpiSELEX-seq binding assays and in vivo methylated TFBSs from the MeDReaders database. TFBSshape content has increased to 2428 structural profiles for 1900 TFs from 39 different species. The structural profiles for each TFBS entry now include 13 shape features and minor groove electrostatic potential for standard DNA and four shape features for methylated DNA. We improved the flexibility and accuracy for the shape-based alignment of TFBSs and designed new tools to compare methylated and unmethylated structural profiles of TFs and methods to derive DNA shape-preserving nucleotide mutations in TFBSs.
Collapse
Affiliation(s)
- Tsu-Pei Chiu
- Quantitative and Computational Biology, Departments of Biological Sciences, Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Beibei Xin
- Quantitative and Computational Biology, Departments of Biological Sciences, Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Nicholas Markarian
- Quantitative and Computational Biology, Departments of Biological Sciences, Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Yingfei Wang
- Quantitative and Computational Biology, Departments of Biological Sciences, Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Remo Rohs
- Quantitative and Computational Biology, Departments of Biological Sciences, Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
10
|
Lin M, Guo JT. New insights into protein-DNA binding specificity from hydrogen bond based comparative study. Nucleic Acids Res 2020; 47:11103-11113. [PMID: 31665426 PMCID: PMC6868434 DOI: 10.1093/nar/gkz963] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Revised: 10/06/2019] [Accepted: 10/10/2019] [Indexed: 12/25/2022] Open
Abstract
Knowledge of protein-DNA binding specificity has important implications in understanding DNA metabolism, transcriptional regulation and developing therapeutic drugs. Previous studies demonstrated hydrogen bonds between amino acid side chains and DNA bases play major roles in specific protein-DNA interactions. In this paper, we investigated the roles of individual DNA strands and protein secondary structure types in specific protein-DNA recognition based on side chain-base hydrogen bonds. By comparing the contribution of each DNA strand to the overall binding specificity between DNA-binding proteins with different degrees of binding specificity, we found that highly specific DNA-binding proteins show balanced hydrogen bonding with each of the two DNA strands while multi-specific DNA binding proteins are generally biased towards one strand. Protein-base pair hydrogen bonds, in which both bases of a base pair are involved in forming hydrogen bonds with amino acid side chains, are more prevalent in the highly specific protein-DNA complexes than those in the multi-specific group. Amino acids involved in side chain-base hydrogen bonds favor strand and coil secondary structure types in highly specific DNA-binding proteins while multi-specific DNA-binding proteins prefer helices.
Collapse
Affiliation(s)
- Maoxuan Lin
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Jun-Tao Guo
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| |
Collapse
|
11
|
Retureau R, Foloppe N, Elbahnsi A, Oguey C, Hartmann B. A dynamic view of DNA structure within the nucleosome: Biological implications. J Struct Biol 2020; 211:107511. [PMID: 32311461 DOI: 10.1016/j.jsb.2020.107511] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 04/10/2020] [Accepted: 04/13/2020] [Indexed: 01/21/2023]
Abstract
Most of eukaryotic cellular DNA is packed in nucleosome core particles (NCPs), in which the DNA (DNANCP) is wrapped around histones. The influence of this organization on the intrinsic local dynamics of DNA is largely unknown, in particular because capturing such information from experiments remains notoriously challenging. Given the importance of dynamical properties in DNA functions, we addressed this issue using CHARMM36 MD simulations of a nucleosome containing the NCP positioning 601 sequence and four related free dodecamers. Comparison between DNANCP and free DNA reveals a limited impact of the dense DNA-histone interface on correlated motions of dinucleotide constituents and on fluctuations of inter base pair parameters. A characteristic feature intimately associated with the DNANCP super-helical path is a set of structural periodicities that includes a marked alternation of regions enriched in backbone BI and BII conformers. This observation led to uncover a convincing correspondence between the sequence effect on BI/BII propensities in both DNANCP and free DNA, strengthening the idea that the histone preference for particular DNA sequences relies on those intrinsic structural properties. These results offer for the first time a detailed view of the DNA dynamical behavior within NCP. They show in particular that the DNANCP dynamics is substantial enough to preserve the ability to structurally adjust to external proteins, for instance remodelers. Also, fresh structural arguments highlight the relevance of relationships between DNA sequence and structural properties for NCP formation. Overall, our work offers a more rational framework to approach the functional, biological roles of NCP.
Collapse
Affiliation(s)
- Romain Retureau
- Université Paris-Saclay, ENS Paris-Saclay, CNRS, Laboratoire de biologie et pharmacologie appliquée, 61 avenue du Président Wilson, 94235 Cachan cedex, France
| | | | - Ahmad Elbahnsi
- Université Paris-Saclay, ENS Paris-Saclay, CNRS, Laboratoire de biologie et pharmacologie appliquée, 61 avenue du Président Wilson, 94235 Cachan cedex, France; LPTM, UMR8089, CNRS, CY Cergy Paris Université, 2 avenue Adolphe Chauvin, 95302 Cergy-Pontoise, France
| | - Christophe Oguey
- LPTM, UMR8089, CNRS, CY Cergy Paris Université, 2 avenue Adolphe Chauvin, 95302 Cergy-Pontoise, France
| | - Brigitte Hartmann
- Université Paris-Saclay, ENS Paris-Saclay, CNRS, Laboratoire de biologie et pharmacologie appliquée, 61 avenue du Président Wilson, 94235 Cachan cedex, France.
| |
Collapse
|
12
|
Zeiske T, Baburajendran N, Kaczynska A, Brasch J, Palmer AG, Shapiro L, Honig B, Mann RS. Intrinsic DNA Shape Accounts for Affinity Differences between Hox-Cofactor Binding Sites. Cell Rep 2020; 24:2221-2230. [PMID: 30157419 DOI: 10.1016/j.celrep.2018.07.100] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 06/14/2018] [Accepted: 07/28/2018] [Indexed: 11/26/2022] Open
Abstract
Transcription factors bind to their binding sites over a wide range of affinities, yet how differences in affinity are encoded in DNA sequences is not well understood. Here, we report X-ray crystal structures of four heterodimers of the Hox protein AbdominalB bound with its cofactor Extradenticle to four target DNA molecules that differ in affinity by up to ∼20-fold. Remarkably, despite large differences in affinity, the overall structures are very similar in all four complexes. In contrast, the predicted shapes of the DNA binding sites (i.e., the intrinsic DNA shape) in the absence of bound protein are strikingly different from each other and correlate with affinity: binding sites that must change conformations upon protein binding have lower affinities than binding sites that have more optimal conformations prior to binding. Together, these observations suggest that intrinsic differences in DNA shape provide a robust mechanism for modulating affinity without affecting other protein-DNA interactions.
Collapse
Affiliation(s)
- Tim Zeiske
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | - Nithya Baburajendran
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | - Anna Kaczynska
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | - Julia Brasch
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | - Arthur G Palmer
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | - Lawrence Shapiro
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Barry Honig
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA; Department of Systems Biology, Columbia University, New York, NY, USA; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA.
| | - Richard S Mann
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA; Department of Systems Biology, Columbia University, New York, NY, USA; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA.
| |
Collapse
|
13
|
Lai X, Stigliani A, Vachon G, Carles C, Smaczniak C, Zubieta C, Kaufmann K, Parcy F. Building Transcription Factor Binding Site Models to Understand Gene Regulation in Plants. MOLECULAR PLANT 2019; 12:743-763. [PMID: 30447332 DOI: 10.1016/j.molp.2018.10.010] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Revised: 09/20/2018] [Accepted: 10/30/2018] [Indexed: 06/09/2023]
Abstract
Transcription factors (TFs) are key cellular components that control gene expression. They recognize specific DNA sequences, the TF binding sites (TFBSs), and thus are targeted to specific regions of the genome where they can recruit transcriptional co-factors and/or chromatin regulators to fine-tune spatiotemporal gene regulation. Therefore, the identification of TFBSs in genomic sequences and their subsequent quantitative modeling is of crucial importance for understanding and predicting gene expression. Here, we review how TFBSs can be determined experimentally, how the TFBS models can be constructed in silico, and how they can be optimized by taking into account features such as position interdependence within TFBSs, DNA shape, and/or by introducing state-of-the-art computational algorithms such as deep learning methods. In addition, we discuss the integration of context variables into the TFBS modeling, including nucleosome positioning, chromatin states, methylation patterns, 3D genome architectures, and TF cooperative binding, in order to better predict TF binding under cellular contexts. Finally, we explore the possibilities of combining the optimized TFBS model with technological advances, such as targeted TFBS perturbation by CRISPR, to better understand gene regulation, evolution, and plant diversity.
Collapse
Affiliation(s)
- Xuelei Lai
- CNRS, Univ. Grenoble Alpes, CEA, INRA, BIG-LPCV, 38000 Grenoble, France.
| | - Arnaud Stigliani
- CNRS, Univ. Grenoble Alpes, CEA, INRA, BIG-LPCV, 38000 Grenoble, France
| | - Gilles Vachon
- CNRS, Univ. Grenoble Alpes, CEA, INRA, BIG-LPCV, 38000 Grenoble, France
| | - Cristel Carles
- CNRS, Univ. Grenoble Alpes, CEA, INRA, BIG-LPCV, 38000 Grenoble, France
| | - Cezary Smaczniak
- Department for Plant Cell and Molecular Biology, Institute for Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Chloe Zubieta
- CNRS, Univ. Grenoble Alpes, CEA, INRA, BIG-LPCV, 38000 Grenoble, France
| | - Kerstin Kaufmann
- Department for Plant Cell and Molecular Biology, Institute for Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - François Parcy
- CNRS, Univ. Grenoble Alpes, CEA, INRA, BIG-LPCV, 38000 Grenoble, France.
| |
Collapse
|
14
|
A New Generation of Minor-Groove-Binding-Heterocyclic Diamidines That Recognize G·C Base Pairs in an AT Sequence Context. Molecules 2019; 24:molecules24050946. [PMID: 30866557 PMCID: PMC6429135 DOI: 10.3390/molecules24050946] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 02/28/2019] [Accepted: 03/01/2019] [Indexed: 12/17/2022] Open
Abstract
We review the preparation of new compounds with good solution and cell uptake properties that can selectively recognize mixed A·T and G·C bp sequences of DNA. Our underlying aim is to show that these new compounds provide important new biotechnology reagents as well as a new class of therapeutic candidates with better properties and development potential than other currently available agents. In this review, entirely different ways to recognize mixed sequences of DNA by modifying AT selective heterocyclic cations are described. To selectively recognize a G·C base pair an H-bond acceptor must be incorporated with AT recognizing groups as with netropsin. We have used pyridine, azabenzimidazole and thiophene-N-methylbenzimidazole GC recognition units in modules crafted with both rational design and empirical optimization. These modules can selectively and strongly recognize a single G·C base pair in an AT sequence context. In some cases, a relatively simple change in substituents can convert a heterocyclic module from AT to GC recognition selectivity. Synthesis and DNA interaction results for initial example lead modules are described for single G·C base pair recognition compounds. The review concludes with a description of the initial efforts to prepare larger compounds to recognize sequences of DNA with more than one G·C base pairs. The challenges and initial successes are described along with future directions.
Collapse
|
15
|
Guo P, Farahat AA, Paul A, Harika NK, Boykin DW, Wilson WD. Compound Shape Effects in Minor Groove Binding Affinity and Specificity for Mixed Sequence DNA. J Am Chem Soc 2018; 140:14761-14769. [PMID: 30353731 PMCID: PMC6399738 DOI: 10.1021/jacs.8b08152] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
AT specific heterocyclic cations that bind in the DNA duplex minor groove have had major successes as cell and nuclear stains and as therapeutic agents which can effectively enter human cells. Expanding the DNA sequence recognition capability of the minor groove compounds could also expand their therapeutic targets and have an impact in many areas, such as modulation of transcription factor biological activity. Success in the design of mixed sequence binding compounds has been achieved with N-methylbenzimidazole ( N-MeBI) thiophenes which are preorganized to fit the shape of the DNA minor groove and H-bond to the -NH of G·C base pairs that project into the minor groove. Initial compounds bind strongly to a single G·C base pair in an AT context with a specificity ratio of 50 ( KD AT-GC/ KD AT) or less and this is somewhat low for biological use. We felt that modifications of compound shape could be used to probe local DNA microstructure in target mixed base pair sequences of DNA and potentially improve the compound binding selectivity. Modifications were made by increasing the size of the benzimidazole N-substituent, for example, by using N-isobutyl instead of N-Me, and by changing the molecular twist by introducing substitutions at specific positions on the aromatic core of the compounds. In both cases, we have been able to achieve a dramatic increase in binding specificity, including no detectible binding to pure AT sequences, without a significant loss in affinity to mixed base pair target sequences.
Collapse
Affiliation(s)
- Pu Guo
- Department of Chemistry and Center for Diagnostics and Therapeutics , Georgia State University , 50 Decatur Street South East , Atlanta , Georgia 30303 , United States
| | - Abdelbasset A Farahat
- Department of Chemistry and Center for Diagnostics and Therapeutics , Georgia State University , 50 Decatur Street South East , Atlanta , Georgia 30303 , United States
- Department of Pharmaceutical Organic Chemistry, Faculty of Pharmacy , Mansoura University , Mansoura 35516 , Egypt
| | - Ananya Paul
- Department of Chemistry and Center for Diagnostics and Therapeutics , Georgia State University , 50 Decatur Street South East , Atlanta , Georgia 30303 , United States
| | - Narinder K Harika
- Department of Chemistry and Center for Diagnostics and Therapeutics , Georgia State University , 50 Decatur Street South East , Atlanta , Georgia 30303 , United States
| | - David W Boykin
- Department of Chemistry and Center for Diagnostics and Therapeutics , Georgia State University , 50 Decatur Street South East , Atlanta , Georgia 30303 , United States
| | - W David Wilson
- Department of Chemistry and Center for Diagnostics and Therapeutics , Georgia State University , 50 Decatur Street South East , Atlanta , Georgia 30303 , United States
| |
Collapse
|