1
|
Zhang Y, Yang C, Xiong Y, Xiao Y. 3dDNAscoreA: A scoring function for evaluation of DNA 3D structures. Biophys J 2024; 123:2696-2704. [PMID: 38409781 PMCID: PMC11393702 DOI: 10.1016/j.bpj.2024.02.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 01/31/2024] [Accepted: 02/21/2024] [Indexed: 02/28/2024] Open
Abstract
DNA molecules are vital macromolecules that play a fundamental role in many cellular processes and have broad applications in medicine. For example, DNA aptamers have been rapidly developed for diagnosis, biosensors, and clinical therapy. Recently, we proposed a computational method of predicting DNA 3D structures, called 3dDNA. However, it lacks a scoring function to evaluate the predicted DNA 3D structures, and so they are not ranked for users. Here, we report a scoring function, 3dDNAscoreA, for evaluation of DNA 3D structures based on a deep learning model ARES for RNA 3D structure evaluation but using a new strategy for training. 3dDNAscoreA is benchmarked on two test sets to show its ability to rank DNA 3D structures and select the native and near-native structures.
Collapse
Affiliation(s)
- Yi Zhang
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Chenxi Yang
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Yiduo Xiong
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Yi Xiao
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China.
| |
Collapse
|
2
|
Gupta A, Kulkarni M, Mukherjee A. Accurate prediction of B-form/A-form DNA conformation propensity from primary sequence: A machine learning and free energy handshake. PATTERNS 2021; 2:100329. [PMID: 34553171 PMCID: PMC8441556 DOI: 10.1016/j.patter.2021.100329] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 03/25/2021] [Accepted: 07/20/2021] [Indexed: 11/26/2022]
Abstract
DNA carries the genetic code of life, with different conformations associated with different biological functions. Predicting the conformation of DNA from its primary sequence, although desirable, is a challenging problem owing to the polymorphic nature of DNA. We have deployed a host of machine learning algorithms, including the popular state-of-the-art LightGBM (a gradient boosting model), for building prediction models. We used the nested cross-validation strategy to address the issues of “overfitting” and selection bias. This simultaneously provides an unbiased estimate of the generalization performance of a machine learning algorithm and allows us to tune the hyperparameters optimally. Furthermore, we built a secondary model based on SHAP (SHapley Additive exPlanations) that offers crucial insight into model interpretability. Our detailed model-building strategy and robust statistical validation protocols tackle the formidable challenge of working on small datasets, which is often the case in biological and medical data. A robust machine learning model to predict A- or B-DNA conformation Outcome of machine learning model is explained with free energy values Our approach works well under class imbalance and limited data constraints
The sequence in the genome of an organism encodes all the information of life. We combine a data-driven approach using machine learning (ML) and the results of free energy calculations to offer a fresh perspective on this long-standing problem of prediction of DNA conformation (A or B) from the sequence. We trained our ML model using sophisticated state-of-the art algorithms such as LightGBM along with a nested cross-validation strategy to overcome the common problems associated with data bias and overfitting when constrained by limited data size. Our study will serve the broader interest of researchers who are not only seeking accurate and reliable predictive models but also want to understand the physical and chemical origins behind the predictions.
Collapse
Affiliation(s)
- Abhijit Gupta
- Department of Chemistry, Indian Institute of Science Education and Research, Pune, Maharashtra 411008, India
| | - Mandar Kulkarni
- Division of Biophysical Chemistry, Lund University, Chemical Center, P.O.B. 124, 22100 Lund, Sweden
| | - Arnab Mukherjee
- Department of Chemistry, Indian Institute of Science Education and Research, Pune, Maharashtra 411008, India
| |
Collapse
|
3
|
Ratnasinghe BD, Salsbury AM, Lemkul JA. Ion Binding Properties and Dynamics of the bcl-2 G-Quadruplex Using a Polarizable Force Field. J Chem Inf Model 2020; 60:6476-6488. [PMID: 33264004 DOI: 10.1021/acs.jcim.0c01064] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
G-quadruplexes (GQs) are topologically diverse, highly thermostable noncanonical nucleic acid structures that form in guanine-rich sequences in DNA and RNA. GQs are implicated in transcriptional and translational regulation and genome maintenance, and deleterious alterations to their structures contribute to diseases such as cancer. The expression of the B-cell lymphoma 2 (Bcl-2) antiapoptotic protein, for example, is under transcriptional control of a GQ in the promoter of the bcl-2 gene. Modulation of the bcl-2 GQ by small molecules is of interest for chemotherapeutic development but doing so requires knowledge of the factors driving GQ folding and stabilization. To develop a greater understanding of the electrostatic properties of the bcl-2 promoter GQ, we performed molecular dynamics simulations using the Drude-2017 polarizable force field and compared relevant outcomes to the nonpolarizable CHARMM36 force field. Our simulation outcomes highlight the importance of dipole-dipole interactions in the bcl-2 GQ, particularly during the recruitment of a bulk K+ ion to the solvent-exposed face of the tetrad stem. We also predict and characterize an "electronegative pocket" at the tetrad-long loop junction that induces local backbone conformational change and may induce local conformational changes at cellular concentrations of K+. These outcomes suggest that moieties within the bcl-2 GQ can be targeted by small molecules to modulate bcl-2 GQ stability.
Collapse
Affiliation(s)
- Brian D Ratnasinghe
- Department of Biochemistry, Virginia Tech, 303 Engel Hall, 340 West Campus Dr., Blacksburg, Virginia 24061, United States
| | - Alexa M Salsbury
- Department of Biochemistry, Virginia Tech, 303 Engel Hall, 340 West Campus Dr., Blacksburg, Virginia 24061, United States
| | - Justin A Lemkul
- Department of Biochemistry and Center for Drug Discovery, Virginia Tech, 303 Engel Hall, 340 West Campus Dr., Blacksburg, Virginia 24061, United States
| |
Collapse
|
4
|
Fukal J, Páv O, Buděšínský M, Rosenberg I, Šebera J, Sychrovský V. Structural interpretation of the 31P NMR chemical shifts in thiophosphate and phosphate: key effects due to spin–orbit and explicit solvent. Phys Chem Chem Phys 2019; 21:9924-9934. [DOI: 10.1039/c9cp01460h] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Structural interpretation of the 31P NMR shifts measured in different molecules including thiophosphate or phosphate group was obtained by means of theoretical calculations including the effects of geometry, molecular dynamics, solvent, relativistic effects and the effect of NMR reference.
Collapse
Affiliation(s)
- J. Fukal
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - O. Páv
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - M. Buděšínský
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - I. Rosenberg
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - J. Šebera
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - V. Sychrovský
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
- Department of Electrotechnology
| |
Collapse
|
5
|
Eltzner B, Huckemann S, Mardia KV. Torus principal component analysis with applications to RNA structure. Ann Appl Stat 2018. [DOI: 10.1214/17-aoas1115] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
6
|
Biologically important conformational features of DNA as interpreted by quantum mechanics and molecular mechanics computations of its simple fragments. J Mol Model 2018; 24:46. [DOI: 10.1007/s00894-018-3589-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 01/15/2018] [Indexed: 10/18/2022]
|
7
|
Schneider B, Božíková P, Nečasová I, Čech P, Svozil D, Černý J. A DNA structural alphabet provides new insight into DNA flexibility. Acta Crystallogr D Struct Biol 2018; 74:52-64. [PMID: 29372899 PMCID: PMC5786007 DOI: 10.1107/s2059798318000050] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 01/02/2018] [Indexed: 11/10/2022] Open
Abstract
DNA is a structurally plastic molecule, and its biological function is enabled by adaptation to its binding partners. To identify the DNA structural polymorphisms that are possible in such adaptations, the dinucleotide structures of 60 000 DNA steps from sequentially nonredundant crystal structures were classified and an automated protocol assigning 44 distinct structural (conformational) classes called NtC (for Nucleotide Conformers) was developed. To further facilitate understanding of the DNA structure, the NtC were assembled into the DNA structural alphabet CANA (Conformational Alphabet of Nucleic Acids) and the projection of CANA onto the graphical representation of the molecular structure was proposed. The NtC classification was used to define a validation score called confal, which quantifies the conformity between an analyzed structure and the geometries of NtC. NtC and CANA assignment were applied to analyze the structural properties of typical DNA structures such as Dickerson-Drew dodecamers, guanine quadruplexes and structural models based on fibre diffraction. NtC, CANA and confal assignment, which is accessible at the website https://dnatco.org, allows the quantitative assessment and validation of DNA structures and their subsequent analysis by means of pseudo-sequence alignment. An animated Interactive 3D Complement (I3DC) is available in Proteopedia at http://proteopedia.org/w/Journal:Acta_Cryst_D:2.
Collapse
Affiliation(s)
- Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Czechia
| | - Paulína Božíková
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Czechia
| | - Iva Nečasová
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Czechia
| | - Petr Čech
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, CZ-166 28 Prague, Czechia
| | - Daniel Svozil
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, CZ-166 28 Prague, Czechia
| | - Jiří Černý
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Czechia
| |
Collapse
|
8
|
Schneider B, Božíková P, Čech P, Svozil D, Černý J. A DNA Structural Alphabet Distinguishes Structural Features of DNA Bound to Regulatory Proteins and in the Nucleosome Core Particle. Genes (Basel) 2017; 8:E278. [PMID: 29057824 PMCID: PMC5664128 DOI: 10.3390/genes8100278] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 10/06/2017] [Accepted: 10/13/2017] [Indexed: 01/21/2023] Open
Abstract
We analyzed the structural behavior of DNA complexed with regulatory proteins and the nucleosome core particle (NCP). The three-dimensional structures of almost 25 thousand dinucleotide steps from more than 500 sequentially non-redundant crystal structures were classified by using DNA structural alphabet CANA (Conformational Alphabet of Nucleic Acids) and associations between ten CANA letters and sixteen dinucleotide sequences were investigated. The associations showed features discriminating between specific and non-specific binding of DNA to proteins. Important is the specific role of two DNA structural forms, A-DNA, and BII-DNA, represented by the CANA letters AAA and BB2: AAA structures are avoided in non-specific NCP complexes, where the wrapping of the DNA duplex is explained by the periodic occurrence of BB2 every 10.3 steps. In both regulatory and NCP complexes, the extent of bending of the DNA local helical axis does not influence proportional representation of the CANA alphabet letters, namely the relative incidences of AAA and BB2 remain constant in bent and straight duplexes.
Collapse
Affiliation(s)
- Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Prague West, Czech Republic.
| | - Paulína Božíková
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Prague West, Czech Republic.
| | - Petr Čech
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic.
| | - Daniel Svozil
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic.
| | - Jiří Černý
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, CZ-252 50 Vestec, Prague West, Czech Republic.
| |
Collapse
|
9
|
Lemkul JA, MacKerell AD. Polarizable Force Field for DNA Based on the Classical Drude Oscillator: I. Refinement Using Quantum Mechanical Base Stacking and Conformational Energetics. J Chem Theory Comput 2017; 13:2053-2071. [PMID: 28399366 DOI: 10.1021/acs.jctc.7b00067] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Empirical force fields seek to relate the configuration of a set of atoms to its energy, thus yielding the forces governing its dynamics, using classical physics rather than more expensive quantum mechanical calculations that are computationally intractable for large systems. Most force fields used to simulate biomolecular systems use fixed atomic partial charges, neglecting the influence of electronic polarization, instead making use of a mean-field approximation that may not be transferable across environments. Recent hardware and software developments make polarizable simulations feasible, and to this end, polarizable force fields represent the next generation of molecular dynamics simulation technology. In this work, we describe the refinement of a polarizable force field for DNA based on the classical Drude oscillator model by targeting quantum mechanical interaction energies and conformational energy profiles of model compounds necessary to build a complete DNA force field. The parametrization strategy employed in the present work seeks to correct weak base stacking in A- and B-DNA and the unwinding of Z-DNA observed in the previous version of the force field, called Drude-2013. Refinement of base nonbonded terms and reparametrization of dihedral terms in the glycosidic linkage, deoxyribofuranose rings, and important backbone torsions resulted in improved agreement with quantum mechanical potential energy surfaces. Notably, we expand on previous efforts by explicitly including Z-DNA conformational energetics in the refinement.
Collapse
Affiliation(s)
- Justin A Lemkul
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland , Baltimore, Maryland 21201, United States
| | - Alexander D MacKerell
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland , Baltimore, Maryland 21201, United States
| |
Collapse
|
10
|
Zgarbová M, Jurečka P, Banáš P, Havrila M, Šponer J, Otyepka M. Noncanonical α/γ Backbone Conformations in RNA and the Accuracy of Their Description by the AMBER Force Field. J Phys Chem B 2017; 121:2420-2433. [PMID: 28290207 DOI: 10.1021/acs.jpcb.7b00262] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The sugar-phosphate backbone of RNA can exist in diverse rotameric substates, giving RNA molecules enormous conformational variability. The most frequent noncanonical backbone conformation in RNA is α/γ = t/t, which is derived from the canonical backbone by a crankshaft motion and largely preserves the standard geometry of the RNA duplex. A similar conformation also exists in DNA, where it has been extensively studied and shown to be involved in DNA-protein interactions. However, the function of the α/γ = t/t conformation in RNA is poorly understood. Here, we present molecular dynamics simulations of several prototypical RNA structures obtained from X-ray and NMR experiments, including canonical and mismatched RNA duplexes, UUCG and GAGA tetraloops, Loop E, the sarcin-ricin loop, a parallel guanine quadruplex, and a viral pseudoknot. The stability of various noncanonical α/γ backbone conformations was analyzed with two AMBER force fields, ff99bsc0χOL3 and ff99bsc0χOL3 with the recent εζOL1 and βOL1 corrections for DNA. Although some α/γ substates were stable with seemingly well-described equilibria, many were unstable in our simulations. Notably, the most frequent noncanonical conformer α/γ = t/t was unstable in both tested force fields. Possible reasons for this instability are discussed. Our work reveals a potentially important artifact in RNA force fields and highlights a need for further force field refinement.
Collapse
Affiliation(s)
- Marie Zgarbová
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - Petr Jurečka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - Pavel Banáš
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - Marek Havrila
- Institute of Biophysics, Academy of Sciences of the Czech Republic , Královopolská 135, 612 65 Brno, Czech Republic
| | - Jiří Šponer
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic.,Institute of Biophysics, Academy of Sciences of the Czech Republic , Královopolská 135, 612 65 Brno, Czech Republic
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| |
Collapse
|
11
|
Fukal J, Páv O, Buděšínský M, Šebera J, Sychrovský V. The benchmark of 31P NMR parameters in phosphate: a case study on structurally constrained and flexible phosphate. Phys Chem Chem Phys 2017; 19:31830-31841. [DOI: 10.1039/c7cp06969c] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
A benchmark for structural interpretation of the 31P NMR shift and the 2JP,C spin–spin coupling in the phosphate group was obtained by means of theoretical calculations and measurements in diethylphosphate and 5,5-dimethyl-2-hydroxy-1,3,2-dioxaphosphinane 2-oxide.
Collapse
Affiliation(s)
- Jiří Fukal
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - Ondřej Páv
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - Miloš Buděšínský
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - Jakub Šebera
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
| | - Vladimír Sychrovský
- Institute of Organic Chemistry and Biochemistry
- Academy of Sciences of the Czech Republic
- Praha 6
- Czech Republic
- Department of Electrotechnology
| |
Collapse
|
12
|
Galindo-Murillo R, Robertson JC, Zgarbová M, Šponer J, Otyepka M, Jurečka P, Cheatham TE. Assessing the Current State of Amber Force Field Modifications for DNA. J Chem Theory Comput 2016; 12:4114-27. [PMID: 27300587 PMCID: PMC4980684 DOI: 10.1021/acs.jctc.6b00186] [Citation(s) in RCA: 322] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
![]()
The utility of molecular
dynamics (MD) simulations to model biomolecular
structure, dynamics, and interactions has witnessed enormous advances
in recent years due to the availability of optimized MD software and
access to significant computational power, including GPU multicore
computing engines and other specialized hardware. This has led researchers
to routinely extend conformational sampling times to the microsecond
level and beyond. The extended sampling time has allowed the community
not only to converge conformational ensembles through complete sampling
but also to discover deficiencies and overcome problems with the force
fields. Accuracy of the force fields is a key component, along with
sampling, toward being able to generate accurate and stable structures
of biopolymers. The Amber force field for nucleic acids has been used
extensively since the 1990s, and multiple artifacts have been discovered,
corrected, and reassessed by different research groups. We present
a direct comparison of two of the most recent and state-of-the-art
Amber force field modifications, bsc1 and OL15, that focus on accurate
modeling of double-stranded DNA. After extensive MD simulations with
five test cases and two different water models, we conclude that both
modifications are a remarkable improvement over the previous bsc0
force field. Both force field modifications show better agreement
when compared to experimental structures. To ensure convergence, the
Drew–Dickerson dodecamer (DDD) system was simulated using 100
independent MD simulations, each extended to at least 10 μs,
and the independent MD simulations were concatenated into a single
1 ms long trajectory for each combination of force field and water
model. This is significantly beyond the time scale needed to converge
the conformational ensemble of the internal portions of a DNA helix
absent internal base pair opening. Considering all of the simulations
discussed in the current work, the MD simulations performed to assess
and validate the current force fields and water models aggregate over
14 ms of simulation time. The results suggest that both the bsc1 and
OL15 force fields render average structures that deviate significantly
less than 1 Å from the average experimental structures. This
can be compared to similar but less exhaustive simulations with the
CHARMM 36 force field that aggregate to the ∼90 μs time
scale and also perform well but do not produce structures as close
to the DDD NMR average structures (with root-mean-square deviations
of 1.3 Å) as the newer Amber force fields. On the basis of these
analyses, any future research involving double-stranded DNA simulations
using the Amber force fields should employ the bsc1 or OL15 modification.
Collapse
Affiliation(s)
- Rodrigo Galindo-Murillo
- Department of Medicinal Chemistry, University of Utah , 2000 East 30 South, Skaggs 105, Salt Lake City, Utah 84112, United States
| | - James C Robertson
- Department of Medicinal Chemistry, University of Utah , 2000 East 30 South, Skaggs 105, Salt Lake City, Utah 84112, United States
| | - Marie Zgarbová
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17 Listopadu 12, 771 46 Olomouc, Czech Republic
| | - Jiří Šponer
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17 Listopadu 12, 771 46 Olomouc, Czech Republic.,Institute of Biophysics, Academy of Sciences of the Czech Republic , Královopolská 135, 612 65 Brno, Czech Republic
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17 Listopadu 12, 771 46 Olomouc, Czech Republic
| | - Petr Jurečka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17 Listopadu 12, 771 46 Olomouc, Czech Republic
| | - Thomas E Cheatham
- Department of Medicinal Chemistry, University of Utah , 2000 East 30 South, Skaggs 105, Salt Lake City, Utah 84112, United States
| |
Collapse
|
13
|
Černý J, Božíková P, Schneider B. DNATCO: assignment of DNA conformers at dnatco.org. Nucleic Acids Res 2016; 44:W284-7. [PMID: 27150812 PMCID: PMC4987927 DOI: 10.1093/nar/gkw381] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2016] [Accepted: 04/26/2016] [Indexed: 12/14/2022] Open
Abstract
The web service DNATCO (dnatco.org) classifies local conformations of DNA molecules beyond their traditional sorting to A, B and Z DNA forms. DNATCO provides an interface to robust algorithms assigning conformation classes called ntC to dinucleotides extracted from DNA-containing structures uploaded in PDB format version 3.1 or above. The assigned dinucleotide ntC classes are further grouped into DNA structural alphabet ntA, to the best of our knowledge the first DNA structural alphabet. The results are presented at two levels: in the form of user friendly visualization and analysis of the assignment, and in the form of a downloadable, more detailed table for further analysis offline. The website is free and open to all users and there is no login requirement.
Collapse
Affiliation(s)
- Jiří Černý
- Institute of Biotechnology CAS, BIOCEV, Průmyslová 595, 252 50 Vestec, Prague West, Czech Republic
| | - Paulína Božíková
- Institute of Biotechnology CAS, BIOCEV, Průmyslová 595, 252 50 Vestec, Prague West, Czech Republic
| | - Bohdan Schneider
- Institute of Biotechnology CAS, BIOCEV, Průmyslová 595, 252 50 Vestec, Prague West, Czech Republic
| |
Collapse
|
14
|
Zgarbová M, Šponer J, Otyepka M, Cheatham TE, Galindo-Murillo R, Jurečka P. Refinement of the Sugar-Phosphate Backbone Torsion Beta for AMBER Force Fields Improves the Description of Z- and B-DNA. J Chem Theory Comput 2015; 11:5723-36. [PMID: 26588601 DOI: 10.1021/acs.jctc.5b00716] [Citation(s) in RCA: 347] [Impact Index Per Article: 38.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Z-DNA duplexes are a particularly complicated test case for current force fields. We performed a set of explicit solvent molecular dynamics (MD) simulations with various AMBER force field parametrizations including our recent refinements of the ε/ζ and glycosidic torsions. None of these force fields described the ZI/ZII and other backbone substates correctly, and all of them underpredicted the population of the important ZI substate. We show that this underprediction can be attributed to an inaccurate potential for the sugar-phosphate backbone torsion angle β. We suggest a refinement of this potential, β(OL1), which was derived using our recently introduced methodology that includes conformation-dependent solvation effects. The new potential significantly increases the stability of the dominant ZI backbone substate and improves the overall description of the Z-DNA backbone. It also has a positive (albeit small) impact on another important DNA form, the antiparallel guanine quadruplex (G-DNA), and improves the description of the canonical B-DNA backbone by increasing the population of BII backbone substates, providing a better agreement with experiment. We recommend using β(OL1) in combination with our previously introduced corrections, εζ(OL1) and χ(OL4), (the combination being named OL15) as a possible alternative to the current β torsion potential for more accurate modeling of nucleic acids.
Collapse
Affiliation(s)
- Marie Zgarbová
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - Jiří Šponer
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic.,Institute of Biophysics, Academy of Sciences of the Czech Republic , Královopolská 135, 612 65 Brno, Czech Republic
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - Thomas E Cheatham
- Department of Medicinal Chemistry, University of Utah , 30 South 2000 East, Skaggs 105, Salt Lake City, Utah 84112, United States
| | - Rodrigo Galindo-Murillo
- Department of Medicinal Chemistry, University of Utah , 30 South 2000 East, Skaggs 105, Salt Lake City, Utah 84112, United States
| | - Petr Jurečka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University , 17. listopadu 12, 77146 Olomouc, Czech Republic
| |
Collapse
|
15
|
Craveur P, Joseph AP, Esque J, Narwani TJ, Noël F, Shinada N, Goguet M, Leonard S, Poulain P, Bertrand O, Faure G, Rebehmed J, Ghozlane A, Swapna LS, Bhaskara RM, Barnoud J, Téletchéa S, Jallu V, Cerny J, Schneider B, Etchebest C, Srinivasan N, Gelly JC, de Brevern AG. Protein flexibility in the light of structural alphabets. Front Mol Biosci 2015; 2:20. [PMID: 26075209 PMCID: PMC4445325 DOI: 10.3389/fmolb.2015.00020] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2015] [Accepted: 04/30/2015] [Indexed: 01/01/2023] Open
Abstract
Protein structures are valuable tools to understand protein function. Nonetheless, proteins are often considered as rigid macromolecules while their structures exhibit specific flexibility, which is essential to complete their functions. Analyses of protein structures and dynamics are often performed with a simplified three-state description, i.e., the classical secondary structures. More precise and complete description of protein backbone conformation can be obtained using libraries of small protein fragments that are able to approximate every part of protein structures. These libraries, called structural alphabets (SAs), have been widely used in structure analysis field, from definition of ligand binding sites to superimposition of protein structures. SAs are also well suited to analyze the dynamics of protein structures. Here, we review innovative approaches that investigate protein flexibility based on SAs description. Coupled to various sources of experimental data (e.g., B-factor) and computational methodology (e.g., Molecular Dynamic simulation), SAs turn out to be powerful tools to analyze protein dynamics, e.g., to examine allosteric mechanisms in large set of structures in complexes, to identify order/disorder transition. SAs were also shown to be quite efficient to predict protein flexibility from amino-acid sequence. Finally, in this review, we exemplify the interest of SAs for studying flexibility with different cases of proteins implicated in pathologies and diseases.
Collapse
Affiliation(s)
- Pierrick Craveur
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Agnel P Joseph
- Rutherford Appleton Laboratory, Science and Technology Facilities Council Didcot, UK
| | - Jeremy Esque
- Institut National de la Santé et de la Recherche Médicale U964,7 UMR Centre National de la Recherche Scientifique 7104, IGBMC, Université de Strasbourg Illkirch, France
| | - Tarun J Narwani
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Floriane Noël
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Nicolas Shinada
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Matthieu Goguet
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Sylvain Leonard
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Pierre Poulain
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France ; Ets Poulain Pointe-Noire, Congo
| | - Olivier Bertrand
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Guilhem Faure
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health Bethesda, MD, USA
| | - Joseph Rebehmed
- Centre National de la Recherche Scientifique UMR7590, Sorbonne Universités, Université Pierre et Marie Curie - MNHN - IRD - IUC Paris, France
| | | | - Lakshmipuram S Swapna
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore Bangalore, India ; Hospital for Sick Children, and Departments of Biochemistry and Molecular Genetics, University of Toronto Toronto, ON, Canada
| | - Ramachandra M Bhaskara
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore Bangalore, India ; Department of Theoretical Biophysics, Max Planck Institute of Biophysics Frankfurt, Germany
| | - Jonathan Barnoud
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France ; Laboratoire de Physique, École Normale Supérieure de Lyon, Université de Lyon, Centre National de la Recherche Scientifique UMR 5672 Lyon, France
| | - Stéphane Téletchéa
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France ; Faculté des Sciences et Techniques, Université de Nantes, Unité Fonctionnalité et Ingénierie des Protéines, Centre National de la Recherche Scientifique UMR 6286, Université Nantes Nantes, France
| | - Vincent Jallu
- Platelet Unit, Institut National de la Transfusion Sanguine Paris, France
| | - Jiri Cerny
- Institute of Biotechnology, The Czech Academy of Sciences Prague, Czech Republic
| | - Bohdan Schneider
- Institute of Biotechnology, The Czech Academy of Sciences Prague, Czech Republic
| | - Catherine Etchebest
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | | | - Jean-Christophe Gelly
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| | - Alexandre G de Brevern
- Institut National de la Santé et de la Recherche Médicale U 1134 Paris, France ; UMR_S 1134, DSIMB, Université Paris Diderot, Sorbonne Paris Cite Paris, France ; Institut National de la Transfusion Sanguine, DSIMB Paris, France ; UMR_S 1134, DSIMB, Laboratory of Excellence GR-Ex Paris, France
| |
Collapse
|
16
|
Schneider B, Černý J, Svozil D, Čech P, Gelly JC, de Brevern AG. Bioinformatic analysis of the protein/DNA interface. Nucleic Acids Res 2014; 42:3381-94. [PMID: 24335080 PMCID: PMC3950675 DOI: 10.1093/nar/gkt1273] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Revised: 11/14/2013] [Accepted: 11/14/2013] [Indexed: 01/04/2023] Open
Abstract
To investigate the principles driving recognition between proteins and DNA, we analyzed more than thousand crystal structures of protein/DNA complexes. We classified protein and DNA conformations by structural alphabets, protein blocks [de Brevern, Etchebest and Hazout (2000) (Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Prots. Struct. Funct. Genet., 41:271-287)] and dinucleotide conformers [Svozil, Kalina, Omelka and Schneider (2008) (DNA conformations and their sequence preferences. Nucleic Acids Res., 36:3690-3706)], respectively. Assembling the mutually interacting protein blocks and dinucleotide conformers into 'interaction matrices' revealed their correlations and conformer preferences at the interface relative to their occurrence outside the interface. The analyzed data demonstrated important differences between complexes of various types of proteins such as transcription factors and nucleases, distinct interaction patterns for the DNA minor groove relative to the major groove and phosphate and importance of water-mediated contacts. Water molecules mediate proportionally the largest number of contacts in the minor groove and form the largest proportion of contacts in complexes of transcription factors. The generally known induction of A-DNA forms by complexation was more accurately attributed to A-like and intermediate A/B conformers rare in naked DNA molecules.
Collapse
Affiliation(s)
- Bohdan Schneider
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Jiří Černý
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Daniel Svozil
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Petr Čech
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Jean-Christophe Gelly
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Alexandre G. de Brevern
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| |
Collapse
|