1
|
Genta M, Ferrara G, Capelli R, Rondelli D, Sertic S, Bolognesi M, Rizzi M, Rossi F, Jeruzalmi D, Chaves-Sanjuan A, Miggiano R. Mechanistic understanding of UvrA damage detection and lesion hand-off to UvrB in Nucleotide Excision Repair. Nat Commun 2025; 16:3416. [PMID: 40210888 PMCID: PMC11985968 DOI: 10.1038/s41467-025-58670-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Accepted: 03/28/2025] [Indexed: 04/12/2025] Open
Abstract
Nucleotide excision repair (NER) represents one of the major molecular machineries that control chromosome stability in all living species. In Eubacteria, the initial stages of the repair process are carried out by the UvrABC excinuclease complex. Despite the wealth of structural data available, some crucial details of the pathway remain elusive. In this study, we present a structural investigation of the Mycobacterium tuberculosis UvrAUvrB complex and of the UvrA dimer, both in complex with damaged DNA. Our analyses yield insights into the DNA binding mode of UvrA, showing an unexplored conformation of Insertion Domains (IDs), underlying the essential role of these domains in DNA coordination. Furthermore, we observe an interplay between the ID and the UvrB Binding Domain (UBD): after the recognition of the damage, the IDs repositions with the concomitant reorganization of UBD, allowing the formation of the complex between UvrA and UvrB. These events are detected along the formation of the uncharacterized UvrA2UvrB1-DNA and the UvrA2UvrB2-DNA complexes which we interpret as hierarchical steps initiating the DNA repair cascade in the NER pathway, resulting in the formation of the pre-incision complex.
Collapse
Affiliation(s)
- Marianna Genta
- Department of Pharmaceutical Sciences, University of Piemonte Orientale, Via Bovio 6, 28100, Novara, Italy
| | - Giulia Ferrara
- Department of Pharmaceutical Sciences, University of Piemonte Orientale, Via Bovio 6, 28100, Novara, Italy
| | - Riccardo Capelli
- Department of Biosciences, University of Milan, Milan, 20133, Italy
| | - Diego Rondelli
- Department of Biosciences, University of Milan, Milan, 20133, Italy
| | - Sarah Sertic
- Department of Biosciences, University of Milan, Milan, 20133, Italy
| | - Martino Bolognesi
- Department of Biosciences, University of Milan, Milan, 20133, Italy
- Pediatric Clinical Research Center Romeo ed Enrica Invernizzi and NOLIMITS, University of Milan, Milan, 20133, Italy
| | - Menico Rizzi
- Department of Pharmaceutical Sciences, University of Piemonte Orientale, Via Bovio 6, 28100, Novara, Italy
| | - Franca Rossi
- Department of Pharmaceutical Sciences, University of Piemonte Orientale, Via Bovio 6, 28100, Novara, Italy
| | - David Jeruzalmi
- Department of Chemistry and Biochemistry, The City College of New York, New York, NY, 10031, USA
| | - Antonio Chaves-Sanjuan
- Department of Biosciences, University of Milan, Milan, 20133, Italy.
- Pediatric Clinical Research Center Romeo ed Enrica Invernizzi and NOLIMITS, University of Milan, Milan, 20133, Italy.
| | - Riccardo Miggiano
- Department of Pharmaceutical Sciences, University of Piemonte Orientale, Via Bovio 6, 28100, Novara, Italy.
| |
Collapse
|
2
|
Li T, Chen H, Ma N, Jiang D, Wu J, Zhang X, Li H, Su J, Chen P, Liu Q, Guan Y, Zhu X, Lin J, Zhang J, Wang Q, Guo H, Zhu F. Specificity landscapes of 40 R2R3-MYBs reveal how paralogs target different cis-elements by homodimeric binding. IMETA 2025; 4:e70009. [PMID: 40236784 PMCID: PMC11995187 DOI: 10.1002/imt2.70009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Revised: 02/11/2025] [Accepted: 02/17/2025] [Indexed: 04/17/2025]
Abstract
Paralogous transcription factors (TFs) frequently recognize highly similar DNA motifs. Homodimerization can help distinguish them according to their different dimeric configurations. Here, by studying R2R3-MYB TFs, we show that homodimerization can also directly change the recognized DNA motifs to distinguish between similar TFs. By high-throughput SELEX, we profiled the specificity landscape for 40 R2R3-MYBs of subfamily VIII and curated 833 motif models. The dimeric models show that homodimeric binding has evoked specificity changes for AtMYBs. Focusing on AtMYB2 as an example, we show that homodimerization has modified its specificity and allowed it to recognize additional cis-regulatory sequences that are different from the closely related CCWAA-box AtMYBs and are unique among all AtMYBs. Genomic sites described by the modified dimeric specificities of AtMYB2 are conserved in evolution and involved in AtMYB2-specific transcriptional activation. Collectively, this study provides rich data on sequence preferences of VIII R2R3-MYBs and suggests an alternative mechanism that guides closely related TFs to respective cis-regulatory sites.
Collapse
Affiliation(s)
- Tian Li
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Hao Chen
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Nana Ma
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
- College of Life ScienceFujian Agriculture and Forestry UniversityFuzhouChina
| | - Dingkun Jiang
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
- College of Life ScienceFujian Agriculture and Forestry UniversityFuzhouChina
| | - Jiacheng Wu
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Xinfeng Zhang
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Hao Li
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
- College of Life ScienceFujian Agriculture and Forestry UniversityFuzhouChina
| | - Jiaqing Su
- College of Resources and EnvironmentFujian Agriculture and Forestry UniversityFuzhouChina
| | - Piaojuan Chen
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Qing Liu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro‐BioresourcesSouth China Agricultural UniversityGuangzhouChina
| | - Yuefeng Guan
- College of Resources and EnvironmentFujian Agriculture and Forestry UniversityFuzhouChina
| | - Xiaoyue Zhu
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Juncheng Lin
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Jilin Zhang
- Department of Biomedical SciencesCity University of Hong KongHong KongChina
- Tung Biomedical Sciences CentreCity University of Hong KongHong KongChina
- Department of Precision Diagnostic and Therapeutic TechnologyThe City University of Hong Kong Shenzhen Futian Research InstituteShenzhenChina
| | - Qin Wang
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| | - Honghong Guo
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
- College of Life ScienceFujian Agriculture and Forestry UniversityFuzhouChina
| | - Fangjie Zhu
- Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, College of JUNCAO Science and Ecology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems BiologyFujian Agriculture and Forestry UniversityFuzhouChina
| |
Collapse
|
3
|
Singh SK, Mathur M, Kamboj H, Kaushik JK, Mohanty AK, Kumar S. Deciphering aptamer-protein interactions for bovine sperm sorting through in silico and in vitro studies. Mol Biol Rep 2025; 52:300. [PMID: 40085160 DOI: 10.1007/s11033-025-10402-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2024] [Accepted: 02/28/2025] [Indexed: 03/16/2025]
Abstract
BACKGROUND In recent years, aptamers have emerged as versatile molecular tools with promising applications in various fields, including diagnostics and therapeutics. In livestock reproduction, their application holds promise for improving the sorting and identification of X and Y chromosome-bearing sperm cells, which is essential for increasing productivity in the dairy and beef industries. METHOD This study utilized seven rounds of Cell-SELEX using bovine X and Y sperm cells to isolate specific aptamers that target these cells. A comprehensive in-silico analysis was conducted to evaluate the binding interactions between the selected aptamer sequences and the differentially expressed plasma membrane proteins of X and Y sperm cells. RESULT The analysis identified the aptamer sequences APT1X, APT2X, and APT5X as having the most stable interactions with the X sperm surface proteins TLR8 (Toll-like receptor 8), CLRN3, and TLR7 (Toll-like receptor 7), respectively. APT2Y exhibited a relatively high affinity for the protein SCAMP1, a Y-sperm-specific protein. Aptamer‒protein interactions are characterized by hydrogen bonds and hydrophobic contacts. Notably, APT1X formed the greatest number of hydrogen bonds with the polar residues of TLR8, whereas TLR7-APT5X interactions exhibited the greatest number of hydrophobic contacts. CONCLUSION The use of in-silico analysis for evaluating the interaction between candidate aptamer sequences and differentially expressed X and Y bovine sperm proteins provides valuable insights. This approach might facilitate the sorting of bovine X and Y sperm cells, contributing to advancements in livestock reproduction strategies.
Collapse
Affiliation(s)
| | - Manya Mathur
- National Dairy Research Institute (ICAR), Karnal, India
| | - Himanshu Kamboj
- Amity Institute of Biotechnology, Amity University, Noida, India
| | | | - Ashok Kumar Mohanty
- National Dairy Research Institute (ICAR), Karnal, India
- Central Institute for Research on Cattle (ICAR), Meerut, India
| | | |
Collapse
|
4
|
Jones MS, Khanna S, Ferguson AL. FlowBack: A Generalized Flow-Matching Approach for Biomolecular Backmapping. J Chem Inf Model 2025; 65:672-692. [PMID: 39772562 DOI: 10.1021/acs.jcim.4c02046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2025]
Abstract
Coarse-grained models have become ubiquitous in biomolecular modeling tasks aimed at studying slow dynamical processes such as protein folding and DNA hybridization. These models can considerably accelerate sampling but it remains challenging to accurately and efficiently restore all-atom detail to the coarse-grained trajectory, which can be vital for detailed understanding of molecular mechanisms and calculation of observables contingent on all-atom coordinates. In this work, we introduce FlowBack as a deep generative model employing a flow-matching objective to map samples from a coarse-grained prior distribution to an all-atom data distribution. We construct our prior distribution to be agnostic to the coarse-grained map and molecular type. A protein-specific model trained on ∼65k structures from the Protein Data Bank achieves state-of-the-art performance on structural metrics compared to previous generative and rules-based approaches in applications to static PDB structures, all-atom simulations of fast-folding proteins, and coarse-grained trajectories generated by a machine-learned force field. A DNA-protein model trained on ∼1.5k DNA-protein complexes achieves excellent reconstruction and generative capabilities on static DNA-protein complexes from the Protein Data Bank as well as on out-of-distribution coarse-grained dynamical simulations of DNA-protein complexation. FlowBack offers an accurate, efficient, and easy-to-use tool to recover all-atom structures from coarse-grained molecular simulations with higher robustness and fewer steric clashes than previous approaches. We make FlowBack freely available to the community as an open source Python package.
Collapse
Affiliation(s)
- Michael S Jones
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Smayan Khanna
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
- Department of Chemistry, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
5
|
Roy TB, Sarma SP. Insights into the solution structure and transcriptional regulation of the MazE9 antitoxin in Mycobacterium tuberculosis. Proteins 2025; 93:176-196. [PMID: 37737533 DOI: 10.1002/prot.26589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 08/21/2023] [Accepted: 09/07/2023] [Indexed: 09/23/2023]
Abstract
The present study endeavors to decode the details of the transcriptional autoregulation effected by the MazE9 antitoxin of the Mycobacterium tuberculosis MazEF9 toxin-antitoxin system. Regulation of this bicistronic operon at the level of transcription is a critical biochemical process that is key for the organism's stress adaptation and virulence. Here, we have reported the solution structure of the DNA binding domain of MazE9 and scrutinized the thermodynamic and kinetic parameters operational in its interaction with the promoter/operator region, specific to the mazEF9 operon. A HADDOCK model of MazE9 bound to its operator DNA has been calculated based on the information on interacting residues obtained from these studies. The thermodynamics and kinetics of the interaction of MazE9 with the functionally related mazEF6 operon indicate that the potential for intracellular cross-regulation is unlikely. An interesting feature of MazE9 is the cis ⇌ trans conformational isomerization of proline residues in the intrinsically disordered C-terminal domain of this antitoxin.
Collapse
Affiliation(s)
- Tanaya Basu Roy
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Siddhartha P Sarma
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| |
Collapse
|
6
|
Harini K, Sekijima M, Gromiha MM. Bioinformatics Approaches for Understanding the Binding Affinity of Protein-Nucleic Acid Complexes. Methods Mol Biol 2025; 2867:315-330. [PMID: 39576589 DOI: 10.1007/978-1-0716-4196-5_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2024]
Abstract
Protein-nucleic acid interactions are involved in various biological processes such as gene expression, replication, transcription, translation, and packaging. Understanding the recognition mechanism of the protein-nucleic acid complexes has been investigated from different perspectives, including the binding affinities of protein-DNA and protein-RNA complexes. Experimentally, protein-nucleic acid interactions are analyzed using X-ray crystallography, Isothermal Titration Calorimetry (ITC), DNA/RNA pull-down assays, DNA/RNA footprinting, and systematic evolution of ligands by exponential enrichment (SELEX). On the other hand, numerous databases and computational tools have been developed to study protein-nucleic acid complexes based on their binding sites, specific interactions between them, and binding affinity. In this chapter, we discuss various databases for protein-nucleic acid complex structures and the tools available to extract features from them. Further, we provide details on databases and prediction methods reported for exploring the binding affinity of protein-nucleic acid complexes along with important structure-based parameters, which govern the binding affinity.
Collapse
Affiliation(s)
- K Harini
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Masakazu Sekijima
- Department of Computer Science, Tokyo Institute of Technology, Yokohama, Japan
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India.
- International Research Frontiers Initiative, School of Computing, Tokyo Institute of Technology, Yokohama, Japan.
| |
Collapse
|
7
|
Weissmann F, Greiwe JF, Pühringer T, Eastwood EL, Couves EC, Miller TCR, Diffley JFX, Costa A. MCM double hexamer loading visualized with human proteins. Nature 2024; 636:499-508. [PMID: 39604733 PMCID: PMC11634765 DOI: 10.1038/s41586-024-08263-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 10/22/2024] [Indexed: 11/29/2024]
Abstract
Eukaryotic DNA replication begins with the loading of the MCM replicative DNA helicase as a head-to-head double hexamer at origins of DNA replication1-3. Our current understanding of how the double hexamer is assembled by the origin recognition complex (ORC), CDC6 and CDT1 comes mostly from budding yeast. Here we characterize human double hexamer (hDH) loading using biochemical reconstitution and cryo-electron microscopy with purified proteins. We show that the human double hexamer engages DNA differently from the yeast double hexamer (yDH), and generates approximately five base pairs of underwound DNA at the interface between hexamers, as seen in hDH isolated from cells4. We identify several differences from the yeast double hexamer in the order of factor recruitment and dependencies during hDH assembly. Unlike in yeast5-8, the ORC6 subunit of the ORC is not essential for initial MCM recruitment or hDH loading, but contributes to an alternative hDH assembly pathway that requires an intrinsically disordered region in ORC1, which may work through a MCM-ORC intermediate. Our work presents a detailed view of how double hexamers are assembled in an organism that uses sequence-independent replication origins, provides further evidence for diversity in eukaryotic double hexamer assembly mechanisms9, and represents a first step towards reconstitution of DNA replication initiation with purified human proteins.
Collapse
Affiliation(s)
- Florian Weissmann
- Chromosome Replication Laboratory, The Francis Crick Institute, London, UK
| | - Julia F Greiwe
- Macromolecular Machines Laboratory, The Francis Crick Institute, London, UK
| | - Thomas Pühringer
- Macromolecular Machines Laboratory, The Francis Crick Institute, London, UK
| | - Evelyn L Eastwood
- Chromosome Replication Laboratory, The Francis Crick Institute, London, UK
| | - Emma C Couves
- Macromolecular Machines Laboratory, The Francis Crick Institute, London, UK
| | - Thomas C R Miller
- Macromolecular Machines Laboratory, The Francis Crick Institute, London, UK
- DNRF Center for Chromosome Stability, Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - John F X Diffley
- Chromosome Replication Laboratory, The Francis Crick Institute, London, UK.
| | - Alessandro Costa
- Macromolecular Machines Laboratory, The Francis Crick Institute, London, UK.
| |
Collapse
|
8
|
Xu X, Yang H, Dong H, Li X, Liu Q, Feng Y. Characterization of argonaute nucleases from mesophilic bacteria Pseudobutyrivibrio ruminis. BIORESOUR BIOPROCESS 2024; 11:94. [PMID: 39373873 PMCID: PMC11458871 DOI: 10.1186/s40643-024-00797-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Accepted: 08/14/2024] [Indexed: 10/08/2024] Open
Abstract
Mesophilic Argonautes (Agos) from microbial resources have received significant attention due to their potential applications in genome editing and molecular diagnostics. This study characterizes a novel Ago from Pseudobutyrivibrio ruminis (PrAgo), which can cleave single-stranded DNA using guide DNA (gDNA). PrAgo, functioning as a multi-turnover enzyme, effectively cleaves DNA using 5'-phosphate gDNA, 14-30 nucleotides in length, in the presence of both Mn2+ and Mg2+ ions. PrAgo demonstrates DNA cleavage activity over a broad pH range (pH 4-12), with optimal activity at pH 11. As a mesophilic enzyme, PrAgo cleaves efficiently DNA at temperatures ranging from 25 to 65 °C, particularly at 65 °C. PrAgo does not show strong preferences for the 5'-nucleotide in gDNA. It shows high tolerance for single-base mismatches, except at positions 13 and 15 of gDNA. Continuous double-nucleotide mismatches at positions 10-16 of gDNA significantly reduce cleavage activity. Furthermore, PrAgo mediates DNA-guided DNA cleavage of AT-rich double stranded DNA at 65 °C. Additionally, molecular dynamic simulations suggest that interactions between the PAZ domain and different nucleic acids strongly influence cleavage efficiency. These findings expand our understanding of Protokaryotic Agos and their potential applications in biotechnology.
Collapse
Affiliation(s)
- Xiaoyi Xu
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China
| | - Hao Yang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China
| | - Huarong Dong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China
| | - Xiao Li
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China
| | - Qian Liu
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China.
| | - Yan Feng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China.
| |
Collapse
|
9
|
P P, Riyaz A, Choudhury A, Choudhury PR, Pradhan N, Singh A, Nakul M, Dudeja C, Yadav A, Nath SK, Khanna V, Sharma T, Pradhan G, Takkar S, Rawal K. DNASCANNER v2: A Web-Based Tool to Analyze the Characteristic Properties of Nucleotide Sequences. J Comput Biol 2024; 31:651-669. [PMID: 38662479 DOI: 10.1089/cmb.2023.0227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2024] Open
Abstract
Throughout the process of evolution, DNA undergoes the accumulation of distinct mutations, which can often result in highly organized patterns that serve various essential biological functions. These patterns encompass various genomic elements and provide valuable insights into the regulatory and functional aspects of DNA. The physicochemical, mechanical, thermodynamic, and structural properties of DNA sequences play a crucial role in the formation of specific patterns. These properties contribute to the three-dimensional structure of DNA and influence their interactions with proteins, regulatory elements, and other molecules. In this study, we introduce DNASCANNER v2, an advanced version of our previously published algorithm DNASCANNER for analyzing DNA properties. The current tool is built using the FLASK framework in Python language. Featuring a user-friendly interface tailored for nonspecialized researchers, it offers an extensive analysis of 158 DNA properties, including mono/di/trinucleotide frequencies, structural, physicochemical, thermodynamics, and mechanical properties of DNA sequences. The tool provides downloadable results and offers interactive plots for easy interpretation and comparison between different features. We also demonstrate the utility of DNASCANNER v2 in analyzing splice-site junctions, casposon insertion sequences, and transposon insertion sites (TIS) within the bacterial and human genomes, respectively. We also developed a deep learning module for the prediction of potential TIS in a given nucleotide sequence. In the future, we aim to optimize the performance of this prediction model through extensive training on larger data sets.
Collapse
Affiliation(s)
- Preeti P
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Azeen Riyaz
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Alakto Choudhury
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Priyanka Ray Choudhury
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Nischal Pradhan
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Abhishek Singh
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Mihir Nakul
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Chhavi Dudeja
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Abhijeet Yadav
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Swarsat Kaushik Nath
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Vrinda Khanna
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Trapti Sharma
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Gayatri Pradhan
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Simran Takkar
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| | - Kamal Rawal
- Center for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
| |
Collapse
|
10
|
Chen Q, Chuai G, Zhang H, Tang J, Duan L, Guan H, Li W, Li W, Wen J, Zuo E, Zhang Q, Liu Q. Genome-wide CRISPR off-target prediction and optimization using RNA-DNA interaction fingerprints. Nat Commun 2023; 14:7521. [PMID: 37980345 PMCID: PMC10657421 DOI: 10.1038/s41467-023-42695-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 10/19/2023] [Indexed: 11/20/2023] Open
Abstract
The powerful CRISPR genome editing system is hindered by its off-target effects, and existing computational tools achieved limited performance in genome-wide off-target prediction due to the lack of deep understanding of the CRISPR molecular mechanism. In this study, we propose to incorporate molecular dynamics (MD) simulations in the computational analysis of CRISPR system, and present CRISOT, an integrated tool suite containing four related modules, i.e., CRISOT-FP, CRISOT-Score, CRISOT-Spec, CRISORT-Opti for RNA-DNA molecular interaction fingerprint generation, genome-wide CRISPR off-target prediction, sgRNA specificity evaluation and sgRNA optimization of Cas9 system respectively. Our comprehensive computational and experimental tests reveal that CRISOT outperforms existing tools with extensive in silico validations and proof-of-concept experimental validations. In addition, CRISOT shows potential in accurately predicting off-target effects of the base editors and prime editors, indicating that the derived RNA-DNA molecular interaction fingerprint captures the underlying mechanisms of RNA-DNA interaction among distinct CRISPR systems. Collectively, CRISOT provides an efficient and generalizable framework for genome-wide CRISPR off-target prediction, evaluation and sgRNA optimization for improved targeting specificity in CRISPR genome editing.
Collapse
Affiliation(s)
- Qinchang Chen
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department of Tongji Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
| | - Guohui Chuai
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Haihang Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Gene Editing Technologies (Hainan), Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Jin Tang
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
| | - Liwen Duan
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
| | - Huan Guan
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
| | - Wenhui Li
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
| | - Wannian Li
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department of Tongji Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Jiaying Wen
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department of Tongji Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Erwei Zuo
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Gene Editing Technologies (Hainan), Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Qing Zhang
- Roche R&D Center (China) Ltd., China Innovation Center of Roche, Shanghai, 201203, China.
- Ailomics Therapeutics, Shanghai, 201203, China.
| | - Qi Liu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department of Tongji Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Frontier Science Center for Stem Cell Research, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China.
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China.
| |
Collapse
|
11
|
Jones MS, Shmilovich K, Ferguson AL. DiAMoNDBack: Diffusion-Denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces. J Chem Theory Comput 2023; 19:7908-7923. [PMID: 37906711 DOI: 10.1021/acs.jctc.3c00840] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and the simulation of processes that occur on long time scales, such as aggregation and folding. The reduced resolution realizes computational accelerations, but an atomistic representation can be vital for a complete understanding of mechanistic details. Backmapping is the process of restoring all-atom resolution to coarse-grained molecular models. In this work, we report DiAMoNDBack (Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping) as an autoregressive denoising diffusion probability model to restore all-atom details to coarse-grained protein representations retaining only Cα coordinates. The autoregressive generation process proceeds from the protein N-terminus to C-terminus in a residue-by-residue fashion conditioned on the Cα trace and previously backmapped backbone and side-chain atoms within the local neighborhood. The local and autoregressive nature of our model makes it transferable between proteins. The stochastic nature of the denoising diffusion process means that the model generates a realistic ensemble of backbone and side-chain all-atom configurations consistent with the coarse-grained Cα trace. We train DiAMoNDBack over 65k+ structures from the Protein Data Bank (PDB) and validate it in applications to a hold-out PDB test set, intrinsically disordered protein structures from the Protein Ensemble Database (PED), molecular dynamics simulations of fast-folding mini-proteins from DE Shaw Research, and coarse-grained simulation data. We achieve state-of-the-art reconstruction performance in terms of correct bond formation, avoidance of side-chain clashes, and the diversity of the generated side-chain configurational states. We make the DiAMoNDBack model publicly available as a free and open-source Python package.
Collapse
Affiliation(s)
- Michael S Jones
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Kirill Shmilovich
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
12
|
Santiago-Frangos A, Henriques WS, Wiegand T, Gauvin CC, Buyukyoruk M, Graham AB, Wilkinson RA, Triem L, Neselu K, Eng ET, Lander GC, Wiedenheft B. Structure reveals why genome folding is necessary for site-specific integration of foreign DNA into CRISPR arrays. Nat Struct Mol Biol 2023; 30:1675-1685. [PMID: 37710013 PMCID: PMC10872659 DOI: 10.1038/s41594-023-01097-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 08/15/2023] [Indexed: 09/16/2023]
Abstract
Bacteria and archaea acquire resistance to viruses and plasmids by integrating fragments of foreign DNA into the first repeat of a CRISPR array. However, the mechanism of site-specific integration remains poorly understood. Here, we determine a 560-kDa integration complex structure that explains how Pseudomonas aeruginosa Cas (Cas1-Cas2/3) and non-Cas proteins (for example, integration host factor) fold 150 base pairs of host DNA into a U-shaped bend and a loop that protrude from Cas1-2/3 at right angles. The U-shaped bend traps foreign DNA on one face of the Cas1-2/3 integrase, while the loop places the first CRISPR repeat in the Cas1 active site. Both Cas3 proteins rotate 100 degrees to expose DNA-binding sites on either side of the Cas2 homodimer, which each bind an inverted repeat motif in the leader. Leader sequence motifs direct Cas1-2/3-mediated integration to diverse repeat sequences that have a 5'-GT. Collectively, this work reveals new DNA-binding surfaces on Cas2 that are critical for DNA folding and site-specific delivery of foreign DNA.
Collapse
Affiliation(s)
| | - William S Henriques
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Tanner Wiegand
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Colin C Gauvin
- Department of Chemistry and Biochemistry, Montana State University, Bozeman, MT, USA
- Thermal Biology Institute, Montana State University, Bozeman, MT, USA
| | - Murat Buyukyoruk
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Ava B Graham
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Royce A Wilkinson
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Lenny Triem
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Kasahun Neselu
- Simons Electron Microscopy Center, National Resource for Automated Molecular Microscopy, New York Structural Biology Center, New York, NY, USA
| | - Edward T Eng
- Simons Electron Microscopy Center, National Resource for Automated Molecular Microscopy, New York Structural Biology Center, New York, NY, USA
| | - Gabriel C Lander
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, USA
| | - Blake Wiedenheft
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA.
| |
Collapse
|
13
|
Park J, Herrmann GK, Mitchell PG, Sherman MB, Yin YW. Polγ coordinates DNA synthesis and proofreading to ensure mitochondrial genome integrity. Nat Struct Mol Biol 2023; 30:812-823. [PMID: 37202477 PMCID: PMC10920075 DOI: 10.1038/s41594-023-00980-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 03/28/2023] [Indexed: 05/20/2023]
Abstract
Accurate replication of mitochondrial DNA (mtDNA) by DNA polymerase γ (Polγ) is essential for maintaining cellular energy supplies, metabolism, and cell cycle control. To illustrate the structural mechanism for Polγ coordinating polymerase (pol) and exonuclease (exo) activities to ensure rapid and accurate DNA synthesis, we determined four cryo-EM structures of Polγ captured after accurate or erroneous incorporation to a resolution of 2.4-3.0 Å. The structures show that Polγ employs a dual-checkpoint mechanism to sense nucleotide misincorporation and initiate proofreading. The transition from replication to error editing is accompanied by increased dynamics in both DNA and enzyme, in which the polymerase relaxes its processivity and the primer-template DNA unwinds, rotates, and backtracks to shuttle the mismatch-containing primer terminus 32 Å to the exo site for editing. Our structural and functional studies also provide a foundation for analyses of Polγ mutation-induced human diseases and aging.
Collapse
Affiliation(s)
- Joon Park
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, USA
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX, USA
| | - Geoffrey K Herrmann
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, USA
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX, USA
| | - Patrick G Mitchell
- Division of CryoEM and Bioimaging, Stanford Synchrotron Radiation Lightsource, Stanford Linear Accelerator Center National Accelerator Laboratory, Stanford University, Menlo Park, CA, USA
| | - Michael B Sherman
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, USA
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX, USA
| | - Y Whitney Yin
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, USA.
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX, USA.
- Department of Pharmacology and Toxicology, University of Texas Medical Branch, Galveston, TX, USA.
| |
Collapse
|
14
|
Kennedy MA, Hosford CJ, Azumaya CM, Luyten YA, Chen M, Morgan RD, Stoddard BL. Structures, activity and mechanism of the Type IIS restriction endonuclease PaqCI. Nucleic Acids Res 2023; 51:4467-4487. [PMID: 36987874 PMCID: PMC10201449 DOI: 10.1093/nar/gkad228] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 03/10/2023] [Accepted: 03/16/2023] [Indexed: 03/30/2023] Open
Abstract
Type IIS restriction endonucleases contain separate DNA recognition and catalytic domains and cleave their substrates at well-defined distances outside their target sequences. They are employed in biotechnology for a variety of purposes, including the creation of gene-targeting zinc finger and TAL effector nucleases and DNA synthesis applications such as Golden Gate assembly. The most thoroughly studied Type IIS enzyme, FokI, has been shown to require multimerization and engagement with multiple DNA targets for optimal cleavage activity; however, details of how it or similar enzymes forms a DNA-bound reaction complex have not been described at atomic resolution. Here we describe biochemical analyses of DNA cleavage by the Type IIS PaqCI restriction endonuclease and a series of molecular structures in the presence and absence of multiple bound DNA targets. The enzyme displays a similar tetrameric organization of target recognition domains in the absence or presence of bound substrate, with a significant repositioning of endonuclease domains in a trapped DNA-bound complex that is poised to deliver the first of a series of double-strand breaks. PaqCI and FokI share similar structural mechanisms of DNA cleavage, but considerable differences in their domain organization and quaternary architecture, facilitating comparisons between distinct Type IIS enzymes.
Collapse
Affiliation(s)
- Madison A Kennedy
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle,WA 98109, USA
| | | | - Caleigh M Azumaya
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle,WA 98109, USA
| | - Yvette A Luyten
- New England Biolabs, 240 County Road, Ipswich, MA 01938, USA
| | - Minyong Chen
- New England Biolabs, 240 County Road, Ipswich, MA 01938, USA
| | | | - Barry L Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle,WA 98109, USA
| |
Collapse
|
15
|
Smiley AT, Tompkins KJ, Pawlak MR, Krueger AJ, Evans RL, Shi K, Aihara H, Gordon WR. Watson-Crick Base-Pairing Requirements for ssDNA Recognition and Processing in Replication-Initiating HUH Endonucleases. mBio 2023; 14:e0258722. [PMID: 36541758 PMCID: PMC9973303 DOI: 10.1128/mbio.02587-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
Replication-initiating HUH endonucleases (Reps) are sequence-specific nucleases that cleave and rejoin single-stranded DNA (ssDNA) during rolling-circle replication. These functions are mediated by covalent linkage of the Rep to its substrate post cleavage. Here, we describe the structures of the endonuclease domain from the Muscovy duck circovirus Rep in complex with its cognate ssDNA 10-mer with and without manganese in the active site. Structural and functional analyses demonstrate that divalent cations play both catalytic and structural roles in Reps by polarizing and positioning their substrate. Further structural comparisons highlight the importance of an intramolecular substrate Watson-Crick (WC) base pairing between the -4 and +1 positions. Subsequent kinetic and functional analyses demonstrate a functional dependency on WC base pairing between these positions regardless of the pair's identity (i.e., A·T, T·A, G·C, or C·G), highlighting a structural specificity for substrate interaction. Finally, considering how well WC swaps were tolerated in vitro, we sought to determine to what extent the canonical -4T·+1A pairing is conserved in circular Rep-encoding single-stranded DNA viruses and found evidence of noncanonical pairings in a minority of these genomes. Altogether, our data suggest that substrate intramolecular WC base pairing is a universal requirement for separation and reunion of ssDNA in Reps. IMPORTANCE Circular Rep-encoding single-stranded DNA (CRESS-DNA) viruses are a ubiquitous group of viruses that infect organisms across all domains of life. These viruses negatively impact both agriculture and human health. All members of this viral family employ a multifunctional nuclease (Rep) to initiate replication. Reps are structurally similar throughout this family, making them targets of interest for viral inhibition strategies. Here, we investigate the functional dependencies of the Rep protein from Muscovy duck circovirus for ssDNA interaction. We demonstrate that this Rep requires an intramolecular Watson-Crick base pairing for origin of replication (Ori) recognition and interaction. We show that noncognate base pair swaps are well tolerated, highlighting a local structural specificity over sequence specificity. Bioinformatic analysis found that the vast majority of CRESS-DNA Oris form base pairs in conserved positions, suggesting this pairing is a universal requirement for replication initiation in the CRESS-DNA virus family.
Collapse
Affiliation(s)
- Adam T. Smiley
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Kassidy J. Tompkins
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Matthew R. Pawlak
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - August J. Krueger
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Robert L. Evans
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Ke Shi
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Hideki Aihara
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Wendy R. Gordon
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
16
|
Kang K, Wang L, Song C. ProtRAP: Predicting Lipid Accessibility Together with Solvent Accessibility of Proteins in One Run. J Chem Inf Model 2023; 63:1058-1065. [PMID: 36693122 DOI: 10.1021/acs.jcim.2c01235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Solvent accessibility has been extensively used to characterize and predict the chemical properties of the surface residues of soluble proteins. However, there is not yet a widely accepted quantity of the same dimension for the study of lipid-accessible residues of membrane proteins. In this study, we propose that lipid accessibility, defined in a similar way to solvent accessibility, can be used to characterize the lipid-accessible residues of membrane proteins. Moreover, we developed a deep learning-based method, ProtRAP (Protein Relative Accessibility Predictor), to predict the relative lipid accessibility and relative solvent accessibility of residues from a given protein sequence, which can infer which residues are likely accessible to lipids, accessible to solvent, or buried in the protein interior in one run.
Collapse
Affiliation(s)
- Kai Kang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing100871, China.,Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing100871, China
| | - Lei Wang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing100871, China.,Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing100871, China
| | - Chen Song
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing100871, China.,Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing100871, China
| |
Collapse
|
17
|
Physicochemical models of protein-DNA binding with standard and modified base pairs. Proc Natl Acad Sci U S A 2023; 120:e2205796120. [PMID: 36656856 PMCID: PMC9942898 DOI: 10.1073/pnas.2205796120] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these signatures enable proteins to form specific contacts between protein residues and bp, their study can provide mechanistic insights into protein-DNA binding. Existing experimental methods, such as X-ray crystallography, can reveal such mechanisms based on physicochemical interactions between proteins and their DNA target sites. However, the low throughput of structural biology methods limits mechanistic insights for selection of many genomic sites. High-throughput binding assays enable prediction of potential target sites by determining relative binding affinities of a protein to massive numbers of DNA sequences. Many currently available computational methods are based on the sequence of standard Watson-Crick bp. They assume that the contribution of overall binding affinity is independent for each base pair, or alternatively include dinucleotides or short k-mers. These methods cannot directly expand to physicochemical contacts, and they are not suitable to apply to DNA modifications or non-Watson-Crick bp. These variations include DNA methylation, and synthetic or mismatched bp. The proposed method, DeepRec, can predict relative binding affinities as function of physicochemical signatures and the effect of DNA methylation or other chemical modifications on binding. Sequence-based modeling methods are in comparison a coarse-grain description and cannot achieve such insights. Our chemistry-based modeling framework provides a path towards understanding genome function at a mechanistic level.
Collapse
|
18
|
Sharma N, Thompson MK, Arrington JF, Terry DM, Chakravarthy S, Prevelige PE, Prakash A. Novel interaction interfaces mediate the interaction between the NEIL1 DNA glycosylase and mitochondrial transcription factor A. Front Cell Dev Biol 2022; 10:893806. [PMID: 35938152 PMCID: PMC9354671 DOI: 10.3389/fcell.2022.893806] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 07/01/2022] [Indexed: 02/01/2023] Open
Abstract
The maintenance of human mitochondrial DNA (mtDNA) is critical for proper cellular function as damage to mtDNA, if left unrepaired, can lead to a diverse array of pathologies. Of the pathways identified to participate in DNA repair within the mitochondria, base excision repair (BER) is the most extensively studied. Protein-protein interactions drive the step-by-step coordination required for the successful completion of this pathway and are important for crosstalk with other mitochondrial factors involved in genome maintenance. Human NEIL1 is one of seven DNA glycosylases that initiates BER in both the nuclear and mitochondrial compartments. In the current work, we scrutinized the interaction between NEIL1 and mitochondrial transcription factor A (TFAM), a protein that is essential for various aspects of mtDNA metabolism. We note, for the first time, that both the N- and C- terminal domains of NEIL1 interact with TFAM revealing a unique NEIL1 protein-binding interface. The interaction between the two proteins, as observed biochemically, appears to be transient and is most apparent at concentrations of low salt. The presence of DNA (or RNA) also positively influences the interaction between the two proteins, and molar mass estimates indicate that duplex DNA is required for complex formation at higher salt concentrations. Hydrogen deuterium exchange mass spectrometry data reveal that both proteins exchange less deuterium upon DNA binding, indicative of an interaction, and the addition of NEIL1 to the TFAM-DNA complex alters the interaction landscape. The transcriptional activity of TFAM appears to be independent of NEIL1 expression under normal cellular conditions, however, in the presence of DNA damage, we observe a significant reduction in the mRNA expression of TFAM-transcribed mitochondrial genes in the absence of NEIL1. Overall, our data indicate that the interaction between NEIL1 and TFAM can be modulated by local environment such as salt concentrations, protein availability, the presence of nucleic acids, as well as the presence of DNA damage.
Collapse
Affiliation(s)
- Nidhi Sharma
- Department of Biochemistry and Molecular Biology, Mitchell Cancer Institute, University of South Alabama, Mobile, AL, United States
| | - Marlo K. Thompson
- Department of Biochemistry and Molecular Biology, Mitchell Cancer Institute, University of South Alabama, Mobile, AL, United States
| | - Jennifer F. Arrington
- Department of Biochemistry and Molecular Biology, Mitchell Cancer Institute, University of South Alabama, Mobile, AL, United States
| | - Dava M. Terry
- Department of Biochemistry and Molecular Biology, Mitchell Cancer Institute, University of South Alabama, Mobile, AL, United States
| | - Srinivas Chakravarthy
- Advanced Photon Source, Illinois Institute of Technology, Chicago, IL, United States
| | - Peter E. Prevelige
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, AL, United States
| | - Aishwarya Prakash
- Department of Biochemistry and Molecular Biology, Mitchell Cancer Institute, University of South Alabama, Mobile, AL, United States
| |
Collapse
|
19
|
Developing Community Resources for Nucleic Acid Structures. Life (Basel) 2022; 12:life12040540. [PMID: 35455031 PMCID: PMC9031032 DOI: 10.3390/life12040540] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 03/28/2022] [Accepted: 03/31/2022] [Indexed: 01/14/2023] Open
Abstract
In this review, we describe the creation of the Nucleic Acid Database (NDB) at Rutgers University and how it became a testbed for the current infrastructure of the RCSB Protein Data Bank. We describe some of the special features of the NDB and how it has been used to enable research. Plans for the next phase as the Nucleic Acid Knowledgebase (NAKB) are summarized.
Collapse
|
20
|
Shi H, Kimsey IJ, Gu S, Liu HF, Pham U, Schumacher MA, Al-Hashimi HM. Revealing A-T and G-C Hoogsteen base pairs in stressed protein-bound duplex DNA. Nucleic Acids Res 2021; 49:12540-12555. [PMID: 34792150 PMCID: PMC8643651 DOI: 10.1093/nar/gkab936] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 09/25/2021] [Accepted: 11/14/2021] [Indexed: 11/17/2022] Open
Abstract
Watson–Crick base pairs (bps) are the fundamental unit of genetic information and the building blocks of the DNA double helix. However, A-T and G-C can also form alternative ‘Hoogsteen’ bps, expanding the functional complexity of DNA. We developed ‘Hoog-finder’, which uses structural fingerprints to rapidly screen Hoogsteen bps, which may have been mismodeled as Watson–Crick in crystal structures of protein–DNA complexes. We uncovered 17 Hoogsteen bps, 7 of which were in complex with 6 proteins never before shown to bind Hoogsteen bps. The Hoogsteen bps occur near mismatches, nicks and lesions and some appear to participate in recognition and damage repair. Our results suggest a potentially broad role for Hoogsteen bps in stressed regions of the genome and call for a community-wide effort to identify these bps in current and future crystal structures of DNA and its complexes.
Collapse
Affiliation(s)
- Honglue Shi
- Department of Chemistry, Duke University, Durham, NC 27710, USA
| | - Isaac J Kimsey
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Stephanie Gu
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Hsuan-Fu Liu
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Uyen Pham
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Maria A Schumacher
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Hashim M Al-Hashimi
- Department of Chemistry, Duke University, Durham, NC 27710, USA.,Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| |
Collapse
|
21
|
Alves CC, Donadi EA, Giuliatti S. Structural Characterization of the Interaction of Hypoxia Inducible Factor-1 with Its Hypoxia Responsive Element at the -964G > A Variation Site of the HLA-G Promoter Region. Int J Mol Sci 2021; 22:ijms222313046. [PMID: 34884849 PMCID: PMC8657931 DOI: 10.3390/ijms222313046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 11/08/2021] [Accepted: 11/12/2021] [Indexed: 11/16/2022] Open
Abstract
Human Antigen Leukocyte-G (HLA-G) gene encodes an immune checkpoint molecule that has restricted tissue expression in physiological conditions; however, the gene may be induced in hypoxic conditions by the interaction with the hypoxia inducible factor-1 (HIF1). Hypoxia regulatory elements (HRE) located at the HLA-G promoter region and at exon 2 are the major HIF1 target sites. Since the G allele of the −964G > A transversion induces higher HLA-G expression when compared to the A allele in hypoxic conditions, here we analyzed HIF1-HRE complex interaction at the pair-atom level considering both −964G > A polymorphism alleles. Mouse HIF2 dimer crystal (Protein Data Bank ID: 4ZPK) was used as template to perform homology modelling of human HIF1 quaternary structure using MODELLER v9.14. Two 3D DNA structures were built from 5′GCRTG’3 HRE sequence containing the −964G/A alleles using x3DNA. Protein-DNA docking was performed using the HADDOCK v2.4 server, and non-covalent bonds were computed by DNAproDB server. Molecular dynamic simulation was carried out per 200 ns, using Gromacs v.2019. HIF1 binding in the HRE containing −964G allele results in more hydrogen bonds and van der Waals contact formation than HRE with −964A allele. Protein-DNA complex trajectory analysis revealed that HIF1-HRE-964G complex is more stable. In conclusion, HIF1 binds in a more stable and specific manner at the HRE with G allele.
Collapse
Affiliation(s)
- Cinthia C. Alves
- Department of Genetic, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto 14049-900, Brazil;
| | - Eduardo A. Donadi
- Department of Medicine, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto 14049-900, Brazil;
| | - Silvana Giuliatti
- Department of Genetic, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto 14049-900, Brazil;
- Correspondence:
| |
Collapse
|
22
|
Marcos-Torres FJ, Maurer D, Juniar L, Griese JJ. The bacterial iron sensor IdeR recognizes its DNA targets by indirect readout. Nucleic Acids Res 2021; 49:10120-10135. [PMID: 34417623 PMCID: PMC8464063 DOI: 10.1093/nar/gkab711] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 07/19/2021] [Accepted: 08/02/2021] [Indexed: 01/11/2023] Open
Abstract
The iron-dependent regulator IdeR is the main transcriptional regulator controlling iron homeostasis genes in Actinobacteria, including species from the Corynebacterium, Mycobacterium and Streptomyces genera, as well as the erythromycin-producing bacterium Saccharopolyspora erythraea. Despite being a well-studied transcription factor since the identification of the Diphtheria toxin repressor DtxR three decades ago, the details of how IdeR proteins recognize their highly conserved 19-bp DNA target remain to be elucidated. IdeR makes few direct contacts with DNA bases in its target sequence, and we show here that these contacts are not required for target recognition. The results of our structural and mutational studies support a model wherein IdeR mainly uses an indirect readout mechanism, identifying its targets via the sequence-dependent DNA backbone structure rather than through specific contacts with the DNA bases. Furthermore, we show that IdeR efficiently recognizes a shorter palindromic sequence corresponding to a half binding site as compared to the full 19-bp target previously reported, expanding the number of potential target genes controlled by IdeR proteins.
Collapse
Affiliation(s)
| | - Dirk Maurer
- Department of Cell and Molecular Biology, Uppsala University, SE-751 24 Uppsala, Sweden
| | - Linda Juniar
- Department of Cell and Molecular Biology, Uppsala University, SE-751 24 Uppsala, Sweden
| | - Julia J Griese
- Department of Cell and Molecular Biology, Uppsala University, SE-751 24 Uppsala, Sweden
| |
Collapse
|
23
|
Hashmi HF, Waseem M, Ali SS, Hussain Z, Chen K. Structural and Biophysical Investigation of the Key Hotspots on the Surface of Epstein-Barr Nuclear Antigen 1 Essential for DNA Recognition and Pathogenesis. Front Mol Biosci 2021; 8:664436. [PMID: 34268333 PMCID: PMC8275655 DOI: 10.3389/fmolb.2021.664436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 06/14/2021] [Indexed: 11/30/2022] Open
Abstract
Epstein-Barr Virus (EBV) is considered the most important human pathogen due to its role in infections and cellular malignancies. It has been reported that this Oncolytic virus infects 90% world’s population. EBNA1 is required for DNA binding and survival of the virus and is considered an essential drug target. The biochemical and structural properties of this protein are known, but it is still unclear which residues impart a critical role in the recognition of dsDNA. Intending to disclose only the essential residues in recognition of dsDNA, this study used a computational pipeline to generate an alanine mutant of each interacting residue and determine the impact on the binding. Our analysis revealed that R469A, K514A, Y518A, R521A and R522A are the key hotspots for the recognition of dsDNA by the EBNA1. The dynamics properties, i.e. stability, flexibility, structural compactness, hydrogen bonding frequency, binding affinity, are altered by disrupting the protein-DNA contacts, thereby decreases the binding affinity. In particular, the two arginine substitution, R521A and R522A, significantly affected the total binding energy. Thus, we hypothesize that these residues impart a critical role in the dsDNA recognition and pathogenesis. This study would help to design structure-based drugs against the EBV infections.
Collapse
Affiliation(s)
| | - Muhammad Waseem
- Faculty of Rehabilitation and Allied Health Science, Riphah International University, Islamabad, Pakistan
| | - Syed Shujait Ali
- Center for Biotechnology and Microbiology, University of Swat, Swat, Pakistan
| | - Zahid Hussain
- Center for Biotechnology and Microbiology, University of Swat, Swat, Pakistan
| | - Kaoshan Chen
- College of Life Sciences, Shandong University, Jinan, China
| |
Collapse
|
24
|
Huesa J, Giner-Lamia J, Pucciarelli MG, Paredes-Martínez F, García-del Portillo F, Marina A, Casino P. Structure-based analyses of Salmonella RcsB variants unravel new features of the Rcs regulon. Nucleic Acids Res 2021; 49:2357-2374. [PMID: 33638994 PMCID: PMC7913699 DOI: 10.1093/nar/gkab060] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 01/13/2021] [Accepted: 01/26/2021] [Indexed: 12/13/2022] Open
Abstract
RcsB is a transcriptional regulator that controls expression of numerous genes in enteric bacteria. RcsB accomplishes this role alone or in combination with auxiliary transcriptional factors independently or dependently of phosphorylation. To understand the mechanisms by which RcsB regulates such large number of genes, we performed structural studies as well as in vitro and in vivo functional studies with different RcsB variants. Our structural data reveal that RcsB binds promoters of target genes such as rprA and flhDC in a dimeric active conformation. In this state, the RcsB homodimer docks the DNA-binding domains into the major groove of the DNA, facilitating an initial weak read-out of the target sequence. Interestingly, comparative structural analyses also show that DNA binding may stabilize an active conformation in unphosphorylated RcsB. Furthermore, RNAseq performed in strains expressing wild-type or several RcsB variants provided new insights into the contribution of phosphorylation to gene regulation and assign a potential role of RcsB in controlling iron metabolism. Finally, we delimited the RcsB box for homodimeric active binding to DNA as the sequence TN(G/A)GAN4TC(T/C)NA. This RcsB box was found in promoter, intergenic and intragenic regions, facilitating both increased or decreased gene transcription.
Collapse
Affiliation(s)
- Juanjo Huesa
- Departamento de Bioquímica y Biología Molecular, Universitat de València. Dr Moliner 50, 46100 Burjassot, Spain.,Instituto universitario de Biotecnologia i Biomedicina (BIOTECMED), Universitat de València. Dr Moliner 50, 46100 Burjassot, Spain
| | - Joaquín Giner-Lamia
- Laboratorio de Patógenos Bacterianos Intracelulares. Centro Nacional de Biotecnología (CNB)-CSIC. Darwin 3, 28049 Madrid. Spain.,Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Campus Montegancedo, E-28223 Pozuelo de Alarcón, Madrid, Spain.,Departamento de Biotecnología y Biología Vegetal, ETSI Agronómica, Alimentaria y de Biosistemas, Universidad Politócnica de Madrid, 28040 Madrid, Spain
| | - M Graciela Pucciarelli
- Laboratorio de Patógenos Bacterianos Intracelulares. Centro Nacional de Biotecnología (CNB)-CSIC. Darwin 3, 28049 Madrid. Spain.,Centro de Biología Molecular 'Severo Ochoa' (CBMSO)-CSIC. Departamento de Biología Molecular. Universidad Autónoma de Madrid, Madrid, Spain
| | - Francisco Paredes-Martínez
- Departamento de Bioquímica y Biología Molecular, Universitat de València. Dr Moliner 50, 46100 Burjassot, Spain.,Instituto universitario de Biotecnologia i Biomedicina (BIOTECMED), Universitat de València. Dr Moliner 50, 46100 Burjassot, Spain
| | - Francisco García-del Portillo
- Laboratorio de Patógenos Bacterianos Intracelulares. Centro Nacional de Biotecnología (CNB)-CSIC. Darwin 3, 28049 Madrid. Spain
| | - Alberto Marina
- Department of Genomic and Proteomic, Instituto de Biomedicina de Valencia (IBV-CSIC), Jaume Roig 11, 46010 Valencia, Spain.,Group 739 of the Centro de Investigación Biomédica en Red sobre Enfermedades Raras (CIBERER) del Instituto de Salud Carlos III, Spain
| | - Patricia Casino
- Departamento de Bioquímica y Biología Molecular, Universitat de València. Dr Moliner 50, 46100 Burjassot, Spain.,Instituto universitario de Biotecnologia i Biomedicina (BIOTECMED), Universitat de València. Dr Moliner 50, 46100 Burjassot, Spain.,Group 739 of the Centro de Investigación Biomédica en Red sobre Enfermedades Raras (CIBERER) del Instituto de Salud Carlos III, Spain
| |
Collapse
|
25
|
Lin M, Malik FK, Guo JT. A comparative study of protein-ssDNA interactions. NAR Genom Bioinform 2021; 3:lqab006. [PMID: 33655206 PMCID: PMC7902235 DOI: 10.1093/nargab/lqab006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 11/24/2020] [Accepted: 01/26/2021] [Indexed: 12/18/2022] Open
Abstract
Single-stranded DNA-binding proteins (SSBs) play crucial roles in DNA replication, recombination and repair, and serve as key players in the maintenance of genomic stability. While a number of SSBs bind single-stranded DNA (ssDNA) non-specifically, the others recognize and bind specific ssDNA sequences. The mechanisms underlying this binding discrepancy, however, are largely unknown. Here, we present a comparative study of protein-ssDNA interactions by annotating specific and non-specific SSBs and comparing structural features such as DNA-binding propensities and secondary structure types of residues in SSB-ssDNA interactions, protein-ssDNA hydrogen bonding and π-π interactions between specific and non-specific SSBs. Our results suggest that protein side chain-DNA base hydrogen bonds are the major contributors to protein-ssDNA binding specificity, while π-π interactions may mainly contribute to binding affinity. We also found the enrichment of aspartate in the specific SSBs, a key feature in specific protein-double-stranded DNA (dsDNA) interactions as reported in our previous study. In addition, no significant differences between specific and non-specific groups with respect of conformational changes upon ssDNA binding were found, suggesting that the flexibility of SSBs plays a lesser role than that of dsDNA-binding proteins in conferring binding specificity.
Collapse
Affiliation(s)
- Maoxuan Lin
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Fareeha K Malik
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, USA
- Research Center of Modeling and Simulation, National University of Science and Technology, Islamabad, 44000, Pakistan
| | - Jun-tao Guo
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| |
Collapse
|
26
|
Tompkins KJ, Houtti M, Litzau LA, Aird EJ, Everett BA, Nelson AT, Pornschloegl L, Limón-Swanson LK, Evans RL, Evans K, Shi K, Aihara H, Gordon WR. Molecular underpinnings of ssDNA specificity by Rep HUH-endonucleases and implications for HUH-tag multiplexing and engineering. Nucleic Acids Res 2021; 49:1046-1064. [PMID: 33410911 PMCID: PMC7826260 DOI: 10.1093/nar/gkaa1248] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 12/08/2020] [Accepted: 12/14/2020] [Indexed: 12/11/2022] Open
Abstract
Replication initiator proteins (Reps) from the HUH-endonuclease superfamily process specific single-stranded DNA (ssDNA) sequences to initiate rolling circle/hairpin replication in viruses, such as crop ravaging geminiviruses and human disease causing parvoviruses. In biotechnology contexts, Reps are the basis for HUH-tag bioconjugation and a critical adeno-associated virus genome integration tool. We solved the first co-crystal structures of Reps complexed to ssDNA, revealing a key motif for conferring sequence specificity and for anchoring a bent DNA architecture. In combination, we developed a deep sequencing cleavage assay, termed HUH-seq, to interrogate subtleties in Rep specificity and demonstrate how differences can be exploited for multiplexed HUH-tagging. Together, our insights allowed engineering of only four amino acids in a Rep chimera to predictably alter sequence specificity. These results have important implications for modulating viral infections, developing Rep-based genomic integration tools, and enabling massively parallel HUH-tag barcoding and bioconjugation applications.
Collapse
Affiliation(s)
- Kassidy J Tompkins
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Mo Houtti
- Department of Computer Science and Engineering, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Lauren A Litzau
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Eric J Aird
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Blake A Everett
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Andrew T Nelson
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Leland Pornschloegl
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Lidia K Limón-Swanson
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Robert L Evans
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Karen Evans
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Ke Shi
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Hideki Aihara
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| | - Wendy R Gordon
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota Twin Cities, Minneapolis, MN 55455, USA
| |
Collapse
|
27
|
Gharouni M, Mosaddeghi H, Mehrzad J, Es-Haghi A, Motavalizadehkakhky A. In silico profiling and structural insights of zinc metal ion on O6-methylguanine methyl transferase and its interactions using molecular dynamics approach. J Mol Model 2021; 27:40. [PMID: 33454889 DOI: 10.1007/s00894-020-04631-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 12/01/2020] [Indexed: 10/22/2022]
Abstract
O6-methylguanine DNA methyl transferase (MGMT) is a metalloenzyme participating in the repair of alkylated DNA. In this research, we performed a comparative study for evaluating the impact of zinc metal ion on the behavior and interactions of MGMT in the both enzymatic forms of apo MGMT and holo MGMT. DNA and proliferating cell nuclear antigen (PCNA), as partners of MGMT, were utilized to evaluate molecular interactions by virtual microscopy of molecular dynamics simulation. The stability and conformational alterations of each forms (apo and holo) MGMT-PCNA, and (apo and holo) MGMT-DNA complexes were calculated by MM/PBSA method. A total of seven systems including apo MGMT, holo MGMT, free PCNA, apo MGMT-PCNA, holo MGMT-PCNA, apo MGMT-DNA, and holo MGMT-DNA complexes were simulated. In this study, we found that holo MGMT was more stable and had better folding and functional properties than that of apo MGMT. Simulation analysis of (apo and holo) MGMT-PCNA complexes displayed that the sequences of the amino acids involved in the interactions were different in the two forms of MGMT. The important amino acids of holo MGMT involved in its interaction with PCNA included E92, K101, A119, G122, N123, P124, and K125, whereas the important amino acids of apo MGMT included R128, R135, S152, N157, Y158, and L162. Virtual microscopy of molecular dynamics simulation showed that the R128 and its surrounding residues were important amino acids involved in the interaction of holo MGMT with DNA that was exactly consistent with X-ray crystallography structure. In the apo form of the protein, the N157 and its surrounding residues were important amino acids involved in the interaction with DNA. The binding free energies of - 387.976, - 396.226, - 622.227, and - 617.333 kcal/mol were obtained for holo MGMT-PCNA, apo MGMT-PCNA, holo MGMT-DNA, and apo MGMT-DNA complexes, respectively. The principle result of this research was that the area of molecular interactions differed between the two states of MGMT. Therefore, in investigations of metalloproteins, the metal ion must be preserved in their structures. Finally, it is recommended to use the holo form of metalloproteins in in vitro and in silico researches.
Collapse
Affiliation(s)
- Marzieh Gharouni
- Department of Biochemistry, Neyshabur Branch, Islamic Azad University, Neyshabur, Iran
| | - Hamid Mosaddeghi
- Department of Chemistry, Isfahan University of Technology, Isfahan, Iran. .,Department of Chemistry, Neyshabur Branch, Islamic Azad University, Neyshabur, Iran.
| | - Jamshid Mehrzad
- Department of Biochemistry, Neyshabur Branch, Islamic Azad University, Neyshabur, Iran.
| | - Ali Es-Haghi
- Department of Biology, Mashhad Branch, Islamic Azad University, Mashhad, Iran
| | | |
Collapse
|
28
|
Mallona I, Ilie IM, Karemaker ID, Butz S, Manzo M, Caflisch A, Baubec T. Flanking sequence preference modulates de novo DNA methylation in the mouse genome. Nucleic Acids Res 2021; 49:145-157. [PMID: 33290556 PMCID: PMC7797059 DOI: 10.1093/nar/gkaa1168] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 10/22/2020] [Accepted: 11/16/2020] [Indexed: 12/13/2022] Open
Abstract
Mammalian de novo DNA methyltransferases (DNMT) are responsible for the establishment of cell-type-specific DNA methylation in healthy and diseased tissues. Through genome-wide analysis of de novo methylation activity in murine stem cells we uncover that DNMT3A prefers to methylate CpGs followed by cytosines or thymines, while DNMT3B predominantly methylates CpGs followed by guanines or adenines. These signatures are further observed at non-CpG sites, resembling methylation context observed in specialised cell types, including neurons and oocytes. We further show that these preferences result from structural differences in the catalytic domains of the two de novo DNMTs and are not a consequence of differential recruitment to the genome. Molecular dynamics simulations suggest that, in case of human DNMT3A, the preference is due to favourable polar interactions between the flexible Arg836 side chain and the guanine that base-pairs with the cytosine following the CpG. By exchanging arginine to a lysine, the corresponding side chain in DNMT3B, the sequence preference is reversed, confirming the requirement for arginine at this position. This context-dependent enzymatic activity provides additional insights into the complex regulation of DNA methylation patterns.
Collapse
Affiliation(s)
- Izaskun Mallona
- Department of Molecular Mechanisms of Disease, University of Zurich, Zurich 8057, Switzerland
- Department of Molecular Life Sciences, University of Zurich, Zurich 8057, Switzerland
- SIB Swiss Institute of Bioinformatics, Zurich 8057, Switzerland
| | - Ioana Mariuca Ilie
- Department of Biochemistry, University of Zurich, Zurich 8057, Switzerland
| | - Ino Dominiek Karemaker
- Department of Molecular Mechanisms of Disease, University of Zurich, Zurich 8057, Switzerland
| | - Stefan Butz
- Department of Molecular Mechanisms of Disease, University of Zurich, Zurich 8057, Switzerland
- Life Science Zurich Graduate School, University of Zurich, Zurich 8057, Switzerland
| | - Massimiliano Manzo
- Department of Molecular Mechanisms of Disease, University of Zurich, Zurich 8057, Switzerland
- Life Science Zurich Graduate School, University of Zurich, Zurich 8057, Switzerland
| | - Amedeo Caflisch
- Department of Biochemistry, University of Zurich, Zurich 8057, Switzerland
| | - Tuncay Baubec
- Department of Molecular Mechanisms of Disease, University of Zurich, Zurich 8057, Switzerland
| |
Collapse
|
29
|
Lara-Gonzalez S, Dantas Machado AC, Rao S, Napoli AA, Birktoft J, Di Felice R, Rohs R, Lawson CL. The RNA Polymerase α Subunit Recognizes the DNA Shape of the Upstream Promoter Element. Biochemistry 2020; 59:4523-4532. [PMID: 33205945 DOI: 10.1021/acs.biochem.0c00571] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
We demonstrate here that the α subunit C-terminal domain of Escherichia coli RNA polymerase (αCTD) recognizes the upstream promoter (UP) DNA element via its characteristic minor groove shape and electrostatic potential. In two compositionally distinct crystallized assemblies, a pair of αCTD subunits bind in tandem to the UP element consensus A-tract that is 6 bp in length (A6-tract), each with their arginine 265 guanidinium group inserted into the minor groove. The A6-tract minor groove is significantly narrowed in these crystal structures, as well as in computationally predicted structures of free and bound DNA duplexes derived by Monte Carlo and molecular dynamics simulations, respectively. The negative electrostatic potential of free A6-tract DNA is substantially enhanced compared to that of generic DNA. Shortening the A-tract by 1 bp is shown to "knock out" binding of the second αCTD through widening of the minor groove. Furthermore, in computationally derived structures with arginine 265 mutated to alanine in either αCTD, either with or without the "knockout" DNA mutation, contact with the DNA is perturbed, highlighting the importance of arginine 265 in achieving αCTD-DNA binding. These results demonstrate that the importance of the DNA shape in sequence-dependent recognition of DNA by RNA polymerase is comparable to that of certain transcription factors.
Collapse
Affiliation(s)
- Samuel Lara-Gonzalez
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Road, Piscataway, New Jersey 08854, United States
| | - Ana Carolina Dantas Machado
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Satyanarayan Rao
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Andrew A Napoli
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Road, Piscataway, New Jersey 08854, United States
| | - Jens Birktoft
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Road, Piscataway, New Jersey 08854, United States
| | - Rosa Di Felice
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States.,Department of Physics and Astronomy, University of Southern California, Los Angeles, California 90089, United States.,CNR-NANO Modena, Via Campi 213/A, 41125 Modena, Italy
| | - Remo Rohs
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States.,Department of Physics and Astronomy, University of Southern California, Los Angeles, California 90089, United States.,Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States.,Department of Computer Science, University of Southern California, Los Angeles, California 90089, United States
| | - Catherine L Lawson
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Road, Piscataway, New Jersey 08854, United States.,Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, New Jersey 08854, United States
| |
Collapse
|
30
|
Schnepf M, von Reutern M, Ludwig C, Jung C, Gaul U. Transcription Factor Binding Affinities and DNA Shape Readout. iScience 2020; 23:101694. [PMID: 33163946 PMCID: PMC7607496 DOI: 10.1016/j.isci.2020.101694] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 09/30/2020] [Accepted: 10/13/2020] [Indexed: 12/16/2022] Open
Abstract
An essential event in gene regulation is the binding of a transcription factor (TF) to its target DNA. Models considering the interactions between the TF and the DNA geometry proved to be successful approaches to describe this binding event, while conserving data interpretability. However, a direct characterization of the DNA shape contribution to binding is still missing due to the lack of accurate and large-scale binding affinity data. Here, we use a binding assay we recently established to measure with high sensitivity the binding specificities of 13 Drosophila TFs, including dinucleotide dependencies to capture non-independent amino acid-base interactions. Correlating the binding affinities with all DNA shape features, we find that shape readout is widely used by these factors. A shape readout/TF-DNA complex structure analysis validates our approach while providing biological insights such as positively charged or highly polar amino acids often contact nucleotides that exhibit strong shape readout. The DNA shape contribution to Drosophila TFs-DNA binding is directly characterized Zeroth- and first-order TF-DNA binding specificities are measured with high accuracy DNA shape readout is widely used by these TFs A shape readout/structural correlation analysis provides biological insights
Collapse
Affiliation(s)
- Max Schnepf
- Gene Center and Department of Biochemistry, Center for Protein Science Munich (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 München, Germany
| | - Marc von Reutern
- Gene Center and Department of Biochemistry, Center for Protein Science Munich (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 München, Germany
| | - Claudia Ludwig
- Gene Center and Department of Biochemistry, Center for Protein Science Munich (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 München, Germany
| | - Christophe Jung
- Gene Center and Department of Biochemistry, Center for Protein Science Munich (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 München, Germany
| | - Ulrike Gaul
- Gene Center and Department of Biochemistry, Center for Protein Science Munich (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 München, Germany
| |
Collapse
|
31
|
Adam K, Gyorgypal Z, Hegedus Z. DNA Readout Viewer (DRV): visualization of specificity determining patterns of protein-binding DNA segments. Bioinformatics 2020; 36:2286-2287. [PMID: 31793988 PMCID: PMC7141859 DOI: 10.1093/bioinformatics/btz906] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Revised: 11/27/2019] [Accepted: 11/29/2019] [Indexed: 11/14/2022] Open
Abstract
Summary The sequence specific recognition of DNA by regulatory proteins typically occurs by establishing hydrogen bonds and non-bonded contacts between chemical sub-structures of nucleotides and amino acids forming the compatible interacting surfaces. The recognition process is also influenced by the physicochemical and conformational character of the target oligonucleotide motif. Although the role of these mechanisms in DNA-protein interactions is well-established, bioinformatical methods rarely address them directly, instead binding specificity is mostly assessed at nucleotide level. DNA Readout Viewer (DRV) aims to provide a novel DNA representation, facilitating in-depth view into these mechanisms by the concurrent visualization of functional groups and a diverse collection of DNA descriptors. By applying its intuitive representation concept for various DNA recognition related visualization tasks, DRV can contribute to unravelling the binding specificity factors of DNA-protein interactions. Availability and implementation DRV is freely available at https://drv.brc.hu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Krisztian Adam
- Institute of Biophysics, Biological Research Centre, H-6726 Szeged, Hungary
| | - Zoltan Gyorgypal
- Institute of Biophysics, Biological Research Centre, H-6726 Szeged, Hungary
| | - Zoltan Hegedus
- Institute of Biophysics, Biological Research Centre, H-6726 Szeged, Hungary.,Department of Biochemistry and Medical Chemistry, Medical School, University of Pécs, H-7622 Pécs, Hungary
| |
Collapse
|
32
|
Crystal Structures of Ternary Complexes of MEF2 and NKX2-5 Bound to DNA Reveal a Disease Related Protein-Protein Interaction Interface. J Mol Biol 2020; 432:5499-5508. [PMID: 32681840 DOI: 10.1016/j.jmb.2020.07.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 07/08/2020] [Accepted: 07/09/2020] [Indexed: 11/22/2022]
Abstract
MEF2 and NKX2-5 transcription factors interact with each other in cardiogenesis and are necessary for normal heart formation. Despite evidence suggesting that these two transcription factors function synergistically and possibly through direct physical interactions, molecular mechanisms by which they interact are not clear. Here we determined the crystal structures of ternary complexes of MEF2 and NKX2-5 bound to myocardin enhancer DNA in two crystal forms. These crystal structures are the first example of human MADS-box/homeobox ternary complex structures involved in cardiogenesis. Our structures reveal two possible modes of interactions between MEF2 and NKX2-5: MEF2 and NKX bind to adjacent DNA sites to recognize DNA in cis; and MEF2 and NKX bind to different DNA strands to interact with each other in trans via a conserved protein-protein interface observed in both crystal forms. Disease-related mutations are mapped to the observed protein-protein interface. Our structural studies provide a starting point to understand and further study the molecular mechanisms of the interactions between MEF2 and NKX2.5 and their roles in cardiogenesis.
Collapse
|
33
|
Dantas Machado AC, Cooper BH, Lei X, Di Felice R, Chen L, Rohs R. Landscape of DNA binding signatures of myocyte enhancer factor-2B reveals a unique interplay of base and shape readout. Nucleic Acids Res 2020; 48:8529-8544. [PMID: 32738045 PMCID: PMC7470950 DOI: 10.1093/nar/gkaa642] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 07/16/2020] [Accepted: 07/22/2020] [Indexed: 01/08/2023] Open
Abstract
Myocyte enhancer factor-2B (MEF2B) has the unique capability of binding to its DNA target sites with a degenerate motif, while still functioning as a gene-specific transcriptional regulator. Identifying its DNA targets is crucial given regulatory roles exerted by members of the MEF2 family and MEF2B's involvement in B-cell lymphoma. Analyzing structural data and SELEX-seq experimental results, we deduced the DNA sequence and shape determinants of MEF2B target sites on a high-throughput basis in vitro for wild-type and mutant proteins. Quantitative modeling of MEF2B binding affinities and computational simulations exposed the DNA readout mechanisms of MEF2B. The resulting binding signature of MEF2B revealed distinct intricacies of DNA recognition compared to other transcription factors. MEF2B uses base readout at its half-sites combined with shape readout at the center of its degenerate motif, where A-tract polarity dictates nuances of binding. The predominant role of shape readout at the center of the core motif, with most contacts formed in the minor groove, differs from previously observed protein-DNA readout modes. MEF2B, therefore, represents a unique protein for studies of the role of DNA shape in achieving binding specificity. MEF2B-DNA recognition mechanisms are likely representative for other members of the MEF2 family.
Collapse
Affiliation(s)
- Ana Carolina Dantas Machado
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Brendon H Cooper
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Xiao Lei
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Rosa Di Felice
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Department of Physics & Astronomy, University of Southern California, Los Angeles, CA 90089, USA
| | - Lin Chen
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Department of Chemistry, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
| | - Remo Rohs
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Department of Physics & Astronomy, University of Southern California, Los Angeles, CA 90089, USA
- Department of Chemistry, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
- Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
34
|
Khan A, Tahir Khan M, Saleem S, Junaid M, Ali A, Shujait Ali S, Khan M, Wei DQ. Structural insights into the mechanism of RNA recognition by the N-terminal RNA-binding domain of the SARS-CoV-2 nucleocapsid phosphoprotein. Comput Struct Biotechnol J 2020; 18:2174-2184. [PMID: 32837710 PMCID: PMC7419326 DOI: 10.1016/j.csbj.2020.08.006] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/05/2020] [Accepted: 08/05/2020] [Indexed: 01/18/2023] Open
Abstract
The emergence of recent SARS-CoV-2 has become a global health issue. This single-stranded positive-sense RNA virus is continuously spreading with increasing morbidities and mortalities. The proteome of this virus contains four structural and sixteen nonstructural proteins that ensure the replication of the virus in the host cell. However, the role of phosphoprotein (N) in RNA recognition, replicating, transcribing the viral genome, and modulating the host immune response is indispensable. Recently, the NMR structure of the N-terminal domain of the Nucleocapsid Phosphoprotein has been reported, but its precise structural mechanism of how the ssRNA interacts with it is not reported yet. Therefore, here, we have used an integrated computational pipeline to identify the key residues, which play an essential role in RNA recognition. We generated multiple variants by using an alanine scanning strategy and performed an extensive simulation for each system to signify the role of each interfacial residue. Our analyses suggest that residues T57A, H59A, S105A, R107A, F171A, and Y172A significantly affected the dynamics and binding of RNA. Furthermore, per-residue energy decomposition analysis suggests that residues T57, H59, S105 and R107 are the key hotspots for drug discovery. Thus, these residues may be useful as potential pharmacophores in drug designing.
Collapse
Affiliation(s)
- Abbas Khan
- State Key Lab of Microbial Metabolism, Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Muhammad Tahir Khan
- Department of Bioinformatics and Biosciences, Capital University of Science and Technology, Islamabad, Pakistan
| | - Shoaib Saleem
- National Center for Bioinformatics, Quaid-i-Azam University, 45320 Islamabad, Pakistan
| | - Muhammad Junaid
- State Key Lab of Microbial Metabolism, Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Arif Ali
- State Key Lab of Microbial Metabolism, Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Syed Shujait Ali
- Center for Biotechnology and Microbiology, University of Swat, Swat, Khyber Pakhtunkhwa, Pakistan
| | - Mazhar Khan
- The CAS Key Laboratory of Innate Immunity and Chronic Diseases, Hefei National Laboratory for Physical Sciences at Microscale, School of Life Sciences, CAS Center for Excellence in Molecular Cell Science, University of Science and Technology of China (USTC), Collaborative Innovation Center of Genetics and Development, Hefei 230027, Anhui, China
| | - Dong-Qing Wei
- State Key Lab of Microbial Metabolism, Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, China
- Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nashan District, Shenzhen, Guangdong 518055, China
| |
Collapse
|
35
|
Lu XJ. DSSR-enabled innovative schematics of 3D nucleic acid structures with PyMOL. Nucleic Acids Res 2020; 48:e74. [PMID: 32442277 PMCID: PMC7367123 DOI: 10.1093/nar/gkaa426] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Revised: 04/26/2020] [Accepted: 05/07/2020] [Indexed: 12/11/2022] Open
Abstract
Sophisticated analysis and simplified visualization are crucial for understanding complicated structures of biomacromolecules. DSSR (Dissecting the Spatial Structure of RNA) is an integrated computational tool that has streamlined the analysis and annotation of 3D nucleic acid structures. The program creates schematic block representations in diverse styles that can be seamlessly integrated into PyMOL and complement its other popular visualization options. In addition to portraying individual base blocks, DSSR can draw Watson-Crick pairs as long blocks and highlight the minor-groove edges. Notably, DSSR can dramatically simplify the depiction of G-quadruplexes by automatically detecting G-tetrads and treating them as large square blocks. The DSSR-enabled innovative schematics with PyMOL are aesthetically pleasing and highly informative: the base identity, pairing geometry, stacking interactions, double-helical stems, and G-quadruplexes are immediately obvious. These features can be accessed via four interfaces: the command-line interface, the DSSR plugin for PyMOL, the web application, and the web application programming interface. The supplemental PDF serves as a practical guide, with complete and reproducible examples. Thus, even beginners or occasional users can get started quickly, especially via the web application at http://skmatic.x3dna.org.
Collapse
Affiliation(s)
- Xiang-Jun Lu
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| |
Collapse
|
36
|
Kribelbauer JF, Lu XJ, Rohs R, Mann RS, Bussemaker HJ. Toward a Mechanistic Understanding of DNA Methylation Readout by Transcription Factors. J Mol Biol 2019:S0022-2836(19)30617-5. [PMID: 31689433 DOI: 10.1016/j.jmb.2019.10.021] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Revised: 10/23/2019] [Accepted: 10/24/2019] [Indexed: 01/09/2023]
Abstract
Epigenetic DNA modification impacts gene expression, but the underlying molecular mechanisms are only partly understood. Adding a methyl group to a cytosine base locally modifies the structural features of DNA in multiple ways, which may change the interaction with DNA-binding transcription factors (TFs) and trigger a cascade of downstream molecular events. Cells can be probed using various functional genomics assays, but it is difficult to disentangle the confounded effects of DNA modification on TF binding, chromatin accessibility, intranuclear variation in local TF concentration, and rate of transcription. Here we discuss how high-throughput in vitro profiling of protein-DNA interactions has enabled comprehensive characterization and quantification of the methylation sensitivity of TFs. Despite the limited structural data for DNA containing methylated cytosine, automated analysis of structural information in the Protein Data Bank (PDB) shows how 5-methylcytosine (5mC) can be recognized in various ways by amino acid side chains. We discuss how a context-dependent effect of methylation on DNA groove geometry can affect DNA binding by homeodomain proteins and how principled modeling of ChIP-seq data can overcome the confounding that makes the interpretation of in vivo data challenging. The emerging picture is that epigenetic modifications affect TF binding in a highly context-specific manner, with a direction and effect size that depend critically on their position within the TF binding site and the amino acid sequence of the TF. With this improved mechanistic knowledge, we have come closer to understanding how cells use DNA modification to acquire, retain, and change their identity.
Collapse
Affiliation(s)
- Judith F Kribelbauer
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA; Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Xiang-Jun Lu
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Remo Rohs
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA; Department of Chemistry, University of Southern California, Los Angeles, CA 90089, USA; Department of Physics & Astronomy, University of Southern California, Los Angeles, CA 90089, USA; Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Richard S Mann
- Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA; Department of Systems Biology, Columbia University, New York, NY 10032, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA; Department of Neuroscience, Columbia University, New York, NY 10027, USA
| | - Harmen J Bussemaker
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA; Department of Systems Biology, Columbia University, New York, NY 10032, USA.
| |
Collapse
|