1
|
Rinaldi S, Moroni E, Rozza R, Magistrato A. Frontiers and Challenges of Computing ncRNAs Biogenesis, Function and Modulation. J Chem Theory Comput 2024; 20:993-1018. [PMID: 38287883 DOI: 10.1021/acs.jctc.3c01239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]
Abstract
Non-coding RNAs (ncRNAs), generated from nonprotein coding DNA sequences, constitute 98-99% of the human genome. Non-coding RNAs encompass diverse functional classes, including microRNAs, small interfering RNAs, PIWI-interacting RNAs, small nuclear RNAs, small nucleolar RNAs, and long non-coding RNAs. With critical involvement in gene expression and regulation across various biological and physiopathological contexts, such as neuronal disorders, immune responses, cardiovascular diseases, and cancer, non-coding RNAs are emerging as disease biomarkers and therapeutic targets. In this review, after providing an overview of non-coding RNAs' role in cell homeostasis, we illustrate the potential and the challenges of state-of-the-art computational methods exploited to study non-coding RNAs biogenesis, function, and modulation. This can be done by directly targeting them with small molecules or by altering their expression by targeting the cellular engines underlying their biosynthesis. Drawing from applications, also taken from our work, we showcase the significance and role of computer simulations in uncovering fundamental facets of ncRNA mechanisms and modulation. This information may set the basis to advance gene modulation tools and therapeutic strategies to address unmet medical needs.
Collapse
Affiliation(s)
- Silvia Rinaldi
- National Research Council of Italy (CNR) - Institute of Chemistry of OrganoMetallic Compounds (ICCOM), c/o Area di Ricerca CNR di Firenze Via Madonna del Piano 10, 50019 Sesto Fiorentino, Florence, Italy
| | - Elisabetta Moroni
- National Research Council of Italy (CNR) - Institute of Chemical Sciences and Technologies (SCITEC), via Mario Bianco 9, 20131 Milano, Italy
| | - Riccardo Rozza
- National Research Council of Italy (CNR) - Institute of Material Foundry (IOM) c/o International School for Advanced Studies (SISSA), Via Bonomea, 265, 34136 Trieste, Italy
| | - Alessandra Magistrato
- National Research Council of Italy (CNR) - Institute of Material Foundry (IOM) c/o International School for Advanced Studies (SISSA), Via Bonomea, 265, 34136 Trieste, Italy
| |
Collapse
|
2
|
Singh M, Kumar S. Effect of single nucleotide polymorphisms on the structure of long noncoding RNAs and their interaction with RNA binding proteins. Biosystems 2023; 233:105021. [PMID: 37703988 DOI: 10.1016/j.biosystems.2023.105021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 07/25/2023] [Accepted: 09/06/2023] [Indexed: 09/15/2023]
Abstract
Long non-coding RNAs (lncRNA) are emerging as a new class of regulatory RNAs with remarkable potential to be utilized as therapeutic targets against many human diseases. Several genome-wide association studies (GWAS) have catalogued Single Nucleotide Polymorphisms (SNPs) present in the noncoding regions of the genome from where lncRNAs originate. In this study, we have selected 67 lncRNAs with GWAS-tagged SNPs and have also investigated their role in affecting the local secondary structures. Majority of the SNPs lead to changes in the secondary structure of lncRNAs to a different extent by altering the base pairing patterns. These structural changes in lncRNA are also manifested in form of alteration in the binding site for RNA binding proteins (RBPs) along with affecting their binding efficacies. Ultimately, these structural modifications may influence the transcriptional and post-transcriptional pathways of these RNAs, leading to the causation of diseases. Hence, it is important to understand the possible underlying mechanism of RBPs in association with GWAS-tagged SNPs in human diseases.
Collapse
Affiliation(s)
- Mandakini Singh
- Department of Life Science, National Institute of Technology, Rourkela, Odisha, 769008, India
| | - Santosh Kumar
- Department of Life Science, National Institute of Technology, Rourkela, Odisha, 769008, India.
| |
Collapse
|
3
|
Adelusi TI, Oyedele AQK, Boyenle ID, Ogunlana AT, Adeyemi RO, Ukachi CD, Idris MO, Olaoba OT, Adedotun IO, Kolawole OE, Xiaoxing Y, Abdul-Hammed M. Molecular modeling in drug discovery. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.100880] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
4
|
3D Modeling of Non-coding RNA Interactions. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1385:281-317. [DOI: 10.1007/978-3-031-08356-3_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
5
|
In silico structural analysis of truncated 2’ fluoro-RNA aptamer: Elucidating EGF-1 and EGF-2 binding domains on factor IX protein. Process Biochem 2021. [DOI: 10.1016/j.procbio.2021.10.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
6
|
Oulas A, Richter J, Zanti M, Tomazou M, Michailidou K, Christodoulou K, Christodoulou C, Spyrou GM. In depth analysis of Cyprus-specific mutations of SARS-CoV-2 strains using computational approaches. BMC Genom Data 2021; 22:48. [PMID: 34773976 PMCID: PMC8590444 DOI: 10.1186/s12863-021-01007-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 10/26/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND This study aims to characterize SARS-CoV-2 mutations which are primarily prevalent in the Cypriot population. Moreover, using computational approaches, we assess whether these mutations are associated with changes in viral virulence. METHODS We utilize genetic data from 144 sequences of SARS-CoV-2 strains from the Cypriot population obtained between March 2020 and January 2021, as well as all data available from GISAID. We combine this with countries' regional information, such as deaths and cases per million, as well as COVID-19-related public health austerity measure response times. Initial indications of selective advantage of Cyprus-specific mutations are obtained by mutation tracking analysis. This entails calculating specific mutation frequencies within the Cypriot population and comparing these with their prevalence world-wide throughout the course of the pandemic. We further make use of linear regression models to extrapolate additional information that may be missed through standard statistical analysis. RESULTS We report a single mutation found in the ORF1ab gene (nucleotide position 18,440) that appears to be significantly enriched within the Cypriot population. The amino acid change is denoted as S6059F, which maps to the SARS-CoV-2 NSP14 protein. We further analyse this mutation using regression models to investigate possible associations with increased deaths and cases per million. Moreover, protein structure prediction tools show that the mutation infers a conformational change to the protein that significantly alters its structure when compared to the reference protein. CONCLUSIONS Investigating Cyprus-specific mutations for SARS-CoV-2 can lead to a better understanding of viral pathogenicity. Researching these mutations can generate potential links between viral-specific mutations and the unique genomics of the Cypriot population. This can not only lead to important findings from which to battle the pandemic on a national level, but also provide insights into viral virulence worldwide.
Collapse
Affiliation(s)
- Anastasis Oulas
- Bioinformatics Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus.
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus.
| | - Jan Richter
- Molecular Virology Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Maria Zanti
- Bioinformatics Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
- Biostatistics Unit, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Marios Tomazou
- Bioinformatics Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
- Neurogenetics Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Kyriaki Michailidou
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
- Biostatistics Unit, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Kyproula Christodoulou
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
- Neurogenetics Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Christina Christodoulou
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
- Molecular Virology Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - George M Spyrou
- Bioinformatics Department, Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
| |
Collapse
|
7
|
Mias-Lucquin D, Chauvot de Beauchene I. Conformational variability in proteins bound to single-stranded DNA: A new benchmark for new docking perspectives. Proteins 2021; 90:625-631. [PMID: 34617336 PMCID: PMC9292434 DOI: 10.1002/prot.26258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 09/15/2021] [Accepted: 09/27/2021] [Indexed: 12/19/2022]
Abstract
We explored the Protein Data Bank (PDB) to collect protein-ssDNA structures and create a multi-conformational docking benchmark including both bound and unbound protein structures. Due to ssDNA high flexibility when not bound, no ssDNA unbound structure is included in the benchmark. For the 91 sequence-identity groups identified as bound-unbound structures of the same protein, we studied the conformational changes in the protein induced by the ssDNA binding. Moreover, based on several bound or unbound protein structures in some groups, we also assessed the intrinsic conformational variability in either bound or unbound conditions and compared it to the supposedly binding-induced modifications. To illustrate a use case of this benchmark, we performed docking experiments using ATTRACT docking software. This benchmark is, to our knowledge, the first one made to peruse available structures of ssDNA-protein interactions to such an extent, aiming to improve computational docking tools dedicated to this kind of molecular interactions.
Collapse
|
8
|
Feng Y, Zhang K, Wu Q, Huang SY. NLDock: a Fast Nucleic Acid-Ligand Docking Algorithm for Modeling RNA/DNA-Ligand Complexes. J Chem Inf Model 2021; 61:4771-4782. [PMID: 34468128 DOI: 10.1021/acs.jcim.1c00341] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Nucleic acid-ligand interactions play an important role in numerous cellular processes such as gene function expression and regulation. Therefore, nucleic acids such as RNAs have become more and more important drug targets, where the structural determination of nucleic acid-ligand complexes is pivotal for understanding their functions and thus developing therapeutic interventions. Molecular docking has been a useful computational tool in predicting the complex structure between molecules. However, although a number of docking algorithms have been developed for protein-ligand interactions, only a few docking programs were presented for nucleic acid-ligand interactions. Here, we have developed a fast nucleic acid-ligand docking algorithm, named NLDock, by implementing our intrinsic scoring function ITScoreNL for nucleic acid-ligand interactions into a modified version of the MDock program. NLDock was extensively evaluated on four test sets and compared with five other state-of-the-art docking algorithms including AutoDock, DOCK 6, rDock, GOLD, and Glide. It was shown that our NLDock algorithm obtained a significantly better performance than the other docking programs in binding mode predictions and achieved the success rates of 73%, 36%, and 32% on the largest test set of 77 complexes for local rigid-, local flexible-, and global flexible-ligand docking, respectively. In addition, our NLDock approach is also computationally efficient and consumed an average of as short as 0.97 and 2.08 min for a local flexible-ligand docking job and a global flexible-ligand docking job, respectively. These results suggest the good performance of our NLDock in both docking accuracy and computational efficiency.
Collapse
Affiliation(s)
- Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Keqiong Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Qilong Wu
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
9
|
Philip M, Chen T, Tyagi S. A Survey of Current Resources to Study lncRNA-Protein Interactions. Noncoding RNA 2021; 7:ncrna7020033. [PMID: 34201302 PMCID: PMC8293367 DOI: 10.3390/ncrna7020033] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 05/28/2021] [Accepted: 06/07/2021] [Indexed: 12/15/2022] Open
Abstract
Phenotypes are driven by regulated gene expression, which in turn are mediated by complex interactions between diverse biological molecules. Protein-DNA interactions such as histone and transcription factor binding are well studied, along with RNA-RNA interactions in short RNA silencing of genes. In contrast, lncRNA-protein interaction (LPI) mechanisms are comparatively unknown, likely directed by the difficulties in studying LPI. However, LPI are emerging as key interactions in epigenetic mechanisms, playing a role in development and disease. Their importance is further highlighted by their conservation across kingdoms. Hence, interest in LPI research is increasing. We therefore review the current state of the art in lncRNA-protein interactions. We specifically surveyed recent computational methods and databases which researchers can exploit for LPI investigation. We discovered that algorithm development is heavily reliant on a few generic databases containing curated LPI information. Additionally, these databases house information at gene-level as opposed to transcript-level annotations. We show that early methods predict LPI using molecular docking, have limited scope and are slow, creating a data processing bottleneck. Recently, machine learning has become the strategy of choice in LPI prediction, likely due to the rapid growth in machine learning infrastructure and expertise. While many of these methods have notable limitations, machine learning is expected to be the basis of modern LPI prediction algorithms.
Collapse
Affiliation(s)
- Melcy Philip
- School of Biological Sciences, Monash University, 25 Rainforest Walk, Clayton, VIC 3800, Australia; (M.P.); (T.C.)
| | - Tyrone Chen
- School of Biological Sciences, Monash University, 25 Rainforest Walk, Clayton, VIC 3800, Australia; (M.P.); (T.C.)
| | - Sonika Tyagi
- School of Biological Sciences, Monash University, 25 Rainforest Walk, Clayton, VIC 3800, Australia; (M.P.); (T.C.)
- Monash eResearch Centre, Monash University, Clayton, VIC 3800, Australia
- Department of Infectious Disease, Monash University (Alfred Campus), 85 Commercial Road, Melbourne, VIC 3004, Australia
- Correspondence:
| |
Collapse
|
10
|
Cai D, Han JDJ. Aging-associated lncRNAs are evolutionarily conserved and participate in NFκB signaling. NATURE AGING 2021; 1:438-453. [PMID: 37118014 DOI: 10.1038/s43587-021-00056-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 03/10/2021] [Indexed: 04/30/2023]
Abstract
The transcriptome undergoes global changes during aging, including both protein-coding and noncoding RNAs. Using comparative genomics, we identify aging-associated long noncoding RNAs (lncRNAs) that are under evolutionary constraint and are more conserved than lncRNAs that do not change with age. Aging-associated lncRNAs are enriched for functional elements, including binding sites for RNA-binding proteins and transcription factors, in particular nuclear factor kappa B (NFκB). Using CRISPR screening, we discovered that 13 of the aging-associated lncRNAs were regulators of the NFκB pathway, and we named this family 'NFκB modulating aging-related lncRNAs (NFKBMARLs)'. Further characterization of NFκBMARL-1 reveals it can be traced to 29 Ma before humans and is induced by NFκB during aging, inflammation and senescence. Reciprocally, NFκBMARL-1 directly regulates transcription of the NFκB inhibitor NFKBIZ in cis within the same topologically associated domain by binding to the NFKBIZ enhancer and recruiting RELA to the NFKBIZ promoter. These findings reveal many aging-associated lncRNAs are evolutionarily conserved components of the NFκB pathway.
Collapse
Affiliation(s)
- Donghong Cai
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jing-Dong J Han
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, China.
| |
Collapse
|
11
|
Oulas A, Zanti M, Tomazou M, Zachariou M, Minadakis G, Bourdakou MM, Pavlidis P, Spyrou GM. Generalized linear models provide a measure of virulence for specific mutations in SARS-CoV-2 strains. PLoS One 2021; 16:e0238665. [PMID: 33497392 PMCID: PMC7837476 DOI: 10.1371/journal.pone.0238665] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 01/06/2021] [Indexed: 01/01/2023] Open
Abstract
This study aims to highlight SARS-COV-2 mutations which are associated with increased or decreased viral virulence. We utilize genetic data from all strains available from GISAID and countries' regional information, such as deaths and cases per million, as well as COVID-19-related public health austerity measure response times. Initial indications of selective advantage of specific mutations can be obtained from calculating their frequencies across viral strains. By applying modelling approaches, we provide additional information that is not evident from standard statistics or mutation frequencies alone. We therefore, propose a more precise way of selecting informative mutations. We highlight two interesting mutations found in genes N (P13L) and ORF3a (Q57H). The former appears to be significantly associated with decreased deaths and cases per million according to our models, while the latter shows an opposing association with decreased deaths and increased cases per million. Moreover, protein structure prediction tools show that the mutations infer conformational changes to the protein that significantly alter its structure when compared to the reference protein.
Collapse
Affiliation(s)
- Anastasis Oulas
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
- * E-mail:
| | - Maria Zanti
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
| | - Marios Tomazou
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
| | - Margarita Zachariou
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
| | - George Minadakis
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
| | - Marilena M. Bourdakou
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
| | - Pavlos Pavlidis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Crete, Greece
| | - George M. Spyrou
- Bioinformatics Department, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
- The Cyprus School of Molecular Medicine, Nicosia, Cyprus
| |
Collapse
|
12
|
Abstract
Biological processes are often mediated by complexes formed between proteins and various biomolecules. The 3D structures of such protein-biomolecule complexes provide insights into the molecular mechanism of their action. The structure of these complexes can be predicted by various computational methods. Choosing an appropriate method for modelling depends on the category of biomolecule that a protein interacts with and the availability of structural information about the protein and its interacting partner. We intend for the contents of this chapter to serve as a guide as to what software would be the most appropriate for the type of data at hand and the kind of 3D complex structure required. Particularly, we have dealt with protein-small molecule ligand, protein-peptide, protein-protein, and protein-nucleic acid interactions.Most, if not all, model building protocols perform some sampling and scoring. Typically, several alternate conformations and configurations of the interactors are sampled. Each such sample is then scored for optimization. To boost the confidence in these predicted models, their assessment using other independent scoring schemes besides the inbuilt/default ones would prove to be helpful. This chapter also lists such software and serves as a guide to gauge the fidelity of modelled structures of biomolecular complexes.
Collapse
|
13
|
Feng Y, Huang SY. ITScore-NL: An Iterative Knowledge-Based Scoring Function for Nucleic Acid-Ligand Interactions. J Chem Inf Model 2020; 60:6698-6708. [PMID: 33291885 DOI: 10.1021/acs.jcim.0c00974] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Nucleic acid-ligand complexes underlie numerous cellular processes, such as gene function expression and regulation, in which their three-dimensional structures are important to understand their functions and thus to develop therapeutic interventions. Given the high cost and technical difficulties in experimental methods, computational methods such as molecular docking have been actively used to investigate nucleic acid-ligand interactions in which an accurate scoring function is crucial. However, because of the limited number of experimental nucleic acid-ligand binding data and structures, the scoring function development for nucleic acid-ligand interactions falls far behind that for protein-protein and protein-ligand interactions. Here, based on our statistical mechanics-based iterative approach, we have developed an iterative knowledge-based scoring function for nucleic acid-ligand interactions, named as ITScore-NL, by explicitly including stacking and electrostatic potentials. Our ITScore-NL scoring function was extensively evaluated for its ability in the binding mode and binding affinity predictions on three diverse test sets and compared with state-of-the-art scoring functions. Overall, ITScore-NL obtained significantly better performance than the other 12 scoring functions and predicted near-native poses with rmsd ≤ 1.5 Å for 71.43% of the cases when the top three binding modes were considered and a good correlation of R = 0.64 in binding affinity prediction on the large test set of 77 nucleic acid-ligand complexes. These results suggested the accuracy of ITScore-NL and the necessity of explicitly including stacking and electrostatic potentials.
Collapse
Affiliation(s)
- Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
14
|
Chen J, Qi Y, Duan Y, Duan M, Yang M. C1188D mutation abolishes specific recognition between MLL1-CXXC domain and CpG site by inducing conformational switch of flexible N-terminal. Proteins 2020; 88:1401-1412. [PMID: 32519403 DOI: 10.1002/prot.25960] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 05/22/2020] [Accepted: 06/06/2020] [Indexed: 01/19/2023]
Abstract
Mixed lineage leukemia protein (MLL1 protein) recognizes the CpG site via its CXXC domain and is frequently associated with leukemia. The specific recognition is abolished by C1188D mutation, which also prevents MLL-related leukemia. In this paper, multiple molecular dynamic (MD) simulations were performed to investigate the mechanism of recognition and influences of C1188D mutation. Started from fully dissociated DNA and MLL1-CXXC domain, remarkably, the center of mass (COM) of MLL1-CXXC domain quickly concentrates on the vicinity of the CpG site in all 53 short MD simulations. Extended simulations of the wild type showed that the native complex formed in 500 ns among 4 of 53 simulations. In contrast, the C1188D mutant COM distributed broadly around the DNA and the native complex was not observed in any of the extended simulations. Simulations on the apo MLL1-CXXC domain further suggest that the wild type protein remained predominantly in an open form that closely resembles its structure in the native complex whereas C1188D mutant formed predominantly compact structures in which the N- terminal bends to D1188. This conformational switch hinders the formation of encounter complex, thus abolishes the recognition. Our study also provides clues to the study mechanism of recognition, by the CXXC domain from proteins like DNA methyltransferase and ten-eleven translocation enzymes.
Collapse
Affiliation(s)
- Jiawen Chen
- Key Laboratory of Magnetic Resonance in Biological Systems, National Center for Magnetic Resonances in Wuhan, State Key laboratory of Magnetic Resonance and Atomic and Molecular Physics, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, China
| | - Yanping Qi
- Key Laboratory of Magnetic Resonance in Biological Systems, National Center for Magnetic Resonances in Wuhan, State Key laboratory of Magnetic Resonance and Atomic and Molecular Physics, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, China.,College of Physical Science and Technology, Central China Normal University, Wuhan, China
| | - Yong Duan
- Department of Biomedical Engineering and UC Davis Genome Center, University of California at Davis, Davis, California, USA
| | - Mojie Duan
- Key Laboratory of Magnetic Resonance in Biological Systems, National Center for Magnetic Resonances in Wuhan, State Key laboratory of Magnetic Resonance and Atomic and Molecular Physics, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, China
| | - Minghui Yang
- Key Laboratory of Magnetic Resonance in Biological Systems, National Center for Magnetic Resonances in Wuhan, State Key laboratory of Magnetic Resonance and Atomic and Molecular Physics, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, China.,Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
15
|
Exploring the Binding Mechanism and Dynamics of EndoMS/NucS to Mismatched dsDNA. Int J Mol Sci 2019; 20:ijms20205142. [PMID: 31627318 PMCID: PMC6829318 DOI: 10.3390/ijms20205142] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 10/03/2019] [Accepted: 10/12/2019] [Indexed: 12/15/2022] Open
Abstract
The well-known mismatch repair (MMR) machinery, MutS/MutL, is absent in numerous Archaea and some Bacteria. Recent studies have shown that EndoMS/NucS has the ability to cleave double-stranded DNA (dsDNA) containing a mismatched base pair, which suggests a novel mismatch repair process. However, the recognition mechanism and the binding process of EndoMS/NucS in the MMR pathway remain unclear. In this study, we investigate the binding dynamics of EndoMS/NucS to mismatched dsDNA and its energy as a function of the angle between the two C-terminal domains of EndoMS/NucS, through molecular docking and extensive molecular dynamics (MD) simulations. It is found that there exists a half-open transition state corresponding to an energy barrier (at an activation angle of approximately 80∘) between the open state and the closed state, according to the energy curve. When the angle is larger than the activation angle, the C-terminal domains can move freely and tend to change to the open state (local energy minimum). Otherwise, the C-terminal domains will interact with the mismatched dsDNA directly and converge to the closed state at the global energy minimum. As such, this two-state system enables the exposed N-terminal domains of EndoMS/NucS to recognize mismatched dsDNA during the open state and then stabilize the binding of the C-terminal domains of EndoMS/NucS to the mismatched dsDNA during the closed state. We also investigate how the EndoMS/NucS recognizes and binds to mismatched dsDNA, as well as the effects of K+ ions. The results provide insights into the recognition and binding mechanisms of EndoMS/NucS to mismatched dsDNA in the MMR pathway.
Collapse
|