1
|
Krupa MA, Krupa P. Free-Docking and Template-Based Docking: Physics Versus Knowledge-Based Docking. Methods Mol Biol 2024; 2780:27-41. [PMID: 38987462 DOI: 10.1007/978-1-0716-3985-6_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Docking methods can be used to predict the orientations of two or more molecules with respect of each other using a plethora of various algorithms, which can be based on the physics of interactions or can use information from databases and templates. The usability of these approaches depends on the type and size of the molecules, whose relative orientation will be estimated. The two most important limitations are (i) the computational cost of the prediction and (ii) the availability of the structural information for similar complexes. In general, if there is enough information about similar systems, knowledge-based and template-based methods can significantly reduce the computational cost while providing high accuracy of the prediction. However, if the information about the system topology and interactions between its partners is scarce, physics-based methods are more reliable or even the only choice. In this chapter, knowledge-, template-, and physics-based methods will be compared and briefly discussed providing examples of their usability with a special emphasis on physics-based protein-protein, protein-peptide, and protein-fullerene docking in the UNRES coarse-grained model.
Collapse
Affiliation(s)
- Magdalena A Krupa
- Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
| | - Paweł Krupa
- Institute of Physics, Polish Academy of Sciences, Warsaw, Poland.
| |
Collapse
|
2
|
Abstract
Major histocompatibility complex (MHC) proteins are the most polymorphic and polygenic proteins in humans. They bind peptides, derived from cleavage of different pathogenic antigens, and are responsible for presenting them to T cells. The peptides recognized by the T cell receptors are denoted as epitopes and they trigger an immune response.In this chapter, we describe a docking protocol for predicting the peptide binding to a given MHC protein using the software tool GOLD. The protocol starts with the construction of a combinatorial peptide library used in the docking and ends with the derivation of a quantitative matrix (QM) accounting for the contribution of each amino acid at each peptide position.
Collapse
|
3
|
Adelusi TI, Oyedele AQK, Boyenle ID, Ogunlana AT, Adeyemi RO, Ukachi CD, Idris MO, Olaoba OT, Adedotun IO, Kolawole OE, Xiaoxing Y, Abdul-Hammed M. Molecular modeling in drug discovery. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.100880] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
4
|
Aderinwale T, Christoffer CW, Sarkar D, Alnabati E, Kihara D. Computational structure modeling for diverse categories of macromolecular interactions. Curr Opin Struct Biol 2020; 64:1-8. [PMID: 32599506 PMCID: PMC7665979 DOI: 10.1016/j.sbi.2020.05.017] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 05/06/2020] [Accepted: 05/21/2020] [Indexed: 01/23/2023]
Abstract
Computational protein-protein docking is one of the most intensively studied topics in structural bioinformatics. The field has made substantial progress through over three decades of development. The development began with methods for rigid-body docking of two proteins, which have now been extended in different directions to cover the various macromolecular interactions observed in a cell. Here, we overview the recent developments of the variations of docking methods, including multiple protein docking, peptide-protein docking, and disordered protein docking methods.
Collapse
Affiliation(s)
- Tunde Aderinwale
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA; Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| |
Collapse
|
5
|
He J, Tao H, Huang SY. Protein-ensemble-RNA docking by efficient consideration of protein flexibility through homology models. Bioinformatics 2020; 35:4994-5002. [PMID: 31086984 DOI: 10.1093/bioinformatics/btz388] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2019] [Revised: 04/28/2019] [Accepted: 05/03/2019] [Indexed: 12/18/2022] Open
Abstract
MOTIVATION Given the importance of protein-ribonucleic acid (RNA) interactions in many biological processes, a variety of docking algorithms have been developed to predict the complex structure from individual protein and RNA partners in the past decade. However, due to the impact of molecular flexibility, the performance of current methods has hit a bottleneck in realistic unbound docking. Pushing the limit, we have proposed a protein-ensemble-RNA docking strategy to explicitly consider the protein flexibility in protein-RNA docking through an ensemble of multiple protein structures, which is referred to as MPRDock. Instead of taking conformations from MD simulations or experimental structures, we obtained the multiple structures of a protein by building models from its homologous templates in the Protein Data Bank (PDB). RESULTS Our approach can not only avoid the reliability issue of structures from MD simulations but also circumvent the limited number of experimental structures for a target protein in the PDB. Tested on 68 unbound-bound and 18 unbound-unbound protein-RNA complexes, our MPRDock/DITScorePR considerably improved the docking performance and achieved a significantly higher success rate than single-protein rigid docking whether pseudo-unbound templates are included or not. Similar improvements were also observed when combining our ensemble docking strategy with other scoring functions. The present homology model-based ensemble docking approach will have a general application in molecular docking for other interactions. AVAILABILITY AND IMPLEMENTATION http://huanglab.phys.hust.edu.cn/mprdock/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jiahua He
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Huanyu Tao
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Sheng-You Huang
- Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| |
Collapse
|
6
|
Roel-Touris J, Bonvin AM. Coarse-grained (hybrid) integrative modeling of biomolecular interactions. Comput Struct Biotechnol J 2020; 18:1182-1190. [PMID: 32514329 PMCID: PMC7264466 DOI: 10.1016/j.csbj.2020.05.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 04/23/2020] [Accepted: 05/06/2020] [Indexed: 12/23/2022] Open
Abstract
The computational modeling field has vastly evolved over the past decades. The early developments of simplified protein systems represented a stepping stone towards establishing more efficient approaches to sample intricated conformational landscapes. Downscaling the level of resolution of biomolecules to coarser representations allows for studying protein structure, dynamics and interactions that are not accessible by classical atomistic approaches. The combination of different resolutions, namely hybrid modeling, has also been proved as an alternative when mixed levels of details are required. In this review, we provide an overview of coarse-grained/hybrid models focusing on their applicability in the modeling of biomolecular interactions. We give a detailed list of ready-to-use modeling software for studying biomolecular interactions allowing various levels of coarse-graining and provide examples of complexes determined by integrative coarse-grained/hybrid approaches in combination with experimental information.
Collapse
|
7
|
Glashagen G, de Vries S, Uciechowska-Kaczmarzyk U, Samsonov SA, Murail S, Tuffery P, Zacharias M. Coarse-grained and atomic resolution biomolecular docking with the ATTRACT approach. Proteins 2019; 88:1018-1028. [PMID: 31785163 DOI: 10.1002/prot.25860] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 11/20/2019] [Accepted: 11/27/2019] [Indexed: 01/17/2023]
Abstract
The ATTRACT protein-protein docking program has been employed to predict protein-protein complex structures in CAPRI rounds 38-45. For 11 out of 16 targets acceptable or better quality solutions have been submitted (~70%). It includes also several cases of peptide-protein docking and the successful prediction of the geometry of carbohydrate-protein interactions. The option of combining rigid body minimization and simultaneous optimization in collective degrees of freedom based on elastic network modes was employed and systematically evaluated. Application to a large benchmark set indicates a modest improvement in docking performance compared to rigid docking. Possible further improvements of the docking approach in particular at the scoring and the flexible refinement steps are discussed.
Collapse
Affiliation(s)
- Glenn Glashagen
- Physik-Department T38, Technische Universität München, Garching, Germany
| | - Sjoerd de Vries
- Université de Paris, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | | | | | - Samuel Murail
- Université de Paris, CNRS UMR 8251, INSERM ERL U1133, Paris, France
| | - Pierre Tuffery
- Université de Paris, CNRS UMR 8251, INSERM ERL U1133, Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | - Martin Zacharias
- Physik-Department T38, Technische Universität München, Garching, Germany
| |
Collapse
|
8
|
Honorato RV, Roel-Touris J, Bonvin AMJJ. MARTINI-Based Protein-DNA Coarse-Grained HADDOCKing. Front Mol Biosci 2019; 6:102. [PMID: 31632986 PMCID: PMC6779769 DOI: 10.3389/fmolb.2019.00102] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 09/17/2019] [Indexed: 11/13/2022] Open
Abstract
Modeling biomolecular assemblies is an important field in computational structural biology. The inherent complexity of their energy landscape and the computational cost associated with modeling large and complex assemblies are major drawbacks for integrative modeling approaches. The so-called coarse-graining approaches, which reduce the degrees of freedom of the system by grouping several atoms into larger “pseudo-atoms,” have been shown to alleviate some of those limitations, facilitating the identification of the global energy minima assumed to correspond to the native state of the complex, while making the calculations more efficient. Here, we describe and assess the implementation of the MARTINI force field for DNA into HADDOCK, our integrative modeling platform. We combine it with our previous implementation for protein-protein coarse-grained docking, enabling coarse-grained modeling of protein-nucleic acid complexes. The system is modeled using MARTINI topologies and interaction parameters during the rigid body docking and semi-flexible refinement stages of HADDOCK, and the resulting models are then converted back to atomistic resolution by an atom-to-bead distance restraints-guided protocol. We first demonstrate the performance of this protocol using 44 complexes from the protein-DNA docking benchmark, which shows an overall ~6-fold speed increase and maintains similar accuracy as compared to standard atomistic calculations. As a proof of concept, we then model the interaction between the PRC1 and the nucleosome (a former CAPRI target in round 31), using the same information available at the time the target was offered, and compare all-atom and coarse-grained models.
Collapse
Affiliation(s)
- Rodrigo V Honorato
- Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands.,Brazilian Biosciences National Laboratory (LNBio), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, Brazil
| | - Jorge Roel-Touris
- Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands
| | - Alexandre M J J Bonvin
- Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
9
|
Maffeo C, Chou HY, Aksimentiev A. Molecular Mechanisms of DNA Replication and Repair Machinery: Insights from Microscopic Simulations. ADVANCED THEORY AND SIMULATIONS 2019; 2:1800191. [PMID: 31728433 PMCID: PMC6855400 DOI: 10.1002/adts.201800191] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Indexed: 12/15/2022]
Abstract
Reproduction, the hallmark of biological activity, requires making an accurate copy of the genetic material to allow the progeny to inherit parental traits. In all living cells, the process of DNA replication is carried out by a concerted action of multiple protein species forming a loose protein-nucleic acid complex, the replisome. Proofreading and error correction generally accompany replication but also occur independently, safeguarding genetic information through all phases of the cell cycle. Advances in biochemical characterization of intracellular processes, proteomics and the advent of single-molecule biophysics have brought about a treasure trove of information awaiting to be assembled into an accurate mechanistic model of the DNA replication process. In this review, we describe recent efforts to model elements of DNA replication and repair processes using computer simulations, an approach that has gained immense popularity in many areas of molecular biophysics but has yet to become mainstream in the DNA metabolism community. We highlight the use of diverse computational methods to address specific problems of the fields and discuss unexplored possibilities that lie ahead for the computational approaches in these areas.
Collapse
Affiliation(s)
- Christopher Maffeo
- Department of Physics, Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign,1110 W Green St, Urbana, IL 61801, USA
| | - Han-Yi Chou
- Department of Physics, Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign,1110 W Green St, Urbana, IL 61801, USA
| | - Aleksei Aksimentiev
- Department of Physics, Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign,1110 W Green St, Urbana, IL 61801, USA
| |
Collapse
|
10
|
Pal A, Levy Y. Structure, stability and specificity of the binding of ssDNA and ssRNA with proteins. PLoS Comput Biol 2019; 15:e1006768. [PMID: 30933978 PMCID: PMC6467422 DOI: 10.1371/journal.pcbi.1006768] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Revised: 04/16/2019] [Accepted: 01/01/2019] [Indexed: 02/06/2023] Open
Abstract
Recognition of single-stranded DNA (ssDNA) or single-stranded RNA (ssRNA) is important for many fundamental cellular functions. A variety of single-stranded DNA-binding proteins (ssDBPs) and single-stranded RNA-binding proteins (ssRBPs) have evolved that bind ssDNA and ssRNA, respectively, with varying degree of affinities and specificities to form complexes. Structural studies of these complexes provide key insights into their recognition mechanism. However, computational modeling of the specific recognition process and to predict the structure of the complex is challenging, primarily due to the heterogeneity of their binding energy landscape and the greater flexibility of ssDNA or ssRNA compared with double-stranded nucleic acids. Consequently, considerably fewer computational studies have explored interactions between proteins and single-stranded nucleic acids compared with protein interactions with double-stranded nucleic acids. Here, we report a newly developed energy-based coarse-grained model to predict the structure of ssDNA–ssDBP and ssRNA–ssRBP complexes and to assess their sequence-specific interactions and stabilities. We tuned two factors that can modulate specific recognition: base–aromatic stacking strength and the flexibility of the single-stranded nucleic acid. The model was successfully applied to predict the binding conformations of 12 distinct ssDBP and ssRBP structures with their cognate ssDNA and ssRNA partners having various sequences. Estimated binding energies agreed well with the corresponding experimental binding affinities. Bound conformations from the simulation showed a funnel-shaped binding energy distribution where the native-like conformations corresponded to the energy minima. The various ssDNA–protein and ssRNA–protein complexes differed in the balance of electrostatic and aromatic energies. The lower affinity of the ssRNA–ssRBP complexes compared with the ssDNA–ssDBP complexes stems from lower flexibility of ssRNA compared to ssDNA, which results in higher rate constants for the dissociation of the complex (koff) for complexes involving the former. Quantifying bimolecular self-assembly is pivotal to understanding cellular function. In recent years, a large progress has been made in understanding the structure and biophysics of protein-protein interactions. Particularly, various computational tools are available for predicting these structures and to estimate their stability and the driving forces of their formation. The understating of the interactions between proteins and nucleic acids, however, is still limited, presumably due to the involvement of non-specific interactions as well as the high conformational plasticity that may demand an induced-fit mechanism. In particular, the interactions between proteins and single-stranded nucleic acids (i.e., single-stranded DNA and RNA) is very challenging due to their high flexibility. Furthermore, the interface between proteins and single-stranded nucleic acids is often chemically more heterogeneous than the interface between proteins and double-stranded DNA. In this study, we developed a coarse-grained computational model to predict the structure of complexes between proteins and single-stranded nucleic acids. The model was applied to estimate binding affinities and the estimated binding energies agreed well with the corresponding experimental binding affinities. The kinetics of association as well as the specificity of the complexes between proteins and ssDNA are different than those with ssRNA, mostly due to differences in their conformational flexibility.
Collapse
Affiliation(s)
- Arumay Pal
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Yaakov Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
11
|
Emamjomeh A, Choobineh D, Hajieghrari B, MahdiNezhad N, Khodavirdipour A. DNA-protein interaction: identification, prediction and data analysis. Mol Biol Rep 2019; 46:3571-3596. [PMID: 30915687 DOI: 10.1007/s11033-019-04763-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Accepted: 03/14/2019] [Indexed: 12/30/2022]
Abstract
Life in living organisms is dependent on specific and purposeful interaction between other molecules. Such purposeful interactions make the various processes inside the cells and the bodies of living organisms possible. DNA-protein interactions, among all the types of interactions between different molecules, are of considerable importance. Currently, with the development of numerous experimental techniques, diverse methods are convenient for recognition and investigating such interactions. While the traditional experimental techniques to identify DNA-protein complexes are time-consuming and are unsuitable for genome-scale studies, the current high throughput approaches are more efficient in determining such interaction at a large-scale, but they are clearly too costly to be practice for daily applications. Hence, according to the availability of much information related to different biological sequences and clearing different dimensions of conditions in which such interactions are formed, with the developments related to the computer, mathematics, and statistics motivate scientists to develop bioinformatics tools for prediction the interaction site(s). Until now, there has been much progress in this field. In this review, the factors and conditions governing the interaction and the laboratory techniques for examining such interactions are addressed. In addition, developed bioinformatics tools are introduced and compared for this reason and, in the end, several suggestions are offered for the promotion of such tools in prediction with much more precision.
Collapse
Affiliation(s)
- Abbasali Emamjomeh
- Laboratory of Computational Biotechnology and Bioinformatics (CBB), Department of Plant Breeding and Biotechnology (PBB), University of Zabol, Zabol, 98615-538, Iran.
| | - Darush Choobineh
- Agricultural Biotechnology, Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Behzad Hajieghrari
- Department of Agricultural Biotechnology, College of Agriculture, Jahrom University, Jahrom, 74135-111, Iran.
| | - Nafiseh MahdiNezhad
- Laboratory of Computational Biotechnology and Bioinformatics (CBB), Department of Plant Breeding and Biotechnology (PBB), University of Zabol, Zabol, 98615-538, Iran
| | - Amir Khodavirdipour
- Division of Human Genetics, Department of Anatomy, St. John's hospital, Bangalore, India
| |
Collapse
|
12
|
Corona RI, Sudarshan S, Aluru S, Guo JT. An SVM-based method for assessment of transcription factor-DNA complex models. BMC Bioinformatics 2018; 19:506. [PMID: 30577740 PMCID: PMC6302363 DOI: 10.1186/s12859-018-2538-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Background Atomic details of protein-DNA complexes can provide insightful information for better understanding of the function and binding specificity of DNA binding proteins. In addition to experimental methods for solving protein-DNA complex structures, protein-DNA docking can be used to predict native or near-native complex models. A docking program typically generates a large number of complex conformations and predicts the complex model(s) based on interaction energies between protein and DNA. However, the prediction accuracy is hampered by current approaches to model assessment, especially when docking simulations fail to produce any near-native models. Results We present here a Support Vector Machine (SVM)-based approach for quality assessment of the predicted transcription factor (TF)-DNA complex models. Besides a knowledge-based protein-DNA interaction potential DDNA3, we applied several structural features that have been shown to play important roles in binding specificity between transcription factors and DNA molecules to quality assessment of complex models. To address the issue of unbalanced positive and negative cases in the training dataset, we applied hard-negative mining, an iterative training process that selects an initial training dataset by combining all of the positive cases and a random sample from the negative cases. Results show that the SVM model greatly improves prediction accuracy (84.2%) over two knowledge-based protein-DNA interaction potentials, orientation potential (60.8%) and DDNA3 (68.4%). The improvement is achieved through reducing the number of false positive predictions, especially for the hard docking cases, in which a docking algorithm fails to produce any near-native complex models. Conclusions A learning-based SVM scoring model with structural features for specific protein-DNA binding and an atomic-level protein-DNA interaction potential DDNA3 significantly improves prediction accuracy of complex models by successfully identifying cases without near-native structural models. Electronic supplementary material The online version of this article (10.1186/s12859-018-2538-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rosario I Corona
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC, 28223, USA
| | - Sanjana Sudarshan
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC, 28223, USA
| | - Srinivas Aluru
- School of Computational Science and Engineering, Georgia Institute of Technology, 266 Ferst Drive, Atlanta, GA, 30332, USA
| | - Jun-Tao Guo
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC, 28223, USA.
| |
Collapse
|
13
|
Kinghorn AB, Fraser LA, Liang S, Shiu SCC, Tanner JA. Aptamer Bioinformatics. Int J Mol Sci 2017; 18:E2516. [PMID: 29186809 PMCID: PMC5751119 DOI: 10.3390/ijms18122516] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Revised: 11/17/2017] [Accepted: 11/20/2017] [Indexed: 02/07/2023] Open
Abstract
Aptamers are short nucleic acid sequences capable of specific, high-affinity molecular binding. They are isolated via SELEX (Systematic Evolution of Ligands by Exponential Enrichment), an evolutionary process that involves iterative rounds of selection and amplification before sequencing and aptamer characterization. As aptamers are genetic in nature, bioinformatic approaches have been used to improve both aptamers and their selection. This review will discuss the advancements made in several enclaves of aptamer bioinformatics, including simulation of aptamer selection, fragment-based aptamer design, patterning of libraries, identification of lead aptamers from high-throughput sequencing (HTS) data and in silico aptamer optimization.
Collapse
Affiliation(s)
| | | | | | | | - Julian A. Tanner
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR China; (A.B.K.); (L.A.F.); (S.L.); (S.C.-C.S.)
| |
Collapse
|
14
|
Hsieh PC, Lin HT, Chen WY, Tsai JJP, Hu WP. The Combination of Computational and Biosensing Technologies for Selecting Aptamer against Prostate Specific Antigen. BIOMED RESEARCH INTERNATIONAL 2017; 2017:5041683. [PMID: 28459059 PMCID: PMC5387809 DOI: 10.1155/2017/5041683] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2016] [Revised: 03/10/2017] [Accepted: 03/19/2017] [Indexed: 01/19/2023]
Abstract
Herein, we report a method of combining bioinformatics and biosensing technologies to select aptamers against prostate specific antigen (PSA). The main objective of this study is to select DNA aptamers with higher binding affinity for PSA by using the proposed method. Based on the five known sequences of PSA-binding aptamers, we adopted the functions of reproduction and crossover in the genetic algorithm to produce next-generation sequences for the computational and experimental analysis. RNAfold web server was utilized to analyze the secondary structures, and the 3-dimensional molecular models of aptamer sequences were generated by using RNAComposer web server. ZRANK scoring function was used to rerank the docking predictions from ZDOCK. The biosensors, the quartz crystal microbalance (QCM) and a surface plasmon resonance (SPR) instrument, were used to verify the binding ability of selected aptamer for PSA. By carrying out the simulations and experiments after two generations, we obtain one aptamer that can have the highest binding affinity with PSA, which generates almost 2-fold and 3-fold greater measured signals than the responses produced by the best known DNA sequence in the QCM and SPR experiments, respectively.
Collapse
Affiliation(s)
- Pi-Chou Hsieh
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung City 41354, Taiwan
| | - Hui-Ting Lin
- Department of Physical Therapy, I-Shou University, Kaohsiung City 82445, Taiwan
| | - Wen-Yih Chen
- Department of Chemical and Materials Engineering, National Central University, Jhongli 32001, Taiwan
| | - Jeffrey J. P. Tsai
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung City 41354, Taiwan
| | - Wen-Pin Hu
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung City 41354, Taiwan
- Department of Medical Laboratory Science and Biotechnology, China Medical University, Taichung City 40402, Taiwan
| |
Collapse
|
15
|
Application of the ATTRACT Coarse-Grained Docking and Atomistic Refinement for Predicting Peptide-Protein Interactions. Methods Mol Biol 2017. [PMID: 28236233 DOI: 10.1007/978-1-4939-6798-8_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Peptide-protein interactions are abundant in the cell and form an important part of the interactome. Large-scale modeling of peptide-protein complexes requires a fully blind approach; i.e., simultaneously predicting the peptide-binding site and the peptide conformation to high accuracy. Here, we present one of the first fully blind peptide-protein docking protocols, pepATTRACT. It combines a coarse-grained ensemble docking search of the entire protein surface with two stages of atomistic flexible refinement. pepATTRACT yields high-quality predictions for 70 % of the cases when tested on a large benchmark of peptide-protein complexes. This performance in fully blind mode is similar to state-of-the-art local docking approaches that use information on the location of the binding site. Limiting the search to the peptide-binding region, the resulting pepATTRACT-local approach further improves the performance. Docking scripts for pepATTRACT and pepATTRACT-local can be generated via a web interface at www.attract.ph.tum.de/peptide.html . Here, we explain how to set up a docking run with the pepATTRACT web interface and demonstrate its usage by an application on binding of disordered regions from tumor suppressor p53 to a partner protein.
Collapse
|
16
|
Sasse A, de Vries SJ, Schindler CEM, de Beauchêne IC, Zacharias M. Rapid Design of Knowledge-Based Scoring Potentials for Enrichment of Near-Native Geometries in Protein-Protein Docking. PLoS One 2017; 12:e0170625. [PMID: 28118389 PMCID: PMC5261736 DOI: 10.1371/journal.pone.0170625] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 01/07/2017] [Indexed: 01/15/2023] Open
Abstract
Protein-protein docking protocols aim to predict the structures of protein-protein complexes based on the structure of individual partners. Docking protocols usually include several steps of sampling, clustering, refinement and re-scoring. The scoring step is one of the bottlenecks in the performance of many state-of-the-art protocols. The performance of scoring functions depends on the quality of the generated structures and its coupling to the sampling algorithm. A tool kit, GRADSCOPT (GRid Accelerated Directly SCoring OPTimizing), was designed to allow rapid development and optimization of different knowledge-based scoring potentials for specific objectives in protein-protein docking. Different atomistic and coarse-grained potentials can be created by a grid-accelerated directly scoring dependent Monte-Carlo annealing or by a linear regression optimization. We demonstrate that the scoring functions generated by our approach are similar to or even outperform state-of-the-art scoring functions for predicting near-native solutions. Of additional importance, we find that potentials specifically trained to identify the native bound complex perform rather poorly on identifying acceptable or medium quality (near-native) solutions. In contrast, atomistic long-range contact potentials can increase the average fraction of near-native poses by up to a factor 2.5 in the best scored 1% decoys (compared to existing scoring), emphasizing the need of specific docking potentials for different steps in the docking protocol.
Collapse
Affiliation(s)
- Alexander Sasse
- Physik Department T38, Technische Universität München, James-Franck-Straße, Garching, Germany
| | - Sjoerd J. de Vries
- Physik Department T38, Technische Universität München, James-Franck-Straße, Garching, Germany
| | | | | | - Martin Zacharias
- Physik Department T38, Technische Universität München, James-Franck-Straße, Garching, Germany
- * E-mail:
| |
Collapse
|
17
|
Zhang Q, Wang C, Wan M, Wu Y, Ma Q. Streptococcus pneumoniae Genome-wide Identification and Characterization of BOX Element-binding Domains. Mol Inform 2016; 34:742-52. [PMID: 27491035 DOI: 10.1002/minf.201500044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2015] [Indexed: 11/11/2022]
Abstract
The BOX elements are short repetitive DNA sequences that distribute randomly in intergenic regions of the Streptococcus pneumoniae genome. The function and origin of such elements are still unknown, but they were found to modulate expression of neighboring genes. Evidences suggested that the modulation's mechanism can be fulfilled by sequence-specific interaction of BOX elements with transcription factor family proteins. However, the type and function of these BOX-binding proteins still remain largely unexplored to date. In the current study we described a synthetic protocol to investigate the recognition and interaction between a highly conserved site of BOX elements and the DNA-binding domains of a variety of putative transcription factors in the pneumococcal genome. With the protocol we were able to predict those high-affinity domain binders of the conserved BOX DNA site (BOX DNA) in a high-throughput manner, and analyzed sequence-specific interaction in the domainDNA recognition at molecular level. Consequently, a number of putative transcription factor domains with both high affinity and specificity for the BOX DNA were identified, from which the helix-turn-helix (HTH) motif of a small heat shock factor was selected as a case study and tested for its binding capability toward the double-stranded BOX DNA using fluorescence anisotropy analysis. As might be expected, a relatively high affinity was detected for the interaction of HTH motif with BOX DNA with dissociation constant at nanomolar level. Molecular dynamics simulation, atomic structure examination and binding energy analysis revealed a complicated network of intensive nonbonded interactions across the complex interface, which confers both stability and specificity for the complex architecture.
Collapse
Affiliation(s)
- Qiao Zhang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Changzheng Wang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Min Wan
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Yin Wu
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Qianli Ma
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| |
Collapse
|
18
|
Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, Kolinski A. Coarse-Grained Protein Models and Their Applications. Chem Rev 2016; 116:7898-936. [DOI: 10.1021/acs.chemrev.6b00163] [Citation(s) in RCA: 555] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Sebastian Kmiecik
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Dominik Gront
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Michal Kolinski
- Bioinformatics
Laboratory, Mossakowski Medical Research Center of the Polish Academy of Sciences, Pawinskiego 5, 02-106 Warsaw, Poland
| | - Lukasz Wieteska
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
- Department
of Medical Biochemistry, Medical University of Lodz, Mazowiecka 6/8, 92-215 Lodz, Poland
| | | | - Andrzej Kolinski
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| |
Collapse
|
19
|
de Vries SJ, Schindler CEM, Chauvot de Beauchêne I, Zacharias M. A web interface for easy flexible protein-protein docking with ATTRACT. Biophys J 2015; 108:462-5. [PMID: 25650913 DOI: 10.1016/j.bpj.2014.12.015] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Revised: 12/06/2014] [Accepted: 12/10/2014] [Indexed: 01/03/2023] Open
Abstract
Protein-protein docking programs can give valuable insights into the structure of protein complexes in the absence of an experimental complex structure. Web interfaces can facilitate the use of docking programs by structural biologists. Here, we present an easy web interface for protein-protein docking with the ATTRACT program. While aimed at nonexpert users, the web interface still covers a considerable range of docking applications. The web interface supports systematic rigid-body protein docking with the ATTRACT coarse-grained force field, as well as various kinds of protein flexibility. The execution of a docking protocol takes up to a few hours on a standard desktop computer.
Collapse
Affiliation(s)
- Sjoerd J de Vries
- Physics Department, Technische Universität München, Garching, Germany.
| | | | | | - Martin Zacharias
- Physics Department, Technische Universität München, Garching, Germany
| |
Collapse
|
20
|
Barik A, Nithin C, Karampudi NBR, Mukherjee S, Bahadur RP. Probing binding hot spots at protein-RNA recognition sites. Nucleic Acids Res 2015; 44:e9. [PMID: 26365245 PMCID: PMC4737170 DOI: 10.1093/nar/gkv876] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 08/23/2015] [Indexed: 01/30/2023] Open
Abstract
We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein–RNA interfaces to probe the binding hot spots at protein–RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein–protein and protein–RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein–RNA recognition sites with desired affinity.
Collapse
Affiliation(s)
- Amita Barik
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Chandran Nithin
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | | | - Sunandan Mukherjee
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India Advanced Technology Development Centre, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| |
Collapse
|
21
|
Liu L, Heermann DW. The interaction of DNA with multi-Cys2His2 zinc finger proteins. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2015; 27:064107. [PMID: 25563438 DOI: 10.1088/0953-8984/27/6/064107] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The multi-Cys2His2 (mC2H2) zinc finger protein, like CTCF, plays a central role in the three-dimensional organization of chromatin and gene regulation. The interaction between DNA and mC2H2 zinc finger proteins becomes crucial to better understand how CTCF dynamically shapes the chromatin structure. Here, we study a coarse-grained model of the mC2H2 zinc finger proteins in complexes with DNA, and in particular, we study how a mC2H2 zinc finger protein binds to and searches for its target DNA loci. On the basis of coarse-grained molecular dynamics simulations, we present several interesting kinetic conformational properties of the proteins, such as the rotation-coupled sliding, the asymmetrical roles of different zinc fingers and the partial binding partial dangling mode. In addition, two kinds of studied mC2H2 zinc finger proteins, of CG-rich and AT-rich binding motif each, were able to recognize their target sites and slide away from their non-target sites, which shows a proper sequence specificity in our model and the derived force field for mC2H2-DNA interaction. A further application to CTCF shows that the protein binds to a specific DNA duplex only with its central zinc fingers. The zinc finger domains of CTCF asymmetrically bend the DNA, but do not form a DNA loop alone in our simulations.
Collapse
Affiliation(s)
- Lei Liu
- Institute for Theoretical Physics, Heidelberg University, 69117 Heidelberg, Germany
| | | |
Collapse
|
22
|
Barik A, C N, Pilla SP, Bahadur RP. Molecular architecture of protein-RNA recognition sites. J Biomol Struct Dyn 2015; 33:2738-51. [PMID: 25562181 DOI: 10.1080/07391102.2015.1004652] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
The molecular architecture of protein-RNA interfaces are analyzed using a non-redundant dataset of 152 protein-RNA complexes. We find that an average protein-RNA interface is smaller than an average protein-DNA interface but larger than an average protein-protein interface. Among the different classes of protein-RNA complexes, interfaces with tRNA are the largest, while the interfaces with the single-stranded RNA are the smallest. Significantly, RNA contributes more to the interface area than its partner protein. Moreover, unlike protein-protein interfaces where the side chain contributes less to the interface area compared to the main chain, the main chain and side chain contributions flipped in protein-RNA interfaces. We find that the protein surface in contact with the RNA in protein-RNA complexes is better packed than that in contact with the DNA in protein-DNA complexes, but loosely packed than that in contact with the protein in protein-protein complexes. Shape complementarity and electrostatic potential are the two major factors that determine the specificity of the protein-RNA interaction. We find that the H-bond density at the protein-RNA interfaces is similar with that of protein-DNA interfaces but higher than the protein-protein interfaces. Unlike protein-DNA interfaces where the deoxyribose has little role in intermolecular H-bonds, due to the presence of an oxygen atom at the 2' position, the ribose in RNA plays significant role in protein-RNA H-bonds. We find that besides H-bonds, salt bridges and stacking interactions also play significant role in stabilizing protein-nucleic acids interfaces; however, their contribution at the protein-protein interfaces is insignificant.
Collapse
Affiliation(s)
- Amita Barik
- a Computational Structural Biology Laboratory, Department of Biotechnology , Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Nithin C
- a Computational Structural Biology Laboratory, Department of Biotechnology , Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Smita P Pilla
- a Computational Structural Biology Laboratory, Department of Biotechnology , Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Ranjit Prasad Bahadur
- a Computational Structural Biology Laboratory, Department of Biotechnology , Indian Institute of Technology Kharagpur , Kharagpur , India
| |
Collapse
|
23
|
Abstract
We investigate the role of water molecules in 89 protein–RNA complexes taken from the Protein Data Bank. Those with tRNA and single-stranded RNA are less hydrated than with duplex or ribosomal proteins. Protein–RNA interfaces are hydrated less than protein–DNA interfaces, but more than protein–protein interfaces. Majority of the waters at protein–RNA interfaces makes multiple H-bonds; however, a fraction do not make any. Those making H-bonds have preferences for the polar groups of RNA than its partner protein. The spatial distribution of waters makes interfaces with ribosomal proteins and single-stranded RNA relatively ‘dry’ than interfaces with tRNA and duplex RNA. In contrast to protein–DNA interfaces, mainly due to the presence of the 2′OH, the ribose in protein–RNA interfaces is hydrated more than the phosphate or the bases. The minor groove in protein–RNA interfaces is hydrated more than the major groove, while in protein–DNA interfaces it is reverse. The strands make the highest number of water-mediated H-bonds per unit interface area followed by the helices and the non-regular structures. The preserved waters at protein–RNA interfaces make higher number of H-bonds than the other waters. Preserved waters contribute toward the affinity in protein–RNA recognition and should be carefully treated while engineering protein–RNA interfaces.
Collapse
Affiliation(s)
- Amita Barik
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Ranjit Prasad Bahadur
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| |
Collapse
|
24
|
Abstract
By focusing on essential features, while averaging over less important details, coarse-grained (CG) models provide significant computational and conceptual advantages with respect to more detailed models. Consequently, despite dramatic advances in computational methodologies and resources, CG models enjoy surging popularity and are becoming increasingly equal partners to atomically detailed models. This perspective surveys the rapidly developing landscape of CG models for biomolecular systems. In particular, this review seeks to provide a balanced, coherent, and unified presentation of several distinct approaches for developing CG models, including top-down, network-based, native-centric, knowledge-based, and bottom-up modeling strategies. The review summarizes their basic philosophies, theoretical foundations, typical applications, and recent developments. Additionally, the review identifies fundamental inter-relationships among the diverse approaches and discusses outstanding challenges in the field. When carefully applied and assessed, current CG models provide highly efficient means for investigating the biological consequences of basic physicochemical principles. Moreover, rigorous bottom-up approaches hold great promise for further improving the accuracy and scope of CG models for biomolecular systems.
Collapse
Affiliation(s)
- W G Noid
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
25
|
Setny P, Zacharias M. Elastic Network Models of Nucleic Acids Flexibility. J Chem Theory Comput 2013; 9:5460-70. [PMID: 26592282 DOI: 10.1021/ct400814n] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Elastic network models (ENMs) are a useful tool for describing large scale motions in protein systems. While they are well validated in the context of proteins, relatively little is known about their applicability to nucleic acids, whose different architecture does not necessarily warrant comparable performance. In this study we thoroughly evaluate and optimize the efficiency of popular ENMs for capturing RNA and DNA flexibility. We also introduce two alternative models in which the strength of elastic connections at a coarse-grained level is governed by distance distribution at atomic resolution. For each of the considered ENMs we report the optimal length of spring connections as well as the scaling of elastic force constants that provides the best agreement of vibrational frequencies with normal modes based on atomic force field. In order to determine the absolute values of force constants we introduce a novel method based on the overlap of pseudoinverse of Hessian matrices.
Collapse
Affiliation(s)
- Piotr Setny
- Centre for New Technologies, University of Warsaw , 00-927 Warsaw, Poland
| | - Martin Zacharias
- Physics Department T38, Technical University Munich , 85748 Garching, Germany
| |
Collapse
|
26
|
Sokkar P, Choi SM, Rhee YM. Simple Method for Simulating the Mixture of Atomistic and Coarse-Grained Molecular Systems. J Chem Theory Comput 2013; 9:3728-39. [DOI: 10.1021/ct400091a] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Pandian Sokkar
- Center for
Self-assembly and Complexity, Institute for Basic Science (IBS), Pohang 790-784, Korea
| | - Sun Mi Choi
- Center for
Self-assembly and Complexity, Institute for Basic Science (IBS), Pohang 790-784, Korea
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang 790-784,
Korea
| | - Young Min Rhee
- Center for
Self-assembly and Complexity, Institute for Basic Science (IBS), Pohang 790-784, Korea
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang 790-784,
Korea
| |
Collapse
|
27
|
Do TN, Carloni P, Varani G, Bussi G. RNA/Peptide Binding Driven by Electrostatics-Insight from Bidirectional Pulling Simulations. J Chem Theory Comput 2013; 9:1720-30. [PMID: 26587630 DOI: 10.1021/ct3009914] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
RNA/protein interactions play crucial roles in controlling gene expression. They are becoming important targets for pharmaceutical applications. Due to RNA flexibility and to the strength of electrostatic interactions, standard docking methods are insufficient. We here present a computational method which allows studying the binding of RNA molecules and charged peptides with atomistic, explicit-solvent molecular dynamics. In our method, a suitable estimate of the electrostatic interaction is used as an order parameter (collective variable) which is then accelerated using bidirectional pulling simulations. Since the electrostatic interaction is only used to enhance the sampling, the approximations used to compute it do not affect the final accuracy. The method is employed to characterize the binding of TAR RNA from HIV-1 and a small cyclic peptide. Our simulation protocol allows blindly predicting the binding pocket and pose as well as the binding affinity. The method is general and could be applied to study other electrostatics-driven binding events.
Collapse
Affiliation(s)
- Trang N Do
- SISSA/ISAS - International School for Advanced Studies, Trieste 34136, Italy
| | - Paolo Carloni
- Computational Biophysics, German Research School for Simulation Sciences, D-52425 Jülich, Germany and Institute for Advanced Simulation IAS-5, Computational Biomedicine, Forschungszentrum Jülich, D-52425 Jülich, Germany
| | - Gabriele Varani
- Department of Chemistry and Department of Biochemistry, University of Washington, Seattle, Washington 98195-1700, United States
| | - Giovanni Bussi
- SISSA/ISAS - International School for Advanced Studies, Trieste 34136, Italy
| |
Collapse
|