1
|
Kumar M, Rathore RS. Disallowed spots in protein structures. Biochim Biophys Acta Gen Subj 2023; 1867:130493. [PMID: 37865175 DOI: 10.1016/j.bbagen.2023.130493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 09/26/2023] [Accepted: 10/17/2023] [Indexed: 10/23/2023]
Abstract
Ramachandran (ϕ, ψ) steric map was introduced in 1963 to describe available conformation space for protein structures. Subsequently, residues were observed in high-energy disallowed regions of the map. To unequivocally identify the locations of disallowed conformations of residues, we got 36 noise-free protein structures (resolution ≤1 Å, Rwork/Rfree ≤ 0.10). These stringent criteria were applied to rule out data or model errors or any crystallographic disorders. No disallowed conformation was found in the dataset. Further, we also examined disallowed conformations in a larger dataset (resolution ≤1.5 Å, devoid of any model errors, or disorders). The observed locations of disallowed residues are referred as disallowed spots. These spots include short loops of 3-5 residues, and locations where residues participate in disulfide bonding or intramolecular interactions or inter-molecular interactions with neighboring water, metals or ligands. Conformational sampling revealed that short loops in between secondary structures hardly have any opportunity to relieve from conformational strain. Residues involved in interactions, which provide energetic compensation for high-energy conformational states, were relieved from strain once the causative interaction was removed. The present study aims to identify disallowed spots in the native state of proteins, wherein residues are forced to be trapped in high-energy disallowed conformations. Moreover, it was also observed that pre-Pro, Ser, Asp, trans-Pro, Val, Asn & Gly have higher tendency to occur in disallowed conformation, which could be attributed to factors such as conformational restrictions, residue propensity of secondary structures and compensating sidechain and mainchain interactions, stabilizing turn-mimics.
Collapse
Affiliation(s)
- Mayank Kumar
- Department of Bioinformatics, School of Earth, Biological and Environmental Sciences, Central University of South Bihar, Gaya, Bihar 824236, India
| | - R S Rathore
- Department of Bioinformatics, School of Earth, Biological and Environmental Sciences, Central University of South Bihar, Gaya, Bihar 824236, India.
| |
Collapse
|
2
|
Corbella M, Pinto GP, Kamerlin SCL. Loop dynamics and the evolution of enzyme activity. Nat Rev Chem 2023; 7:536-547. [PMID: 37225920 DOI: 10.1038/s41570-023-00495-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/06/2023] [Indexed: 05/26/2023]
Abstract
In the early 2000s, Tawfik presented his 'New View' on enzyme evolution, highlighting the role of conformational plasticity in expanding the functional diversity of limited repertoires of sequences. This view is gaining increasing traction with increasing evidence of the importance of conformational dynamics in both natural and laboratory evolution of enzymes. The past years have seen several elegant examples of harnessing conformational (particularly loop) dynamics to successfully manipulate protein function. This Review revisits flexible loops as critical participants in regulating enzyme activity. We showcase several systems of particular interest: triosephosphate isomerase barrel proteins, protein tyrosine phosphatases and β-lactamases, while briefly discussing other systems in which loop dynamics are important for selectivity and turnover. We then discuss the implications for engineering, presenting examples of successful loop manipulation in either improving catalytic efficiency, or changing selectivity completely. Overall, it is becoming clearer that mimicking nature by manipulating the conformational dynamics of key protein loops is a powerful method of tailoring enzyme activity, without needing to target active-site residues.
Collapse
Affiliation(s)
- Marina Corbella
- Department of Chemistry, Uppsala University, Uppsala, Sweden
| | - Gaspar P Pinto
- Department of Chemistry, Uppsala University, Uppsala, Sweden
- Cortex Discovery GmbH, Regensburg, Germany
| | - Shina C L Kamerlin
- Department of Chemistry, Uppsala University, Uppsala, Sweden.
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
3
|
Gonzalez B, Tare A, Ryu S, Johnson SC, Atzmon G, Barzilai N, Kaeberlein M, Suh Y. High-throughput sequencing analysis of nuclear-encoded mitochondrial genes reveals a genetic signature of human longevity. GeroScience 2023; 45:311-330. [PMID: 35948858 PMCID: PMC9886794 DOI: 10.1007/s11357-022-00634-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 07/28/2022] [Indexed: 02/03/2023] Open
Abstract
Mitochondrial dysfunction is a well-known contributor to aging and age-related diseases. The precise mechanisms through which mitochondria impact human lifespan, however, remain unclear. We hypothesize that humans with exceptional longevity harbor rare variants in nuclear-encoded mitochondrial genes (mitonuclear genes) that confer resistance against age-related mitochondrial dysfunction. Here we report an integrated functional genomics study to identify rare functional variants in ~ 660 mitonuclear candidate genes discovered by target capture sequencing analysis of 496 centenarians and 572 controls of Ashkenazi Jewish descent. We identify and prioritize longevity-associated variants, genes, and mitochondrial pathways that are enriched with rare variants. We provide functional gene variants such as those in MTOR (Y2396Lfs*29), CPS1 (T1406N), and MFN2 (G548*) as well as LRPPRC (S1378G) that is predicted to affect mitochondrial translation. Taken together, our results suggest a functional role for specific mitonuclear genes and pathways in human longevity.
Collapse
Affiliation(s)
- Brenda Gonzalez
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
| | - Archana Tare
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
| | - Seungjin Ryu
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
- Department of Pharmacology, College of Medicine, Hallym University, Chuncheon, Gangwon, 24252, Republic of Korea
| | - Simon C Johnson
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
| | - Gil Atzmon
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
- Department of Medicine, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
- Department of Biology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
| | - Nir Barzilai
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
- Department of Medicine, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
| | - Matt Kaeberlein
- Department of Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Yousin Suh
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA.
- Department of Medicine, Albert Einstein College of Medicine, Bronx, NY, 10461, USA.
- Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, NY, 10461, USA.
- Departments of Obstetrics and Gynecology, and Genetics and Development, Columbia University, 630 West 168th Street, New York, NY, 10032, USA.
| |
Collapse
|
4
|
Mitusińska K, Skalski T, Góra A. Simple Selection Procedure to Distinguish between Static and Flexible Loops. Int J Mol Sci 2020; 21:ijms21072293. [PMID: 32225102 PMCID: PMC7177474 DOI: 10.3390/ijms21072293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 03/22/2020] [Accepted: 03/24/2020] [Indexed: 12/02/2022] Open
Abstract
Loops are the most variable and unorganized elements of the secondary structure of proteins. Their ability to shift their shape can play a role in the binding of small ligands, enzymatic catalysis, or protein–protein interactions. Due to the loop flexibility, the positions of their residues in solved structures show the largest B-factors, or in a worst-case scenario can be unknown. Based on the loops’ movements’ timeline, they can be divided into slow (static) and fast (flexible). Although most of the loops that are missing in experimental structures belong to the flexible loops group, the computational tools for loop reconstruction use a set of static loop conformations to predict the missing part of the structure and evaluate the model. We believe that these two loop types can adopt different conformations and that using scoring functions appropriate for static loops is not sufficient for flexible loops. We showed that common model evaluation methods, are insufficient in the case of flexible solvent-exposed loops. Instead, we recommend using the potential energy to evaluate such loop models. We provide a novel model selection method based on a set of geometrical parameters to distinguish between flexible and static loops without the use of molecular dynamics simulations. We have also pointed out the importance of water network and interactions with the solvent for the flexible loop modeling.
Collapse
Affiliation(s)
- Karolina Mitusińska
- Tunneling Group, Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
| | - Tomasz Skalski
- Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
| | - Artur Góra
- Tunneling Group, Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
- Correspondence: ; Tel.: +48-322371659
| |
Collapse
|
5
|
SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information. PLoS One 2018; 13:e0198854. [PMID: 29975698 PMCID: PMC6033379 DOI: 10.1371/journal.pone.0198854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 05/25/2018] [Indexed: 11/19/2022] Open
Abstract
In this paper, we describe SAFlex (Structural Alphabet Flexibility), an extension of an existing structural alphabet (HMM-SA), to better explore increasing protein three dimensional structure information by encoding conformations of proteins in case of missing residues or uncertainties. An SA aims to reduce three dimensional conformations of proteins as well as their analysis and comparison complexity by simplifying any conformation in a series of structural letters. Our methodology presents several novelties. Firstly, it can account for the encoding uncertainty by providing a wide range of encoding options: the maximum a posteriori, the marginal posterior distribution, and the effective number of letters at each given position. Secondly, our new algorithm deals with the missing data in the protein structure files (concerning more than 75% of the proteins from the Protein Data Bank) in a rigorous probabilistic framework. Thirdly, SAFlex is able to encode and to build a consensus encoding from different replicates of a single protein such as several homomer chains. This allows localizing structural differences between different chains and detecting structural variability, which is essential for protein flexibility identification. These improvements are illustrated on different proteins, such as the crystal structure of an eukaryotic small heat shock protein. They are promising to explore increasing protein redundancy data and obtain useful quantification of their flexibility.
Collapse
|
6
|
Triki D, Cano Contreras ME, Flatters D, Visseaux B, Descamps D, Camproux AC, Regad L. Analysis of the HIV-2 protease's adaptation to various ligands: characterization of backbone asymmetry using a structural alphabet. Sci Rep 2018; 8:710. [PMID: 29335428 PMCID: PMC5768731 DOI: 10.1038/s41598-017-18941-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 12/18/2017] [Indexed: 12/27/2022] Open
Abstract
The HIV-2 protease (PR2) is a homodimer of 99 residues with asymmetric assembly and binding various ligands. We propose an exhaustive study of the local structural asymmetry between the two monomers of all available PR2 structures complexed with various inhibitors using a structural alphabet approach. On average, PR2 exhibits asymmetry in 31% of its positions-i.e., exhibiting different backbone local conformations in the two monomers. This asymmetry was observed all along its structure, particularly in the elbow and flap regions. We first differentiated structural asymmetry conserved in most PR2 structures from the one specific to some PR2. Then, we explored the origin of the detected asymmetry in PR2. We localized asymmetry that could be induced by PR2's flexibility, allowing transition from the semi-open to closed conformations and the asymmetry potentially induced by ligand binding. This latter could be important for the PR2's adaptation to diverse ligands. Our results highlighted some differences between asymmetry of PR2 bound to darunavir and amprenavir that could explain their differences of affinity. This knowledge is critical for a better description of PR2's recognition and adaptation to various ligands and for a better understanding of the resistance of PR2 to most PR2 inhibitors, a major antiretroviral class.
Collapse
Affiliation(s)
- Dhoha Triki
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Mario Enrique Cano Contreras
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Delphine Flatters
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Benoit Visseaux
- IAME, INSERM UMR 1137, Laboratoire de Virologie, Hôpital Bichat, AP-HP, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Diane Descamps
- IAME, INSERM UMR 1137, Laboratoire de Virologie, Hôpital Bichat, AP-HP, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Anne-Claude Camproux
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France.,Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Leslie Regad
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France. .,Université Paris Diderot, Sorbonne Paris Cité, Paris, France.
| |
Collapse
|
7
|
Regad L, Chéron JB, Triki D, Senac C, Flatters D, Camproux AC. Exploring the potential of a structural alphabet-based tool for mining multiple target conformations and target flexibility insight. PLoS One 2017; 12:e0182972. [PMID: 28817602 PMCID: PMC5560695 DOI: 10.1371/journal.pone.0182972] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 07/27/2017] [Indexed: 11/18/2022] Open
Abstract
Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels.
Collapse
Affiliation(s)
- Leslie Regad
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| | - Jean-Baptiste Chéron
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Institut de Chimie de Nice, UMR-CNRS 7272, Faculté des Sciences, Université de Nice-Sophia Antipolis, Nice, France
| | - Dhoha Triki
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Caroline Senac
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, INSERM, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Delphine Flatters
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Anne-Claude Camproux
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| |
Collapse
|
8
|
Bartolowits M, Davisson VJ. Considerations of Protein Subpockets in Fragment-Based Drug Design. Chem Biol Drug Des 2015; 87:5-20. [PMID: 26307335 DOI: 10.1111/cbdd.12631] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
While the fragment-based drug design approach continues to gain importance, gaps in the tools and methods available in the identification and accurate utilization of protein subpockets have limited the scope. The importance of these features of small molecule-protein recognition is highlighted with several examples. A generalized solution for the identification of subpockets and corresponding chemical fragments remains elusive, but there are numerous advancements in methods that can be used in combination to address subpockets. Finally, additional examples of approaches that consider the relative importance of small-molecule co-dependence of protein conformations are highlighted to emphasize an increased significance of subpockets, especially at protein interfaces.
Collapse
Affiliation(s)
- Matthew Bartolowits
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Dr., West Lafayette, IN, 47907, USA
| | - V Jo Davisson
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Dr., West Lafayette, IN, 47907, USA
| |
Collapse
|
9
|
Critical Role of a Loop at C-Terminal Domain on the Conformational Stability and Catalytic Efficiency of Chondroitinase ABC I. Mol Biotechnol 2015; 57:727-34. [DOI: 10.1007/s12033-015-9864-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
10
|
Bonet J, Fiser A, Oliva B, Fernandez-Fuentes N. Smotifs as structural local descriptors of supersecondary elements: classification, completeness and applications. BIO-ALGORITHMS AND MED-SYSTEMS 2014. [DOI: 10.1515/bams-2014-0016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractProtein structures are made up of periodic and aperiodic structural elements (i.e., α-helices, β-strands and loops). Despite the apparent lack of regular structure, loops have specific conformations and play a central role in the folding, dynamics, and function of proteins. In this article, we reviewed our previous works in the study of protein loops as local supersecondary structural motifs or Smotifs. We reexamined our works about the structural classification of loops (ArchDB) and its application to loop structure prediction (ArchPRED), including the assessment of the limits of knowledge-based loop structure prediction methods. We finalized this article by focusing on the modular nature of proteins and how the concept of Smotifs provides a convenient and practical approach to decompose proteins into strings of concatenated Smotifs and how can this be used in computational protein design and protein structure prediction.
Collapse
|
11
|
Bonet J, Planas-Iglesias J, Garcia-Garcia J, Marín-López MA, Fernandez-Fuentes N, Oliva B. ArchDB 2014: structural classification of loops in proteins. Nucleic Acids Res 2013; 42:D315-9. [PMID: 24265221 PMCID: PMC3964960 DOI: 10.1093/nar/gkt1189] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The function of a protein is determined by its three-dimensional structure, which is formed by regular (i.e. β-strands and α-helices) and non-periodic structural units such as loops. Compared to regular structural elements, non-periodic, non-repetitive conformational units enclose a much higher degree of variability—raising difficulties in the identification of regularities, and yet represent an important part of the structure of a protein. Indeed, loops often play a pivotal role in the function of a protein and different aspects of protein folding and dynamics. Therefore, the structural classification of protein loops is an important subject with clear applications in homology modelling, protein structure prediction, protein design (e.g. enzyme design and catalytic loops) and function prediction. ArchDB, the database presented here (freely available at http://sbi.imim.es/archdb), represents such a resource and has been an important asset for the scientific community throughout the years. In this article, we present a completely reworked and updated version of ArchDB. The new version of ArchDB features a novel, fast and user-friendly web-based interface, and a novel graph-based, computationally efficient, clustering algorithm. The current version of ArchDB classifies 149,134 loops in 5739 classes and 9608 subclasses.
Collapse
Affiliation(s)
- Jaume Bonet
- Structural Bioinformatics Lab (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Barcelona, Catalonia, 08950, Spain and Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, SY23 3DA Aberystwyth, Ceredigion, UK
| | | | | | | | | | | |
Collapse
|
12
|
Li Y. Conformational sampling in template-free protein loop structure modeling: an overview. Comput Struct Biotechnol J 2013; 5:e201302003. [PMID: 24688696 PMCID: PMC3962101 DOI: 10.5936/csbj.201302003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/23/2013] [Accepted: 01/28/2013] [Indexed: 01/04/2023] Open
Abstract
Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.
Collapse
Affiliation(s)
- Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| |
Collapse
|
13
|
Rorick M. Quantifying protein modularity and evolvability: a comparison of different techniques. Biosystems 2012; 110:22-33. [PMID: 22796584 DOI: 10.1016/j.biosystems.2012.06.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Revised: 06/20/2012] [Accepted: 06/27/2012] [Indexed: 10/28/2022]
Abstract
Modularity increases evolvability by reducing constraints on adaptation and by allowing preexisting parts to function in new contexts for novel uses. Protein evolution provides an excellent context to study the causes and consequences of biological modularity. In order to address such questions, however, an index for protein modularity is necessary. This paper proposes a simple index for protein modularity-"module density"-which is the number of evolutionarily independent modules that compose a protein divided by the number of amino acids in the protein. The decomposition of proteins into constituent modules can be accomplished by either of two classes of methods. The first class of methods relies on "suppositional" criteria to assign amino acids to modules, whereas the second class of methods relies on "coevolutionary" criteria for this task. One simple and practical method from the first class consists of approximating the number of modules in a protein as the number of regular secondary structure elements (i.e., helices and sheets). Methods based on coevolutionary criteria require more elaborate data, but they have the advantage of being able to specify modules without prior assumptions about why they exist. Given the increasing availability of datasets sampling protein mutational spectra (e.g., from comparative genomics, experimental evolution, and computational prediction), methods based on coevolutionary criteria will likely become more promising in the near future. The ability to meaningfully quantify protein modularity via simple indices has the potential to aid future efforts to understand protein evolutionary rate determinants, improve molecular evolution models and engineer novel proteins.
Collapse
Affiliation(s)
- Mary Rorick
- University of Michigan, Department of Ecology and Evolutionary Biology, Ann Arbor, MI 48109-1048, United States.
| |
Collapse
|
14
|
Novikova IV, Hennelly SP, Sanbonmatsu KY. Structural architecture of the human long non-coding RNA, steroid receptor RNA activator. Nucleic Acids Res 2012; 40:5034-51. [PMID: 22362738 PMCID: PMC3367176 DOI: 10.1093/nar/gks071] [Citation(s) in RCA: 204] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features.
Collapse
Affiliation(s)
- Irina V Novikova
- Theoretical Biology and Biophysics, Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | | | | |
Collapse
|
15
|
Ku SY, Hu YJ. Structural alphabet motif discovery and a structural motif database. Comput Biol Med 2011; 42:93-105. [PMID: 22099701 DOI: 10.1016/j.compbiomed.2011.10.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 09/28/2011] [Accepted: 10/27/2011] [Indexed: 10/15/2022]
Abstract
This study proposes a general framework for structural motif discovery. The framework is based on a modular design in which the system components can be modified or replaced independently to increase its applicability to various studies. It is a two-stage approach that first converts protein 3D structures into structural alphabet sequences, and then applies a sequence motif-finding tool to these sequences to detect conserved motifs. We named the structural motif database we built the SA-Motifbase, which provides the structural information conserved at different hierarchical levels in SCOP. For each motif, SA-Motifbase presents its 3D view; alphabet letter preference; alphabet letter frequency distribution; and the significance. SA-Motifbase is available at http://bioinfo.cis.nctu.edu.tw/samotifbase/.
Collapse
Affiliation(s)
- Shih-Yen Ku
- Department of Computer Science, National Chiao Tung University, 1001 Tashuei Rd., Hsinchu, Taiwan
| | | |
Collapse
|
16
|
Regad L, Martin J, Camproux AC. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs. BMC Bioinformatics 2011; 12:247. [PMID: 21689388 PMCID: PMC3158783 DOI: 10.1186/1471-2105-12-247] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Accepted: 06/20/2011] [Indexed: 12/24/2022] Open
Abstract
Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
Collapse
|
17
|
Regad L, Saladin A, Maupetit J, Geneix C, Camproux AC. SA-Mot: a web server for the identification of motifs of interest extracted from protein loops. Nucleic Acids Res 2011; 39:W203-9. [PMID: 21665924 PMCID: PMC3125790 DOI: 10.1093/nar/gkr410] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.
Collapse
Affiliation(s)
- Leslie Regad
- INSERM, U973, Université Paris 7-Paris Diderot, UMR-S973, MTi F-75013 Paris, France.
| | | | | | | | | |
Collapse
|
18
|
Rorick MM, Wagner GP. Protein structural modularity and robustness are associated with evolvability. Genome Biol Evol 2011; 3:456-75. [PMID: 21602570 PMCID: PMC3134980 DOI: 10.1093/gbe/evr046] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Theory suggests that biological modularity and robustness allow for maintenance of fitness under mutational change, and when this change is adaptive, for evolvability. Empirical demonstrations that these traits promote evolvability in nature remain scant however. This is in part because modularity, robustness, and evolvability are difficult to define and measure in real biological systems. Here, we address whether structural modularity and/or robustness confer evolvability at the level of proteins by looking for associations between indices of protein structural modularity, structural robustness, and evolvability. We propose a novel index for protein structural modularity: the number of regular secondary structure elements (helices and strands) divided by the number of residues in the structure. We index protein evolvability as the proportion of sites with evidence of being under positive selection multiplied by the average rate of adaptive evolution at these sites, and we measure this as an average over a phylogeny of 25 mammalian species. We use contact density as an index of protein designability, and thus, structural robustness. We find that protein evolvability is positively associated with structural modularity as well as structural robustness and that the effect of structural modularity on evolvability is independent of the structural robustness index. We interpret these associations to be the result of reduced constraints on amino acid substitutions in highly modular and robust protein structures, which results in faster adaptation through natural selection.
Collapse
Affiliation(s)
- Mary M Rorick
- Department of Genetics, Yale University, New Haven, Connecticut, USA.
| | | |
Collapse
|
19
|
Bastard K, Saladin A, Prévost C. Accounting for large amplitude protein deformation during in silico macromolecular docking. Int J Mol Sci 2011; 12:1316-33. [PMID: 21541061 PMCID: PMC3083708 DOI: 10.3390/ijms12021316] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2010] [Revised: 01/07/2011] [Accepted: 02/08/2011] [Indexed: 12/23/2022] Open
Abstract
Rapid progress of theoretical methods and computer calculation resources has turned in silico methods into a conceivable tool to predict the 3D structure of macromolecular assemblages, starting from the structure of their separate elements. Still, some classes of complexes represent a real challenge for macromolecular docking methods. In these complexes, protein parts like loops or domains undergo large amplitude deformations upon association, thus remodeling the surface accessible to the partner protein or DNA. We discuss the problems linked with managing such rearrangements in docking methods and we review strategies that are presently being explored, as well as their limitations and success.
Collapse
Affiliation(s)
- Karine Bastard
- LABIS, Genoscope, CEA, 2 rue Gaston Cremieux, F-91057 Evry Cedex, France; E-Mail:
| | - Adrien Saladin
- MTI, INSERM UMR-M 973, Paris Diderot-Paris 7 University, Bât Lamarck, 35 rue Hélène Brion, F-75205 Paris Cedex 13, France; E-Mail:
| | - Chantal Prévost
- LBT-UPR 9080 CNRS, IBPC, 13 rue Pierre et Marie Curie, F-75005 Paris, France
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +33-(0)1 58 41 51 71, Fax: +33-(0)1 58 415 026
| |
Collapse
|
20
|
Baussand J, Camproux AC. Deciphering the shape and deformation of secondary structures through local conformation analysis. BMC STRUCTURAL BIOLOGY 2011; 11:9. [PMID: 21284872 PMCID: PMC3224362 DOI: 10.1186/1472-6807-11-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2010] [Accepted: 02/01/2011] [Indexed: 12/30/2022]
Abstract
Background Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Results Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. Conclusion The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons.
Collapse
Affiliation(s)
- Julie Baussand
- Molécules Thérapeutiques in silico, UMRS-973, Université Paris-Diderot Paris-7,36, rue Hélène Brion, 75013 Paris, France
| | | |
Collapse
|
21
|
Hirani TA, Tovar-Méndez A, Miernyk JA, Randall DD. Asp295 stabilizes the active-site loop structure of pyruvate dehydrogenase, facilitating phosphorylation of ser292 by pyruvate dehydrogenase-kinase. Enzyme Res 2011; 2011:939068. [PMID: 21318135 PMCID: PMC3034952 DOI: 10.4061/2011/939068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2010] [Accepted: 11/05/2010] [Indexed: 01/22/2023] Open
Abstract
We have developed an in vitro system for detailed analysis of reversible phosphorylation of the plant mitochondrial pyruvate dehydrogenase complex, comprising recombinant Arabidopsis thalianaα2β2-heterotetrameric pyruvate dehydrogenase (E1) plus A. thaliana E1-kinase (AtPDK). Upon addition of MgATP, Ser292, which is located within the active-site loop structure of E1α, is phosphorylated. In addition to Ser292, Asp295 and Gly297 are highly conserved in the E1α active-site loop sequences. Mutation of Asp295 to Ala, Asn, or Leu greatly reduced phosphorylation of Ser292, while mutation of Gly297 had relatively little effect. Quantitative two-hybrid analysis was used to show that mutation of Asp295 did not substantially affect binding of AtPDK to E1α. When using pyruvate as a variable substrate, the Asp295 mutant proteins had modest changes in kcat, Km, and kcat/Km values. Therefore, we propose that Asp295 plays an important role in stabilizing the active-site loop structure, facilitating transfer of the γ-phosphate from ATP to the Ser residue at regulatory site one of E1α.
Collapse
Affiliation(s)
- Tripty A Hirani
- Department of Biochemistry, University of Missouri, Columbia, MO 65211, USA
| | | | | | | |
Collapse
|
22
|
Abstract
Loop modeling is crucial for high-quality homology model construction outside conserved secondary structure elements. Dozens of loop modeling protocols involving a range of database and ab initio search algorithms and a variety of scoring functions have been proposed. Knowledge-based loop modeling methods are very fast and some can successfully and reliably predict loops up to about eight residues long. Several recent ab initio loop simulation methods can be used to construct accurate models of loops up to 12-13 residues long, albeit at a substantial computational cost. Major current challenges are the simulations of loops longer than 12-13 residues, the modeling of multiple interacting flexible loops, and the sensitivity of the loop predictions to the accuracy of the loop environment.
Collapse
|