1
|
βαβ Super-Secondary Motifs: Sequence, Structural Overview, and Pursuit of Potential Autonomously Folding βαβ Sequences from (β/α) 8/TIM Barrels. Methods Mol Biol 2019; 1958:221-236. [PMID: 30945221 DOI: 10.1007/978-1-4939-9161-7_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
βαβ super-secondary structures constitute the basic building blocks of (β/α)8 class of proteins. Despite the success in designing super-secondary structures, till date, there is not a single example of a natural βαβ sequence known to fold in isolation. In this chapter, to address the finding the "needles" in the haystack scenario, we have combined the sequence preferences and structural features of independent βαβ motifs, dictated by natural selection, with rationally derived parameters from a designed βαβ motif adopting stable fold in solution. Guided by this approach, a set of potential βαβ sequences from (β/α)8/TIM barrels are proposed as likely candidates for autonomously folding based on the assessment of their foldability.
Collapse
|
2
|
Stratmann D, Pathmanathan JS, Postic G, Rey J, Chomilier J. TEF 2.0: a graph-based method for decomposing protein structures into closed loops. J Biomol Struct Dyn 2018; 37:4140-4150. [PMID: 30585105 DOI: 10.1080/07391102.2018.1546230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Dirk Stratmann
- Sorbonne Université, UMR 7590 CNRS, MNHN, IRD, Institut de Minéralogie de Physique des Matériaux et de Cosmochimie (IMPMC) , Paris , France
| | - Jananan S Pathmanathan
- Sorbonne Université, UMR 7590 CNRS, MNHN, IRD, Institut de Minéralogie de Physique des Matériaux et de Cosmochimie (IMPMC) , Paris , France
| | - Guillaume Postic
- Sorbonne Université, UMR 7590 CNRS, MNHN, IRD, Institut de Minéralogie de Physique des Matériaux et de Cosmochimie (IMPMC) , Paris , France
| | - Julien Rey
- Sorbonne Paris Cité, Université Paris Diderot , Paris , France
| | - Jacques Chomilier
- Sorbonne Université, UMR 7590 CNRS, MNHN, IRD, Institut de Minéralogie de Physique des Matériaux et de Cosmochimie (IMPMC) , Paris , France
| |
Collapse
|
3
|
Rodriguez PM, Stratmann D, Duprat E, Papandreou N, Acuna R, Lacroix Z, Chomilier J. Correlating topology and thermodynamics to predict protein structure sensitivity to point mutations. BIO-ALGORITHMS AND MED-SYSTEMS 2018. [DOI: 10.1515/bams-2018-0026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractThe relation between distribution of hydrophobic amino acids along with protein chains and their structure is far from being completely understood. No reliable method allowsab initioprediction of the folded structure from this distribution of physicochemical properties, even when they are highly degenerated by considering only two classes: hydrophobic and polar. Establishment of long-range hydrophobic three dimension (3D) contacts is essential for the formation of the nucleus, a key process in the early steps of protein folding. Thus, a large number of 3D simulation studies were developed to challenge this issue. They are nowadays evaluated in a specific chapter of the molecular modeling competition, Critical Assessment of Protein Structure Prediction. We present here a simulation of the early steps of the folding process for 850 proteins, performed in a discrete 3D space, which results in peaks in the predicted distribution of intra-chain noncovalent contacts. The residues located at these peak positions tend to be buried in the core of the protein and are expected to correspond to critical positions in the sequence, important both for folding and structural (or similarly, energetic in the thermodynamic hypothesis) stability. The degree of stabilization or destabilization due to a point mutation at the critical positions involved in numerous contacts is estimated from the calculated folding free energy difference between mutated and native structures. The results show that these critical positions are not tolerant towards mutation. This simulation of the noncovalent contacts only needs a sequence as input, and this paper proposes a validation of the method by comparison with the prediction of stability by well-established programs.
Collapse
|
4
|
Berezovsky IN, Guarnera E, Zheng Z. Basic units of protein structure, folding, and function. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2016; 128:85-99. [PMID: 27697476 DOI: 10.1016/j.pbiomolbio.2016.09.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/05/2016] [Accepted: 09/26/2016] [Indexed: 10/20/2022]
Abstract
Study of the hierarchy of domain structure with alternative sets of domains and analysis of discontinuous domains, consisting of remote segments of the polypeptide chain, raised a question about the minimal structural unit of the protein domain. The hypothesis on the decisive role of the polypeptide backbone in determining the elementary units of globular proteins have led to the discovery of closed loops. It is reviewed here how closed loops form the loop-n-lock structure of proteins, providing the foundation for stability and designability of protein folds/domain and underlying their co-translational folding. Simplified protein sequences are considered here with the aim to explore the basic principles that presumably dominated the folding and stability of proteins in the early stages of structural evolution. Elementary functional loops (EFLs), closed loops with one or few catalytic residues, are, in turn, units of the protein function. They are apparent descendants of the prebiotic ring-like peptides, which gave rise to the first functional folds/domains being fused in the beginning of the evolution of protein structure. It is also shown how evolutionary relations between protein functional superfamilies and folds delineated with the help of EFLs can contribute to establishing the rules for design of desired enzymatic functions. Generalized descriptors of the elementary functions are proposed to be used as basic units in the future computational design.
Collapse
Affiliation(s)
- Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Zejun Zheng
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| |
Collapse
|
5
|
Orevi T, Rahamim G, Shemesh S, Ben Ishay E, Amir D, Haas E. Fast closure of long loops at the initiation of the folding transition of globular proteins studied by time-resolved FRET-based methods. BIO-ALGORITHMS AND MED-SYSTEMS 2014. [DOI: 10.1515/bams-2014-0018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractThe protein folding problem would be considered “solved” when it will be possible to “read genes”, i.e., to predict the native fold of proteins, their dynamics, and the mechanism of fast folding based solely on sequence data. The long-term goal should be the creation of an algorithm that would simulate the stepwise mechanism of folding, which constrains the conformational space and in which random search for stable interactions is possible. Here, we focus attention on the initial phases of the folding transition starting with the compact disordered collapsed ensemble, in search of the initial sub-domain structural biases that direct the otherwise stochastic dynamics of the backbone. Our studies are designed to test the “loop hypothesis”, which suggests that fast closure of long loop structures by non-local interactions between clusters of mainly non-polar residues is an essential conformational step at the initiation of the folding transition of globular proteins. We developed and applied experimental methods based on time-resolved resonance excitation energy transfer (trFRET) measurements combined with fast mixing methods and studied the initial phases of the folding of
Collapse
|
6
|
Chintapalli SV, Illingworth CJR, Upton GJG, Sacquin-Mora S, Reeves PJ, Mohammedali HS, Reynolds CA. Assessing the effect of dynamics on the closed-loop protein-folding hypothesis. J R Soc Interface 2013; 11:20130935. [PMID: 24258160 PMCID: PMC3869168 DOI: 10.1098/rsif.2013.0935] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
The closed-loop (loop-n-lock) hypothesis of protein folding suggests that loops of about 25 residues, closed through interactions between the loop ends (locks), play an important role in protein structure. Coarse-grain elastic network simulations, and examination of loop lengths in a diverse set of proteins, each supports a bias towards loops of close to 25 residues in length between residues of high stability. Previous studies have established a correlation between total contact distance (TCD), a metric of sequence distances between contacting residues (cf. contact order), and the log-folding rate of a protein. In a set of 43 proteins, we identify an improved correlation (r2 = 0.76), when the metric is restricted to residues contacting the locks, compared to the equivalent result when all residues are considered (r2 = 0.65). This provides qualified support for the hypothesis, albeit with an increased emphasis upon the importance of a much larger set of residues surrounding the locks. Evidence of a similar-sized protein core/extended nucleus (with significant overlap) was obtained from TCD calculations in which residues were successively eliminated according to their hydrophobicity and connectivity, and from molecular dynamics simulations. Our results suggest that while folding is determined by a subset of residues that can be predicted by application of the closed-loop hypothesis, the original hypothesis is too simplistic; efficient protein folding is dependent on a considerably larger subset of residues than those involved in lock formation.
Collapse
Affiliation(s)
- Sree V Chintapalli
- School of Biological Sciences, University of Essex, , Wivenhoe Park, Colchester CO4 3SQ, UK
| | | | | | | | | | | | | |
Collapse
|
7
|
Orevi T, Rahamim G, Hazan G, Amir D, Haas E. The loop hypothesis: contribution of early formed specific non-local interactions to the determination of protein folding pathways. Biophys Rev 2013; 5:85-98. [PMID: 28510159 DOI: 10.1007/s12551-013-0113-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2013] [Accepted: 03/01/2013] [Indexed: 12/12/2022] Open
Abstract
The extremely fast and efficient folding transition (in seconds) of globular proteins led to the search for some unifying principles embedded in the physics of the folding polypeptides. Most of the proposed mechanisms highlight the role of local interactions that stabilize secondary structure elements or a folding nucleus as the starting point of the folding pathways, i.e., a "bottom-up" mechanism. Non-local interactions were assumed either to stabilize the nucleus or lead to the later steps of coalescence of the secondary structure elements. An alternative mechanism was proposed, an "up-down" mechanism in which it was assumed that folding starts with the formation of very few non-local interactions which form closed long loops at the initiation of folding. The possible biological advantage of this mechanism, the "loop hypothesis", is that the hydrophobic collapse is associated with ordered compactization which reduces the chance for degradation and misfolding. In the present review the experiments, simulations and theoretical consideration that either directly or indirectly support this mechanism are summarized. It is argued that experiments monitoring the time-dependent development of the formation of specifically targeted early-formed sub-domain structural elements, either long loops or secondary structure elements, are necessary. This can be achieved by the time-resolved FRET-based "double kinetics" method in combination with mutational studies. Yet, attempts to improve the time resolution of the folding initiation should be extended down to the sub-microsecond time regime in order to design experiments that would resolve the classes of proteins which first fold by local or non-local interactions.
Collapse
Affiliation(s)
- Tomer Orevi
- The Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel, 52900
| | - Gil Rahamim
- The Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel, 52900
| | - Gershon Hazan
- The Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel, 52900
| | - Dan Amir
- The Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel, 52900
| | - Elisha Haas
- The Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel, 52900.
| |
Collapse
|
8
|
Fernandez-Fuentes N, Fiser A. A modular perspective of protein structures: application to fragment based loop modeling. Methods Mol Biol 2013; 932:141-58. [PMID: 22987351 PMCID: PMC3635063 DOI: 10.1007/978-1-62703-065-6_9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Proteins can be decomposed into supersecondary structure modules. We used a generic definition of supersecondary structure elements, so-called Smotifs, which are composed of two flanking regular secondary structures connected by a loop, to explore the evolution and current variety of structure building blocks. Here, we discuss recent observations about the saturation of Smotif geometries in protein structures and how it opens new avenues in protein structure modeling and design. As a first application of these observations we describe our loop conformation modeling algorithm, ArchPred that takes advantage of Smotifs classification. In this application, instead of focusing on specific loop properties the method narrows down possible template conformations in other, often not homologous structures, by identifying the most likely supersecondary structure environment that cradles the loop. Beyond identifying the correct starting supersecondary structure geometry, it takes into account information of fit of anchor residues, sterical clashes, match of predicted and observed dihedral angle preferences, and local sequence signal.
Collapse
Affiliation(s)
- Narcis Fernandez-Fuentes
- Leeds Institute of Molecular Medicine, Section of Experimental Therapeutics, University of Leeds, St. James's University Hospital, Leeds LS9 7TF, UK
| | - Andras Fiser
- Department of Systems and Computational Biology, Department of Biochemistry Albert Einstein College of Medicine, 1301 Morris Park Ave, Bronx, NY 10461, USA
| |
Collapse
|
9
|
Piwowar M, Banach M, Konieczny L, Chomilier J, Roterman I. Structural role of exons in hemoglobin. BIO-ALGORITHMS AND MED-SYSTEMS 2013. [DOI: 10.1515/bams-2013-0009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
10
|
Sobolevsky Y, Guimarães RC, Trifonov EN. Towards functional repertoire of the earliest proteins. J Biomol Struct Dyn 2012; 31:1293-300. [PMID: 23140233 DOI: 10.1080/07391102.2012.735623] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The conserved protein sequence motifs present in all prokaryotic proteomes, "omnipresent motifs," presumably, correspond to the earliest proteins of the Last Universal Cellular Ancestor, from which all the proteomes have descended. Fifteen proteomes, each representing one of the total 15 diverse phyla of 131 Eubacteria and Archea, from which the omnipresent elements have been originally derived, are exhaustively screened. All those proteins which harbor the omnipresent motifs are identified. Six "omnipresent" protein types are revealed which are located in all 15 proteomes: ABC cassettes, FtsH proteases, translation initiation factors, translation elongation factors, isoleucyl-tRNA synthases, and RNA polymerases β'. In addition to the omnipresent motifs, these proteins also contain other highly conserved motifs, standing for additional modules of the proteins. Remarkably, the identified tentative earliest proteins are responsible for only three basic functions: supply of monomers (ABC transporters and proteases), protein synthesis (initiation and elongation factors, aminoacyl-tRNA synthases), and RNA synthesis (polymerases). No enzymes involved in metabolic activities are present in the list of the earliest proteins derived by this approach. Some of the omnipresent sequence motifs are found, indeed, in the metabolic enzymes (e.g. NTP binding motifs), but these enzymes do not make a sequence matching collection of 15 sequences, i.e. they are not omnipresent. Future analysis of less conserved sequence motifs may reveal at what degree of conservation (stage of evolution) the metabolic enzymes could have entered the scene.
Collapse
Affiliation(s)
- Y Sobolevsky
- a Faculdade de Gama , Universidade de Brasilia , Gama , 72405-610 , Brazil
| | | | | |
Collapse
|
11
|
Ben Ishay E, Rahamim G, Orevi T, Hazan G, Amir D, Haas E. Fast Subdomain Folding Prior to the Global Refolding Transition of E. coli Adenylate Kinase: A Double Kinetics Study. J Mol Biol 2012; 423:613-23. [DOI: 10.1016/j.jmb.2012.08.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2012] [Revised: 07/31/2012] [Accepted: 08/07/2012] [Indexed: 11/16/2022]
|
12
|
Goncearenco A, Berezovsky IN. Exploring the evolution of protein function in Archaea. BMC Evol Biol 2012; 12:75. [PMID: 22646318 PMCID: PMC3458885 DOI: 10.1186/1471-2148-12-75] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2011] [Accepted: 02/24/2012] [Indexed: 11/21/2022] Open
Abstract
Background Despite recent progress in studies of the evolution of protein function, the questions what were the first functional protein domains and what were their basic building blocks remain unresolved. Previously, we introduced the concept of elementary functional loops (EFLs), which are the functional units of enzymes that provide elementary reactions in biochemical transformations. They are presumably descendants of primordial catalytic peptides. Results We analyzed distant evolutionary connections between protein functions in Archaea based on the EFLs comprising them. We show examples of the involvement of EFLs in new functional domains, as well as reutilization of EFLs and functional domains in building multidomain structures and protein complexes. Conclusions Our analysis of the archaeal superkingdom yields the dominating mechanisms in different periods of protein evolution, which resulted in several levels of the organization of biochemical function. First, functional domains emerged as combinations of prebiotic peptides with the very basic functions, such as nucleotide/phosphate and metal cofactor binding. Second, domain recombination brought to the evolutionary scene the multidomain proteins and complexes. Later, reutilization and de novo design of functional domains and elementary functional loops complemented evolution of protein function.
Collapse
Affiliation(s)
- Alexander Goncearenco
- Computational Biology Unit, Uni Research, University of Bergen, N-5008 Bergen, Norway
| | | |
Collapse
|