1
|
Kargatov AM, Brazhnikov EV, Efimov AV. Structure and Features of Amino Acid Sequences of L-Modules in SH3-Like Folds. Mol Biol 2018. [DOI: 10.1134/s0026893318060092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
2
|
Efimov AV. Chirality and Handedness of Protein Structures. BIOCHEMISTRY (MOSCOW) 2018; 83:S103-S110. [DOI: 10.1134/s0006297918140092] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
3
|
Abstract
Modularity is known as one of the most important features of protein's robust and efficient design. The architecture and topology of proteins play a vital role by providing necessary robust scaffolds to support organism's growth and survival in constant evolutionary pressure. These complex biomolecules can be represented by several layers of modular architecture, but it is pivotal to understand and explore the smallest biologically relevant structural component. In the present study, we have developed a component-based method, using protein's secondary structures and their arrangements (i.e. patterns) in order to investigate its structural space. Our result on all-alpha protein shows that the known structural space is highly populated with limited set of structural patterns. We have also noticed that these frequently observed structural patterns are present as modules or "building blocks" in large proteins (i.e. higher secondary structure content). From structural descriptor analysis, observed patterns are found to be within similar deviation; however, frequent patterns are found to be distinctly occurring in diverse functions e.g. in enzymatic classes and reactions. In this study, we are introducing a simple approach to explore protein structural space using combinatorial- and graph-based geometry methods, which can be used to describe modularity in protein structures. Moreover, analysis indicates that protein function seems to be the driving force that shapes the known structure space.
Collapse
Affiliation(s)
- Taushif Khan
- a School of Computational & Integrative Sciences , Jawaharlal Nehru University , New Delhi 110067 , India
| | - Indira Ghosh
- a School of Computational & Integrative Sciences , Jawaharlal Nehru University , New Delhi 110067 , India
| |
Collapse
|
4
|
Gordeev AB, Efimov AV. Modeling of folds and folding pathways for some protein families of (α + β)- and (α/β)-classes. J Biomol Struct Dyn 2013; 31:4-16. [PMID: 22800569 DOI: 10.1080/07391102.2012.691341] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
5
|
Boshkova EA, Efimov AV. Structures closed into cycles in proteins containing 3β-corners. BIOCHEMISTRY (MOSCOW) 2010; 75:1258-63. [DOI: 10.1134/s000629791010007x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Gordeev AB, Kargatov AM, Efimov AV. PCBOST: Protein classification based on structural trees. Biochem Biophys Res Commun 2010; 397:470-1. [PMID: 20573601 DOI: 10.1016/j.bbrc.2010.05.136] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2010] [Accepted: 05/27/2010] [Indexed: 11/17/2022]
Abstract
In this paper, we present the protein classification based on structural trees (PCBOST). This is a novel hierarchical classification of proteins that is primarily based on similarity of overall folds of proteins as well as on the modeled folding pathways of proteins. Amino acid sequences, functions of proteins and their evolutionary relationship are not taken into account in this classification. To date the database includes 3847 proteins and domains grouped into six categories having structural similarity and forming six structural trees (total 10,547 PDB-entries). The work on extension of the database and construction of novel structural trees is in progress. The service is free for all users and available at the URL <http://strees.protres.ru/>.
Collapse
Affiliation(s)
- Alexey B Gordeev
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region 142290, Russian Federation
| | | | | |
Collapse
|
7
|
|
8
|
Abstract
In the present study, a novel structural motif of proteins referred to as the phi-motif is considered, and two novel structural trees in which the phi-motif is taken as the root structure have been constructed. The simplest phi-motif is formed by three adjacent beta-strands connected by loops and packed in one beta-sheet so that its overall fold resembles the Greek letter phi. Construction of the structural trees and modeling of folding pathways have shown that all structures of the protein superfamilies can be obtained by stepwise addition of alpha-helices and/or beta-strands to the root phi-motif taking into account a restricted set of rules inferred from known principles of protein structure. The structural trees are a good tool for structure comparison, structural classification of proteins, as well as for searching for all possible protein folds and folding pathways.
Collapse
Affiliation(s)
- A V Efimov
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia.
| |
Collapse
|
9
|
Gordeev AB, Kondratova MS, Efimov AV. Novel structural tree of β-proteins containing abcd units. Mol Biol 2008. [DOI: 10.1134/s0026893308020155] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
10
|
Garbuzynskiy S, Kondratova M. Structural features of protein folding nuclei. FEBS Lett 2008; 582:768-72. [DOI: 10.1016/j.febslet.2008.01.049] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2007] [Revised: 12/10/2007] [Accepted: 01/29/2008] [Indexed: 10/22/2022]
|
11
|
Fokas AS, Papatheodorou TS, Kister AE, Gelfand IM. A geometric construction determines all permissible strand arrangements of sandwich proteins. Proc Natl Acad Sci U S A 2005; 102:15851-3. [PMID: 16249331 PMCID: PMC1276083 DOI: 10.1073/pnas.0507335102] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
For a large class of proteins called sandwich-like proteins (SPs), the secondary structures consist of two beta-sheets packed face-to-face, with each beta-sheet consisting typically of three to five beta-strands. An important step in the prediction of the three-dimensional structure of a SP is the prediction of its supersecondary structure, namely the prediction of the arrangement of the beta-strands in the two beta-sheets. Recently, significant progress in this direction was made, where it was shown that 91% of observed SPs form what we here call "canonical motifs." Here, we show that all canonical motifs can be constructed in a simple manner that is based on thermodynamic considerations and uses certain geometric structures. The number of these structures is much smaller than the number of possible strand arrangements. For instance, whereas for SPs consisting of six strands there exist a priori 900 possible strand arrangements, there exist only five geometric structures. Furthermore, the few motifs that are noncanonial can be constructed from canonical motifs by a simple procedure.
Collapse
Affiliation(s)
- A S Fokas
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom.
| | | | | | | |
Collapse
|
12
|
Kister AE, Finkelstein AV, Gelfand IM. Common features in structures and sequences of sandwich-like proteins. Proc Natl Acad Sci U S A 2002; 99:14137-41. [PMID: 12384574 PMCID: PMC137850 DOI: 10.1073/pnas.212511499] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The goal of this work is to define the structural and sequence features common to sandwich-like proteins (SPs), a group of very different proteins now comprising 69 superfamilies in 38 protein folds. Analysis of the arrangements of strands within main sandwich sheets revealed a rigorously defined constraint on the supersecondary substructure that holds true for 94% of known SP structures. The invariant substructure consists of two interlocked pairs of neighboring beta-strands. It is even more typical for centers of SP than the well-known "Greek key" strands arrangement for their edges. As homology among these proteins is not usually detectable even with the most powerful sequence-comparing algorithms, we employed a structure-based approach to sequence alignment. Within the interlocked strands we found 12 positions with fixed structural roles in SP. A residue at any of these positions possesses similar structural properties with residues in the same position of other SPs. The 12 positions lie at the center of the interface between the beta-sheets and form the common geometrical core of SPs. Of the 12 positions, 8 are occupied by only four hydrophobic residues in 80% of all SPs.
Collapse
Affiliation(s)
- Alexander E Kister
- Department of Mathematics, Rutgers University, Piscataway, NJ 08854, USA.
| | | | | |
Collapse
|
13
|
Reddy BV, Li WW, Shindyalov IN, Bourne PE. Conserved key amino acid positions (CKAAPs) derived from the analysis of common substructures in proteins. Proteins 2001. [DOI: 10.1002/1097-0134(20010201)42:2%3c148::aid-prot20%3e3.0.co;2-r] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
14
|
Reddy BV, Li WW, Shindyalov IN, Bourne PE. Conserved key amino acid positions (CKAAPs) derived from the analysis of common substructures in proteins. Proteins 2001; 42:148-63. [PMID: 11119639 DOI: 10.1002/1097-0134(20010201)42:2<148::aid-prot20>3.0.co;2-r] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
An all-against-all protein structure comparison using the Combinatorial Extension (CE) algorithm applied to a representative set of PDB structures revealed a gallery of common substructures in proteins (http://cl.sdsc.edu/ce.html). These substructures represent commonly identified folds, domains, or components thereof. Most of the subsequences forming these similar substructures have no significant sequence similarity. We present a method to identify conserved amino acid positions and residue-dependent property clusters within these subsequences starting with structure alignments. Each of the subsequences is aligned to its homologues in SWALL, a nonredundant protein sequence database. The most similar sequences are purged into a common frequency matrix, and weighted homologues of each one of the subsequences are used in scoring for conserved key amino acid positions (CKAAPs). We have set the top 20% of the high-scoring positions in each substructure to be CKAAPs. It is hypothesized that CKAAPs may be responsible for the common folding patterns in either a local or global view of the protein-folding pathway. Where a significant number of structures exist, CKAAPs have also been identified in structure alignments of complete polypeptide chains from the same protein family or superfamily. Evidence to support the presence of CKAAPs comes from other computational approaches and experimental studies of mutation and protein-folding experiments, notably the Paracelsus challenge. Finally, the structural environment of CKAAPs versus non-CKAAPs is examined for solvent accessibility, hydrogen bonding, and secondary structure. The identification of CKAAPs has important implications for protein engineering, fold recognition, modeling, and structure prediction studies and is dependent on the availability of structures and an accurate structure alignment methodology. Proteins 2001;42:148-163.
Collapse
Affiliation(s)
- B V Reddy
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, California 92093-0505, USA
| | | | | | | |
Collapse
|
15
|
Alexandrescu AT, Jaravine VA, Dames SA, Lamour FP. NMR hydrogen exchange of the OB-fold protein LysN as a function of denaturant: the most conserved elements of structure are the most stable to unfolding. J Mol Biol 1999; 289:1041-54. [PMID: 10369781 DOI: 10.1006/jmbi.1999.2813] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The structure of LysN contains an OB-fold motif composed of a structurally conserved five-stranded beta-barrel capped by a poorly conserved alpha-helix between strands beta3 and beta4. Two additional alpha-helices, unique to the LysN structure, flank the N terminus of the OB-fold. The stability of LysN to unfolding has been investigated with NMR native state hydrogen exchange measurements as a function of guanidinium hydrochloride concentration, and equilibrium unfolding transitions monitored by ellipticity at 222 nm and fluorescence at 350 nm. The spectrophotometric measurements suggest an apparent two-state unfolding transition with DeltaGu(0) approximately 6 kcal/mol and m approximately 3 kcal/(molM). By contrast, NMR hydrogen exchange measurements manifest a distribution of DeltaGu(0) and m values which indicate that the protein can undergo subglobal unfolding. The largest DeltaGu(0) values from hydrogen exchange are for residues in the beta-sheet of the protein. These values, which reflect complete unfolding of the protein, are between 3 and 4 kcal/mol higher than those obtained from circular dichroism or fluorescence. This discrepancy may be due to the comparison of NMR hydrogen exchange parameters measured at residue-level resolution, with spectrophotometric parameters that reflect an unresolved super position of unfolding transitions of the alpha-helices and beta-strands. The largest DeltaGu(0) values obtained from hydrogen exchange for the subset of residues in the alpha-helices of the protein, agree with the DeltaGu(0) values obtained from circular dichroism or fluorescence. Based on the hydrogen exchange data, however, the three alpha-helices of LysN are on average 3 kcal/mol less stable than the beta-sheet. Consistent with the subglobal unfolding of LysN evinced by hydrogen exchange, a deletion mutant that lacks the first alpha-helix of the protein retains a cooperatively folded structure. Taken together with previous results on the OB-fold proteins SN and CspA, the present results for LysN suggest that the most conserved elements of structure in the OB-fold motif are the most resistant to denaturation. In all three proteins, stability to denaturation correlates with sequence hydrophobicity.
Collapse
Affiliation(s)
- A T Alexandrescu
- Department of Structural Biology, Biozentrum, University of Basel, Basel, CH-4056, Switzerland.
| | | | | | | |
Collapse
|