Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Boutonnet NS, Kajava AV, Rooman MJ. Structural classification of αββ and ββα supersecondary structure units in proteins. Proteins 1998. [DOI: 10.1002/(sici)1097-0134(19980201)30:2<193::aid-prot9>3.0.co;2-o] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

For:	Boutonnet NS, Kajava AV, Rooman MJ. Structural classification of αββ and ββα supersecondary structure units in proteins. Proteins 1998. [DOI: 10.1002/(sici)1097-0134(19980201)30:2<193::aid-prot9>3.0.co;2-o] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Caetano-Anollés K, Aziz MF, Mughal F, Caetano-Anollés G. On Protein Loops, Prior Molecular States and Common Ancestors of Life. J Mol Evol 2024:10.1007/s00239-024-10167-y. [PMID: 38652291 DOI: 10.1007/s00239-024-10167-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 03/22/2024] [Indexed: 04/25/2024]

Extension of the classical classification of β-turns. Sci Rep 2016;6:33191. [PMID: 27627963 PMCID: PMC5024104 DOI: 10.1038/srep33191] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Accepted: 08/22/2016] [Indexed: 11/29/2022] Open

Ho HK, Zhang L, Ramamohanarao K, Martin S. A survey of machine learning methods for secondary and supersecondary protein structure prediction. Methods Mol Biol 2013;932:87-106. [PMID: 22987348 DOI: 10.1007/978-1-62703-065-6_6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Fernandez-Fuentes N, Fiser A. A modular perspective of protein structures: application to fragment based loop modeling. Methods Mol Biol 2013;932:141-58. [PMID: 22987351 PMCID: PMC3635063 DOI: 10.1007/978-1-62703-065-6_9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Hu C, Koehl P, Max N. PackHelix: a tool for helix-sheet packing during protein structure prediction. Proteins 2011;79:2828-43. [PMID: 21905109 PMCID: PMC3172692 DOI: 10.1002/prot.23108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Revised: 04/18/2011] [Accepted: 05/13/2011] [Indexed: 11/09/2022]

Hu C, Koehl P. Helix-sheet packing in proteins. Proteins 2010;78:1736-47. [PMID: 20186972 PMCID: PMC2854864 DOI: 10.1002/prot.22688] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Fernandez-Fuentes N, Dybas JM, Fiser A. Structural characteristics of novel protein folds. PLoS Comput Biol 2010;6:e1000750. [PMID: 20421995 PMCID: PMC2858679 DOI: 10.1371/journal.pcbi.1000750] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2009] [Accepted: 03/19/2010] [Indexed: 11/29/2022] Open

Abstract

Folds are the basic building blocks of protein structures. Understanding the emergence of novel protein folds is an important step towards understanding the rules governing the evolution of protein structure and function and for developing tools for protein structure modeling and design. We explored the frequency of occurrences of an exhaustively classified library of supersecondary structural elements (Smotifs), in protein structures, in order to identify features that would define a fold as novel compared to previously known structures. We found that a surprisingly small set of Smotifs is sufficient to describe all known folds. Furthermore, novel folds do not require novel Smotifs, but rather are a new combination of existing ones. Novel folds can be typified by the inclusion of a relatively higher number of rarely occurring Smotifs in their structures and, to a lesser extent, by a novel topological combination of commonly occurring Smotifs. When investigating the structural features of Smotifs, we found that the top 10% of most frequent ones have a higher fraction of internal contacts, while some of the most rare motifs are larger, and contain a longer loop region.

Structural genomics efforts aim at exploring the repertoire of three-dimensional structures of protein molecules. While genome scale sequencing projects have already provided us with all the genes of many organisms, it is the three dimensional shape of gene encoded proteins that defines all the interactions among these components. Understanding the versatility and, ultimately, the role of all possible molecular shapes in the cell is a necessary step toward understanding how organisms function. In this work we explored the rules that identify certain shapes as novel compared to all already known structures. The findings of this work provide possible insights into the rules that can be used in future works to identify or design new molecular shapes or to relate folds with each other in a quantitative manner.

Collapse

Tyagi M, Bornot A, Offmann B, de Brevern AG. Analysis of loop boundaries using different local structure assignment methods. Protein Sci 2009;18:1869-81. [PMID: 19606500 DOI: 10.1002/pro.198] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Shi S, Chitturi B, Grishin NV. ProSMoS server: a pattern-based search using interaction matrix representation of protein structures. Nucleic Acids Res 2009;37:W526-31. [PMID: 19420061 PMCID: PMC2703969 DOI: 10.1093/nar/gkp316] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Paluszewski M, Winter P. Protein Decoy Generation Using Branch and Bound with Efficient Bounding. LECTURE NOTES IN COMPUTER SCIENCE 2008. [DOI: 10.1007/978-3-540-87361-7_32] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Kurgan L, Kedarisetti KD. Sequence representation and prediction of protein secondary structure for structural motifs in twilight zone proteins. Protein J 2007;25:463-74. [PMID: 17115254 DOI: 10.1007/s10930-006-9029-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Shi S, Zhong Y, Majumdar I, Sri Krishna S, Grishin NV. Searching for three-dimensional secondary structural patterns in proteins with ProSMoS. Bioinformatics 2007;23:1331-8. [PMID: 17384423 DOI: 10.1093/bioinformatics/btm121] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Abstract

MOTIVATION

Many evolutionarily distant, but functionally meaningful links between proteins come to light through comparison of spatial structures. Most programs that assess structural similarity compare two proteins to each other and find regions in common between them. Structural classification experts look for a particular structural motif instead. Programs base similarity scores on superposition or closeness of either Cartesian coordinates or inter-residue contacts. Experts pay more attention to the general orientation of the main chain and mutual spatial arrangement of secondary structural elements. There is a need for a computational tool to find proteins with the same secondary structures, topological connections and spatial architecture, regardless of subtle differences in 3D coordinates.

RESULTS

We developed ProSMoS--a Protein Structure Motif Search program that emulates an expert. Starting from a spatial structure, the program uses previously delineated secondary structural elements. A meta-matrix of interactions between the elements (parallel or antiparallel) minding handedness of connections (left or right) and other features (e.g. element lengths and hydrogen bonds) is constructed prior to or during the searches. All structures are reduced to such meta-matrices that contain just enough information to define a protein fold, but this definition remains very general and deviations in 3D coordinates are tolerated. User supplies a meta-matrix for a structural motif of interest, and ProSMoS finds all proteins in the protein data bank (PDB) that match the meta-matrix. ProSMoS performance is compared to other programs and is illustrated on a beta-Grasp motif. A brief analysis of all beta-Grasp-containing proteins is presented. Program availability: ProSMoS is freely available for non-commercial use from ftp://iole.swmed.edu/pub/ProSMoS.

Collapse

Kihara D, Skolnick J. The PDB is a covering set of small protein structures. J Mol Biol 2004;334:793-802. [PMID: 14636603 DOI: 10.1016/j.jmb.2003.10.027] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

Structure comparisons of all representative proteins have been done. Employing the relative root mean square deviation (RMSD) from native enables the assessment of the statistical significance of structure alignments of different lengths in terms of a Z-score. Two conclusions emerge: first, proteins with their native fold can be distinguished by their Z-score. Second and somewhat surprising, all small proteins up to 100 residues in length have significant structure alignments to other proteins in a different secondary structure and fold class; i.e. 24.0% of them have 60% coverage by a template protein with a RMSD below 3.5A and 6.0% have 70% coverage. If the restriction that we align proteins only having different secondary structure types is removed, then in a representative benchmark set of proteins of 200 residues or smaller, 93% can be aligned to a single template structure (with average sequence identity of 9.8%), with a RMSD less than 4A, and 79% average coverage. In this sense, the current Protein Data Bank (PDB) is almost a covering set of small protein structures. The length of the aligned region (relative to the whole protein length) does not differ among the top hit proteins, indicating that protein structure space is highly dense. For larger proteins, non-related proteins can cover a significant portion of the structure. Moreover, these top hit proteins are aligned to different parts of the target protein, so that almost the entire molecule can be covered when combined. The number of proteins required to cover a target protein is very small, e.g. the top ten hit proteins can give 90% coverage below a RMSD of 3.5A for proteins up to 320 residues long. These results give a new view of the nature of protein structure space, and its implications for protein structure prediction are discussed.

Collapse

Singh SK, Babu MM, Balaram P. Registering alpha-helices and beta-strands using backbone C-H...O interactions. Proteins 2003;51:167-71. [PMID: 12660986 DOI: 10.1002/prot.10245] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Iurcu-Mustata G, Van Belle D, Wintjens R, Prévost M, Rooman M. Role of salt bridges in homeodomains investigated by structural analyses and molecular dynamics simulations. Biopolymers 2001;59:145-59. [PMID: 11391564 DOI: 10.1002/1097-0282(200109)59:3<145::aid-bip1014>3.0.co;2-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Gilis D, Rooman M. Identification and ab initio simulations of early folding units in proteins. Proteins 2001;42:164-76. [PMID: 11119640 DOI: 10.1002/1097-0134(20010201)42:2<164::aid-prot30>3.0.co;2-#] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

de Brevern AG, Etchebest C, Hazout S. Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Proteins 2000;41:271-87. [PMID: 11025540 DOI: 10.1002/1097-0134(20001115)41:3<271::aid-prot10>3.0.co;2-z] [Citation(s) in RCA: 208] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Tsai CJ, Maizel JV, Nussinov R. Distinguishing between sequential and nonsequentially folded proteins: implications for folding and misfolding. Protein Sci 1999;8:1591-604. [PMID: 10452603 PMCID: PMC2144423 DOI: 10.1110/ps.8.8.1591] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Abstract

We describe here an algorithm for distinguishing sequential from nonsequentially folding proteins. Several experiments have recently suggested that most of the proteins that are synthesized in the eukaryotic cell may fold sequentially. This proposed folding mechanism in vivo is particularly advantageous to the organism. In the absence of chaperones, the probability that a sequentially folding protein will misfold is reduced significantly. The problem we address here is devising a procedure that would differentiate between the two types of folding patterns. Footprints of sequential folding may be found in structures where consecutive fragments of the chain interact with each other. In such cases, the folding complexity may be viewed as being lower. On the other hand, higher folding complexity suggests that at least a portion of the polypeptide backbone folds back upon itself to form three-dimensional (3D) interactions with noncontiguous portion(s) of the chain. Hence, we look at the mechanism of folding of the molecule via analysis of its complexity, that is, through the 3D interactions formed by contiguous segments on the polypeptide chain. To computationally splice the structure into consecutively interacting fragments, we either cut it into compact hydrophobic folding units or into a set of hypothetical, transient, highly populated, contiguous fragments ("building blocks" of the structure). In sequential folding, successive building blocks interact with each other from the amino to the carboxy terminus of the polypeptide chain. Consequently, the results of the parsing differentiate between sequentially vs. nonsequentially folded chains. The automated assessment of the folding complexity provides insight into both the likelihood of misfolding and the kinetic folding rate of the given protein. In terms of the funnel free energy landscape theory, a protein that truly follows the mechanism of sequential folding, in principle, encounters smoother free energy barriers. A simple sequentially folded protein should, therefore, be less error prone and fold faster than a protein with a complex folding pattern.

Collapse

Reddy BV, Nagarajaram HA, Blundell TL. Analysis of interactive packing of secondary structural elements in alpha/beta units in proteins. Protein Sci 1999;8:573-86. [PMID: 10091660 PMCID: PMC2144285 DOI: 10.1110/ps.8.3.573] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]