Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gibrat JF, Madej T, Bryant SH. Surprising similarities in structure comparison. Curr Opin Struct Biol 1996;6:377-85. [PMID: 8804824 DOI: 10.1016/s0959-440x(96)80058-3] [Citation(s) in RCA: 685] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

For:	Gibrat JF, Madej T, Bryant SH. Surprising similarities in structure comparison. Curr Opin Struct Biol 1996;6:377-85. [PMID: 8804824 DOI: 10.1016/s0959-440x(96)80058-3] [Citation(s) in RCA: 685] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Number

Cited by Other Article(s)

151

Hangasky JA, Taabazuing CY, Valliere MA, Knapp MJ. Imposing function down a (cupin)-barrel: secondary structure and metal stereochemistry in the αKG-dependent oxygenases. Metallomics 2013;5:287-301. [PMID: 23446356 PMCID: PMC4109655 DOI: 10.1039/c3mt20153h] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

152

Thomas JC, O'Hara JM, Hu L, Gao FP, Joshi SB, Volkin DB, Brey RN, Fang J, Karanicolas J, Mantis NJ, Middaugh CR. Effect of single-point mutations on the stability and immunogenicity of a recombinant ricin A chain subunit vaccine antigen. Hum Vaccin Immunother 2013;9:744-52. [PMID: 23563512 DOI: 10.4161/hv.22998] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

153

Description of local and global shape properties of protein helices. J Mol Model 2013;19:2901-11. [PMID: 23529181 DOI: 10.1007/s00894-013-1819-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2012] [Accepted: 03/05/2013] [Indexed: 10/27/2022]

154

Implementation of a parallel protein structure alignment service on cloud. Int J Genomics 2013;2013:439681. [PMID: 23671842 PMCID: PMC3647543 DOI: 10.1155/2013/439681] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2013] [Accepted: 02/20/2013] [Indexed: 12/20/2022] Open

155

Ashby C, Johnson D, Walker K, Kanj IA, Xia G, Huang X. New enumeration algorithm for protein structure comparison and classification. BMC Genomics 2013;14 Suppl 2:S1. [PMID: 23445440 PMCID: PMC3582452 DOI: 10.1186/1471-2164-14-s2-s1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

156

Wiegels T, Bienert S, Torda AE. Fast alignment and comparison of RNA structures. Bioinformatics 2013;29:588-96. [PMID: 23314325 DOI: 10.1093/bioinformatics/btt006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

157

Amela I, Delicado P, Gómez A, Querol E, Cedano J. A dynamic model of the proteins that form the initial iron-sulfur cluster biogenesis machinery in yeast mitochondria. Protein J 2013;32:183-96. [PMID: 23463383 DOI: 10.1007/s10930-013-9475-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

158

von Behren MM, Volkamer A, Henzler AM, Schomburg KT, Urbaczek S, Rarey M. Fast protein binding site comparison via an index-based screening technology. J Chem Inf Model 2013;53:411-22. [PMID: 23390978 DOI: 10.1021/ci300469h] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

159

Torshin IY, Esipova NG, Tumanyan VG. Alternatingly twisted β-hairpins and nonglycine residues in the disallowed II′ region of the Ramachandran plot. J Biomol Struct Dyn 2013;32:198-208. [DOI: 10.1080/07391102.2012.759451] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

160

Awwad K, Desai A, Smith C, Sommerhalter M. Structural and functional characterization of a noncanonical nucleoside triphosphate pyrophosphatase from Thermotoga maritima. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013;69:184-93. [PMID: 23385455 PMCID: PMC3565439 DOI: 10.1107/s0907444912044630] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Accepted: 10/29/2012] [Indexed: 11/11/2022]

161

Devi PP, Adhikari S. Homology modeling and functional sites prediction of azoreductase enzyme from the cyanobacterium Nostoc sp. PCC7120. Interdiscip Sci 2013;4:310-8. [DOI: 10.1007/s12539-012-0140-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2012] [Revised: 05/02/2012] [Accepted: 07/30/2012] [Indexed: 10/27/2022]

162

Li SC. The difficulty of protein structure alignment under the RMSD. Algorithms Mol Biol 2013;8:1. [PMID: 23286762 PMCID: PMC3599502 DOI: 10.1186/1748-7188-8-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 12/17/2012] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Protein structure alignment is often modeled as the largest common point set (LCP) problem based on the Root Mean Square Deviation (RMSD), a measure commonly used to evaluate structural similarity. In the problem, each residue is represented by the coordinate of the Cαatom, and a structure is modeled as a sequence of 3D points. Out of two such sequences, one is to find two equal-sized subsequences of the maximum length, and a bijection between the points of the subsequences which gives an RMSD within a given threshold. The problem is considered to be difficult in terms of time complexity, but the reasons for its difficulty is not well-understood. Improving this time complexity is considered important in protein structure prediction and structural comparison, where the task of comparing very numerous structures is commonly encountered.

RESULTS

To study why the LCP problem is difficult, we define a natural variant of the problem, called the minimum aligned distance (MAD). In the MAD problem, the length of the subsequences to obtain is specified in the input; and instead of fulfilling a threshold, the RMSD between the points of the two subsequences is to be minimized. Our results show that the difficulty of the two problems does not lie solely in the combinatorial complexity of finding the optimal subsequences, or in the task of superimposing the structures. By placing a limit on the distance between consecutive points, and assuming that the points are specified as integral values, we show that both problems are equally difficult, in the sense that they are reducible to each other. In this case, both problems can be exactly solved in polynomial time, although the time complexity remains high.

CONCLUSIONS

We showed insights and techniques which we hope will lead to practical algorithms for the LCP problem for protein structures. The study identified two important factors in the problem's complexity: (1) The lack of a limit in the distance between the consecutive points of a structure; (2) The arbitrariness of the precision allowed in the input values. Both issues are of little practical concern for the purpose of protein structure alignment. When these factors are removed, the LCP problem is as hard as that of minimizing the RMSD (MAD problem), and can be solved exactly in polynomial time.

Collapse

163

Hitaoka S, Shibata Y, Matoba H, Kawano A, Harada M, Rahman MM, Tsuji D, Hirokawa T, Itoh K, Yoshida T, Chuman H. Modeling of Human Neuraminidase-1 and Its Validation by LERE-Correlation Analysis. CHEM-BIO INFORMATICS JOURNAL 2013. [DOI: 10.1273/cbij.13.30] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

164

Production of bulk chemicals via novel metabolic pathways in microorganisms. Biotechnol Adv 2012;31:925-35. [PMID: 23280013 DOI: 10.1016/j.biotechadv.2012.12.008] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Revised: 12/09/2012] [Accepted: 12/23/2012] [Indexed: 02/05/2023]

165

Volkamer A, Kuhn D, Rippmann F, Rarey M. Predicting enzymatic function from global binding site descriptors. Proteins 2012;81:479-89. [DOI: 10.1002/prot.24205] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Revised: 09/21/2012] [Accepted: 10/11/2012] [Indexed: 11/09/2022]

166

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2012. [PMID: 23193264 PMCID: PMC3531099 DOI: 10.1093/nar/gks1189] [Citation(s) in RCA: 362] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

167

Structure of the type III secretion effector protein ExoU in complex with its chaperone SpcU. PLoS One 2012;7:e49388. [PMID: 23166655 PMCID: PMC3498133 DOI: 10.1371/journal.pone.0049388] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2012] [Accepted: 10/10/2012] [Indexed: 11/21/2022] Open

168

Chen BY, Bandyopadhyay S. A regionalizable statistical model of intersecting regions in protein-ligand binding cavities. J Bioinform Comput Biol 2012;10:1242004. [PMID: 22809380 DOI: 10.1142/s0219720012420048] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

169

Structure and function of a unique pore-forming protein from a pathogenic acanthamoeba. Nat Chem Biol 2012;9:37-42. [PMID: 23143413 DOI: 10.1038/nchembio.1116] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2012] [Accepted: 10/15/2012] [Indexed: 11/08/2022]

170

Xu D. Protein databases on the internet. CURRENT PROTOCOLS IN PROTEIN SCIENCE 2012;Chapter 2:2.6.1-2.6.17. [PMID: 23151744 DOI: 10.1002/0471140864.ps0206s70] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

171

Feldman HJ. Identifying structural domains of proteins using clustering. BMC Bioinformatics 2012;13:286. [PMID: 23116496 PMCID: PMC3534501 DOI: 10.1186/1471-2105-13-286] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2012] [Accepted: 10/29/2012] [Indexed: 11/16/2022] Open

172

Ritchie DW, Ghoorah AW, Mavridis L, Venkatraman V. Fast protein structure alignment using Gaussian overlap scoring of backbone peptide fragment similarity. Bioinformatics 2012;28:3274-81. [DOI: 10.1093/bioinformatics/bts618] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

173

Meux E, Prosper P, Masai E, Mulliert G, Dumarçay S, Morel M, Didierjean C, Gelhaye E, Favier F. Sphingobium sp. SYK-6 LigG involved in lignin degradation is structurally and biochemically related to the glutathione transferase ω class. FEBS Lett 2012;586:3944-50. [PMID: 23058289 DOI: 10.1016/j.febslet.2012.09.036] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2012] [Revised: 09/13/2012] [Accepted: 09/21/2012] [Indexed: 10/27/2022]

174

Santini G, Soldano H, Pothier J. Automatic classification of protein structures relying on similarities between alignments. BMC Bioinformatics 2012;13:233. [PMID: 22974051 PMCID: PMC3534633 DOI: 10.1186/1471-2105-13-233] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2011] [Accepted: 08/20/2012] [Indexed: 11/10/2022] Open

Abstract

Background

Identification of protein structural cores requires isolation of sets of proteins all sharing a same subset of structural motifs. In the context of an ever growing number of available 3D protein structures, standard and automatic clustering algorithms require adaptations so as to allow for efficient identification of such sets of proteins.

Results

When considering a pair of 3D structures, they are stated as similar or not according to the local similarities of their matching substructures in a structural alignment. This binary relation can be represented in a graph of similarities where a node represents a 3D protein structure and an edge states that two 3D protein structures are similar. Therefore, classifying proteins into structural families can be viewed as a graph clustering task. Unfortunately, because such a graph encodes only pairwise similarity information, clustering algorithms may include in the same cluster a subset of 3D structures that do not share a common substructure. In order to overcome this drawback we first define a ternary similarity on a triple of 3D structures as a constraint to be satisfied by the graph of similarities. Such a ternary constraint takes into account similarities between pairwise alignments, so as to ensure that the three involved protein structures do have some common substructure. We propose hereunder a modification algorithm that eliminates edges from the original graph of similarities and gives a reduced graph in which no ternary constraints are violated. Our approach is then first to build a graph of similarities, then to reduce the graph according to the modification algorithm, and finally to apply to the reduced graph a standard graph clustering algorithm. Such method was used for classifying ASTRAL-40 non-redundant protein domains, identifying significant pairwise similarities with Yakusa, a program devised for rapid 3D structure alignments.

Conclusions

We show that filtering similarities prior to standard graph based clustering process by applying ternary similarity constraints i) improves the separation of proteins of different classes and consequently ii) improves the classification quality of standard graph based clustering algorithms according to the reference classification SCOP.

Collapse

175

Manjasetty BA, Yu XH, Panjikar S, Taguchi G, Chance MR, Liu CJ. Structural basis for modification of flavonol and naphthol glucoconjugates by Nicotiana tabacum malonyltransferase (NtMaT1). PLANTA 2012;236:781-93. [PMID: 22610270 DOI: 10.1007/s00425-012-1660-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2012] [Accepted: 04/23/2012] [Indexed: 06/01/2023]

Abstract

Plant HXXXD acyltransferase-catalyzed malonylation is an important modification reaction in elaborating the structural diversity of flavonoids and anthocyanins, and a universal adaptive mechanism to detoxify xenobiotics. Nicotiana tabacum malonyltransferase 1 (NtMaT1) is a member of anthocyanin acyltransferase subfamily that uses malonyl-CoA (MLC) as donor catalyzing transacylation in a range of flavonoid and naphthol glucosides. To gain insights into the molecular basis underlying its catalytic mechanism and versatile substrate specificity, we resolved the X-ray crystal structure of NtMaT1 to 3.1 Å resolution. The structure comprises two α/β mixed subdomains, as typically found in the HXXXD acyltransferases. The partial electron density map of malonyl-CoA allowed us to reliably dock the entire molecule into the solvent channel and subsequently define the binding sites for both donor and acceptor substrates. MLC bound to the NtMaT1 occupies one end of the long solvent channel between two subdomains. On superimposing and comparing the structure of NtMaT1 with that of an enzyme from anthocyanin acyltransferase subfamily from red chrysanthemum (Dm3Mat3) revealed large architectural variation in the binding sites, both for the acyl donor and for the acceptor, although their overall protein folds are structurally conserved. Consequently, the shape and the interactions of malonyl-CoA with the binding sites' amino acid residues differ substantially. These major local architectural disparities point to the independent, divergent evolution of plant HXXXD acyltransferases in different species. The structural flexibility of the enzyme and the amendable binding pattern of the substrates provide a basis for the evolution of the distinct, versatile substrate specificity of plant HXXXD acyltransferases.

Collapse

176

Bonnel N, Marteau PF. LNA: fast protein structural comparison using a Laplacian characterization of tertiary structure. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012;9:1451-1458. [PMID: 22547433 DOI: 10.1109/tcbb.2012.64] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

177

Burrell M, Hanfrey CC, Kinch LN, Elliott KA, Michael AJ. Evolution of a novel lysine decarboxylase in siderophore biosynthesis. Mol Microbiol 2012;86:485-99. [PMID: 22906379 DOI: 10.1111/j.1365-2958.2012.08208.x] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/07/2012] [Indexed: 12/30/2022]

178

Sousounis K, Haney CE, Cao J, Sunchu B, Tsonis PA. Conservation of the three-dimensional structure in non-homologous or unrelated proteins. Hum Genomics 2012;6:10. [PMID: 23244440 PMCID: PMC3500211 DOI: 10.1186/1479-7364-6-10] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 05/14/2012] [Indexed: 12/12/2022] Open

179

Horton JR, Mabuchi MY, Cohen-Karni D, Zhang X, Griggs RM, Samaranayake M, Roberts RJ, Zheng Y, Cheng X. Structure and cleavage activity of the tetrameric MspJI DNA modification-dependent restriction endonuclease. Nucleic Acids Res 2012;40:9763-73. [PMID: 22848107 PMCID: PMC3479186 DOI: 10.1093/nar/gks719] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

180

Mitin N, Rossman KL, Der CJ. Identification of a novel actin-binding domain within the Rho guanine nucleotide exchange factor TEM4. PLoS One 2012;7:e41876. [PMID: 22911862 PMCID: PMC3404065 DOI: 10.1371/journal.pone.0041876] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Accepted: 06/27/2012] [Indexed: 11/19/2022] Open

181

Aiello D, Caffrey DR. Evolution of specific protein-protein interaction sites following gene duplication. J Mol Biol 2012;423:257-72. [PMID: 22789570 DOI: 10.1016/j.jmb.2012.06.039] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Revised: 05/16/2012] [Accepted: 06/29/2012] [Indexed: 11/15/2022]

182

Mirceva G, Cingovska I, Dimov Z, Davcev D. Efficient approaches for retrieving protein tertiary structures. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012;9:1166-1179. [PMID: 22025763 DOI: 10.1109/tcbb.2011.138] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

183

Hung K, Wang JC, Chen CW, Chuang CL, Tsai KN, Chen CM. Enhancement of initial equivalency for protein structure alignment based on encoded local structures. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE : A PUBLICATION OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY 2012;16:1185-92. [PMID: 22717522 DOI: 10.1109/titb.2012.2204892] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

184

Joseph AP, Valadié H, Srinivasan N, de Brevern AG. Local structural differences in homologous proteins: specificities in different SCOP classes. PLoS One 2012;7:e38805. [PMID: 22745680 PMCID: PMC3382195 DOI: 10.1371/journal.pone.0038805] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2011] [Accepted: 05/10/2012] [Indexed: 11/19/2022] Open

Abstract

The constant increase in the number of solved protein structures is of great help in understanding the basic principles behind protein folding and evolution. 3-D structural knowledge is valuable in designing and developing methods for comparison, modelling and prediction of protein structures. These approaches for structure analysis can be directly implicated in studying protein function and for drug design. The backbone of a protein structure favours certain local conformations which include α-helices, β-strands and turns. Libraries of limited number of local conformations (Structural Alphabets) were developed in the past to obtain a useful categorization of backbone conformation. Protein Block (PB) is one such Structural Alphabet that gave a reasonable structure approximation of 0.42 Å. In this study, we use PB description of local structures to analyse conformations that are preferred sites for structural variations and insertions, among group of related folds. This knowledge can be utilized in improving tools for structure comparison that work by analysing local structure similarities. Conformational differences between homologous proteins are known to occur often in the regions comprising turns and loops. Interestingly, these differences are found to have specific preferences depending upon the structural classes of proteins. Such class-specific preferences are mainly seen in the all-β class with changes involving short helical conformations and hairpin turns. A test carried out on a benchmark dataset also indicates that the use of knowledge on the class specific variations can improve the performance of a PB based structure comparison approach. The preference for the indel sites also seem to be confined to a few backbone conformations involving β-turns and helix C-caps. These are mainly associated with short loops joining the regular secondary structures that mediate a reversal in the chain direction. Rare β-turns of type I’ and II’ are also identified as preferred sites for insertions.

Collapse

185

Chen BY, Bandyopadhyay S. Modeling regionalized volumetric differences in protein-ligand binding cavities. Proteome Sci 2012;10 Suppl 1:S6. [PMID: 22759583 PMCID: PMC3390949 DOI: 10.1186/1477-5956-10-s1-s6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

186

Launay G, Téletchéa S, Wade F, Pajot-Augy E, Gibrat JF, Sanz G. Automatic modeling of mammalian olfactory receptors and docking of odorants. Protein Eng Des Sel 2012;25:377-86. [PMID: 22691703 DOI: 10.1093/protein/gzs037] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

187

Shah SB, Sahinidis NV. SAS-Pro: simultaneous residue assignment and structure superposition for protein structure alignment. PLoS One 2012;7:e37493. [PMID: 22662161 PMCID: PMC3360771 DOI: 10.1371/journal.pone.0037493] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2011] [Accepted: 04/24/2012] [Indexed: 11/19/2022] Open

188

Anand P, Yeturu K, Chandra N. PocketAnnotate: towards site-based function annotation. Nucleic Acids Res 2012;40:W400-8. [PMID: 22618878 PMCID: PMC3394344 DOI: 10.1093/nar/gks421] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

189

Wang J, Gao X, Wang Q, Li Y. ProDis-ContSHC: learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval. BMC Bioinformatics 2012;13 Suppl 7:S2. [PMID: 22594999 PMCID: PMC3348016 DOI: 10.1186/1471-2105-13-s7-s2] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Abstract

BACKGROUND

The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity/similarity measure for comparing a pair of proteins. This kind of pairwise measures suffer from the limitation of neglecting the distribution of other proteins and thus cannot satisfy the need for high accuracy of the retrieval systems. Recent work in the machine learning community has shown that exploiting the global structure of the database and learning the contextual dissimilarity/similarity measures can improve the retrieval performance significantly. However, most existing contextual dissimilarity/similarity learning algorithms work in an unsupervised manner, which does not utilize the information of the known class labels of proteins in the database.

RESULTS

In this paper, we propose a novel protein-protein dissimilarity learning algorithm, ProDis-ContSHC. ProDis-ContSHC regularizes an existing dissimilarity measure dij by considering the contextual information of the proteins. The context of a protein is defined by its neighboring proteins. The basic idea is, for a pair of proteins (i, j), if their context N(i) and N(j) is similar to each other, the two proteins should also have a high similarity. We implement this idea by regularizing dij by a factor learned from the context N(i) and N(j).Moreover, we divide the context to hierarchial sub-context and get the contextual dissimilarity vector for each protein pair. Using the class label information of the proteins, we select the relevant (a pair of proteins that has the same class labels) and irrelevant (with different labels) protein pairs, and train an SVM model to distinguish between their contextual dissimilarity vectors. The SVM model is further used to learn a supervised regularizing factor. Finally, with the new Supervised learned Dissimilarity measure, we update the Protein Hierarchial Context Coherently in an iterative algorithm--ProDis-ContSHC.We test the performance of ProDis-ContSHC on two benchmark sets, i.e., the ASTRAL 1.73 database and the FSSP/DALI database. Experimental results demonstrate that plugging our supervised contextual dissimilarity measures into the retrieval systems significantly outperforms the context-free dissimilarity/similarity measures and other unsupervised contextual dissimilarity measures that do not use the class label information.

CONCLUSIONS

Using the contextual proteins with their class labels in the database, we can improve the accuracy of the pairwise dissimilarity/similarity measures dramatically for the protein retrieval tasks. In this work, for the first time, we propose the idea of supervised contextual dissimilarity learning, resulting in the ProDis-ContSHC algorithm. Among different contextual dissimilarity learning approaches that can be used to compare a pair of proteins, ProDis-ContSHC provides the highest accuracy. Finally, ProDis-ContSHC compares favorably with other methods reported in the recent literature.

Collapse

190

Schlenker C, Goel A, Tripet BP, Menon S, Willi T, Dlakić M, Young MJ, Lawrence CM, Copié V. Structural studies of E73 from a hyperthermophilic archaeal virus identify the "RH3" domain, an elaborated ribbon-helix-helix motif involved in DNA recognition. Biochemistry 2012;51:2899-910. [PMID: 22409376 PMCID: PMC3326356 DOI: 10.1021/bi201791s] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

191

Derbyshire MK, Lanczycki CJ, Bryant SH, Marchler-Bauer A. Annotation of functional sites with the Conserved Domain Database. Database (Oxford) 2012;2012:bar058. [PMID: 22434827 PMCID: PMC3308149 DOI: 10.1093/database/bar058] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2011] [Revised: 11/21/2011] [Accepted: 11/23/2011] [Indexed: 11/13/2022]

Abstract

The overwhelming fraction of proteins whose sequences have been collected in comprehensive databases may never be assessed for function experimentally. Commonly, putative function is assigned based on similarity to experimentally characterized homologs, either on the level of the entire protein or for single evolutionarily conserved domains. The annotation of individual sites provides more detailed insights regarding the correspondence between sequence and function, as well as context for the interpretation of sequence variation and the outcomes of experiments. In general, site annotation has to be extracted from the published literature, and can often be transferred to closely related sequence neighbors. The National Center for Biotechnology Information's Conserved Domain Database (CDD) provides a system for curators to record functional (such as active sites or binding sites for cofactors) or characteristic sites (such as signature motifs), which are conserved across domain families, and for the transfer of that annotation to protein database sequences via high-confidence domain matches. Recently, CDD curators have begun to sort-site annotations into seven categories (active, polypeptide binding, nucleic acid binding, ion binding, chemical binding, post-translational modification and other) and here we present a first comparative analysis of sites obtained via domain model matches, juxtaposed with existing site annotation encountered in high-quality data sets. Site annotation derived from domain annotation has the potential to cover large fractions of protein sequences, and we observe that CDD-based site annotation complements existing site annotation in many cases, which may, in part, originate from CDD's curation practice of collecting sites conserved across diverse taxa and supported by evidence from multiple 3D structures.

Collapse

192

Sadowski MI, Taylor WR. Evolutionary inaccuracy of pairwise structural alignments. ACTA ACUST UNITED AC 2012;28:1209-15. [PMID: 22399676 PMCID: PMC3338010 DOI: 10.1093/bioinformatics/bts103] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]

Abstract

Motivation: Structural alignment methods are widely used to generate gold standard alignments for improving multiple sequence alignments and transferring functional annotations, as well as for assigning structural distances between proteins. However, the correctness of the alignments generated by these methods is difficult to assess objectively since little is known about the exact evolutionary history of most proteins. Since homology is an equivalence relation, an upper bound on alignment quality can be found by assessing the consistency of alignments. Measuring the consistency of current methods of structure alignment and determining the causes of inconsistencies can, therefore, provide information on the quality of current methods and suggest possibilities for further improvement.

Results: We analyze the self-consistency of seven widely-used structural alignment methods (SAP, TM-align, Fr-TM-align, MAMMOTH, DALI, CE and FATCAT) on a diverse, non-redundant set of 1863 domains from the SCOP database and demonstrate that even for relatively similar proteins the degree of inconsistency of the alignments on a residue level is high (30%). We further show that levels of consistency vary substantially between methods, with two methods (SAP and Fr-TM-align) producing more consistent alignments than the rest. Inconsistency is found to be higher near gaps and for proteins of low structural complexity, as well as for helices. The ability of the methods to identify good structural alignments is also assessed using geometric measures, for which FATCAT (flexible mode) is found to be the best performer despite being highly inconsistent. We conclude that there is substantial scope for improving the consistency of structural alignment methods.

Contact:msadows@nimr.mrc.ac.uk

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

193

Alvarez MA, Yan C. A new protein graph model for function prediction. Comput Biol Chem 2012;37:6-10. [PMID: 22381922 DOI: 10.1016/j.compbiolchem.2012.01.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2011] [Revised: 01/02/2012] [Accepted: 01/04/2012] [Indexed: 11/27/2022]

194

Sleator RD. Proteins: form and function. Bioeng Bugs 2012;3:80-5. [PMID: 22095055 DOI: 10.4161/bbug.18303] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

195

Tyagi M, Hashimoto K, Shoemaker BA, Wuchty S, Panchenko AR. Large-scale mapping of human protein interactome using structural complexes. EMBO Rep 2012;13:266-71. [PMID: 22261719 PMCID: PMC3296913 DOI: 10.1038/embor.2011.261] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2011] [Revised: 11/23/2011] [Accepted: 12/09/2011] [Indexed: 11/09/2022] Open

196

Samson F, Shrager R, Tai CH, Sam V, Lee B, Munson PJ, Gibrat JF, Garnier J. DOMIRE: a web server for identifying structural domains and their neighbors in proteins. Bioinformatics 2012;28:1040-1. [PMID: 22345617 DOI: 10.1093/bioinformatics/bts076] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

197

Tyagi M, Thangudu RR, Zhang D, Bryant SH, Madej T, Panchenko AR. Homology inference of protein-protein interactions via conserved binding sites. PLoS One 2012;7:e28896. [PMID: 22303436 PMCID: PMC3269416 DOI: 10.1371/journal.pone.0028896] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Accepted: 11/16/2011] [Indexed: 11/18/2022] Open

198

Tomii K, Sawada Y, Honda S. Convergent evolution in structural elements of proteins investigated using cross profile analysis. BMC Bioinformatics 2012;13:11. [PMID: 22244085 PMCID: PMC3398312 DOI: 10.1186/1471-2105-13-11] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Accepted: 01/16/2012] [Indexed: 11/10/2022] Open

Abstract

Background

Evolutionary relations of similar segments shared by different protein folds remain controversial, even though many examples of such segments have been found. To date, several methods such as those based on the results of structure comparisons, sequence-based classifications, and sequence-based profile-profile comparisons have been applied to identify such protein segments that possess local similarities in both sequence and structure across protein folds. However, to capture more precise sequence-structure relations, no method reported to date combines structure-based profiles, and sequence-based profiles based on evolutionary information. The former are generally regarded as representing the amino acid preferences at each position of a specific conformation of protein segment. They might reflect the nature of ancient short peptide ancestors, using the results of structural classifications of protein segments.

Results

This report describes the development and use of "Cross Profile Analysis" to compare sequence-based profiles and structure-based profiles based on amino acid occurrences at each position within a protein segment cluster. Using systematic cross profile analysis, we found structural clusters of 9-residue and 15-residue segments showing remarkably strong correlation with particular sequence profiles. These correlations reflect structural similarities among constituent segments of both sequence-based and structure-based profiles. We also report previously undetectable sequence-structure patterns that transcend protein family and fold boundaries, and present results of the conformational analysis of the deduced peptide of a segment cluster. These results suggest the existence of ancient short-peptide ancestors.

Conclusions

Cross profile analysis reveals the polyphyletic and convergent evolution of β-hairpin-like structures, which were verified both experimentally and computationally. The results presented here give us new insights into the evolution of short protein segments.

Collapse

199

Gibney G, Baxevanis AD. Searching NCBI Databases Using Entrez. CURRENT PROTOCOLS IN HUMAN GENETICS 2012;Chapter 6:Unit6.10. [PMID: 21975942 DOI: 10.1002/0471142905.hg0610s71] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

200

Andreeva A. Classification of proteins: available structural space for molecular modeling. Methods Mol Biol 2012;857:1-31. [PMID: 22323215 DOI: 10.1007/978-1-61779-588-6_1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]