Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: May AC. Toward more meaningful hierarchical classification of protein three-dimensional structures. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(19991001)37:1<20::aid-prot3>3.0.co;2-v] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

For:	May AC. Toward more meaningful hierarchical classification of protein three-dimensional structures. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(19991001)37:1<20::aid-prot3>3.0.co;2-v] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Number

Cited by Other Article(s)

Xu S, Zou S, Wang L. A geometric clustering algorithm with applications to structural data. J Comput Biol 2014;22:436-50. [PMID: 25517067 DOI: 10.1089/cmb.2014.0162] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Pelta DA, González JR, Moreno Vega M. A simple and fast heuristic for protein structure comparison. BMC Bioinformatics 2008;9:161. [PMID: 18366735 PMCID: PMC2335283 DOI: 10.1186/1471-2105-9-161] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2007] [Accepted: 03/25/2008] [Indexed: 11/20/2022] Open

Theobald DL, Wuttke DS. Divergent evolution within protein superfolds inferred from profile-based phylogenetics. J Mol Biol 2005;354:722-37. [PMID: 16266719 PMCID: PMC1769326 DOI: 10.1016/j.jmb.2005.08.071] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2005] [Revised: 08/29/2005] [Accepted: 08/30/2005] [Indexed: 11/19/2022]

Bostick DL, Shen M, Vaisman II. A simple topological representation of protein structure: implications for new, fast, and robust structural classification. Proteins 2004;56:487-501. [PMID: 15229882 DOI: 10.1002/prot.20146] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Day R, Beck DAC, Armen RS, Daggett V. A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary. Protein Sci 2004;12:2150-60. [PMID: 14500873 PMCID: PMC2366924 DOI: 10.1110/ps.0306803] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Wintjens R, Noël C, May ACW, Gerbod D, Dufernez F, Capron M, Viscogliosi E, Rooman M. Specificity and Phenetic Relationships of Iron- and Manganese-containing Superoxide Dismutases on the Basis of Structure and Sequence Comparisons. J Biol Chem 2004;279:9248-54. [PMID: 14672935 DOI: 10.1074/jbc.m312329200] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Molecular Phylogenetics of Restriction Endonucleases. ACTA ACUST UNITED AC 2004. [DOI: 10.1007/978-3-642-18851-0_3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/24/2023]

May ACW. Definition of the tempo of sequence diversity across an alignment and automatic identification of sequence motifs: Application to protein homologous families and superfamilies. Protein Sci 2002;11:2825-35. [PMID: 12441381 PMCID: PMC2373737 DOI: 10.1110/ps.0211202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Orengo CA, Sillitoe I, Reeves G, Pearl FM. Review: what can structural classifications reveal about protein evolution? J Struct Biol 2001;134:145-65. [PMID: 11551176 DOI: 10.1006/jsbi.2001.4398] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Grishin NV. Fold change in evolution of protein structures. J Struct Biol 2001;134:167-85. [PMID: 11551177 DOI: 10.1006/jsbi.2001.4335] [Citation(s) in RCA: 334] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

May AC. Optimal classification of protein sequences and selection of representative sets from multiple alignments: application to homologous families and lessons for structural genomics. PROTEIN ENGINEERING 2001;14:209-17. [PMID: 11391012 DOI: 10.1093/protein/14.4.209] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

Hierarchical classification is probably the most popular approach to group related proteins. However, there are a number of problems associated with its use for this purpose. One is that the resulting tree showing a nested sequence of groups may not be the most suitable representation of the data. Another is that visual inspection is the most common method to decide the most appropriate number of subsets from a tree. In fact, classification of proteins in general is bedevilled with the need for subjective thresholds to define group membership (e.g., 'significant' sequence identity for homologous families). Such arbitrariness is not only intellectually unsatisfying but also has important practical consequences. For instance, it hinders meaningful identification of protein targets for structural genomics. I describe an alternative approach to cluster related proteins without the need for an a priori threshold: one, through its use of dynamic programming, which is guaranteed to produce globally optimal solutions at all levels of partition granularity. Grouping proteins according to weights assigned to their aligned sequences makes it possible to delineate dynamically a 'core-periphery' structure within families. The 'core' of a protein family comprises the most typical sequences while the 'periphery' consists of the atypical ones. Further, a new sequence weighting scheme that combines the information in all the multiply aligned positions of an alignment in a novel way is put forward. Instead of averaging over all positions, this procedure takes into account directly the distribution of sequence variability along an alignment. The relationships between sequence weights and sequence identity are investigated for 168 families taken from HOMSTRAD, a database of protein structure alignments for homologous families. An exact solution is presented for the problem of how to select the most representative pair of sequences for a protein family. Extension of this approach by a greedy algorithm allows automatic identification of a minimal set of aligned sequences. The results of this analysis are available on the Web at http://mathbio.nimr.mrc.ac.uk/~amay.

Collapse

Copley RR, Bork P. Homology among (betaalpha)(8) barrels: implications for the evolution of metabolic pathways. J Mol Biol 2000;303:627-41. [PMID: 11054297 DOI: 10.1006/jmbi.2000.4152] [Citation(s) in RCA: 163] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Gutiérrez G, Ganfornina MD, Sánchez D. Evolution of the lipocalin family as inferred from a protein sequence phylogeny. BIOCHIMICA ET BIOPHYSICA ACTA 2000;1482:35-45. [PMID: 11058745 DOI: 10.1016/s0167-4838(00)00151-5] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]