Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fischer D, Eisenberg D. Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium. Proc Natl Acad Sci U S A 1997;94:11929-34. [PMID: 9342339 PMCID: PMC23659 DOI: 10.1073/pnas.94.22.11929] [Citation(s) in RCA: 83] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

For:	Fischer D, Eisenberg D. Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium. Proc Natl Acad Sci U S A 1997;94:11929-34. [PMID: 9342339 PMCID: PMC23659 DOI: 10.1073/pnas.94.22.11929] [Citation(s) in RCA: 83] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Number

Cited by Other Article(s)

Müller A, MacCallum RM, Sternberg MJ. Benchmarking PSI-BLAST in genome annotation. J Mol Biol 1999;293:1257-71. [PMID: 10547299 DOI: 10.1006/jmbi.1999.3233] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Burley SK, Almo SC, Bonanno JB, Capel M, Chance MR, Gaasterland T, Lin D, Sali A, Studier FW, Swaminathan S. Structural genomics: beyond the human genome project. Nat Genet 1999;23:151-7. [PMID: 10508510 DOI: 10.1038/13783] [Citation(s) in RCA: 275] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Stohl EA, Brady SF, Clardy J, Handelsman J. ZmaR, a novel and widespread antibiotic resistance determinant that acetylates zwittermicin A. J Bacteriol 1999;181:5455-60. [PMID: 10464220 PMCID: PMC94055 DOI: 10.1128/jb.181.17.5455-5460.1999] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Frishman D, Mewes HW. Genome-based structural biology. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 1999;72:1-17. [PMID: 10446500 DOI: 10.1016/s0079-6107(98)00057-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Paw?owski K, Zhang B, Rychlewski L, Godzik A. TheHelicobacter pylori genome: From sequence analysis to structural and functional predictions. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(19990701)36:1<20::aid-prot2>3.0.co;2-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Zhang B, Rychlewski L, Pawłowski K, Fetrow JS, Skolnick J, Godzik A. From fold predictions to function predictions: automation of functional site conservation analysis for functional genome predictions. Protein Sci 1999;8:1104-15. [PMID: 10338021 PMCID: PMC2144342 DOI: 10.1110/ps.8.5.1104] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Abstract

A database of functional sites for proteins with known structures, SITE, is constructed and used in conjunction with a simple pattern matching program SiteMatch to evaluate possible function conservation in a recently constructed database of fold predictions for Escherichia coli proteins (Rychlewski L et al., 1999, Protein Sci 8:614-624). In this and other prediction databases, fold predictions are based on algorithms that can recognize weak sequence similarities and putatively assign new proteins into already characterized protein families. It is not clear whether such sequence similarities arise from distant homologies or general similarity of physicochemical features along the sequence. Leaving aside the important question of nature of relations within fold superfamilies, it is possible to assess possible function conservation by looking at the pattern of conservation of crucial functional residues. SITE consists of a multilevel function description based on structure annotations and structure analyses. In particular, active site residues, ligand binding residues, and patterns of hydrophobic residues on the protein surface are used to describe different functional features. SiteMatch, a simple pattern matching program, is designed to check the conservation of residues involved in protein activity in alignments generated by any alignment method. Here, this procedure is used to study conservation of functional features in alignments between protein sequences from the E. coli genome and their optimal structural templates. The optimal templates were identified and alignments taken from the database of genomic structural predictions was described in a previous publication (Rychlewski L et al., 1999, Protein Sci 8:614-624). An automated assessment of function conservation is used to analyze the relation between fold and function similarity for a large number of fold predictions. For instance, it is shown that identifying low significance predictions with a high level of functional residue conservations can be used to extend the prediction sensitivity for fold prediction methods. Over 100 new fold/function predictions in this class were obtained in the E. coli genome. At the same time, about 30% of our previous fold predictions are not confirmed as function predictions, further highlighting the problem of function divergence in fold superfamilies.

Collapse

Ota M, Nishikawa K. Feasibility in the inverse protein folding protocol. Protein Sci 1999;8:1001-9. [PMID: 10338011 PMCID: PMC2144338 DOI: 10.1110/ps.8.5.1001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Jones DT. GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J Mol Biol 1999;287:797-815. [PMID: 10191147 DOI: 10.1006/jmbi.1999.2583] [Citation(s) in RCA: 614] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Salamov AA, Suwa M, Orengo CA, Swindells MB. Genome analysis: Assigning protein coding regions to three-dimensional structures. Protein Sci 1999;8:771-7. [PMID: 10211823 PMCID: PMC2144302 DOI: 10.1110/ps.8.4.771] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Rychlewski L, Zhang B, Godzik A. Functional insights from structural predictions: analysis of the Escherichia coli genome. Protein Sci 1999;8:614-24. [PMID: 10091664 PMCID: PMC2144289 DOI: 10.1110/ps.8.3.614] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Fischer D. Modeling three-dimensional protein structures for amino acid sequences of the CASP3 experiment using sequence-derived predictions. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(1999)37:3+<61::aid-prot9>3.0.co;2-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Wolf YI, Brenner SE, Bash PA, Koonin EV. Distribution of Protein Folds in the Three Superkingdoms of Life. Genome Res 1999. [DOI: 10.1101/gr.9.1.17] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Ota M, Kawabata T, Kinjo AR, Nishikawa K. Cooperative approach for the protein fold recognition. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(1999)37:3+<126::aid-prot17>3.0.co;2-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Fischer D, Barret C, Bryson K, Elofsson A, Godzik A, Jones D, Karplus KJ, Kelley LA, MacCallum RM, Pawowski K, Rost B, Rychlewski L, Sternberg M. CAFASP-1: Critical assessment of fully automated structure prediction methods. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(1999)37:3+<209::aid-prot27>3.0.co;2-y] [Citation(s) in RCA: 107] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Teichmann SA, Park J, Chothia C. Structural assignments to the Mycoplasma genitalium proteins show extensive gene duplications and domain rearrangements. Proc Natl Acad Sci U S A 1998;95:14658-63. [PMID: 9843945 PMCID: PMC24505 DOI: 10.1073/pnas.95.25.14658] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Gerstein M. Patterns of protein-fold usage in eight microbial genomes: a comprehensive structural census. Proteins 1998;33:518-34. [PMID: 9849936 DOI: 10.1002/(sici)1097-0134(19981201)33:4<518::aid-prot5>3.0.co;2-j] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Razin S, Yogev D, Naot Y. Molecular biology and pathogenicity of mycoplasmas. Microbiol Mol Biol Rev 1998;62:1094-156. [PMID: 9841667 PMCID: PMC98941 DOI: 10.1128/mmbr.62.4.1094-1156.1998] [Citation(s) in RCA: 1065] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Abstract

The recent sequencing of the entire genomes of Mycoplasma genitalium and M. pneumoniae has attracted considerable attention to the molecular biology of mycoplasmas, the smallest self-replicating organisms. It appears that we are now much closer to the goal of defining, in molecular terms, the entire machinery of a self-replicating cell. Comparative genomics based on comparison of the genomic makeup of mycoplasmal genomes with those of other bacteria, has opened new ways of looking at the evolutionary history of the mycoplasmas. There is now solid genetic support for the hypothesis that mycoplasmas have evolved as a branch of gram-positive bacteria by a process of reductive evolution. During this process, the mycoplasmas lost considerable portions of their ancestors' chromosomes but retained the genes essential for life. Thus, the mycoplasmal genomes carry a high percentage of conserved genes, greatly facilitating gene annotation. The significant genome compaction that occurred in mycoplasmas was made possible by adopting a parasitic mode of life. The supply of nutrients from their hosts apparently enabled mycoplasmas to lose, during evolution, the genes for many assimilative processes. During their evolution and adaptation to a parasitic mode of life, the mycoplasmas have developed various genetic systems providing a highly plastic set of variable surface proteins to evade the host immune system. The uniqueness of the mycoplasmal systems is manifested by the presence of highly mutable modules combined with an ability to expand the antigenic repertoire by generating structural alternatives, all compressed into limited genomic sequences. In the absence of a cell wall and a periplasmic space, the majority of surface variable antigens in mycoplasmas are lipoproteins. Apart from providing specific antimycoplasmal defense, the host immune system is also involved in the development of pathogenic lesions and exacerbation of mycoplasma induced diseases. Mycoplasmas are able to stimulate as well as suppress lymphocytes in a nonspecific, polyclonal manner, both in vitro and in vivo. As well as to affecting various subsets of lymphocytes, mycoplasmas and mycoplasma-derived cell components modulate the activities of monocytes/macrophages and NK cells and trigger the production of a wide variety of up-regulating and down-regulating cytokines and chemokines. Mycoplasma-mediated secretion of proinflammatory cytokines, such as tumor necrosis factor alpha, interleukin-1 (IL-1), and IL-6, by macrophages and of up-regulating cytokines by mitogenically stimulated lymphocytes plays a major role in mycoplasma-induced immune system modulation and inflammatory responses.

Collapse

Sánchez R, Sali A. Large-scale protein structure modeling of the Saccharomyces cerevisiae genome. Proc Natl Acad Sci U S A 1998;95:13597-602. [PMID: 9811845 PMCID: PMC24864 DOI: 10.1073/pnas.95.23.13597] [Citation(s) in RCA: 282] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/1998] [Indexed: 11/18/2022] Open

Bork P, Dandekar T, Diaz-Lazcoz Y, Eisenhaber F, Huynen M, Yuan Y. Predicting function: from genes to genomes and back. J Mol Biol 1998;283:707-25. [PMID: 9790834 DOI: 10.1006/jmbi.1998.2144] [Citation(s) in RCA: 262] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Dubchak I, Muchnik I, Kim SH. Assignment of folds for proteins of unknown function in three microbial genomes. MICROBIAL & COMPARATIVE GENOMICS 1998;3:171-5. [PMID: 9775387 DOI: 10.1089/omi.1.1998.3.171] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Russell RB, Sasieni PD, Sternberg MJ. Supersites within superfolds. Binding site similarity in the absence of homology. J Mol Biol 1998;282:903-18. [PMID: 9743635 DOI: 10.1006/jmbi.1998.2043] [Citation(s) in RCA: 162] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Holm L, Sander C. Dictionary of recurrent domains in protein structures. Proteins 1998;33:88-96. [PMID: 9741847 DOI: 10.1002/(sici)1097-0134(19981001)33:1<88::aid-prot8>3.0.co;2-h] [Citation(s) in RCA: 145] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Herrmann R, Reiner B. Mycoplasma pneumoniae and Mycoplasma genitalium: a comparison of two closely related bacterial species. Curr Opin Microbiol 1998;1:572-9. [PMID: 10066529 DOI: 10.1016/s1369-5274(98)80091-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Gerstein M, Hegyi H. Comparing genomes in terms of protein structure: surveys of a finite parts list. FEMS Microbiol Rev 1998;22:277-304. [PMID: 10357579 DOI: 10.1111/j.1574-6976.1998.tb00371.x] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Abstract

We give an overview of the emerging field of structural genomics, describing how genomes can be compared in terms of protein structure. As the number of genes in a genome and the total number of protein folds are both quite limited, these comparisons take the form of surveys of a finite parts list, similar in respects to demographic censuses. Fold surveys have many similarities with other whole-genome characterizations, e.g., analyses of motifs or pathways. However, structure has a number of aspects that make it particularly suitable for comparing genomes, namely the way it allows for the precise definition of a basic protein module and the fact that it has a better defined relationship to sequence similarity than does protein function. An essential requirement for a structure survey is a library of folds, which groups the known structures into 'fold families.' This library can be built up automatically using a structure comparison program, and we described how important objective statistical measures are for assessing similarities within the library and between the library and genome sequences. After building the library, one can use it to count the number of folds in genomes, expressing the results in the form of Venn diagrams and 'top-10' statistics for shared and common folds. Depending on the counting methodology employed, these statistics can reflect different aspects of the genome, such as the amount of internal duplication or gene expression. Previous analyses have shown that the common folds shared between very different microorganisms, i.e., in different kingdoms, have a remarkably similar structure, being comprised of repeated strand-helix-strand super-secondary structure units. A major difficulty with this sort of 'fold-counting' is that only a small subset of the structures in a complete genome are currently known and this subset is prone to sampling bias. One way of overcoming biases is through structure prediction, which can be applied uniformly and comprehensively to a whole genome. Various investigators have, in fact, already applied many of the existing techniques for predicting secondary structure and transmembrane (TM) helices to the recently sequenced genomes. The results have been consistent: microbial genomes have similar fractions of strands and helices even though they have significantly different amino acid composition. The fraction of membrane proteins with a given number of TM helices falls off rapidly with more TM elements, approximately according to a Zipf law. This latter finding indicates that there is no preference for the highly studied 7-TM proteins in microbial genomes. Continuously updated tables and further information pertinent to this review are available over the web at http://bioinfo.mbb.yale.edu/genome.

Collapse

Rychlewski L, Zhang B, Godzik A. Fold and function predictions for Mycoplasma genitalium proteins. FOLDING & DESIGN 1998;3:229-38. [PMID: 9710568 DOI: 10.1016/s1359-0278(98)00034-0] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Kim SH. Shining a light on structural genomics. NATURE STRUCTURAL BIOLOGY 1998;5 Suppl:643-5. [PMID: 9699614 DOI: 10.1038/1334] [Citation(s) in RCA: 94] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Huynen M, Doerks T, Eisenhaber F, Orengo C, Sunyaev S, Yuan Y, Bork P. Homology-based fold predictions for Mycoplasma genitalium proteins. J Mol Biol 1998;280:323-6. [PMID: 9665839 DOI: 10.1006/jmbi.1998.1884] [Citation(s) in RCA: 88] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Beamer LJ, Fischer D, Eisenberg D. Detecting distant relatives of mammalian LPS-binding and lipid transport proteins. Protein Sci 1998;7:1643-6. [PMID: 9684900 PMCID: PMC2144061 DOI: 10.1002/pro.5560070721] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Koonin EV, Tatusov RL, Galperin MY. Beyond complete genomes: from sequence to structure and function. Curr Opin Struct Biol 1998;8:355-63. [PMID: 9666332 DOI: 10.1016/s0959-440x(98)80070-5] [Citation(s) in RCA: 114] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Holm L. Unification of protein families. Curr Opin Struct Biol 1998;8:372-9. [PMID: 9666334 DOI: 10.1016/s0959-440x(98)80072-9] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Bork P, Koonin EV. Predicting functions from protein sequences--where are the bottlenecks? Nat Genet 1998;18:313-8. [PMID: 9537411 DOI: 10.1038/ng0498-313] [Citation(s) in RCA: 236] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Rost B. Marrying structure and genomics. Structure 1998;6:259-63. [PMID: 9551548 DOI: 10.1016/s0969-2126(98)00029-x] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Léonetti JP, Wong K, Geiduschek EP. Core-sigma interaction: probing the interaction of the bacteriophage T4 gene 55 promoter recognition protein with E.coli RNA polymerase core. EMBO J 1998;17:1467-75. [PMID: 9482743 PMCID: PMC1170494 DOI: 10.1093/emboj/17.5.1467] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open