Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sander C, Schneider R. The HSSP data base of protein structure-sequence alignments. Nucleic Acids Res 1993;21:3105-9. [PMID: 8332531 PMCID: PMC309738 DOI: 10.1093/nar/21.13.3105] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

For:	Sander C, Schneider R. The HSSP data base of protein structure-sequence alignments. Nucleic Acids Res 1993;21:3105-9. [PMID: 8332531 PMCID: PMC309738 DOI: 10.1093/nar/21.13.3105] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Number

Cited by Other Article(s)

van Beusekom B, Perrakis A, Joosten RP. Data Mining of Macromolecular Structures. Methods Mol Biol 2016;1415:107-38. [PMID: 27115630 DOI: 10.1007/978-1-4939-3572-7_6] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Touw WG, Baakman C, Black J, te Beek TAH, Krieger E, Joosten RP, Vriend G. A series of PDB-related databanks for everyday needs. Nucleic Acids Res 2014;43:D364-8. [PMID: 25352545 PMCID: PMC4383885 DOI: 10.1093/nar/gku1028] [Citation(s) in RCA: 623] [Impact Index Per Article: 62.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Dey S, Pal A, Guharoy M, Sonavane S, Chakrabarti P. Characterization and prediction of the binding site in DNA-binding proteins: improvement of accuracy by combining residue composition, evolutionary conservation and structural parameters. Nucleic Acids Res 2012;40:7150-61. [PMID: 22641851 PMCID: PMC3424558 DOI: 10.1093/nar/gks405] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Juritz E, Palopoli N, Fornasari MS, Fernandez-Alberti S, Parisi G. Protein Conformational Diversity Modulates Sequence Divergence. Mol Biol Evol 2012;30:79-87. [DOI: 10.1093/molbev/mss080] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Rational design of DNA sequence-specific zinc fingers. FEBS Lett 2012;586:918-23. [DOI: 10.1016/j.febslet.2012.02.025] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2011] [Revised: 02/10/2012] [Accepted: 02/15/2012] [Indexed: 11/22/2022]

Satagopam VP, Theodoropoulou MC, Stampolakis CK, Pavlopoulos GA, Papandreou NC, Bagos PG, Schneider R, Hamodrakas SJ. GPCRs, G-proteins, effectors and their interactions: human-gpDB, a database employing visualization tools and data integration techniques. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010;2010:baq019. [PMID: 20689020 PMCID: PMC2931634 DOI: 10.1093/database/baq019] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Dey S, Pal A, Chakrabarti P, Janin J. The subunit interfaces of weakly associated homodimeric proteins. J Mol Biol 2010;398:146-60. [PMID: 20156457 DOI: 10.1016/j.jmb.2010.02.020] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2009] [Revised: 02/10/2010] [Accepted: 02/10/2010] [Indexed: 02/07/2023]

Galperin MY, Cochrane GR. Nucleic Acids Research annual Database Issue and the NAR online Molecular Biology Database Collection in 2009. Nucleic Acids Res 2008;37:D1-4. [PMID: 19033364 PMCID: PMC2686608 DOI: 10.1093/nar/gkn942] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open

The UniProtKB/Swiss-Prot knowledgebase and its Plant Proteome Annotation Program. J Proteomics 2008;72:567-73. [PMID: 19084081 DOI: 10.1016/j.jprot.2008.11.010] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2008] [Revised: 11/04/2008] [Accepted: 11/10/2008] [Indexed: 11/21/2022]

Protein–protein interaction and quaternary structure. Q Rev Biophys 2008;41:133-80. [PMID: 18812015 DOI: 10.1017/s0033583508004708] [Citation(s) in RCA: 289] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Bahadur RP, Janin J. Residue conservation in viral capsid assembly. Proteins 2008;71:407-14. [PMID: 17957774 DOI: 10.1002/prot.21710] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Codoñer FM, Fares MA. Why should we care about molecular coevolution? Evol Bioinform Online 2008;4:29-38. [PMID: 19204805 PMCID: PMC2614197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Codoñer FM, Fares MA. Why Should We Care about Molecular Coevolution? Evol Bioinform Online 2008. [DOI: 10.1177/117693430800400003] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Dobson RJ, Munroe PB, Caulfield MJ, Saqi MAS. Predicting deleterious nsSNPs: an analysis of sequence and structural attributes. BMC Bioinformatics 2006;7:217. [PMID: 16630345 PMCID: PMC1489951 DOI: 10.1186/1471-2105-7-217] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2005] [Accepted: 04/21/2006] [Indexed: 11/10/2022] Open

Pugalenthi G, Bhaduri A, Sowdhamini R. GenDiS: Genomic Distribution of protein structural domain Superfamilies. Nucleic Acids Res 2005;33:D252-5. [PMID: 15608190 PMCID: PMC540041 DOI: 10.1093/nar/gki087] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Worthey EA, Myler PJ. Protozoan genomes: gene identification and annotation. Int J Parasitol 2005;35:495-512. [PMID: 15826642 DOI: 10.1016/j.ijpara.2005.02.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2004] [Revised: 01/25/2005] [Accepted: 02/06/2005] [Indexed: 12/01/2022]

Pazos F, Sternberg MJE. Automated prediction of protein function and detection of functional sites from structure. Proc Natl Acad Sci U S A 2004;101:14754-9. [PMID: 15456910 PMCID: PMC522026 DOI: 10.1073/pnas.0404569101] [Citation(s) in RCA: 139] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2004] [Indexed: 11/18/2022] Open

Emes RD, Beatson SA, Ponting CP, Goodstadt L. Evolution and comparative genomics of odorant- and pheromone-associated genes in rodents. Genome Res 2004;14:591-602. [PMID: 15060000 PMCID: PMC383303 DOI: 10.1101/gr.1940604] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Oberg KA, Ruysschaert JM, Goormaghtigh E. Rationally selected basis proteins: a new approach to selecting proteins for spectroscopic secondary structure analysis. Protein Sci 2003;12:2015-31. [PMID: 12931000 PMCID: PMC2323998 DOI: 10.1110/ps.0354703] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

del Sol A, del Sol Mesa A, Pazos F, Valencia A. Automatic methods for predicting functionally important residues. J Mol Biol 2003;326:1289-302. [PMID: 12589769 DOI: 10.1016/s0022-2836(02)01451-1] [Citation(s) in RCA: 169] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

Sequence analysis is often the first guide for the prediction of residues in a protein family that may have functional significance. A few methods have been proposed which use the division of protein families into subfamilies in the search for those positions that could have some functional significance for the whole family, but at the same time which exhibit the specificity of each subfamily ("Tree-determinant residues"). However, there are still many unsolved questions like the best division of a protein family into subfamilies, or the accurate detection of sequence variation patterns characteristic of different subfamilies. Here we present a systematic study in a significant number of protein families, testing the statistical meaning of the Tree-determinant residues predicted by three different methods that represent the range of available approaches. The first method takes as a starting point a phylogenetic representation of a protein family and, following the principle of Relative Entropy from Information Theory, automatically searches for the optimal division of the family into subfamilies. The second method looks for positions whose mutational behavior is reminiscent of the mutational behavior of the full-length proteins, by directly comparing the corresponding distance matrices. The third method is an automation of the analysis of distribution of sequences and amino acid positions in the corresponding multidimensional spaces using a vector-based principal component analysis. These three methods have been tested on two non-redundant lists of protein families: one composed by proteins that bind a variety of ligand groups, and the other composed by proteins with annotated functionally relevant sites. In most cases, the residues predicted by the three methods show a clear tendency to be close to bound ligands of biological relevance and to those amino acids described as participants in key aspects of protein function. These three automatic methods provide a wide range of possibilities for biologists to analyze their families of interest, in a similar way to the one presented here for the family of proteins related with ras-p21.

Collapse

Melo FR, Rigden DJ, Franco OL, Mello LV, Ary MB, Grossi de Sá MF, Bloch C. Inhibition of trypsin by cowpea thionin: characterization, molecular modeling, and docking. Proteins 2002;48:311-9. [PMID: 12112698 DOI: 10.1002/prot.10142] [Citation(s) in RCA: 95] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Pazos F, Valencia A. In silico two-hybrid system for the selection of physically interacting protein pairs. Proteins 2002;47:219-27. [PMID: 11933068 DOI: 10.1002/prot.10074] [Citation(s) in RCA: 183] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Mallika V, Bhaduri A, Sowdhamini R. PASS2: a semi-automated database of protein alignments organised as structural superfamilies. Nucleic Acids Res 2002;30:284-8. [PMID: 11752316 PMCID: PMC99156 DOI: 10.1093/nar/30.1.284] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Fariselli P, Olmea O, Valencia A, Casadio R. Prediction of contact maps with neural networks and correlated mutations. PROTEIN ENGINEERING 2001;14:835-43. [PMID: 11742102 DOI: 10.1093/protein/14.11.835] [Citation(s) in RCA: 149] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Pazos F, Valencia A. Similarity of phylogenetic trees as indicator of protein-protein interaction. PROTEIN ENGINEERING 2001;14:609-14. [PMID: 11707606 DOI: 10.1093/protein/14.9.609] [Citation(s) in RCA: 303] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Bonneau R, Strauss CE, Baker D. Improving the performance of Rosetta using multiple sequence alignment information and global measures of hydrophobic core formation. Proteins 2001;43:1-11. [PMID: 11170209 DOI: 10.1002/1097-0134(20010401)43:1<1::aid-prot1012>3.0.co;2-a] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Elcock AH, McCammon JA. Identification of protein oligomerization states by analysis of interface conservation. Proc Natl Acad Sci U S A 2001;98:2990-4. [PMID: 11248019 PMCID: PMC30594 DOI: 10.1073/pnas.061411798] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Mandel-Gutfreund Y, Zaremba SM, Gregoret LM. Contributions of residue pairing to beta-sheet formation: conservation and covariation of amino acid residue pairs on antiparallel beta-strands. J Mol Biol 2001;305:1145-59. [PMID: 11162120 DOI: 10.1006/jmbi.2000.4364] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Rychlewski L, Jaroszewski L, Li W, Godzik A. Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci 2000;9:232-41. [PMID: 10716175 PMCID: PMC2144550 DOI: 10.1110/ps.9.2.232] [Citation(s) in RCA: 385] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Olmea O, Rost B, Valencia A. Effective use of sequence correlation and conservation in fold recognition. J Mol Biol 1999;293:1221-39. [PMID: 10547297 DOI: 10.1006/jmbi.1999.3208] [Citation(s) in RCA: 125] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Lebeda FJ, Olson MA. Prediction of a conserved, neutralizing epitope in ribosome-inactivating proteins. Int J Biol Macromol 1999;24:19-26. [PMID: 10077268 DOI: 10.1016/s0141-8130(98)00059-2] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Tsugita A, Kamo M, Miyazaki K, Takayama M, Kawakami T, Shen R, Nozawa T. Additional possible tools for identification of proteins on one- or two-dimensional electrophoresis. Electrophoresis 1998;19:928-38. [PMID: 9638939 DOI: 10.1002/elps.1150190608] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Pazos F, Helmer-Citterich M, Ausiello G, Valencia A. Correlated mutations contain information about protein-protein interaction. J Mol Biol 1997;271:511-23. [PMID: 9281423 DOI: 10.1006/jmbi.1997.1198] [Citation(s) in RCA: 345] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

Olmea O, Valencia A. Improving contact predictions by the combination of correlated mutations and other sources of sequence information. FOLDING & DESIGN 1997;2:S25-32. [PMID: 9218963 DOI: 10.1016/s1359-0278(97)00060-6] [Citation(s) in RCA: 157] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Pearson BM, Hernando Y, Payne J, Wolf SS, Kalogeropoulos A, Schweizer M. Sequencing of a 35·71 kb DNA segment on the right arm of yeast chromosome XV reveals regions of similarity to chromosomes I and XIII. Yeast 1996. [DOI: 10.1002/(sici)1097-0061(199609)12:10b<1021::aid-yea981>3.0.co;2-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Pearson BM, Hernando Y, Payne J, Wolf SS, Kalogeropoulos A, Schweizer M. Sequencing of a 35.71 kb DNA segment on the right arm of yeast chromosome XV reveals regions of similarity to chromosomes I and XIII. Yeast 1996;12:1021-31. [PMID: 8896266 DOI: 10.1002/(sici)1097-0061(199609)12:10b%3c1021::aid-yea981%3e3.0.co;2-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

Casari G, Sander C, Valencia A. A method to predict functional residues in proteins. NATURE STRUCTURAL BIOLOGY 1995;2:171-8. [PMID: 7749921 DOI: 10.1038/nsb0295-171] [Citation(s) in RCA: 294] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Voss H, Tamames J, Teodoru C, Valencia A, Sensen C, Wiemann S, Schwager C, Zimmermann J, Sander C, Ansorge W. Nucleotide sequence and analysis of the centromeric region of yeast chromosome IX. Yeast 1995;11:61-78. [PMID: 7762303 DOI: 10.1002/yea.320110109] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open

Rost B, Sander C. Conservation and prediction of solvent accessibility in protein families. Proteins 1994;20:216-26. [PMID: 7892171 DOI: 10.1002/prot.340200303] [Citation(s) in RCA: 428] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Abstract

Currently, the prediction of three-dimensional (3D) protein structure from sequence alone is an exceedingly difficult task. As an intermediate step, a much simpler task has been pursued extensively: predicting 1D strings of secondary structure. Here, we present an analysis of another 1D projection from 3D structure: the relative solvent accessibility of each residue. We show that solvent accessibility is less conserved in 3D homologues than is secondary structure, and hence is predicted less accurately from automatic homology modeling; the correlation coefficient of relative solvent accessibility between 3D homologues is only 0.77, and the average accuracy of predictions based on sequence alignments is only 0.68. The latter number provides an effective upper limit on the accuracy of predicting accessibility from sequence when homology modeling is not possible. We introduce a neural network system that predicts relative solvent accessibility (projected onto ten discrete states) using evolutionary profiles of amino acid substitutions derived from multiple sequence alignments. Evaluated in a cross-validation test on 238 unique proteins, the correlation between predicted and observed relative accessibility is 0.54. Interpreted in terms of a three-state (buried, intermediate, exposed) description of relative accessibility, the fraction of correctly predicted residue states is about 58%. In absolute terms this accuracy appears poor, but given the relatively low conservation of accessibility in 3D families, the network system is not far from its likely optimal performance. The most reliably predicted fraction of the residues (50%) is predicted as accurately as by automatic homology modeling. Prediction is best for buried residues, e.g., 86% of the completely buried sites are correctly predicted as having 0% relative accessibility.

Collapse

Emmert DB, Stoehr PJ, Stoesser G, Cameron GN. The European Bioinformatics Institute (EBI) databases. Nucleic Acids Res 1994;22:3445-9. [PMID: 7937043 PMCID: PMC308299 DOI: 10.1093/nar/22.17.3445] [Citation(s) in RCA: 78] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open

Rost B, Sander C. Structure prediction of proteins--where are we now? Curr Opin Biotechnol 1994;5:372-80. [PMID: 7765169 DOI: 10.1016/0958-1669(94)90045-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Bork P, Ouzounis C, Sander C. From genome sequences to protein function. Curr Opin Struct Biol 1994. [DOI: 10.1016/s0959-440x(94)90109-0] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Rost B, Sander C. Combining evolutionary information and neural networks to predict protein secondary structure. Proteins 1994;19:55-72. [PMID: 8066087 DOI: 10.1002/prot.340190108] [Citation(s) in RCA: 1157] [Impact Index Per Article: 38.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Abstract

Using evolutionary information contained in multiple sequence alignments as input to neural networks, secondary structure can be predicted at significantly increased accuracy. Here, we extend our previous three-level system of neural networks by using additional input information derived from multiple alignments. Using a position-specific conservation weight as part of the input increases performance. Using the number of insertions and deletions reduces the tendency for overprediction and increases overall accuracy. Addition of the global amino acid content yields a further improvement, mainly in predicting structural class. The final network system has sustained overall accuracy of 71.6% in a multiple cross-validation test on 126 unique protein chains. A test on a new set of 124 recently solved protein structures that have no significant sequence similarity to the learning set confirms the high level of accuracy. The average cross-validated accuracy for all 250 sequence-unique chains is above 72%. Using various data sets, the method is compared to alternative prediction methods, some of which also use multiple alignments: the performance advantage of the network system is at least 6 percentage points in three-state accuracy. In addition, the network estimates secondary structure content from multiple sequence alignments about as well as circular dichroism spectroscopy on a single protein and classifies 75% of the 250 proteins correctly into one of four protein structural classes. Of particular practical importance is the definition of a position-specific reliability index. For 40% of all residues the method has a sustained three-state accuracy of 88%, as high as the overall average for homology modelling. A further strength of the method is greatly increased accuracy in predicting the placement of secondary structure segments.

Collapse

Taylor WR. Remotely related sequences and structures: analysis and predictive modelling. Trends Biotechnol 1994;12:154-8. [PMID: 7764896 DOI: 10.1016/0167-7799(94)90075-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]