Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pei J, Grishin NV. MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information. Nucleic Acids Res 2006;34:4364-74. [PMID: 16936316 PMCID: PMC1636350 DOI: 10.1093/nar/gkl514] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

For:	Pei J, Grishin NV. MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information. Nucleic Acids Res 2006;34:4364-74. [PMID: 16936316 PMCID: PMC1636350 DOI: 10.1093/nar/gkl514] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Pei J, Kim BH, Grishin NV. PROMALS3D: a tool for multiple protein sequence and structure alignments. Nucleic Acids Res 2008;36:2295-300. [PMID: 18287115 PMCID: PMC2367709 DOI: 10.1093/nar/gkn072] [Citation(s) in RCA: 1038] [Impact Index Per Article: 64.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Kaminska KH, Baraniak U, Boniecki M, Nowaczyk K, Czerwoniec A, Bujnicki JM. Structural bioinformatics analysis of enzymes involved in the biosynthesis pathway of the hypermodified nucleoside ms(2)io(6)A37 in tRNA. Proteins 2008;70:1-18. [PMID: 17910062 DOI: 10.1002/prot.21640] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

TRNAs from all organisms contain posttranscriptionally modified nucleosides, which are derived from the four canonical nucleosides. In most tRNAs that read codons beginning with U, adenosine in the position 37 adjacent to the 3' position of the anticodon is modified to N(6)-(Delta(2)-isopentenyl) adenosine (i(6)A). In many bacteria, such as Escherichia coli, this residue is typically hypermodified to N(6)-isopentenyl-2-thiomethyladenosine (ms(2)i(6)A). In a few bacteria, such as Salmonella typhimurium, ms(2)i(6)A can be further hydroxylated to N(6)-(cis-4-hydroxyisopentenyl)-2-thiomethyladenosine (ms(2)io(6)A). Although the enzymes that introduce the respective modifications (prenyltransferase MiaA, methylthiotransferase MiaB, and hydroxylase MiaE) have been identified, their structures remain unknown and sequence-function relationships remain obscure. We carried out sequence analysis and structure prediction of MiaA, MiaB, and MiaE, using the protein fold-recognition approach. Three-dimensional models of all three proteins were then built using a new modeling protocol designed to overcome uncertainties in the alignments and divergence between the templates. For MiaA and MiaB, the catalytic core was built based on the templates from the P-loop NTPase and Radical-SAM superfamilies, respectively. For MiaB, we have also modeled the C-terminal TRAM domain and the newly predicted N-terminal flavodoxin-fold domain. For MiaE, we confidently predict that it shares the three-dimensional fold with the ferritin-like four-helix bundle proteins and that it has a similar active site and mechanism of action to diiron carboxylate enzymes, in particular, methane monooxygenase (E.C.1.14.13.25) that catalyses the biological hydroxylation of alkanes. Our models provide the first structural platform for enzymes involved in the biosynthesis of i(6)A, ms(2)i(6)A, and ms(2)io(6)A, explain the data available from the literature and will help to design further experiments and interpret their results.

Collapse

Do CB, Katoh K. Protein multiple sequence alignment. Methods Mol Biol 2008;484:379-413. [PMID: 18592193 DOI: 10.1007/978-1-59745-398-1_25] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Notredame C. Recent evolutions of multiple sequence alignment algorithms. PLoS Comput Biol 2007;3:e123. [PMID: 17784778 PMCID: PMC1963500 DOI: 10.1371/journal.pcbi.0030123] [Citation(s) in RCA: 153] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Maravić Vlahovicek G, Cubrilo S, Tkaczuk KL, Bujnicki JM. Modeling and experimental analyses reveal a two-domain structure and amino acids important for the activity of aminoglycoside resistance methyltransferase Sgm. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2007;1784:582-90. [PMID: 18343347 DOI: 10.1016/j.bbapap.2007.09.009] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2007] [Revised: 09/18/2007] [Accepted: 09/19/2007] [Indexed: 12/19/2022]

Qi Y, Sadreyev RI, Wang Y, Kim BH, Grishin NV. A comprehensive system for evaluation of remote sequence similarity detection. BMC Bioinformatics 2007;8:314. [PMID: 17725841 PMCID: PMC2031906 DOI: 10.1186/1471-2105-8-314] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2007] [Accepted: 08/28/2007] [Indexed: 11/25/2022] Open

Abstract

Background

Accurate and sensitive performance evaluation is crucial for both effective development of better structure prediction methods based on sequence similarity, and for the comparative analysis of existing methods. Up to date, there has been no satisfactory comprehensive evaluation method that (i) is based on a large and statistically unbiased set of proteins with clearly defined relationships; and (ii) covers all performance aspects of sequence-based structure predictors, such as sensitivity and specificity, alignment accuracy and coverage, and structure template quality.

Results

With the aim of designing such a method, we (i) select a statistically balanced set of divergent protein domains from SCOP, and define similarity relationships for the majority of these domains by complementing the best of information available in SCOP with a rigorous SVM-based algorithm; and (ii) develop protocols for the assessment of similarity detection and alignment quality from several complementary perspectives. The evaluation of similarity detection is based on ROC-like curves and includes several complementary approaches to the definition of true/false positives. Reference-dependent approaches use the 'gold standard' of pre-defined domain relationships and structure-based alignments. Reference-independent approaches assess the quality of structural match predicted by the sequence alignment, with respect to the whole domain length (global mode) or to the aligned region only (local mode). Similarly, the evaluation of alignment quality includes several reference-dependent and -independent measures, in global and local modes. As an illustration, we use our benchmark to compare the performance of several methods for the detection of remote sequence similarities, and show that different aspects of evaluation reveal different properties of the evaluated methods, highlighting their advantages, weaknesses, and potential for further development.

Conclusion

The presented benchmark provides a new tool for a statistically unbiased assessment of methods for remote sequence similarity detection, from various complementary perspectives. This tool should be useful both for users choosing the best method for a given purpose, and for developers designing new, more powerful methods. The benchmark set, reference alignments, and evaluation codes can be downloaded from .

Collapse

Pei J, Kim BH, Tang M, Grishin NV. PROMALS web server for accurate multiple protein sequence alignments. Nucleic Acids Res 2007;35:W649-52. [PMID: 17452345 PMCID: PMC1933189 DOI: 10.1093/nar/gkm227] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Structural and evolutionary bioinformatics of the SPOUT superfamily of methyltransferases. BMC Bioinformatics 2007;8:73. [PMID: 17338813 PMCID: PMC1829167 DOI: 10.1186/1471-2105-8-73] [Citation(s) in RCA: 128] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2006] [Accepted: 03/05/2007] [Indexed: 11/29/2022] Open

Abstract

Background

SPOUT methyltransferases (MTases) are a large class of S-adenosyl-L-methionine-dependent enzymes that exhibit an unusual alpha/beta fold with a very deep topological knot. In 2001, when no crystal structures were available for any of these proteins, Anantharaman, Koonin, and Aravind identified homology between SpoU and TrmD MTases and defined the SPOUT superfamily. Since then, multiple crystal structures of knotted MTases have been solved and numerous new homologous sequences appeared in the databases. However, no comprehensive comparative analysis of these proteins has been carried out to classify them based on structural and evolutionary criteria and to guide functional predictions.

Results

We carried out extensive searches of databases of protein structures and sequences to collect all members of previously identified SPOUT MTases, and to identify previously unknown homologs. Based on sequence clustering, characterization of domain architecture, structure predictions and sequence/structure comparisons, we re-defined families within the SPOUT superfamily and predicted putative active sites and biochemical functions for the so far uncharacterized members. We have also delineated the common core of SPOUT MTases and inferred a multiple sequence alignment for the conserved knot region, from which we calculated the phylogenetic tree of the superfamily. We have also studied phylogenetic distribution of different families, and used this information to infer the evolutionary history of the SPOUT superfamily.

Conclusion

We present the first phylogenetic tree of the SPOUT superfamily since it was defined, together with a new scheme for its classification, and discussion about conservation of sequence and structure in different families, and their functional implications. We identified four protein families as new members of the SPOUT superfamily. Three of these families are functionally uncharacterized (COG1772, COG1901, and COG4080), and one (COG1756 represented by Nep1p) has been already implicated in RNA metabolism, but its biochemical function has been unknown. Based on the inference of orthologous and paralogous relationships between all SPOUT families we propose that the Last Universal Common Ancestor (LUCA) of all extant organisms contained at least three SPOUT members, ancestors of contemporary RNA MTases that carry out m¹G, m3U, and 2'O-ribose methylation, respectively. In this work we also speculate on the origin of the knot and propose possible 'unknotted' ancestors. The results of our analysis provide a comprehensive 'roadmap' for experimental characterization of SPOUT MTases and interpretation of functional studies in the light of sequence-structure relationships.

Collapse

Pei J, Grishin NV. PROMALS: towards accurate multiple sequence alignments of distantly related proteins. ACTA ACUST UNITED AC 2007;23:802-8. [PMID: 17267437 DOI: 10.1093/bioinformatics/btm017] [Citation(s) in RCA: 230] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]