Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Jaroszewski L, Rychlewski L, Godzik A. Improving the quality of twilight-zone alignments. Protein Sci 2000;9:1487-96. [PMID: 10975570 PMCID: PMC2144727 DOI: 10.1110/ps.9.8.1487] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

For:	Jaroszewski L, Rychlewski L, Godzik A. Improving the quality of twilight-zone alignments. Protein Sci 2000;9:1487-96. [PMID: 10975570 PMCID: PMC2144727 DOI: 10.1110/ps.9.8.1487] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Number

Cited by Other Article(s)

Chen L, Li Q, Nasif KFA, Xie Y, Deng B, Niu S, Pouriyeh S, Dai Z, Chen J, Xie CY. AI-Driven Deep Learning Techniques in Protein Structure Prediction. Int J Mol Sci 2024;25:8426. [PMID: 39125995 PMCID: PMC11313475 DOI: 10.3390/ijms25158426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 07/29/2024] [Accepted: 07/29/2024] [Indexed: 08/12/2024] Open

Abstract

Protein structure prediction is important for understanding their function and behavior. This review study presents a comprehensive review of the computational models used in predicting protein structure. It covers the progression from established protein modeling to state-of-the-art artificial intelligence (AI) frameworks. The paper will start with a brief introduction to protein structures, protein modeling, and AI. The section on established protein modeling will discuss homology modeling, ab initio modeling, and threading. The next section is deep learning-based models. It introduces some state-of-the-art AI models, such as AlphaFold (AlphaFold, AlphaFold2, AlphaFold3), RoseTTAFold, ProteinBERT, etc. This section also discusses how AI techniques have been integrated into established frameworks like Swiss-Model, Rosetta, and I-TASSER. The model performance is compared using the rankings of CASP14 (Critical Assessment of Structure Prediction) and CASP15. CASP16 is ongoing, and its results are not included in this review. Continuous Automated Model EvaluatiOn (CAMEO) complements the biennial CASP experiment. Template modeling score (TM-score), global distance test total score (GDT_TS), and Local Distance Difference Test (lDDT) score are discussed too. This paper then acknowledges the ongoing difficulties in predicting protein structure and emphasizes the necessity of additional searches like dynamic protein behavior, conformational changes, and protein-protein interactions. In the application section, this paper introduces some applications in various fields like drug design, industry, education, and novel protein development. In summary, this paper provides a comprehensive overview of the latest advancements in established protein modeling and deep learning-based models for protein structure predictions. It emphasizes the significant advancements achieved by AI and identifies potential areas for further investigation.

Collapse

Heinzinger M, Littmann M, Sillitoe I, Bordin N, Orengo C, Rost B. Contrastive learning on protein embeddings enlightens midnight zone. NAR Genom Bioinform 2022;4:lqac043. [PMID: 35702380 PMCID: PMC9188115 DOI: 10.1093/nargab/lqac043] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 03/25/2022] [Accepted: 05/17/2022] [Indexed: 12/23/2022] Open

Rajapaksa S, Sumanaweera D, Lesk AM, Allison L, Stuckey PJ, Garcia de la Banda M, Abramson D, Konagurthu AS. OUP accepted manuscript. Bioinformatics 2022;38:i255-i263. [PMID: 35758808 PMCID: PMC9235515 DOI: 10.1093/bioinformatics/btac247] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/09/2022] [Indexed: 11/13/2022] Open

Abbass J, Nebel JC. Rosetta and the Journey to Predict Proteins’ Structures, 20 Years on. Curr Bioinform 2020. [DOI: 10.2174/1574893615999200504103643] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Conway JM, Crosby JR, Hren AP, Southerland RT, Lee LL, Lunin VV, Alahuhta P, Himmel ME, Bomble YJ, Adams MWW, Kelly RM. Novel multidomain, multifunctional glycoside hydrolases from highly lignocellulolytic Caldicellulosiruptor species. AIChE J 2018. [DOI: 10.1002/aic.16354] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

A selective class of inhibitors for the CLC-Ka chloride ion channel. Proc Natl Acad Sci U S A 2018;115:E4900-E4909. [PMID: 29669921 DOI: 10.1073/pnas.1720584115] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Structural characterization of ANGPTL8 (betatrophin) with its interacting partner lipoprotein lipase. Comput Biol Chem 2016;61:210-20. [PMID: 26908254 DOI: 10.1016/j.compbiolchem.2016.01.009] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 01/07/2016] [Accepted: 01/21/2016] [Indexed: 12/20/2022]

Abstract

Angiopoietin-like protein 8 (ANGPTL8) (also known as betatrophin) is a newly identified secretory protein with a potential role in autophagy, lipid metabolism and pancreatic beta-cell proliferation. Its structural characterization is required to enhance our current understanding of its mechanism of action which could help in identifying its receptor and/or other binding partners. Based on the physiological significance and necessity of exploring structural features of ANGPTL8, the present study is conducted with a specific aim to model the structure of ANGPTL8 and study its possible interactions with Lipoprotein Lipase (LPL). To the best of our knowledge, this is the first attempt to predict 3-dimensional (3D) structure of ANGPTL8. Three different approaches were used for modeling of ANGPTL8 including homology modeling, de-novo structure prediction and their amalgam which is then proceeded by structure verification using ERRATT, PROSA, Qmean and Ramachandran plot scores. The selected models of ANGPTL8 were further evaluated for protein-protein interaction (PPI) analysis with LPL using CPORT and HADDOCK server. Our results have shown that the crystal structure of iSH2 domain of Phosphatidylinositol 3-kinase (PI3K) p85β subunit (PDB entry: 3mtt) is a good candidate for homology modeling of ANGPTL8. Analysis of inter-molecular interactions between the structure of ANGPTL8 and LPL revealed existence of several non-covalent interactions. The residues of LPL involved in these interactions belong from its lid region, thrombospondin (TSP) region and heparin binding site which is suggestive of a possible role of ANGPTL8 in regulating the proteolysis, motility and localization of LPL. Besides, the conserved residues of SE1 region of ANGPTL8 formed interactions with the residues around the hinge region of LPL. Overall, our results support a model of inhibition of LPL by ANGPTL8 through the steric block of its catalytic site which will be further explored using wet lab studies in future.

Collapse

Shabelnikov S, Kiselev A. Cysteine-Rich Atrial Secretory Protein from the Snail Achatina achatina: Purification and Structural Characterization. PLoS One 2015;10:e0138787. [PMID: 26444993 PMCID: PMC4596865 DOI: 10.1371/journal.pone.0138787] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 09/03/2015] [Indexed: 11/28/2022] Open

PvdP is a tyrosinase that drives maturation of the pyoverdine chromophore in Pseudomonas aeruginosa. J Bacteriol 2014;196:2681-90. [PMID: 24816606 DOI: 10.1128/jb.01376-13] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Fiser A. Protein structure modeling in the proteomics era. Expert Rev Proteomics 2014;1:97-110. [PMID: 15966803 DOI: 10.1586/14789450.1.1.97] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Xu D, Jaroszewski L, Li Z, Godzik A. FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking. ACTA ACUST UNITED AC 2013;30:660-7. [PMID: 24130308 DOI: 10.1093/bioinformatics/btt578] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Mishra S, Saxena A, Sangwan RS. Fundamentals of Homology Modeling Steps and Comparison among Important Bioinformatics Tools: An Overview. ACTA ACUST UNITED AC 2013. [DOI: 10.17311/sciintl.2013.237.252] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

A novel predicted calcium-regulated kinase family implicated in neurological disorders. PLoS One 2013;8:e66427. [PMID: 23840464 PMCID: PMC3696010 DOI: 10.1371/journal.pone.0066427] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Accepted: 05/08/2013] [Indexed: 12/03/2022] Open

Dhingra P, Jayaram B. A homology/ab initio hybrid algorithm for sampling near-native protein conformations. J Comput Chem 2013;34:1925-36. [PMID: 23728619 DOI: 10.1002/jcc.23339] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2012] [Revised: 03/09/2013] [Accepted: 04/21/2013] [Indexed: 12/19/2022]

Vishnepolsky B, Managadze G, Grigolava M, Pirtskhalava M. Evaluation performance of substitution matrices, based on contacts between residue terminal groups. J Biomol Struct Dyn 2012;30:180-90. [PMID: 22702729 DOI: 10.1080/07391102.2012.677769] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Zhou H, Skolnick J. FINDSITE(X): a structure-based, small molecule virtual screening approach with application to all identified human GPCRs. Mol Pharm 2012;9:1775-84. [PMID: 22574683 DOI: 10.1021/mp3000716] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Abstract

We have developed FINDSITE(X), an extension of FINDSITE, a protein threading based algorithm for the inference of protein binding sites, biochemical function and virtual ligand screening, that removes the limitation that holo protein structures (those containing bound ligands) of a sufficiently large set of distant evolutionarily related proteins to the target be solved; rather, predicted protein structures and experimental ligand binding information are employed. To provide the predicted protein structures, a fast and accurate version of our recently developed TASSER(VMT), TASSER(VMT)-lite, for template-based protein structural modeling applicable up to 1000 residues is developed and tested, with comparable performance to the top CASP9 servers. Then, a hybrid approach that combines structure alignments with an evolutionary similarity score for identifying functional relationships between target and proteins with binding data has been developed. By way of illustration, FINDSITE(X) is applied to 998 identified human G-protein coupled receptors (GPCRs). First, TASSER(VMT)-lite provides updates of all human GPCR structures previously modeled in our lab. We then use these structures and the new function similarity detection algorithm to screen all human GPCRs against the ZINC8 nonredundant (TC < 0.7) ligand set combined with ligands from the GLIDA database (a total of 88,949 compounds). Testing (excluding GPCRs whose sequence identity > 30% to the target from the binding data library) on a 168 human GPCR set with known binding data, the average enrichment factor in the top 1% of the compound library (EF(0.01)) is 22.7, whereas EF(0.01) by FINDSITE is 7.1. For virtual screening when just the target and its native ligands are excluded, the average EF(0.01) reaches 41.4. We also analyze off-target interactions for the 168 protein test set. All predicted structures, virtual screening data and off-target interactions for the 998 human GPCRs are available at http://cssb.biology.gatech.edu/skolnick/webservice/gpcr/index.html .

Collapse

Pentony MM, Winters P, Penfold-Brown D, Drew K, Narechania A, DeSalle R, Bonneau R, Purugganan MD. The plant proteome folding project: structure and positive selection in plant protein families. Genome Biol Evol 2012;4:360-71. [PMID: 22345424 PMCID: PMC3318447 DOI: 10.1093/gbe/evs015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Dudkiewicz M, Szczepińska T, Grynberg M, Pawłowski K. A novel protein kinase-like domain in a selenoprotein, widespread in the tree of life. PLoS One 2012;7:e32138. [PMID: 22359664 PMCID: PMC3281104 DOI: 10.1371/journal.pone.0032138] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Accepted: 01/24/2012] [Indexed: 12/21/2022] Open

Abstract

Selenoproteins serve important functions in many organisms, usually providing essential oxidoreductase enzymatic activity, often for defense against toxic xenobiotic substances. Most eukaryotic genomes possess a small number of these proteins, usually not more than 20. Selenoproteins belong to various structural classes, often related to oxidoreductase function, yet a few of them are completely uncharacterised.

Here, the structural and functional prediction for the uncharacterised selenoprotein O (SELO) is presented. Using bioinformatics tools, we predict that SELO protein adopts a three-dimensional fold similar to protein kinases. Furthermore, we argue that despite the lack of conservation of the “classic” catalytic aspartate residue of the archetypical His-Arg-Asp motif, SELO kinases might have retained catalytic phosphotransferase activity, albeit with an atypical active site. Lastly, the role of the selenocysteine residue is considered and the possibility of an oxidoreductase-regulated kinase function for SELO is discussed.

The novel kinase prediction is discussed in the context of functional data on SELO orthologues in model organisms, FMP40 a.k.a.YPL222W (yeast), and ydiU (bacteria). Expression data from bacteria and yeast suggest a role in oxidative stress response. Analysis of genomic neighbourhoods of SELO homologues in the three domains of life points toward a role in regulation of ABC transport, in oxidative stress response, or in basic metabolism regulation. Among bacteria possessing SELO homologues, there is a significant over-representation of aquatic organisms, also of aerobic ones. The selenocysteine residue in SELO proteins occurs only in few members of this protein family, including proteins from Metazoa, and few small eukaryotes (Ostreococcus, stramenopiles). It is also demonstrated that enterobacterial mchC proteins involved in maturation of bactericidal antibiotics, microcins, form a distant subfamily of the SELO proteins.

The new protein structural domain, with a putative kinase function assigned, expands the known kinome and deserves experimental determination of its biological role within the cell-signaling network.

Collapse

Structural correlates of selectivity and inactivation in potassium channels. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2011;1818:272-85. [PMID: 21958666 DOI: 10.1016/j.bbamem.2011.09.007] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2011] [Revised: 09/07/2011] [Accepted: 09/09/2011] [Indexed: 12/23/2022]

Cai XH, Jaroszewski L, Wooley J, Godzik A. Internal organization of large protein families: relationship between the sequence, structure, and function-based clustering. Proteins 2011;79:2389-402. [PMID: 21671455 DOI: 10.1002/prot.23049] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2010] [Revised: 02/12/2011] [Accepted: 03/13/2011] [Indexed: 12/14/2022]

Han GW, Elsliger MA, Yeates TO, Xu Q, Murzin AG, Krishna SS, Jaroszewski L, Abdubek P, Astakhova T, Axelrod HL, Carlton D, Chen C, Chiu HJ, Clayton T, Das D, Deller MC, Duan L, Ernst D, Feuerhelm J, Grant JC, Grzechnik A, Jin KK, Johnson HA, Klock HE, Knuth MW, Kozbial P, Kumar A, Lam WW, Marciano D, McMullan D, Miller MD, Morse AT, Nigoghossian E, Okach L, Reyes R, Rife CL, Sefcovic N, Tien HJ, Trame CB, van den Bedem H, Weekes D, Hodgson KO, Wooley J, Deacon AM, Godzik A, Lesley SA, Wilson IA. Structure of a putative NTP pyrophosphohydrolase: YP_001813558.1 from Exiguobacterium sibiricum 255-15. Acta Crystallogr Sect F Struct Biol Cryst Commun 2010;66:1237-44. [PMID: 20944217 PMCID: PMC2954211 DOI: 10.1107/s1744309110025534] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2010] [Accepted: 06/29/2010] [Indexed: 11/24/2022]

Affiliation(s)

Gye Won Han Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Marc-André Elsliger Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Todd O. Yeates Department of Chemistry and Biochemistry, University of California Los Angeles, Los Angeles, CA, USA
Qingping Xu Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Alexey G. Murzin MRC Laboratory of Molecular Biology, Hills Road, Cambridge, England
S. Sri Krishna Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA Program on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute, La Jolla, CA, USA
Lukasz Jaroszewski Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA Program on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute, La Jolla, CA, USA
Polat Abdubek Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Tamara Astakhova Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA
Herbert L. Axelrod Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Dennis Carlton Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Connie Chen Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Hsiu-Ju Chiu Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Thomas Clayton Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Debanu Das Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Marc C. Deller Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Lian Duan Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA
Dustin Ernst Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Julie Feuerhelm Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Joanna C. Grant Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Anna Grzechnik Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Kevin K. Jin Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Hope A. Johnson Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Heath E. Klock Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Mark W. Knuth Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Piotr Kozbial Joint Center for Structural Genomics, http://www.jcsg.org, USA Program on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute, La Jolla, CA, USA
Abhinav Kumar Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Winnie W. Lam Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
David Marciano Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Daniel McMullan Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Mitchell D. Miller Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Andrew T. Morse Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA
Edward Nigoghossian Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Linda Okach Joint Center for Structural Genomics, http://www.jcsg.org, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Ron Reyes Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Christopher L. Rife Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Natasha Sefcovic Joint Center for Structural Genomics, http://www.jcsg.org, USA Program on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute, La Jolla, CA, USA
Henry J. Tien Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA
Christine B. Trame Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Henry van den Bedem Joint Center for Structural Genomics, http://www.jcsg.org, USA Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
Dana Weekes Joint Center for Structural Genomics, http://www.jcsg.org, USA Program on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute, La Jolla, CA, USA
Keith O. Hodgson Joint Center for Structural Genomics, http://www.jcsg.org, USA Photon Science, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
John Wooley Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA
Ashley M. Deacon Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA
Adam Godzik Joint Center for Structural Genomics, http://www.jcsg.org, USA Center for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA Program on Bioinformatics and Systems Biology, Sanford–Burnham Medical Research Institute, La Jolla, CA, USA
Scott A. Lesley Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA Protein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA
Ian A. Wilson Joint Center for Structural Genomics, http://www.jcsg.org, USA Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA

Collapse

Nielsen M, Lundegaard C, Lund O, Petersen TN. CPHmodels-3.0--remote homology modeling using structure-guided sequence profiles. Nucleic Acids Res 2010;38:W576-81. [PMID: 20542909 PMCID: PMC2896139 DOI: 10.1093/nar/gkq535] [Citation(s) in RCA: 235] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Fiser A. Template-based protein structure modeling. Methods Mol Biol 2010;673:73-94. [PMID: 20835794 DOI: 10.1007/978-1-60761-842-3_6] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Protein structure prediction based on sequence similarity. Methods Mol Biol 2009;569:129-56. [PMID: 19623489 DOI: 10.1007/978-1-59745-524-4_7] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Zhou H, Skolnick J. Protein structure prediction by pro-Sp3-TASSER. Biophys J 2009;96:2119-27. [PMID: 19289038 DOI: 10.1016/j.bpj.2008.12.3898] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2008] [Revised: 11/12/2008] [Accepted: 12/03/2008] [Indexed: 12/29/2022] Open

Abstract

An automated protein structure prediction algorithm, pro-sp3-Threading/ASSEmbly/Refinement (TASSER), is described and benchmarked. Structural templates are identified using five different scoring functions derived from the previously developed threading methods PROSPECTOR_3 and SP(3). Top templates identified by each scoring function are combined to derive contact and distant restraints for subsequent model refinement by short TASSER simulations. For Medium/Hard targets (those with moderate to poor quality templates and/or alignments), alternative template alignments are also generated by parametric alignment and the top models selected by TASSER-QA are included in the contact and distance restraint derivation. Then, multiple short TASSER simulations are used to generate an ensemble of full-length models. Subsequently, the top models are selected from the ensemble by TASSER-QA and used to derive TASSER contacts and distant restraints for another round of full TASSER refinement. The final models are selected from both rounds of TASSER simulations by TASSER-QA. We compare pro-sp3-TASSER with our previously developed MetaTASSER method (enhanced with chunk-TASSER for Medium/Hard targets) on a representative test data set of 723 proteins <250 residues in length. For the 348 proteins classified as easy targets (those templates with good alignments and global structure similarity to the target), the cumulative TM-score of the best of top five models by pro-sp3-TASSER shows a 2.1% improvement over MetaTASSER. For the 155/220 medium/hard targets, the improvements in TM-score are 2.8% and 2.2%, respectively. All improvements are statistically significant. More importantly, the number of foldable targets (those having models whose TM-score to native >0.4 in the top five clusters) increases from 472 to 497 for all targets, and the relative increases for medium and hard targets are 10% and 15%, respectively. A server that implements the above algorithm is available at http://cssb.biology.gatech.edu/skolnick/webservice/pro-sp3-TASSER/. The source code is also available upon request.

Collapse

Zhu J, Fan H, Periole X, Honig B, Mark AE. Refining homology models by combining replica-exchange molecular dynamics and statistical potentials. Proteins 2009;72:1171-88. [PMID: 18338384 DOI: 10.1002/prot.22005] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Pacheco B, Maccarana M, Goodlett DR, Malmström A, Malmström L. Identification of the active site of DS-epimerase 1 and requirement of N-glycosylation for enzyme function. J Biol Chem 2008;284:1741-7. [PMID: 19004833 DOI: 10.1074/jbc.m805479200] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Arnon TI, Kaiser JT, West AP, Olson R, Diskin R, Viertlboeck BC, Göbel TW, Bjorkman PJ. The crystal structure of CHIR-AB1: a primordial avian classical Fc receptor. J Mol Biol 2008;381:1012-24. [PMID: 18625238 DOI: 10.1016/j.jmb.2008.06.082] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2008] [Revised: 06/25/2008] [Accepted: 06/26/2008] [Indexed: 01/22/2023]

Zhou H, Pandit SB, Lee SY, Borreguero J, Chen H, Wroblewska L, Skolnick J. Analysis of TASSER-based CASP7 protein structure prediction results. Proteins 2008;69 Suppl 8:90-7. [PMID: 17705276 DOI: 10.1002/prot.21649] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Axelrod HL, McMullan D, Krishna SS, Miller MD, Elsliger MA, Abdubek P, Ambing E, Astakhova T, Carlton D, Chiu HJ, Clayton T, Duan L, Feuerhelm J, Grzechnik SK, Hale J, Han GW, Haugen J, Jaroszewski L, Jin KK, Klock HE, Knuth MW, Koesema E, Morse AT, Nigoghossian E, Okach L, Oommachen S, Paulsen J, Quijano K, Reyes R, Rife CL, van den Bedem H, Weekes D, White A, Wolf G, Xu Q, Hodgson KO, Wooley J, Deacon AM, Godzik A, Lesley SA, Wilson IA. Crystal structure of AICAR transformylase IMP cyclohydrolase (TM1249) fromThermotoga maritima at 1.88 Å resolution. Proteins 2008;71:1042-9. [DOI: 10.1002/prot.21967] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Sterner B, Singh R, Berger B. Predicting and annotating catalytic residues: an information theoretic approach. J Comput Biol 2007;14:1058-73. [PMID: 17887954 DOI: 10.1089/cmb.2007.0042] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Abstract

We introduce a computational method to predict and annotate the catalytic residues of a protein using only its sequence information, so that we describe both the residues' sequence locations (prediction) and their specific biochemical roles in the catalyzed reaction (annotation). While knowing the chemistry of an enzyme's catalytic residues is essential to understanding its function, the challenges of prediction and annotation have remained difficult, especially when only the enzyme's sequence and no homologous structures are available. Our sequence-based approach follows the guiding principle that catalytic residues performing the same biochemical function should have similar chemical environments; it detects specific conservation patterns near in sequence to known catalytic residues and accordingly constrains what combination of amino acids can be present near a predicted catalytic residue. We associate with each catalytic residue a short sequence profile and define a Kullback-Leibler (KL) distance measure between these profiles, which, as we show, effectively captures even subtle biochemical variations. We apply the method to the class of glycohydrolase enzymes. This class includes proteins from 96 families with very different sequences and folds, many of which perform important functions. In a cross-validation test, our approach correctly predicts the location of the enzymes' catalytic residues with a sensitivity of 80% at a specificity of 99.4%, and in a separate cross-validation we also correctly annotate the biochemical role of 80% of the catalytic residues. Our results compare favorably to existing methods. Moreover, our method is more broadly applicable because it relies on sequence and not structure information; it may, furthermore, be used in conjunction with structure-based methods.

Collapse

Friedberg I, Godzik A. Connecting the protein structure universe by using sparse recurring fragments. Structure 2007;13:1213-24. [PMID: 16084393 DOI: 10.1016/j.str.2005.05.009] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2005] [Revised: 04/22/2005] [Accepted: 05/11/2005] [Indexed: 10/25/2022]

Fernandez-Fuentes N, Rai BK, Madrid-Aliste CJ, Fajardo JE, Fiser A. Comparative protein structure modeling by combining multiple templates and optimizing sequence-to-structure alignments. Bioinformatics 2007;23:2558-65. [PMID: 17823132 DOI: 10.1093/bioinformatics/btm377] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Zhou H, Skolnick J. Ab initio protein structure prediction using chunk-TASSER. Biophys J 2007;93:1510-8. [PMID: 17496016 PMCID: PMC1948038 DOI: 10.1529/biophysj.107.109959] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

We have developed an ab initio protein structure prediction method called chunk-TASSER that uses ab initio folded supersecondary structure chunks of a given target as well as threading templates for obtaining contact potentials and distance restraints. The predicted chunks, selected on the basis of a new fragment comparison method, are folded by a fragment insertion method. Full-length models are built and refined by the TASSER methodology, which searches conformational space via parallel hyperbolic Monte Carlo. We employ an optimized reduced force field that includes knowledge-based statistical potentials and restraints derived from the chunks as well as threading templates. The method is tested on a dataset of 425 hard target proteins < or =250 amino acids in length. The average TM-scores of the best of top five models per target are 0.266, 0.336, and 0.362 by the threading algorithm SP(3), original TASSER and chunk-TASSER, respectively. For a subset of 80 proteins with predicted alpha-helix content > or =50%, these averages are 0.284, 0.356, and 0.403, respectively. The percentages of proteins with the best of top five models having TM-score > or =0.4 (a statistically significant threshold for structural similarity) are 3.76, 20.94, and 28.94% by SP(3), TASSER, and chunk-TASSER, respectively, overall, while for the subset of 80 predominantly helical proteins, these percentages are 2.50, 23.75, and 41.25%. Thus, chunk-TASSER shows a significant improvement over TASSER for modeling hard targets where no good template can be identified. We also tested chunk-TASSER on 21 medium/hard targets <200 amino-acids-long from CASP7. Chunk-TASSER is approximately 11% (10%) better than TASSER for the total TM-score of the first (best of top five) models. Chunk-TASSER is fully automated and can be used in proteome scale protein structure prediction.

Collapse

López-Viñas E, Bentebibel A, Gurunathan C, Morillas M, de Arriaga D, Serra D, Asins G, Hegardt FG, Gómez-Puertas P. Definition by functional and structural analysis of two malonyl-CoA sites in carnitine palmitoyltransferase 1A. J Biol Chem 2007;282:18212-18224. [PMID: 17452323 DOI: 10.1074/jbc.m700885200] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Affiliation(s)

Eduardo López-Viñas Centro de Biología Molecular "Severo Ochoa" (Consejo Superior de Investigaciones Científicas-Universidad Autónoma de Madrid), Cantoblanco, E-28049 Madrid, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain
Assia Bentebibel Departamento de Bioquímica y Biología Molecular, Facultad de Farmacia, Universidad de Barcelona, E-08028 Barcelona, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain
Chandrashekaran Gurunathan Departamento de Bioquímica y Biología Molecular, Facultad de Farmacia, Universidad de Barcelona, E-08028 Barcelona, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain
Montserrat Morillas Departamento de Bioquímica y Biología Molecular, Facultad de Farmacia, Universidad de Barcelona, E-08028 Barcelona, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain
Dolores de Arriaga Departamento de Biología Molecular, Universidad de León, E-24071 León, Spain
Dolors Serra Departamento de Bioquímica y Biología Molecular, Facultad de Farmacia, Universidad de Barcelona, E-08028 Barcelona, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain
Guillermina Asins Departamento de Bioquímica y Biología Molecular, Facultad de Farmacia, Universidad de Barcelona, E-08028 Barcelona, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain
Fausto G Hegardt Departamento de Bioquímica y Biología Molecular, Facultad de Farmacia, Universidad de Barcelona, E-08028 Barcelona, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain.
Paulino Gómez-Puertas Centro de Biología Molecular "Severo Ochoa" (Consejo Superior de Investigaciones Científicas-Universidad Autónoma de Madrid), Cantoblanco, E-28049 Madrid, Spain; CIBER Institute of Fisiopatología de la Obesidad y Nutrición (CB06/03), Instituto de Salud Carlos III, 28049 Madrid, Spain

Collapse

Chovancová E, Kosinski J, Bujnicki JM, Damborský J. Phylogenetic analysis of haloalkane dehalogenases. Proteins 2007;67:305-16. [PMID: 17295320 DOI: 10.1002/prot.21313] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Abstract

Haloalkane dehalogenases (HLDs) are enzymes that catalyze the cleavage of carbon-halogen bonds by a hydrolytic mechanism. Although comparative biochemical analyses have been published, no classification system has been proposed for HLDs, to date, that reconciles their phylogenetic and functional relationships. In the study presented here, we have analyzed all sequences and structures of genuine HLDs and their homologs detectable by database searches. Phylogenetic analyses revealed that the HLD family can be divided into three subfamilies denoted HLD-I, HLD-II, and HLD-III, of which HLD-I and HLD-III are predicted to be sister-groups. A mismatch between the HLD protein tree and the tree of species, as well as the presence of more than one HLD gene in a few genomes, suggest that horizontal gene transfers, and perhaps also multiple gene duplications and losses have been involved in the evolution of this family. Most of the biochemically characterized HLDs are found in the HLD-II subfamily. The dehalogenating activity of two members of the newly identified HLD-III subfamily has only recently been confirmed, in a study motivated by this phylogenetic analysis. A novel type of the catalytic pentad (Asp-His-Asp+Asn-Trp) was predicted for members of the HLD-III subfamily. Calculation of the evolutionary rates and lineage-specific innovations revealed a common conserved core as well as a set of residues that characterizes each HLD subfamily. The N-terminal part of the cap domain is one of the most variable regions within the whole family as well as within individual subfamilies, and serves as a preferential site for the location of relatively long insertions. The highest variability of discrete sites was observed among residues that are structural components of the access channels. Mutations at these sites modify the anatomy of the channels, which are important for the exchange of ligands between the buried active site and the bulk solvent, thus creating a structural basis for the molecular evolution of new substrate specificities. Our analysis sheds light on the evolutionary history of HLDs and provides a structural framework for designing enzymes with new specificities.

Collapse

Zhu J, Xie L, Honig B. Structural refinement of protein segments containing secondary structure elements: Local sampling, knowledge-based potentials, and clustering. Proteins 2006;65:463-79. [PMID: 16927337 DOI: 10.1002/prot.21085] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Tan YH, Huang H, Kihara D. Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences. Proteins 2006;64:587-600. [PMID: 16799934 DOI: 10.1002/prot.21020] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Chivian D, Baker D. Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection. Nucleic Acids Res 2006;34:e112. [PMID: 16971460 PMCID: PMC1635247 DOI: 10.1093/nar/gkl480] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Pons T, González B, Ceciliani F, Galizzi A. FlgM anti-sigma factors: identification of novel members of the family, evolutionary analysis, homology modeling, and analysis of sequence-structure-function relationships. J Mol Model 2006;12:973-83. [PMID: 16673084 DOI: 10.1007/s00894-005-0096-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2005] [Accepted: 12/02/2005] [Indexed: 10/24/2022]

Qiu J, Elber R. SSALN: an alignment algorithm using structure-dependent substitution matrices and gap penalties learned from structurally aligned protein pairs. Proteins 2006;62:881-91. [PMID: 16385554 DOI: 10.1002/prot.20854] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Skowronek KJ, Kosinski J, Bujnicki JM. Theoretical model of restriction endonuclease HpaI in complex with DNA, predicted by fold recognition and validated by site-directed mutagenesis. Proteins 2006;63:1059-68. [PMID: 16498623 DOI: 10.1002/prot.20920] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Abstract

Type II restriction enzymes are commercially important deoxyribonucleases and very attractive targets for protein engineering of new specificities. At the same time they are a very challenging test bed for protein structure prediction methods. Typically, enzymes that recognize different sequences show little or no amino acid sequence similarity to each other and to other proteins. Based on crystallographic analyses that revealed the same PD-(D/E)XK fold for more than a dozen case studies, they were nevertheless considered to be related until the combination of bioinformatics and mutational analyses has demonstrated that some of these proteins belong to other, unrelated folds PLD, HNH, and GIY-YIG. As a part of a large-scale project aiming at identification of a three-dimensional fold for all type II REases with known sequences (currently approximately 1000 proteins), we carried out preliminary structure prediction and selected candidates for experimental validation. Here, we present the analysis of HpaI REase, an ORFan with no detectable homologs, for which we detected a structural template by protein fold recognition, constructed a model using the FRankenstein monster approach and identified a number of residues important for the DNA binding and catalysis. These predictions were confirmed by site-directed mutagenesis and in vitro analysis of the mutant proteins. The experimentally validated model of HpaI will serve as a low-resolution structural platform for evolutionary considerations in the subgroup of blunt-cutting REases with different specificities. The research protocol developed in the course of this work represents a streamlined version of the previously used techniques and can be used in a high-throughput fashion to build and validate models for other enzymes, especially ORFans that exhibit no sequence similarity to any other protein in the database.

Collapse

Petrey D, Honig B. Protein Structure Prediction: Inroads to Biology. Mol Cell 2005;20:811-9. [PMID: 16364908 DOI: 10.1016/j.molcel.2005.12.005] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Shachar O, Linial M. A robust method to detect structural and functional remote homologues. Proteins 2005;57:531-8. [PMID: 15382232 DOI: 10.1002/prot.20235] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Bitto E, Bingman CA, Robinson H, Allard STM, Wesenberg GE, Phillips GN. The structure at 2.5 A resolution of human basophilic leukemia-expressed protein BLES03. Acta Crystallogr Sect F Struct Biol Cryst Commun 2005;61:812-7. [PMID: 16511166 PMCID: PMC1978119 DOI: 10.1107/s1744309105023845] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2005] [Accepted: 07/25/2005] [Indexed: 11/10/2022]

Jaroszewski L, Rychlewski L, Li Z, Li W, Godzik A. FFAS03: a server for profile--profile sequence alignments. Nucleic Acids Res 2005;33:W284-8. [PMID: 15980471 PMCID: PMC1160179 DOI: 10.1093/nar/gki418] [Citation(s) in RCA: 456] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Shah PK, Aloy P, Bork P, Russell RB. Structural similarity to bridge sequence space: finding new families on the bridges. Protein Sci 2005;14:1305-14. [PMID: 15840833 PMCID: PMC2253280 DOI: 10.1110/ps.041187405] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Johnston CR, Shields DC. A sequence sub-sampling algorithm increases the power to detect distant homologues. Nucleic Acids Res 2005;33:3772-8. [PMID: 16006623 PMCID: PMC1174907 DOI: 10.1093/nar/gki687] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Bitto E, Bingman CA, Allard STM, Wesenberg GE, Phillips GN. The structure at 1.7 A resolution of the protein product of the At2g17340 gene from Arabidopsis thaliana. Acta Crystallogr Sect F Struct Biol Cryst Commun 2005;61:630-5. [PMID: 16511115 PMCID: PMC1952457 DOI: 10.1107/s1744309105017690] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2005] [Accepted: 06/03/2005] [Indexed: 04/10/2023]

Allard STM, Bingman CA, Johnson KA, Wesenberg GE, Bitto E, Jeon WB, Phillips GN. Structure at 1.6 A resolution of the protein from gene locus At3g22680 from Arabidopsis thaliana. Acta Crystallogr Sect F Struct Biol Cryst Commun 2005;61:647-50. [PMID: 16511118 PMCID: PMC1952470 DOI: 10.1107/s1744309105019743] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2005] [Accepted: 06/22/2005] [Indexed: 11/10/2022]