1
|
Olek AT, Rushton PS, Kihara D, Ciesielski P, Aryal UK, Zhang Z, Stauffacher CV, McCann MC, Carpita NC. Essential amino acids in the Plant-Conserved and Class-Specific Regions of cellulose synthases. PLANT PHYSIOLOGY 2023; 191:142-160. [PMID: 36250895 PMCID: PMC9806608 DOI: 10.1093/plphys/kiac479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 09/24/2022] [Indexed: 05/05/2023]
Abstract
The Plant-Conserved Region (P-CR) and the Class-Specific Region (CSR) are two plant-unique sequences in the catalytic core of cellulose synthases (CESAs) for which specific functions have not been established. Here, we used site-directed mutagenesis to replace amino acids and motifs within these sequences predicted to be essential for assembly and function of CESAs. We developed an in vivo method to determine the ability of mutated CesA1 transgenes to complement an Arabidopsis (Arabidopsis thaliana) temperature-sensitive root-swelling1 (rsw1) mutant. Replacement of a Cys residue in the CSR, which blocks dimerization in vitro, rendered the AtCesA1 transgene unable to complement the rsw1 mutation. Examination of the CSR sequences from 33 diverse angiosperm species showed domains of high-sequence conservation in a class-specific manner but with variation in the degrees of disorder, indicating a nonredundant role of the CSR structures in different CESA isoform classes. The Cys residue essential for dimerization was not always located in domains of intrinsic disorder. Expression of AtCesA1 transgene constructs, in which Pro417 and Arg453 were substituted for Ala or Lys in the coiled-coil of the P-CR, were also unable to complement the rsw1 mutation. Despite an expected role for Arg457 in trimerization of CESA proteins, AtCesA1 transgenes with Arg457Ala mutations were able to fully restore the wild-type phenotype in rsw1. Our data support that Cys662 within the CSR and Pro417 and Arg453 within the P-CR of Arabidopsis CESA1 are essential residues for functional synthase complex formation, but our data do not support a specific role for Arg457 in trimerization in native CESA complexes.
Collapse
Affiliation(s)
- Anna T Olek
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
| | - Phillip S Rushton
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, USA
| | - Peter Ciesielski
- Renewable Resources & Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, USA
| | - Uma K Aryal
- Bindley Biosciences Center, Purdue University, West Lafayette, Indiana 47907, USA
- Department of Comparative Pathobiology, College of Veterinary Medicine, Purdue University, West Lafayette, Indiana 47907, USA
| | - Zicong Zhang
- Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, USA
| | - Cynthia V Stauffacher
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
| | - Maureen C McCann
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, USA
| | - Nicholas C Carpita
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, USA
| |
Collapse
|
2
|
Discovering the Ultimate Limits of Protein Secondary Structure Prediction. Biomolecules 2021; 11:biom11111627. [PMID: 34827624 PMCID: PMC8615938 DOI: 10.3390/biom11111627] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 10/25/2021] [Accepted: 10/28/2021] [Indexed: 12/29/2022] Open
Abstract
Secondary structure prediction (SSP) of proteins is an important structural biology technique with many applications. There have been ~300 algorithms published in the past seven decades with fierce competition in accuracy. In the first 60 years, the accuracy of three-state SSP rose from ~56% to 81%; after that, it has long stayed at 81–86%. In the 1990s, the theoretical limit of three-state SSP accuracy had been estimated to be 88%. Thus, SSP is now generally considered not challenging or too challenging to improve. However, we found that the limit of three-state SSP might be underestimated. Besides, there is still much room for improving segment-based and eight-state SSPs, but the limits of these emerging topics have not been determined. This work performs large-scale sequence and structural analyses to estimate SSP accuracy limits and assess state-of-the-art SSP methods. The limit of three-state SSP is re-estimated to be ~92%, 4–5% higher than previously expected, indicating that SSP is still challenging. The estimated limit of eight-state SSP is 84–87%. Several proposals for improving future SSP algorithms are made based on our results. We hope that these findings will help move forward the development of SSP and all its applications.
Collapse
|
3
|
Chen TR, Lin YC, Huang YW, Chen CC, Lo WC. CirPred, the first structure modeling and linker design system for circularly permuted proteins. BMC Bioinformatics 2021; 22:494. [PMID: 34641789 PMCID: PMC8513176 DOI: 10.1186/s12859-021-04403-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2021] [Accepted: 09/24/2021] [Indexed: 11/16/2022] Open
Abstract
Background This work aims to help develop new protein engineering techniques based on a structural rearrangement phenomenon called circular permutation (CP), equivalent to connecting the native termini of a protein followed by creating new termini at another site. Although CP has been applied in many fields, its implementation is still costly because of inevitable trials and errors.
Results Here we present CirPred, a structure modeling and termini linker design method for circularly permuted proteins. Compared with state-of-the-art protein structure modeling methods, CirPred is the only one fully capable of both circularly-permuted modeling and traditional co-linear modeling. CirPred performs well when the permutant shares low sequence identity with the native protein and even when the permutant adopts a different conformation from the native protein because of three-dimensional (3D) domain swapping. Linker redesign experiments demonstrated that the linker design algorithm of CirPred achieved subangstrom accuracy. Conclusions The CirPred system is capable of (1) predicting the structure of circular permutants, (2) designing termini linkers, (3) performing traditional co-linear protein structure modeling, and (4) identifying the CP-induced occurrence of 3D domain swapping. This method is supposed helpful for broadening the application of CP, and its web server is available at http://10.life.nctu.edu.tw/CirPred/ and http://lo.life.nctu.edu.tw/CirPred/. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04403-1.
Collapse
Affiliation(s)
- Teng-Ruei Chen
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan.,Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yen-Cheng Lin
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan.,Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yu-Wei Huang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan.,Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Chih-Chieh Chen
- Institute of Medical Science and Technology, National Sun Yat-sen University, Kaohsiung, Taiwan
| | - Wei-Cheng Lo
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan. .,Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan. .,Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan. .,Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan. .,The Center for Bioinformatics Research, National Yang Ming Chiao Tung University, Hsinchu, Taiwan.
| |
Collapse
|
4
|
Beaudoin CA, Jamasb AR, Alsulami AF, Copoiu L, van Tonder AJ, Hala S, Bannerman BP, Thomas SE, Vedithi SC, Torres PH, Blundell TL. Predicted structural mimicry of spike receptor-binding motifs from highly pathogenic human coronaviruses. Comput Struct Biotechnol J 2021; 19:3938-3953. [PMID: 34234921 PMCID: PMC8249111 DOI: 10.1016/j.csbj.2021.06.041] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Revised: 06/26/2021] [Accepted: 06/27/2021] [Indexed: 12/19/2022] Open
Abstract
Potential coronavirus spike protein mimicry revealed by structural comparison. Human and non-human protein potential interactions with virus identified. Predicted structural mimicry corroborated by protein–protein docking. Epitope-based alignments may help guide vaccine efforts.
Viruses often encode proteins that mimic host proteins in order to facilitate infection. Little work has been done to understand the potential mimicry of the SARS-CoV-2, SARS-CoV, and MERS-CoV spike proteins, particularly the receptor-binding motifs, which could be important in determining tropism and druggability of the virus. Peptide and epitope motifs have been detected on coronavirus spike proteins using sequence homology approaches; however, comparing the three-dimensional shape of the protein has been shown as more informative in predicting mimicry than sequence-based comparisons. Here, we use structural bioinformatics software to characterize potential mimicry of the three coronavirus spike protein receptor-binding motifs. We utilize sequence-independent alignment tools to compare structurally known protein models with the receptor-binding motifs and verify potential mimicked interactions with protein docking simulations. Both human and non-human proteins were returned for all three receptor-binding motifs. For example, all three were similar to several proteins containing EGF-like domains: some of which are endogenous to humans, such as thrombomodulin, and others exogenous, such as Plasmodium falciparum MSP-1. Similarity to human proteins may reveal which pathways the spike protein is co-opting, while analogous non-human proteins may indicate shared host interaction partners and overlapping antibody cross-reactivity. These findings can help guide experimental efforts to further understand potential interactions between human and coronavirus proteins.
Collapse
Affiliation(s)
- Christopher A. Beaudoin
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
- Corresponding authors.
| | - Arian R. Jamasb
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
- Department of Computer Science & Technology, University of Cambridge, JJ Thomson Ave, Cambridge CB3 0FD, United Kingdom
| | - Ali F. Alsulami
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
| | - Liviu Copoiu
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
| | - Andries J. van Tonder
- Department of Veterinary Medicine, University of Cambridge, Madingley Rd, Cambridge CB3 0ES, United Kingdom
| | - Sharif Hala
- King Abdullah International Medical Research Centre – Ministry of National Guard Health Affairs, Jeddah, Saudi Arabia
- King Saud bin Abdulaziz University for Health Sciences, Jeddah, Saudi Arabia
| | - Bridget P. Bannerman
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
| | - Sherine E. Thomas
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
| | - Sundeep Chaitanya Vedithi
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
| | - Pedro H.M. Torres
- Laboratório de Modelagem e Dinâmica Molecular, Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Tom L. Blundell
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, United Kingdom
- Corresponding authors.
| |
Collapse
|
5
|
Juan SH, Chen TR, Lo WC. A simple strategy to enhance the speed of protein secondary structure prediction without sacrificing accuracy. PLoS One 2020; 15:e0235153. [PMID: 32603341 PMCID: PMC7326220 DOI: 10.1371/journal.pone.0235153] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 06/09/2020] [Indexed: 01/06/2023] Open
Abstract
The secondary structure prediction of proteins is a classic topic of computational structural biology with a variety of applications. During the past decade, the accuracy of prediction achieved by state-of-the-art algorithms has been >80%; meanwhile, the time cost of prediction increased rapidly because of the exponential growth of fundamental protein sequence data. Based on literature studies and preliminary observations on the relationships between the size/homology of the fundamental protein dataset and the speed/accuracy of predictions, we raised two hypotheses that might be helpful to determine the main influence factors of the efficiency of secondary structure prediction. Experimental results of size and homology reductions of the fundamental protein dataset supported those hypotheses. They revealed that shrinking the size of the dataset could substantially cut down the time cost of prediction with a slight decrease of accuracy, which could be increased on the contrary by homology reduction of the dataset. Moreover, the Shannon information entropy could be applied to explain how accuracy was influenced by the size and homology of the dataset. Based on these findings, we proposed that a proper combination of size and homology reductions of the protein dataset could speed up the secondary structure prediction while preserving the high accuracy of state-of-the-art algorithms. Testing the proposed strategy with the fundamental protein dataset of the year 2018 provided by the Universal Protein Resource, the speed of prediction was enhanced over 20 folds while all accuracy measures remained equivalently high. These findings are supposed helpful for improving the efficiency of researches and applications depending on the secondary structure prediction of proteins. To make future implementations of the proposed strategy easy, we have established a database of size and homology reduced protein datasets at http://10.life.nctu.edu.tw/UniRefNR.
Collapse
Affiliation(s)
- Sheng-Hung Juan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
| | - Teng-Ruei Chen
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
| | - Wei-Cheng Lo
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
- The Center for Bioinformatics Research, National Chiao Tung University, Hsinchu, Taiwan
| |
Collapse
|
6
|
Zhang Q, Wang C, Wan M, Wu Y, Ma Q. Streptococcus pneumoniae Genome-wide Identification and Characterization of BOX Element-binding Domains. Mol Inform 2016; 34:742-52. [PMID: 27491035 DOI: 10.1002/minf.201500044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2015] [Indexed: 11/11/2022]
Abstract
The BOX elements are short repetitive DNA sequences that distribute randomly in intergenic regions of the Streptococcus pneumoniae genome. The function and origin of such elements are still unknown, but they were found to modulate expression of neighboring genes. Evidences suggested that the modulation's mechanism can be fulfilled by sequence-specific interaction of BOX elements with transcription factor family proteins. However, the type and function of these BOX-binding proteins still remain largely unexplored to date. In the current study we described a synthetic protocol to investigate the recognition and interaction between a highly conserved site of BOX elements and the DNA-binding domains of a variety of putative transcription factors in the pneumococcal genome. With the protocol we were able to predict those high-affinity domain binders of the conserved BOX DNA site (BOX DNA) in a high-throughput manner, and analyzed sequence-specific interaction in the domainDNA recognition at molecular level. Consequently, a number of putative transcription factor domains with both high affinity and specificity for the BOX DNA were identified, from which the helix-turn-helix (HTH) motif of a small heat shock factor was selected as a case study and tested for its binding capability toward the double-stranded BOX DNA using fluorescence anisotropy analysis. As might be expected, a relatively high affinity was detected for the interaction of HTH motif with BOX DNA with dissociation constant at nanomolar level. Molecular dynamics simulation, atomic structure examination and binding energy analysis revealed a complicated network of intensive nonbonded interactions across the complex interface, which confers both stability and specificity for the complex architecture.
Collapse
Affiliation(s)
- Qiao Zhang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Changzheng Wang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Min Wan
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Yin Wu
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Qianli Ma
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| |
Collapse
|
7
|
Tyurin А, Sadovskaya N, Nikiforova K, Mustafaev О, Komakhin R, Fadeev V, Goldenkova-Pavlova I. Clostridium thermocellum thermostable lichenase with circular permutations and modifications in the N-terminal region retains its activity and thermostability. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2015; 1854:10-9. [DOI: 10.1016/j.bbapap.2014.10.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 09/25/2014] [Accepted: 10/15/2014] [Indexed: 11/30/2022]
|
8
|
Production of bulk chemicals via novel metabolic pathways in microorganisms. Biotechnol Adv 2012; 31:925-35. [PMID: 23280013 DOI: 10.1016/j.biotechadv.2012.12.008] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Revised: 12/09/2012] [Accepted: 12/23/2012] [Indexed: 02/05/2023]
Abstract
Metabolic engineering has been playing important roles in developing high performance microorganisms capable of producing various chemicals and materials from renewable biomass in a sustainable manner. Synthetic and systems biology are also contributing significantly to the creation of novel pathways and the whole cell-wide optimization of metabolic performance, respectively. In order to expand the spectrum of chemicals that can be produced biotechnologically, it is necessary to broaden the metabolic capacities of microorganisms. Expanding the metabolic pathways for biosynthesizing the target chemicals requires not only the enumeration of a series of known enzymes, but also the identification of biochemical gaps whose corresponding enzymes might not actually exist in nature; this issue is the focus of this paper. First, pathway prediction tools, effectively combining reactions that lead to the production of a target chemical, are analyzed in terms of logics representing chemical information, and designing and ranking the proposed metabolic pathways. Then, several approaches for potentially filling in the gaps of the novel metabolic pathway are suggested along with relevant examples, including the use of promiscuous enzymes that flexibly utilize different substrates, design of novel enzymes for non-natural reactions, and exploration of hypothetical proteins. Finally, strain optimization by systems metabolic engineering in the context of novel metabolic pathways constructed is briefly described. It is hoped that this review paper will provide logical ways of efficiently utilizing 'big' biological data to design and develop novel metabolic pathways for the production of various bulk chemicals that are currently produced from fossil resources.
Collapse
|
9
|
Stephen P, Cheng KC, Lyu PC. Crystal structure of circular permuted RoCBM21 (CP90): dimerisation and proximity of binding sites. PLoS One 2012; 7:e50488. [PMID: 23226294 PMCID: PMC3511584 DOI: 10.1371/journal.pone.0050488] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2012] [Accepted: 10/22/2012] [Indexed: 11/20/2022] Open
Abstract
Glucoamylases, containing starch-binding domains (SBD), have a wide range of scientific and industrial applications. Random mutagenesis and DNA shuffling of the gene encoding a starch-binding domain have resulted in only minor improvements in the affinities of the corresponding protein to their ligands, whereas circular permutation of the RoCBM21 substantially improved its binding affinity and selectivity towards longer-chain carbohydrates. For the study reported herein, we used a standard soluble ligand (amylose EX-I) to characterize the functional and structural aspects of circularly permuted RoCBM21 (CP90). Site-directed mutagenesis and the analysis of crystal structure reveal the dimerisation and an altered binding path, which may be responsible for improved affinity and altered selectivity of this newly created starch-binding domain. The functional and structural characterization of CP90 suggests that it has significant potential in industrial applications.
Collapse
Affiliation(s)
- Preyesh Stephen
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan
| | - Kuo-Chang Cheng
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan
| | - Ping-Chiang Lyu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan
- Department of Medical Sciences, National Tsing Hua University, Hsinchu, Taiwan
- Graduate Institute of Molecular Systems Biomedicine, China Medical University, Taichung, Taiwan
- * E-mail:
| |
Collapse
|
10
|
Crystal structure of the dopamine N-acetyltransferase-acetyl-CoA complex provides insights into the catalytic mechanism. Biochem J 2012; 446:395-404. [PMID: 22716280 PMCID: PMC3429931 DOI: 10.1042/bj20120520] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The daily cycle of melatonin biosynthesis in mammals is regulated by AANAT (arylalkylamine N-acetyltransferase; EC 2.3.1.87), making it an attractive target for therapeutic control of abnormal melatonin production in mood and sleep disorders. Drosophila melanogaster Dat (dopamine N-acetyltransferase) is an AANAT. Until the present study, no insect Dat structure had been solved, and, consequently, the structural basis for its acetyl-transfer activity was not well understood. We report in the present paper the high-resolution crystal structure for a D. melanogaster Dat-AcCoA (acetyl-CoA) complex obtained using one-edge (selenium) single-wavelength anomalous diffraction. A binding study using isothermal titration calorimetry suggested that the cofactor bound to Dat first before substrate. Examination of the complex structure and a substrate-docked model indicated that Dat contains a novel AANAT catalytic triad. Site-directed mutagenesis, kinetic studies and pH-rate profiles confirmed that Glu(47), Ser(182) and Ser(186) were critical for catalysis. Collectively, the results of the present study suggest that Dat possesses a specialized active site structure dedicated to a catalytic mechanism.
Collapse
|
11
|
Bonnel N, Marteau PF. LNA: fast protein structural comparison using a Laplacian characterization of tertiary structure. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1451-1458. [PMID: 22547433 DOI: 10.1109/tcbb.2012.64] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Abstract—In the last two decades, a lot of protein 3D shapes have been discovered, characterized, and made available thanks to the Protein Data Bank (PDB), that is nevertheless growing very quickly. New scalable methods are thus urgently required to search through the PDB efficiently. This paper presents an approach entitled LNA (Laplacian Norm Alignment) that performs a structural comparison of two proteins with dynamic programming algorithms. This is achieved by characterizing each residue in the protein with scalar features. The feature values are calculated using a Laplacian operator applied on the graph corresponding to the adjacency matrix of the residues. The weighted Laplacian operator we use estimates, at various scales, local deformations of the topology where each residue is located. On some benchmarks, which are widely shared by the community, we obtain qualitatively similar results compared to other competing approaches, but with an algorithm one or two order of magnitudes faster. 180,000 protein comparisons can be done within 1 second with a single recent Graphical Processing Unit (GPU), which makes our algorithm very scalable and suitable for real-time database querying across the web.
Collapse
Affiliation(s)
- Nicolas Bonnel
- IRISA, Université de Bretagne Sud, Campus de Tohannic, Vannes 56000, France.
| | | |
Collapse
|
12
|
Li N, Zhang Z, Zhang L, Wang S, Zou Z, Wang G, Wang Y. Insulin-like growth factor binding protein 7, a member of insulin-like growth factor signal pathway, involved in immune response of small abalone Haliotis diversicolor. FISH & SHELLFISH IMMUNOLOGY 2012; 33:229-242. [PMID: 22584203 DOI: 10.1016/j.fsi.2012.04.016] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Revised: 04/28/2012] [Accepted: 04/30/2012] [Indexed: 05/31/2023]
Abstract
Insulin-like growth factor binding protein 7 (IGFBP7), the only member of the IGFBP superfamily that binds strongly to insulin, may have different functions from other IGFBPs. Unlike other IGFBPs, there is no knowledge available on aquatic invertebrate IGFBP7. In this study, a molluscan IGFBP7 gene, saIGFBP7, was cloned for the first time from the small abalone Haliotis diversicolor. Its full-length cDNA sequence is 1812 bp, with a 720 bp open reading frame encoding a protein of 239 aa. The molecular mass of the deduced protein is approximately 25.37 kDa with an estimated pI of 5.00, and it shares highest 41% identity to IGFBP7 of Amblyomma americanum. Analysis of conserved domains revealed the presence of an IGFBP N-terminal domain (IB), a kazal-type serine proteinase inhibitor domain (KI), and an immunoglobulin-like C2 domain (IgC2) in saIGFBP7. Furthermore, the 12 cysteine residues and the signature amino acid motif 'xCGCCxxC' which are characterized by the amino terminus region of the IGFBP superfamily are all presented in saIGFBP7. Quantitative real-time PCR and western blot were employed to investigate the tissue distribution of saIGFBP7, and its expression under bacterial challenge. The saIGFBP7 mRNA and protein could be detected in all examined tissues, with the highest expression level in hemocytes, higher expression level in gills, and was up-regulated in hemocytes and gills after bacterial injection. In addition, saIGFBP7 mRNA transcripts were observed in a subset of the branchial epithelium and the nucleus of hemocytes using the in situ hybridization method. Interestingly, saIGFBP7 was detected mainly in the goblet-like cell of the branchial epithelium by immunohistochemistry. These results suggested that saIGFBP7 was likely to be involved in a function associated with pathogenic infection and may play an important role in the adult abalone immune system.
Collapse
Affiliation(s)
- Na Li
- Key Laboratory of Healthy Mariculture for East China Sea, Ministry of Agriculture, Fisheries College, Jimei University, Jimei, Xiamen, China
| | | | | | | | | | | | | |
Collapse
|
13
|
Chruszcz M, Chapman MD, Osinski T, Solberg R, Demas M, Porebski PJ, Majorek KA, Pomés A, Minor W. Alternaria alternata allergen Alt a 1: a unique β-barrel protein dimer found exclusively in fungi. J Allergy Clin Immunol 2012; 130:241-7.e9. [PMID: 22664167 DOI: 10.1016/j.jaci.2012.03.047] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2012] [Revised: 02/21/2012] [Accepted: 03/27/2012] [Indexed: 12/16/2022]
Abstract
BACKGROUND Alternaria species is one of the most common molds associated with allergic diseases, and 80% of Alternaria species-sensitive patients produce IgE antibodies to a major protein allergen, Alt a 1. The structure and function of Alt a 1 is unknown. OBJECTIVE We sought to obtain a high-resolution structure of Alt a 1 using x-ray crystallography and to investigate structural relationships between Alt a 1 and other allergens and proteins reported in the Protein Data Bank. METHODS X-ray crystallography was used to determine the structure of Alt a 1 by using a custom-designed set of crystallization conditions. An initial Alt a 1 model was determined by the application of a Ta(6)Br(12)(2+) cluster and single-wavelength anomalous diffraction. Bioinformatic analyses were used to compare the Alt a 1 sequence and structure with that of other proteins. RESULTS Alt a 1 is a unique β-barrel comprising 11 β-strands and forms a "butterfly-like" dimer linked by a single disulfide bond with a large (1345 Å(2)) dimer interface. Intramolecular disulfide bonds are conserved among Alt a 1 homologs. Currently, the Alt a 1 structure has no equivalent in the Protein Data Bank. Bioinformatics analyses suggest that the structure is found exclusively in fungi. Four previously reported putative IgE-binding peptides have been located on the Alt a 1 structure. CONCLUSIONS Alt a 1 has a unique, dimeric β-barrel structure that appears to define a new protein family with unknown function found exclusively in fungi. The location of IgE antibody-binding epitopes is in agreement with the structural analysis of Alt a 1. The Alt a 1 structure will allow mechanistic structure/function studies and immunologic studies directed toward new forms of immunotherapy for Alternaria species-sensitive allergic patients.
Collapse
Affiliation(s)
- Maksymilian Chruszcz
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Ge H, Wang G, Zhang L, Zhang Z, Wang S, Zou Z, Yan S, Wang Y. Molecular cloning and expression of interleukin-1 receptor-associated kinase 4, an important mediator of Toll-like receptor signal pathway, from small abalone Haliotis diversicolor. FISH & SHELLFISH IMMUNOLOGY 2011; 30:1138-1146. [PMID: 21362486 DOI: 10.1016/j.fsi.2011.02.018] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2010] [Revised: 02/15/2011] [Accepted: 02/20/2011] [Indexed: 05/30/2023]
Abstract
Mammal interleukin-1 receptor-associated kinases (IRAKs) have been demonstrated to play important functions in TLRs (Toll-like receptor) signal pathway and T cell proliferation, but there is less knowledge available on mollusc IRAKs. In this study, a molluscan IRAK-4 gene, saIRAK-4, was cloned for the first time from the small abalone (Haliotis diversicolor). Its full-length cDNA sequence was 2062 bp, with a 1548 bp open reading frame encoding a protein of 516 aa. The molecular mass of the deduced protein was approximately 57.8 kDa with an estimated pI of 5.23, and showed highest identity (47%) to acorn worm Saccoglossus kowalevskii. Amino acid sequence analysis revealed saIRAK-4 shares conserved signature motifs with other IRAK-4 proteins, including the death domain (DD), serine/threonine/tyrosine protein kinase domain (STYKc), protein kinases ATP-binding region signature, serine/threonine protein kinases active-site signature and prokaryotic membrane lipoprotein lipid attachment site. Quantitative real-time PCR was employed to investigate the tissue distribution of saIRAK-4 mRNA, and its expression in abalone under bacteria challenge and larvae at different developmental stages. The saIRAK-4 mRNA could be detected in all examined tissues, with the highest expression level in gills, and was up-regulated in hemocytes and gills after bacteria injection. Additionally, saIRAK-4 was constitutively expressed at all examined developmental stages. These results indicate that saIRAK-4 could respond to pathogenic infection and may play an important role in the adult abalone immune system and early innate immunity in the process of abalone larval development.
Collapse
Affiliation(s)
- Hui Ge
- The Key Laboratory of Science and Technology for Aquaculture and Food Safety, Fisheries College, Jimei University, Xiamen, Fujian 361021, China
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Chu CH, Lo WC, Wang HW, Hsu YC, Hwang JK, Lyu PC, Pai TW, Tang CY. Detection and alignment of 3D domain swapping proteins using angle-distance image-based secondary structural matching techniques. PLoS One 2010; 5:e13361. [PMID: 20976204 PMCID: PMC2955075 DOI: 10.1371/journal.pone.0013361] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2010] [Accepted: 09/13/2010] [Indexed: 11/18/2022] Open
Abstract
This work presents a novel detection method for three-dimensional domain swapping (DS), a mechanism for forming protein quaternary structures that can be visualized as if monomers had “opened” their “closed” structures and exchanged the opened portion to form intertwined oligomers. Since the first report of DS in the mid 1990s, an increasing number of identified cases has led to the postulation that DS might occur in a protein with an unconstrained terminus under appropriate conditions. DS may play important roles in the molecular evolution and functional regulation of proteins and the formation of depositions in Alzheimer's and prion diseases. Moreover, it is promising for designing auto-assembling biomaterials. Despite the increasing interest in DS, related bioinformatics methods are rarely available. Owing to a dramatic conformational difference between the monomeric/closed and oligomeric/open forms, conventional structural comparison methods are inadequate for detecting DS. Hence, there is also a lack of comprehensive datasets for studying DS. Based on angle-distance (A-D) image transformations of secondary structural elements (SSEs), specific patterns within A-D images can be recognized and classified for structural similarities. In this work, a matching algorithm to extract corresponding SSE pairs from A-D images and a novel DS score have been designed and demonstrated to be applicable to the detection of DS relationships. The Matthews correlation coefficient (MCC) and sensitivity of the proposed DS-detecting method were higher than 0.81 even when the sequence identities of the proteins examined were lower than 10%. On average, the alignment percentage and root-mean-square distance (RMSD) computed by the proposed method were 90% and 1.8Å for a set of 1,211 DS-related pairs of proteins. The performances of structural alignments remain high and stable for DS-related homologs with less than 10% sequence identities. In addition, the quality of its hinge loop determination is comparable to that of manual inspection. This method has been implemented as a web-based tool, which requires two protein structures as the input and then the type and/or existence of DS relationships between the input structures are determined according to the A-D image-based structural alignments and the DS score. The proposed method is expected to trigger large-scale studies of this interesting structural phenomenon and facilitate related applications.
Collapse
Affiliation(s)
- Chia-Han Chu
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, Republic of China
| | - Wei-Cheng Lo
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan, Republic of China
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, Republic of China
| | - Hsin-Wei Wang
- Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung, Taiwan, Republic of China
| | - Yen-Chu Hsu
- Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung, Taiwan, Republic of China
| | - Jenn-Kang Hwang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, Republic of China
| | - Ping-Chiang Lyu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan, Republic of China
| | - Tun-Wen Pai
- Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung, Taiwan, Republic of China
- * E-mail: (T-WP); (CYT)
| | - Chuan Yi Tang
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, Republic of China
- Department of Computer Science and Information Engineering, Providence University, Taichung, Taiwan, Republic of China
- * E-mail: (T-WP); (CYT)
| |
Collapse
|
16
|
Shyu CR, Pang B, Chi PH, Zhao N, Korkin D, Xu D. ProteinDBS v2.0: a web server for global and local protein structure search. Nucleic Acids Res 2010; 38:W53-8. [PMID: 20538653 PMCID: PMC2896110 DOI: 10.1093/nar/gkq522] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
ProteinDBS v2.0 is a web server designed for efficient and accurate comparisons and searches of structurally similar proteins from a large-scale database. It provides two comparison methods, global-to-global and local-to-local, to facilitate the searches of protein structures or substructures. ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a high-running speed while preserving reasonably high precision of structural comparison. The experimental results show that our system is able to return results of global comparisons in seconds from a complete Protein Data Bank (PDB) database of 152,959 protein chains and that it takes much less time to complete local comparisons from a non-redundant database of 3276 proteins than other accurate comparison methods. ProteinDBS v2.0 supports query by PDB protein ID and by new structures uploaded by users. To our knowledge, this is the only search engine that can simultaneously support global and local comparisons. ProteinDBS v2.0 is a useful tool to investigate functional or evolutional relationships among proteins. Moreover, the common substructures identified by local comparison can be potentially used to assist the human curation process in discovering new domains or folds from the ever-growing protein structure databases. The system is hosted at http://ProteinDBS.rnet.missouri.edu.
Collapse
Affiliation(s)
- Chi-Ren Shyu
- Informatics Institute, University of Missouri, Columbia, MO 65211, USA.
| | | | | | | | | | | |
Collapse
|
17
|
Zhang ZH, Bharatham K, Sherman WA, Mihalek I. deconSTRUCT: general purpose protein database search on the substructure level. Nucleic Acids Res 2010; 38:W590-4. [PMID: 20522512 PMCID: PMC2896154 DOI: 10.1093/nar/gkq489] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
deconSTRUCT webserver offers an interface to a protein database search engine, usable for a general purpose detection of similar protein (sub)structures. Initially, it deconstructs the query structure into its secondary structure elements (SSEs) and reassembles the match to the target by requiring a (tunable) degree of similarity in the direction and sequential order of SSEs. Hierarchical organization and judicious use of the information about protein structure enables deconSTRUCT to achieve the sensitivity and specificity of the established search engines at orders of magnitude increased speed, without tying up irretrievably the substructure information in the form of a hash. In a post-processing step, a match on the level of the backbone atoms is constructed. The results presented to the user consist of the list of the matched SSEs, the transformation matrix for rigid superposition of the structures and several ways of visualization, both downloadable and implemented as a web-browser plug-in. The server is available at http://epsf.bmad.bii.a-star.edu.sg/struct_server.html.
Collapse
Affiliation(s)
- Zong Hong Zhang
- Bioinformatics Institute 30 Biopolis Street, #07-01 Matrix, Singapore
| | | | | | | |
Collapse
|
18
|
Abstract
BACKGROUND Protein structure comparison is a fundamental task in structural biology. While the number of known protein structures has grown rapidly over the last decade, searching a large database of protein structures is still relatively slow using existing methods. There is a need for new techniques which can rapidly compare protein structures, whilst maintaining high matching accuracy. RESULTS We have developed IR Tableau, a fast protein comparison algorithm, which leverages the tableau representation to compare protein tertiary structures. IR tableau compares tableaux using information retrieval style feature indexing techniques. Experimental analysis on the ASTRAL SCOP protein structural domain database demonstrates that IR Tableau achieves two orders of magnitude speedup over the search times of existing methods, while producing search results of comparable accuracy. CONCLUSION We show that it is possible to obtain very significant speedups for the protein structure comparison problem, by employing an information retrieval style approach for indexing proteins. The comparison accuracy achieved is also strong, thus opening the way for large scale processing of very large protein structure databases.
Collapse
|