1
|
Negi SS, Schein CH, Braun W. The updated Structural Database of Allergenic Proteins (SDAP 2.0) provides 3D models for allergens and incorporated bioinformatics tools. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. GLOBAL 2023; 2:100162. [PMID: 37781674 PMCID: PMC10509899 DOI: 10.1016/j.jacig.2023.100162] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 07/07/2023] [Accepted: 07/08/2023] [Indexed: 10/03/2023]
Abstract
Background Allergenic proteins can cause IgE-mediated adverse reactions in sensitized individuals. Although the sequences of many allergenic proteins have been identified, bioinformatics data analysis with advanced computational methods and modeling is needed to identify the basis for IgE binding and cross-reactivity. Objective We aim to present the features and use of the updated Structural Database of Allergenic Proteins 2.0 (SDAP 2.0) webserver, a unique, publicly available resource to compare allergens using specially designed computational tools and new high-quality 3-D models for most known allergens. Methods Previously developed and novel software tools for identifying cross-reactive allergens using sequence and structure similarity are implemented in SDAP 2.0. A comprehensive set of high-quality 3-D models of most allergens was generated with the state-of-the-art AlphaFold 2 software. A graphics tool enables the interactive visualization of IgE epitopes on experimentally determined and modeled 3-D structures. Results A user can search for allergens similar to a given input sequence with the FASTA algorithm or the window-based World Health Organization/International Union of Immunological Societies (WHO/IUIS) guidelines on safety concerns of novel food products. Peptides similar to known IgE epitopes can be identified with the property distance tool and conformational epitopes by the Cross-React method. The updated database contains 1657 manually curated sequences including all allergens from the IUIS database, 334 experimentally determined X-ray or NMR structures, and 1565 3-D models. Each allergen/isoallergen is classified according to its protein family. Conclusions SDAP provides access to the steadily increasing information on allergenic structures and epitopes with integrated bioinformatics tools to identify and analyze their similarities. In addition to serving the research and regulatory community, it provides clinicians with tools to identify potential coallergies in a sensitive patient and can help companies to design hypoallergenic foods and immunotherapies.
Collapse
Affiliation(s)
- Surendra S. Negi
- Sealy Center for Structural Biology and Molecular Biophysics, The University of Texas Medical Branch, Galveston, Tex
- Institute for Human Infections and Immunity, The University of Texas Medical Branch, Galveston, Tex
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Tex
| | - Catherine H. Schein
- Institute for Human Infections and Immunity, The University of Texas Medical Branch, Galveston, Tex
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Tex
| | - Werner Braun
- Sealy Center for Structural Biology and Molecular Biophysics, The University of Texas Medical Branch, Galveston, Tex
- Institute for Human Infections and Immunity, The University of Texas Medical Branch, Galveston, Tex
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Tex
| |
Collapse
|
2
|
Dreskin SC, Koppelman SJ, Andorf S, Nadeau KC, Kalra A, Braun W, Negi SS, Chen X, Schein CH. The importance of the 2S albumins for allergenicity and cross-reactivity of peanuts, tree nuts, and sesame seeds. J Allergy Clin Immunol 2021; 147:1154-1163. [PMID: 33217410 PMCID: PMC8035160 DOI: 10.1016/j.jaci.2020.11.004] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 11/06/2020] [Accepted: 11/10/2020] [Indexed: 12/17/2022]
Abstract
Allergies to peanuts, tree nuts, and sesame seeds are among the most important food-related causes of anaphylaxis. Important clinical questions include: Why is there a variable occurrence of coallergy among these foods and Is this immunologically mediated? The clinical and immunologic data summarized here suggest an immunologic basis for these coallergies that is based on similarities among the 2S albumins. Data from component resolved diagnostics have highlighted the relationship between IgE binding to these allergens and the presence of IgE-mediated food allergy. Furthermore, in vitro and in vivo experiments provide strong evidence that the 2S albumins are the most important allergens in peanuts for inducing an allergic effector response. Although the 2S albumins are diverse, they have a common disulfide-linked core with similar physicochemical properties that make them prime candidates to explain much of the observed coallergy among peanuts, tree nuts, and sesame seeds. The well-established frequency of cashew and pistachio nut coallergy (64%-100%) highlights how the structural similarities among their 2S albumins may account for observed clinical cross-reactivity. A complete understanding of the physicochemical properties of the 2S albumins in peanuts, tree nuts, and sesame seeds will enhance our ability to diagnose, treat, and ultimately prevent these allergies.
Collapse
Affiliation(s)
- Stephen C Dreskin
- Division of Allergy and Clinical Immunology, Department of Medicine, University of Colorado Denver, Aurora, Colo.
| | - Stef J Koppelman
- Food Allergy Research and Resource Program, Department of Food Science and Technology, University of Nebraska, Lincoln, Neb
| | - Sandra Andorf
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio; Division of Allergy and Immunology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio; Sean N. Parker Center for Allergy and Asthma Research, Stanford University School of Medicine, Stanford, Calif
| | - Kari C Nadeau
- Sean N. Parker Center for Allergy and Asthma Research, Stanford University School of Medicine, Stanford, Calif
| | - Anjeli Kalra
- Division of Allergy and Clinical Immunology, Department of Medicine, University of Colorado Denver, Aurora, Colo
| | - Werner Braun
- Sealy Center for Structural Biology and Molecular Biophysics, The University of Texas Medical Branch, Galveston, Tex; Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Tex
| | - Surendra S Negi
- Sealy Center for Structural Biology and Molecular Biophysics, The University of Texas Medical Branch, Galveston, Tex; Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Tex
| | - Xueni Chen
- Division of Allergy and Clinical Immunology, Department of Medicine, University of Colorado Denver, Aurora, Colo
| | - Catherine H Schein
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Tex; Institute for Human Infection and Immunity, The University of Texas Medical Branch, Galveston, Tex.
| |
Collapse
|
3
|
Nesbit JB, Schein CH, Braun BA, Gipson SAY, Cheng H, Hurlburt BK, Maleki SJ. Epitopes with similar physicochemical properties contribute to cross reactivity between peanut and tree nuts. Mol Immunol 2020; 122:223-231. [PMID: 32442779 DOI: 10.1016/j.molimm.2020.03.017] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 02/11/2020] [Accepted: 03/26/2020] [Indexed: 12/28/2022]
Abstract
Many individuals with peanut (PN) allergy have severe reactions to tree nuts (TN) such as walnuts or cashews. Although allergenic proteins in TN and PN have overall low identity, they share discrete sequences similar in physicochemical properties (PCP) to known IgE epitopes. Here, PCP-consensus peptides (cp, 13 aa and 31 aa) were identified from an alignment of epitope rich regions of walnut vicilin, Jug r 2, leader sequence (J2LS) and cross-reactive epitopes in the 2S albumins of peanut and synthesized. A peptide similarity search in the Structural Database of Allergenic Proteins (SDAP) revealed a network of peptides similar (low property distance, PD) to the 13 aa cp (13cp) in many different plant allergens. Peptides similar to the 13cp in PN and TN allergens bound IgE from sera of patients allergic to PN and TN in peptide microarray analysis. The 13cp was used to produce a rabbit consensus peptide antibody (cpAB) that detected proteins containing repeats similar to the 13cp in western blots of various nut extracts, in which reactive proteins were identified by mass spectrometry. The cpAB bound more specifically to allergens and nut extracts containing multiple repeats similar to the 13 cp, such as almond (Pru du 6), peanut (Ara h 2) and walnut (Jug r 2). IgE binding to various nut extracts is inhibited by recombinant J2LS sequence and synthetic 31cp. Thus, several repeated sequences similar to the 13cp are bound by IgE. Multiple similar repeats in several allergens could account for reaction severity and clinically relevant cross-reactivity to PN and TN. These findings may help improve detection, diagnostic, and therapeutic tools.
Collapse
Affiliation(s)
- Jacqueline B Nesbit
- Dept of Agriculture-Agricultural Research Service-Southern Regional Research Center (USDA-ARS-SRRC), New Orleans, LA, United States
| | - Catherine H Schein
- Department of Biochemistry and Molecular Biology, Institute for Human Infection and Immunity, University of Texas Medical Branch at Galveston (UTMB), TX, United States.
| | - Benjamin A Braun
- Department of Computer Science, Stanford University, United States
| | - Stephen A Y Gipson
- Dept of Agriculture-Agricultural Research Service-Southern Regional Research Center (USDA-ARS-SRRC), New Orleans, LA, United States
| | - Hsiaopo Cheng
- Dept of Agriculture-Agricultural Research Service-Southern Regional Research Center (USDA-ARS-SRRC), New Orleans, LA, United States
| | - Barry K Hurlburt
- Dept of Agriculture-Agricultural Research Service-Southern Regional Research Center (USDA-ARS-SRRC), New Orleans, LA, United States
| | - Soheila J Maleki
- Dept of Agriculture-Agricultural Research Service-Southern Regional Research Center (USDA-ARS-SRRC), New Orleans, LA, United States.
| |
Collapse
|
4
|
Braun W, Schein CH. Membrane interaction and functional plasticity of inositol polyphosphate 5-phosphatases. Structure 2015; 22:664-6. [PMID: 24807076 DOI: 10.1016/j.str.2014.04.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
In this issue of Structure, Trésaugues and colleagues determined the interaction of membrane-bound phosphoinositides with three clinically significant human inositol polyphosphate 5-phosphatases (I5Ps). A comparison to the structures determined with soluble substrates revealed differences in the binding mode and suggested how the I5Ps and apurinic endonuclease (APE1) activities evolved from the same metal-binding active center.
Collapse
Affiliation(s)
- Werner Braun
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX 77555, USA; Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555, USA.
| | - Catherine H Schein
- Foundation for Applied Molecular Evolution, 720 SW 2nd Avenue Suite 201, Gainesville, FL 32601, USA
| |
Collapse
|
5
|
Dimitrov I, Bangov I, Flower DR, Doytchinova I. AllerTOP v.2--a server for in silico prediction of allergens. J Mol Model 2014; 20:2278. [PMID: 24878803 DOI: 10.1007/s00894-014-2278-5] [Citation(s) in RCA: 582] [Impact Index Per Article: 58.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Accepted: 04/25/2014] [Indexed: 11/29/2022]
Abstract
Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance--typically proteins--resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), naïve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP).
Collapse
Affiliation(s)
- Ivan Dimitrov
- Faculty of Pharmacy, Medical University of Sofia, 2 Dunav st., 1000, Sofia, Bulgaria
| | | | | | | |
Collapse
|
6
|
Schein CH, Bowen DM, Lewis JA, Choi K, Paul A, van der Heden van Noort GJ, Lu W, Filippov DV. Physicochemical property consensus sequences for functional analysis, design of multivalent antigens and targeted antivirals. BMC Bioinformatics 2012; 13 Suppl 13:S9. [PMID: 23320474 PMCID: PMC3426803 DOI: 10.1186/1471-2105-13-s13-s9] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Background Analysis of large sets of biological sequence data from related strains or organisms is complicated by superficial redundancy in the set, which may contain many members that are identical except at one or two positions. Thus a new method, based on deriving physicochemical property (PCP)-consensus sequences, was tested for its ability to generate reference sequences and distinguish functionally significant changes from background variability. Methods The PCP consensus program was used to automatically derive consensus sequences starting from sequence alignments of proteins from Flaviviruses (from the Flavitrack database) and human enteroviruses, using a five dimensional set of Eigenvectors that summarize over 200 different scalar values for the PCPs of the amino acids. A PCP-consensus protein of a Dengue virus envelope protein was produced recombinantly and tested for its ability to bind antibodies to strains using ELISA. Results PCP-consensus sequences of the flavivirus family could be used to classify them into five discrete groups and distinguish areas of the envelope proteins that correlate with host specificity and disease type. A multivalent Dengue virus antigen was designed and shown to bind antibodies against all four DENV types. A consensus enteroviral VPg protein had the same distinctive high pKa as wild type proteins and was recognized by two different polymerases. Conclusions The process for deriving PCP-consensus sequences for any group of aligned similar sequences, has been validated for sequences with up to 50% diversity. Ongoing projects have shown that the method identifies residues that significantly alter PCPs at a given position, and might thus cause changes in function or immunogenicity. Other potential applications include deriving target proteins for drug design and diagnostic kits.
Collapse
Affiliation(s)
- Catherine H Schein
- Institute for Translational Sciences, Computational Biology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Texas 77555-0857, USA.
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Bowen DM, Lewis JA, Lu W, Schein CH. Simplifying complex sequence information: a PCP-consensus protein binds antibodies against all four Dengue serotypes. Vaccine 2012; 30:6081-7. [PMID: 22863657 DOI: 10.1016/j.vaccine.2012.07.042] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2012] [Revised: 07/13/2012] [Accepted: 07/18/2012] [Indexed: 12/15/2022]
Abstract
Designing proteins that reflect the natural variability of a pathogen is essential for developing novel vaccines and drugs. Flaviviruses, including Dengue (DENV) and West Nile (WNV), evolve rapidly and can "escape" neutralizing monoclonal antibodies by mutation. Designing antigens that represent many distinct strains is important for DENV, where infection with a strain from one of the four serotypes may lead to severe hemorrhagic disease on subsequent infection with a strain from another serotype. Here, a DENV physicochemical property (PCP)-consensus sequence was derived from 671 unique sequences from the Flavitrack database. PCP-consensus proteins for domain 3 of the envelope protein (EdomIII) were expressed from synthetic genes in Escherichia coli. The ability of the purified consensus proteins to bind polyclonal antibodies generated in response to infection with strains from each of the four DENV serotypes was determined. The initial consensus protein bound antibodies from DENV-1-3 in ELISA and Western blot assays. This sequence was altered in 3 steps to incorporate regions of maximum variability, identified as significant changes in the PCPs, characteristic of DENV-4 strains. The final protein was recognized by antibodies against all four serotypes. Two amino acids essential for efficient binding to all DENV antibodies are part of a discontinuous epitope previously defined for a neutralizing monoclonal antibody. The PCP-consensus method can significantly reduce the number of experiments required to define a multivalent antigen, which is particularly important when dealing with pathogens that must be tested at higher biosafety levels.
Collapse
Affiliation(s)
- David M Bowen
- Computational Biology, Sealy Center for Structural Biology and Molecular Biophysics, Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-0857, United States
| | | | | | | |
Collapse
|
8
|
Lu W, Negi SS, Oberhauser AF, Braun W. Engineering proteins with enhanced mechanical stability by force-specific sequence motifs. Proteins 2012; 80:1308-15. [PMID: 22274941 DOI: 10.1002/prot.24027] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2011] [Revised: 12/22/2011] [Accepted: 01/02/2012] [Indexed: 11/09/2022]
Abstract
Use of atomic force microscopy (AFM) has recently led to a better understanding of the molecular mechanisms of the unfolding process by mechanical forces; however, the rational design of novel proteins with specific mechanical strength remains challenging. We have approached this problem from a new perspective that generates linear physical-chemical properties (PCP) motifs from a limited AFM data set. Guided by our linear sequence analysis, we designed and analyzed four new mutants of the titin I1 domain with the goal of increasing the domain's mechanical strength. All four mutants could be cloned and expressed as soluble proteins. AFM data indicate that at least two of the mutants have increased molecular mechanical strength. This observation suggests that the PCP method is useful to graft sequences specific for high mechanical stability to weak proteins to increase their mechanical stability, and represents an additional tool in the design of novel proteins besides steered molecular dynamics calculations, coarse grained simulations, and ϕ-value analysis of the transition state.
Collapse
Affiliation(s)
- Wenzhe Lu
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, Texas 77555-0857, USA
| | | | | | | |
Collapse
|
9
|
Oezguen N, Mantha AK, Izumi T, Schein CH, Mitra S, Braun W. MD simulation and experimental evidence for Mg²+ binding at the B site in human AP endonuclease 1. Bioinformation 2011; 7:184-98. [PMID: 22102776 PMCID: PMC3218521 DOI: 10.6026/97320630007184] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2011] [Accepted: 10/03/2011] [Indexed: 11/23/2022] Open
Abstract
Apurinic/apyrimidinic endonuclease 1 (APE1), a central enzyme in the base excision repair pathway, cleaves damaged DNA in Mg(2+) dependent reaction. Despite characterization of nine X-ray crystallographic structures of human APE1, in some cases, bound to various metal ions and substrate/product, the position of the metal ion and its stoichiometry for the cleavage reaction are still being debated. While a mutation of the active site E96Q was proposed to eliminate Mg(2+) binding at the "A" site, we show experimentally that this mutant still requires Mg(2+) at concentration similar to that for the wild type enzyme to cleave the AP site in DNA. Molecular dynamics simulations of the wild type APE1, E96Q and a double missense mutant E96Q + D210N indicate that Mg(2+) placed at the A-site destabilizes the bound AP site-containing DNA. In these simulations, the H-bond chain D238-H309-AP site oxygen is broken and the substrate DNA is shifted away from its crystal structure position (1DE9). In contrast, simulations with the Mg(2+) at site B or A+B sites leave the substrate DNA at the position shown in the crystal structure (1DE9). Taken together our MD simulations and biochemical analysis suggests that Mg(2+) binding at the B site is involved in the reaction mechanism associated with endonuclease function of APE1.
Collapse
Affiliation(s)
- Numan Oezguen
- Internal Medicine-Endocrinology Department, University of Texas Medical Branch, Galveston, TX 77555-1060, USA
| | - Anil K Mantha
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-1079, USA
| | - Tadahide Izumi
- Stanley S. Scott Cancer Center and Department of Otolaryngology, 533 Bolivar St., University Health Sciences Center, New Orleans, LA 70112, USA
| | - Catherine H Schein
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-1079, USA
| | - Sankar Mitra
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-1079, USA
| | - Werner Braun
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-1079, USA
| |
Collapse
|
10
|
AllerML: markup language for allergens. Regul Toxicol Pharmacol 2011; 60:151-60. [PMID: 21420460 DOI: 10.1016/j.yrtph.2011.03.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2011] [Revised: 03/14/2011] [Accepted: 03/16/2011] [Indexed: 02/01/2023]
Abstract
Many concerns have been raised about the potential allergenicity of novel, recombinant proteins into food crops. Guidelines, proposed by WHO/FAO and EFSA, include the use of bioinformatics screening to assess the risk of potential allergenicity or cross-reactivities of all proteins introduced, for example, to improve nutritional value or promote crop resistance. However, there are no universally accepted standards that can be used to encode data on the biology of allergens to facilitate using data from multiple databases in this screening. Therefore, we developed AllerML a markup language for allergens to assist in the automated exchange of information between databases and in the integration of the bioinformatics tools that are used to investigate allergenicity and cross-reactivity. As proof of concept, AllerML was implemented using the Structural Database of Allergenic Proteins (SDAP; http://fermi.utmb.edu/SDAP/) database. General implementation of AllerML will promote automatic flow of validated data that will aid in allergy research and regulatory analysis.
Collapse
|
11
|
Danecek P, Lu W, Schein CH. PCP consensus sequences of flaviviruses: correlating variance with vector competence and disease phenotype. J Mol Biol 2009; 396:550-63. [PMID: 19969003 DOI: 10.1016/j.jmb.2009.11.070] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2009] [Revised: 11/18/2009] [Accepted: 11/30/2009] [Indexed: 01/11/2023]
Abstract
BACKGROUND Computational methods are needed to design multivalent vaccines against flaviviruses (FVs) such as the West Nile virus or the dengue virus (DENV). OBJECTIVE We aimed to use physicochemical property (PCP) consensus sequences of FV strains to delineate conserved motifs, areas of maximum variability, and specific loci that correlate with arthropod vector, serotype, and disease severity. METHODS PCP consensus sequences for 27 species were prepared from 928 annotated sequences catalogued in Flavitrack. Alignments of these correlated well with the known structures of the NS3 protease domain and envelope (E) proteins. The PCPMer suite was used to identify motifs common to all FVs. Areas of PCP variability that correlated with phenotype were plotted on the structures. RESULTS Despite considerable diversity at the amino acid level, PCPs for both proteins were well conserved throughout the FVs. A series of insertions in E separated tick- from mosquito-borne viruses and all arthropod-borne viruses from isolates with no known vector or directly from insects. Comparison of a PCP consensus sequence of E derived from 600 DENV strains (DENV600) with individual ones for DENV1-DENV4 showed that most major serotype-specific variation occurs near these insertions. The DENV600 differed from one prepared from eight hemorrhagic or fatal strains from four DENV serotypes at only three positions, two of which overlap known escape mutant sites. CONCLUSIONS Comparing consensus sequences showed that substantial changes occur in only a few areas of the E protein. PCP consensus sequences can contribute to the design of multivalent vaccines.
Collapse
Affiliation(s)
- Petr Danecek
- Sealy Center for Structural Biology and Molecular Biophysics, Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, 301 University Boulevard, Galveston, TX 77555-0857, USA
| | | | | |
Collapse
|
12
|
Garcia TI, Oberhauser AF, Braun W. Mechanical stability and differentially conserved physical-chemical properties of titin Ig-domains. Proteins 2009; 75:706-18. [PMID: 19003986 DOI: 10.1002/prot.22281] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
The mechanisms that determine mechanical stabilities of protein folds remain elusive. Our understanding of these mechanisms is vital to both bioengineering efforts and to the better understanding and eventual treatment of pathogenic mutations affecting mechanically important proteins such as titin. We present a new approach to analyze data from single-molecule force spectroscopy for different domains of the giant muscle protein titin. The region of titin found in the I-band of a sarcomere is composed of about 40 Ig-domains and is exposed to force under normal physiological conditions and connects the free-hanging ends of the myosin filaments to the Z-disc. Recent single-molecule force spectroscopy data show a mechanical hierarchy in the I-band domains. Domains near the C-terminus in this region unfold at forces two to three times greater than domains near the beginning of the I-band. Though all of these Ig-domains are thought to share a fold and topology common to members of the Ig-like fold family, the sequences of neighboring domains vary greatly with an average sequence identity of only 25%. We examine in this study the relation of these unique mechanical stabilities of each I-band Ig domain to specific, conserved physical-chemical properties of amino acid sequences in related Ig domains. We find that the sequences of each individual titin Ig domain are very highly conserved, with an average sequence identity of 79% across species that are divergent as humans, chickens, and zebra fish. This indicates that the mechanical properties of each domain are well conserved and tailored to its unique position in the titin molecule. We used the PCPMer software to determine the conservation of amino acid properties in titin Ig domains grouped by unfolding forces into "strong" and "weak" families. We found two motifs unique to each family that may have some role in determining the mechanical properties of these Ig domains. A detailed statistical analysis of properties of individual residues revealed several positions that displayed differentially conserved properties in strong and weak families. In contrast to previous studies, we find evidence that suggests that the mechanical stability of Ig domains is determined by several residues scattered across the beta-sandwich fold, and force sensitive residues are not only confined to the A'-G region.
Collapse
Affiliation(s)
- Tzintzuni I Garcia
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555, USA
| | | | | |
Collapse
|
13
|
Characteristic motifs for families of allergenic proteins. Mol Immunol 2008; 46:559-68. [PMID: 18951633 DOI: 10.1016/j.molimm.2008.07.034] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Revised: 07/22/2008] [Accepted: 07/23/2008] [Indexed: 12/16/2022]
Abstract
The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver MotifMate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins.
Collapse
|
14
|
Ivanciuc O, Midoro-Horiuti T, Schein CH, Xie L, Hillman GR, Goldblum RM, Braun W. The property distance index PD predicts peptides that cross-react with IgE antibodies. Mol Immunol 2008; 46:873-83. [PMID: 18950868 DOI: 10.1016/j.molimm.2008.09.004] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2008] [Accepted: 09/08/2008] [Indexed: 11/15/2022]
Abstract
Similarities in the sequence and structure of allergens can explain clinically observed cross-reactivities. Distinguishing sequences that bind IgE in patient sera can be used to identify potentially allergenic protein sequences and aid in the design of hypo-allergenic proteins. The property distance index PD, incorporated in our Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/), may identify potentially cross-reactive segments of proteins, based on their similarity to known IgE epitopes. We sought to obtain experimental validation of the PD index as a quantitative predictor of IgE cross-reactivity, by designing peptide variants with predetermined PD scores relative to three linear IgE epitopes of Jun a 1, the dominant allergen from mountain cedar pollen. For each of the three epitopes, 60 peptides were designed with increasing PD values (decreasing physicochemical similarity) to the starting sequence. The peptides synthesized on a derivatized cellulose membrane were probed with sera from patients who were allergic to Jun a 1, and the experimental data were interpreted with a PD classification method. Peptides with low PD values relative to a given epitope were more likely to bind IgE from the sera than were those with PD values larger than 6. Control sequences, with PD values between 18 and 20 to all the three epitopes, did not bind patient IgE, thus validating our procedure for identifying negative control peptides. The PD index is a statistically validated method to detect discrete regions of proteins that have a high probability of cross-reacting with IgE from allergic patients.
Collapse
Affiliation(s)
- Ovidiu Ivanciuc
- Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, 301 University Boulevard, Galveston, TX 77555-0857, United States
| | | | | | | | | | | | | |
Collapse
|
15
|
Oezguen N, Kumar S, Hindupur A, Braun W, Muralidhara BK, Halpert JR. Identification and analysis of conserved sequence motifs in cytochrome P450 family 2. Functional and structural role of a motif 187RFDYKD192 in CYP2B enzymes. J Biol Chem 2008; 283:21808-16. [PMID: 18495666 PMCID: PMC2490781 DOI: 10.1074/jbc.m708582200] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2007] [Revised: 05/20/2008] [Indexed: 11/06/2022] Open
Abstract
Using a multiple alignment of 175 cytochrome P450 (CYP) family 2 sequences, 20 conserved sequence motifs (CSMs) were identified with the program PCPMer. Functional importance of the CSM in CYP2B enzymes was assessed from available data on site-directed mutants and genetic variants. These analyses suggested an important role of the CSM 8, which corresponds to(187)RFDYKD(192) in CYP2B4. Further analysis showed that residues 187, 188, 190, and 192 have a very high rank order of conservation compared with 189 and 191. Therefore, eight mutants (R187A, R187K, F188A, D189A, Y190A, K191A, D192A, and a negative control K186A) were made in an N-terminal truncated and modified form of CYP2B4 with an internal mutation, which is termed 2B4dH/H226Y. Function was examined with the substrates 7-methoxy-4-(trifluoromethyl)coumarin (7-MFC), 7-ethoxy-4-(trifluoromethyl)coumarin (7-EFC), 7-benzyloxy-4-(trifluoromethyl)coumarin (7-BFC), and testosterone and with the inhibitors 4-(4-chlorophenyl)imidazole (4-CPI) and bifonazole (BIF). Compared with the template and K186A, the mutants R187A, R187K, F188A, Y190A, and D192A showed > or =2-fold altered substrate specificity, k(cat), K(m), and/or k(cat)/K(m) for 7-MFC and 7-EFC and 3- to 6-fold decreases in differential inhibition (IC(50,BIF)/IC(50,4-CPI)). Subsequently, these mutants displayed 5-12 degrees C decreases in thermal stability (T(m)) and 2-8 degrees C decreases in catalytic tolerance to temperature (T(50)) compared with the template and K186A. Furthermore, when R187A and D192A were introduced in CYP2B1dH, the P450 expression and thermal stability were decreased. In addition, R187A showed increased activity with 7-EFC and decreased IC(50,BIF)/IC(50,4-CPI) compared with 2B1dH. Analysis of long range residue-residue interactions in the CYP2B4 crystal structures indicated strong hydrogen bonds involving Glu(149)-Asn(177)-Arg(187)-Tyr(190) and Asp(192)-Val(194), which were significantly-reduced/abolished by the Arg(187)-->Ala and Asp(192)-->Alasubstitutions, respectively.
Collapse
Affiliation(s)
- Numan Oezguen
- Skaggs School of Pharmacy & Pharmaceutical Sciences, University of California-San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | | | | | | | | | | |
Collapse
|
16
|
Lipton AS, Heck RW, Primak S, McNeill DR, Wilson DM, Ellis PD. Characterization of Mg2+ binding to the DNA repair protein apurinic/apyrimidic endonuclease 1 via solid-state 25Mg NMR spectroscopy. J Am Chem Soc 2008; 130:9332-41. [PMID: 18576638 PMCID: PMC2564828 DOI: 10.1021/ja0776881] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Apurinic/apyrimidinic endonuclease 1 (APE1), a member of the divalent cation-dependent phosphoesterase superfamily of proteins that retain the conserved four-layered alpha/beta-sandwich structural core, is an essential protein that functions as part of base excision repair to remove mutagenic and cytotoxic abasic sites from DNA. Using low-temperature solid-state (25)Mg NMR spectroscopy and various mutants of APE1, we demonstrate that Mg(2+) binds to APE1 and a functional APE1-substrate DNA complex with an overall stoichiometry of one Mg(2+) per mole of APE1 as predicted by the X-ray work of Tainer and co-workers (Mol, C. D.; Kuo, C. F.; Thayer, M. M.; Cunningham, R. P.; Tainer, J. A. Nature 1995, 374 , 381-386). However, the NMR spectra show that the single Mg(2+) site is disordered. We discuss the probable reasons for the disorder at the Mg(2+) binding site. The most likely source of this disorder is arrangement of the protein-ligands about the Mg(2+) (cis and trans isomers). The existence of these isomers reinforces the notion of the plasticity of the metal binding site within APE1.
Collapse
Affiliation(s)
- A S Lipton
- The Biological Sciences Directorate, Pacific Northwest National Laboratory, K8-98, 902 Battelle Boulevard, Richland, Washington 99352, USA
| | | | | | | | | | | |
Collapse
|
17
|
Chen D, Misra M, Sower L, Peterson JW, Kellogg GE, Schein CH. Novel inhibitors of anthrax edema factor. Bioorg Med Chem 2008; 16:7225-33. [PMID: 18620864 DOI: 10.1016/j.bmc.2008.06.036] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2008] [Revised: 06/18/2008] [Accepted: 06/20/2008] [Indexed: 01/13/2023]
Abstract
Several pathogenic bacteria produce adenylyl cyclase toxins, such as the edema factor (EF) of Bacillus anthracis. These disturb cellular metabolism by catalyzing production of excessive amounts of the regulatory molecule cAMP. Here, a structure-based method, where a 3D-pharmacophore that fit the active site of EF was constructed from fragments, was used to identify non-nucleotide inhibitors of EF. A library of small molecule fragments was docked to the EF-active site in existing crystal structures, and those with the highest HINT scores were assembled into a 3D-pharmacophore. About 10,000 compounds, from over 2.7 million compounds in the ZINC database, had a similar molecular framework. These were ranked according to their docking scores, using methodology that was shown to achieve maximum accuracy (i.e., how well the docked position matched the experimentally determined site for ATP analogues in crystal structures of the complex). Finally, 19 diverse compounds with the best AutoDock binding/docking scores were assayed in a cell-based assay for their ability to reduce cAMP secretion induced by EF. Four of the test compounds, from different structural groups, inhibited in the low micromolar range. One of these has a core structure common to phosphatase inhibitors previously identified by high-throughput assays of a diversity library. Thus, the fragment-based pharmacophore identified a small number of diverse compounds for assay, and greatly enhanced the selection process of advanced lead compounds for combinatorial design.
Collapse
Affiliation(s)
- Deliang Chen
- Sealy Center for Structural Biology and Molecular Biophysics, Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, 301 University Boulevard, Galveston, TX 77555-0857, USA
| | | | | | | | | | | |
Collapse
|
18
|
Torrance JW, Macarthur MW, Thornton JM. Evolution of binding sites for zinc and calcium ions playing structural roles. Proteins 2008; 71:813-30. [PMID: 18004751 DOI: 10.1002/prot.21741] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The geometry of metal coordination by proteins is well understood, but the evolution of metal binding sites has been less studied. Here we present a study on a small number of well-documented structural calcium and zinc binding sites, concerning how the geometry diverges between relatives, how often nonrelatives converge towards the same structure, and how often these metal binding sites are lost in the course of evolution. Both calcium and zinc binding site structure is observed to be conserved; structural differences between those atoms directly involved in metal binding in related proteins are typically less than 0.5 A root mean square deviation, even in distant relatives. Structural templates representing these conserved calcium and zinc binding sites were used to search the Protein Data Bank for cases where unrelated proteins have converged upon the same residue selection and geometry for metal binding. This allowed us to identify six "archetypal" metal binding site structures: two archetypal zinc binding sites, both of which had independently evolved on a large number of occasions, and four diverse archetypal calcium binding sites, where each had evolved independently on only a handful of occasions. We found that it was common for distant relatives of metal-binding proteins to lack metal-binding capacity. This occurred for 13 of the 18 metal binding sites we studied, even though in some of these cases the original metal had been classified as "essential for protein folding." For most of the calcium binding sites studied (seven out of eleven cases), the lack of metal binding in relatives was due to point mutation of the metal-binding residues, whilst for zinc binding sites, lack of metal binding in relatives always involved more extensive changes, with loss of secondary structural elements or loops around the binding site.
Collapse
Affiliation(s)
- James W Torrance
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB101SD, United Kingdom.
| | | | | |
Collapse
|
19
|
Abstract
Sequence databases can be searched for homologies of zinc coordination motifs with characteristic ligand signatures. Ensuing predictions that 3-10 % of the human genes encodes zinc proteins are most remarkable. But they seem conservative when considering that database mining cannot discover new signatures or coordination environments that employ nonsequential binding of ligands and sulfur-ligand bridges. Predictions also fall short for zinc/protein interactions at protein interfaces and for inhibitory zinc sites. Zinc ions transiently target proteins that are not known to be zinc proteins, adding a hitherto unrecognized dimension to the human zinc proteome. Predicted zinc sites need to be verified experimentally. The metal can be absent or sites may bind metal ions other than zinc because protein coordination environments do not have absolute specificity for zinc. The metaphor of the "galvanization of biology" continues to gain prominence in terms of the sheer number of approximately 3000 human zinc proteins and their annotation with new functions. Clearly, description of zinc proteomes cannot be pursued solely in silico and requires zinc proteomics, an integrated scientific approach. Progress hinges on a combination of bioinformatics, biology, and significantly, analytical and structural chemistry.
Collapse
|
20
|
Oezguen N, Schein CH, Peddi SR, Power TD, Izumi T, Braun W. A "moving metal mechanism" for substrate cleavage by the DNA repair endonuclease APE-1. Proteins 2007; 68:313-23. [PMID: 17427952 DOI: 10.1002/prot.21397] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Apurinic/apyrimidinic endonuclease (APE-1) is essential for base excision repair (BER) of damaged DNA. Here molecular dynamics (MD) simulations of APE1 complexed with cleaved and uncleaved damaged DNA were used to determine the role and position of the metal ion(s) in the active site before and after DNA cleavage. The simulations started from an energy minimized wild-type structure of the metal-free APE1/damaged-DNA complex (1DE8). A grid search with one Mg2+ ion located two low energy clusters of Mg2+ consistent with the experimentally determined metal ion positions. At the start of the longer MD simulations, Mg2+ ions were placed at different positions as seen in the crystal structures and the movement of the ion was followed over the course of the trajectory. Our analysis suggests a "moving metal mechanism" in which one Mg2+ ion moves from the B- (more buried) to the A-site during substrate cleavage. The anticipated inversion of the phosphate oxygens occurs during the in-line cleavage reaction. Experimental results, which show competition between Ca2+ and Mg2+ for catalyzing the reaction, and high concentrations of Mg2+ are inhibitory, indicate that both sites cannot be simultaneously occupied for maximal activity.
Collapse
Affiliation(s)
- Numan Oezguen
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, Texas 77555-0857, USA
| | | | | | | | | | | |
Collapse
|
21
|
Negi SS, Schein CH, Oezguen N, Power TD, Braun W. InterProSurf: a web server for predicting interacting sites on protein surfaces. Bioinformatics 2007; 23:3397-9. [PMID: 17933856 DOI: 10.1093/bioinformatics/btm474] [Citation(s) in RCA: 127] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
UNLABELLED A new web server, InterProSurf, predicts interacting amino acid residues in proteins that are most likely to interact with other proteins, given the 3D structures of subunits of a protein complex. The prediction method is based on solvent accessible surface area of residues in the isolated subunits, a propensity scale for interface residues and a clustering algorithm to identify surface regions with residues of high interface propensities. Here we illustrate the application of InterProSurf to determine which areas of Bacillus anthracis toxins and measles virus hemagglutinin protein interact with their respective cell surface receptors. The computationally predicted regions overlap with those regions previously identified as interface regions by sequence analysis and mutagenesis experiments. AVAILABILITY The InterProSurf web server is available at http://curie.utmb.edu/
Collapse
Affiliation(s)
- Surendra S Negi
- Department of Biochemistry and Molecular Biology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, Texas 77555-0857, USA
| | | | | | | | | |
Collapse
|
22
|
Cummins SF, Xie F, de Vries MR, Annangudi SP, Misra M, Degnan BM, Sweedler JV, Nagle GT, Schein CH. Aplysia temptin - the 'glue' in the water-borne attractin pheromone complex. FEBS J 2007; 274:5425-37. [PMID: 17894821 DOI: 10.1111/j.1742-4658.2007.06070.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Temptin, a component of the complex of water-borne protein pheromones that stimulate attraction and mating behavior in the marine mollusk Aplysia, has sequence homology to the epidermal growth factor (EGF)-like domains of higher organisms that mediate protein-cell surface contact during fertilization and blood coagulation. In this work, recombinant temptin for structural and functional studies was produced in Escherichia coli using a cold shock promoter and purified by RP-HPLC. CD spectra confirmed a predominantly beta-sheet structure. Two disulfide bonds were determined via limited proteolysis and MS. One internal disulfide (Cys57-Cys77) was predicted from initial alignments with class I EGF-like domains; the second, between Cys18 and Cys103, could protect temptin against proteolysis in seawater and stabilize its interacting surface. A three-dimensional model of temptin was prepared with our MPACK suite, based on the Ca(2+)-binding, EGF-like domain of the extracellular matrix protein fibrillin. Two temptin residues, Trp52 and Trp79, which align with cysteine residues conserved in fibrillins, lie adjacent to and could stabilize the disulfide bonds and a proposed metal-binding loop. The water-borne pheromone attractin in egg cordon eluates is complexed with other proteins. Docking results with our model and the NMR structure of attractin suggest that one face of temptin interacts with the pheromone, perhaps controlling its access to the cellular receptors. Gel shifts confirmed that temptin complexes with wild-type attractin. These results indicate that temptin, analogous to the role of fibrillin in controlling transforming growth factor-beta concentration, modulates pheromone signaling by direct binding to attractin.
Collapse
Affiliation(s)
- Scott F Cummins
- Department of Neuroscience and Cell Biology, University of Texas Medical Branch, Galveston, TX 77555-0857, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Abstract
MOTIVATION Properly annotated sequence data for flaviviruses, which cause diseases, such as tick-borne encephalitis (TBE), dengue fever (DF), West Nile (WN) and yellow fever (YF), can aid in the design of antiviral drugs and vaccines to prevent their spread. Flavitrack was designed to help identify conserved sequence motifs, interpret mutational and structural data and track evolution of phenotypic properties. SUMMARY Flavitrack contains over 590 complete flavivirus genome/protein sequences and information on known mutations and literature references. Each sequence has been manually annotated according to its date and place of isolation, phenotype and lethality. Internal tools are provided to rapidly determine relationships between viruses in Flavitrack and sequences provided by the user.
Collapse
Affiliation(s)
- Milind Misra
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, Texas 77555-0857, USA.
| | | |
Collapse
|
24
|
Chen D, Menche G, Power TD, Sower L, Peterson JW, Schein CH. Accounting for ligand-bound metal ions in docking small molecules on adenylyl cyclase toxins. Proteins 2007; 67:593-605. [PMID: 17311351 DOI: 10.1002/prot.21249] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The adenylyl cyclase toxins produced by bacteria (such as the edema factor (EF) of Bacillus anthracis and CyaA of Bordetella pertussis) are important virulence factors in anthrax and whooping cough. Co-crystal structures of these proteins differ in the number and positioning of metal ions in the active site. Metal ions bound only to the ligands in the crystal structures are not included during the docking. To determine what effect these "missing" metals have on docking results, the AutoDock, LigandFit/Cerius2, and FlexX programs were compared for their ability to correctly place substrate analogues and inhibitors into the active sites of the crystal structures of EF, CyaA, and mammalian adenylate cyclase. Protonating the phosphates of substrate analogues improved the accuracy of docking into the active site of CyaA, where the grid did not account for one of the three Mg2+ ions in the crystal structure. The AutoDock ranking (based on docking energies) of a test group of compounds was relatively unaffected by protonation of carboxyl groups. However, the ranking by FlexX-ChemScore varied significantly, especially for docking to CyaA, suggesting that alternate protonation states should be tested when screening compound libraries with this program. When the charges on the bound metal were set correctly, AutoDock was the most reliable program of the three tested with respect to positioning substrate analogues and ranking compounds according to their experimentally determined ability to inhibit EF.
Collapse
Affiliation(s)
- Deliang Chen
- Sealy Center for Structural Biology and Molecular Biophysics, Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, Texas 77555-0857, USA
| | | | | | | | | | | |
Collapse
|
25
|
Discovering structural motifs using a structural alphabet: application to magnesium-binding sites. BMC Bioinformatics 2007; 8:106. [PMID: 17389049 PMCID: PMC1851716 DOI: 10.1186/1471-2105-8-106] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2006] [Accepted: 03/28/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND For many metalloproteins, sequence motifs characteristic of metal-binding sites have not been found or are so short that they would not be expected to be metal-specific. Striking examples of such metalloproteins are those containing Mg2+, one of the most versatile metal cofactors in cellular biochemistry. Even when Mg2+-proteins share insufficient sequence homology to identify Mg2+-specific sequence motifs, they may still share similarity in the Mg2+-binding site structure. However, no structural motifs characteristic of Mg2+-binding sites have been reported. Thus, our aims are (i) to develop a general method for discovering structural patterns/motifs characteristic of ligand-binding sites, given the 3D protein structures, and (ii) to apply it to Mg2+-proteins sharing <30% sequence identity. Our motif discovery method employs structural alphabet encoding to convert 3D structures to the corresponding 1D structural letter sequences, where the Mg2+-structural motifs are identified as recurring structural patterns. RESULTS The structural alphabet-based motif discovery method has revealed the structural preference of Mg2+-binding sites for certain local/secondary structures: compared to all residues in the Mg2+-proteins, both first and second-shell Mg2+-ligands prefer loops to helices. Even when the Mg2+-proteins share no significant sequence homology, some of them share a similar Mg2+-binding site structure: 4 Mg2+-structural motifs, comprising 21% of the binding sites, were found. In particular, one of the Mg2+-structural motifs found maps to a specific functional group, namely, hydrolases. Furthermore, 2 of the motifs were not found in non metalloproteins or in Ca2+-binding proteins. The structural motifs discovered thus capture some essential biochemical and/or evolutionary properties, and hence may be useful for discovering proteins where Mg2+ plays an important biological role. CONCLUSION The structural motif discovery method presented herein is general and can be applied to any set of proteins with known 3D structures. This new method is timely considering the increasing number of structures for proteins with unknown function that are being solved from structural genomics incentives. For such proteins, which share no significant sequence homology to proteins of known function, the presence of a structural motif that maps to a specific protein function in the structure would suggest likely active/binding sites and a particular biological function.
Collapse
|
26
|
Negi SS, Kolokoltsov AA, Schein CH, Davey RA, Braun W. Determining functionally important amino acid residues of the E1 protein of Venezuelan equine encephalitis virus. J Mol Model 2006; 12:921-9. [PMID: 16607494 DOI: 10.1007/s00894-006-0101-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2005] [Accepted: 01/05/2006] [Indexed: 10/24/2022]
Abstract
A new method for predicting interacting residues in protein complexes, InterProSurf, was applied to the E1 envelope protein of Venezuelan equine encephalitis (VEEV). Monomeric and trimeric models of VEEV-E1 were constructed with our MPACK program, using the crystal structure of the E1 protein of Semliki forest virus as a template. An alignment of the E1 sequences from representative alphavirus sequences was used to determine physical chemical property motifs (likely functional areas) with our PCPMer program. Information on residue variability, propensity to be in protein interfaces, and surface exposure on the model was combined to predict surface clusters likely to interact with other viral or cellular proteins. Mutagenesis of these clusters indicated that the predictions accurately detected areas crucial for virus infection. In addition to the fusion peptide area in domain 2, at least two other surface areas play an important role in virus infection. We propose that these may be sites of interaction between the E1-E1 and E1-E2 subdomains of the envelope proteins that are required to assemble the functional unit. The InterProSurf method is, thus, an important new tool for predicting viral protein interactions. These results can aid in the design of new vaccines against alphaviruses and other viruses.
Collapse
Affiliation(s)
- Surendra S Negi
- Sealy Center for Structural Biology, Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-0857, USA
| | | | | | | | | |
Collapse
|
27
|
Schein CH, Zhou B, Braun W. Stereophysicochemical variability plots highlight conserved antigenic areas in Flaviviruses. Virol J 2005; 2:40. [PMID: 15845145 PMCID: PMC1112618 DOI: 10.1186/1743-422x-2-40] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2005] [Accepted: 04/21/2005] [Indexed: 11/18/2022] Open
Abstract
Background Flaviviruses, which include Dengue (DV) and West Nile (WN), mutate in response to immune system pressure. Identifying escape mutants, variant progeny that replicate in the presence of neutralizing antibodies, is a common way to identify functionally important residues of viral proteins. However, the mutations typically occur at variable positions on the viral surface that are not essential for viral replication. Methods are needed to determine the true targets of the neutralizing antibodies. Results Stereophysicochemical variability plots (SVPs), 3-D images of protein structures colored according to variability, as determined by our PCPMer program, were used to visualize residues conserved in their physical chemical properties (PCPs) near escape mutant positions. The analysis showed 1) that escape mutations in the flavivirus envelope protein are variable residues by our criteria and 2) two escape mutants found at the same position in many flaviviruses sit above clusters of conserved residues from different regions of the linear sequence. Conservation patterns in T-cell epitopes in the NS3- protease suggest a similar mechanism of immune system evasion. Conclusion The SVPs add another dimension to structurally defining the binding sites of neutralizing antibodies. They provide a useful aid for determining antigenically important regions and designing vaccines.
Collapse
Affiliation(s)
- Catherine H Schein
- Sealy Center for Structural Biology, Department of Human Biology, Chemistry and Genetics, University of Texas Medical Branch at Galveston, TX, USA
- Department of Microbiology and Immunology, University of Texas Medical Branch at Galveston, TX, USA
| | - Bin Zhou
- Sealy Center for Structural Biology, Department of Human Biology, Chemistry and Genetics, University of Texas Medical Branch at Galveston, TX, USA
| | - Werner Braun
- Sealy Center for Structural Biology, Department of Human Biology, Chemistry and Genetics, University of Texas Medical Branch at Galveston, TX, USA
- Sealy Center for Vaccine Development, Department of Human Biology, Chemistry and Genetics, University of Texas Medical Branch at Galveston, TX, USA
| |
Collapse
|