1
|
Abstract
Antibodies and T cell receptors (TCRs) are the fundamental building blocks of adaptive immunity. Repertoire-scale functionality derives from their epitope-binding properties, just as macroscopic properties like temperature derive from microscopic molecular properties. However, most approaches to repertoire-scale measurement, including sequence diversity and entropy, are not based on antibody or TCR function in this way. Thus, they potentially overlook key features of immunological function. Here we present a framework that describes repertoires in terms of the epitope-binding properties of their constituent antibodies and TCRs, based on analysis of thousands of antibody-antigen and TCR-peptide-major-histocompatibility-complex binding interactions and over 400 high-throughput repertoires. We show that repertoires consist of loose overlapping classes of antibodies and TCRs with similar binding properties. We demonstrate the potential of this framework to distinguish specific responses vs. bystander activation in influenza vaccinees, stratify cytomegalovirus (CMV)-infected cohorts, and identify potential immunological "super-agers." Classes add a valuable dimension to the assessment of immune function.
Collapse
|
2
|
Heads JT, Kelm S, Tyson K, Lawson ADG. A computational method for predicting the aggregation propensity of IgG1 and IgG4(P) mAbs in common storage buffers. MAbs 2022; 14:2138092. [PMID: 36418193 PMCID: PMC9704409 DOI: 10.1080/19420862.2022.2138092] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
The propensity for some monoclonal antibodies (mAbs) to aggregate at physiological and manufacturing pH values can prevent their use as therapeutic molecules or delay time to market. Consequently, developability assessments are essential to select optimum candidates, or inform on mitigation strategies to avoid potential late-stage failures. These studies are typically performed in a range of buffer solutions because factors such as pH can dramatically alter the aggregation propensity of the test mAbs (up to 100-fold in extreme cases). A computational method capable of robustly predicting the aggregation propensity at the pH values of common storage buffers would have substantial value. Here, we describe a mAb aggregation prediction tool (MAPT) that builds on our previously published isotype-dependent, charge-based model of aggregation. We show that the addition of a homology model-derived hydrophobicity descriptor to our electrostatic aggregation model enabled the generation of a robust mAb developability indicator. To contextualize our aggregation scoring system, we analyzed 97 clinical-stage therapeutic mAbs. To further validate our approach, we focused on six mAbs (infliximab, tocilizumab, rituximab, CNTO607, MEDI1912 and MEDI1912_STT) which have been reported to cover a large range of aggregation propensities. The different aggregation propensities of the case study molecules at neutral and slightly acidic pH were correctly predicted, verifying the utility of our computational method.
Collapse
Affiliation(s)
- James T. Heads
- UCB Pharma, 208 Bath Road, SloughSL1 3WE, UK,CONTACT James T. Heads UCB Pharma, 208 Bath Road, SloughSL1 3WE, UK
| | | | - Kerry Tyson
- UCB Pharma, 208 Bath Road, SloughSL1 3WE, UK
| | | |
Collapse
|
3
|
Regy RM, Thompson J, Kim YC, Mittal J. Improved coarse-grained model for studying sequence dependent phase separation of disordered proteins. Protein Sci 2021; 30:1371-1379. [PMID: 33934416 DOI: 10.1002/pro.4094] [Citation(s) in RCA: 81] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 04/27/2021] [Accepted: 04/30/2021] [Indexed: 12/28/2022]
Abstract
We present improvements to the hydropathy scale (HPS) coarse-grained (CG) model for simulating sequence-specific behavior of intrinsically disordered proteins (IDPs), including their liquid-liquid phase separation (LLPS). The previous model based on an atomistic hydropathy scale by Kapcha and Rossky (KR scale) is not able to capture some well-known LLPS trends such as reduced phase separation propensity upon mutations (R-to-K and Y-to-F). Here, we propose to use the Urry hydropathy scale instead, which was derived from the inverse temperature transitions in a model polypeptide with guest residues X. We introduce two free parameters to shift (Δ) and scale (µ) the overall interaction strengths for the new model (HPS-Urry) and use the experimental radius of gyration for a diverse group of IDPs to find their optimal values. Interestingly, many possible (Δ, µ) combinations can be used for typical IDPs, but the phase behavior of a low-complexity (LC) sequence FUS is only well described by one of these models, which highlights the need for a careful validation strategy based on multiple proteins. The CG HPS-Urry model should enable accurate simulations of protein LLPS and provide a microscopically detailed view of molecular interactions.
Collapse
Affiliation(s)
- Roshan Mammen Regy
- Department of Chemical and Biomolecular Engineering, Lehigh University, Bethlehem, Pennsylvania, USA
| | - Jacob Thompson
- Department of Chemical and Biomolecular Engineering, Lehigh University, Bethlehem, Pennsylvania, USA
| | - Young C Kim
- Center for Materials Physics and Technology, Naval Research Laboratory, Washington, District of Columbia, USA
| | - Jeetain Mittal
- Department of Chemical and Biomolecular Engineering, Lehigh University, Bethlehem, Pennsylvania, USA
| |
Collapse
|
4
|
Chwastyk M, Panek EA, Malinowski J, Jaskólski M, Cieplak M. Properties of Cavities in Biological Structures-A Survey of the Protein Data Bank. Front Mol Biosci 2020; 7:591381. [PMID: 33240933 PMCID: PMC7677499 DOI: 10.3389/fmolb.2020.591381] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 10/06/2020] [Indexed: 11/13/2022] Open
Abstract
We performed a PDB-wide survey of proteins to assess their cavity content, using the SPACEBALL algorithm to calculate the cavity volumes. In addition, we determined the hydropathy character of the cavities. We demonstrate that the cavities of most proteins are hydrophilic, but smaller proteins tend to have cavities with hydrophobic walls. We propose criteria for distinguishing between cavities and pockets, and single out proteins with the largest cavities.
Collapse
Affiliation(s)
- Mateusz Chwastyk
- Institute of Physics, Polish Academy of Sciences, Warsaw, Poland
| | - Ewa A Panek
- Department of Biometry, Warsaw University of Life Sciences, Warsaw, Poland
| | - Jan Malinowski
- Faculty of Physics, University of Warsaw, Warsaw, Poland
| | - Mariusz Jaskólski
- Center for Biocrystallographic Research, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznań, Poland.,Department of Crystallography, Faculty of Chemistry, Adam Mickiewicz University, Poznań, Poland
| | - Marek Cieplak
- Institute of Physics, Polish Academy of Sciences, Warsaw, Poland
| |
Collapse
|
5
|
Zhao Y, Cieplak M. Proteins at curved fluid-fluid interfaces in a coarse-grained model. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2020; 32:404003. [PMID: 32434179 DOI: 10.1088/1361-648x/ab94f6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 05/20/2020] [Indexed: 06/11/2023]
Abstract
We employ an empirical coarse-grained model with a proposed Gaussian-like interfacial potential to describe proteins at curved fluid-fluid interfaces such as occurring in bubbles and droplets. We consider the air-water and oil-water interfaces. We study the mass distributions and the geometry of the aqueous proteins as a function of the radius of curvature for protein G and two lipid transfer proteins. At curved interfaces the distortion of the proteins is different than at flat interfaces. We find that the proteins come closer to the surface of a bubble than to the surface of similarly curved droplet. In addition, the bubbles adsorb more proteins. We identify the pinning residues. We demonstrate the existence of the second layer in the density profile for sufficiently dense solutions.
Collapse
Affiliation(s)
- Yani Zhao
- Max Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - Marek Cieplak
- Institute of Physics, Polish Academy of Sciences, Al. Lotników 32/46, 02-668 Warsaw, Poland
| |
Collapse
|
6
|
Hoffmann W, Langenhan J, Huhmann S, Moschner J, Chang R, Accorsi M, Seo J, Rademann J, Meijer G, Koksch B, Bowers MT, von Helden G, Pagel K. Eine intrinsische Hydrophobieskala für Aminosäuren und ihre Anwendung auf fluorierte Verbindungen. Angew Chem Int Ed Engl 2019. [DOI: 10.1002/ange.201813954] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Waldemar Hoffmann
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
| | - Jennifer Langenhan
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
| | - Susanne Huhmann
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
| | - Johann Moschner
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
| | - Rayoon Chang
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
| | - Matteo Accorsi
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
| | - Jongcheol Seo
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
- aktuelle Adresse: University of Science and Technology (POSTECH) Fachbereich Chemie 77 Cheongam-ro Pohang 37673 Republik Korea
| | - Jörg Rademann
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
| | - Gerard Meijer
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
| | - Beate Koksch
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
| | - Michael T. Bowers
- University of California Santa Barbara Department of Chemistry & Biochemistry Santa Barbara California 93106 USA
| | - Gert von Helden
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
| | - Kevin Pagel
- Freie Universität Berlin Fachbereich für Biologie, Chemie und Pharmazie Takustraße 3 / Königin-Luise-Straße 2+4 14195 Berlin Deutschland
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Abteilung Molekülphysik Faradayweg 4–6 14195 Berlin Deutschland
| |
Collapse
|
7
|
Hoffmann W, Langenhan J, Huhmann S, Moschner J, Chang R, Accorsi M, Seo J, Rademann J, Meijer G, Koksch B, Bowers MT, von Helden G, Pagel K. An Intrinsic Hydrophobicity Scale for Amino Acids and Its Application to Fluorinated Compounds. Angew Chem Int Ed Engl 2019; 58:8216-8220. [PMID: 30958917 DOI: 10.1002/anie.201813954] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 03/01/2019] [Indexed: 11/10/2022]
Abstract
More than 100 hydrophobicity scales have been introduced, with each being based on a distinct condensed-phase approach. However, a comparison of the hydrophobicity values gained from different techniques, and their relative ranking, is not straightforward, as the interactions between the environment and the amino acid are unique to each method. Here, we overcome this limitation by studying the properties of amino acids in the clean-room environment of the gas phase. In the gas phase, entropic contributions from the hydrophobic effect are by default absent and only the polarity of the side chain dictates the self-assembly. This allows for the derivation of a novel hydrophobicity scale, which is based solely on the interaction between individual amino acid units within the cluster and thus more accurately reflects the intrinsic nature of a side chain. This principle can be further applied to classify non-natural derivatives, as shown here for fluorinated amino acid variants.
Collapse
Affiliation(s)
- Waldemar Hoffmann
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany.,Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany
| | - Jennifer Langenhan
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany
| | - Susanne Huhmann
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany
| | - Johann Moschner
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany
| | - Rayoon Chang
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany.,Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany
| | - Matteo Accorsi
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany
| | - Jongcheol Seo
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany.,present address: University of Science and Technology (POSTECH), Department of Chemistry, 77 Cheongam-ro, Pohang, 37673, Korea
| | - Jörg Rademann
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany
| | - Gerard Meijer
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany
| | - Beate Koksch
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany
| | - Michael T Bowers
- University of California Santa Barbara, Department of Chemistry & Biochemistry, Santa Barbara, California, 93106, USA
| | - Gert von Helden
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany
| | - Kevin Pagel
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 3/Königin-Luise-Strasse 2+4, 14195, Berlin, Germany.,Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Molecular Physics, Faradayweg 4-6, 14195, Berlin, Germany
| |
Collapse
|
8
|
Topological transformations in proteins: effects of heating and proximity of an interface. Sci Rep 2017; 7:39851. [PMID: 28051124 PMCID: PMC5209716 DOI: 10.1038/srep39851] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2016] [Accepted: 11/28/2016] [Indexed: 01/04/2023] Open
Abstract
Using a structure-based coarse-grained model of proteins, we study the mechanism of unfolding of knotted proteins through heating. We find that the dominant mechanisms of unfolding depend on the temperature applied and are generally distinct from those identified for folding at its optimal temperature. In particular, for shallowly knotted proteins, folding usually involves formation of two loops whereas unfolding through high-temperature heating is dominated by untying of single loops. Untying the knots is found to generally precede unfolding unless the protein is deeply knotted and the heating temperature exceeds a threshold value. We then use a phenomenological model of the air-water interface to show that such an interface can untie shallow knots, but it can also make knots in proteins that are natively unknotted.
Collapse
|
9
|
Zhao Y, Cieplak M. Proteins at air–water and oil–water interfaces in an all-atom model. Phys Chem Chem Phys 2017; 19:25197-25206. [DOI: 10.1039/c7cp03829a] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Proteins with different hydrophobicities are studied at the air–water and oil–water interfaces. The all-atom simulating results are consistent with the coarse-grained interfacial model. Proteins are found to be coupled stronger but diffused slower at the oil–water interface than the air–water one.
Collapse
Affiliation(s)
- Yani Zhao
- Institute of Physics
- Polish Academy of Sciences
- 02-668 Warsaw
- Poland
| | - Marek Cieplak
- Institute of Physics
- Polish Academy of Sciences
- 02-668 Warsaw
- Poland
| |
Collapse
|
10
|
Cieplak M, Allan DB, Leheny RL, Reich DH. Proteins at air-water interfaces: a coarse-grained model. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2014; 30:12888-96. [PMID: 25310625 DOI: 10.1021/la502465m] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
We present a coarse-grained model to describe the adsorption and deformation of proteins at an air-water interface. The interface is introduced empirically in the form of a localized field that couples to a hydropathy scale of amino acids. We consider three kinds of proteins: protein G, egg-white lysozyme, and hydrophobin. We characterize the nature of the deformation and the orientation of the proteins induced by their proximity to and association with the interface. We also study protein diffusion in the layer formed at the interface and show that the diffusion slows with increasing concentration in a manner similar to that for a colloidal suspension approaching the glass transition.
Collapse
Affiliation(s)
- Marek Cieplak
- Institute of Physics, Polish Academy of Sciences , 02-668 Warsaw, Poland
| | | | | | | |
Collapse
|
11
|
Mishra A, Rana PS, Mittal A, Jayaram B. D2N: Distance to the native. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2014; 1844:1798-807. [DOI: 10.1016/j.bbapap.2014.07.010] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Revised: 07/03/2014] [Accepted: 07/15/2014] [Indexed: 12/26/2022]
|
12
|
Capturing native/native like structures with a physico-chemical metric (pcSM) in protein folding. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1834:1520-31. [PMID: 23665455 DOI: 10.1016/j.bbapap.2013.04.023] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2013] [Revised: 04/12/2013] [Accepted: 04/15/2013] [Indexed: 12/15/2022]
Abstract
Specification of the three dimensional structure of a protein from its amino acid sequence, also called a "Grand Challenge" problem, has eluded a solution for over six decades. A modestly successful strategy has evolved over the last couple of decades based on development of scoring functions (e.g. mimicking free energy) that can capture native or native-like structures from an ensemble of decoys generated as plausible candidates for the native structure. A scoring function must be fast enough in discriminating the native from unfolded/misfolded structures, and requires validation on a large data set(s) to generate sufficient confidence in the score. Here we develop a scoring function called pcSM that detects true native structure in the top 5 with 93% accuracy from an ensemble of candidate structures. If we eliminate the native from ensemble of decoys then pcSM is able to capture near native structure (RMSD<=5Ǻ) in top 10 with 86% accuracy. The parameters considered in pcSM are a C-alpha Euclidean metric, secondary structural propensity, surface areas and an intramolecular energy function. pcSM has been tested on 415 systems consisting 142,698 decoys (public and CASP-largest reported hitherto in literature). The average rank for the native is 2.38, a significant improvement over that existing in literature. In-silico protein structure prediction requires robust scoring technique(s). Therefore, pcSM is easily amenable to integration into a successful protein structure prediction strategy. The tool is freely available at http://www.scfbio-iitd.res.in/software/pcsm.jsp.
Collapse
|
13
|
Abstract
Membrane proteins currently receive a lot of attention, in large part thanks to a steady stream of high-resolution X-ray structures. Although the first few structures showed proteins composed of tightly packed bundles of very hydrophobic more or less straight transmembrane α-helices, we now know that helix-bundle membrane proteins can be both highly flexible and contain transmembrane segments that are neither very hydrophobic nor necessarily helical throughout their lengths. This raises questions regarding how membrane proteins are inserted into the membrane and fold in vivo, and also complicates life for bioinformaticians trying to predict membrane protein topology and structure.
Collapse
|
14
|
Abstract
Simulaid performs a large number of simulation-related tasks: interconversion and modification of structure and trajectory files, optimization of orientation, and a large variety of analysis functions. The program can handle structures in PDB (Berman et al., Nucleic Acids Res 2000, 28, 235), Charmm (Brooks et al., J Comput Chem 4, 187) CRD, Amber (Case et al.), Macromodel (Mohamadi et al., J Comput Chem 1990, 11, 440), Gromos/Gromacs (Hess et al.), InsightII (InsightII. Accelrys Inc.: San Diego, 2005), Grasp (Nicholls et al., Proteins: Struct Funct Genet 1991, 11, 281) .crg, Tripos (Tripos International, S. H. R., St. Louis, MO) .mol2 (input only), and in the MMC (Mezei, M.; MMC: Monte Carlo program for molecular assemblies. Available at: http://inka.mssm.edu/~mezei/mmc) formats; and trajectories in the formats of Charmm, Amber, Macromodel, and MMC. Analysis features include (but are not limited to): (1) simple distance calculations and hydrogen-bond analysis, (2) calculation of 2-D RMSD maps (produced both as text file with the data and as a color-coded matrix) and cross RMSD maps between trajectories, (3) clustering based on RMSD maps, (4) analysis of torsion angles, Ramachandran (Ramachandran and Sasiskharan, Adv Protein Chem 1968, 23, 283) angles, proline kink (Visiers et al., Protein Eng 2000, 13, 603) angles, pseudorotational (Altona and Sundaralingam, J Am Chem Soc 1972, 94, 8205; Cremer and Pople, J Am Chem Soc 1975, 97, 1354) angles, and (5) analysis based on circular variance (Mezei, J Mol Graphics Model 2003, 21, 463). Torsion angle evolutions are presented in dial plots (Ravishanker et al., J Biomol Struct Dyn 1989, 6, 669). Several of these features are unique to Simulaid.
Collapse
Affiliation(s)
- Mihaly Mezei
- Department of Structural and Chemical Biology, Mount Sinai School of Medicine, New York, New York 10029, USA.
| |
Collapse
|
15
|
Evidence that the translocon may function as a hydropathy partitioning filter. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2010; 1798:1995-8. [DOI: 10.1016/j.bbamem.2010.07.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2010] [Revised: 06/28/2010] [Accepted: 07/12/2010] [Indexed: 10/19/2022]
|
16
|
Narang P, Bhushan K, Bose S, Jayaram B. A computational pathway for bracketing native-like structures fo small alpha helical globular proteins. Phys Chem Chem Phys 2009; 7:2364-75. [PMID: 19785123 DOI: 10.1039/b502226f] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Impressive advances in the applications of bioinformatics for protein structure prediction coupled with growing structural databases on one hand and the insurmountable time-scale problem with ab initio computational methods on the other continue to raise doubts whether a computational solution to the protein folding problem--categorized as an NP-hard problem--is within reach in the near future. Combining some specially designed biophysical filters and vector algebra tools with ab initio methods, we present here a promising computational pathway for bracketing native-like structures of small alpha helical globular proteins departing from secondary structural information. The automated protocol is initiated by generating multiple structures around the loops between secondary structural elements. A set of knowledge-based biophysical filters namely persistence length and radius of gyration, developed and calibrated on approximately 1000 globular proteins, is introduced to screen the trial structures to filter out improbable candidates for the native and reduce the size of the library of probable structures. The ensemble so generated encompasses a few structures with native-like topology. Monte Carlo optimizations of the loop dihedrals are then carried out to remove steric clashes. The resultant structures are energy minimized and ranked according to a scoring function tested previously on a series of decoy sets vis-a-vis their corresponding natives. We find that the 100 lowest energy structures culled from the ensemble of energy optimized trial structures comprise at least a few to within 3-5 angstroms of the native. Thus the formidable "needle in a haystack" problem is narrowed down to finding an optimal solution amongst a computationally tractable number of alternatives. Encouraging results obtained on twelve small alpha helical globular proteins with the above outlined pathway are presented and discussed.
Collapse
Affiliation(s)
- Pooja Narang
- Department of Chemistry, Indian Institute of Technology, Hauz Khas, New Delhi 110016, India
| | | | | | | |
Collapse
|
17
|
Thorvaldsen S, Ytterstad E. Environmental adaptation of proteins: regression models with simple physicochemical properties. Comput Biol Chem 2009; 33:351-6. [PMID: 19679511 DOI: 10.1016/j.compbiolchem.2009.07.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2009] [Accepted: 07/01/2009] [Indexed: 10/20/2022]
Abstract
Bio-sequences from ortholog proteins are well suited for statistical inference when the sequences can be divided into ordinal groups based on known environmental features or traits of the host organisms. In this paper two new regression models are described for extracting proteomic trends of extreme environments. The approach is based on physicochemical properties of the amino acids, and may also utilise stratification of the data. We are especially looking for connections of temperature adaptation between the organism and its molecular level. To show the applicability of the methods, we present analyses of genomic data from proteobacteria orders, where we examine the cold adaptation of membrane proteins, intracellular proteins, and the enzyme endonuclease I. Our results confirm earlier findings that redistribution of charge and increase of surface hydrophobicity might be some of the most important signatures for cold adaptation.
Collapse
Affiliation(s)
- Steinar Thorvaldsen
- Department of Mathematics and Statistics, Faculty of Science, University of Tromsø, 9037 Tromsø, Norway.
| | | |
Collapse
|
18
|
Mant CT, Kovacs JM, Kim HM, Pollock DD, Hodges RS. Intrinsic amino acid side-chain hydrophilicity/hydrophobicity coefficients determined by reversed-phase high-performance liquid chromatography of model peptides: comparison with other hydrophilicity/hydrophobicity scales. Biopolymers 2009; 92:573-95. [PMID: 19795449 PMCID: PMC2792893 DOI: 10.1002/bip.21316] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
An accurate determination of the intrinsic hydrophilicity/hydrophobicity of amino acid side-chains in peptides and proteins is fundamental in understanding many area of research, including protein folding and stability, peptide and protein function, protein-protein interactions and peptide/protein oligomerization, as well as the design of protocols for purification and characterization of peptides and proteins. Our definition of intrinsic hydrophilicity/hydrophobicity of side-chains is the maximum possible hydrophilicity/hydrophobicity of side-chains in the absence of any nearest-neighbor effects and/or any conformational effects of the polypeptide chain that prevent full expression of side-chain hydrophilicity/hydrophobicity. In this review, we have compared an experimentally derived intrinsic side-chain hydrophilicity/hydrophobicity scale generated from RP-HPLC retention behavior of de novo designed synthetic model peptides at pH 2 and pH 7 with other RP-HPLC-derived scales, as well as scales generated from classic experimental and calculation-based methods of octanol/water partitioning of Nalpha-acetyl-amino-acid amides or free energy of transfer of free amino acids. Generally poor correlation was found with previous RP-HPLC-derived scales, likely due to the random nature of the peptide mixtures in terms of varying peptide size, conformation and frequency of particular amino acids. In addition, generally poor correlation with the classical approaches served to underline the importance of the presence of a polypeptide backbone when generating intrinsic values. We have shown that the intrinsic scale determined here is in full agreement with the structural characteristics of amino acid side-chains.
Collapse
Affiliation(s)
- Colin T. Mant
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, School of Medicine, Aurora, CO 80045, USA
| | - James M. Kovacs
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, School of Medicine, Aurora, CO 80045, USA
| | - Hyun-Min Kim
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, School of Medicine, Aurora, CO 80045, USA
| | - David D. Pollock
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, School of Medicine, Aurora, CO 80045, USA
| | - Robert S. Hodges
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
19
|
Momen-Roknabadi A, Sadeghi M, Pezeshk H, Marashi SA. Impact of residue accessible surface area on the prediction of protein secondary structures. BMC Bioinformatics 2008; 9:357. [PMID: 18759992 PMCID: PMC2553345 DOI: 10.1186/1471-2105-9-357] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2007] [Accepted: 08/31/2008] [Indexed: 12/02/2022] Open
Abstract
Background The problem of accurate prediction of protein secondary structure continues to be one of the challenging problems in Bioinformatics. It has been previously suggested that amino acid relative solvent accessibility (RSA) might be an effective factor for increasing the accuracy of protein secondary structure prediction. Previous studies have either used a single constant threshold to classify residues into discrete classes (buries vs. exposed), or used the real-value predicted RSAs in their prediction method. Results We studied the effect of applying different RSA threshold types (namely, fixed thresholds vs. residue-dependent thresholds) on a variety of secondary structure prediction methods. With the consideration of DSSP-assigned RSA values we realized that improvement in the accuracy of prediction strictly depends on the selected threshold(s). Furthermore, we showed that choosing a single threshold for all amino acids is not the best possible parameter. We therefore used residue-dependent thresholds and most of residues showed improvement in prediction. Next, we tried to consider predicted RSA values, since in the real-world problem, protein sequence is the only available information. We first predicted the RSA classes by RVP-net program and then used these data in our method. Using this approach, improvement in prediction was also obtained. Conclusion The success of applying the RSA information on different secondary structure prediction methods suggest that prediction accuracy can be improved independent of prediction approaches. Thus, solvent accessibility can be considered as a rich source of information to help the improvement of these methods.
Collapse
Affiliation(s)
- Amir Momen-Roknabadi
- Department of Biotechnology, College of Science, University of Tehran, Tehran, Iran.
| | | | | | | |
Collapse
|
20
|
Bastolla U, Porto M, Ortíz AR. Local interactions in protein folding determined through an inverse folding model. Proteins 2008; 71:278-99. [DOI: 10.1002/prot.21730] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
21
|
A generalized analysis of hydrophobic and loop clusters within globular protein sequences. BMC STRUCTURAL BIOLOGY 2007; 7:2. [PMID: 17210072 PMCID: PMC1774571 DOI: 10.1186/1472-6807-7-2] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2006] [Accepted: 01/08/2007] [Indexed: 11/14/2022]
Abstract
Background Hydrophobic Cluster Analysis (HCA) is an efficient way to compare highly divergent sequences through the implicit secondary structure information directly derived from hydrophobic clusters. However, its efficiency and application are currently limited by the need of user expertise. In order to help the analysis of HCA plots, we report here the structural preferences of hydrophobic cluster species, which are frequently encountered in globular domains of proteins. These species are characterized only by their hydrophobic/non-hydrophobic dichotomy. This analysis has been extended to loop-forming clusters, using an appropriate loop alphabet. Results The structural behavior of hydrophobic cluster species, which are typical of protein globular domains, was investigated within banks of experimental structures, considered at different levels of sequence redundancy. The 294 more frequent hydrophobic cluster species were analyzed with regard to their association with the different secondary structures (frequencies of association with secondary structures and secondary structure propensities). Hydrophobic cluster species are predominantly associated with regular secondary structures, and a large part (60 %) reveals preferences for α-helices or β-strands. Moreover, the analysis of the hydrophobic cluster amino acid composition generally allows for finer prediction of the regular secondary structure associated with the considered cluster within a cluster species. We also investigated the behavior of loop forming clusters, using a "PGDNS" alphabet. These loop clusters do not overlap with hydrophobic clusters and are highly associated with coils. Finally, the structural information contained in the hydrophobic structural words, as deduced from experimental structures, was compared to the PSI-PRED predictions, revealing that β-strands and especially α-helices are generally over-predicted within the limits of typical β and α hydrophobic clusters. Conclusion The dictionary of hydrophobic clusters described here can help the HCA user to interpret and compare the HCA plots of globular protein sequences, as well as provides an original fundamental insight into the structural bricks of protein folds. Moreover, the novel loop cluster analysis brings additional information for secondary structure prediction on the whole sequence through a generalized cluster analysis (GCA), and not only on regular secondary structures. Such information lays the foundations for developing a new and original tool for secondary structure prediction.
Collapse
|
22
|
Smith TA, Parry DAD. Sequence analyses of Type I and Type II chains in human hair and epithelial keratin intermediate filaments: promiscuous obligate heterodimers, Type II template for molecule formation and a rationale for heterodimer formation. J Struct Biol 2006; 158:344-57. [PMID: 17306560 DOI: 10.1016/j.jsb.2006.12.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2006] [Revised: 12/11/2006] [Accepted: 12/12/2006] [Indexed: 11/16/2022]
Abstract
Sequence comparisons have been undertaken for all hair and epithelial keratin IF chains from a single species--human. The results lead to several new proposals. First, it is clear that not only is the chain structure of the molecule an obligate heterodimer but promiscuous association of Type I and Type II chains must occur in vivo. Second, the higher predicted content of alpha-helix in Type II chains in solution relative to that expected for Type I chains suggests that it is the Type II chains that precede their Type I counterparts and that they may serve as templates for molecule formation. Third, heterodimer formation leads naturally to greater structural and functional specificity, and this may be required not only because keratin IF have more interacting partners in its cell type than other types of IF have in theirs but also because hair and skin IF have two distinct structures that relate to the "reducing" or "oxidizing" environment in which they can find themselves. The transition between the two forms may require specific head/tail interactions and this, it is proposed, would be more easily accomplished by a heterodimer structure with its greater in-built specificity.
Collapse
Affiliation(s)
- Thomasin A Smith
- Institute of Fundamental Sciences, Massey University, Private Bag 11-222, Palmerston North, New Zealand
| | | |
Collapse
|
23
|
Gu J, Gribskov M, Bourne PE. Wiggle-predicting functionally flexible regions from primary sequence. PLoS Comput Biol 2006; 2:e90. [PMID: 16839194 PMCID: PMC1500818 DOI: 10.1371/journal.pcbi.0020090] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2006] [Accepted: 06/02/2006] [Indexed: 11/18/2022] Open
Abstract
The Wiggle series are support vector machine-based predictors that identify regions of functional flexibility using only protein sequence information. Functionally flexible regions are defined as regions that can adopt different conformational states and are assumed to be necessary for bioactivity. Many advances have been made in understanding the relationship between protein sequence and structure. This work contributes to those efforts by making strides to understand the relationship between protein sequence and flexibility. A coarse-grained protein dynamic modeling approach was used to generate the dataset required for support vector machine training. We define our regions of interest based on the participation of residues in correlated large-scale fluctuations. Even with this structure-based approach to computationally define regions of functional flexibility, predictors successfully extract sequence-flexibility relationships that have been experimentally confirmed to be functionally important. Thus, a sequence-based tool to identify flexible regions important for protein function has been created. The ability to identify functional flexibility using a sequence based approach complements structure-based definitions and will be especially useful for the large majority of proteins with unknown structures. The methodology offers promise to identify structural genomics targets amenable to crystallization and the possibility to engineer more flexible or rigid regions within proteins to modify their bioactivity.
Collapse
Affiliation(s)
- Jenny Gu
- Department of Pharmacology and Biomedical Sciences Graduate Program, University of California San Diego, La Jolla, California, USA.
| | | | | |
Collapse
|
24
|
Bastolla U, Porto M, Roman HE, Vendruscolo M. A protein evolution model with independent sites that reproduces site-specific amino acid distributions from the Protein Data Bank. BMC Evol Biol 2006; 6:43. [PMID: 16737532 PMCID: PMC1570368 DOI: 10.1186/1471-2148-6-43] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2005] [Accepted: 05/31/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Since thermodynamic stability is a global property of proteins that has to be conserved during evolution, the selective pressure at a given site of a protein sequence depends on the amino acids present at other sites. However, models of molecular evolution that aim at reconstructing the evolutionary history of macromolecules become computationally intractable if such correlations between sites are explicitly taken into account. RESULTS We introduce an evolutionary model with sites evolving independently under a global constraint on the conservation of structural stability. This model consists of a selection process, which depends on two hydrophobicity parameters that can be computed from protein sequences without any fit, and a mutation process for which we consider various models. It reproduces quantitatively the results of Structurally Constrained Neutral (SCN) simulations of protein evolution in which the stability of the native state is explicitly computed and conserved. We then compare the predicted site-specific amino acid distributions with those sampled from the Protein Data Bank (PDB). The parameters of the mutation model, whose number varies between zero and five, are fitted from the data. The mean correlation coefficient between predicted and observed site-specific amino acid distributions is larger than <r> = 0.70 for a mutation model with no free parameters and no genetic code. In contrast, considering only the mutation process with no selection yields a mean correlation coefficient of <r> = 0.56 with three fitted parameters. The mutation model that best fits the data takes into account increased mutation rate at CpG dinucleotides, yielding <r> = 0.90 with five parameters. CONCLUSION The effective selection process that we propose reproduces well amino acid distributions as observed in the protein sequences in the PDB. Its simplicity makes it very promising for likelihood calculations in phylogenetic studies. Interestingly, in this approach the mutation process influences the effective selection process, i.e. selection and mutation must be entangled in order to obtain effectively independent sites. This interdependence between mutation and selection reflects the deep influence that mutation has on the evolutionary process: The bias in the mutation influences the thermodynamic properties of the evolving proteins, in agreement with comparative studies of bacterial proteomes, and it also influences the rate of accepted mutations.
Collapse
Affiliation(s)
- Ugo Bastolla
- Centro de Biología Molecular "Severo Ochoa", (CSIC-UAM), Cantoblanco, 28049 Madrid, Spain
| | - Markus Porto
- Institut für Festkörperphysik, Technische Universität Darmstadt, Hochschulstr. 8, 64289 Darmstadt, Germany
| | - H Eduardo Roman
- Dipartimento di Fisica, Università di Milano Bicocca, Piazza della Scienza 3, 20126 Milano, Italy
| | - Michele Vendruscolo
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| |
Collapse
|
25
|
Parry DAD. Hendecad repeat in segment 2A and linker L2 of intermediate filament chains implies the possibility of a right-handed coiled-coil structure. J Struct Biol 2006; 155:370-4. [PMID: 16713299 DOI: 10.1016/j.jsb.2006.03.017] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2006] [Accepted: 03/07/2006] [Indexed: 11/15/2022]
Abstract
The conformation adopted by intermediate filament chains (IF) has been described in terms of a central rod domain with four, alpha-helical, left-handed coiled-coil segments (1A, 1B, 2A, and 2B) joined by linkers (L1, L12, and L2, respectively). The rod domain is terminated at its N- and C-terminal ends by "globular" head and tail domains, respectively. This analysis, initially undertaken about 20-25 years ago, was based on the recognition of an underlying heptad substructure in the sequence of the rod domain, the presence of which can be directly associated with an alpha-helical coiled-coil structure. In this work, a hendecad sequence motif that is closely related to the heptad repeat but which is nonetheless significantly different from it has been recognized in the primary structure of segments 2A and linker L2. This motif, which is 11 residues long and structurally equivalent to a true heptad plus another heptad with an inclusive stutter, is consistent with the chains adopting a continuous right-handed coiled-coil structure with a long-period pitch length. It is therefore predicted that segment 2 as a whole may have a coiled-coil conformation with both right-handed (2A+L2) and left-handed (2B) regions. The changeover in handedness would be expected to occur at the C-terminal end of linker L2 and N-terminal end of segment 2B.
Collapse
Affiliation(s)
- David A D Parry
- Institute of Fundamental Sciences, Massey University, Private Bag 11-222, Palmerston North, New Zealand.
| |
Collapse
|
26
|
Abstract
The estrogen receptor-alpha is a wonderfully complex protein important in normal biology, breast cancer, and as a target for anti-cancer agents. We are using the available structures of the hERalpha as well as secondary structure predictions to guide site-directed mutagenesis in order to test the importance of specific interactions and regions in the ligand-regulated activity of the protein. In one area of interest, we are investigating the role of the F domain in the ligand-stimulated activity of the hERalpha. Results from our laboratory and others suggest that the F domain modulates the activity of the hERalpha. In order to better understand the role of the F domain in the hERalpha, we have constructed mutants within this region. Mutations within a predicted alpha-helical region alter the response of the ER to estradiol (E2), eliminate or impair the agonist activity of 4-hydroxytamoxifen (4-OHT), and alter the ability of E2 to overcome 4-OHT's antagonist activity. Deleting the F domain increases the affinity of the receptor for E2; by contrast, mutating a residue in the middle of the predicted helix to a proline does not alter the affinity for E2, but does change the binding mechanism from a positive cooperative to a noncooperative interaction. These and other results show the F domain exhibits substantial functional complexity, and support the idea that this domain modulates the activity of the hERalpha. In a second area of interest, we are investigating the role of hydrophobic and hydrogen-bonding interactions at the start of helix 12 in the activity of the hERalpha. Leucine-536 (L536) has been proposed to participate in hydrophobic interactions that form part of a capping motif stabilizing the start of helix 12. When mutated, the resulting receptors exhibit a reduced response, or even an inverted response, to E2 and 4-OHT on both ERE-driven and AP-1-driven promoters. Interestingly, these mutated receptors also exhibit altered interactions with probes that recognize the agonist-bound and 4-OHT-bound conformations of the ERalpha. Thus, L536 couples the binding of ligand with the conformation of the receptor. Overall, these results show that combining structure-based hypotheses with functional tests of the ER's activity can identify regions and interactions that are important in the ligand-stimulated activity of the protein.
Collapse
Affiliation(s)
- Debra F Skafar
- Department of Physiology, Wayne State University School of Medicine, 540 E. Canfield, Detroit, MI 48201, USA.
| | | |
Collapse
|
27
|
Kovacs JM, Mant CT, Hodges RS. Determination of intrinsic hydrophilicity/hydrophobicity of amino acid side chains in peptides in the absence of nearest-neighbor or conformational effects. Biopolymers 2006; 84:283-97. [PMID: 16315143 PMCID: PMC2744689 DOI: 10.1002/bip.20417] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Understanding the hydrophilicity/hydrophobicity of amino acid side chains in peptides/proteins is one the most important aspects of biology. Though many hydrophilicity/hydrophobicity scales have been generated, an "intrinsic" scale has yet to be achieved. "Intrinsic" implies the maximum possible hydrophilicity/hydrophobicity of side chains in the absence of nearest-neighbor or conformational effects that would decrease the full expression of the side-chain hydrophilicity/hydrophobicity when the side chain is in a polypeptide chain. Such a scale is the fundamental starting point for determining the parameters that affect side-chain hydrophobicity and for quantifying such effects in peptides and proteins. A 10-residue peptide sequence, Ac-X-G-A-K-G-A-G-V-G-L-amide, was designed to enable the determination of the intrinsic values, where position X was substituted by all 20 naturally occurring amino acids and norvaline, norleucine, and ornithine. The coefficients were determined by reversed-phase high-performance liquid chromatography using six different mobile phase conditions involving different pH values (2, 5, and 7), ion-pairing reagents, and the presence and absence of different salts. The results show that the intrinsic hydrophilicity/hydrophobicity of amino acid side chains in peptides (proteins) is independent of pH, buffer conditions, or whether C(8) or C(18) reversed-phase columns were used for 17 side chains (Gly, Ala, Cys, Pro, Val, nVal, Leu, nLeu, Ile, Met, Tyr, Phe, Trp, Ser, Thr, Asn, and Gln) and dependent on pH and buffer conditions, including the type of salt or ion-pairing reagent for potentially charged side chains (Orn, Lys, His, Arg, Asp, and Glu).
Collapse
Affiliation(s)
- James M Kovacs
- Department of Biochemistry and Molecular Genetics, University of Colorado at Denver and Health Sciences Center, Aurora, 80045, USA
| | | | | |
Collapse
|
28
|
Najafabadi HS, Goodarzi H, Torabi N. Optimality of codon usage in Escherichia coli due to load minimization. J Theor Biol 2005; 237:203-9. [PMID: 15932760 DOI: 10.1016/j.jtbi.2005.04.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2005] [Revised: 04/02/2005] [Accepted: 04/04/2005] [Indexed: 11/19/2022]
Abstract
The canonical genetic code is known to be highly efficient in minimizing the effects of mistranslational errors and point mutations, an ability which in term is designated "load minimization". One parameter involved in calculating the load minimizing property of the genetic code is codon usage. In most bacteria, synonymous codons are not used with equal frequencies. Different factors have been proposed to contribute to codon usage preference. It has been shown that the codon preference is correlated with the composition of the tRNA pool. Selection for translational efficiency and translational accuracy both result in such a correlation. In this work, it is shown that codon usage bias in Escherichia coli works so as to minimize the consequences of translational errors, i.e. optimized for load minimization.
Collapse
Affiliation(s)
- Hamed Shateri Najafabadi
- Department of Biotechnology, Faculty of Science, University of Tehran, Enghelab Ave., Tehran, Iran.
| | | | | |
Collapse
|
29
|
Bastolla U, Porto M, Roman HE, Vendruscolo M. Looking at structure, stability, and evolution of proteins through the principal eigenvector of contact matrices and hydrophobicity profiles. Gene 2005; 347:219-30. [PMID: 15777696 DOI: 10.1016/j.gene.2004.12.015] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2004] [Revised: 11/29/2004] [Accepted: 12/10/2004] [Indexed: 11/28/2022]
Abstract
We review and further develop an analytical model that describes how thermodynamic constraints on the stability of the native state influence protein evolution in a site-specific manner. To this end, we represent both protein sequences and protein structures as vectors: structures are represented by the principal eigenvector (PE) of the protein contact matrix, a quantity that resembles closely the effective connectivity of each site; sequences are represented through the "interactivity" of each amino acid type, using novel parameters that are correlated with hydropathy scales. These interactivity parameters are more strongly correlated than the other hydropathy scales that we examine with: (1) the change upon mutations of the unfolding free energy of proteins with two-states thermodynamics; (2) genomic properties as the genome-size and the genome-wide GC content; (3) the main eigenvectors of the substitution matrices. The evolutionary average of the interactivity vector correlates very strongly with the PE of a protein structure. Using this result, we derive an analytic expression for site-specific distributions of amino acids across protein families in the form of Boltzmann distributions whose "inverse temperature" is a function of the PE component. We show that our predictions are in agreement with site-specific amino acid distributions obtained from the Protein Data Bank, and we determine the mutational model that best fits the observed site-specific amino acid distributions. Interestingly, the optimal model almost minimizes the rate at which deleterious mutations are eliminated by natural selection.
Collapse
Affiliation(s)
- Ugo Bastolla
- Centro de Astrobiología, INTA-CSIC, c.tra de Ajalvir km.4, E-28850, Torrejón de Ardoz, Madrid, Spain.
| | | | | | | |
Collapse
|
30
|
Bastolla U, Moya A, Viguera E, van Ham RCHJ. Genomic determinants of protein folding thermodynamics in prokaryotic organisms. J Mol Biol 2004; 343:1451-66. [PMID: 15491623 DOI: 10.1016/j.jmb.2004.08.086] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2004] [Revised: 08/24/2004] [Accepted: 08/27/2004] [Indexed: 02/07/2023]
Abstract
Here we investigate how thermodynamic properties of orthologous proteins are influenced by the genomic environment in which they evolve. We performed a comparative computational study of 21 protein families in 73 prokaryotic species and obtained the following main results. (i) Protein stability with respect to the unfolded state and with respect to misfolding are anticorrelated. There appears to be a trade-off between these two properties, which cannot be optimized simultaneously. (ii) Folding thermodynamic parameters are strongly correlated with two genomic features, genome size and G+C composition. In particular, the normalized energy gap, an indicator of folding efficiency in statistical mechanical models of protein folding, is smaller in proteins of organisms with a small genome size and a compositional bias towards A+T. Such genomic features are characteristic for bacteria with an intracellular lifestyle. We interpret these correlations in light of mutation pressure and natural selection. A mutational bias toward A+T at the DNA level translates into a mutational bias toward more hydrophobic (and in general more interactive) proteins, a consequence of the structure of the genetic code. Increased hydrophobicity renders proteins more stable against unfolding but less stable against misfolding. Proteins with high hydrophobicity and low stability against misfolding occur in organisms with reduced genomes, like obligate intracellular bacteria. We argue that they are fixed because these organisms experience weaker purifying selection due to their small effective population sizes. This interpretation is supported by the observation of a high expression level of chaperones in these bacteria. Our results indicate that the mutational spectrum of a genome and the strength of selection significantly influence protein folding thermodynamics.
Collapse
Affiliation(s)
- Ugo Bastolla
- Centro de Astrobiología (CSIC-INTA), E-28850 Torrejón de Ardoz, Spain.
| | | | | | | |
Collapse
|
31
|
Abstract
Hydrophobic interactions are highly selective, and differences in surface hydrophobicities between proteins can be used as an efficient handle to facilitate protein isolation. Aromatic amino acid residues are of particular importance for molecular recognition because they have a key role in several biological functions. The hydrophobicity of a protein can easily be altered with minor genetic modifications, such as site-directed mutagenesis or fusions of hydrophobic peptide tags. An important advantage of hydrophobic peptide tags over traditional affinity tags is the possibility of exploring simple and inexpensive bioseparation materials. Recent results demonstrate the potential of hydrophobic interaction chromatography and aqueous two-phase systems as tools to study relative hydrophobicities of recombinant proteins with only minor alterations. This review focuses on hydrophobic peptide tags as fusion partners, which can be used as important tools in bioseparation.
Collapse
Affiliation(s)
- Sara Fexby
- Laboratoire de Technologie Enzymatique, UMR 6022 du CNRS, Université de Technologie de Compiègne, B.P. 20529, F-60205 Compiègne Cedex, France.
| | | |
Collapse
|
32
|
Porrello A, Soddu S, Zbilut JP, Crescenzi M, Giuliani A. Discrimination of single amino acid mutations of the p53 protein by means of deterministic singularities of recurrence quantification analysis. Proteins 2004; 55:743-55. [PMID: 15103636 DOI: 10.1002/prot.20075] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
p53 is mutated in roughly 50% of all human tumors, predominantly in the DNA-binding domain codons. Structural, biochemical, and functional studies have reported that the different p53 mutants possess a broad range of behaviors that include the elimination of the tumor-suppression function of wild-type protein, the acquisition of dominant-negative function over the wild-type form, and the establishment of gain-of-function activities. The contribution of each of these types of mutations to tumor progression, grade of malignancy, and response to anticancer treatments has been so far analyzed only for a few "hot-spots." In an attempt to identify new approaches to systematically characterize the complete spectrum of p53 mutations, we applied recurrence quantification analysis (RQA), a non-linear signal analysis technique, to p53 primary structure. Moving from the study of the p53 hydrophobicity pattern, which revealed important similarities with the singular deterministic structuring of prions, we could statistically discriminate, on a pure amino acid sequence basis, between experimentally characterized DNA-contact defective and conformational p53 mutants with a very high percentage of success. This result indicates that RQA is a mathematical tool particularly advantageous for the development of a database of p53 mutations that integrates epidemiological data with structural and functional categorizations.
Collapse
Affiliation(s)
- Alessandro Porrello
- Department of Experimental Oncology, Regina Elena Cancer Institute, Via delle Messi d'Oro, Rome, Italy.
| | | | | | | | | |
Collapse
|
33
|
Biswas KM, DeVido DR, Dorsey JG. Evaluation of methods for measuring amino acid hydrophobicities and interactions. J Chromatogr A 2003; 1000:637-55. [PMID: 12877193 DOI: 10.1016/s0021-9673(03)00182-1] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The concept of hydrophobicity has been addressed by researchers in all aspects of science, particularly in the fields of biology and chemistry. Over the past several decades, the study of the hydrophobicity of biomolecules, particularly amino acids has resulted in the development of a variety of hydrophobicity scales. In this review, we discuss the various methods of measuring amino acid hydrophobicity and provide explanations for the wide range of rankings that exist among these published scales. A discussion of the literature on amino acid interactions is also presented. Only a surprisingly small number of papers exist in this rather important area of research; measuring pairwise amino acid interactions will aid in understanding structural aspects of proteins.
Collapse
Affiliation(s)
- Kallol M Biswas
- Department of Chemistry and Biochemistry, Florida State University, Tallahassee, FL 32306-4390, USA
| | | | | |
Collapse
|
34
|
Zhou H, Zhou Y. Stability scale and atomic solvation parameters extracted from 1023 mutation experiments. Proteins 2002; 49:483-92. [PMID: 12402358 DOI: 10.1002/prot.10241] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The stability scale of 20 amino acid residues is derived from a database of 1023 mutation experiments on 35 proteins. The resulting scale of hydrophobic residues has an excellent correlation with the octanol-to-water transfer free energy corrected with an additional Flory-Huggins molar-volume term (correlation coefficient r = 0.95, slope = 1.05, and a near zero intercept). Thus, hydrophobic contribution to folding stability is characterized remarkably well by transfer experiments. However, no corresponding correlation is found for hydrophilic residues. Both the hydrophilic portion and the entire scale, however, correlate strongly with average burial accessible surface (r = 0.76 and 0.97, respectively). Such a strong correlation leads to a near uniform value of the atomic solvation parameters for atoms C, S, O/N, O(-0.5), and N(+0.5,1). All are in the range of 12-28 cal x mol(-1) A(-2), close to the original estimate of hydrophobic contribution of 25-30 cal x mol(-1) A(-2) to folding stability. Without any adjustable parameters, the new stability scale and new atomic solvation parameters yielded an accurate prediction of protein-protein binding free energy for a separate database of 21 protein-protein complexes (r = 0.80 and slope = 1.06, and r = 0.83 and slope = 0.93, respectively).
Collapse
Affiliation(s)
- Hongyi Zhou
- Howard Hughes Medical Institute Center for Single Molecule Biophysics, Department of Physiology & Biophysics, State University of New York at Buffalo, Buffalo, New York 14214, USA
| | | |
Collapse
|
35
|
Cserzö M, Eisenhaber F, Eisenhaber B, Simon I. On filtering false positive transmembrane protein predictions. Protein Eng Des Sel 2002; 15:745-52. [PMID: 12456873 DOI: 10.1093/protein/15.9.745] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
While helical transmembrane (TM) region prediction tools achieve high (>90%) success rates for real integral membrane proteins, they produce a considerable number of false positive hits in sequences of known nontransmembrane queries. We propose a modification of the dense alignment surface (DAS) method that achieves a substantial decrease in the false positive error rate. Essentially, a sequence that includes possible transmembrane regions is compared in a second step with TM segments in a sequence library of documented transmembrane proteins. If the performance of the query sequence against the library of documented TM segment-containing sequences in this test is lower than an empirical threshold, it is classified as a non-transmembrane protein. The probability of false positive prediction for trusted TM region hits is expressed in terms of E-values. The modified DAS method, the DAS-TMfilter algorithm, has an unchanged high sensitivity for TM segments ( approximately 95% detected in a learning set of 128 documented transmembrane proteins). At the same time, the selectivity measured over a non-redundant set of 526 soluble proteins with known 3D structure is approximately 99%, mainly because a large number of falsely predicted single membrane-pass proteins are eliminated by the DAS-TMfilter algorithm.
Collapse
Affiliation(s)
- Miklos Cserzö
- University of Birmingham, School of Biosciences, Edgbaston, Birmingham B15 2TT, UK.
| | | | | | | |
Collapse
|
36
|
Qi Y, Pei J, Grishin NV. C-terminal domain of gyrase A is predicted to have a beta-propeller structure. Proteins 2002; 47:258-64. [PMID: 11948780 DOI: 10.1002/prot.10090] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Two different type II topoisomerases are known in bacteria. DNA gyrase (Gyr) introduces negative supercoils into DNA. Topoisomerase IV (Par) relaxes DNA supercoils. GyrA and ParC subunits of bacterial type II topoisomerases are involved in breakage and reunion of DNA. The spatial structure of the C-terminal fragment in GyrA/ParC is not available. We infer homology between the C-terminal domain of GyrA/ParC and a regulator of chromosome condensation (RCC1), a eukaryotic protein that functions as a guanine-nucleotide-exchange factor for the nuclear G protein Ran. This homology, complemented by detection of 6 sequence repeats with 4 predicted beta-strands each in GyrA/ParC sequences, allows us to predict that the GyrA/ParC C-terminal domain folds into a 6-bladed beta-propeller. The prediction rationalizes available experimental data and sheds light on the spatial properties of the largest topoisomerase domain that lacks structural information.
Collapse
Affiliation(s)
- Yuan Qi
- Department of Biochemistry, University of Texas Southwestern Medical Center, Texas 75390-9050, USA
| | | | | |
Collapse
|
37
|
Giuliani A, Benigni R, Zbilut JP, Webber CL, Sirabella P, Colosimo A. Nonlinear signal analysis methods in the elucidation of protein sequence-structure relationships. Chem Rev 2002; 102:1471-92. [PMID: 11996541 DOI: 10.1021/cr0101499] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Alessandro Giuliani
- Istituto Superiore di Sanità, TCE Laboratory, Viale Regina Elena 299, 00161 Rome, Italy.
| | | | | | | | | | | |
Collapse
|
38
|
Smith TA, Strelkov SV, Burkhard P, Aebi U, Parry DAD. Sequence comparisons of intermediate filament chains: evidence of a unique functional/structural role for coiled-coil segment 1A and linker L1. J Struct Biol 2002; 137:128-45. [PMID: 12064940 DOI: 10.1006/jsbi.2002.4438] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A comprehensive analysis of the sequences of all types of intermediate filament chains has been undertaken with a particular emphasis on those of segment 1A and linker L1. This has been done to assess whether structural characteristics can be recognized in the sequences that would be consistent with the role of each region in the recently proposed "swinging head" hypothesis. The analyses show that linker L1 is the most flexible rod domain region, that it is the most elongated structure (on a per residue basis), and that it is the most variable region as regards sequence and length. Segment 1A has one of the two most highly conserved regions of sequence in the rod domain (the other being at the end of segment 2B), with seven particular residues conserved across all chain types. It also contains one of the very few potential interchain ionic interactions that could be conserved across all chain types. However, the aggregation of chains in segment 1A is specified less precisely overall by interchain ionic interactions than are the other coiled-coil segments. The apolar residue contents in positions a and d of the heptad substructure are the highest of any coiled-coil segment in the intermediate filament family. Segment 1A also displays an amino acid composition atypical of not only coiled-coil segments 1B and 2B, but indeed of two-stranded coiled coils in general. Nonetheless, molecular modeling based on the crystal structure of the monomeric 1A fragment from human vimentin shows that coiled-coil formation is plausible. The most extensive regions of apolar/aromatic residues lie at the C-terminal end of segment 2B in the helix termination motif and in segment 1A in and close to the helix initiation motif. The predicted stability of the individual alpha-helices in segment 1A is greater than in those comprising segments 1B and 2B, though potential intrachain ionic interactions are either lacking or are minimal in number. Analysis of the 1A sequence and those regions immediately N- and C-terminal to it has shown that the capping residues are near optimal close to the previously predicted ends, thus adding to the likely stability of the alpha-helical structure. However, a second terminating sequence is predicted in 1A (about 10 residues back from the C-terminus). This allows the possibility of some unwinding of the alpha-helical structure of 1A immediately adjacent to linker L1 when the head domains no longer stabilize the coiled-coil structure. All of these data are consistent with the concept of a flexible hinge at L1 and with the ability of the two alpha-helical coiled-coil strands to separate under appropriate conditions and partly unwind at their C-terminal ends to allow the head domains a greater degree of mobility, thus facilitating function.
Collapse
Affiliation(s)
- Thomasin A Smith
- Institute of Fundamental Sciences, Massey University, Palmerston North, New Zealand
| | | | | | | | | |
Collapse
|
39
|
Simon I, Fiser A, Tusnády GE. Predicting protein conformation by statistical methods. BIOCHIMICA ET BIOPHYSICA ACTA 2001; 1549:123-36. [PMID: 11690649 DOI: 10.1016/s0167-4838(01)00253-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The unique folded structure makes a polypeptide a functional protein. The number of known sequences is about a hundred times larger than the number of known structures and the gap is increasing rapidly. The primary goal of all structure prediction methods is to obtain structure-related information on proteins, whose structures have not been determined experimentally. Besides this goal, the development of accurate prediction methods helps to reveal principles of protein folding. Here we present a brief survey of protein structure predictions based on statistical analyses of known sequence and structure data. We discuss the background of these methods and attempt to elucidate principles, which govern structure formation of soluble and membrane proteins.
Collapse
Affiliation(s)
- I Simon
- Institute of Enzymology, BRC, Hungarian Academy of Sciences, Budapest, Hungary.
| | | | | |
Collapse
|