1
|
Chidyausiku TM, Mendes SR, Klima JC, Nadal M, Eckhard U, Roel-Touris J, Houliston S, Guevara T, Haddox HK, Moyer A, Arrowsmith CH, Gomis-Rüth FX, Baker D, Marcos E. De novo design of immunoglobulin-like domains. Nat Commun 2022; 13:5661. [PMID: 36192397 PMCID: PMC9530121 DOI: 10.1038/s41467-022-33004-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 08/17/2022] [Indexed: 11/21/2022] Open
Abstract
Antibodies, and antibody derivatives such as nanobodies, contain immunoglobulin-like (Ig) β-sandwich scaffolds which anchor the hypervariable antigen-binding loops and constitute the largest growing class of drugs. Current engineering strategies for this class of compounds rely on naturally existing Ig frameworks, which can be hard to modify and have limitations in manufacturability, designability and range of action. Here, we develop design rules for the central feature of the Ig fold architecture—the non-local cross-β structure connecting the two β-sheets—and use these to design highly stable Ig domains de novo, confirm their structures through X-ray crystallography, and show they can correctly scaffold functional loops. Our approach opens the door to the design of antibody-like scaffolds with tailored structures and superior biophysical properties. The immunoglobulin domain framework of antibodies has been a long standing design challenge. Here, the authors describe design rules for tailoring these domains and show they can be accurately designed, de novo, with high stability and the ability to scaffold functional loops.
Collapse
Affiliation(s)
- Tamuka M Chidyausiku
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA.,Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.,Novartis Institutes for BioMedical Research Inc., San Diego, CA, 92121, USA
| | - Soraia R Mendes
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Jason C Klima
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA.,Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA.,Encodia, Inc., San Diego, CA, 92121, USA
| | - Marta Nadal
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Ulrich Eckhard
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Jorge Roel-Touris
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Scott Houliston
- Structural Genomics Consortium, University of Toronto, Toronto, ON, M5G 1L7, Canada.,Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 2M9, Canada
| | - Tibisay Guevara
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Hugh K Haddox
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Adam Moyer
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Cheryl H Arrowsmith
- Structural Genomics Consortium, University of Toronto, Toronto, ON, M5G 1L7, Canada.,Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 2M9, Canada
| | - F Xavier Gomis-Rüth
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain.
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA. .,Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA. .,Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| | - Enrique Marcos
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain.
| |
Collapse
|
2
|
Decoding an Amino Acid Sequence to Extract Information on Protein Folding. Molecules 2022; 27:molecules27093020. [PMID: 35566370 PMCID: PMC9106047 DOI: 10.3390/molecules27093020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 05/01/2022] [Accepted: 05/05/2022] [Indexed: 01/27/2023] Open
Abstract
Protein folding is a complicated phenomenon including various time scales (μs to several s), and various structural indices are required to analyze it. The methodologies used to study this phenomenon also have a wide variety and employ various experimental and computational techniques. Thus, a simple speculation does not serve to understand the folding mechanism of a protein. In the present review, we discuss the recent studies conducted by the author and their colleagues to decode amino acid sequences to obtain information on protein folding. We investigate globin-like proteins, ferredoxin-like fold proteins, IgG-like beta-sandwich fold proteins, lysozyme-like fold proteins and β-trefoil-like fold proteins. Our techniques are based on statistics relating to the inter-residue average distance, and our studies performed so far indicate that the information obtained from these analyses includes data on the protein folding mechanism. The relationships between our results and the actual protein folding phenomena are also discussed.
Collapse
|
3
|
Parry DAD. Structures of the ß-Keratin Filaments and Keratin Intermediate Filaments in the Epidermal Appendages of Birds and Reptiles (Sauropsids). Genes (Basel) 2021; 12:591. [PMID: 33920614 PMCID: PMC8072682 DOI: 10.3390/genes12040591] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Revised: 04/13/2021] [Accepted: 04/15/2021] [Indexed: 01/14/2023] Open
Abstract
The epidermal appendages of birds and reptiles (the sauropsids) include claws, scales, and feathers. Each has specialized physical properties that facilitate movement, thermal insulation, defence mechanisms, and/or the catching of prey. The mechanical attributes of each of these appendages originate from its fibril-matrix texture, where the two filamentous structures present, i.e., the corneous ß-proteins (CBP or ß-keratins) that form 3.4 nm diameter filaments and the α-fibrous molecules that form the 7-10 nm diameter keratin intermediate filaments (KIF), provide much of the required tensile properties. The matrix, which is composed of the terminal domains of the KIF molecules and the proteins of the epidermal differentiation complex (EDC) (and which include the terminal domains of the CBP), provides the appendages, with their ability to resist compression and torsion. Only by knowing the detailed structures of the individual components and the manner in which they interact with one another will a full understanding be gained of the physical properties of the tissues as a whole. Towards that end, newly-derived aspects of the detailed conformations of the two filamentous structures will be discussed and then placed in the context of former knowledge.
Collapse
Affiliation(s)
- David A D Parry
- School of Fundamental Sciences, Massey University, Private Bag 11-222, Palmerston North 4442, New Zealand
| |
Collapse
|
4
|
Vance TDR, Ye Q, Conroy B, Davies PL. Essential role of calcium in extending RTX adhesins to their target. JOURNAL OF STRUCTURAL BIOLOGY-X 2020; 4:100036. [PMID: 32984811 PMCID: PMC7493085 DOI: 10.1016/j.yjsbx.2020.100036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 08/12/2020] [Accepted: 09/02/2020] [Indexed: 11/25/2022]
Abstract
Elongated beta-sandwich repeats are a major part of bacterial RTX adhesins. The repeats are arranged in tandem to extend away from the bacterial surface. Calcium ions are coordinated in the linkers between repeats to stiffen the protein. Rigidification of the tandem repeats further helps extension of the adhesin. The repeats differ greatly between species, but all have Ca2+ in their linkers.
RTX adhesins are long, multi-domain proteins present on the outer membrane of many Gram-negative bacteria. From this vantage point, adhesins use their distal ligand-binding domains for surface attachment leading to biofilm formation. To expand the reach of the ligand-binding domains, RTX adhesins maintain a central extender region of multiple tandem repeats, which makes up most of the proteins’ large molecular weight. Alignments of the 10-15-kDa extender domains show low sequence identity between adhesins. Here we have produced and structurally characterized protein constructs of four tandem repeats (tetra-tandemers) from two different RTX adhesins. In comparing the tetra-tandemers to each other and already solved structures from Marinomonas primoryensis and Salmonella enterica, the extender domains fold as diverse beta-sandwich structures with widely differing calcium contents. However, all the tetra-tandemers have at least one calcium ion coordinated in the linker region between beta-sandwich domains whose role appears to be the rigidification of the extender region to help the adhesin extend its reach.
Collapse
Affiliation(s)
- Tyler D R Vance
- Department of Biomedical and Molecular Science, Queen's University Kingston ON, Canada
| | - Qilu Ye
- Department of Biomedical and Molecular Science, Queen's University Kingston ON, Canada
| | - Brigid Conroy
- Department of Biomedical and Molecular Science, Queen's University Kingston ON, Canada
| | - Peter L Davies
- Department of Biomedical and Molecular Science, Queen's University Kingston ON, Canada
| |
Collapse
|
5
|
Aumpuchin P, Hamaue S, Kikuchi T. Prediction of the initial folding sites and the entire folding processes for Ig-like beta-sandwich proteins. Proteins 2019; 88:740-758. [PMID: 31833097 DOI: 10.1002/prot.25862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Revised: 11/13/2019] [Accepted: 12/06/2019] [Indexed: 11/12/2022]
Abstract
Describing the whole story of protein folding is currently the main enigmatic problem in molecular bioinformatics study. Protein folding mechanisms have been intensively investigated with experimental as well as simulation techniques. Since a protein folds into its specific 3D structure from a unique amino acid sequence, it is interesting to extract as much information as possible from the amino acid sequence of a protein. Analyses based on inter-residue average distance statistics and a coarse-grained Gō-model simulation were conducted on Ig and FN3 domains of a titin protein to decode the folding mechanisms from their sequence data and native structure data, respectively. The central region of all domains was predicted to be an initial folding unit, that is, stable in an early state of folding. This common feature coincides well with the experimental results and underscores the significance of the β-sandwich proteins' common structure, namely, the key strands for folding and the Greek-key motif, which is located in the central region. We confirmed that our sequence-based techniques were able to predict the initial folding event just next to the denatured state and that a 3D-based Gō-model simulation can be used to investigate the whole process of protein folding.
Collapse
Affiliation(s)
- Panyavut Aumpuchin
- Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Shoya Hamaue
- Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Takeshi Kikuchi
- Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Shiga, Japan
| |
Collapse
|
6
|
Wang S, Mao C, Liu S. Peptides encoded by noncoding genes: challenges and perspectives. Signal Transduct Target Ther 2019; 4:57. [PMID: 31871775 PMCID: PMC6908703 DOI: 10.1038/s41392-019-0092-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Revised: 10/17/2019] [Accepted: 10/27/2019] [Indexed: 01/01/2023] Open
Abstract
In recent years, noncoding gene (NCG) translation events have been frequently discovered. The resultant peptides, as novel findings in the life sciences, perform unexpected functions of increasingly recognized importance in many fundamental biological and pathological processes. The emergence of these novel peptides, in turn, has advanced the field of genomics while indispensably aiding living organisms. The peptides from NCGs serve as important links between extracellular stimuli and intracellular adjustment mechanisms. These peptides are also important entry points for further exploration of the mysteries of life that may trigger a new round of revolutionary biotechnological discoveries. Insights into NCG-derived peptides will assist in understanding the secrets of life and the causes of diseases, and will also open up new paths to the treatment of diseases such as cancer. Here, a critical review is presented on the action modes and biological functions of the peptides encoded by NCGs. The challenges and future trends in searching for and studying NCG peptides are also critically discussed.
Collapse
Affiliation(s)
- Shuo Wang
- Changhai Hospital, Shanghai, 200433 China
| | - Chuanbin Mao
- Department of Chemistry and Biochemistry, Stephenson Life Sciences Research Center, Institute for Biomedical Engineering, Science and Technology, University of Oklahoma, 101 Stephenson Parkway, Norman, OK 73019-5300 USA
| | | |
Collapse
|
7
|
Press-Sandler O, Miller Y. Distinct Primary Nucleation of Polymorphic Aβ Dimers Yields to Distinguished Fibrillation Pathways. ACS Chem Neurosci 2019; 10:4407-4413. [PMID: 31532176 DOI: 10.1021/acschemneuro.9b00437] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Polymorphic Aβ dimers are the smallest toxic species that play a role in the pathology of Alzheimer's disease. There is great interest in understanding the malfunctions that yield to these toxic species and in providing insights into the molecular mechanisms of the primary nucleation. Herein, we present a first work that demonstrates two distant edges states of Aβ dimers. The first is the so-called "random coil" state dimer that mimics the primary seeding/nucleation that is far from a fibrillation state. The second is the "fibril-like" state dimer that is structurally in close proximity to the fibril, a well-organized state into a fibril-like structure. We show for the first time that a conformational change of one monomer within the dimer impedes primary nucleation, while less fluctuations and relatively large number of interactions in nucleation domains induce the primary nucleation to produce toxic stable species. Overall, the current study exhibits a diversity of primary nucleation in each dimer state, suggesting distinct molecular mechanisms of fibril formation. The conformations of the early stage Aβ dimers that were achieved may provide crucial data for designing inhibitors to impede the primary nucleation.
Collapse
Affiliation(s)
- Olga Press-Sandler
- Department of Chemistry, Ben-Gurion University of the Negev, Be’er Sheva 84105, Israel
- The Ilse Katz Institute for Nanoscale Science & Technology, Ben-Gurion University of the Negev, Be’er Sheva 84105, Israel
| | - Yifat Miller
- Department of Chemistry, Ben-Gurion University of the Negev, Be’er Sheva 84105, Israel
- The Ilse Katz Institute for Nanoscale Science & Technology, Ben-Gurion University of the Negev, Be’er Sheva 84105, Israel
| |
Collapse
|
8
|
Abstract
Copper is a redox-active transition metal ion required for the function of many essential human proteins. For biosynthesis of proteins coordinating copper, the metal may bind before, during or after folding of the polypeptide. If the metal binds to unfolded or partially folded structures of the protein, such coordination may modulate the folding reaction. The molecular understanding of how copper is incorporated into proteins requires descriptions of chemical, thermodynamic, kinetic and structural parameters involved in the formation of protein-metal complexes. Because free copper ions are toxic, living systems have elaborate copper-transport systems that include particular proteins that facilitate efficient and specific delivery of copper ions to target proteins. Therefore, these pathways become an integral part of copper protein folding in vivo. This review summarizes biophysical-molecular in vitro work assessing the role of copper in folding and stability of copper-binding proteins as well as protein-protein copper exchange reactions between human copper transport proteins. We also describe some recent findings about the participation of copper ions and copper proteins in protein misfolding and aggregation reactions in vitro.
Collapse
|
9
|
Sequence Pattern for Supersecondary Structure of Sandwich-Like Proteins. Methods Mol Biol 2019. [PMID: 30945226 DOI: 10.1007/978-1-4939-9161-7_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
The goal is to define sequence characteristics of beta-sandwich proteins that are unique for the beta-sandwich supersecondary structure (SSS). Finding of the conserved residues that are critical for protein structure can often be accomplished with homology methods, but these methods are not always adequate as residues with similar structural role do not always occupy the same position as determined by sequence alignment. In this paper, we show how to identify residues that play the same structural role in the different proteins of the same SSS, even when these residue positions cannot be aligned with sequence alignment methods. The SSS characteristics are (a) a set of positions in each strand that are involved in the formation of a hydrophobic core, residue content, and correlations of residues at these key positions, (b) maximum allowable number of "low-frequency residues" for each strand, (c) minimum allowed number of "high-frequency" residues for each loop, and (d) minimum and maximum lengths of each loop. These sequence characteristics are referred to as "sequence pattern" for their respective SSS. The high specificity and sensitivity for a particular SSS are confirmed by applying this pattern to all protein structures in the SCOP data bank. We present here the pattern for one of the most common SSS of beta-sandwich proteins.
Collapse
|
10
|
Aumpuchin P, Kikuchi T. Prediction of folding mechanisms for Ig-like beta sandwich proteins based on inter-residue average distance statistics methods. Proteins 2018; 87:120-135. [PMID: 30520530 DOI: 10.1002/prot.25637] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 10/05/2018] [Accepted: 11/29/2018] [Indexed: 11/11/2022]
Abstract
To understand the folding mechanism of a protein is one of the goals in bioinformatics study. Nowadays, it is enigmatic and difficult to extract folding information from amino acid sequence using standard bioinformatics techniques or even experimental protocols which can be time consuming. To overcome these problems, we aim to extract the initial folding unit for titin protein (Ig and fnIII domains) by means of inter-residue average distance statistics, Average Distance Map (ADM) and contact frequency analysis (F-value). TI I27 and TNfn3 domains are used to represent the Ig-domain and fnIII-domain, respectively. Beta-strands 2, 3, 5, and 6 are significant for the initial folding processes of TI I27. The central strands of TNfn3 were predicted as a primary folding segment. Known 3D structure and unknown 3D structure domains were investigated by structure or non-structure based multiple sequence alignment, respectively, to learn the conserved hydrophobic residues and predicted compact region relevant to evolution. Our results show good correspondence to experimental data, phi-value and protection factor from H-D exchange experiments. The significance of conserved hydrophobic residues near F-value peaks for structural stability using hydrophobic packing is confirmed. Our prediction methods once again could extract a folding mechanism only knowing the amino acid sequence.
Collapse
Affiliation(s)
- Panyavut Aumpuchin
- Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Takeshi Kikuchi
- Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Shiga, Japan
| |
Collapse
|
11
|
Minami S, Chikenji G, Ota M. Rules for connectivity of secondary structure elements in protein: Two-layer αβ sandwiches. Protein Sci 2017; 26:2257-2267. [PMID: 28856751 DOI: 10.1002/pro.3285] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Revised: 08/21/2017] [Accepted: 08/26/2017] [Indexed: 11/09/2022]
Abstract
In protein structures, the fold is described according to the spatial arrangement of secondary structure elements (SSEs: α-helices and β-strands) and their connectivity. The connectivity or the pattern of links among SSEs is one of the most important factors for understanding the variety of protein folds. In this study, we introduced the connectivity strings that encode the connectivities by using the types, positions, and connections of SSEs, and computationally enumerated all the connectivities of two-layer αβ sandwiches. The calculated connectivities were compared with those in natural proteins determined using MICAN, a nonsequential structure comparison method. For 2α-4β, among 23,000 of all connectivities, only 48 were free from irregular connectivities such as loop crossing. Of these, only 20 were found in natural proteins and the superfamilies were biased toward certain types of connectivities. A similar disproportional distribution was confirmed for most of other spatial arrangements of SSEs in the two-layer αβ sandwiches. We found two connectivity rules that explain the bias well: the abundances of interlayer connecting loops that bridge SSEs in the distinct layers; and nonlocal β-strand pairs, two spatially adjacent β-strands located at discontinuous positions in the amino acid sequence. A two-dimensional plot of these two properties indicated that the two connectivity rules are not independent, which may be interpreted as a rule for the cooperativity of proteins.
Collapse
Affiliation(s)
- Shintaro Minami
- Department of Complex Systems Science, Graduate School of Informatics, Nagoya University, Nagoya, 464-8601, Japan
| | - George Chikenji
- Department of Computational Science and Engineering, Graduate School of Engineering, Nagoya University, Nagoya, 464-8601, Japan
| | - Motonori Ota
- Department of Complex Systems Science, Graduate School of Informatics, Nagoya University, Nagoya, 464-8601, Japan
| |
Collapse
|
12
|
Fraser RDB, Parry DAD. Filamentous Structure of Hard β-Keratins in the Epidermal Appendages of Birds and Reptiles. Subcell Biochem 2017; 82:231-252. [PMID: 28101864 DOI: 10.1007/978-3-319-49674-0_8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
The structures of avian and reptilian epidermal appendages, such as feathers, claws and scales, have been modelled using X-ray diffraction and electron microscopy data, combined with sequence analyses. In most cases, a family of closely related molecules makes up the bulk of the appendage, and each of these molecules contains a central β-rich 34-residue segment, which has been identified as the principal component of the framework of the 3.4 nm diameter filaments. The N- and C-terminal segments form the matrix component of the filament/matrix complex. The 34-residue β-rich central domains occur in pairs, related by either a parallel dyad or a perpendicular dyad axis, and form a β-sandwich stabilized by apolar interactions. They are also twisted in a right-handed manner. In feather, the filaments are packed into small sheets and it is possible to determine their likely orientation within the sheets from the low-angle X-ray diffraction data. The physical properties of the various epidermal appendages can be related to the amino acid sequence and composition of defined molecular segments characteristic of the chains concerned.
Collapse
Affiliation(s)
- R D Bruce Fraser
- Institute of Fundamental Sciences, Massey University, Private Bag 11-222, Palmerston North, 4442, New Zealand
- , 28 Satinay Drive, Noosa Parklands, Tewantin, Qld, 4565, Australia
| | - David A D Parry
- Institute of Fundamental Sciences and Riddet Institute, Massey University, Private Bag 11-222, Palmerston North, 4442, New Zealand.
| |
Collapse
|
13
|
Affiliation(s)
- Pernilla Wittung-Stafshede
- Department of Biology and Biological Engineering; Chalmers University of Technology; 41296 Gothenburg Sweden
| |
Collapse
|
14
|
Kister A. Amino acid distribution rules predict protein fold: protein grammar for beta-strand sandwich-like structures. Biomolecules 2015; 5:41-59. [PMID: 25625198 PMCID: PMC4384110 DOI: 10.3390/biom5010041] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Accepted: 12/31/2014] [Indexed: 11/16/2022] Open
Abstract
We present an alternative approach to protein 3D folding prediction based on determination of rules that specify distribution of "favorable" residues, that are mainly responsible for a given fold formation, and "unfavorable" residues, that are incompatible with that fold, in polypeptide sequences. The process of determining favorable and unfavorable residues is iterative. The starting assumptions are based on the general principles of protein structure formation as well as structural features peculiar to a protein fold under investigation. The initial assumptions are tested one-by-one for a set of all known proteins with a given structure. The assumption is accepted as a "rule of amino acid distribution" for the protein fold if it holds true for all, or near all, structures. If the assumption is not accepted as a rule, it can be modified to better fit the data and then tested again in the next step of the iterative search algorithm, or rejected. We determined the set of amino acid distribution rules for a large group of beta sandwich-like proteins characterized by a specific arrangement of strands in two beta sheets. It was shown that this set of rules is highly sensitive (~90%) and very specific (~99%) for identifying sequences of proteins with specified beta sandwich fold structure. The advantage of the proposed approach is that it does not require that query proteins have a high degree of homology to proteins with known structure. So long as the query protein satisfies residue distribution rules, it can be confidently assigned to its respective protein fold. Another advantage of our approach is that it allows for a better understanding of which residues play an essential role in protein fold formation. It may, therefore, facilitate rational protein engineering design.
Collapse
Affiliation(s)
- Alexander Kister
- Department of Mathematics, Rutgers University, Piscataway, NJ 08854, USA.
| |
Collapse
|
15
|
Aksianov E. Motif Analyzer for protein 3D structures. J Struct Biol 2014; 186:62-7. [PMID: 24607867 DOI: 10.1016/j.jsb.2014.02.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 02/26/2014] [Indexed: 11/16/2022]
Abstract
The topology of the protein structure of all-β- or α/β-class is a special arrangement of β-strands within β-sheets (and α-helices surrounding β-sheets) and the order of them along the polypeptide chain. Structural motifs are a subset of strands and/or helices with widely spread topology. Structural motifs are used for classification of protein structure. Because of an increasing variety of known structures, an automatic tool for motif detection is needed. MotAn is an algorithmic detector of structural motifs in a given 3D protein structure. It detects β-hairpins, β-meanders, β-helices, Greek keys, interlocks, jellyrolls, β-α-β-motifs and β-α-β-helices. MotAn was tested on selected SCOP families and shown to be more sensitive detector than the PTGL and PROMOTIF programs. MotAn is available at http://mouse.belozersky.msu.ru/motan.
Collapse
Affiliation(s)
- Evgeniy Aksianov
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskye Gory, 1/40, 119992 Moscow, Russia.
| |
Collapse
|
16
|
Abstract
In the present article, we provide a brief overview of the main approaches to analysing the sequence-structure relationship of proteins and outline a novel method of structure prediction. The proposed method involves finding a set of rules that describes a correlation between the distribution of residues in a sequence and the essential structural characteristics of a protein structure. The residue distribution rules specify the 'favourable' residues that are required in certain positions of a polypeptide chain in order for it to assume a particular protein fold, and the 'unfavourable' residues incompatible with the given fold. Identification of amino acid distribution rules derives from examination of inter-residue contacts. We describe residue distribution rules for a large group of β-sandwich-like proteins characterized by a specific arrangement of strands in their two β-sheets. It was shown that this method has very high accuracy (approximately 85%). The advantage of the residue rule approach is that it makes possible prediction of protein folding even in polypeptide chains that have very low global sequence similarities, as low as 18%. Another potential benefit is that a better understanding of which residues play essential roles in a given protein fold may facilitate rational protein engineering design.
Collapse
|
17
|
Guilloux A, Caudron B, Jestin JL. A method to predict edge strands in beta-sheets from protein sequences. Comput Struct Biotechnol J 2013; 7:e201305001. [PMID: 24688737 PMCID: PMC3962219 DOI: 10.5936/csbj.201305001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2013] [Revised: 05/27/2013] [Accepted: 05/30/2013] [Indexed: 12/15/2022] Open
Abstract
There is a need for rules allowing three-dimensional structure information to be derived from protein sequences. In this work, consideration of an elementary protein folding step allows protein sub-sequences which optimize folding to be derived for any given protein sequence. Classical mechanics applied to this system and the energy conservation law during the elementary folding step yields an equation whose solutions are taken over the field of rational numbers. This formalism is applied to beta-sheets containing two edge strands and at least two central strands. The number of protein sub-sequences optimized for folding per amino acid in beta-strands is shown in particular to predict edge strands from protein sequences. Topological information on beta-strands and loops connecting them is derived for protein sequences with a prediction accuracy of 75%. The statistical significance of the finding is given. Applications in protein structure prediction are envisioned such as for the quality assessment of protein structure models.
Collapse
Affiliation(s)
- Antonin Guilloux
- Analyse algébrique, Institut de Mathématiques de Jussieu, Université Pierre et Marie Curie, Paris VI, France
| | - Bernard Caudron
- Centre d'Informatique pour la Biologie, Institut Pasteur, Paris, France
| | | |
Collapse
|
18
|
Subramani A, Floudas CA. β-sheet topology prediction with high precision and recall for β and mixed α/β proteins. PLoS One 2012; 7:e32461. [PMID: 22427840 PMCID: PMC3302896 DOI: 10.1371/journal.pone.0032461] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Accepted: 01/26/2012] [Indexed: 11/19/2022] Open
Abstract
The prediction of the correct -sheet topology for pure and mixed proteins is a critical intermediate step toward the three dimensional protein structure prediction. The predicted beta sheet topology provides distance constraints between sequentially separated residues, which reduces the three dimensional search space for a protein structure prediction algorithm. Here, we present a novel mixed integer linear optimization based framework for the prediction of -sheet topology in and mixed proteins. The objective is to maximize the total strand-to-strand contact potential of the protein. A large number of physical constraints are applied to provide biologically meaningful topology results. The formulation permits the creation of a rank-ordered list of preferred -sheet arrangements. Finally, the generated topologies are re-ranked using a fully atomistic approach involving torsion angle dynamics and clustering. For a large, non-redundant data set of 2102 and mixed proteins with at least 3 strands taken from the PDB, the proposed approach provides the top 5 solutions with average precision and recall greater than 78%. Consistent results are obtained in the -sheet topology prediction for blind targets provided during the CASP8 and CASP9 experiments, as well as for actual and predicted secondary structures. The -sheet topology prediction algorithm, BeST, is available to the scientific community at http://selene.princeton.edu/BeST/.
Collapse
Affiliation(s)
| | - Christodoulos A. Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
- * E-mail:
| |
Collapse
|
19
|
Role of metal in folding and stability of copper proteins in vitro. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2012; 1823:1594-603. [PMID: 22306006 DOI: 10.1016/j.bbamcr.2012.01.013] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2011] [Revised: 01/09/2012] [Accepted: 01/18/2012] [Indexed: 01/04/2023]
Abstract
Metal coordination is required for function of many proteins. For biosynthesis of proteins coordinating a metal, the question arises if the metal binds before, during or after folding of the polypeptide. Moreover, when the metal is bound to the protein, how does its coordination affect biophysical properties such as stability and dynamics? Understanding how metals are utilized by proteins in cells on a molecular level requires accurate descriptions of the thermodynamic and kinetic parameters involved in protein-metal complexes. Copper is one of the essential transition metals found in the active sites of many key proteins. To avoid toxicity of free copper ions, living systems have developed elaborate copper-transport systems that involve dedicated proteins that facilitate efficient and specific delivery of copper to target proteins. This review describes in vitro and in silico biophysical work assessing the role of copper in folding and stability of copper-binding proteins. Examples of proteins discussed are: a blue-copper protein (Pseudomonas aeruginosa azurin), members of copper-transport systems (bacterial CopZ, human Atox1 and ATP7B domains) and multi-copper ferroxidases (yeast Fet3p and human ceruloplasmin). The consequences of interactions between copper proteins and platinum-complexes are also discussed. This article is part of a Special Issue entitled: Cell Biology of Metals.
Collapse
|
20
|
Abstract
A characteristic feature of the polypeptide chain is its ability to form a restricted set of commonly occurring folding units composed of two or more elements of secondary structure that are adjacent along the chain. Some of these super-secondary structures exhibit a unique handedness and a unique overall fold irrespective of whether they occur in homologous or nonhomologous proteins. Such super-secondary structures are of particular value since they can be used as starting structures in protein modeling. The larger protein folds can be obtained by stepwise addition of other secondary structural elements to the starting structures taking into account a set of simple rules inferred from known principles of protein structure.
Collapse
Affiliation(s)
- Alexander V Efimov
- Institute of Protein Research, Russian Academy of Sciences, Moscow Region, Russia.
| |
Collapse
|
21
|
On the evolutionary origins of "Fold Space Continuity": a study of topological convergence and divergence in mixed alpha-beta domains. J Struct Biol 2010; 172:244-52. [PMID: 20691788 DOI: 10.1016/j.jsb.2010.07.016] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Revised: 06/25/2010] [Accepted: 07/31/2010] [Indexed: 11/21/2022]
Abstract
Existing protein structure classifications group proteins by overall structural similarity at the highest level and by evolutionary relationships at the lowest level, deriving higher-level groups by pairwise structure comparison. For this to be successful requires that large changes in structure are relatively rare in evolution and that proteins with no detectable evolutionary relationship do not converge on similar global chain conformations since this creates conflicts between structural and evolutionary consistency. Analysis of global structural changes using core topological descriptions for 4261 domains from classes C and D of the SCOP database and new measures of topological distance and consistency of classification showed that the topological consistency of SCOP folds is highly variable with some folds having no consistent description and significant overlaps between groups including some members of separate folds with identical topological descriptions. Topological clustering shows that including sufficient indels to allow family members to be joined would also require joining several distinct folds. We conclude that evolutionary changes in the global topology of protein domains are the root cause of many difficulties for present approaches to structure classification using pairwise comparison. As a resolution we propose that a purely structural classification should be created using an approach similar to that adopted by the Gene Ontology in which proteins are assigned labels describing structure.
Collapse
|
22
|
Hills RD, Kathuria SV, Wallace LA, Day IJ, Brooks CL, Matthews CR. Topological frustration in beta alpha-repeat proteins: sequence diversity modulates the conserved folding mechanisms of alpha/beta/alpha sandwich proteins. J Mol Biol 2010; 398:332-50. [PMID: 20226790 DOI: 10.1016/j.jmb.2010.03.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2009] [Revised: 02/27/2010] [Accepted: 03/03/2010] [Indexed: 10/19/2022]
Abstract
The thermodynamic hypothesis of Anfinsen postulates that structures and stabilities of globular proteins are determined by their amino acid sequences. Chain topology, however, is known to influence the folding reaction, in that motifs with a preponderance of local interactions typically fold more rapidly than those with a larger fraction of nonlocal interactions. Together, the topology and sequence can modulate the energy landscape and influence the rate at which the protein folds to the native conformation. To explore the relationship of sequence and topology in the folding of beta alpha-repeat proteins, which are dominated by local interactions, we performed a combined experimental and simulation analysis on two members of the flavodoxin-like, alpha/beta/alpha sandwich fold. Spo0F and the N-terminal receiver domain of NtrC (NT-NtrC) have similar topologies but low sequence identity, enabling a test of the effects of sequence on folding. Experimental results demonstrated that both response-regulator proteins fold via parallel channels through highly structured submillisecond intermediates before accessing their cis prolyl peptide bond-containing native conformations. Global analysis of the experimental results preferentially places these intermediates off the productive folding pathway. Sequence-sensitive Gō-model simulations conclude that frustration in the folding in Spo0F, corresponding to the appearance of the off-pathway intermediate, reflects competition for intra-subdomain van der Waals contacts between its N- and C-terminal subdomains. The extent of transient, premature structure appears to correlate with the number of isoleucine, leucine, and valine (ILV) side chains that form a large sequence-local cluster involving the central beta-sheet and helices alpha2, alpha 3, and alpha 4. The failure to detect the off-pathway species in the simulations of NT-NtrC may reflect the reduced number of ILV side chains in its corresponding hydrophobic cluster. The location of the hydrophobic clusters in the structure may also be related to the differing functional properties of these response regulators. Comparison with the results of previous experimental and simulation analyses on the homologous CheY argues that prematurely folded unproductive intermediates are a common property of the beta alpha-repeat motif.
Collapse
Affiliation(s)
- Ronald D Hills
- Department of Molecular Biology and Kellogg School of Science and Technology, The Scripps Research Institute, 10550 North Torrey Pines Road TPC6, La Jolla, CA 92037, USA
| | | | | | | | | | | |
Collapse
|
23
|
Papatheodorou TS, Fokas AS. Systematic construction and prediction of the arrangement of the strands of sandwich proteins. J R Soc Interface 2009; 6:63-73. [PMID: 18586634 DOI: 10.1098/rsif.2008.0192] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The problem of predicting the three-dimensional structure of a protein starting from its amino acid sequence is regarded as one of the most important open problems in biology. Here, we solve aspects of this problem for the so-called sandwich proteins that constitute a large class of proteins consisting of only beta-strands arranged in two sheets. A breakthrough for this class of proteins was announced in Kister et al. (Kister et al. 2002 Proc. Natl Acad. Sci. USA 99, 14 137-14 141), in which it was shown that sandwich proteins contain a certain invariant substructure called interlock. It was later noted that approximately 90% of the observed sandwich proteins are canonical, namely they are generated by certain geometrical structures. Here, employing a topological investigation, we prove that interlocks and geometrical structures are the direct consequence of certain biologically motivated fundamental principles. Furthermore, we construct all possible canonical motifs involving 6-10 strands. This construction limits dramatically the number of possible motifs. For example, for sandwich proteins with nine strands, the a priori number of possible canonical motifs exceeds 360000, whereas our construction yields only 49 geometrical structures and 625 canonical motifs.
Collapse
Affiliation(s)
- T S Papatheodorou
- Department of Computer Engineering and Informatics, University of Patras, 265 04 Patras, Greece.
| | | |
Collapse
|
24
|
Hussain F, Sedlak E, Wittung-Stafshede P. Role of copper in folding and stability of cupredoxin-like copper-carrier protein CopC. Arch Biochem Biophys 2007; 467:58-66. [PMID: 17889826 DOI: 10.1016/j.abb.2007.08.014] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2007] [Revised: 08/06/2007] [Accepted: 08/10/2007] [Indexed: 11/29/2022]
Abstract
CopC is a periplasmic copper carrier that, in contrast to cytoplasmic copper chaperones, has a beta-barrel fold and two metal-binding sites distinct for Cu(II) and Cu(I). The copper sites are located in each end of the molecule: the Cu(I) site involves His and Met coordination whereas the Cu(II) site consists of charged residues. To reveal biophysical properties of this protein, we have explored the effects of the cofactors on CopC unfolding in vitro. We demonstrate that Cu(II) coordination affects both protein stability and unfolding pathway, whereas Cu(I) has only a small effect on stability. Apo-CopC unfolds in a two-state reaction between pH 4 and 7.5 with maximal stability at pH 6. In contrast, Cu(II)-CopC unfolds in a three-state reaction at pH6 that involves a partly folded intermediate that retains Cu(II). This intermediate exhibits high thermal and chemical stability. Unique energetic and structural properties of different metalated CopC forms may help facilitate metal transport to many partners in vivo.
Collapse
Affiliation(s)
- Faiza Hussain
- Department of Biochemistry and Cell Biology, Rice University, 6100 Main Street, Houston, TX 77251, USA
| | | | | |
Collapse
|
25
|
Chiang YS, Gelfand TI, Kister AE, Gelfand IM. New classification of supersecondary structures of sandwich-like proteins uncovers strict patterns of strand assemblage. Proteins 2007; 68:915-21. [PMID: 17557333 DOI: 10.1002/prot.21473] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
To describe the supersecondary structure (SSS) of beta sandwich-like proteins (SPs), we introduce a structural unit called the "strandon." A strandon is defined as a set of sequentially consecutive strands connected by hydrogen bonds in 3D structures. Representing beta-proteins as the assembly of strandons exposes the underlying similarities in their SSS and enables us to construct a novel classification scheme of SPs. Classification of all known SPs is based on shared supersecondary structural features and is presented in the SSS database (http://binfs.umdnj.edu/sssdb/). Analysis of the SSS reveals two common specific patterns. The first pattern defines the arrangement of strandons and was found in 95% of all examined SPs. The second pattern establishes the ordering of strands in the protein domain and was observed in 82% of the analyzed SPs. Knowledge of these two patterns that uncover the spatial arrangement of strands will likely prove useful in protein structure prediction.
Collapse
Affiliation(s)
- Yih-Shien Chiang
- Department of Health Informatics, SHRP, University of Medicine and Dentistry of New Jersey, Newark, New Jersey 07107, USA
| | | | | | | |
Collapse
|
26
|
Stout J, Van Driessche G, Savvides SN, Van Beeumen J. X-ray crystallographic analysis of the sulfur carrier protein SoxY from Chlorobium limicola f. thiosulfatophilum reveals a tetrameric structure. Protein Sci 2007; 16:589-601. [PMID: 17327392 PMCID: PMC2203348 DOI: 10.1110/ps.062633607] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Dissimilatory oxidation of thiosulfate in the green sulfur bacterium Chlorobium limicola f. thiosulfatophilum is carried out by the ubiquitous sulfur-oxidizing (Sox) multi-enzyme system. In this system, SoxY plays a key role, functioning as the sulfur substrate-binding protein that offers its sulfur substrate, which is covalently bound to a conserved C-terminal cysteine, to another oxidizing Sox enzyme. Here, we report the crystal structures of a stand-alone SoxY protein of C. limicola f. thiosulfatophilum, solved at 2.15 A and 2.40 A resolution using X-ray diffraction data collected at 100 K and room temperature, respectively. The structure reveals a monomeric Ig-like protein, with an N-terminal alpha-helix, that oligomerizes into a tetramer via conserved contact regions between the monomers. The tetramer can be described as a dimer of dimers that exhibits one large hydrophobic contact region in each dimer and two small hydrophilic interface patches in the tetramer. At the tetramer interface patch, two conserved redox-active C-terminal cysteines form an intersubunit disulfide bridge. Intriguingly, SoxY exhibits a dimer/tetramer equilibrium that is dependent on the redox state of the cysteines and on the type of sulfur substrate component bound to them. Taken together, the dimer/tetramer equilibrium, the specific interactions between the subunits in the tetramer, and the significant conservation level of the interfaces strongly indicate that these SoxY oligomers are biologically relevant.
Collapse
Affiliation(s)
- Jan Stout
- Laboratory of Protein Biochemistry and Protein Engineering, Department of Biochemistry, Microbiology and Physiology, Ghent University, 9000 Ghent, B-Belgium
| | | | | | | |
Collapse
|
27
|
Bradley P, Baker D. Improved beta-protein structure prediction by multilevel optimization of nonlocal strand pairings and local backbone conformation. Proteins 2007; 65:922-9. [PMID: 17034045 DOI: 10.1002/prot.21133] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Proteins with complex, nonlocal beta-sheets are challenging for de novo structure prediction, due in part to the difficulty of efficiently sampling long-range strand pairings. We present a new, multilevel approach to beta-sheet structure prediction that circumvents this difficulty by reformulating structure generation in terms of a folding tree. Nonlocal connections in this tree allow us to explicitly sample alternative beta-strand pairings while simultaneously exploring local conformational space using backbone torsion-space moves. An iterative, energy-biased resampling strategy is used to explore the space of beta-strand pairings; we expect that such a strategy will be generally useful for searching large conformational spaces with a high degree of combinatorial complexity.
Collapse
Affiliation(s)
- Philip Bradley
- University of Washington, Seattle, Washington 98195, USA
| | | |
Collapse
|
28
|
Karyagina A, Ershova A, Titov M, Olovnikov I, Aksianov E, Ryazanova A, Kubareva E, Spirin S, Alexeevski A. Analysis of conserved hydrophobic cores in proteins and supramolecular complexes. J Bioinform Comput Biol 2006; 4:357-72. [PMID: 16819788 DOI: 10.1142/s0219720006001837] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2005] [Revised: 11/24/2005] [Accepted: 12/09/2005] [Indexed: 11/18/2022]
Abstract
The conserved hydrophobic core is an important feature of a family of protein domains. We suggest a procedure for finding and the analysis of conserved hydrophobic cores. The procedure is based on using an original program called CluD (http://monkey.belozersky.msu.ru/CluD/cgi-bin/hftri.pl). Conserved hydrophobic cores of several families including homeodomains and interlock-containing domains are described. Hydrophobic clusters on some protein-DNA and protein-protein interfaces were also analyzed.
Collapse
Affiliation(s)
- Anna Karyagina
- Gamaleya Institute of Epidemiology and Microbiology, Gamaleya Str. 18, Moscow 123098, Russia.
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Chen M, Wilson CJ, Wu Y, Wittung-Stafshede P, Ma J. Correlation between Protein Stability Cores and Protein Folding Kinetics: A Case Study on Pseudomonas aeruginosa Apo-Azurin. Structure 2006; 14:1401-10. [PMID: 16962971 DOI: 10.1016/j.str.2006.07.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2006] [Revised: 06/30/2006] [Accepted: 07/13/2006] [Indexed: 10/24/2022]
Abstract
This paper reports a combined computational and experimental study of the correlation between protein stability cores and folding kinetics. An empirical potential function was developed, and it was used for analyzing interaction energies among secondary structure elements. Studies on a beta sandwich protein, Pseudomonas aeruginosa azurin, showed that the computationally identified substructure with the strongest interactions in the native state is identical to the "interlocked pair" of beta strands, an invariant motif found in most sandwich-like proteins. Moreover, previous and new in vitro folding results revealed that the identified substructure harbors most residues that form native-like interactions in the folding transition state. These observations demonstrate that the potential function is effective in revealing the relative strength of interactions among various protein parts; they also strengthen the suggestion that the most stable regions in native proteins favor stable interactions early during folding.
Collapse
Affiliation(s)
- Mingzhi Chen
- Graduate Program of Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas 77030, USA
| | | | | | | | | |
Collapse
|
30
|
Kister AE, Fokas AS, Papatheodorou TS, Gelfand IM. Strict rules determine arrangements of strands in sandwich proteins. Proc Natl Acad Sci U S A 2006; 103:4107-10. [PMID: 16537492 PMCID: PMC1449654 DOI: 10.1073/pnas.0510747103] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
From a computer analysis of the spatial organization of the secondary structures of beta-sandwich proteins, we find certain sets of consecutive strands that are connected by hydrogen bonds, which we call "strandons." The analysis of the arrangements of strandons in 491 protein structures that come from 69 different superfamilies reveals strict regularities in the arrangements of strandons and the formation of what we call "canonical supermotifs." Six such supermotifs account for approximately 90% of all observed structures. Simple geometric rules are described that dictate the formation of these supermotifs.
Collapse
Affiliation(s)
- A. E. Kister
- *Department of Health Informatics, School of Health Related Professions, University of Medicine and Dentistry of New Jersey, Newark, NJ 07107
- To whom correspondence may be addressed. E-mail:
or
| | - A. S. Fokas
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom
| | - T. S. Papatheodorou
- High Performance Computing Laboratory, Department of Computer Engineering and Informatics, University of Patras, Patras 26500, Greece; and
| | - I. M. Gelfand
- Department of Mathematics, Rutgers, The State University of New Jersey, Piscataway, NJ 08855
- To whom correspondence may be addressed. E-mail:
or
| |
Collapse
|
31
|
Fokas AS, Papatheodorou TS, Kister AE, Gelfand IM. A geometric construction determines all permissible strand arrangements of sandwich proteins. Proc Natl Acad Sci U S A 2005; 102:15851-3. [PMID: 16249331 PMCID: PMC1276083 DOI: 10.1073/pnas.0507335102] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
For a large class of proteins called sandwich-like proteins (SPs), the secondary structures consist of two beta-sheets packed face-to-face, with each beta-sheet consisting typically of three to five beta-strands. An important step in the prediction of the three-dimensional structure of a SP is the prediction of its supersecondary structure, namely the prediction of the arrangement of the beta-strands in the two beta-sheets. Recently, significant progress in this direction was made, where it was shown that 91% of observed SPs form what we here call "canonical motifs." Here, we show that all canonical motifs can be constructed in a simple manner that is based on thermodynamic considerations and uses certain geometric structures. The number of these structures is much smaller than the number of possible strand arrangements. For instance, whereas for SPs consisting of six strands there exist a priori 900 possible strand arrangements, there exist only five geometric structures. Furthermore, the few motifs that are noncanonial can be constructed from canonical motifs by a simple procedure.
Collapse
Affiliation(s)
- A S Fokas
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom.
| | | | | | | |
Collapse
|
32
|
Sandberg A, Leckner J, Karlsson BG. Apo-azurin folds via an intermediate that resembles the molten-globule. Protein Sci 2005; 13:2628-38. [PMID: 15388858 PMCID: PMC2286560 DOI: 10.1110/ps.04848204] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
The folding of Pseudomonas aeruginosa apo-azurin was investigated with the intent of identifying putative intermediates. Two apo-mutants were constructed by replacing the main metal-binding ligand C112 with a serine (C112S) and an alanine (C112A). The guanidinium-induced unfolding free energies (DeltaG(U-N)(H2O)) of the C112S and C112A mutants were measured to 36.8 +/- 1 kJ mole(-1) and 26.1 +/- 1 kJ mole(-1), respectively, and the m-value of the transition to 23.5 +/- 0.7 kJ mole(-1) M(-1). The difference in folding free energy (DeltaDeltaG(U-N)(H2O)) is largely attributed to the intramolecular hydrogen bonding properties of the serine Ogamma in the C112S mutant, which is lacking in the C112A structure. Furthermore, only the unfolding rates differ between the two mutants, thus pointing to the energy of the native state as the source of the observed Delta DeltaG(U-N)(H2O). This also indicates that the formation of the hydrogen bonds present in C112S but absent in C112A is a late event in the folding of the apo-protein, thus suggesting that formation of the metal-binding site occurs after the rate-limiting formation of the transition state. In both mutants we also noted a burst-phase intermediate. Because this intermediate was capable of binding 1-anilinonaphtalene-8-sulfonate (ANS), as were an acid-induced species at pH 2.6, we ascribe it molten globule-like status. However, despite the presence of an intermediate, the folding of apo-azurin C112S is well approximated by a two-state kinetic mechanism.
Collapse
Affiliation(s)
- Anders Sandberg
- Department of Chemistry, Göteborg University, Göteborg, Sweden
| | | | | |
Collapse
|
33
|
Wilson CJ, Wittung-Stafshede P. Role of structural determinants in folding of the sandwich-like protein Pseudomonas aeruginosa azurin. Proc Natl Acad Sci U S A 2005; 102:3984-7. [PMID: 15753320 PMCID: PMC554837 DOI: 10.1073/pnas.0501038102] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
An invariant substructure that forms two interlocked pairs of neighboring beta-strands occurs in essentially all known sandwich-like proteins. Eight conserved positions in these strands were recently shown to act as structural determinants. To test whether the residues at these invariant positions are conserved for mechanistic (i.e., part of folding nucleus) or energetic (i.e., governing native-state stability) reasons, we characterized the folding behavior of eight point-mutated variants of the sandwich-like protein Pseudomonas aeruginosa apo-azurin. We find a simple relationship among the conserved positions: half of the residues form native-like interactions in the folding transition state, whereas the others do not participate in the folding nucleus but govern high native-state stability. Thus, evolutionary preservation of these specific positions gives both mechanistic and energetic advantages to members of the sandwich-like protein family.
Collapse
Affiliation(s)
- Corey J Wilson
- Department of Biochemistry and Cell Biology, Keck Center for Structural Computational Biology, Rice University, 6100 Main Street, Houston, TX 77251, USA
| | | |
Collapse
|
34
|
Abstract
Many proteins in living cells coordinate cofactors, such as metal ions, to attain their activity. Since the cofactors in such cases often can interact with their corresponding unfolded polypeptides in vitro, it is important to unravel how cofactors modulate protein folding. In this review, I will discuss the role of cofactors in folding of the blue-copper protein Pseudomonas aeruginosa azurin. In the case of both copper (Cu(II) and Cu(I)) and zinc (Zn(II)), the metal can bind to unfolded azurin. The residues involved in copper (Cu(II) and Cu(I)) coordination in the unfolded state have been identified as Cys112, His117, and Met121. The affinities of Cu(II), Cu(I), and Zn(II) are all higher for the folded than for the unfolded azurin polypeptide, resulting in metal stabilization of the native state as compared to the stability of apo-azurin. Cu(II), Zn(II), and several apo forms of azurin all fold in two-state kinetic reactions with roughly identical polypeptide-folding speeds. This suggests that the native-state beta-barrel topology, not cofactor interactions or thermodynamic stability, determines azurin's folding barrier. Nonetheless, copper binds much more rapidly (i.e., 4 orders of magnitude) to unfolded azurin than to folded azurin. Therefore, the fastest route to functional azurin is through copper binding before polypeptide folding; this sequence of events may be the relevant biological pathway.
Collapse
Affiliation(s)
- Pernilla Wittung-Stafshede
- Department of Biochemistry & Cell Biology and Department of Chemistry, Rice University, Houston, Texas 77251, USA.
| |
Collapse
|
35
|
Fokas AS, Gelfand IM, Kister AE. Prediction of the structural motifs of sandwich proteins. Proc Natl Acad Sci U S A 2004; 101:16780-3. [PMID: 15550537 PMCID: PMC534733 DOI: 10.1073/pnas.0407570101] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We investigate the supersecondary structure of a large group of proteins, the so-called sandwich proteins. The analysis of a large number of such proteins has led us to propose a set of rules that can be used to predict the possible arrangements of strands in the two beta-sheets forming a given sandwich structure. These rules imply the existence of certain invariant supersecondary substructures common to all sandwich proteins. Furthermore, they dramatically restrict the number of permissible arrangements. For example, whereas for proteins consisting of three strands in each beta-sheet 180 possible strand arrangements exist a priori, our rules imply that only 15 of them are permissible. Five of these predicted arrangements describe all currently known sandwich proteins with six strands.
Collapse
Affiliation(s)
- A S Fokas
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom
| | | | | |
Collapse
|
36
|
Potapov V, Sobolev V, Edelman M, Kister A, Gelfand I. Protein--protein recognition: juxtaposition of domain and interface cores in immunoglobulins and other sandwich-like proteins. J Mol Biol 2004; 342:665-79. [PMID: 15327963 DOI: 10.1016/j.jmb.2004.06.072] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2004] [Revised: 06/23/2004] [Accepted: 06/28/2004] [Indexed: 11/16/2022]
Abstract
Structural analysis of a non-redundant data set of 47 immunoglobulin (Ig) proteins was carried out using a combination of criteria: atom--atom contact compatibility, position occupancy rate, conservation of residue type and positional conservation in 3D space. Our analysis shows that roughly half of the interface positions between the light and heavy chains are specific to individual structures while the other half are conserved across the database. The tendency for conservation of a primary subset of positions holds true for the intra-domain faces as well. These subsets, with an average of 12 conserved positions and a contact surface of 630 A(2), delineate the inter- and intra-domain core, a refined instrument with a reduced target for analysis of sheet--sheet interactions in sandwich-like proteins. Employing this instrument, we find that a majority of Ig interface core positions are adjoined in sequence to domain core positions. This was derived independent of geometric considerations, however beta-sheet side-chain geometry clearly dictates it. The geometric wedding of the domain and interface cores supports the concept of a rigid-like substructure on the protein surface involved in complex formation and indicates a close relationship between surface determinants and those involved in protein folding of Ig domains. The definitions developed for the Ig interface and domain cores proved satisfactory to extract first-approximation cores for a group of 24 non-Ig sandwich-like proteins, treated as individual structures due to their diverse strand topologies. We show that the same rule of positional connectivity between the rigid domain core and interface core extends generally to sandwich-like proteins interacting in a sheet--sheet fashion. The non-Ig structures were used as templates to analyze sandwich-like interfaces of unresolved homologous proteins using a database merging structure and sequence conservation.
Collapse
Affiliation(s)
- Vladimir Potapov
- Department of Plant Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | | | | | |
Collapse
|
37
|
Engman KC, Sandberg A, Leckner J, Karlsson BG. Probing the influence on folding behavior of structurally conserved core residues in P. aeruginosa apo-azurin. Protein Sci 2004; 13:2706-15. [PMID: 15340166 PMCID: PMC2286540 DOI: 10.1110/ps.04849004] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
The effects on folding kinetics and equilibrium stability of core mutations in the apo-mutant C112S of azurin from Pseudomonas aeruginosa were studied. A number of conserved residues within the cupredoxin family were recognized by sequential alignment as constituting a common hydrophobic core: I7, F15, L33, W48, F110, L50, V95, and V31. Of these, I7, V31, L33, and L50 were mutated for the purpose of obtaining information on the transition state and a potential folding nucleus. In addition, residue V5 in the immediate vicinity of the common core, as well as T52, separate from the core, were mutated as controls. All mutants exhibited a nonlinear dependence of activation free energy of folding on denaturant concentration, although the refolding kinetics of the V31A/C112S mutant indicated that the V31A mutation destabilizes the transition state enough to allow folding via a parallel transition state ensemble. Phi-values could be calculated for three of the six mutants, V31A/C112S, L33A/C112S, and L50A/C112S, and the fractional values of 0.63, 0.33, and 0.50 (respectively) obtained at 0.5 M GdmCl suggest that these residues are important for stabilizing the transition state. Furthermore, a linear dependence of ln k(obs)(H2O) on DeltaG(U-N)(H2O) of the core mutations and the putative involvement of ground-state effects suggest the presence of native-like residual interactions in the denatured state that bias this ensemble toward a folding-competent state.
Collapse
Affiliation(s)
- K Cecilia Engman
- Department of Chemistry, Göteborg University, Box 465, SE 405 30 Göteborg, Sweden
| | | | | | | |
Collapse
|
38
|
Pozdnyakova I, Wittung-Stafshede P. Approaching the speed limit for Greek Key β-barrel formation: transition-state movement tunes folding rate of zinc-substituted azurin. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2003; 1651:1-4. [PMID: 14499583 DOI: 10.1016/s1570-9639(03)00240-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Azurin is a blue-copper protein with a beta-barrel structure of Greek Key topology. In vitro, copper can be substituted with zinc without change in protein structure. We here analyze the kinetic folding behavior of zinc-substituted Pseudomonas aeruginosa azurin. Our findings can be summarized in three key conclusions: first, zinc remains strongly bound to the polypeptide upon unfolding, suggesting that the cofactor may bind to the protein before polypeptide folding in vivo. Second, the semi-logarithmic plot of folding and unfolding rates for zinc-substituted azurin as a function of denaturant concentration exhibits curvature due to a changing transition-state structure. Third, the extrapolated folding speed in water for zinc-substituted azurin is similar to that of other proteins with the same topology, implying that there is a speed limit that can be modulated by stability-driven transition-state movement for formation of beta-barrel structures with Greek Key topology.
Collapse
Affiliation(s)
- Irina Pozdnyakova
- Department of Chemistry, Tulane University, New Orleans, LA 70118-5698, USA
| | | |
Collapse
|