1
|
Lopez-Martinez E, Manteca A, Ferruz N, Cortajarena AL. Statistical Analysis and Tokenization of Epitopes to Construct Artificial Neoepitope Libraries. ACS Synth Biol 2023; 12:2812-2818. [PMID: 37703075 PMCID: PMC10594869 DOI: 10.1021/acssynbio.3c00201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Indexed: 09/14/2023]
Abstract
Epitopes are specific regions on an antigen's surface that the immune system recognizes. Epitopes are usually protein regions on foreign immune-stimulating entities such as viruses and bacteria, and in some cases, endogenous proteins may act as antigens. Identifying epitopes is crucial for accelerating the development of vaccines and immunotherapies. However, mapping epitopes in pathogen proteomes is challenging using conventional methods. Screening artificial neoepitope libraries against antibodies can overcome this issue. Here, we applied conventional sequence analysis and methods inspired in natural language processing to reveal specific sequence patterns in the linear epitopes deposited in the Immune Epitope Database (www.iedb.org) that can serve as building blocks for the design of universal epitope libraries. Our results reveal that amino acid frequency in annotated linear epitopes differs from that in the human proteome. Aromatic residues are overrepresented, while the presence of cysteines is practically null in epitopes. Byte pair encoding tokenization shows high frequencies of tryptophan in tokens of 5, 6, and 7 amino acids, corroborating the findings of the conventional sequence analysis. These results can be applied to reduce the diversity of linear epitope libraries by orders of magnitude.
Collapse
Affiliation(s)
- Elena Lopez-Martinez
- Centre
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián, 20014 Spain
| | - Aitor Manteca
- Centre
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián, 20014 Spain
| | - Noelia Ferruz
- Molecular
Biology Institute of Barcelona (IBMB-CSIC), Barcelona Science Park, Baldiri Reixac, 15-21, 08028, Barcelona, Spain
| | - Aitziber L. Cortajarena
- Centre
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián, 20014 Spain
- IKERBASQUE, Basque
Foundation for Science, Plaza Euskadi 5, 48009 Bilbao, Spain
| |
Collapse
|
2
|
Aires A, Fernández-Afonso Y, Guedes G, Guisasola E, Gutiérrez L, Cortajarena AL. Engineered Protein-Driven Synthesis of Tunable Iron Oxide Nanoparticles as T1 and T2 Magnetic Resonance Imaging Contrast Agents. CHEMISTRY OF MATERIALS : A PUBLICATION OF THE AMERICAN CHEMICAL SOCIETY 2022; 34:10832-10841. [PMID: 36590706 PMCID: PMC9798829 DOI: 10.1021/acs.chemmater.2c01746] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 11/15/2022] [Indexed: 05/14/2023]
Abstract
Iron oxide nanoparticles (IONPs) have become one of the most promising nanomaterials for biomedical applications because of their biocompatibility and physicochemical properties. This study demonstrates the use of protein engineering as a novel approach to design scaffolds for the tunable synthesis of ultrasmall IONPs. Rationally designed proteins, containing different number of metal-coordination sites, were evaluated to control the size and the physicochemical and magnetic properties of a set of protein-stabilized IONPs (Prot-IONPs). Prot-IONPs, synthesized through an optimized coprecipitation approach, presented good T1 and T2 relaxivity values, stability, and biocompatibility, showing potential for magnetic resonance imaging (MRI) applications.
Collapse
Affiliation(s)
- Antonio Aires
- Center
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián 20014, Spain
| | - Yilian Fernández-Afonso
- Instituto
de Nanociencia y Materiales de Aragón (INMA), CSIC-Universidad de Zaragoza, 50018 Zaragoza, Spain
| | - Gabriela Guedes
- Center
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián 20014, Spain
- University
of the Basque Country (UPV/EHU), 48940 Leioa, Spain
| | - Eduardo Guisasola
- Center
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián 20014, Spain
| | - Lucía Gutiérrez
- Instituto
de Nanociencia y Materiales de Aragón (INMA), CSIC-Universidad de Zaragoza, 50018 Zaragoza, Spain
- Centro
de Investigación Biomédica en Red de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), 50018 Zaragoza, Spain
| | - Aitziber L. Cortajarena
- Center
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián 20014, Spain
- Ikerbasque, Basque Foundation for Science, 48009 Bilbao, Spain
| |
Collapse
|
3
|
Synakewicz M, Eapen RS, Perez-Riba A, Rowling PJE, Bauer D, Weißl A, Fischer G, Hyvönen M, Rief M, Itzhaki LS, Stigler J. Unraveling the Mechanics of a Repeat-Protein Nanospring: From Folding of Individual Repeats to Fluctuations of the Superhelix. ACS NANO 2022. [PMID: 35258937 DOI: 10.1101/2021.03.27.437344] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Tandem-repeat proteins comprise small secondary structure motifs that stack to form one-dimensional arrays with distinctive mechanical properties that are proposed to direct their cellular functions. Here, we use single-molecule optical tweezers to study the folding of consensus-designed tetratricopeptide repeats (CTPRs), superhelical arrays of short helix-turn-helix motifs. We find that CTPRs display a spring-like mechanical response in which individual repeats undergo rapid equilibrium fluctuations between partially folded and unfolded conformations. We rationalize the force response using Ising models and dissect the folding pathway of CTPRs under mechanical load, revealing how the repeat arrays form from the center toward both termini simultaneously. Most strikingly, we also directly observe the protein's superhelical tertiary structure in the force signal. Using protein engineering, crystallography, and single-molecule experiments, we show that the superhelical geometry can be altered by carefully placed amino acid substitutions, and we examine how these sequence changes affect intrinsic repeat stability and inter-repeat coupling. Our findings provide the means to dissect and modulate repeat-protein stability and dynamics, which will be essential for researchers to understand the function of natural repeat proteins and to exploit artificial repeats proteins in nanotechnology and biomedical applications.
Collapse
Affiliation(s)
- Marie Synakewicz
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Rohan S Eapen
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Albert Perez-Riba
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Pamela J E Rowling
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Daniela Bauer
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Andreas Weißl
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Gerhard Fischer
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Marko Hyvönen
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Matthias Rief
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Johannes Stigler
- Gene Center Munich, Ludwig-Maximilians-Universität München, Feodor-Lynen-Straße 25, 81377 München, Germany
| |
Collapse
|
4
|
Synakewicz M, Eapen RS, Perez-Riba A, Rowling PJE, Bauer D, Weißl A, Fischer G, Hyvönen M, Rief M, Itzhaki LS, Stigler J. Unraveling the Mechanics of a Repeat-Protein Nanospring: From Folding of Individual Repeats to Fluctuations of the Superhelix. ACS NANO 2022; 16:3895-3905. [PMID: 35258937 PMCID: PMC8944806 DOI: 10.1021/acsnano.1c09162] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 01/27/2022] [Indexed: 06/14/2023]
Abstract
Tandem-repeat proteins comprise small secondary structure motifs that stack to form one-dimensional arrays with distinctive mechanical properties that are proposed to direct their cellular functions. Here, we use single-molecule optical tweezers to study the folding of consensus-designed tetratricopeptide repeats (CTPRs), superhelical arrays of short helix-turn-helix motifs. We find that CTPRs display a spring-like mechanical response in which individual repeats undergo rapid equilibrium fluctuations between partially folded and unfolded conformations. We rationalize the force response using Ising models and dissect the folding pathway of CTPRs under mechanical load, revealing how the repeat arrays form from the center toward both termini simultaneously. Most strikingly, we also directly observe the protein's superhelical tertiary structure in the force signal. Using protein engineering, crystallography, and single-molecule experiments, we show that the superhelical geometry can be altered by carefully placed amino acid substitutions, and we examine how these sequence changes affect intrinsic repeat stability and inter-repeat coupling. Our findings provide the means to dissect and modulate repeat-protein stability and dynamics, which will be essential for researchers to understand the function of natural repeat proteins and to exploit artificial repeats proteins in nanotechnology and biomedical applications.
Collapse
Affiliation(s)
- Marie Synakewicz
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Rohan S. Eapen
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Albert Perez-Riba
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Pamela J. E. Rowling
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Daniela Bauer
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Andreas Weißl
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Gerhard Fischer
- Department
of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Marko Hyvönen
- Department
of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Matthias Rief
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Laura S. Itzhaki
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Johannes Stigler
- Gene
Center Munich, Ludwig-Maximilians-Universität
München, Feodor-Lynen-Straße 25, 81377 München, Germany
| |
Collapse
|
5
|
Wang Y, Lei R, Nourmohammad A, Wu NC. Antigenic evolution of human influenza H3N2 neuraminidase is constrained by charge balancing. eLife 2021; 10:e72516. [PMID: 34878407 PMCID: PMC8683081 DOI: 10.7554/elife.72516] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 12/07/2021] [Indexed: 11/13/2022] Open
Abstract
As one of the main influenza antigens, neuraminidase (NA) in H3N2 virus has evolved extensively for more than 50 years due to continuous immune pressure. While NA has recently emerged as an effective vaccine target, biophysical constraints on the antigenic evolution of NA remain largely elusive. Here, we apply combinatorial mutagenesis and next-generation sequencing to characterize the local fitness landscape in an antigenic region of NA in six different human H3N2 strains that were isolated around 10 years apart. The local fitness landscape correlates well among strains and the pairwise epistasis is highly conserved. Our analysis further demonstrates that local net charge governs the pairwise epistasis in this antigenic region. In addition, we show that residue coevolution in this antigenic region is correlated with the pairwise epistasis between charge states. Overall, this study demonstrates the importance of quantifying epistasis and the underlying biophysical constraint for building a model of influenza evolution.
Collapse
Affiliation(s)
- Yiquan Wang
- Department of Biochemistry, University of Illinois at Urbana-ChampaignUrbanaUnited States
| | - Ruipeng Lei
- Department of Biochemistry, University of Illinois at Urbana-ChampaignUrbanaUnited States
| | - Armita Nourmohammad
- Department of Physics, University of WashingtonSeattleUnited States
- Max Planck Institute for Dynamics and Self-OrganizationGöttingenGermany
- Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Nicholas C Wu
- Department of Biochemistry, University of Illinois at Urbana-ChampaignUrbanaUnited States
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-ChampaignUrbanaUnited States
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-ChampaignUrbanaUnited States
- Carle Illinois College of Medicine, University of Illinois at Urbana-ChampaignUrbanaUnited States
| |
Collapse
|
6
|
Kozuka K, Nakano S, Asano Y, Ito S. Partial Consensus Design and Enhancement of Protein Function by Secondary-Structure-Guided Consensus Mutations. Biochemistry 2021; 60:2309-2319. [PMID: 34254784 DOI: 10.1021/acs.biochem.1c00309] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Consensus design (CD) is a representative sequence-based protein design method that enables the design of highly functional proteins by analyzing vast amounts of protein sequence data. This study proposes a partial consensus design (PCD) of a protein as a derivative approach of CD. The method replaces the target protein sequence with a consensus sequence in a secondary-structure-dependent manner (i.e., regionally dependent and divided into α-helix, β-sheet, and loop regions). In this study, we generated several artificial partial consensus l-threonine 3-dehydrogenases (PcTDHs) by PCD using the TDH from Cupriavidus necator (CnTDH) as a target protein. Structural and functional analysis of PcTDHs suggested that thermostability would be independently improved when consensus mutations are introduced into the loop region of TDHs. On the other hand, enzyme kinetic parameters (kcat/Km) and average productivity would be synergistically enhanced by changing the combination of the mutations-replacement of one region of CnTDH with a consensus sequence provided only negative effects, but the negative effects were nullified when the two regions were replaced simultaneously. Taken together, we propose the hypothesis that there are protein regions that encode individual protein properties, such as thermostability and activity, and that the introduction of consensus mutations into these regions could additively or synergistically modify their functions.
Collapse
Affiliation(s)
- Kohei Kozuka
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan
| | - Shogo Nakano
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan.,PREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
| | - Yasuhisa Asano
- Biotechnology Research Center and Department of Biotechnology, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama 939-0398, Japan
| | - Sohei Ito
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan
| |
Collapse
|
7
|
Izert MA, Szybowska PE, Górna MW, Merski M. The Effect of Mutations in the TPR and Ankyrin Families of Alpha Solenoid Repeat Proteins. FRONTIERS IN BIOINFORMATICS 2021; 1:696368. [PMID: 36303725 PMCID: PMC9581033 DOI: 10.3389/fbinf.2021.696368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 06/22/2021] [Indexed: 11/20/2022] Open
Abstract
Protein repeats are short, highly similar peptide motifs that occur several times within a single protein, for example the TPR and Ankyrin repeats. Understanding the role of mutation in these proteins is complicated by the competing facts that 1) the repeats are much more restricted to a set sequence than non-repeat proteins, so mutations should be harmful much more often because there are more residues that are heavily restricted due to the need of the sequence to repeat and 2) the symmetry of the repeats in allows the distribution of functional contributions over a number of residues so that sometimes no specific site is singularly responsible for function (unlike enzymatic active site catalytic residues). To address this issue, we review the effects of mutations in a number of natural repeat proteins from the tetratricopeptide and Ankyrin repeat families. We find that mutations are context dependent. Some mutations are indeed highly disruptive to the function of the protein repeats while mutations in identical positions in other repeats in the same protein have little to no effect on structure or function.
Collapse
Affiliation(s)
| | | | | | - Matthew Merski
- *Correspondence: Maria Wiktoria Górna, ; Matthew Merski,
| |
Collapse
|
8
|
Khong GN, Le NT, Pham MT, Adam H, Gauron C, Le HQ, Pham DT, Colonges K, Pham XH, Do VN, Lebrun M, Jouannic S. A cluster of Ankyrin and Ankyrin-TPR repeat genes is associated with panicle branching diversity in rice. PLoS Genet 2021; 17:e1009594. [PMID: 34097698 PMCID: PMC8211194 DOI: 10.1371/journal.pgen.1009594] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 06/17/2021] [Accepted: 05/10/2021] [Indexed: 12/13/2022] Open
Abstract
The number of grains per panicle is an important yield-related trait in cereals which depends in part on panicle branching complexity. One component of this complexity is the number of secondary branches per panicle. Previously, a GWAS site associated with secondary branch and spikelet numbers per panicle in rice was identified. Here we combined gene capture, bi-parental genetic population analysis, expression profiling and transgenic approaches in order to investigate the functional significance of a cluster of 6 ANK and ANK-TPR genes within the QTL. Four of the ANK and ANK-TPR genes present a differential expression associated with panicle secondary branch number in contrasted accessions. These differential expression patterns correlate in the different alleles of these genes with specific deletions of potential cis-regulatory sequences in their promoters. Two of these genes were confirmed through functional analysis as playing a role in the control of panicle architecture. Our findings indicate that secondary branching diversity in the rice panicle is governed in part by differentially expressed genes within this cluster encoding ANK and ANK-TPR domain proteins that may act as positive or negative regulators of panicle meristem’s identity transition from indeterminate to determinate state. Grain yield is one of the most important indexes in rice breeding, which is controlled in part by panicle branching complexity. A new QTL with co-location of spikelet number (SpN) and secondary branch number (SBN) traits was identified by genome-wide association study in a Vietnamese rice landrace panel. A set of four Ankyrin and Tetratricopeptide repeat domain-encoding genes was identified from this QTL based on their difference of expression levels between two contrasted haplotypes for the SpN and SBN traits. The differential expression is correlated with deletions in the promoter regions of these genes. Two of the genes act as negative regulators of the panicle meristem’s identity transition from indeterminate to determinate state while the other two act as positive regulators of this meristem fate transition. Based on the different phenotypes between overexpressed and mutant plants, two of these genes were confirmed as playing a role in the control of panicle architecture. These findings can be directly used to assist selection for grain yield improvement.
Collapse
Affiliation(s)
- Giang Ngan Khong
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
- * E-mail: (GNK); (SJ)
| | - Nhu Thi Le
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
| | - Mai Thi Pham
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
| | - Helene Adam
- UMR DIADE, University of Montpellier, IRD, Montpellier, France
| | - Carole Gauron
- UMR DIADE, University of Montpellier, IRD, Montpellier, France
| | - Hoa Quang Le
- School of Biotechnology and Food Technology, Hanoi University of Science and Technology, Hanoi, Vietnam
| | - Dung Tien Pham
- School of Biotechnology and Food Technology, Hanoi University of Science and Technology, Hanoi, Vietnam
| | - Kelly Colonges
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
| | - Xuan Hoi Pham
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
| | - Vinh Nang Do
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
| | - Michel Lebrun
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
- UMR LSTM, University of Montpellier, IRD, CIRAD, INRAE, SupAgro, Montpellier, France
| | - Stefan Jouannic
- LMI RICE, National Key Laboratory for Plant Cell Biotechnology, Agronomical Genetics Institute, Hanoi, Vietnam
- UMR DIADE, University of Montpellier, IRD, Montpellier, France
- * E-mail: (GNK); (SJ)
| |
Collapse
|
9
|
Barik S. An Analytical Review of the Structural Features of Pentatricopeptide Repeats: Strategic Amino Acids, Repeat Arrangements and Superhelical Architecture. Int J Mol Sci 2021; 22:ijms22105407. [PMID: 34065603 PMCID: PMC8160929 DOI: 10.3390/ijms22105407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 05/18/2021] [Accepted: 05/19/2021] [Indexed: 12/27/2022] Open
Abstract
Tricopeptide repeats are common in natural proteins, and are exemplified by 34- and 35-residue repeats, known respectively as tetratricopeptide repeats (TPRs) and pentatricopeptide repeats (PPRs). In both classes, each repeat unit forms an antiparallel bihelical structure, so that multiple such units in a polypeptide are arranged in a parallel fashion. The primary structures of the motifs are nonidentical, but amino acids of similar properties occur in strategic positions. The focus of the present work was on PPR, but TPR, its better-studied cousin, is often included for comparison. The analyses revealed that critical amino acids, namely Gly, Pro, Ala and Trp, were placed at distinct locations in the higher order structure of PPR domains. While most TPRs occur in repeats of three, the PPRs exhibited a much greater diversity in repeat numbers, from 1 to 30 or more, separated by spacers of various sequences and lengths. Studies of PPR strings in proteins showed that the majority of PPR units are single, and that the longer tandems (i.e., without space in between) occurred in decreasing order. The multi-PPR domains also formed superhelical vortices, likely governed by interhelical angles rather than the spacers. These findings should be useful in designing and understanding the PPR domains.
Collapse
Affiliation(s)
- Sailen Barik
- EonBio, 3780 Pelham Drive, Mobile, AL 36619, USA
| |
Collapse
|
10
|
Bai X, Li D, Ma F, Deng X, Luo M, Feng Y, Yang G. Improved thermostability of creatinase from Alcaligenes Faecalis through non-biased phylogenetic consensus-guided mutagenesis. Microb Cell Fact 2020; 19:194. [PMID: 33069232 PMCID: PMC7568399 DOI: 10.1186/s12934-020-01451-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2020] [Accepted: 10/07/2020] [Indexed: 02/06/2023] Open
Abstract
Background Enzymatic quantification of creatinine has become an essential method for clinical evaluation of renal function. Although creatinase (CR) is frequently used for this purpose, its poor thermostability severely limits industrial applications. Herein, we report a novel creatinase from Alcaligenes faecalis (afCR) with higher catalytic activity and lower KM value, than currently used creatinases. Furthermore, we developed a non-biased phylogenetic consensus method to improve the thermostability of afCR. Results We applied a non-biased phylogenetic consensus method to identify 59 candidate consensus residues from 24 creatinase family homologs for screening afCR mutants with improved thermostability. Twenty-one amino acids of afCR were selected to mutagenesis and 11 of them exhibited improved thermostability compared to the parent enzyme (afCR-M0). Combination of single-site mutations in sequential screens resulted in a quadruple mutant D17V/T199S/L6P/T251C (M4-2) which showed ~ 1700-fold enhanced half-life at 57 °C and a 4.2 °C higher T5015 than that of afCR-M0. The mutant retained catalytic activity equivalent to afCR-M0, and thus showed strong promise for application in creatinine detection. Structural homology modeling revealed a wide range of potential molecular interactions associated with individual mutations that contributed to improving afCR thermostability. Conclusions Results of this study clearly demonstrated that the non-biased-phylogenetic consensus design for improvement of thermostability in afCR is effective and promising in improving the thermostability of more enzymes.
Collapse
Affiliation(s)
- Xue Bai
- Institute of Biothermal Science and Technology, University of Shanghai for Science and Technology, Shanghai, 200093, People's Republic of China
| | - Daixi Li
- Institute of Biothermal Science and Technology, University of Shanghai for Science and Technology, Shanghai, 200093, People's Republic of China.
| | - Fuqiang Ma
- CAS Key Lab of Bio-Medical Diagnostics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, Jiangsu, People's Republic of China
| | - Xi Deng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Rd., Shanghai, 200240, People's Republic of China
| | - Manjie Luo
- Wuhan Hzymes Biotechnology Co., Ltd., Wuhan, 430000, Hubei, People's Republic of China
| | - Yan Feng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Rd., Shanghai, 200240, People's Republic of China
| | - Guangyu Yang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Rd., Shanghai, 200240, People's Republic of China.
| |
Collapse
|
11
|
Motoyama T, Hiramatsu N, Asano Y, Nakano S, Ito S. Protein Sequence Selection Method That Enables Full Consensus Design of Artificial l-Threonine 3-Dehydrogenases with Unique Enzymatic Properties. Biochemistry 2020; 59:3823-3833. [PMID: 32945652 DOI: 10.1021/acs.biochem.0c00570] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Exponentially increasing protein sequence data enables artificial enzyme design using sequence-based protein design methods, including full-consensus protein design (FCD). The success of artificial enzyme design is strongly dependent on the nature of the sequences used. Hence, sequences must be selected from databases and curated libraries prepared to enable a successful design by FCD. In this study, we proposed a selection approach regarding several key residues as sequence motifs. We used l-threonine 3-dehydrogenase (TDH) as a model to test the validity of this approach. In the classification, four residues (143, 174, 188, and 214) were used as key residues. We classified thousands of TDH homologous sequences into five groups containing hundreds of sequences. Utilizing sequences in the libraries, we designed five artificial TDHs by FCD. Among the five, we successfully expressed four in soluble form. Biochemical analysis of artificial TDHs indicated that their enzymatic properties vary; half of the maximum measured enzyme activity (t1/2) and activation energies were distributed from 53 to 65 °C and from 38 to 125 kJ/mol, respectively. The artificial TDHs had unique kinetic parameters, distinct from one another. Structural analysis indicates that consensus mutations are mainly introduced in the secondary or outer shell. The functional diversity of the artificial TDHs is due to the accumulation of mutations that affect their physicochemical properties. Taken together, our findings indicate that our proposed approach can help generate artificial enzymes with unique enzymatic properties.
Collapse
Affiliation(s)
- Tomoharu Motoyama
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Nozomi Hiramatsu
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Yasuhisa Asano
- Biotechnology Research Center and Department of Biotechnology, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama 939-0398, Japan
| | - Shogo Nakano
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Sohei Ito
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| |
Collapse
|
12
|
Georgoulis A, Louka M, Mylonas S, Stavros P, Nounesis G, Vorgias CE. Consensus protein engineering on the thermostable histone-like bacterial protein HUs significantly improves stability and DNA binding affinity. Extremophiles 2020; 24:293-306. [PMID: 31980943 DOI: 10.1007/s00792-020-01154-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 01/06/2020] [Indexed: 11/28/2022]
Abstract
Consensus-based protein engineering strategy has been applied to various proteins and it can lead to the design of proteins with enhanced biological performance. Histone-like HUs comprise a protein family with sequence variety within a highly conserved 3D-fold. HU function includes compacting and regulating bacterial DNA in a wide range of biological conditions in bacteria. To explore the possible impact of consensus-based design in the thermodynamic stability of HU proteins, the approach was applied using a dataset of sequences derived from a group of 40 mesostable, thermostable, and hyperthermostable HUs. The consensus-derived HU protein was named HUBest, since it is expected to perform best. The synthetic HU gene was overexpressed in E. coli and the recombinant protein was purified. Subsequently, HUBest was characterized concerning its correct folding and thermodynamic stability, as well as its ability to interact with plasmid DNA. A substantial increase in HUBest stability at high temperatures is observed. HUBest has significantly improved biological performance at ambience temperature, presenting very low Kd values for binding plasmid DNA as indicated from the Gibbs energy profile of HUBest. This Kd may be associated to conformational changes leading to decreased thermodynamic stability and, therefore, higher flexibility at ambient temperature.
Collapse
Affiliation(s)
- Anastasios Georgoulis
- Department of Biochemistry and Molecular Biology, National and Kapodistrian University of Athens, 157 01, Zografou, Greece
| | - Maria Louka
- Department of Biochemistry and Molecular Biology, National and Kapodistrian University of Athens, 157 01, Zografou, Greece
| | - Stratos Mylonas
- Department of Biochemistry and Molecular Biology, National and Kapodistrian University of Athens, 157 01, Zografou, Greece
| | - Philemon Stavros
- Biomolecular Physics Laboratory, INRASTES, National Centre for Scientific Research "Demokritos", 153 10, Agia Paraskevi, Greece
| | - George Nounesis
- Biomolecular Physics Laboratory, INRASTES, National Centre for Scientific Research "Demokritos", 153 10, Agia Paraskevi, Greece
| | - Constantinos E Vorgias
- Department of Biochemistry and Molecular Biology, National and Kapodistrian University of Athens, 157 01, Zografou, Greece.
| |
Collapse
|
13
|
Dave K, Gasic AG, Cheung MS, Gruebele M. Competition of individual domain folding with inter-domain interaction in WW domain engineered repeat proteins. Phys Chem Chem Phys 2019; 21:24393-24405. [PMID: 31663524 DOI: 10.1039/c8cp07775d] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Engineered repeat proteins have proven to be a fertile ground for studying the competition between folding, misfolding and transient aggregation of tethered protein domains. We examine the interplay between folding and inter-domain interactions of engineered FiP35 WW domain repeat proteins with n = 1 through 5 repeats. We characterize protein expression, thermal and guanidium melts, as well as laser T-jump kinetics. All experimental data is fitted by a global fitting model with two states per domain (U, N), plus a third state M to account for non-native states due to domain interactions present in all but the monomer. A detailed structural model is provided by coarse-grained simulated annealing using the AWSEM Hamiltonian. Tethered FiP35 WW domains with n = 2 and 3 domains are just slightly less stable than the monomer. The n = 4 oligomer is yet less stable, its expression yield is much lower than the monomer's, and depends on the purification tag used. The n = 5 plasmid did not express at all, indicating the sudden onset of aggregation past n = 4. Thus, tethered FiP35 has a critical nucleus size for inter-domain aggregation of n ≈ 4. According to our simulations, misfolded structures become increasingly prevalent as one proceeds from monomer to pentamer, with extended inter-domain beta sheets appearing first, then multi-sheet 'intramolecular amyloid' structures, and finally novel motifs containing alpha helices. We discuss the implications of our results for oligomeric aggregate formation and structure, transient aggregation of proteins whilst folding, as well as for protein evolution that starts with repeat proteins.
Collapse
Affiliation(s)
- Kapil Dave
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Champaign, IL 61801, USA
| | | | | | | |
Collapse
|
14
|
Goyal VD, Sullivan BJ, Magliery TJ. Phylogenetic spread of sequence data affects fitness of consensus enzymes: Insights from triosephosphate isomerase. Proteins 2019; 88:274-283. [PMID: 31407418 DOI: 10.1002/prot.25799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 07/26/2019] [Accepted: 08/08/2019] [Indexed: 11/08/2022]
Abstract
The concept of consensus in multiple sequence alignments (MSAs) has been used to design and engineer proteins previously with some success. However, consensus design implicitly assumes that all amino acid positions function independently, whereas in reality, the amino acids in a protein interact with each other and work cooperatively to produce the optimum structure required for its function. Correlation analysis is a tool that can capture the effect of such interactions. In a previously published study, we made consensus variants of the triosephosphate isomerase (TIM) protein using MSAs that included sequences form both prokaryotic and eukaryotic organisms. These variants were not completely native-like and were also surprisingly different from each other in terms of oligomeric state, structural dynamics, and activity. Extensive correlation analysis of the TIM database has revealed some clues about factors leading to the unusual behavior of the previously constructed consensus proteins. Among other things, we have found that the more ill-behaved consensus mutant had more broken correlations than the better-behaved consensus variant. Moreover, we report three correlation and phylogeny-based consensus variants of TIM. These variants were more native-like than the previous consensus mutants and considerably more stable than a wild-type TIM from a mesophilic organism. This study highlights the importance of choosing the appropriate diversity of MSA for consensus analysis and provides information that can be used to engineer stable enzymes.
Collapse
Affiliation(s)
- Venuka Durani Goyal
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio
| | - Brandon J Sullivan
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio.,Ohio State Biochemistry Program, The Ohio State University, Columbus, Ohio
| | - Thomas J Magliery
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio
| |
Collapse
|
15
|
Nakano S, Motoyama T, Miyashita Y, Ishizuka Y, Matsuo N, Tokiwa H, Shinoda S, Asano Y, Ito S. Benchmark Analysis of Native and Artificial NAD +-Dependent Enzymes Generated by a Sequence-Based Design Method with or without Phylogenetic Data. Biochemistry 2018; 57:3722-3732. [PMID: 29787243 DOI: 10.1021/acs.biochem.8b00339] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The expansion of protein sequence databases has enabled us to design artificial proteins by sequence-based design methods, such as full-consensus design (FCD) and ancestral-sequence reconstruction (ASR). Artificial proteins with enhanced activity levels compared with native ones can potentially be generated by such methods, but successful design is rare because preparing a sequence library by curating the database and selecting a method is difficult. Utilizing a curated library prepared by reducing conservation energies, we successfully designed two artificial l-threonine 3-dehydrogenases (SDR-TDH) with higher activity levels than native SDR-TDH, FcTDH-N1, and AncTDH, using FCD and ASR, respectively. The artificial SDR-TDHs had excellent thermal stability and NAD+ recognition compared to native SDR-TDH from Cupriavidus necator (CnTDH); the melting temperatures of FcTDH-N1 and AncTDH were about 10 and 5 °C higher than that of CnTDH, respectively, and the dissociation constants toward NAD+ of FcTDH-N1 and AncTDH were 2- and 7-fold lower than that of CnTDH, respectively. Enzymatic efficiency of the artificial SDR-TDHs were comparable to that of CnTDH. Crystal structures of FcTDH-N1 and AncTDH were determined at 2.8 and 2.1 Å resolution, respectively. Structural and MD simulation analysis of the SDR-TDHs indicated that only the flexibility at specific regions was changed, suggesting that multiple mutations introduced in the artificial SDR-TDHs altered their flexibility and thereby affected their enzymatic properties. Benchmark analysis of the SDR-TDHs indicated that both FCD and ASR can generate highly functional proteins if a curated library is prepared appropriately.
Collapse
Affiliation(s)
- Shogo Nakano
- Graduate Division of Nutritional and Environmental Sciences , University of Shizuoka , 52-1 Yada , Suruga-ku, Shizuoka 422-8526 , Japan.,Asano Active Enzyme Molecule Project , ERATO, JST , 5180 Kurokawa , Imizu, Toyama 939-0398 , Japan
| | - Tomoharu Motoyama
- Graduate Division of Nutritional and Environmental Sciences , University of Shizuoka , 52-1 Yada , Suruga-ku, Shizuoka 422-8526 , Japan
| | - Yurina Miyashita
- Department of Chemistry , Rikkyo University , Nishi-ikebukuro , Toshima-ku, Tokyo 171-8501 , Japan
| | - Yuki Ishizuka
- Graduate Division of Nutritional and Environmental Sciences , University of Shizuoka , 52-1 Yada , Suruga-ku, Shizuoka 422-8526 , Japan
| | - Naoya Matsuo
- Department of Chemistry , Rikkyo University , Nishi-ikebukuro , Toshima-ku, Tokyo 171-8501 , Japan
| | - Hiroaki Tokiwa
- Department of Chemistry , Rikkyo University , Nishi-ikebukuro , Toshima-ku, Tokyo 171-8501 , Japan
| | - Suguru Shinoda
- Asano Active Enzyme Molecule Project , ERATO, JST , 5180 Kurokawa , Imizu, Toyama 939-0398 , Japan.,Biotechnology Research Center and Department of Biotechnology , Toyama Prefectural University , 5180 Kurokawa , Imizu, Toyama 939-0398 , Japan
| | - Yasuhisa Asano
- Asano Active Enzyme Molecule Project , ERATO, JST , 5180 Kurokawa , Imizu, Toyama 939-0398 , Japan.,Biotechnology Research Center and Department of Biotechnology , Toyama Prefectural University , 5180 Kurokawa , Imizu, Toyama 939-0398 , Japan
| | - Sohei Ito
- Graduate Division of Nutritional and Environmental Sciences , University of Shizuoka , 52-1 Yada , Suruga-ku, Shizuoka 422-8526 , Japan.,Asano Active Enzyme Molecule Project , ERATO, JST , 5180 Kurokawa , Imizu, Toyama 939-0398 , Japan
| |
Collapse
|
16
|
Sanchez-deAlcazar D, Mejias SH, Erazo K, Sot B, Cortajarena AL. Self-assembly of repeat proteins: Concepts and design of new interfaces. J Struct Biol 2018; 201:118-129. [DOI: 10.1016/j.jsb.2017.09.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Revised: 08/09/2017] [Accepted: 09/02/2017] [Indexed: 11/25/2022]
|
17
|
Rahimi H, Shokrgozar MA, Madadkar-Sobhani A, Mahdian R, Foroumadi A, Karimipoor M. Structural Insight into Anaphase Promoting Complex 3 Structure and Docking with a Natural Inhibitory Compound. Adv Biomed Res 2017; 6:26. [PMID: 28401073 PMCID: PMC5359995 DOI: 10.4103/2277-9175.201683] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
BACKGROUND Anaphase promoting complex (APC) is the biggest Cullin-RING E3 ligase and is very important in cell cycle control; many anti-cancer agents target this. APC controls the onset of chromosome separation and mitotic exit through securin and cyclin B degradation, respectively. Its APC3 subunit identifies the APC activators-Cdh1 and Cdc20. MATERIALS AND METHODS The structural model of the APC3 subunit of APC was developed by means of computational techniques; the binding of a natural inhibitory compound to APC3 was also investigated. RESULTS It was found that APC3 structure consists of numerous helices organized in anti-parallel and the overall model is superhelical of tetratrico-peptide repeat (TPR) domains. Furthermore, binding pocket of the natural inhibitory compound as APC3 inhibitor was shown. CONCLUSION The findings are beneficial to understand the mechanism of the APC activation and design inhibitory compounds.
Collapse
Affiliation(s)
- Hamzeh Rahimi
- Department of Molecular Medicine, Biotechnology Research Center, Pasteur Institute of Iran, Tehran, Iran
| | | | - Armin Madadkar-Sobhani
- Department of Life Sciences, Barcelona Supercomputing Center, Barcelona, Spain; Department of Bioinformatics, Institute of Biophysics and Biochemistry, University of Tehran, Tehran, Iran
| | - Reza Mahdian
- Department of Molecular Medicine, Biotechnology Research Center, Pasteur Institute of Iran, Tehran, Iran
| | - Alireza Foroumadi
- Department of Medicinal Chemistry, Faculty of Pharmacy and Pharmaceutical Sciences Research Center, Tehran University of Medical Sciences, Tehran, Iran
| | - Morteza Karimipoor
- Department of Molecular Medicine, Biotechnology Research Center, Pasteur Institute of Iran, Tehran, Iran
| |
Collapse
|
18
|
Chen J, Wang B, Regan L, Gerstein M. Intensification: A Resource for Amplifying Population-Genetic Signals with Protein Repeats. J Mol Biol 2016; 429:435-445. [PMID: 27939289 DOI: 10.1016/j.jmb.2016.12.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 11/16/2016] [Accepted: 12/03/2016] [Indexed: 11/16/2022]
Abstract
Large-scale genome sequencing holds great promise for the interpretation of protein structures through the discovery of many, rare functional variants in the human population. However, because protein-coding regions are under high selective constraints, these variants occur at low frequencies, such that there is often insufficient statistics for downstream calculations. To address this problem, we develop the Intensification approach, which uses the modular structure of repeat protein domains to amplify signals of selection from population genetics and traditional interspecies conservation. In particular, we are able to aggregate variants at the codon level to identify important positions in repeat domains that show strong conservation signals. This allows us to compare conservation over different evolutionary timescales. It also enables us to visualize population-genetic measures on protein structures. We make available the Intensification results as an online resource (http://intensification.gersteinlab.org) and illustrate the approach through a case study on the tetratricopeptide repeat.
Collapse
Affiliation(s)
- Jieming Chen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Bo Wang
- Department of Chemistry, Yale University, New Haven, CT 06520, USA
| | - Lynne Regan
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Chemistry, Yale University, New Haven, CT 06520, USA
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Computer Science, Yale University, New Haven, CT 06520, USA.
| |
Collapse
|
19
|
Zhu H, Sepulveda E, Hartmann MD, Kogenaru M, Ursinus A, Sulz E, Albrecht R, Coles M, Martin J, Lupas AN. Origin of a folded repeat protein from an intrinsically disordered ancestor. eLife 2016; 5:e16761. [PMID: 27623012 PMCID: PMC5074805 DOI: 10.7554/elife.16761] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 09/09/2016] [Indexed: 01/03/2023] Open
Abstract
Repetitive proteins are thought to have arisen through the amplification of subdomain-sized peptides. Many of these originated in a non-repetitive context as cofactors of RNA-based replication and catalysis, and required the RNA to assume their active conformation. In search of the origins of one of the most widespread repeat protein families, the tetratricopeptide repeat (TPR), we identified several potential homologs of its repeated helical hairpin in non-repetitive proteins, including the putatively ancient ribosomal protein S20 (RPS20), which only becomes structured in the context of the ribosome. We evaluated the ability of the RPS20 hairpin to form a TPR fold by amplification and obtained structures identical to natural TPRs for variants with 2-5 point mutations per repeat. The mutations were neutral in the parent organism, suggesting that they could have been sampled in the course of evolution. TPRs could thus have plausibly arisen by amplification from an ancestral helical hairpin.
Collapse
Affiliation(s)
- Hongbo Zhu
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Edgardo Sepulveda
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Marcus D Hartmann
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Manjunatha Kogenaru
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Astrid Ursinus
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Eva Sulz
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Reinhard Albrecht
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Murray Coles
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Jörg Martin
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
20
|
Combining Design and Selection to Create Novel Protein-Peptide Interactions. Methods Enzymol 2016. [PMID: 27586335 DOI: 10.1016/bs.mie.2016.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
The ability to design new protein-protein interactions (PPIs) has many applications in biotechnology and medicine. The goal of designed PPIs is to achieve both high affinity and specificity for the target protein. A great challenge in protein design is to identify such proteins from an enormous number of potential sequences. Many computational and experimental methods have been developed to contend with this challenge. Here we describe one particularly powerful approach-semirational design-that combines design and selection. This approach has been applied to generate new PPIs for many applications, including novel affinity reagents for protein detection/purification and bioorthogonal modules for synthetic biology (Jackrel, Valverde, & Regan, 2009; Sawyer et al., 2014; Speltz, Brown, Hajare, Schlieker, & Regan, 2015; Speltz, Nathan, & Regan, 2015).
Collapse
|
21
|
Protein stability: computation, sequence statistics, and new experimental methods. Curr Opin Struct Biol 2016; 33:161-8. [PMID: 26497286 DOI: 10.1016/j.sbi.2015.09.002] [Citation(s) in RCA: 112] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 09/22/2015] [Accepted: 09/24/2015] [Indexed: 11/22/2022]
Abstract
Calculating protein stability and predicting stabilizing mutations remain exceedingly difficult tasks, largely due to the inadequacy of potential functions, the difficulty of modeling entropy and the unfolded state, and challenges of sampling, particularly of backbone conformations. Yet, computational design has produced some remarkably stable proteins in recent years, apparently owing to near ideality in structure and sequence features. With caveats, computational prediction of stability can be used to guide mutation, and mutations derived from consensus sequence analysis, especially improved by recent co-variation filters, are very likely to stabilize without sacrificing function. The combination of computational and statistical approaches with library approaches, including new technologies such as deep sequencing and high throughput stability measurements, point to a very exciting near term future for stability engineering, even with difficult computational issues remaining.
Collapse
|
22
|
Biomolecular templating of functional hybrid nanostructures using repeat protein scaffolds. Biochem Soc Trans 2016; 43:825-31. [PMID: 26517889 DOI: 10.1042/bst20150077] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The precise synthesis of materials and devices with tailored complex structures and properties is a requisite for the development of the next generation of products based on nanotechnology. Nowadays, the technology for the generation of this type of devices lacks the precision to determine their properties and is accomplished mostly by 'trial and error' experimental approaches. The use of bottom-up approaches that rely on highly specific biomolecular interactions of small and simple components is an attractive approach for the templating of nanoscale elements. In nature, protein assemblies define complex structures and functions. Engineering novel bio-inspired assemblies by exploiting the same rules and interactions that encode the natural diversity is an emerging field that opens the door to create nanostructures with numerous potential applications in synthetic biology and nanotechnology. Self-assembly of biological molecules into defined functional structures has a tremendous potential in nano-patterning and the design of novel materials and functional devices. Molecular self-assembly is a process by which complex 3D structures with specified functions are constructed from simple molecular building blocks. Here we discuss the basis of biomolecular templating, the great potential of repeat proteins as building blocks for biomolecular templating and nano-patterning. In particular, we focus on the designed consensus tetratricopeptide repeats (CTPRs), the control on the assembly of these proteins into higher order structures and their potential as building blocks in order to generate functional nanostructures and materials.
Collapse
|
23
|
Abriata LA, Bovigny C, Dal Peraro M. Detection and sequence/structure mapping of biophysical constraints to protein variation in saturated mutational libraries and protein sequence alignments with a dedicated server. BMC Bioinformatics 2016; 17:242. [PMID: 27315797 PMCID: PMC4912743 DOI: 10.1186/s12859-016-1124-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 06/07/2016] [Indexed: 11/21/2022] Open
Abstract
Background Protein variability can now be studied by measuring high-resolution tolerance-to-substitution maps and fitness landscapes in saturated mutational libraries. But these rich and expensive datasets are typically interpreted coarsely, restricting detailed analyses to positions of extremely high or low variability or dubbed important beforehand based on existing knowledge about active sites, interaction surfaces, (de)stabilizing mutations, etc. Results Our new webserver PsychoProt (freely available without registration at http://psychoprot.epfl.ch or at http://lucianoabriata.altervista.org/psychoprot/index.html) helps to detect, quantify, and sequence/structure map the biophysical and biochemical traits that shape amino acid preferences throughout a protein as determined by deep-sequencing of saturated mutational libraries or from large alignments of naturally occurring variants. Discussion We exemplify how PsychoProt helps to (i) unveil protein structure-function relationships from experiments and from alignments that are consistent with structures according to coevolution analysis, (ii) recall global information about structural and functional features and identify hitherto unknown constraints to variation in alignments, and (iii) point at different sources of variation among related experimental datasets or between experimental and alignment-based data. Remarkably, metabolic costs of the amino acids pose strong constraints to variability at protein surfaces in nature but not in the laboratory. This and other differences call for caution when extrapolating results from in vitro experiments to natural scenarios in, for example, studies of protein evolution. Conclusion We show through examples how PsychoProt can be a useful tool for the broad communities of structural biology and molecular evolution, particularly for studies about protein modeling, evolution and design. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1124-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Luciano A Abriata
- Laboratory for Biomolecular Modeling, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, and Swiss Institute of Bioinformatics, AAB014 Station 19, Lausanne, 1015, Switzerland.
| | - Christophe Bovigny
- Laboratory for Biomolecular Modeling, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, and Swiss Institute of Bioinformatics, AAB014 Station 19, Lausanne, 1015, Switzerland.,Present address: Molecular Modeling Group, Swiss Institute of Bioinformatics, UNIL, Bâtiment Génopode, Lausanne, 1015, Switzerland
| | - Matteo Dal Peraro
- Laboratory for Biomolecular Modeling, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, and Swiss Institute of Bioinformatics, AAB014 Station 19, Lausanne, 1015, Switzerland
| |
Collapse
|
24
|
Abstract
A popular and successful strategy in semi-rational design of protein stability is the use of evolutionary information encapsulated in homologous protein sequences. Consensus design is based on the hypothesis that at a given position, the respective consensus amino acid contributes more than average to the stability of the protein than non-conserved amino acids. Here, we review the consensus design approach, its theoretical underpinnings, successes, limitations and challenges, as well as providing a detailed guide to its application in protein engineering.
Collapse
Affiliation(s)
- Benjamin T Porebski
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Faculty of Medicine, Monash University, Clayton, Victoria 3800, Australia Medical Research Council Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Ashley M Buckle
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Faculty of Medicine, Monash University, Clayton, Victoria 3800, Australia
| |
Collapse
|
25
|
Designed Repeat Proteins as Building Blocks for Nanofabrication. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 940:61-81. [DOI: 10.1007/978-3-319-39196-0_4] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
26
|
Speltz EB, Nathan A, Regan L. Design of Protein-Peptide Interaction Modules for Assembling Supramolecular Structures in Vivo and in Vitro. ACS Chem Biol 2015; 10:2108-15. [PMID: 26131725 DOI: 10.1021/acschembio.5b00415] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Synthetic biology and protein origami both require protein building blocks that behave in a reliable, predictable fashion. In particular, we require protein interaction modules with known specificity and affinity. Here, we describe three designed TRAP (Tetratricopeptide Repeat Affinity Protein)-peptide interaction pairs that are functional in vivo. We show that each TRAP binds to its cognate peptide and exhibits low cross-reactivity with the peptides bound by the other TRAPs. In addition, we demonstrate that the TRAP-peptide interactions are functional in many cellular contexts. In extensions of these designs, we show that the binding affinity of a TRAP-peptide pair can be systematically varied. The TRAP-peptide pairs we present thus represent a powerful set of new building blocks that are suitable for a variety of applications.
Collapse
Affiliation(s)
- Elizabeth B. Speltz
- Department
of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511, United States
| | - Aparna Nathan
- Department
of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511, United States
| | - Lynne Regan
- Department
of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511, United States
- Department
of Chemistry, Yale University, New Haven, Connecticut 06511, United States
- Integrated Graduate Program in Physical and Engineering Biology, New Haven, Connecticut 06511, United States
| |
Collapse
|
27
|
Protein evolution analysis of S-hydroxynitrile lyase by complete sequence design utilizing the INTMSAlign software. Sci Rep 2015; 5:8193. [PMID: 25645341 PMCID: PMC4648443 DOI: 10.1038/srep08193] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Accepted: 01/12/2015] [Indexed: 01/05/2023] Open
Abstract
Development of software and methods for design of complete sequences of functional proteins could contribute to studies of protein engineering and protein evolution. To this end, we developed the INTMSAlign software, and used it to design functional proteins and evaluate their usefulness. The software could assign both consensus and correlation residues of target proteins. We generated three protein sequences with S-selective hydroxynitrile lyase (S-HNL) activity, which we call designed S-HNLs; these proteins folded as efficiently as the native S-HNL. Sequence and biochemical analysis of the designed S-HNLs suggested that accumulation of neutral mutations occurs during the process of S-HNLs evolution from a low-activity form to a high-activity (native) form. Taken together, our results demonstrate that our software and the associated methods could be applied not only to design of complete sequences, but also to predictions of protein evolution, especially within families such as esterases and S-HNLs.
Collapse
|
28
|
Parmeggiani F, Huang PS, Vorobiev S, Xiao R, Park K, Caprari S, Su M, Seetharaman J, Mao L, Janjua H, Montelione GT, Hunt J, Baker D. A general computational approach for repeat protein design. J Mol Biol 2014; 427:563-75. [PMID: 25451037 PMCID: PMC4303030 DOI: 10.1016/j.jmb.2014.11.005] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Revised: 10/08/2014] [Accepted: 11/07/2014] [Indexed: 01/12/2023]
Abstract
Repeat proteins have considerable potential for use as modular binding reagents or biomaterials in biomedical and nanotechnology applications. Here we describe a general computational method for building idealized repeats that integrates available family sequences and structural information with Rosetta de novo protein design calculations. Idealized designs from six different repeat families were generated and experimentally characterized; 80% of the proteins were expressed and soluble and more than 40% were folded and monomeric with high thermal stability. Crystal structures determined for members of three families are within 1Å root-mean-square deviation to the design models. The method provides a general approach for fast and reliable generation of stable modular repeat protein scaffolds.
Collapse
Affiliation(s)
- Fabio Parmeggiani
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA; Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA; Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Sergey Vorobiev
- Department of Biological Sciences, Northeast Structural Genomics Consortium, Columbia University, New York, NY 10027, USA
| | - Rong Xiao
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry and Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Northeast Structural Genomics Consortium, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Keunwan Park
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA; Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Silvia Caprari
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Min Su
- Department of Biological Sciences, Northeast Structural Genomics Consortium, Columbia University, New York, NY 10027, USA
| | - Jayaraman Seetharaman
- Department of Biological Sciences, Northeast Structural Genomics Consortium, Columbia University, New York, NY 10027, USA
| | - Lei Mao
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry and Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Northeast Structural Genomics Consortium, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Haleema Janjua
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry and Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Northeast Structural Genomics Consortium, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry and Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Northeast Structural Genomics Consortium, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - John Hunt
- Department of Biological Sciences, Northeast Structural Genomics Consortium, Columbia University, New York, NY 10027, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA; Institute for Protein Design, University of Washington, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
29
|
Goyal VD, Yadav P, Kumar A, Ghosh B, Makde RD. Crystallization and preliminary X-ray crystallographic analysis of an artificial molten-globular-like triosephosphate isomerase protein of mixed phylogenetic origin. Acta Crystallogr F Struct Biol Commun 2014; 70:1521-5. [PMID: 25372821 PMCID: PMC4231856 DOI: 10.1107/s2053230x14020755] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Accepted: 09/16/2014] [Indexed: 11/10/2022] Open
Abstract
A bioinformatics-based protein-engineering approach called consensus design led to the construction of a chimeric triosephosphate isomerase (TIM) protein called ccTIM (curated consensus TIM) which is as active as Saccharomyces cerevisiae TIM despite sharing only 58% sequence identity with it. The amino-acid sequence of this novel protein is as identical to native sequences from eukaryotes as to those from prokaryotes and shares some biophysical traits with a molten globular protein. Solving its crystal structure would help in understanding the physical implications of its bioinformatics-based sequence. In this report, the ccTIM protein was successfully crystallized using the microbatch-under-oil method and a full X-ray diffraction data set was collected to 2.2 Å resolution using a synchrotron-radiation source. The crystals belonged to space group C2221, with unit-cell parameters a=107.97, b=187.21, c=288.22 Å. Matthews coefficient calculations indicated the presence of six dimers in the asymmetric unit, with an approximate solvent content of 46.2%.
Collapse
Affiliation(s)
| | - Pooja Yadav
- High Pressure and Synchrotron Radiation Physics Division, Bhabha Atomic Research Centre, Trombay, Mumbai 400 085, India
| | - Ashwani Kumar
- High Pressure and Synchrotron Radiation Physics Division, Bhabha Atomic Research Centre, Trombay, Mumbai 400 085, India
| | - Biplab Ghosh
- High Pressure and Synchrotron Radiation Physics Division, Bhabha Atomic Research Centre, Trombay, Mumbai 400 085, India
| | - Ravindra D. Makde
- High Pressure and Synchrotron Radiation Physics Division, Bhabha Atomic Research Centre, Trombay, Mumbai 400 085, India
| |
Collapse
|
30
|
Parker R, Mercedes-Camacho A, Grove TZ. Consensus design of a NOD receptor leucine rich repeat domain with binding affinity for a muramyl dipeptide, a bacterial cell wall fragment. Protein Sci 2014; 23:790-800. [PMID: 24659515 DOI: 10.1002/pro.2461] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 03/18/2014] [Accepted: 03/20/2014] [Indexed: 12/19/2022]
Abstract
Repeat proteins have recently emerged as especially well-suited alternative binding scaffolds due to their modular architecture and biophysical properties. Here we present the design of a scaffold based on the consensus sequence of the leucine rich repeat (LRR) domain of the NOD family of cytoplasmic innate immune system receptors. Consensus sequence design has emerged as a protein design tool to create de novo proteins that capture sequence-structure relationships and interactions present in nature. The multiple sequence alignment of 311 individual LRRs, which are the putative ligand-recognition domain in NOD proteins, resulted in a consensus sequence protein containing two internal and N- and C-capping repeats named CLRR2. CLRR2 protein is a stable, monomeric, and cysteine free scaffold that without any affinity maturation displays micromolar binding to muramyl dipeptide, a bacterial cell wall fragment. To our knowledge, this is the first report of direct interaction of a NOD LRR with a physiologically relevant ligand.
Collapse
Affiliation(s)
- Rachael Parker
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia, 24060
| | | | | |
Collapse
|
31
|
Kim JS, Kim BH, Jang JI, Eom JS, Kim HG, Bang IS, Park YK. Functional insight from the tetratricopeptide repeat-like motifs of the type III secretion chaperone SicA in Salmonella enterica serovar Typhimurium. FEMS Microbiol Lett 2013; 350:146-53. [PMID: 24224875 DOI: 10.1111/1574-6968.12315] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2013] [Revised: 10/24/2013] [Accepted: 10/24/2013] [Indexed: 01/23/2023] Open
Abstract
SicA functions both as a class II chaperone for SipB and SipC of the type III secretion system (T3SS)-1 and as a transcriptional cofactor for the AraC-type transcription factor InvF in Salmonella enterica subsp. enterica serovar Typhimurium. Bioinformatic analysis has predicted that SicA possesses three tetratricopeptide repeat (TPR)-like motifs, which are important for protein-protein interactions and serve as multiprotein complex mediators. To investigate whether the TPR-like motifs in SicA are critical for its transcriptional cofactor function, the canonical residues in these motifs were mutated to glutamate (SicAA44E , SicAA78E , and SicAG112E ). None of these mutants except SicAA44E were able to activate the expression of the sipB and sigD genes. SicAA44E still has a capacity to interact with InvF in vitro, and despite its instability in cell, it could activate the sigDE operon. This suggests that TPR motifs are important for the transcriptional cofactor function of the SicA chaperone.
Collapse
Affiliation(s)
- Jin Seok Kim
- School of Life Sciences and Biotechnology, Korea University, Seoul, Korea
| | | | | | | | | | | | | |
Collapse
|
32
|
Proctor EA, Kota P, Demarest SJ, Caravella JA, Dokholyan NV. Highly covarying residues have a functional role in antibody constant domains. Proteins 2013; 81:884-95. [PMID: 23280585 DOI: 10.1002/prot.24247] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2012] [Revised: 12/05/2012] [Accepted: 12/14/2012] [Indexed: 01/25/2023]
Abstract
The ability to generate and design antibodies recognizing specific targets has revolutionized the pharmaceutical industry and medical imaging. Engineering antibody therapeutics in some cases requires modifying their constant domains to enable new and altered interactions. Engineering novel specificities into antibody constant domains has proved challenging due to the complexity of inter-domain interactions. Covarying networks of residues that tend to cluster on the protein surface and near binding sites have been identified in some proteins. However, the underlying role these networks play in the protein resulting in their conservation remains unclear in most cases. Resolving their role is crucial, because residues in these networks are not viable design targets if their role is to maintain the fold of the protein. Conversely, these networks of residues are ideal candidates for manipulating specificity if they are primarily involved in binding, such as the myriad interdomain interactions maintained within antibodies. Here, we identify networks of evolutionarily-related residues in C-class antibody domains by evaluating covariation, a measure of propensity with which residue pairs vary dependently during evolution. We computationally test whether mutation of residues in these networks affects stability of the folded antibody domain, determining their viability as design candidates. We find that members of covarying networks cluster at domain-domain interfaces, and that mutations to these residues are diverse and frequent during evolution, precluding their importance to domain stability. These results indicate that networks of covarying residues exist in antibody domains for functional reasons unrelated to thermodynamic stability, making them ideal targets for antibody design.
Collapse
Affiliation(s)
- Elizabeth A Proctor
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina 27599-7260, USA
| | | | | | | | | |
Collapse
|
33
|
Sawyer N, Chen J, Regan L. All repeats are not equal: a module-based approach to guide repeat protein design. J Mol Biol 2013; 425:1826-1838. [PMID: 23434848 DOI: 10.1016/j.jmb.2013.02.013] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2012] [Revised: 02/11/2013] [Accepted: 02/12/2013] [Indexed: 12/30/2022]
Abstract
Repeat proteins composed of tandem arrays of a short structural motif often mediate protein-protein interactions. Past efforts to design repeat protein-based molecular recognition tools have focused on the creation of templates from the consensus of individual repeats, regardless of their natural context. Such an approach assumes that all repeats are essentially equivalent. In this study, we present the results of a "module-based" approach in which modules composed of tandem repeats are aligned to identify repeat-specific features. Using this approach to analyze tetratricopeptide repeat modules that contain three tandem repeats (3TPRs), we identify two classes of 3TPR modules with distinct structural signatures that are correlated with different sets of functional residues. Our analyses also reveal a high degree of correlation between positions across the entire ligand-binding surface, indicative of a coordinated, coevolving binding surface. Extension of our analyses to different repeat protein modules reveals more examples of repeat-specific features, especially in armadillo repeat modules. In summary, the module-based analyses that we present effectively capture key repeat-specific features that will be important to include in future repeat protein design templates.
Collapse
Affiliation(s)
- Nicholas Sawyer
- Integrated Graduate Program in Physical and Engineering Biology, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA.,Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA
| | - Jieming Chen
- Integrated Graduate Program in Physical and Engineering Biology, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA.,Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA
| | - Lynne Regan
- Integrated Graduate Program in Physical and Engineering Biology, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA.,Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA.,Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA.,Department of Chemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA
| |
Collapse
|
34
|
Durani V, Magliery TJ. Protein engineering and stabilization from sequence statistics: variation and covariation analysis. Methods Enzymol 2013; 523:237-56. [PMID: 23422433 DOI: 10.1016/b978-0-12-394292-0.00011-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The concepts of consensus and correlation in multiple sequence alignments (MSAs) have been used in the past to understand and engineer proteins. However, there are multiple ways of acquiring MSA databases and also numerous mathematical metrics that can be applied to calculate each of the parameters. This chapter describes an overall methodology that we have chosen to employ for acquiring and statistically analyzing MSAs. We have provided a step-by-step protocol for calculating relative entropy and mutual information metrics and describe how they can be used to predict mutations that have a high probability of stabilizing a protein. This protocol allows for flexibility for modification of formulae and parameters without using anything more complicated than Microsoft Excel. We have also demonstrated various aspects of data analysis by carrying out a sample analysis on the BPTI-Kunitz family of proteins and identified mutations that would be predicted to stabilize this protein based on consensus and correlation values.
Collapse
Affiliation(s)
- Venuka Durani
- Department of Chemistry, The Ohio State University, Columbus, Ohio, USA
| | | |
Collapse
|
35
|
Rockenbauch U, Ritz AM, Sacristan C, Roncero C, Spang A. The complex interactions of Chs5p, the ChAPs, and the cargo Chs3p. Mol Biol Cell 2012; 23:4402-15. [PMID: 23015758 PMCID: PMC3496614 DOI: 10.1091/mbc.e11-12-1015] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The exomer complex, consisting of ChAPs and Chs5p, exports specialized cargoes from the TGN. ChAPs bind to Chs5p through TPR repeats, whereas cargo specificity of the ChAPs is outside these interaction modules. Chs3p and Chs6p may require a complex interaction to form a complex. The exomer complex is a putative vesicle coat required for the direct transport of a subset of cargoes from the trans-Golgi network (TGN) to the plasma membrane. Exomer comprises Chs5p and the ChAPs family of proteins (Chs6p, Bud7p, Bch1p, and Bch2p), which are believed to act as cargo receptors. In particular, Chs6p is required for the transport of the chitin synthase Chs3p to the bud neck. However, how the ChAPs associate with Chs5p and recognize cargo is not well understood. Using domain-switch chimeras of Chs6p and Bch2p, we show that four tetratricopeptide repeats (TPRs) are involved in interaction with Chs5p. Because these roles are conserved among the ChAPs, the TPRs are interchangeable among different ChAP proteins. In contrast, the N-terminal and the central parts of the ChAPs contribute to cargo specificity. Although the entire N-terminal domain of Chs6p is required for Chs3p export at all cell cycle stages, the central part seems to predominantly favor Chs3p export in small-budded cells. The cargo Chs3p probably also uses a complex motif for the interaction with Chs6, as the C-terminus of Chs3p interacts with Chs6p and is necessary, but not sufficient, for TGN export.
Collapse
|
36
|
Speck J, Hecky J, Tam HK, Arndt KM, Einsle O, Müller KM. Exploring the molecular linkage of protein stability traits for enzyme optimization by iterative truncation and evolution. Biochemistry 2012; 51:4850-67. [PMID: 22545913 DOI: 10.1021/bi2018738] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The stability of proteins is paramount for their therapeutic and industrial use and, thus, is a major task for protein engineering. Several types of chemical and physical stabilities are desired, and discussion revolves around whether each stability trait needs to be addressed separately and how specific and compatible stabilizing mutations act. We demonstrate a stepwise perturbation-compensation strategy, which identifies mutations rescuing the activity of a truncated TEM β-lactamase. Analyses relating structural stress with the external stresses of heat, denaturants, and proteases reveal our second-site suppressors as general stability centers that also improve the full-length enzyme. A library of lactamase variants truncated by 15 N-terminal and three C-terminal residues (Bla-NΔ15CΔ3) was subjected to activity selection and DNA shuffling. The resulting clone with the best in vivo performance harbored eight mutations, surpassed the full-length wild-type protein by 5.3 °C in T(m), displayed significantly higher catalytic activity at elevated temperatures, and showed delayed guanidine-induced denaturation. The crystal structure of this mutant was determined and provided insights into its stability determinants. Stepwise reconstitution of the N- and C-termini increased its thermal, denaturant, and proteolytic resistance successively, leading to a full-length enzyme with a T(m) increased by 15.3 °C and a half-denaturation concentration shifted from 0.53 to 1.75 M guanidinium relative to that of the wild type. These improvements demonstrate that iterative truncation-optimization cycles can exploit stability-trait linkages in proteins and are exceptionally suited for the creation of progressively stabilized variants and/or downsized proteins without the need for detailed structural or mechanistic information.
Collapse
Affiliation(s)
- Janina Speck
- Institute for Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | | | | | | | | | | |
Collapse
|
37
|
Evidence that the kinesin light chain domain contains tetratricopeptide repeat units. J Struct Biol 2012; 177:602-12. [DOI: 10.1016/j.jsb.2012.01.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2011] [Revised: 12/20/2011] [Accepted: 01/19/2012] [Indexed: 12/26/2022]
|
38
|
Itzhaki LS, Lowe AR. From artificial antibodies to nanosprings: the biophysical properties of repeat proteins. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 747:153-66. [PMID: 22949117 DOI: 10.1007/978-1-4614-3229-6_10] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
In this chapter we review recent studies of repeat proteins, a class of proteins consisting of tandem arrays of small structural motifs that stack approximately linearly to produce elongated structures. We discuss the observation that, despite lacking the long-range tertiary interactions that are thought to be the hallmark of globular protein stability, repeat proteins can be as stable and as co-orperatively folded as their globular counterparts. The symmetry inherent in the structures of repeat arrays, however, means there can be many partly folded species (whether it be intermediates or transition states) that have similar stabilities. Consequently they do have distinct folding properties compared with globular proteins and these are manifest in their behaviour both at equilibrium and under kinetic conditions. Thus, when studying repeat proteins one appears to be probing a moving target: a relatively small perturbation, by mutation for example, can result in a shift to a different intermediate or transition state. The growing literature on these proteins illustrates how their modular architecture can be adapted to a remarkable array of biological and physical roles, both in vivo and in vitro. Further, their simple architecture makes them uniquely amenable to redesign-of their stability, folding and function-promising exciting possibilities for future research.
Collapse
Affiliation(s)
- Laura S Itzhaki
- Department of Chemistry, University of Cambridge, Cambridge, UK.
| | | |
Collapse
|
39
|
Jiménez B, Ugwu F, Zhao R, Ortí L, Makhnevych T, Pineda-Lucena A, Houry WA. Structure of minimal tetratricopeptide repeat domain protein Tah1 reveals mechanism of its interaction with Pih1 and Hsp90. J Biol Chem 2011; 287:5698-709. [PMID: 22179618 DOI: 10.1074/jbc.m111.287458] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Tah1 and Pih1 are novel Hsp90 interactors. Tah1 acts as a cofactor of Hsp90 to stabilize Pih1. In yeast, Hsp90, Tah1, and Pih1 were found to form a complex that is required for ribosomal RNA processing through their effect on box C/D small nucleolar ribonucleoprotein assembly. Tah1 is a minimal tetratricopeptide repeat protein of 111 amino acid residues that binds to the C terminus of the Hsp90 molecular chaperone, whereas Pih1 consists of 344 residues of unknown fold. The NMR structure of Tah1 has been solved, and this structure shows the presence of two tetratricopeptide repeat motifs followed by a C helix and an unstructured region. The binding of Tah1 to Hsp90 is mediated by the EEVD C-terminal residues of Hsp90, which bind to a positively charged channel formed by Tah1. Five highly conserved residues, which form a two-carboxylate clamp that tightly interacts with the ultimate Asp-0 residue of the bound peptide, are also present in Tah1. Tah1 was found to bind to the C terminus of Pih1 through the C helix and the unstructured region. The C terminus of Pih1 destabilizes the protein in vitro and in vivo, whereas the binding of Tah1 to Pih1 allows for the formation of a stable complex. Based on our data, a model for an Hsp90-Tah1-Pih1 ternary complex is proposed.
Collapse
Affiliation(s)
- Beatriz Jiménez
- Structural Biochemistry Laboratory, Medicinal Chemistry Department, Centro de Investigación Príncipe Felipe, E-46012 Valencia, Spain
| | | | | | | | | | | | | |
Collapse
|
40
|
Triosephosphate Isomerase by Consensus Design: Dramatic Differences in Physical Properties and Activity of Related Variants. J Mol Biol 2011; 413:195-208. [DOI: 10.1016/j.jmb.2011.08.001] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2011] [Revised: 07/23/2011] [Accepted: 08/01/2011] [Indexed: 11/23/2022]
|
41
|
Cortajarena AL, Mochrie SGJ, Regan L. Modulating repeat protein stability: the effect of individual helix stability on the collective behavior of the ensemble. Protein Sci 2011; 20:1042-7. [PMID: 21495096 DOI: 10.1002/pro.638] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2011] [Accepted: 04/07/2011] [Indexed: 11/07/2022]
Abstract
Repeat proteins are tandem arrays of a small structural motif, in which tertiary structure is stabilized by interactions within a repeat and between neighboring repeats. Several studies have shown that this modular structure is manifest in modular thermodynamic properties. Specifically, the global stability of a repeat protein can be described by simple linear models, considering only two parameters: the stability of the individual repeated units (H) and the coupling interaction between the units (J). If the repeat units are identical, single values of H and J, together with the number of repeated units, is sufficient to completely describe the thermodynamic behavior of any protein within a series. In this work, we demonstrate how the global stability of a repeat protein can be changed, in a predictable fashion, by modifying only the H parameter. Taking a previously characterized series of consensus tetratricopeptide repeats (TPR) (CTPRa) proteins, we introduced mutations into the basic repeating unit, such that the stability of the individual repeat unit was increased, but its interaction with neighboring units was unchanged. In other words, we increased H but kept J constant. We demonstrated that the denaturation curves for a series of such repeat proteins can be fit and additional curves can be predicted by the one-dimensional Ising model in which only H has changed from the original fit for the CTPRa series. Our results show that we can significantly increase the stability of a repeat protein by rationally increasing the stability of the units (H), whereas the interaction between repeats (J) remains unchanged.
Collapse
Affiliation(s)
- Aitziber L Cortajarena
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
| | | | | |
Collapse
|
42
|
Magliery TJ, Lavinder JJ, Sullivan BJ. Protein stability by number: high-throughput and statistical approaches to one of protein science's most difficult problems. Curr Opin Chem Biol 2011; 15:443-51. [PMID: 21498105 DOI: 10.1016/j.cbpa.2011.03.015] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2011] [Revised: 03/18/2011] [Accepted: 03/18/2011] [Indexed: 01/24/2023]
Abstract
Most proteins are only barely stable, which impedes research, complicates therapeutic applications, and makes proteins susceptible to pathologically destabilizing mutations. Our ability to predict the thermodynamic consequences of even single point mutations is still surprisingly limited, and established methods of measuring stability are slow. Recent advances are bringing protein stability studies into the high-throughput realm. Some methods are based on inferential read-outs such as activity, proteolytic resistance or split-protein fragment reassembly. Other methods use miniaturization of direct measurements, such as intrinsic fluorescence, H/D exchange, cysteine reactivity, aggregation and hydrophobic dye binding (DSF). Protein engineering based on statistical analysis (consensus and correlated occurrences of amino acids) is promising, but much work remains to understand and implement these methods.
Collapse
Affiliation(s)
- Thomas J Magliery
- Department of Chemistry, The Ohio State University, 100 West 18th Avenue, Columbus, OH 43210, USA.
| | | | | |
Collapse
|
43
|
Jäckel C, Hilvert D. Biocatalysts by evolution. Curr Opin Biotechnol 2010; 21:753-9. [DOI: 10.1016/j.copbio.2010.08.008] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2010] [Revised: 08/15/2010] [Accepted: 08/19/2010] [Indexed: 11/28/2022]
|
44
|
Urvoas A, Guellouz A, Valerio-Lepiniec M, Graille M, Durand D, Desravines DC, van Tilbeurgh H, Desmadril M, Minard P. Design, Production and Molecular Structure of a New Family of Artificial Alpha-helicoidal Repeat Proteins (αRep) Based on Thermostable HEAT-like Repeats. J Mol Biol 2010; 404:307-27. [DOI: 10.1016/j.jmb.2010.09.048] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2010] [Revised: 09/15/2010] [Accepted: 09/21/2010] [Indexed: 01/07/2023]
|
45
|
Cortajarena AL, Liu TY, Hochstrasser M, Regan L. Designed proteins to modulate cellular networks. ACS Chem Biol 2010; 5:545-52. [PMID: 20020775 DOI: 10.1021/cb9002464] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A major challenge of protein design is to create useful new proteins that interact specifically with biological targets in living cells. Such binding modules have many potential applications, including the targeted perturbation of protein networks. As a general approach to create such modules, we designed a library with approximately 10(9) different binding specificities based on a small 3-tetratricopeptide repeat (TPR) motif framework. We employed a novel strategy, based on split GFP reassembly, to screen the library for modules with the desired binding specificity. Using this approach, we identified modules that bind tightly and specifically to Dss1, a small human protein that interacts with the tumor suppressor protein BRCA2. We showed that these modules also bind the yeast homologue of Dss1, Sem1. Furthermore, we demonstrated that these modules inhibit Sem1 activity in yeast. This strategy will be generally applicable to make novel genetically encoded tools for systems/synthetic biology applications.
Collapse
Affiliation(s)
| | - Tina Y. Liu
- Department of Molecular Biophysics & Biochemistry
| | | | - Lynne Regan
- Department of Molecular Biophysics & Biochemistry
- Department of Chemistry, Yale University, 266 Whitney Avenue, New Haven, Connecticut 06520
| |
Collapse
|
46
|
Jackrel ME, Cortajarena AL, Liu TY, Regan L. Screening libraries to identify proteins with desired binding activities using a split-GFP reassembly assay. ACS Chem Biol 2010; 5:553-62. [PMID: 20038141 DOI: 10.1021/cb900272j] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Designer protein modules, which bind specifically to a desired target, have numerous potential applications. One approach to creating such proteins is to construct and screen libraries. Here we present a detailed description of using a split-GFP reassembly assay to screen libraries and identify proteins with novel binding properties. Attractive features of the split-GFP based screen are the absence of false positives and the simplicity, robustness, and ease of automation of the screen. Here, we describe both the construction of a naive protein library, and screening of the library using the split-GFP assay to identify proteins that bind specifically to chosen peptide sequences.
Collapse
Affiliation(s)
| | | | - Tina Y. Liu
- Molecular Biophysics & Biochemistry, Yale University, New Haven, Connecticut 06520
| | - Lynne Regan
- Departments of Chemistry
- Molecular Biophysics & Biochemistry, Yale University, New Haven, Connecticut 06520
| |
Collapse
|
47
|
Grove TZ, Hands M, Regan L. Creating novel proteins by combining design and selection. Protein Eng Des Sel 2010; 23:449-55. [PMID: 20304973 PMCID: PMC2865361 DOI: 10.1093/protein/gzq015] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2010] [Revised: 02/15/2010] [Accepted: 02/20/2010] [Indexed: 11/13/2022] Open
Abstract
We present the results of combining design and selection to remodel a protein-peptide binding interface, using the peptide PTIEEVD and the TPR1 module interaction as our test case. We initially used the program Rosetta to interrogate possible TPR1 sequences compatible with binding the peptide PTIEEVD. Based on these results, we screened a small library of TPR1 variants, using a split GFP fluorescent assay to identify proteins that are able to bind to the PTIEEVD peptide. We discuss the similarities and differences between the modeling and selection results at each position. We show that a new 'consensus' TPR1, created based on the results of the sequences identified in the screen, indeed binds to the PTIEEVD peptide. These results demonstrate the utility of combining design and selection in a synergistic fashion to remodel protein recognition interfaces.
Collapse
Affiliation(s)
- Tijana Z. Grove
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven 06511, USA
| | - Michael Hands
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven 06511, USA
| | - Lynne Regan
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven 06511, USA
- Department of Chemistry, Yale University, New Haven 06511, USA
| |
Collapse
|
48
|
Blenner MA, Shur O, Szilvay GR, Cropek DM, Banta S. Calcium-induced folding of a beta roll motif requires C-terminal entropic stabilization. J Mol Biol 2010; 400:244-56. [PMID: 20438736 DOI: 10.1016/j.jmb.2010.04.056] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2009] [Revised: 04/22/2010] [Accepted: 04/27/2010] [Indexed: 11/30/2022]
Abstract
Beta roll motifs are associated with several proteins secreted by the type 1 secretion system (T1SS). Located just upstream of the C-terminal T1SS secretion signal, they are believed to act as calcium-induced switches that prevent folding before secretion. Bordetella pertussis adenylate cyclase (CyaA) toxin has five blocks of beta roll motifs (or repeats-in-toxin motifs) separated by linkers. The block V motif on its own has been reported to be non-responsive to calcium. Only when the N- and C-terminal linkers, or flanking groups, were fused did the motif bind calcium and fold. In an effort to understand the requirements for beta roll folding, we have truncated the N- and C-terminal flanks at several locations to determine the minimal essential sequences. Calcium-responsive beta roll folding occurred even in the absence of the natural N-terminal flank. The natural C-terminal flank could not be truncated without decreased calcium affinity and only partially truncated before losing calcium-responsiveness. Globular protein fusion at the C-terminus likewise enabled calcium-induced folding but fusions solely at the N-terminus failed. This demonstrates that calcium-induced folding is an inherent property of the beta roll motif rather than the flanking groups. Given the disparate nature of the observed functional flanking groups, C-terminal fusions appear to confer calcium-responsiveness to the beta roll motif via a non-specific mechanism, suggesting that entropic stabilization of the unstructured C-terminus can enable beta roll folding. Increased calcium affinity was observed when the natural C-terminal flank was used to enable calcium-induced folding, pointing to its cooperative participation in beta roll formation. This work indicates that a general principle of C-terminal entropic stabilization can enable stimulus-responsive repeat protein folding, while the C-terminal flank has a specific role in tuning calcium-responsive beta roll formation. These observations are in stark contrast to what has been reported for other repeat proteins.
Collapse
Affiliation(s)
- Mark A Blenner
- Department of Chemical Engineering, Columbia University, 500 W 120th Street, New York, NY 10027, USA
| | | | | | | | | |
Collapse
|
49
|
Jäckel C, Bloom JD, Kast P, Arnold FH, Hilvert D. Consensus protein design without phylogenetic bias. J Mol Biol 2010; 399:541-6. [PMID: 20433850 DOI: 10.1016/j.jmb.2010.04.039] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2010] [Revised: 04/20/2010] [Accepted: 04/22/2010] [Indexed: 11/17/2022]
Abstract
Consensus design is an appealing strategy for the stabilization of proteins. It exploits amino acid conservation in sets of homologous proteins to identify likely beneficial mutations. Nevertheless, its success depends on the phylogenetic diversity of the sequence set available. Here, we show that randomization of a single protein represents a reliable alternative source of sequence diversity that is essentially free of phylogenetic bias. A small number of functional protein sequences selected from binary-patterned libraries suffice as input for the consensus design of active enzymes that are easier to produce and substantially more stable than individual members of the starting data set. Although catalytic activity correlates less consistently with sequence conservation in these extensively randomized proteins, less extreme mutagenesis strategies might be adopted in practice to augment stability while maintaining function.
Collapse
Affiliation(s)
- Christian Jäckel
- Laboratory of Organic Chemistry, ETH Zurich, Wolfgang-Pauli-Strasse 10, 8093 Zurich, Switzerland
| | | | | | | | | |
Collapse
|
50
|
Vural A, Oner S, An N, Simon V, Ma D, Blumer JB, Lanier SM. Distribution of activator of G-protein signaling 3 within the aggresomal pathway: role of specific residues in the tetratricopeptide repeat domain and differential regulation by the AGS3 binding partners Gi(alpha) and mammalian inscuteable. Mol Cell Biol 2010; 30:1528-40. [PMID: 20065032 PMCID: PMC2832490 DOI: 10.1128/mcb.01018-09] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2009] [Revised: 09/10/2009] [Accepted: 12/30/2009] [Indexed: 11/20/2022] Open
Abstract
AGS3, a receptor-independent activator of G-protein signaling, is involved in unexpected functional diversity for G-protein signaling systems. AGS3 has seven tetratricopeptide (TPR) motifs upstream of four G-protein regulatory (GPR) motifs that serve as docking sites for Gialpha-GDP. The positioning of AGS3 within the cell and the intramolecular dynamics between different domains of the proteins are likely key determinants of their ability to influence G-protein signaling. We report that AGS3 enters into the aggresome pathway and that distribution of the protein is regulated by the AGS3 binding partners Gialpha and mammalian Inscuteable (mInsc). Gialpha rescues AGS3 from the aggresome, whereas mInsc augments the aggresome-like distribution of AGS3. The distribution of AGS3 to the aggresome is dependent upon the TPR domain, and it is accelerated by disruption of the TPR organizational structure or introduction of a nonsynonymous single-nucleotide polymorphism. These data present AGS3, G-proteins, and mInsc as candidate proteins involved in regulating cellular stress associated with protein-processing pathologies.
Collapse
Affiliation(s)
- Ali Vural
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| | - Sadik Oner
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| | - Ningfei An
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| | - Violaine Simon
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| | - Dzwokai Ma
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| | - Joe B. Blumer
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| | - Stephen M. Lanier
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, South Carolina 29425, Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, California 93106
| |
Collapse
|