1
|
Ren C, Wen X, Mencius J, Quan S. Selection and screening strategies in directed evolution to improve protein stability. BIORESOUR BIOPROCESS 2019. [DOI: 10.1186/s40643-019-0288-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
AbstractProtein stability is not only fundamental for experimental, industrial, and therapeutic applications, but is also the baseline for evolving novel protein functions. For decades, stability engineering armed with directed evolution has continued its rapid development and inevitably poses challenges. Generally, in directed evolution, establishing a reliable link between a genotype and any interpretable phenotype is more challenging than diversifying genetic libraries. Consequently, we set forth in a small picture to emphasize the screening or selection techniques in protein stability-directed evolution to secure the link. For a more systematic review, two main branches of these techniques, namely cellular or cell-free display and stability biosensors, are expounded with informative examples.
Collapse
|
2
|
Schellenberg MJ, Petrovich RM, Malone CC, Williams RS. Selectable high-yield recombinant protein production in human cells using a GFP/YFP nanobody affinity support. Protein Sci 2018; 27:1083-1092. [PMID: 29577475 DOI: 10.1002/pro.3409] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 03/18/2018] [Accepted: 03/21/2018] [Indexed: 01/10/2023]
Abstract
Recombinant protein expression systems that produce high yields of pure proteins and multi-protein complexes are essential to meet the needs of biologists, biochemists, and structural biologists using X-ray crystallography and cryo-electron microscopy. An ideal expression system for recombinant human proteins is cultured human cells where the correct translation and chaperone machinery are present. However, compared to bacterial expression systems, human cell cultures present several technical challenges to their use as an expression system. We developed a method that utilizes a YFP fusion-tag to generate recombinant proteins using suspension-cultured HEK293F cells. YFP is a dual-function tag that enables direct visualization and fluorescence-based selection of high expressing clones for and rapid purification using a high-stringency, high-affinity anti-GFP/YFP nanobody support. We demonstrate the utility of this system by expressing two large human proteins, TOP2α (340 KDa dimer) and a TOP2β catalytic core (260 KDa dimer). This robustly and reproducibly yields >10 mg/L liter of cell culture using transient expression or 2.5 mg/L using stable expression.
Collapse
Affiliation(s)
- Matthew J Schellenberg
- Structural Cell Biology Group, Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, US National Institutes of Health, Department of Health and Human Services, Research Triangle Park, North Carolina, 27709
| | - Robert M Petrovich
- Structural Cell Biology Group, Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, US National Institutes of Health, Department of Health and Human Services, Research Triangle Park, North Carolina, 27709
| | - Christine C Malone
- Structural Cell Biology Group, Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, US National Institutes of Health, Department of Health and Human Services, Research Triangle Park, North Carolina, 27709
| | - R Scott Williams
- Structural Cell Biology Group, Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, US National Institutes of Health, Department of Health and Human Services, Research Triangle Park, North Carolina, 27709
| |
Collapse
|
3
|
Boone M, De Koker A, Callewaert N. Capturing the 'ome': the expanding molecular toolbox for RNA and DNA library construction. Nucleic Acids Res 2018; 46:2701-2721. [PMID: 29514322 PMCID: PMC5888575 DOI: 10.1093/nar/gky167] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 02/05/2018] [Accepted: 02/23/2018] [Indexed: 12/14/2022] Open
Abstract
All sequencing experiments and most functional genomics screens rely on the generation of libraries to comprehensively capture pools of targeted sequences. In the past decade especially, driven by the progress in the field of massively parallel sequencing, numerous studies have comprehensively assessed the impact of particular manipulations on library complexity and quality, and characterized the activities and specificities of several key enzymes used in library construction. Fortunately, careful protocol design and reagent choice can substantially mitigate many of these biases, and enable reliable representation of sequences in libraries. This review aims to guide the reader through the vast expanse of literature on the subject to promote informed library generation, independent of the application.
Collapse
Affiliation(s)
- Morgane Boone
- Center for Medical Biotechnology, VIB, Zwijnaarde 9052, Belgium
- Department of Biochemistry and Microbiology, Ghent University, Ghent 9000, Belgium
| | - Andries De Koker
- Center for Medical Biotechnology, VIB, Zwijnaarde 9052, Belgium
- Department of Biochemistry and Microbiology, Ghent University, Ghent 9000, Belgium
| | - Nico Callewaert
- Center for Medical Biotechnology, VIB, Zwijnaarde 9052, Belgium
- Department of Biochemistry and Microbiology, Ghent University, Ghent 9000, Belgium
| |
Collapse
|
4
|
Sachsenhauser V, Bardwell JC. Directed evolution to improve protein folding in vivo. Curr Opin Struct Biol 2018; 48:117-123. [PMID: 29278775 PMCID: PMC5880552 DOI: 10.1016/j.sbi.2017.12.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 12/13/2017] [Indexed: 02/06/2023]
Abstract
Recently, several innovative approaches have been developed that allow one to directly screen or select for improved protein folding in the cellular context. These methods have the potential of not just leading to a better understanding of the in vivo folding process, they may also allow for improved production of proteins of biotechnological interest.
Collapse
Affiliation(s)
- Veronika Sachsenhauser
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, 830 N. University, Ann Arbor, MI 48109, USA
| | - James Ca Bardwell
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, 830 N. University, Ann Arbor, MI 48109, USA; Howard Hughes Medical Institute, University of Michigan, 830 N. University, Ann Arbor, MI 48109, USA.
| |
Collapse
|
5
|
Molecular evolution of versatile derivatives from a GFP-like protein in the marine copepod Chiridius poppei. PLoS One 2017; 12:e0181186. [PMID: 28700734 PMCID: PMC5507436 DOI: 10.1371/journal.pone.0181186] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2017] [Accepted: 06/26/2017] [Indexed: 12/20/2022] Open
Abstract
Fluorescent proteins are now indispensable tools in molecular research. They have also been adapted for a wide variety of uses in cases involving creative applications, including textiles, aquarium fish, and ornamental plants. Our colleagues have previously cloned a yellow GFP-like protein derived from the marine copepod Chiridius poppei (YGFP), and moreover, succeeded in generating transgenic flowers with clearly visible fluorescence, without the need for high-sensitivity imaging equipment. However, due to the low Stokes shift of YGFP (10 nm), it is difficult to separate emitted light of a labeled object from the light used for excitation; hence, limitations for various applications remain. In this study, which was aimed at developing YGFP mutants with increased Stokes shifts, we conducted stepwise molecular evolution experiments on YGFP by screening random mutations at three key amino acids, based on their fluorescent characteristics and structural stabilities, followed by optimization of their fluorescence output by DNA shuffling of the entire coding sequence. We successfully identified an eYGFPuv that had an excitation maximum in UV wavelengths and a 24-fold increase in fluorescence intensity compared to the previously reported YGFP mutant (H52D). In addition, eYGFPuv exhibited almost 9-fold higher fluorescence intensity compared to the commercially available GFPuv when expressed in human colon carcinoma HCT116 cells and without any differences in cytotoxicity. Thus, this novel mutant with the desirable characteristics of bright fluorescence, long Stokes shift, and low cytotoxity, may be particularly well suited to a variety of molecular and biological applications.
Collapse
|
6
|
Keller H, Kiosze K, Sachsenweger J, Haumann S, Ohlenschläger O, Nuutinen T, Syväoja JE, Görlach M, Grosse F, Pospiech H. The intrinsically disordered amino-terminal region of human RecQL4: multiple DNA-binding domains confer annealing, strand exchange and G4 DNA binding. Nucleic Acids Res 2014; 42:12614-27. [PMID: 25336622 PMCID: PMC4227796 DOI: 10.1093/nar/gku993] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Revised: 09/20/2014] [Accepted: 10/06/2014] [Indexed: 01/01/2023] Open
Abstract
Human RecQL4 belongs to the ubiquitous RecQ helicase family. Its N-terminal region represents the only homologue of the essential DNA replication initiation factor Sld2 of Saccharomyces cerevisiae, and also participates in the vertebrate initiation of DNA replication. Here, we utilized a random screen to identify N-terminal fragments of human RecQL4 that could be stably expressed in and purified from Escherichia coli. Biophysical characterization of these fragments revealed that the Sld2 homologous RecQL4 N-terminal domain carries large intrinsically disordered regions. The N-terminal fragments were sufficient for the strong annealing activity of RecQL4. Moreover, this activity appeared to be the basis for an ATP-independent strand exchange activity. Both activities relied on multiple DNA-binding sites with affinities to single-stranded, double-stranded and Y-structured DNA. Finally, we found a remarkable affinity of the N-terminus for guanine quadruplex (G4) DNA, exceeding the affinities for other DNA structures by at least 60-fold. Together, these findings suggest that the DNA interactions mediated by the N-terminal region of human RecQL4 represent a central function at the replication fork. The presented data may also provide a mechanistic explanation for the role of elements with a G4-forming propensity identified in the vicinity of vertebrate origins of DNA replication.
Collapse
Affiliation(s)
- Heidi Keller
- Research Group Biochemistry, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany
| | - Kristin Kiosze
- Research Group Biochemistry, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany
| | - Juliane Sachsenweger
- Research Group Biochemistry, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany
| | - Sebastian Haumann
- Research Group Biomolecular NMR Spectroscopy, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany
| | - Oliver Ohlenschläger
- Research Group Biomolecular NMR Spectroscopy, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany
| | - Tarmo Nuutinen
- Department of Biology, University of Eastern Finland, PO Box 111, FI-80101 Joensuu, Finland
| | - Juhani E Syväoja
- Institute of Biomedicine, University of Eastern Finland, PO Box 1627, FI-70211 Kuopio, Finland
| | - Matthias Görlach
- Research Group Biomolecular NMR Spectroscopy, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany
| | - Frank Grosse
- Research Group Biochemistry, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany Center for Molecular Biomedicine, Friedrich-Schiller University, Jena, Germany
| | - Helmut Pospiech
- Research Group Biochemistry, Leibniz Institute for Age Research-Fritz Lipmann Institute, Beutenbergstrasse 11, D-07745 Jena, Germany Faculty of Biochemistry and Molecular Medicine, PO Box 5000, FI-90014 University of Oulu, Finland
| |
Collapse
|
7
|
Bulloch EM, Kingston RL. Identifying protein domains by global analysis of soluble fragment data. Anal Biochem 2014; 465:53-62. [DOI: 10.1016/j.ab.2014.06.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2014] [Revised: 06/17/2014] [Accepted: 06/25/2014] [Indexed: 01/11/2023]
|
8
|
Overman RC, Green I, Truman CM, Read JA, Embrey KJ, McAlister MSB, Attwood TK. Stability and solubility engineering of the EphB4 tyrosine kinase catalytic domain using a rationally designed synthetic library. Protein Eng Des Sel 2013; 26:695-704. [DOI: 10.1093/protein/gzt032] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
9
|
Hart DJ, Waldo GS. Library methods for structural biology of challenging proteins and their complexes. Curr Opin Struct Biol 2013; 23:403-8. [PMID: 23602357 DOI: 10.1016/j.sbi.2013.03.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Revised: 03/18/2013] [Accepted: 03/19/2013] [Indexed: 02/08/2023]
Abstract
Genetic engineering of constructs to improve solubility or stability is a common approach, but it is often unclear how to obtain improvements. When the domain composition of a target is poorly understood, or if there are insufficient structure data to guide sited directed mutagenesis, long iterative phases of subcloning or mutation and expression often prove unsuccessful despite much effort. Random library approaches can offer a solution to this problem and involve construction of large libraries of construct variants that are analysed via screens or selections for the desired phenotype. Huge improvements in construct behaviour can be achieved rapidly with no requirement for prior knowledge of the target. Here we review the development of these experimental strategies and recent successes.
Collapse
Affiliation(s)
- Darren J Hart
- EMBL Grenoble Outstation and Unit of Virus Host-Cell Interactions, UMI3265 UJF-EMBL-CNRS, Grenoble, France.
| | | |
Collapse
|
10
|
Abstract
Molecular chaperones assist de novo protein folding and facilitate the refolding of stress-denatured proteins. The molecular chaperone concept was coined nearly 35 years ago, and since then, tremendous strides have been made in understanding how these factors support protein folding. Here, we focus on how various chaperone proteins were first identified to play roles in protein folding. Examples are used to illustrate traditional routes of chaperone discovery and point out their advantages and limitations. Recent advances, including the development of folding biosensors and promising methods for the stabilization of proteins in vivo, provide new routes for chaperone discovery.
Collapse
Affiliation(s)
- Shu Quan
- Department of Molecular, Cellular, and Developmental Biology, Howard Hughes Medical Institute, University of Michigan, Ann Arbor, MI, USA.
| | | |
Collapse
|
11
|
Saha S, Pollard TD. Characterization of structural and functional domains of the anillin-related protein Mid1p that contribute to cytokinesis in fission yeast. Mol Biol Cell 2012; 23:3993-4007. [PMID: 22918954 PMCID: PMC3469515 DOI: 10.1091/mbc.e12-07-0536] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Fission yeast cells depend on the anillin-related protein Mid1p for reliable cytokinesis. Insolubility limits the purification of full-length Mid1p for biophysical analysis, and lack of knowledge about the structural domains of Mid1p limits functional analysis. We addressed these limitations by identifying in a bacterial expression screen of random Mid1p fragments five soluble segments that can be purified and one insoluble segment. Using complementation experiments in Δmid1 cells, we tested the biological functions of these six putative domains that account for full-length Mid1p. The N-terminal domain (residues 1-149) is essential for correct positioning and orientation of septa. The third domain (residues 309-452) allows the construct composed of the first three domains (residues 1-452) to form hydrodynamically well-behaved octamers. Constructs consisting of residues 1-452 or 1-578 carry out most functions of full-length Mid1p, including concentration at the equatorial cortex in nodes that accumulate myosin-II and other contractile ring proteins during mitosis. However, cells depending on these constructs without the insoluble domain (residues 579-797) form equatorially located rings slowly from strands rather than by direct condensation of nodes. We conclude that residues 1-578 assemble node components myosin-II, Rng2p, and Cdc15p, and the insoluble domain facilitates the normal, efficient condensation of nodes into rings.
Collapse
Affiliation(s)
- Shambaditya Saha
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8103, USA
| | | |
Collapse
|
12
|
Yumerefendi H, Desravines DC, Hart DJ. Library-based methods for identification of soluble expression constructs. Methods 2011; 55:38-43. [DOI: 10.1016/j.ymeth.2011.06.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2011] [Revised: 06/09/2011] [Accepted: 06/11/2011] [Indexed: 01/10/2023] Open
|
13
|
ORF-selector ESPRIT: A second generation library screen for soluble protein expression employing precise open reading frame selection. J Struct Biol 2011; 175:189-97. [DOI: 10.1016/j.jsb.2011.04.004] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2010] [Revised: 04/09/2011] [Accepted: 04/10/2011] [Indexed: 12/24/2022]
|
14
|
Pedelacq JD, Nguyen HB, Cabantous S, Mark BL, Listwan P, Bell C, Friedland N, Lockard M, Faille A, Mourey L, Terwilliger TC, Waldo GS. Experimental mapping of soluble protein domains using a hierarchical approach. Nucleic Acids Res 2011; 39:e125. [PMID: 21771856 PMCID: PMC3185438 DOI: 10.1093/nar/gkr548] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Exploring the function and 3D space of large multidomain protein targets often requires sophisticated experimentation to obtain the targets in a form suitable for structure determination. Screening methods capable of selecting well-expressed, soluble fragments from DNA libraries exist, but require the use of automation to maximize chances of picking a few good candidates. Here, we describe the use of an insertion dihydrofolate reductase (DHFR) vector to select in-frame fragments and a split-GFP assay technology to filter-out constructs that express insoluble protein fragments. With the incorporation of an IPCR step to create high density, focused sublibraries of fragments, this cost-effective method can be performed manually with no a priori knowledge of domain boundaries while permitting single amino acid resolution boundary mapping. We used it on the well-characterized p85α subunit of the phosphoinositide-3-kinase to demonstrate the robustness and efficiency of our methodology. We then successfully tested it onto the polyketide synthase PpsC from Mycobacterium tuberculosis, a potential drug target involved in the biosynthesis of complex lipids in the cell envelope. X-ray quality crystals from the acyl-transferase (AT), dehydratase (DH) and enoyl-reductase (ER) domains have been obtained.
Collapse
Affiliation(s)
- Jean-Denis Pedelacq
- CNRS; IPBS (Institut de Pharmacologie et de Biologie Structurale), 205 route de Narbonne, F-31077 Toulouse, France.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Lockard MA, Listwan P, Pedelacq JD, Cabantous S, Nguyen HB, Terwilliger TC, Waldo GS. A high-throughput immobilized bead screen for stable proteins and multi-protein complexes. Protein Eng Des Sel 2011; 24:565-78. [PMID: 21642284 PMCID: PMC3118733 DOI: 10.1093/protein/gzr021] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We describe an in vitro colony screen to identify Escherichia coli expressing soluble proteins and stable, assembled multiprotein complexes. Proteins with an N-terminal 6His tag and C-terminal green fluorescent protein (GFP) S11 tag are fluorescently labeled in cells by complementation with a coexpressed GFP 1-10 fragment. After partial colony lysis, the fluorescent soluble proteins or complexes diffuse through a supporting filtration membrane and are captured on Talon(®) resin metal affinity beads immobilized in agarose. Images of the fluorescent colonies convey total expression and the level of fluorescence bound to the beads indicates how much protein is soluble. Both pieces of information can be used together when selecting clones. After the assay, colonies can be picked and propagated, eliminating the need to make replica plates. We used the method to screen a DNA fragment library of the human protein p85 and preferentially obtained clones expressing the full-length 'breakpoint cluster region-homology' and NSH2 domains. The assay also distinguished clones expressing stable multi-protein complexes from those that are unstable due to missing subunits. Clones expressing stable, intact heterotrimeric E.coli YheNML complexes were readily identified in libraries dominated by complexes of YheML missing the N subunit.
Collapse
Affiliation(s)
- Meghan A Lockard
- Biosciences Division, MS-M888, Los Alamos National Laboratory, PO Box 1663, Los Alamos, NM 87545, USA
| | | | | | | | | | | | | |
Collapse
|
16
|
A combinatorial method to enable detailed investigation of protein–protein interactions. Future Med Chem 2011; 3:271-82. [DOI: 10.4155/fmc.10.289] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Background: Successful structural investigations of protein–protein interactions can be facilitated by studying only the core interacting regions of the constituent proteins. However, attempting the discovery of stable core complexes using informed trial-and-error approaches can prove time and resource intensive. Methods: We describe a valuable extension of combinatorial domain hunting (CDH), a technology for the timely elucidation of soluble protein truncations. The new method, CDH2, enables empirical discovery of stable protein–protein core complexes. CDH2 is demonstrated ab initio using a previously well-characterized Hsp90/Cdc37 complex. Results: Without using a priori information, we demonstrate the isolation of stable protein–protein complexes, suitable for further analyses. Discussion: This resource-efficient process can provide protein complexes for screening of compounds designed to modulate protein–protein interactions, thus facilitating novel drug discovery.
Collapse
|
17
|
Sabogal A, Rio DC. A green fluorescent protein solubility screen in E. coli reveals domain boundaries of the GTP-binding domain in the P element transposase. Protein Sci 2011; 19:2210-8. [PMID: 20842711 DOI: 10.1002/pro.499] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Guanosine triphosphate (GTP) binding and hydrolysis events often act as molecular switches in proteins, modulating conformational changes between active and inactive states in many signaling molecules and transport systems. The P element transposase of Drosophila melanogaster requires GTP binding to proceed along its reaction pathway, following initial site-specific DNA binding. GTP binding is unique to P elements and may represent a novel form of transpositional regulation, allowing the bound transposase to find a second site, looping the transposon DNA for strand cleavage and excision. The GTP-binding activity has been previously mapped to the central portion of the transposase protein; however, the P element transposase contains little sequence identity with known GTP-binding folds. To identify soluble, active transposase domains, a GFP solubility screen was used testing the solubility of random P element gene fragments in E. coli. The screen produced a single clone spanning known GTP-binding residues in the central portion of the transposase coding region. This clone, amino acids 275-409 in the P element transposase, was soluble, highly expressed in E.coli and active for GTP-binding activity, therefore is a candidate for future biochemical and structural studies. In addition, the chimeric screen revealed a minimal N-terminal THAP DNA-binding domain attached to an extended leucine zipper coiled-coil dimerization domain in the P element transposase, precisely delineating the DNA-binding and dimerization activities on the primary sequence. This study highlights the use of a GFP-based solubility screen on a large multidomain protein to identify highly expressed, soluble truncated domain subregions.
Collapse
Affiliation(s)
- Alex Sabogal
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | | |
Collapse
|
18
|
An Y, Meresse P, Mas PJ, Hart DJ. CoESPRIT: a library-based construct screening method for identification and expression of soluble protein complexes. PLoS One 2011; 6:e16261. [PMID: 21364980 PMCID: PMC3043051 DOI: 10.1371/journal.pone.0016261] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2010] [Accepted: 12/13/2010] [Indexed: 11/29/2022] Open
Abstract
Structural and biophysical studies of protein complexes require multi-milligram quantities of soluble material. Subunits are often unstable when expressed separately so co-expression strategies are commonly employed since in vivo complex formation can provide stabilising effects. Defining constructs for subunit co-expression experiments is difficult if the proteins are poorly understood. Even more problematic is when subunit polypeptide chains co-fold since individually they do not have predictable domains. We have developed CoESPRIT, a modified version of the ESPRIT random library construct screen used previously on single proteins, to express soluble protein complexes. A random library of target constructs is screened against a fixed bait protein to identify stable complexes. In a proof-of-principle study, C-terminal fragments of the influenza polymerase PB2 subunit containing folded domains were isolated using importin alpha as bait. Separately, a C-terminal fragment of the PB1 subunit was used as bait to trap N-terminal fragments of PB2 resulting in co-folded complexes. Subsequent expression of the target protein without the bait indicates whether the target is independently stable, or co-folds with its partner. This highly automated method provides an efficient strategy for obtaining recombinant protein complexes at yields compatible with structural, biophysical and functional studies.
Collapse
Affiliation(s)
- Yingfeng An
- Grenoble Outstation, European Molecular Biology Laboratory, BP181, Grenoble, France
- Unit of Virus Host-Cell Interactions, UJF-EMBL-CNRS, UMI3265, Grenoble, France
| | - Patrick Meresse
- Unit of Virus Host-Cell Interactions, UJF-EMBL-CNRS, UMI3265, Grenoble, France
| | - Philippe J. Mas
- Grenoble Outstation, European Molecular Biology Laboratory, BP181, Grenoble, France
- Unit of Virus Host-Cell Interactions, UJF-EMBL-CNRS, UMI3265, Grenoble, France
| | - Darren J. Hart
- Grenoble Outstation, European Molecular Biology Laboratory, BP181, Grenoble, France
- Unit of Virus Host-Cell Interactions, UJF-EMBL-CNRS, UMI3265, Grenoble, France
- * E-mail:
| |
Collapse
|
19
|
Enhancing the Stability and Solubility of the Glucocorticoid Receptor Ligand-Binding Domain by High-Throughput Library Screening. J Mol Biol 2010; 403:562-77. [DOI: 10.1016/j.jmb.2010.08.048] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2010] [Revised: 08/20/2010] [Accepted: 08/24/2010] [Indexed: 11/23/2022]
|
20
|
Selection of soluble protein expression constructs: the experimental determination of protein domain boundaries. Biochem Soc Trans 2010; 38:908-13. [DOI: 10.1042/bst0380908] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Proteins can contain multiple domains each of which is capable of possessing a separate independent function and three-dimensional structure. It is often useful to clone and express individual protein domains to study their biochemical properties and for structure determination. However, the annotated domain boundaries in databases such as Pfam or SMART are not always accurate. The present review summarizes various strategies for the experimental determination of protein domain boundaries.
Collapse
|
21
|
Yumerefendi H, Tarendeau F, Mas PJ, Hart DJ. ESPRIT: an automated, library-based method for mapping and soluble expression of protein domains from challenging targets. J Struct Biol 2010; 172:66-74. [PMID: 20206698 DOI: 10.1016/j.jsb.2010.02.021] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2010] [Revised: 02/24/2010] [Accepted: 02/28/2010] [Indexed: 01/28/2023]
Abstract
Expression of sufficient quantities of soluble protein for structural biology and other applications is often a very difficult task, especially when multimilligram quantities are required. In order to improve yield, solubility or crystallisability of a protein, it is common to subclone shorter genetic constructs corresponding to single- or multi-domain fragments. However, it is not always clear where domain boundaries are located, especially when working on novel targets with little or no sequence similarity to other proteins. Several methods have been described employing aspects of directed evolution to the recombinant expression of challenging proteins. These combine the construction of a random library of genetic constructs of a target with a screening or selection process to identify solubly expressing protein fragments. Here we review several datasets from the ESPRIT (Expression of Soluble Proteins by Random Incremental Truncation) technology to provide a view on its capabilities. Firstly, we demonstrate how it functions using the well-characterised NF-kappaB p50 transcription factor as a model system. Secondly, application of ESPRIT to the challenging PB2 subunit of influenza polymerase has led to several novel atomic resolution structures; here we present an overview of the screening phase of that project. Thirdly, analysis of the human kinase TBK1 is presented to show how the ESPRIT technology rapidly addresses the compatibility of challenging targets with the Escherichia coli expression system.
Collapse
Affiliation(s)
- Hayretin Yumerefendi
- Unit of Virus Host-Cell Interactions, UJF-EMBL-CNRS, UMI 3265, 6 rue Jules Horowitz, Grenoble Cedex 9, France
| | | | | | | |
Collapse
|
22
|
Chen Y, Li S, Chen T, Hua H, Lin Z. Random dissection to select for protein split sites and its application in protein fragment complementation. Protein Sci 2009; 18:399-409. [PMID: 19165722 PMCID: PMC2708047 DOI: 10.1002/pro.42] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2008] [Revised: 10/23/2008] [Accepted: 11/25/2008] [Indexed: 11/11/2022]
Abstract
To identify protein split sites quickly, a selection procedure by using chloramphenicol acetyl transferase (CAT) as reporter was introduced to search for folded protein fragments from libraries generated by random digestion and reassembly of the target gene, which yielded an abundant amount of DNA fragments with controllable lengths. Experimental results of tryptophan synthase alpha subunit (TSalpha) and TEM-1 beta-lactamase agreed well with what the literature has reported. The solubility of these fragments correlated roughly with the minimum inhibitory concentrations of the CAT fusions. The application of this dissection protocol to protein fragment complementation assay (PCA) was evaluated using aminoglycoside-3'-phosphotransferase I (APH(3')-I) as a model protein. Three nearly bisectional sites and a number of possible split points were identified, and guided by this result, four novel pairs of fragments were tested for complementation. Three out of four pairs partially restored the APH activity with the help of leucine zippers, and a truncated but active APH(3')-I (Delta1-25) was also found. Finally, the weakly active APH(3')-I-(1-253)NZ/CZ (254-271) containing a short 18 residue tag was further improved by error-prone PCR, and a best mutant was obtained showing a fourfold improvement after just one round of evolution. These results demonstrate that protein random dissection based on the CAT selection can provide an efficient search for protein breakage points and guide the design of fragments for protein complementation assay. Furthermore, more active fragment pairs can be achieved with the classical directed evolution approach.
Collapse
Affiliation(s)
| | | | | | | | - Zhanglin Lin
- Department of Chemical Engineering, Tsinghua University1 Tsinghua Garden Road, Beijing 100084, China
| |
Collapse
|
23
|
Cabantous S, Rogers Y, Terwilliger TC, Waldo GS. New molecular reporters for rapid protein folding assays. PLoS One 2008; 3:e2387. [PMID: 18545698 PMCID: PMC2408556 DOI: 10.1371/journal.pone.0002387] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2008] [Accepted: 04/11/2008] [Indexed: 11/26/2022] Open
Abstract
The GFP folding reporter assay [1] uses a C-terminal GFP fusion to report on the folding success of upstream fused polypeptides. The GFP folding assay is widely-used for screening protein variants with improved folding and solubility [2]–[8], but truncation artifacts may arise during evolution, i.e. from de novo internal ribosome entry sites [9]. One way to reduce such artifacts would be to insert target genes within the scaffolding of GFP circular permuted variants. Circular permutants of fluorescent proteins often misfold and are non-fluorescent, and do not readily tolerate fused polypeptides within the fluorescent protein scaffolding [10]–[12]. To overcome these limitations, and to increase the dynamic range for reporting on protein misfolding, we have created eight GFP insertion reporters with different sensitivities to protein misfolding using chimeras of two previously described GFP variants, the GFP folding reporter [1] and the robustly-folding “superfolder” GFP [13]. We applied this technology to engineer soluble variants of Rv0113, a protein from Mycobacterium tuberculosis initially expressed as inclusion bodies in Escherichia coli. Using GFP insertion reporters with increasing stringency for each cycle of mutagenesis and selection led to a variant that produced large amounts of soluble protein at 37°C in Escherichia coli. The new reporter constructs discriminate against truncation artifacts previously isolated during directed evolution of Rv0113 using the original C-terminal GFP folding reporter. Using GFP insertion reporters with variable stringency should prove useful for engineering protein variants with improved folding and solubility, while reducing the number of artifacts arising from internal cryptic ribosome initiation sites.
Collapse
Affiliation(s)
- Stéphanie Cabantous
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | | | | | | |
Collapse
|
24
|
Dyson MR, Perera RL, Shadbolt SP, Biderman L, Bromek K, Murzina NV, McCafferty J. Identification of soluble protein fragments by gene fragmentation and genetic selection. Nucleic Acids Res 2008; 36:e51. [PMID: 18420658 PMCID: PMC2396403 DOI: 10.1093/nar/gkn151] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
We describe a new method, which identifies protein fragments for soluble expression in Escherichia coli from a randomly fragmented gene library. Inhibition of E. coli dihydrofolate reductase (DHFR) by trimethoprim (TMP) prevents growth, but this can be relieved by murine DHFR (mDHFR). Bacterial strains expressing mDHFR fusions with the soluble proteins green fluroscent protein (GFP) or EphB2 (SAM domain) displayed markedly increased growth rates with TMP compared to strains expressing insoluble EphB2 (TK domain) or ketosteroid isomerase (KSI). Therefore, mDHFR is affected by the solubility of fusion partners and can act as a reporter of soluble protein expression. Random fragment libraries of the transcription factor Fli1 were generated by deoxyuridine incorporation and endonuclease V cleavage. The fragments were cloned upstream of mDHFR and TMP resistant clones expressing soluble protein were identified. These were found to cluster around the DNA binding ETS domain. A selected Fli1 fragment was expressed independently of mDHFR and was judged to be correctly folded by various biophysical methods including NMR. Soluble fragments of the cell-surface receptor Pecam1 were also identified. This genetic selection method was shown to generate expression clones useful for both structural studies and antibody generation and does not require a priori knowledge of domain architecture.
Collapse
Affiliation(s)
- Michael R Dyson
- Department of Biochemistry, University of Cambridge, Downing Site, Cambridge CB2 1QW, UK.
| | | | | | | | | | | | | |
Collapse
|
25
|
Small-scale, semi-automated purification of eukaryotic proteins for structure determination. ACTA ACUST UNITED AC 2007; 8:153-66. [PMID: 17985212 PMCID: PMC2668602 DOI: 10.1007/s10969-007-9032-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2007] [Accepted: 10/16/2007] [Indexed: 11/07/2022]
Abstract
A simple approach that allows cost-effective automated purification of recombinant proteins in levels sufficient for functional characterization or structural studies is described. Studies with four human stem cell proteins, an engineered version of green fluorescent protein, and other proteins are included. The method combines an expression vector (pVP62K) that provides in vivo cleavage of an initial fusion protein, a factorial designed auto-induction medium that improves the performance of small-scale production, and rapid, automated metal affinity purification of His8-tagged proteins. For initial small-scale production screening, single colony transformants were grown overnight in 0.4 ml of auto-induction medium, produced proteins were purified using the Promega Maxwell 16, and purification results were analyzed by Caliper LC90 capillary electrophoresis. The yield of purified [U-15N]-His8-Tcl-1 was 7.5 μg/ml of culture medium, of purified [U-15N]-His8-GFP was 68 μg/ml, and of purified selenomethione-labeled AIA–GFP (His8 removed by treatment with TEV protease) was 172 μg/ml. The yield information obtained from a successful automated purification from 0.4 ml was used to inform the decision to scale-up for a second meso-scale (10–50 ml) cell growth and automated purification. 1H–15N NMR HSQC spectra of His8-Tcl-1 and of His8-GFP prepared from 50 ml cultures showed excellent chemical shift dispersion, consistent with well folded states in solution suitable for structure determination. Moreover, AIA–GFP obtained by proteolytic removal of the His8 tag was subjected to crystallization screening, and yielded crystals under several conditions. Single crystals were subsequently produced and optimized by the hanging drop method. The structure was solved by molecular replacement at a resolution of 1.7 Å. This approach provides an efficient way to carry out several key target screening steps that are essential for successful operation of proteomics pipelines with eukaryotic proteins: examination of total expression, determination of proteolysis of fusion tags, quantification of the yield of purified protein, and suitability for structure determination.
Collapse
|
26
|
Savva R, Prodromou C, Driscoll PC. DNA fragmentation based combinatorial approaches to soluble protein expression. Drug Discov Today 2007; 12:939-47. [DOI: 10.1016/j.drudis.2007.08.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2007] [Revised: 08/21/2007] [Accepted: 08/28/2007] [Indexed: 11/26/2022]
|
27
|
Prodromou C, Savva R, Driscoll PC. DNA fragmentation-based combinatorial approaches to soluble protein expression Part I. Generating DNA fragment libraries. Drug Discov Today 2007; 12:931-8. [PMID: 17993411 DOI: 10.1016/j.drudis.2007.08.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2007] [Revised: 08/16/2007] [Accepted: 08/21/2007] [Indexed: 11/28/2022]
Abstract
In addressing a new drug discovery target, the generation of tractable protein substrates for functional and structural analyses can represent a significant hurdle. Traditional approaches rely on protein expression trials of multiple variants in various systems, frequently with limited success. The increasing knowledge base derived from genomics and structural proteomics initiatives assists the bioinformatics-led design of these experiments. Nevertheless, for many eukaryotic polypeptides, particularly those with relatively few homologues, the generation of useful protein products can still be a major challenge. This review describes the basis of efforts to forge an alternative 'domain-hunting' paradigm, based upon combinatorial sampling of expression construct libraries derived by fragmentation of the encoding DNA template, namely the methods and considerations in generating fragment length DNA from target genes. An accompanying review focuses upon the expression screening of such combinatorial DNA libraries for the sampling of the corresponding set of protein fragments.
Collapse
Affiliation(s)
- Chrisostomos Prodromou
- Section of Structural Biology, Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Road, London SW3 6JB, United Kingdom
| | | | | |
Collapse
|
28
|
Kang J, Kang S, Yoo SH, Park S. Identification of residues participating in the interaction between an intraluminal loop of inositol 1,4,5-trisphosphate receptor and a conserved N-terminal region of chromogranin B. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2007; 1774:502-9. [PMID: 17395556 DOI: 10.1016/j.bbapap.2007.02.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Revised: 01/24/2007] [Accepted: 02/02/2007] [Indexed: 11/30/2022]
Abstract
The inositol 1,4,5-trisphosphate receptor (IP3R) is a membrane channel that conducts calcium ions from the intracellular calcium stores. Despite a wealth of information on the cytoplasmic regulation of the IP3R, little is known about its regulation on the luminal side of the calcium stores. Here, we report studies on the IP3R intraluminal loop L3-2 and a conserved N-terminal region of chromogranin B. The IP3R loop is an important part of the channel's pore-forming region, and the chromogranin peptide has been shown to competitively inhibit calcium signaling by IP3R. Using the NMR titration approach, we showed that a part of the L3-2 is involved in a specific interaction with the chromogranin B peptide. Further NMR resonance assignments revealed that the 14th-20th residues of L3-2 are the keys to the binding to the chromogranin B peptide. Through detailed analysis of the data, we suggest a mechanism of IP3R regulation by chromogranin B involving conformational exchanges of the L3-2 region. Our report presents the findings of the first study on the interaction between the luminal loop of the IP3 receptor and its regulator at residue-resolution. The approaches described here should help to guide further studies on the interactions between the IP3R and other luminal side regulators.
Collapse
Affiliation(s)
- Jinho Kang
- Department of Biochemistry and Center for Advanced Medical Education by BK21 Project, School of Medicine, Inha University, Shinheung-Dong, Chung-Gu, Incheon, Korea
| | | | | | | |
Collapse
|
29
|
Pfeffer J, Rusnak M, Hansen CE, Rhlid RB, Schmid RD, Maurer SC. Functional expression of lipase A from Candida antarctica in Escherichia coli—A prerequisite for high-throughput screening and directed evolution. ACTA ACUST UNITED AC 2007. [DOI: 10.1016/j.molcatb.2006.11.006] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
30
|
Reich S, Puckey LH, Cheetham CL, Harris R, Ali AAE, Bhattacharyya U, Maclagan K, Powell KA, Prodromou C, Pearl LH, Driscoll PC, Savva R. Combinatorial Domain Hunting: An effective approach for the identification of soluble protein domains adaptable to high-throughput applications. Protein Sci 2007; 15:2356-65. [PMID: 17008718 PMCID: PMC2242398 DOI: 10.1110/ps.062082606] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Exploitation of potential new targets for drug and vaccine development has an absolute requirement for multimilligram quantities of soluble protein. While recombinant expression of full-length proteins is frequently problematic, high-yield soluble expression of functional subconstructs is an effective alternative, so long as appropriate termini can be identified. Bioinformatics localizes domains, but doesn't predict boundaries with sufficient accuracy, so that subconstructs are typically found by trial and error. Combinatorial Domain Hunting (CDH) is a technology for discovering soluble, highly expressed constructs of target proteins. CDH combines unbiased, finely sampled gene-fragment libraries, with a screening protocol that provides "holistic" readout of solubility and yield for thousands of protein fragments. CDH is free of the "passenger solubilization" and out-of-frame translational start artifacts of fusion-protein systems, and hits are ready for scale-up expression. As a proof of principle, we applied CDH to p85alpha, successfully identifying soluble and highly expressed constructs encapsulating all the known globular domains, and immediately suitable for downstream applications.
Collapse
Affiliation(s)
- Stefanie Reich
- School of Crystallography, Birkbeck College, London WC1E 7HX, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Abstract
This chapter focuses on protein engineering strategies that aim to increase the chances of obtaining crystals suitable for X-ray diffraction. The chapter is divided into three main parts: one dealing with protein engineering through a bioinformatics approach, the second focusing on DNA modifications via random mutagenesis, and the third describing a nonexhaustive number of in vitro modifications based on site-directed mutagenesis.
Collapse
Affiliation(s)
- Sonia Longhi
- Architecture et Fonction des Macromolécules Biologiques, Centre National de la Recherche Scientifique, Universités Aix-Marseille I et II, Marseille, France
| | | | | |
Collapse
|
32
|
Kang S, Kang J, Yoo SH, Park S. Recombinant preparation and characterization of interactions for a calmodulin-binding chromogranin A peptide and calmodulin. J Pept Sci 2007; 13:237-44. [PMID: 17269132 DOI: 10.1002/psc.837] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Chromogranin-derived peptides have important and varied biological activities. They affect a wide spectrum of targets such as fungal membranes, blood vessels, myocardial cells, and pancreatic cells. Despite the biological significance and the diverse activities, the molecular mechanisms of the interactions between the peptides and the target proteins have not been well understood. Here, we studied the interaction between a chromogranin A-derived peptide (CGA40-65) and its target protein, calmodulin, with NMR spectroscopy. Calmodulin was easily prepared with standard recombinant technology, but CGA40-65 posed challenges requiring multistep procedures. The recombinantly produced peptide retained the calmodulin-binding property of the full-length CGA, as shown by the HSQC binding experiment. By applying resonance assignments, we identified the residues in calmodulin involved in the CGA40-65 binding. We also found that the peak changes are close to those exhibited by the peptides having the wrap-around binding mechanism. Further analysis revealed that the CGA40-65-induced changes are more similar to those by CaMKIp peptide than those by smMLCKp peptide among the wrap-around binding peptides, suggesting that CGA40-65 can be categorized as a CaMKIp-like peptide. Our report is the first residue-resolution mechanistic study involving chromogranin peptides and their target proteins. Our approaches should be applicable to interaction studies involving other chromogranin-derived peptides and their cellular target proteins.
Collapse
Affiliation(s)
- Sunmi Kang
- Department of Biochemistry, Center for Advanced Medical Education by BK21 project, School of Medicine, Inha University, Shinheung-dong, Chung-gu, Incheon, Korea
| | | | | | | |
Collapse
|
33
|
Cornvik T, Dahlroth SL, Magnusdottir A, Flodin S, Engvall B, Lieu V, Ekberg M, Nordlund P. An efficient and generic strategy for producing soluble human proteins and domains in E. coli by screening construct libraries. Proteins 2006; 65:266-73. [PMID: 16948159 DOI: 10.1002/prot.21090] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The implementation of generic and efficient technologies for the production of recombinant eukaryotic proteins remains an outstanding challenge in structural genomics programs. We have recently developed a new method for rapid identification of soluble protein expression in E. coli, the colony filtration blot (CoFi blot). In this study, the CoFi blot was used to screen libraries where the N-terminal translation start point was randomized. To investigate the efficiency of this strategy, we have attributed a large number of proteins to this process. In a set of 32 mammalian proteins, we were able to double the success rate (from 34 to 68%) of producing soluble and readily purifiable proteins in E. coli. Most of the selected constructs had their N-termini close to predicted domain borders and the method therefore provides a mean for experimental "domain foot printing." Surprisingly, for most of the targets, we also observed expressing constructs that were close to full-length. In summary this strategy constitutes a generic and efficient method for producing mammalian proteins for structural and functional studies.
Collapse
Affiliation(s)
- Tobias Cornvik
- Department of Biochemistry and Biophysics, Stockholm University, S-106 91 Stockholm, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Jacobs SA, Podell ER, Wuttke DS, Cech TR. Soluble domains of telomerase reverse transcriptase identified by high-throughput screening. Protein Sci 2005; 14:2051-8. [PMID: 16046627 PMCID: PMC2279316 DOI: 10.1110/ps.051532105] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Telomerase is a ribonucleoprotein complex responsible for extending the ends of eukaryotic chromosomes. Structural and biophysical studies of this enzyme have been limited by the inability to produce large amounts of recombinant protein. Here we perform a high-throughput screen to map regions of the Tetrahymena thermophila TERT (Telomerase Reverse Transcriptase) protein that are overexpressed in a soluble form in Escherichia coli using a GFP-fusion system. Many of the soluble protein domains identified do not coincide with domains inferred from multiple sequence alignment, so screening for fluorescent colonies provided information not otherwise readily obtained. The method revealed an essential, independently folded N-terminal domain that was expressed and purified with high yield and found to be suitable for structural analysis. These results provide a tool for future structural and biophysical studies of TERT.
Collapse
Affiliation(s)
- Steven A Jacobs
- Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309-0215, USA
| | | | | | | |
Collapse
|
35
|
Chen M, Cortay JC, Logan IR, Sapountzi V, Robson CN, Gerlier D. Inhibition of ubiquitination and stabilization of human ubiquitin E3 ligase PIRH2 by measles virus phosphoprotein. J Virol 2005; 79:11824-36. [PMID: 16140759 PMCID: PMC1212616 DOI: 10.1128/jvi.79.18.11824-11836.2005] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Using a C-terminal domain (PCT) of the measles virus (MV) phosphoprotein (P protein) as bait in a yeast two-hybrid screen, a cDNA identical to the recently described human p53-induced-RING-H2 (hPIRH2) cDNA was isolated. A glutathione S-transferase-hPIRH2 fusion protein expressed in bacteria was able to pull down P protein when mixed with an extract from P-expressing HeLa cells in vitro, and myc-tagged hPIRH2 could be reciprocally co-immunoprecipitated with MV P protein from human cells. Additionally, immunoprecipitation experiments demonstrated that hPIRH2-myc, MV P, and nucleocapsid (N) proteins form a ternary complex. The hPIRH2 binding site was mapped to the C-terminal X domain region of the P protein by using a yeast two-hybrid assay. The PCT binding site was mapped on hPIRH2 by using a novel yeast two-hybrid tagged PCR approach and by co-immunoprecipitation of hPIRH2 cysteine mutants and mouse/human PIRH2 chimeras. The hPIRH2 C terminus could mediate the interaction with MV P which was favored by the RING-H2 motif. When coexpressed with an enhanced green fluorescent protein-tagged hPIRH2 protein, MV P alone or in a complex with MV N was able to redistribute hPIRH2 to outside the nucleus, within intracellular aggregates. Finally, MV P efficiently stabilized hPIRH2-myc expression and prevented its ubiquitination in vivo but had no effect on the stability or ubiquitination of an alternative ubiquitin E3 ligase, Mdm2. Thus, MV P protein is the first protein from a pathogen that is able to specifically interact with and stabilize the ubiquitin E3 ligase hPIRH2 by preventing its ubiquitination.
Collapse
Affiliation(s)
- Mingzhou Chen
- Immunité & Infections Virales, CNRS--Univ-Lyon 1 UMR 5537, IFR Laennec, 69372 Lyon Cedex 08, France
| | | | | | | | | | | |
Collapse
|
36
|
van den Berg S, Löfdahl PA, Härd T, Berglund H. Improved solubility of TEV protease by directed evolution. J Biotechnol 2005; 121:291-8. [PMID: 16150509 DOI: 10.1016/j.jbiotec.2005.08.006] [Citation(s) in RCA: 249] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2005] [Accepted: 08/04/2005] [Indexed: 11/28/2022]
Abstract
The efficiency and high specificity of tobacco etch virus (TEV) protease has made it widely used for cleavage of recombinant fusion proteins. However, the production of TEV protease in E. coli is hampered by low solubility. We have subjected the gene encoding TEV protease to directed evolution to improve the yield of soluble protein. Libraries of mutated genes obtained by error-prone PCR and gene shuffling were introduced into the Gateway cloning system for facilitated transfer between vectors for screening, purification, or other applications. Fluorescence based in vivo solubility screening was carried out by cloning the libraries into a plasmid encoding a C-terminal GFP fusion. Mutant genes giving rise to high GFP fluorescence intensity indicating high levels of soluble TEV-GFP were subsequently transferred to a vector providing a C-terminal histidine tag for expression, purification, and activity tests of mutated TEV. We identified a mutant, TEV(SH), in which three amino acid substitutions result in a five-fold increase in the yield of purified protease with retained activity.
Collapse
Affiliation(s)
- Susanne van den Berg
- Department of Medical Biochemistry and Biophysics, Karolinska Institute, SE 171 77 Stockholm, Sweden
| | | | | | | |
Collapse
|
37
|
Koschorreck M, Fischer M, Barth S, Pleiss J. How to find soluble proteins: a comprehensive analysis of alpha/beta hydrolases for recombinant expression in E. coli. BMC Genomics 2005; 6:49. [PMID: 15804363 PMCID: PMC1079826 DOI: 10.1186/1471-2164-6-49] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2004] [Accepted: 04/02/2005] [Indexed: 11/15/2022] Open
Abstract
Background In screening of libraries derived by expression cloning, expression of active proteins in E. coli can be limited by formation of inclusion bodies. In these cases it would be desirable to enrich gene libraries for coding sequences with soluble gene products in E. coli and thus to improve the efficiency of screening. Previously Wilkinson and Harrison showed that solubility can be predicted from amino acid composition (Biotechnology 1991, 9(5):443–448). We have applied this analysis to members of the alpha/beta hydrolase fold family to predict their solubility in E. coli. alpha/beta hydrolases are a highly diverse family with more than 1800 proteins which have been grouped into homologous families and superfamilies. Results The predicted solubility in E. coli depends on hydrolase size, phylogenetic origin of the host organism, the homologous family and the superfamily, to which the hydrolase belongs. In general small hydrolases are predicted to be more soluble than large hydrolases, and eukaryotic hydrolases are predicted to be less soluble in E. coli than prokaryotic ones. However, combining phylogenetic origin and size leads to more complex conclusions. Hydrolases from prokaryotic, fungal and metazoan origin are predicted to be most soluble if they are of small, medium and large size, respectively. We observed large variations of predicted solubility between hydrolases from different homologous families and from different taxa. Conclusion A comprehensive analysis of all alpha/beta hydrolase sequences allows more efficient screenings for new soluble alpha/beta hydrolases by the use of libraries which contain more soluble gene products. Screening of hydrolases from families whose members are hard to express as soluble proteins in E. coli should first be done in coding sequences of organisms from phylogenetic groups with the highest average of predicted solubility for proteins of this family. The tools developed here can be used to identify attractive target genes for expression using protein sequences published in databases. This analysis also directs the design of degenerate, family- specific primers to amplify new members from homologous families or superfamilies with a high probability of soluble alpha/beta hydrolases.
Collapse
Affiliation(s)
- Markus Koschorreck
- Institute of Technical Biochemistry, Allmandring 31, 70569 Stuttgart, Gemany
| | - Markus Fischer
- Institute of Technical Biochemistry, Allmandring 31, 70569 Stuttgart, Gemany
| | - Sandra Barth
- Institute of Technical Biochemistry, Allmandring 31, 70569 Stuttgart, Gemany
| | - Jürgen Pleiss
- Institute of Technical Biochemistry, Allmandring 31, 70569 Stuttgart, Gemany
| |
Collapse
|
38
|
Goda N, Tenno T, Takasu H, Hiroaki H, Shirakawa M. The PRESAT-vector: asymmetric T-vector for high-throughput screening of soluble protein domains for structural proteomics. Protein Sci 2004; 13:652-8. [PMID: 14978305 PMCID: PMC2286733 DOI: 10.1110/ps.03439004] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
A rapid unidirectional method for cloning PCR-amplified cDNA fragments into virtually any fusion protein expression vector is described. The method, termed PRESAT-vector cloning, is based on a T-vector technique that does not require restriction endonuclease digestion of the PCR product. Subsequently, we applied a novel ORF selection method of the ligated plasmid products. This second step involves restriction endonuclease treatment that eliminates the plasmids containing an ORF in the wrong orientation prior to transformation into the bacterial host for further protein expression studies. To achieve this selection, we customized the 5'-sequence of the "rear" PCR primer corresponding to the C terminus of the protein to be expressed. The colonies harbored only the ligated products of the desired orientation at >90% efficiency. This method is applied to a GST fusion expression system, and an HTS system for soluble proteins from an expression library was tested.
Collapse
Affiliation(s)
- Natsuko Goda
- Graduate School of Integrated Science, Yokohama City University, 1-7-29 Suehirocho, Tsurumi, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | | | |
Collapse
|
39
|
Yao Y, Ren J, Jones IM. Amino terminal interaction in the prion protein identified using fusion to green fluorescent protein. J Neurochem 2003; 87:1057-65. [PMID: 14622086 DOI: 10.1046/j.1471-4159.2003.02039.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
In contrast to the well-characterized carboxyl domain, the amino terminal half of the mature cellular prion protein has no defined structure. Here, following fusion of mouse prion protein fragments to green fluorescence protein as a reporter of protein stability, we report extreme variability in fluorescence level that is dependent on the prion fragment expressed. In particular, exposure of the extreme amino terminus in the context of a truncated prion protein molecule led to rapid degradation, whereas the loss of only six amino terminal residues rescued high level fluorescence. Study of the precise endpoints and residue identity associated with high fluorescence suggested a domain within the amino terminal half of the molecule defined by a long-range intramolecular interaction between 23KKRPKP28 and 143DWED146 and dependent upon the anti-parallel beta-sheet ending at residue 169 and normally associated with the structurally defined carboxyl terminal domain. This previously unreported interaction may be significant for understanding prion bioactivity and for structural studies aimed at the complete prion structure.
Collapse
Affiliation(s)
- Yongxiu Yao
- School of Animal and Microbial Sciences, The University of Reading, Reading RG6 6AJ, UK
| | | | | |
Collapse
|
40
|
Abstract
Overexpressed proteins are often insoluble, and can be recalcitrant to conventional solubilization techniques such as refolding. Directed evolution methods, in which protein diversity libraries are screened for soluble variants, offer an alternative route to obtaining soluble proteins. Recently, several new protein solubility screens have been developed that do not require structural or functional information about the target protein. Soluble protein can be detected in vivo and in vitro by fusion reporter tags. Protein misfolding can be measured in vivo using the bacterial response to protein misfolding. Finally, soluble protein can be monitored by immunological detection. Efficient, well-established strategies for generating and recombining genetic diversity, driven by new screening and selection methods, can furnish correctly folded, soluble protein.
Collapse
Affiliation(s)
- Geoffrey S Waldo
- BN-2, MS M888, Los Alamos National Laboratories, Los Alamos, NM 87545, USA
| |
Collapse
|
41
|
Abstract
One of the key steps of structural genomics and proteomics is high-throughput expression of many target proteins. Gene cloning, especially by ligation-independent cloning techniques, and recombinant protein expression using microbial hosts such as Escherichia coli and the yeast Pichia pastoris are well optimized and further robotized. Cell-free protein synthesis systems have been developed for large-scale production of protein samples for NMR (stable-isotope labeling) and X-ray crystallography (selenomethionine substitution). Protein folding is still a major bottleneck in protein expression. Cell-based and cell-free methods for screening of suitable samples for structure determination have been developed for achieving a high success rate.
Collapse
Affiliation(s)
- Shigeyuki Yokoyama
- RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan.
| |
Collapse
|