1
|
Rahmatabadi SS, Mobini K, Askari S, Najafian J, Karami K, Soleymani B, Mostafaie A. In silico characterization of fructosyl peptide oxidase properties from Eupenicillium terrenum. J Mol Recognit 2022; 35:e2980. [PMID: 35657361 DOI: 10.1002/jmr.2980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 05/23/2022] [Accepted: 06/01/2022] [Indexed: 12/24/2022]
Abstract
Fructosyl peptide oxidase (FPOX) enzyme from Eupenicillium terrenum has a high potential to be applied as a diagnostic enzyme. The aim of the present study is the characterization of FPOX from E. terrenum using different bioinformatics tools. The computational prediction of the RNA and protein secondary structures of FPOX, solubility profile in Escherichia coli, stability, domains, and functional properties were performed. In the FPOX protein, six motifs were detected. The d-amino acid oxidase motif was found as the most important motif that is a FAD-dependent oxidoreductase. The cysteines including 97, 154, 234, 280, and 360 showed a lower score than -10 that have a low possibility for participitation in the formation of the SS bond. The 56.52% of FPOX amino acids are nonpolar. Random coils are dominant in the FPOX sequence, followed by alpha-helix and extended strand. The fpox gene is capable of generating a stable RNA secondary structure (-423.90 kcal/mol) in E. coli. FPOX has a large number of hydrophobic amino acids. FPOX showed a low solubility in E. coli which has several aggregation-prone sites in its 3-D structure. According to the scores, the best mutation candidate for increasing solubility was the conversion of methionine 302 to arginine. The melting temperature of FPOX based on its amino acid sequence was 55°C to 65°C. The amounts of thermodynamic parameters for the FPOX enzyme were -137.4 kcal/mol, -3.59 kcal/(mol K), and -6.8 kcal/mol for standard folding enthalpy, heat capacity, and folding free energy, respectively. In conclusion, the in silico study of proteins can provide a valuable method for better understanding the protein properties and functions for use in our purposes.
Collapse
Affiliation(s)
| | - Keivan Mobini
- Department of Hematology, Faculty of Allied Medical Science, Bushehr University of Medical Sciences, Bushehr, Iran
| | - Soudabeh Askari
- Department Biotechnolgy, Applied Razi Biotechnology, Kermanshah, Iran
| | - Javad Najafian
- Department of Biology, Faculty of Basic Science, University of Mazandaran, Baboulsar, Iran
| | - Keyvan Karami
- Medical Biology Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Bijan Soleymani
- Medical Biology Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Ali Mostafaie
- Medical Biology Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| |
Collapse
|
2
|
Boone M, Ramasamy P, Zuallaert J, Bouwmeester R, Van Moer B, Maddelein D, Turan D, Hulstaert N, Eeckhaut H, Vandermarliere E, Martens L, Degroeve S, De Neve W, Vranken W, Callewaert N. Massively parallel interrogation of protein fragment secretability using SECRiFY reveals features influencing secretory system transit. Nat Commun 2021; 12:6414. [PMID: 34741024 PMCID: PMC8571348 DOI: 10.1038/s41467-021-26720-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 10/15/2021] [Indexed: 11/09/2022] Open
Abstract
While transcriptome- and proteome-wide technologies to assess processes in protein biogenesis are now widely available, we still lack global approaches to assay post-ribosomal biogenesis events, in particular those occurring in the eukaryotic secretory system. We here develop a method, SECRiFY, to simultaneously assess the secretability of >105 protein fragments by two yeast species, S. cerevisiae and P. pastoris, using custom fragment libraries, surface display and a sequencing-based readout. Screening human proteome fragments with a median size of 50-100 amino acids, we generate datasets that enable datamining into protein features underlying secretability, revealing a striking role for intrinsic disorder and chain flexibility. The SECRiFY methodology generates sufficient amounts of annotated data for advanced machine learning methods to deduce secretability patterns. The finding that secretability is indeed a learnable feature of protein sequences provides a solid base for application-focused studies.
Collapse
Affiliation(s)
- Morgane Boone
- Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium. .,Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent University, Ghent, Belgium. .,Department of Biochemistry and Biophysics, UCSF, San Francisco, CA, USA.
| | - Pathmanaban Ramasamy
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium ,grid.8767.e0000 0001 2290 8069Structural Biology Brussels, VUB, Brussels, Belgium ,grid.11486.3a0000000104788040Structural Biology Research Center, VIB, Brussels, Belgium ,Interuniversity Institute of Bioinformatics in Brussels (IB)2, ULB-VUB, Brussels, Belgium
| | - Jasper Zuallaert
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent University, Ghent, Belgium ,grid.510328.dCenter for Biotech Data Science, Ghent University Global Campus, Songdo, Incheon, South Korea ,grid.5342.00000 0001 2069 7798IDLab, ELIS, UGent, Ghent, Belgium
| | - Robbin Bouwmeester
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Berre Van Moer
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Davy Maddelein
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Demet Turan
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Niels Hulstaert
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Hannah Eeckhaut
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Elien Vandermarliere
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Lennart Martens
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Sven Degroeve
- grid.11486.3a0000000104788040Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Wesley De Neve
- grid.510328.dCenter for Biotech Data Science, Ghent University Global Campus, Songdo, Incheon, South Korea ,grid.5342.00000 0001 2069 7798IDLab, ELIS, UGent, Ghent, Belgium
| | - Wim Vranken
- grid.8767.e0000 0001 2290 8069Structural Biology Brussels, VUB, Brussels, Belgium ,grid.11486.3a0000000104788040Structural Biology Research Center, VIB, Brussels, Belgium ,Interuniversity Institute of Bioinformatics in Brussels (IB)2, ULB-VUB, Brussels, Belgium
| | - Nico Callewaert
- Center for Medical Biotechnology, VIB, Zwijnaarde, Belgium. .,Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent University, Ghent, Belgium.
| |
Collapse
|
3
|
Syu GD, Dunn J, Zhu H. Developments and Applications of Functional Protein Microarrays. Mol Cell Proteomics 2020; 19:916-927. [PMID: 32303587 PMCID: PMC7261817 DOI: 10.1074/mcp.r120.001936] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 03/24/2020] [Indexed: 12/19/2022] Open
Abstract
Protein microarrays are crucial tools in the study of proteins in an unbiased, high-throughput manner, as they allow for characterization of up to thousands of individually purified proteins in parallel. The adaptability of this technology has enabled its use in a wide variety of applications, including the study of proteome-wide molecular interactions, analysis of post-translational modifications, identification of novel drug targets, and examination of pathogen-host interactions. In addition, the technology has also been shown to be useful in profiling antibody specificity, as well as in the discovery of novel biomarkers, especially for autoimmune diseases and cancers. In this review, we will summarize the developments that have been made in protein microarray technology in both in basic and translational research over the past decade. We will also introduce a novel membrane protein array, the GPCR-VirD array, and discuss the future directions of functional protein microarrays.
Collapse
Affiliation(s)
- Guan-Da Syu
- Department of Biotechnology and Bioindustry Sciences, National Cheng Kung University, Tainan 701, Taiwan R.O.C..
| | - Jessica Dunn
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| | - Heng Zhu
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205; Center for High-Throughput Biology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205; Viral Oncology Program, Department of Oncology, The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, Maryland 21231.
| |
Collapse
|
4
|
Hot CoFi Blot: A High-Throughput Colony-Based Screen for Identifying More Thermally Stable Protein Variants. Methods Mol Biol 2019. [PMID: 31267459 DOI: 10.1007/978-1-4939-9624-7_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Highly soluble and stable proteins are desirable for many different applications, from basic science to reaching a cancer patient in the form of a biological drug. For X-ray crystallography-where production of a protein crystal might take weeks and even months-a stable protein sample of high purity and concentration can greatly increase the chances of producing a well-diffracting crystal. For a patient receiving a specific protein drug, its safety, efficacy, and even cost are factors affected by its solubility and stability. Increased protein expression and protein stability can be achieved by randomly altering the coding sequence. As the number of mutants generated might be overwhelming, a powerful protein expression and stability screen is required. In this chapter, we describe a colony filtration technology, which allows us to screen random mutagenesis libraries for increased thermal stability-the Hot CoFi blot. We share how to create the random mutagenesis library, how to perform the Hot CoFi blot, and how to identify more thermally stable clones. We use the Tobacco Etch Virus protease as a target to exemplify the procedure.
Collapse
|
5
|
Sharafi E, Farmani J, Parizi AP, Dehestani A. In Search of Engineered Prokaryotic Chlorophyllases: A Bioinformatics Approach. BIOTECHNOL BIOPROC E 2018. [DOI: 10.1007/s12257-018-0143-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
6
|
Yang Y, Liu G, Liu M, Bai Z, Liu X, Dai X, Guo W. Correlation Between Protein Primary Structure and Soluble Expression Level of HSA dAb in Escherichia coli. Food Technol Biotechnol 2018; 56:101-109. [PMID: 29796003 DOI: 10.17113/ftb.56.01.18.5445] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
It is widely accepted that features such as pI, length, molecular mass and amino acid (AA) sequence have a significant influence on protein solubility. Here, we mainly focused on AA composition and explored those that most affected the soluble expression level of human serum albumin (HSA) domain antibody (dAb). The soluble expression and sequence of 65 dAb variants were analysed using clustering and linear modelling. Certain AAs significantly affected the soluble expression level of dAb, with the specific AA combinations being (S, R, N, D, Q), (G, R, C, N, S) and (R, S, G); these combinations respectively affected the dAb expression level in the broth supernatant, the level in the pellet lysate and total soluble dAb. Among the 20 AAs, R displayed a negative influence on the soluble expression level, whereas G and S showed positive effects. A linear model was built to predict the soluble expression level from the sequence; this model had a prediction accuracy of 80%. In summary, increasing the content of polar AAs, especially G and S, and decreasing the content of R, was helpful to improve the soluble expression level of HSA dAb.
Collapse
Affiliation(s)
- Yankun Yang
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, School of Biotechnology, Jiangnan University, Ministry of Education, 1800 Lihu Avenue, 214122 Wuxi, PR China.,National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| | - Guoqiang Liu
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, School of Biotechnology, Jiangnan University, Ministry of Education, 1800 Lihu Avenue, 214122 Wuxi, PR China.,National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| | - Meng Liu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| | - Zhonghu Bai
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| | - Xiuxia Liu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| | - Xiaofeng Dai
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| | - Wenwen Guo
- Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
| |
Collapse
|
7
|
Rational identification of aggregation hotspots based on secondary structure and amino acid hydrophobicity. Sci Rep 2017; 7:9558. [PMID: 28842596 PMCID: PMC5573320 DOI: 10.1038/s41598-017-09749-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 07/28/2017] [Indexed: 11/12/2022] Open
Abstract
Insolubility of proteins expressed in the Escherichia coli expression system hinders the progress of both basic and applied research. Insoluble proteins contain residues that decrease their solubility (aggregation hotspots). Mutating these hotspots to optimal amino acids is expected to improve protein solubility. To date, however, the identification of these hotspots has proven difficult. In this study, using a combination of approaches involving directed evolution and primary sequence analysis, we found two rules to help inductively identify hotspots: the α-helix rule, which focuses on the hydrophobicity of amino acids in the α-helix structure, and the hydropathy contradiction rule, which focuses on the difference in hydrophobicity relative to the corresponding amino acid in the consensus protein. By properly applying these two rules, we succeeded in improving the probability that expressed proteins would be soluble. Our methods should facilitate research on various insoluble proteins that were previously difficult to study due to their low solubility.
Collapse
|
8
|
Bonacci S, Buccato S, Maione D, Petracca R. Successful completion of a semi-automated enzyme-free cloning method. ACTA ACUST UNITED AC 2016; 17:57-66. [PMID: 27507291 DOI: 10.1007/s10969-016-9207-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Accepted: 08/02/2016] [Indexed: 12/13/2022]
Abstract
Nowadays, in scientific fields such as Structural Biology or Vaccinology, there is an increasing need of fast, effective and reproducible gene cloning and expression processes. Consequently, the implementation of robotic platforms enabling the automation of protocols is becoming a pressing demand. The main goal of our study was to set up a robotic platform devoted to the high-throughput automation of the polymerase incomplete primer extension cloning method, and to evaluate its efficiency compared to that achieved manually, by selecting a set of bacterial genes that were processed either in the automated platform (330) or manually (94). Here we show that we successfully set up a platform able to complete, with high efficiency, a wide range of molecular biology and biochemical steps. 329 gene targets (99 %) were effectively amplified using the automated procedure and 286 (87 %) of these PCR products were successfully cloned in expression vectors, with cloning success rates being higher for the automated protocols respect to the manual procedure (93.6 and 74.5 %, respectively).
Collapse
|
9
|
Cairns TC, Studholme DJ, Talbot NJ, Haynes K. New and Improved Techniques for the Study of Pathogenic Fungi. Trends Microbiol 2015; 24:35-50. [PMID: 26549580 DOI: 10.1016/j.tim.2015.09.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2015] [Revised: 09/29/2015] [Accepted: 09/30/2015] [Indexed: 02/05/2023]
Abstract
Fungal pathogens pose serious threats to human, plant, and ecosystem health. Improved diagnostics and antifungal strategies are therefore urgently required. Here, we review recent developments in online bioinformatic tools and associated interactive data archives, which enable sophisticated comparative genomics and functional analysis of fungal pathogens in silico. Additionally, we highlight cutting-edge experimental techniques, including conditional expression systems, recyclable markers, RNA interference, genome editing, compound screens, infection models, and robotic automation, which are promising to revolutionize the study of both human and plant pathogenic fungi. These novel techniques will allow vital knowledge gaps to be addressed with regard to the evolution of virulence, host-pathogen interactions and antifungal drug therapies in both the clinic and agriculture. This, in turn, will enable delivery of improved diagnosis and durable disease-control strategies.
Collapse
Affiliation(s)
- Timothy C Cairns
- Institut für Biotechnologie, Technische Universität Berlin, Gustav-Meyer Allee 22, Berlin, Germany.
| | | | | | - Ken Haynes
- Biosciences, University of Exeter, Stocker Road, Exeter EX4 4QD, UK
| |
Collapse
|
10
|
Habibi N, Norouzi A, Mohd Hashim SZ, Shamsir MS, Samian R. Prediction of recombinant protein overexpression in Escherichia coli using a machine learning based model (RPOLP). Comput Biol Med 2015; 66:330-6. [DOI: 10.1016/j.compbiomed.2015.09.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 09/18/2015] [Accepted: 09/19/2015] [Indexed: 01/28/2023]
|
11
|
Kurotani A, Yamada Y, Shinozaki K, Kuroda Y, Sakurai T. Plant-PrAS: a database of physicochemical and structural properties and novel functional regions in plant proteomes. PLANT & CELL PHYSIOLOGY 2015; 56:e11. [PMID: 25435546 PMCID: PMC4301743 DOI: 10.1093/pcp/pcu176] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Accepted: 10/31/2014] [Indexed: 05/21/2023]
Abstract
Arabidopsis thaliana is an important model species for studies of plant gene functions. Research on Arabidopsis has resulted in the generation of high-quality genome sequences, annotations and related post-genomic studies. The amount of annotation, such as gene-coding regions and structures, is steadily growing in the field of plant research. In contrast to the genomics resource of animals and microorganisms, there are still some difficulties with characterization of some gene functions in plant genomics studies. The acquisition of information on protein structure can help elucidate the corresponding gene function because proteins encoded in the genome possess highly specific structures and functions. In this study, we calculated multiple physicochemical and secondary structural parameters of protein sequences, including length, hydrophobicity, the amount of secondary structure, the number of intrinsically disordered regions (IDRs) and the predicted presence of transmembrane helices and signal peptides, using a total of 208,333 protein sequences from the genomes of six representative plant species, Arabidopsis thaliana, Glycine max (soybean), Populus trichocarpa (poplar), Oryza sativa (rice), Physcomitrella patens (moss) and Cyanidioschyzon merolae (alga). Using the PASS tool and the Rosetta Stone method, we annotated the presence of novel functional regions in 1,732 protein sequences that included unannotated sequences from the Arabidopsis and rice proteomes. These results were organized into the Plant Protein Annotation Suite database (Plant-PrAS), which can be freely accessed online at http://plant-pras.riken.jp/.
Collapse
Affiliation(s)
- Atsushi Kurotani
- RIKEN Center for Sustainable Resource Science, Yokohama, Kanagawa, 230-0045 Japan Department of Biotechnology and Life Sciences, Faculty of Technology, Tokyo University of Agriculture and Technology, Koganei, Tokyo, 184-8588 Japan
| | - Yutaka Yamada
- RIKEN Center for Sustainable Resource Science, Yokohama, Kanagawa, 230-0045 Japan
| | - Kazuo Shinozaki
- RIKEN Center for Sustainable Resource Science, Yokohama, Kanagawa, 230-0045 Japan
| | - Yutaka Kuroda
- Department of Biotechnology and Life Sciences, Faculty of Technology, Tokyo University of Agriculture and Technology, Koganei, Tokyo, 184-8588 Japan
| | - Tetsuya Sakurai
- RIKEN Center for Sustainable Resource Science, Yokohama, Kanagawa, 230-0045 Japan
| |
Collapse
|
12
|
Abstract
The expression and screening of the solubility of recombinant proteins is an important step in the high-throughput (HT) production of target proteins. For many applications, E. coli remains the most widely used expression system due to the relative ease of adapting it to HT pipelines. Herein is described a platform using a 96-well format for efficient expression and solubility screening of target proteins.
Collapse
Affiliation(s)
- Keehwan Kwon
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, 20850, USA,
| | | |
Collapse
|
13
|
Prediction of soluble heterologous protein expression levels inEscherichia colifrom sequence-based features and its potential in biopharmaceutical process development. ACTA ACUST UNITED AC 2014. [DOI: 10.4155/pbp.14.23] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
14
|
A review of machine learning methods to predict the solubility of overexpressed recombinant proteins in Escherichia coli. BMC Bioinformatics 2014; 15:134. [PMID: 24885721 PMCID: PMC4098780 DOI: 10.1186/1471-2105-15-134] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 03/25/2014] [Indexed: 12/14/2022] Open
Abstract
Background Over the last 20 years in biotechnology, the production of recombinant proteins has been a crucial bioprocess in both biopharmaceutical and research arena in terms of human health, scientific impact and economic volume. Although logical strategies of genetic engineering have been established, protein overexpression is still an art. In particular, heterologous expression is often hindered by low level of production and frequent fail due to opaque reasons. The problem is accentuated because there is no generic solution available to enhance heterologous overexpression. For a given protein, the extent of its solubility can indicate the quality of its function. Over 30% of synthesized proteins are not soluble. In certain experimental circumstances, including temperature, expression host, etc., protein solubility is a feature eventually defined by its sequence. Until now, numerous methods based on machine learning are proposed to predict the solubility of protein merely from its amino acid sequence. In spite of the 20 years of research on the matter, no comprehensive review is available on the published methods. Results This paper presents an extensive review of the existing models to predict protein solubility in Escherichia coli recombinant protein overexpression system. The models are investigated and compared regarding the datasets used, features, feature selection methods, machine learning techniques and accuracy of prediction. A discussion on the models is provided at the end. Conclusions This study aims to investigate extensively the machine learning based methods to predict recombinant protein solubility, so as to offer a general as well as a detailed understanding for researches in the field. Some of the models present acceptable prediction performances and convenient user interfaces. These models can be considered as valuable tools to predict recombinant protein overexpression results before performing real laboratory experiments, thus saving labour, time and cost.
Collapse
|
15
|
|
16
|
Paik YK, Jeong SK, Lee EY, Jeong PY, Shim YH. C. elegans: an invaluable model organism for the proteomics studies of the cholesterol-mediated signaling pathway. Expert Rev Proteomics 2014; 3:439-53. [PMID: 16901202 DOI: 10.1586/14789450.3.4.439] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
With the availability of its complete genome sequence and unique biological features relevant to human disease, Caenorhabditis elegans has become an invaluable model organism for the studies of proteomics, leading to the elucidation of nematode gene function. A journey from the genome to proteome of C. elegans may begin with preparation of expressed proteins, which enables a large-scale analysis of all possible proteins expressed under specific physiological conditions. Although various techniques have been used for proteomic analysis of C. elegans, systematic high-throughput analysis is still to come in order to accommodate studies of post-translational modification and quantitative analysis. Given that no integrated C. elegans protein expression database is available, it is about time that a global C. elegans proteome project is launched through which datasets of transcriptomes, protein-protein interaction and functional annotation can be integrated. As an initial target of a pilot project of the C. elegans proteome project, the cholesterol-mediated signaling pathway will be an excellent example since, like in other organisms, it is one of the key controlling pathways in cell growth and development in C. elegans. As this field tends to broaden to functional proteomics, there is a high demand to develop the versatile proteome informatics tools that can mange many different data in an integrative manner.
Collapse
Affiliation(s)
- Young-Ki Paik
- Yonsei University, Department of Biochemistry, 134 Shinchon-dong, Sudamoon-Ku, Seoul, 120-749, Korea.
| | | | | | | | | |
Collapse
|
17
|
Hirose S, Noguchi T. ESPRESSO: a system for estimating protein expression and solubility in protein expression systems. Proteomics 2013; 13:1444-56. [PMID: 23436767 DOI: 10.1002/pmic.201200175] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2012] [Revised: 01/27/2013] [Accepted: 02/06/2013] [Indexed: 11/11/2022]
Abstract
Recombinant protein technology is essential for conducting protein science and using proteins as materials in pharmaceutical or industrial applications. Although obtaining soluble proteins is still a major experimental obstacle, knowledge about protein expression/solubility under standard conditions may increase the efficiency and reduce the cost of proteomics studies. In this study, we present a computational approach to estimate the probability of protein expression and solubility for two different protein expression systems: in vivo Escherichia coli and wheat germ cell-free, from only the sequence information. It implements two kinds of methods: a sequence/predicted structural property-based method that uses both the sequence and predicted structural features, and a sequence pattern-based method that utilizes the occurrence frequencies of sequence patterns. In the benchmark test, the proposed methods obtained F-scores of around 70%, and outperformed publicly available servers. Applying the proposed methods to genomic data revealed that proteins associated with translation or transcription have a strong tendency to be expressed as soluble proteins by the in vivo E. coli expression system. The sequence pattern-based method also has the potential to indicate a candidate region for modification, to increase protein solubility. All methods are available for free at the ESPRESSO server (http://mbs.cbrc.jp/ESPRESSO).
Collapse
Affiliation(s)
- Shuichi Hirose
- Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan.
| | | |
Collapse
|
18
|
Huang HC, Yao LL, Song ZM, Li XP, Hua QQ, Li Q, Pan CW, Xia CM. Development-specific differences in the proteomics of Angiostrongylus cantonensis. PLoS One 2013; 8:e76982. [PMID: 24204717 PMCID: PMC3808366 DOI: 10.1371/journal.pone.0076982] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2013] [Accepted: 08/27/2013] [Indexed: 11/18/2022] Open
Abstract
Angiostrongyliasis is an emerging communicable disease. Several different hosts are required to complete the life cycle of Angiostrongylus cantonensis. However, we lack a complete understanding of variability of proteins across different developmental stages and their contribution to parasite survival and progression. In this study, we extracted soluble proteins from various stages of the A. cantonensis life cycle [female adults, male adults, the fifth-stage female larvae (FL5), the fifth-stage male larvae (ML5) and third-stage larvae (L3)], separated those proteins using two-dimensional difference gel electrophoresis (2D-DIGE) at pH 4-7, and analyzed the gel images using DeCyder 7.0 software. This proteomic analysis produced a total of 183 different dominant protein spots. Thirty-seven protein spots were found to have high confidence scores (>95%) by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS). Comparative proteomic analyses revealed that 29 spots represented cytoskeleton-associated proteins and functional proteins. Eight spots were unnamed proteins. Twelve protein spots that were matched to the EST of different-stage larvae of A. cantonensis were identified. Two genes and the internal control 18s were chosen for quantitative real-time PCR (qPCR) and the qPCR results were consistent with those of the DIGE studies. These findings will provide a new basis for understanding the characteristics of growth and development of A. cantonensis and the host-parasite relationship. They may also assist searches for candidate proteins suitable for use in diagnostic assays and as drug targets for the control of eosinophilic meningitis caused by A. cantonensis.
Collapse
Affiliation(s)
- Hui-Cong Huang
- Department of Parasitology, Medical College of Soochow University, Suzhou, Jiangsu, P. R. China
- Department of Parasitology, School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, P. R. China
| | - Li-Li Yao
- Department of Parasitology, School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, P. R. China
| | - Zeng-Mei Song
- Department of Parasitology, School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, P. R. China
| | - Xing-Pan Li
- Department of Parasitology, School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, P. R. China
| | - Qian-Qian Hua
- Department of Parasitology, School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, P. R. China
| | - Qiang Li
- Department of Laboratory Diagnosis, The Third Affiliated Hospital of Wenzhou Medical University, Ruian, Zhejiang, P. R. China
| | - Chang-Wang Pan
- Department of Parasitology, School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, P. R. China
| | - Chao-Ming Xia
- Department of Parasitology, Medical College of Soochow University, Suzhou, Jiangsu, P. R. China
- * E-mail:
| |
Collapse
|
19
|
Singh GP, Dash D. Electrostatic mis-interactions cause overexpression toxicity of proteins in E. coli. PLoS One 2013; 8:e64893. [PMID: 23734225 PMCID: PMC3667126 DOI: 10.1371/journal.pone.0064893] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2013] [Accepted: 04/19/2013] [Indexed: 01/28/2023] Open
Abstract
A majority of E. coli proteins when overexpressed inhibit its growth, but the reasons behind overexpression toxicity of proteins remain unknown. Understanding the mechanism of overexpression toxicity is important from evolutionary, biotechnological and possibly clinical perspectives. Here we study sequence and functional features of cytosolic proteins of E. coli associated with overexpression toxicity to understand its mechanism. We find that number of positively charged residues is significantly higher in proteins showing overexpression toxicity. Very long proteins also show high overexpression toxicity. Among the functional classes, transcription factors and regulatory proteins are enriched in toxic proteins, while catalytic proteins are depleted. Overexpression toxicity could be predicted with reasonable accuracy using these few properties. The importance of charged residues in overexpression toxicity indicates that nonspecific electrostatic interactions resulting from protein overexpression cause toxicity of these proteins and suggests ways to improve the expression level of native and foreign proteins in E. coli for basic research and biotechnology. These results might also be applicable to other bacterial species.
Collapse
Affiliation(s)
- Gajinder Pal Singh
- G. N. Ramachandran Knowledge Center for Genome Informatics, Institute of Genomics and Integrative Biology (Council of Scientific and Industrial Research), Delhi, India.
| | | |
Collapse
|
20
|
Chaperone-interacting TPR proteins in Caenorhabditis elegans. J Mol Biol 2013; 425:2922-39. [PMID: 23727266 DOI: 10.1016/j.jmb.2013.05.019] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Revised: 04/30/2013] [Accepted: 05/22/2013] [Indexed: 11/21/2022]
Abstract
The ATP-hydrolyzing molecular chaperones Hsc70/Hsp70 and Hsp90 bind a diverse set of tetratricopeptide repeat (TPR)-containing cofactors via their C-terminal peptide motifs IEEVD and MEEVD. These cochaperones contribute to substrate turnover and confer specific activities to the chaperones. Higher eukaryotic genomes encode a large number of TPR-domain-containing proteins. The human proteome contains more than 200 TPR proteins, and that of Caenorhabditis elegans, about 80. It is unknown how many of them interact with Hsc70 or Hsp90. We systematically screened the C. elegans proteome for TPR-domain-containing proteins that likely interact with Hsc70 and Hsp90 and ranked them due to their similarity with known chaperone-interacting TPRs. We find C. elegans to encode many TPR proteins, which are not present in yeast. All of these have homologs in fruit fly or humans. Highly ranking uncharacterized open reading frames C33H5.8, C34B2.5 and ZK370.8 may encode weakly conserved homologs of the human proteins RPAP3, TTC1 and TOM70. C34B2.5 and ZK370.8 bind both Hsc70 and Hsp90 with low micromolar affinities. Mutation of amino acids involved in EEVD binding disrupts the interaction. In vivo, ZK370.8 is localized to mitochondria in tissues with known chaperone requirements, while C34B2.5 colocalizes with Hsc70 in intestinal cells. The highest-ranking open reading frame with non-conserved EEVD-interacting residues, F52H3.5, did not show any binding to Hsc70 or Hsp90, suggesting that only about 15 of the TPR-domain-containing proteins in C. elegans interact with chaperones, while the many others may have evolved to bind other ligands.
Collapse
|
21
|
Current state and recent advances in biopharmaceutical production in Escherichia coli, yeasts and mammalian cells. J Ind Microbiol Biotechnol 2013; 40:257-74. [PMID: 23385853 DOI: 10.1007/s10295-013-1235-0] [Citation(s) in RCA: 139] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2012] [Accepted: 01/22/2013] [Indexed: 12/28/2022]
Abstract
Almost all of the 200 or so approved biopharmaceuticals have been produced in one of three host systems: the bacterium Escherichia coli, yeasts (Saccharomyces cerevisiae, Pichia pastoris) and mammalian cells. We describe the most widely used methods for the expression of recombinant proteins in the cytoplasm or periplasm of E. coli, as well as strategies for secreting the product to the growth medium. Recombinant expression in E. coli influences the cell physiology and triggers a stress response, which has to be considered in process development. Increased expression of a functional protein can be achieved by optimizing the gene, plasmid, host cell, and fermentation process. Relevant properties of two yeast expression systems, S. cerevisiae and P. pastoris, are summarized. Optimization of expression in S. cerevisiae has focused mainly on increasing the secretion, which is otherwise limiting. P. pastoris was recently approved as a host for biopharmaceutical production for the first time. It enables high-level protein production and secretion. Additionally, genetic engineering has resulted in its ability to produce recombinant proteins with humanized glycosylation patterns. Several mammalian cell lines of either rodent or human origin are also used in biopharmaceutical production. Optimization of their expression has focused on clonal selection, interference with epigenetic factors and genetic engineering. Systemic optimization approaches are applied to all cell expression systems. They feature parallel high-throughput techniques, such as DNA microarray, next-generation sequencing and proteomics, and enable simultaneous monitoring of multiple parameters. Systemic approaches, together with technological advances such as disposable bioreactors and microbioreactors, are expected to lead to increased quality and quantity of biopharmaceuticals, as well as to reduced product development times.
Collapse
|
22
|
Santner AA, Croy CH, Vasanwala FH, Uversky VN, Van YYJ, Dunker AK. Sweeping away protein aggregation with entropic bristles: intrinsically disordered protein fusions enhance soluble expression. Biochemistry 2012; 51:7250-62. [PMID: 22924672 DOI: 10.1021/bi300653m] [Citation(s) in RCA: 88] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Intrinsically disordered, highly charged protein sequences act as entropic bristles (EBs), which, when translationally fused to partner proteins, serve as effective solubilizers by creating both a large favorable surface area for water interactions and large excluded volumes around the partner. By extending away from the partner and sweeping out large molecules, EBs can allow the target protein to fold free from interference. Using both naturally occurring and artificial polypeptides, we demonstrate the successful implementation of intrinsically disordered fusions as protein solubilizers. The artificial fusions discussed herein have a low level of sequence complexity and a high net charge but are diversified by means of distinctive amino acid compositions and lengths. Using 6xHis fusions as controls, soluble protein expression enhancements from 65% (EB60A) to 100% (EB250) were observed for a 20-protein portfolio. Additionally, these EBs were able to more effectively solubilize targets compared to frequently used fusions such as maltose-binding protein, glutathione S-transferase, thioredoxin, and N utilization substance A. Finally, although these EBs possess very distinct physiochemical properties, they did not perturb the structure, conformational stability, or function of the green fluorescent protein or the glutathione S-transferase protein. This work thus illustrates the successful de novo design of intrinsically disordered fusions and presents a promising technology and complementary resource for researchers attempting to solubilize recalcitrant proteins.
Collapse
Affiliation(s)
- Aaron A Santner
- Molecular Kinetics Inc., Indianapolis, Indiana 46268, United States
| | | | | | | | | | | |
Collapse
|
23
|
Tales on the Road to High-Throughput. Biotechniques 2012; 53:27-31. [DOI: 10.2144/000113892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
24
|
Smialowski P, Doose G, Torkler P, Kaufmann S, Frishman D. PROSO II--a new method for protein solubility prediction. FEBS J 2012; 279:2192-200. [PMID: 22536855 DOI: 10.1111/j.1742-4658.2012.08603.x] [Citation(s) in RCA: 129] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Many fields of science and industry depend on efficient production of active protein using heterologous expression in Escherichia coli. The solubility of proteins upon expression is dependent on their amino acid sequence. Prediction of solubility from sequence is therefore highly valuable. We present a novel machine-learning-based model called PROSO II which makes use of new classification methods and growth in experimental data to improve coverage and accuracy of solubility predictions. The classification algorithm is organized as a two-layered structure in which the output of a primary Parzen window model for sequence similarity and a logistic regression classifier of amino acid k-mer composition serve as input for a second-level logistic regression classifier. Compared with previously published research our model is trained on five times more data than used by any other method before (82 000 proteins). When tested on a separate holdout set not used at any point of method development our server attained the best results in comparison with other currently available methods: accuracy 75.4%, Matthew's correlation coefficient 0.39, sensitivity 0.731, specificity 0.759, gain (soluble) 2.263. In summary, due to utilization of cutting edge machine learning technologies combined with the largest currently available experimental data set the PROSO II server constitutes a substantial improvement in protein solubility predictions. PROSO II is available at http://mips.helmholtz-muenchen.de/prosoII.
Collapse
Affiliation(s)
- Pawel Smialowski
- Department of Genome Oriented Bioinformatics, Technische Universität Muenchen, Freising, Germany.
| | | | | | | | | |
Collapse
|
25
|
Retallack DM, Jin H, Chew L. Reliable protein production in a Pseudomonas fluorescens expression system. Protein Expr Purif 2012; 81:157-65. [DOI: 10.1016/j.pep.2011.09.010] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2011] [Revised: 09/20/2011] [Accepted: 09/20/2011] [Indexed: 10/17/2022]
|
26
|
Expression pattern analysis of regulatory transcription factors in Caenorhabditis elegans. Methods Mol Biol 2012; 786:21-50. [PMID: 21938618 DOI: 10.1007/978-1-61779-292-2_2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Expression pattern data are fundamental to understanding transcriptional regulatory networks and the biological significance of such networks. For Caenorhabditis elegans, expression pattern analysis of transcription factor genes, with cellular resolution, typically involves generation of transcription factor gene/reporter gene fusions. This is followed by the creation of C. elegans strains transgenic for, and determination of expression patterns driven by, these fusions. Physiologically relevant regulatory relationships between transcription factors are both inferred from their expression patterns, in combination with protein-DNA interaction data, and evidenced from alterations of expression patterns when networks are disturbed.
Collapse
|
27
|
Petersen LK, Stowers RS. A Gateway MultiSite recombination cloning toolkit. PLoS One 2011; 6:e24531. [PMID: 21931740 PMCID: PMC3170369 DOI: 10.1371/journal.pone.0024531] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2011] [Accepted: 08/11/2011] [Indexed: 02/02/2023] Open
Abstract
The generation of DNA constructs is often a rate-limiting step in conducting biological experiments. Recombination cloning of single DNA fragments using the Gateway system provided an advance over traditional restriction enzyme cloning due to increases in efficiency and reliability. Here we introduce a series of entry clones and a destination vector for use in two, three, and four fragment Gateway MultiSite recombination cloning whose advantages include increased flexibility and versatility. In contrast to Gateway single-fragment cloning approaches where variations are typically incorporated into model system-specific destination vectors, our Gateway MultiSite cloning strategy incorporates variations in easily generated entry clones that are model system-independent. In particular, we present entry clones containing insertions of GAL4, QF, UAS, QUAS, eGFP, and mCherry, among others, and demonstrate their in vivo functionality in Drosophila by using them to generate expression clones including GAL4 and QF drivers for various trp ion channel family members, UAS and QUAS excitatory and inhibitory light-gated ion channels, and QUAS red and green fluorescent synaptic vesicle markers. We thus establish a starter toolkit of modular Gateway MultiSite entry clones potentially adaptable to any model system. An inventory of entry clones and destination vectors for Gateway MultiSite cloning has also been established (www.gatewaymultisite.org).
Collapse
Affiliation(s)
- Lena K. Petersen
- Department of Cell Biology and Neuroscience, Montana State University, Bozeman, Montana, United States of America
| | - R. Steven Stowers
- Department of Cell Biology and Neuroscience, Montana State University, Bozeman, Montana, United States of America
- * E-mail:
| |
Collapse
|
28
|
Ghasemi A, Salmanian AH, Sadeghifard N, Salarian AA, Gholi MK. Cloning, expression and purification of Pwo polymerase from Pyrococcus woesei. IRANIAN JOURNAL OF MICROBIOLOGY 2011; 3:118-22. [PMID: 22347593 PMCID: PMC3279813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
BACKGROUND AND OBJECTIVES Pyrococcus woesei is a hyperthermophilic archaea and produces a heat stable polymerase (Pwo polymerase) that has proofreading activity. MATERIALS AND METHODS In this study, this microorganism was cultured, its DNA was extracted and the pwo gene polymerase was cloned, expressed and purified. The DNA sequence of the cloned gene was verified by sequencing. The pwo polymerase gene consists of 2,328 bps (775 amino acids with about 90 kD molecular weight). Cloning was done by GATEWAY™ Cloning System and for purification of recombinant protein; His6x-Tag was added to the C-terminus of the recombinant protein. RESULTS AND CONCLUSION We could purify Pwo polymerase enzyme by Ni-NTA resin. PCR assay showed that Pwo polymerase activity is comparable to a commercial Pfu polymerase activity.
Collapse
Affiliation(s)
- Amir Ghasemi
- Department of Pathobiology, Institute of Public Health, Tehran University of Medical Sciences, Tehran, Iran.,Army University of Medical Sciences, Tehran, Iran. ,Corresponding author: Amir Ghasemi MSc Address: Department of Pathobiology, Institute of Public Health, Tehran University of Medical Sciences, Tehran, Iran. Tel: +98-9123595610. E-mail:
| | - Ali Hatef Salmanian
- National Institute of Genetic Engineering and Biotechnology (NIGEB). Shahrak-e-Pajoohesh, 15th Km, Tehran, Karaj Highway, Tehran, Iran.
| | - Nourkhoda Sadeghifard
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran.
| | | | - Mohammad Khalifeh Gholi
- Department of Pathobiology, Institute of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
29
|
Martínez-Turiño S, Hernández C. A membrane-associated movement protein of Pelargonium flower break virus shows RNA-binding activity and contains a biologically relevant leucine zipper-like motif. Virology 2011; 413:310-9. [PMID: 21444100 DOI: 10.1016/j.virol.2011.03.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Revised: 02/11/2011] [Accepted: 03/03/2011] [Indexed: 10/18/2022]
Abstract
Two small viral proteins (DGBp1 and DGBp2) have been proposed to act in a concerted manner to aid intra- and intercellular trafficking of carmoviruses though the distribution of functions and mode of action of each protein partner are not yet clear. Here we have confirmed the requirement of the DGBps of Pelargonium flower break virus (PFBV), p7 and p12, for pathogen movement. Studies focused on p12 have shown that it associates to cellular membranes, which is in accordance to its hydrophobic profile and to that reported for several homologs. However, peculiarities that distinguish p12 from other DGBps2 have been found. Firstly, it contains a leucine zipper-like motif which is essential for virus infectivity in plants. Secondly, it has an unusually long and basic N-terminal region that confers RNA binding activity. The results suggest that PFBV p12 may differ mechanistically from related proteins and possible roles of PFBV DGBps are discussed.
Collapse
Affiliation(s)
- Sandra Martínez-Turiño
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-Universidad Politécnica de Valencia, Ciudad Politécnica de la Innovación, Ed. 8E. Camino de Vera s/n, 46022 Valencia, Spain
| | | |
Collapse
|
30
|
Kwon K, Hasseman J, Latham S, Grose C, Do Y, Fleischmann RD, Pieper R, Peterson SN. Recombinant expression and functional analysis of proteases from Streptococcus pneumoniae, Bacillus anthracis, and Yersinia pestis. BMC BIOCHEMISTRY 2011; 12:17. [PMID: 21545736 PMCID: PMC3113736 DOI: 10.1186/1471-2091-12-17] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2010] [Accepted: 05/05/2011] [Indexed: 12/17/2022]
Abstract
Background Uncharacterized proteases naturally expressed by bacterial pathogens represents important topic in infectious disease research, because these enzymes may have critical roles in pathogenicity and cell physiology. It has been observed that cloning, expression and purification of proteases often fail due to their catalytic functions which, in turn, cause toxicity in the E. coli heterologous host. Results In order to address this problem systematically, a modified pipeline of our high-throughput protein expression and purification platform was developed. This included the use of a specific E. coli strain, BL21(DE3) pLysS to tightly control the expression of recombinant proteins and various expression vectors encoding fusion proteins to enhance recombinant protein solubility. Proteases fused to large fusion protein domains, maltosebinding protein (MBP), SP-MBP which contains signal peptide at the N-terminus of MBP, disulfide oxidoreductase (DsbA) and Glutathione S-transferase (GST) improved expression and solubility of proteases. Overall, 86.1% of selected protease genes including hypothetical proteins were expressed and purified using a combination of five different expression vectors. To detect novel proteolytic activities, zymography and fluorescence-based assays were performed and the protease activities of more than 46% of purified proteases and 40% of hypothetical proteins that were predicted to be proteases were confirmed. Conclusions Multiple expression vectors, employing distinct fusion tags in a high throughput pipeline increased overall success rates in expression, solubility and purification of proteases. The combinatorial functional analysis of the purified proteases using fluorescence assays and zymography confirmed their function.
Collapse
Affiliation(s)
- Keehwan Kwon
- Pathogen Functional Genomics Resource Center, J, Craig Venter Institute, Rockville, Maryland 20850, USA.
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Hirose S, Kawamura Y, Yokota K, Kuroita T, Natsume T, Komiya K, Tsutsumi T, Suwa Y, Isogai T, Goshima N, Noguchi T. Statistical analysis of features associated with protein expression/solubility in an in vivo Escherichia coli expression system and a wheat germ cell-free expression system. ACTA ACUST UNITED AC 2011; 150:73-81. [DOI: 10.1093/jb/mvr042] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
|
32
|
Jones MR, Lohn Z, Rose AM. Specialized chromosomes and their uses in Caenorhabditis elegans. Methods Cell Biol 2011; 106:23-64. [PMID: 22118273 DOI: 10.1016/b978-0-12-544172-8.00002-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Research on Caenorhabditis elegans involves the use of a wide range of genetic and molecular tools consisting of chromosomal material captured and modified for specific purposes. These "specialized chromosomes" come in many forms ranging from relatively simple gene deletions to complex rearrangements involving endogenous chromosomes as well as transgenic constructs. In this chapter, we describe the specialized chromosomes that are available in C. elegans, their origins, practical considerations, and methods for generation and evaluation. We will summarize their uses for biological studies, and their contribution to our knowledge about chromosome biology.
Collapse
Affiliation(s)
- Martin R Jones
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | |
Collapse
|
33
|
Farmani J, Safari M, Roohvand F, Razavi SH, Aghasadeghi MR, Noorbazargan H. Conjugated linoleic acid-producing enzymes: A bioinformatics study. EUR J LIPID SCI TECH 2010. [DOI: 10.1002/ejlt.201000360] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
34
|
Mansell TJ, Linderman SW, Fisher AC, DeLisa MP. A rapid protein folding assay for the bacterial periplasm. Protein Sci 2010; 19:1079-90. [PMID: 20440843 DOI: 10.1002/pro.388] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
An array of genetic screens and selections has been developed for reporting protein folding and solubility in the cytoplasm of living cells. However, there are currently no analogous folding assays for the bacterial periplasm, despite the significance of this compartment for the expression of recombinant proteins, especially those requiring important posttranslational modifications (e.g., disulfide bond formation). Here, we describe an engineered genetic selection for monitoring protein folding in the periplasmic compartment of Escherichia coli cells. In this approach, target proteins are sandwiched between an N-terminal signal recognition particle (SRP)-dependent signal peptide and a C-terminal selectable marker, TEM-1 beta-lactamase. The resulting chimeras are localized to the periplasmic space via the cotranslational SRP pathway. Using a panel of native and heterologous proteins, we demonstrate that the folding efficiency of various target proteins correlates directly with in vivo beta-lactamase activity and thus resistance to ampicillin. We also show that this reporter is useful for the discovery of extrinsic periplasmic factors (e.g., chaperones) that affect protein folding and for obtaining folding-enhanced proteins via directed evolution. Collectively, these data demonstrate that our periplasmic folding reporter is a powerful tool for screening and engineering protein folding in a manner that does not require any structural or functional information about the target protein.
Collapse
Affiliation(s)
- Thomas J Mansell
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, New York 14853, USA
| | | | | | | |
Collapse
|
35
|
Chan WC, Liang PH, Shih YP, Yang UC, Lin WC, Hsu CN. Learning to predict expression efficacy of vectors in recombinant protein production. BMC Bioinformatics 2010; 11 Suppl 1:S21. [PMID: 20122193 PMCID: PMC3009492 DOI: 10.1186/1471-2105-11-s1-s21] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Background Recombinant protein production is a useful biotechnology to produce a large quantity of highly soluble proteins. Currently, the most widely used production system is to fuse a target protein into different vectors in Escherichia coli (E. coli). However, the production efficacy of different vectors varies for different target proteins. Trial-and-error is still the common practice to find out the efficacy of a vector for a given target protein. Previous studies are limited in that they assumed that proteins would be over-expressed and focused only on the solubility of expressed proteins. In fact, many pairings of vectors and proteins result in no expression. Results In this study, we applied machine learning to train prediction models to predict whether a pairing of vector-protein will express or not express in E. coli. For expressed cases, the models further predict whether the expressed proteins would be soluble. We collected a set of real cases from the clients of our recombinant protein production core facility, where six different vectors were designed and studied. This set of cases is used in both training and evaluation of our models. We evaluate three different models based on the support vector machines (SVM) and their ensembles. Unlike many previous works, these models consider the sequence of the target protein as well as the sequence of the whole fusion vector as the features. We show that a model that classifies a case into one of the three classes (no expression, inclusion body and soluble) outperforms a model that considers the nested structure of the three classes, while a model that can take advantage of the hierarchical structure of the three classes performs slight worse but comparably to the best model. Meanwhile, compared to previous works, we show that the prediction accuracy of our best method still performs the best. Lastly, we briefly present two methods to use the trained model in the design of the recombinant protein production systems to improve the chance of high soluble protein production. Conclusion In this paper, we show that a machine learning approach to the prediction of the efficacy of a vector for a target protein in a recombinant protein production system is promising and may compliment traditional knowledge-driven study of the efficacy. We will release our program to share with other labs in the public domain when this paper is published.
Collapse
Affiliation(s)
- Wen-Ching Chan
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan.
| | | | | | | | | | | |
Collapse
|
36
|
Kurotani A, Takagi T, Toyama M, Shirouzu M, Yokoyama S, Fukami Y, Tokmakov AA. Comprehensive bioinformatics analysis of cell-free protein synthesis: identification of multiple protein properties that correlate with successful expression. FASEB J 2009; 24:1095-104. [PMID: 19940260 DOI: 10.1096/fj.09-139527] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
High-throughput cell-free protein synthesis is being used increasingly in structural/functional genomics projects. However, the factors determining expression success are poorly understood. Here, we evaluated the expression of 3066 human proteins and their domains in a bacterial cell-free system and analyzed the correlation of protein expression with 39 physicochemical and structural properties of proteins. As a result of the bioinformatics analysis performed, we determined the 18 most influential features that affect protein amenability to cell-free expression. They include protein length; hydrophobicity; pI; content of charged, nonpolar, and aromatic residues;, cysteine content; solvent accessibility; presence of coiled coil; content of intrinsically disordered and structured (alpha-helix and beta-sheet) sequence; number of disulfide bonds and functional domains; presence of transmembrane regions; PEST motifs; and signaling sequences. This study represents the first comprehensive bioinformatics analysis of heterologous protein synthesis in a cell-free system. The rules and correlations revealed here provide a plethora of important insights into rationalization of cell-free protein production and can be of practical use for protein engineering with the aim of increasing expression success.-Kurotani, A., Takagi, T., Toyama, M., Shirouzu, M., Yokoyama, S., Fukami, Y., Tokmakov, A. A. Comprehensive bioinformatics analysis of cell-free protein synthesis: identification of multiple protein properties that correlate with successful expression.
Collapse
|
37
|
Quartley E, Alexandrov A, Mikucki M, Buckner FS, Hol WG, DeTitta GT, Phizicky EM, Grayhack EJ. Heterologous expression of L. major proteins in S. cerevisiae: a test of solubility, purity, and gene recoding. ACTA ACUST UNITED AC 2009; 10:233-47. [PMID: 19701618 DOI: 10.1007/s10969-009-9068-9] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2009] [Accepted: 08/06/2009] [Indexed: 11/25/2022]
Abstract
High level expression of many eukaryotic proteins for structural analysis is likely to require a eukaryotic host since many proteins are either insoluble or lack essential post-translational modifications when expressed in E. coli. The well-studied eukaryote Saccharomyces cerevisiae possesses several attributes of a good expression host: it is simple and inexpensive to culture, has proven genetic tractability, and has excellent recombinant DNA tools. We demonstrate here that this yeast exhibits three additional characteristics that are desirable in a eukaryotic expression host. First, expression in yeast significantly improves the solubility of proteins that are expressed but insoluble in E. coli. The expression and solubility of 83 Leishmania major ORFs were compared in S. cerevisiae and in E. coli, with the result that 42 of the 64 ORFs with good expression and poor solubility in E. coli are highly soluble in S. cerevisiae. Second, the yield and purity of heterologous proteins expressed in yeast is sufficient for structural analysis, as demonstrated with both small scale purifications of 21 highly expressed proteins and large scale purifications of 2 proteins, which yield highly homogeneous preparations. Third, protein expression can be improved by altering codon usage, based on the observation that a codon-optimized construct of one ORF yields three-fold more protein. Thus, these results provide direct verification that high level expression and purification of heterologous proteins in S. cerevisiae is feasible and likely to improve expression of proteins whose solubility in E. coli is poor.
Collapse
Affiliation(s)
- Erin Quartley
- Center for Pediatric Biomedical Research, University of Rochester Medical School, Rochester, NY 14642, USA
| | | | | | | | | | | | | | | |
Collapse
|
38
|
Screening colonies of pooled ORFeomes (SCOOP): a rapid and efficient strategy for expression screening ORFeomes in Escherichia coli. Protein Expr Purif 2009; 68:121-7. [PMID: 19635569 DOI: 10.1016/j.pep.2009.07.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2008] [Revised: 07/18/2009] [Accepted: 07/20/2009] [Indexed: 11/22/2022]
Abstract
We have designed and evaluated a novel strategy for screening large gene collections available as GATEWAY-adapted ORFeomes for soluble recombinant overexpression in Escherichia coli, called "Screening Colonies of ORFeome Pools" (SCOOP). From a large gene collection we could, without expensive multi-well based cloning and expression screening, determine which targets were suitable for large-scale expression and purification. Normalized bacterial overnight cultures of an ORF collection of entry clones derived from the Kaposi's sarcoma associated herpesvirus (KSHV) were pooled and used for the isolation of plasmid DNA. The resulting ORF library was subcloned into a prokaryotic expression vector in a single recombination reaction and was subsequently screened with the colony filtration (CoFi) blot for soluble recombinant overexpression in E. coli. ORFs determined to express soluble recombinant proteins were identified by sequencing and analysed by small-scale IMAC and SDS-PAGE. As a reference, we subcloned all ORFs individually using a traditional multi-well based procedure and screened them for soluble expression. Our results show that the two processes have a similar efficiency as 23 and 25 out of 74 assessable clones were identified as soluble expressers using SCOOP and the traditional multi-well procedure, respectively. Because SCOOP minimises costs for cloning and expression screening, it constitutes an interesting alternative for establishing expression of large gene collections. SCOOP also allows affordable screening in alternative vectors, expression strains and physical conditions, which is challenging in large-scale protein production programs. With this strategy in hand success rates for future proteome-wide protein production efforts can be significantly increased.
Collapse
|
39
|
Oh NS, Park JS, Jeon YJ, Oh JH, Jeong SY, Yang JO, Park YW, Yoo HS, Kim NS. Generation of expression clone set for functional proteomics of human gastric and liver cancers. Exp Biol Med (Maywood) 2009; 234:1220-9. [PMID: 19596826 DOI: 10.3181/0812-rm-371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Two thousand sixty-eight multi-purpose expression clones for the 326 candidate genes related to gastric or liver cancers were constructed using the Gateway system. These clones can be expressed as His, Glutathione-S-transferase (GST) or Enhanced version of the green fluorescent protein (EGFP) fusion proteins in E. coli, insect cells or mammalian cells. For the 246 E. coli expression clones, the GST fusion proteins had greater expression efficiency and solubility than the His fusion proteins. Approximately 20% of the expressed proteins had unexpected molecular weights. A detailed sequence analysis of these clones revealed frameshift mutations resulting from insertion, deletion or substitution of nucleotides. The results indicate that these changes in the candidate genes may affect the occurrence of gastric or liver cancers. In addition, when 105 proteins, which were expressed in E. coli at very low or undetectable levels, were expressed in insect cells, 76% of the proteins were expressed very well and most were soluble. We also found that most of the 30 proteins prepared using EGFP mammalian expression clones were localized to cellular compartments expected by Gene ontology (GO) and this localization was unaffected if the EGFP-fusion was at the N-terminal or C-terminal region of the protein. Antibody production and subcellular localization analysis of the candidate genes as well as a screen of genes involved in carcinogenesis pathways are currently in progress using these expression clones. These studies provide a valuable resource for developing a better understanding of the molecular mechanism of carcinogenesis in both gastric and liver cancer and would be very helpful in diagnosis and therapeutic predictions.
Collapse
Affiliation(s)
- Nang-Soo Oh
- Laboratory of Human Genomics, Genome Research Center, KRIBB, Daejeon 305-806, Korea
| | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Foti L, Fonseca BDPFE, Nascimento LD, Marques CDFS, Silva EDD, Duarte CAB, Probst CM, Goldenberg S, Pinto AG, Krieger MA. Viability study of a multiplex diagnostic platform for Chagas disease. Mem Inst Oswaldo Cruz 2009; 104 Suppl 1:136-41. [DOI: 10.1590/s0074-02762009000900019] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2009] [Accepted: 06/16/2009] [Indexed: 01/20/2023] Open
|
41
|
Magnan CN, Randall A, Baldi P. SOLpro: accurate sequence-based prediction of protein solubility. ACTA ACUST UNITED AC 2009; 25:2200-7. [PMID: 19549632 DOI: 10.1093/bioinformatics/btp386] [Citation(s) in RCA: 343] [Impact Index Per Article: 22.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Protein insolubility is a major obstacle for many experimental studies. A sequence-based prediction method able to accurately predict the propensity of a protein to be soluble on overexpression could be used, for instance, to prioritize targets in large-scale proteomics projects and to identify mutations likely to increase the solubility of insoluble proteins. RESULTS Here, we first curate a large, non-redundant and balanced training set of more than 17 000 proteins. Next, we extract and study 23 groups of features computed directly or predicted (e.g. secondary structure) from the primary sequence. The data and the features are used to train a two-stage support vector machine (SVM) architecture. The resulting predictor, SOLpro, is compared directly with existing methods and shows significant improvement according to standard evaluation metrics, with an overall accuracy of over 74% estimated using multiple runs of 10-fold cross-validation.
Collapse
Affiliation(s)
- Christophe N Magnan
- Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California, Irvine, CA, USA
| | | | | |
Collapse
|
42
|
Possee RD, Hitchman RB, Richards KS, Mann SG, Siaterli E, Nixon CP, Irving H, Assenberg R, Alderton D, Owens RJ, King LA. Generation of baculovirus vectors for the high-throughput production of proteins in insect cells. Biotechnol Bioeng 2008; 101:1115-22. [PMID: 18781697 DOI: 10.1002/bit.22002] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The baculovirus expression system is one of the most popular methods used for the production of recombinant proteins but has several complex steps which have proved inherently difficult to adapt to a multi-parallel process. We have developed a bacmid vector that does not require any form of selection pressure to separate recombinant virus from non-recombinant parental virus. The method relies on homologous recombination in insect cells between a transfer vector containing a gene to be expressed and a replication-deficient bacmid. The target gene replaces a bacterial replicon at the polyhedrin loci, simultaneously restoring a virus gene essential for replication. Therefore, only recombinant virus can replicate facilitating the rapid production of multiple recombinant viruses on automated platforms in a one-step procedure. Using this vector allowed us to automate the generation of multiple recombinant viruses with a robotic liquid handler and then rapidly screen infected insect cell supernatant for the presence of secreted proteins.
Collapse
Affiliation(s)
- Robert D Possee
- National Environmental Research Council, Centre for Hydrology & Ecology, Oxford, UK
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
High-Throughput Self-Interaction Chromatography: Applications in Protein Formulation Prediction. Pharm Res 2008; 26:296-305. [DOI: 10.1007/s11095-008-9737-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2008] [Accepted: 09/24/2008] [Indexed: 10/21/2022]
|
44
|
Abstract
During the last 10 years, there has been a large increase in the number of genome sequences available for study, altering the way that the biology of organisms is studied. In particular, scientific attention has increasingly focused on the proteome, and specifically on the role of all the proteins encoded by the genome. We focus here on several aspects of this problem. We describe several technologies in widespread use to clone genes on a genome-wide scale, and to express and purify the proteins encoded by these genes. We also describe a number of methods that have been developed to analyze various biochemical properties of the proteins, with attention to the methodology and the limitations of the approaches, followed by a look at possible developments in the next decade.
Collapse
Affiliation(s)
- Eric M Phizicky
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine, Rochester, NY 14642, USA.
| | | |
Collapse
|
45
|
Shafran H, Miyara I, Eshed R, Prusky D, Sherman A. Development of new tools for studying gene function in fungi based on the Gateway system. Fungal Genet Biol 2008; 45:1147-54. [PMID: 18550398 DOI: 10.1016/j.fgb.2008.04.011] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2007] [Revised: 03/24/2008] [Accepted: 04/06/2008] [Indexed: 10/22/2022]
Abstract
Genomic information of many fungi has been released but large scale functional genomic studies are still limited by a lack of high-throughput methods. The low rates of homologous recombination and low rates of transformation are limiting steps in filamentous fungi, but the molecular tools are also lagging behind. In this paper we describe two new high-throughput functional genomic tools for filamentous fungi that are based on the Gateway technology. One system is the Gateway RNAi vector for fungi that allows gene silencing in a high-throughput manner. The other system is a high-throughput deletion construct system. These systems were tested using the PAC1 gene of Colletotrichum gloeosporioides. Using these types of approaches, large scale functional genomics experiments can be performed in filamentous fungi.
Collapse
Affiliation(s)
- Hadas Shafran
- Department of Genomics, Agricultural Research Organization, The Volcani Center, Bet Dagan 50250, Israel
| | | | | | | | | |
Collapse
|
46
|
Espargaró A, Castillo V, de Groot NS, Ventura S. The in vivo and in vitro aggregation properties of globular proteins correlate with their conformational stability: the SH3 case. J Mol Biol 2008; 378:1116-31. [PMID: 18423663 DOI: 10.1016/j.jmb.2008.03.020] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2007] [Revised: 03/13/2008] [Accepted: 03/13/2008] [Indexed: 11/24/2022]
Abstract
Protein misfolding and deposition underlie an increasing number of debilitating human disorders and constitute a problem of major concern in biotechnology. In the last years, in vitro studies have provided valuable insights into the physicochemical principles underlying protein aggregation. Nevertheless, information about the determinants of protein deposition within the cell is scarce and only a few systematic studies comparing in vitro and in vivo data have been reported. Here, we have used the SH3 domain of alpha-spectrin as a model globular protein in an attempt to understand the relationship between protein aggregation in the test-tube and in the more complex cellular environment. The investigation of the aggregation in Escherichia coli of this domain and a large set of mutants, together with the analysis of their sequential and conformational properties allowed us to evaluate the contribution of different polypeptidic factors to the cellular deposition of globular proteins. The data presented here suggest that the rules that govern in vitro protein aggregation are also valid in in vivo contexts. They also provide relevant insights into intracellular protein deposition in both conformational diseases and recombinant protein production.
Collapse
Affiliation(s)
- Alba Espargaró
- Departament de Bioquímica i Biologia Molecular, Facultat de Biociències, Universitat Autònoma de Barcelona, E-08193 Bellaterra, Spain
| | | | | | | |
Collapse
|
47
|
Zaccai NR, Carter LG, Berrow NS, Sainsbury S, Nettleship JE, Walter TS, Harlos K, Owens RJ, Wilson KS, Stuart DI, Esnouf RM. Crystal structure of a 3-oxoacyl-(acylcarrier protein) reductase (BA3989) from Bacillus anthracis at 2.4-A resolution. Proteins 2008; 70:562-7. [PMID: 17894349 DOI: 10.1002/prot.21624] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Nathan R Zaccai
- The Oxford Protein Production Facility, Division of Structural Biology, University of Oxford, Roosevelt Drive, Oxford, OX3 7BN, United Kingdom
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Abstract
Pharmacological treatment in Alzheimer's disease (AD) accounts for 10-20% of direct costs, and fewer than 20% of AD patients are moderate responders to conventional drugs (donepezil, rivastigmine, galantamine, memantine), with doubtful cost-effectiveness. Both AD pathogenesis and drug metabolism are genetically regulated complex traits in which hundreds of genes cooperatively participate. Structural genomics studies demonstrated that more than 200 genes might be involved in AD pathogenesis regulating dysfunctional genetic networks leading to premature neuronal death. The AD population exhibits a higher genetic variation rate than the control population, with absolute and relative genetic variations of 40-60% and 0.85-1.89%, respectively. AD patients also differ in their genomic architecture from patients with other forms of dementia. Functional genomics studies in AD revealed that age of onset, brain atrophy, cerebrovascular hemodynamics, brain bioelectrical activity, cognitive decline, apoptosis, immune function, lipid metabolism dyshomeostasis, and amyloid deposition are associated with AD-related genes. Pioneering pharmacogenomics studies also demonstrated that the therapeutic response in AD is genotype-specific, with apolipoprotein E (APOE) 4/4 carriers the worst responders to conventional treatments. About 10-20% of Caucasians are carriers of defective cytochrome P450 (CYP) 2D6 polymorphic variants that alter the metabolism and effects of AD drugs and many psychotropic agents currently administered to patients with dementia. There is a moderate accumulation of AD-related genetic variants of risk in CYP2D6 poor metabolizers (PMs) and ultrarapid metabolizers (UMs), who are the worst responders to conventional drugs. The association of the APOE-4 allele with specific genetic variants of other genes (e.g., CYP2D6, angiotensin-converting enzyme [ACE]) negatively modulates the therapeutic response to multifactorial treatments affecting cognition, mood, and behavior. Pharmacogenetic and pharmacogenomic factors may account for 60-90% of drug variability in drug disposition and pharmacodynamics. The incorporation of pharmacogenetic/pharmacogenomic protocols to AD research and clinical practice can foster therapeutics optimization by helping to develop cost-effective pharmaceuticals and improving drug efficacy and safety.
Collapse
Affiliation(s)
- Ramón Cacabelos
- EuroEspes Biomedical Research Center, Institute for CNS Disorders, Bergondo, Coruña, Spain
| |
Collapse
|
49
|
Kumar S, Chaudhary K, Foster JM, Novelli JF, Zhang Y, Wang S, Spiro D, Ghedin E, Carlow CKS. Mining predicted essential genes of Brugia malayi for nematode drug targets. PLoS One 2007; 2:e1189. [PMID: 18000556 PMCID: PMC2063515 DOI: 10.1371/journal.pone.0001189] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2007] [Accepted: 10/25/2007] [Indexed: 12/02/2022] Open
Abstract
We report results from the first genome-wide application of a rational drug target selection methodology to a metazoan pathogen genome, the completed draft sequence of Brugia malayi, a parasitic nematode responsible for human lymphatic filariasis. More than 1.5 billion people worldwide are at risk of contracting lymphatic filariasis and onchocerciasis, a related filarial disease. Drug treatments for filariasis have not changed significantly in over 20 years, and with the risk of resistance rising, there is an urgent need for the development of new anti-filarial drug therapies. The recent publication of the draft genomic sequence for B. malayi enables a genome-wide search for new drug targets. However, there is no functional genomics data in B. malayi to guide the selection of potential drug targets. To circumvent this problem, we have utilized the free-living model nematode Caenorhabditis elegans as a surrogate for B. malayi. Sequence comparisons between the two genomes allow us to map C. elegans orthologs to B. malayi genes. Using these orthology mappings and by incorporating the extensive genomic and functional genomic data, including genome-wide RNAi screens, that already exist for C. elegans, we identify potentially essential genes in B. malayi. Further incorporation of human host genome sequence data and a custom algorithm for prioritization enables us to collect and rank nearly 600 drug target candidates. Previously identified potential drug targets cluster near the top of our prioritized list, lending credibility to our methodology. Over-represented Gene Ontology terms, predicted InterPro domains, and RNAi phenotypes of C. elegans orthologs associated with the potential target pool are identified. By virtue of the selection procedure, the potential B. malayi drug targets highlight components of key processes in nematode biology such as central metabolism, molting and regulation of gene expression.
Collapse
Affiliation(s)
- Sanjay Kumar
- Division of Parasitology, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Kshitiz Chaudhary
- Division of Parasitology, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Jeremy M. Foster
- Division of Parasitology, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Jacopo F. Novelli
- Division of Parasitology, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Yinhua Zhang
- Division of Parasitology, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
| | - Shiliang Wang
- The Institute for Genomic Research, Rockville, Maryland, United States of America
| | - David Spiro
- The Institute for Genomic Research, Rockville, Maryland, United States of America
| | - Elodie Ghedin
- The Institute for Genomic Research, Rockville, Maryland, United States of America
- Division of Infectious Diseases, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America
| | - Clotilde K. S. Carlow
- Division of Parasitology, New England Biolabs, Inc., Ipswich, Massachusetts, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
50
|
Kwon K, Pieper R, Shallom S, Grose C, Kwon E, Do Y, Latham S, Alami H, Huang ST, Gatlin C, Papazisi L, Fleischmann R, Peterson S. A correlation analysis of protein characteristics associated with genome-wide high throughput expression and solubility of Streptococcus pneumoniae proteins. Protein Expr Purif 2007; 55:368-78. [PMID: 17703947 DOI: 10.1016/j.pep.2007.06.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2007] [Revised: 06/12/2007] [Accepted: 06/18/2007] [Indexed: 12/01/2022]
Abstract
We have developed and evaluated a highly parallel protein expression and purification system using ORFs derived from the pathogenic bacterium Streptococcus pneumoniae as a representative test case in conjunction with the Gateway cloning technology. Establishing high throughput protein production capability is essential for genome-wide characterization of protein function. In this study, we focused on protein expression and purification outcomes generated from an expression vector which encodes an NH(2)-terminal hexa-histidine tag and a COOH-terminal S-tag. Purified recombinant proteins were validated by SDS-PAGE, followed by in-gel digestion and identification by MALDI-TOF/TOF analysis. Starting with 1360 sequence-validated destination clones we examined correlation analyses of expression and solubility of a wide variety of recombinant proteins. In total, 428 purified proteins (31%) were recovered in soluble form. We describe a semi-quantitative scoring method using an S-tag assay to improve the throughput and efficiency of expression and solubility studies for recombinant proteins. Given a relatively large dataset derived from proteins representing all functional groups in a microbial genome we correlated various protein characteristics as they relate to protein expression outcomes.
Collapse
Affiliation(s)
- Keehwan Kwon
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|