1
|
Ambrosini C, Destefanis E, Kheir E, Broso F, Alessandrini F, Longhi S, Battisti N, Pesce I, Dassi E, Petris G, Cereseto A, Quattrone A. Translational enhancement by base editing of the Kozak sequence rescues haploinsufficiency. Nucleic Acids Res 2022; 50:10756-10771. [PMID: 36165847 PMCID: PMC9561285 DOI: 10.1093/nar/gkac799] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 09/01/2022] [Accepted: 09/22/2022] [Indexed: 11/28/2022] Open
Abstract
A variety of single-gene human diseases are caused by haploinsufficiency, a genetic condition by which mutational inactivation of one allele leads to reduced protein levels and functional impairment. Translational enhancement of the spare allele could exert a therapeutic effect. Here we developed BOOST, a novel gene-editing approach to rescue haploinsufficiency loci by the change of specific single nucleotides in the Kozak sequence, which controls translation by regulating start codon recognition. We evaluated for translational strength 230 Kozak sequences of annotated human haploinsufficient genes and 4621 derived variants, which can be installed by base editing, by a high-throughput reporter assay. Of these variants, 149 increased the translation of 47 Kozak sequences, demonstrating that a substantial proportion of haploinsufficient genes are controlled by suboptimal Kozak sequences. Validation of 18 variants for 8 genes produced an average enhancement in an expression window compatible with the rescue of the genetic imbalance. Base editing of the NCF1 gene, whose monoallelic loss causes chronic granulomatous disease, resulted in the desired increase of NCF1 (p47phox) protein levels in a relevant cell model. We propose BOOST as a fine-tuned approach to modulate translation, applicable to the correction of dozens of haploinsufficient monogenic disorders independently of the causing mutation.
Collapse
Affiliation(s)
- Chiara Ambrosini
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Eliana Destefanis
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Eyemen Kheir
- Laboratory of Molecular Virology, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Francesca Broso
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Federica Alessandrini
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Sara Longhi
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Nicolò Battisti
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Isabella Pesce
- Cell Analysis and Separation Core Facility, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Erik Dassi
- Laboratory of RNA Regulatory Networks, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Gianluca Petris
- Medical Research Council Laboratory of Molecular Biology (MRC LMB), Cambridge CB2 0QH, UK
| | - Anna Cereseto
- Laboratory of Molecular Virology, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| | - Alessandro Quattrone
- Laboratory of Translational Genomics, Department of Cellular, Computational and Integrative Biology - CIBIO, University of Trento, Trento 38123, Italy
| |
Collapse
|
2
|
Galluccio M, Console L, Pochini L, Scalise M, Giangregorio N, Indiveri C. Strategies for Successful Over-Expression of Human Membrane Transport Systems Using Bacterial Hosts: Future Perspectives. Int J Mol Sci 2022; 23:ijms23073823. [PMID: 35409183 PMCID: PMC8998559 DOI: 10.3390/ijms23073823] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 03/28/2022] [Accepted: 03/28/2022] [Indexed: 02/06/2023] Open
Abstract
Ten percent of human genes encode for membrane transport systems, which are key components in maintaining cell homeostasis. They are involved in the transport of nutrients, catabolites, vitamins, and ions, allowing the absorption and distribution of these compounds to the various body regions. In addition, roughly 60% of FDA-approved drugs interact with membrane proteins, among which are transporters, often responsible for pharmacokinetics and side effects. Defects of membrane transport systems can cause diseases; however, knowledge of the structure/function relationships of transporters is still limited. Among the expression of hosts that produce human membrane transport systems, E. coli is one of the most favorable for its low cultivation costs, fast growth, handiness, and extensive knowledge of its genetics and molecular mechanisms. However, the expression in E. coli of human membrane proteins is often toxic due to the hydrophobicity of these proteins and the diversity in structure with respect to their bacterial counterparts. Moreover, differences in codon usage between humans and bacteria hamper translation. This review summarizes the many strategies exploited to achieve the expression of human transport systems in bacteria, providing a guide to help people who want to deal with this topic.
Collapse
Affiliation(s)
- Michele Galluccio
- Unit of Biochemistry and Molecular Biotechnology, Department of Biology, Ecology and Earth Sciences (DiBEST), University of Calabria, Via P. Bucci 4c, Arcavacata di Rende, 87036 Cosenza, Italy; (M.G.); (L.C.); (L.P.); (M.S.)
| | - Lara Console
- Unit of Biochemistry and Molecular Biotechnology, Department of Biology, Ecology and Earth Sciences (DiBEST), University of Calabria, Via P. Bucci 4c, Arcavacata di Rende, 87036 Cosenza, Italy; (M.G.); (L.C.); (L.P.); (M.S.)
| | - Lorena Pochini
- Unit of Biochemistry and Molecular Biotechnology, Department of Biology, Ecology and Earth Sciences (DiBEST), University of Calabria, Via P. Bucci 4c, Arcavacata di Rende, 87036 Cosenza, Italy; (M.G.); (L.C.); (L.P.); (M.S.)
| | - Mariafrancesca Scalise
- Unit of Biochemistry and Molecular Biotechnology, Department of Biology, Ecology and Earth Sciences (DiBEST), University of Calabria, Via P. Bucci 4c, Arcavacata di Rende, 87036 Cosenza, Italy; (M.G.); (L.C.); (L.P.); (M.S.)
| | - Nicola Giangregorio
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnology (IBIOM), National Research Council (CNR), Via Amendola 165/A, 70126 Bari, Italy;
| | - Cesare Indiveri
- Unit of Biochemistry and Molecular Biotechnology, Department of Biology, Ecology and Earth Sciences (DiBEST), University of Calabria, Via P. Bucci 4c, Arcavacata di Rende, 87036 Cosenza, Italy; (M.G.); (L.C.); (L.P.); (M.S.)
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnology (IBIOM), National Research Council (CNR), Via Amendola 165/A, 70126 Bari, Italy;
- Correspondence:
| |
Collapse
|
3
|
Abstract
The development of safe and effective vaccines against viruses is central to disease control. With advancements in DNA synthesis technology, the production of synthetic viral genomes has fueled many research efforts that aim to generate attenuated viruses by introducing synonymous mutations. Elucidation of the mechanisms underlying virus attenuation through synonymous mutagenesis is revealing interesting new biology that can be exploited for vaccine development. Here, we review recent advancements in this field of synthetic virology and focus on the molecular mechanisms of attenuation by genetic recoding of viruses. We highlight the action of the zinc finger antiviral protein (ZAP) and RNase L, two proteins involved in the inhibition of viruses enriched for CpG and UpA dinucleotides, that are often the products of virus recoding algorithms. Additionally, we discuss current challenges in the field as well as studies that may illuminate how other host functions, such as translation, are potentially involved in the attenuation of recoded viruses.
Collapse
|
4
|
Wang YJ, Vaidyanathan PP, Rojas-Duran MF, Udeshi ND, Bartoli KM, Carr SA, Gilbert WV. Lso2 is a conserved ribosome-bound protein required for translational recovery in yeast. PLoS Biol 2018; 16:e2005903. [PMID: 30208026 PMCID: PMC6135351 DOI: 10.1371/journal.pbio.2005903] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2018] [Accepted: 08/09/2018] [Indexed: 02/05/2023] Open
Abstract
Ribosome-binding proteins function broadly in protein synthesis, gene regulation, and cellular homeostasis, but the complete complement of functional ribosome-bound proteins remains unknown. Using quantitative mass spectrometry, we identified late-annotated short open reading frame 2 (Lso2) as a ribosome-associated protein that is broadly conserved in eukaryotes. Genome-wide crosslinking and immunoprecipitation of Lso2 and its human ortholog coiled-coil domain containing 124 (CCDC124) recovered 25S ribosomal RNA in a region near the A site that overlaps the GTPase activation center. Consistent with this location, Lso2 also crosslinked to most tRNAs. Ribosome profiling of yeast lacking LSO2 (lso2Δ) revealed global translation defects during recovery from stationary phase with translation of most genes reduced more than 4-fold. Ribosomes accumulated at start codons, were depleted from stop codons, and showed codon-specific changes in occupancy in lso2Δ. These defects, and the conservation of the specific ribosome-binding activity of Lso2/CCDC124, indicate broadly important functions in translation and physiology. Translation, or the production of protein from messenger RNA (mRNA), is catalyzed by a universally conserved macromolecular machine known as the ribosome. Ribosome-binding factors are also required for all substeps of translation, from initial recruitment of mRNA to peptide chain elongation to release of the mature polypeptide. However, many ribosome interactors have been identified whose effects on translation and physiology are unknown. Here, we show that the uncharacterized yeast protein late-annotated short open reading frame 2 (Lso2) crosslinks to a region of the ribosome that underlies accurate progression through all substeps of translation, the GTPase activation center. This specific binding activity is conserved in the human ortholog of Lso2, coiled-coil domain containing 124 (CCDC124). Null mutants of lso2 also show severe translation defects during recovery from extended starvation, including failure to initiate on most mRNAs and a general block to peptide chain elongation. We propose that these defects could arise from a function for Lso2 in modulating the activity or integrity of the ribosome GTPase activation center during challenging growth regimes.
Collapse
Affiliation(s)
- Yinuo J. Wang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Microbiology Graduate Program, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | | | - Maria F. Rojas-Duran
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Namrata D. Udeshi
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Kristen M. Bartoli
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Steven A. Carr
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Wendy V. Gilbert
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
5
|
Abrahams L, Hurst LD. Adenine Enrichment at the Fourth CDS Residue in Bacterial Genes Is Consistent with Error Proofing for +1 Frameshifts. Mol Biol Evol 2018; 34:3064-3080. [PMID: 28961919 PMCID: PMC5850271 DOI: 10.1093/molbev/msx223] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Beyond selection for optimal protein functioning, coding sequences (CDSs) are under selection at the RNA and DNA levels. Here, we identify a possible signature of “dual-coding,” namely extensive adenine (A) enrichment at bacterial CDS fourth sites. In 99.07% of studied bacterial genomes, fourth site A use is greater than expected given genomic A-starting codon use. Arguing for nucleotide level selection, A-starting serine and arginine second codons are heavily utilized when compared with their non-A starting synonyms. Several models have the ability to explain some of this trend. In part, A-enrichment likely reduces 5′ mRNA stability, promoting translation initiation. However T/U, which may also reduce stability, is avoided. Further, +1 frameshifts on the initiating ATG encode a stop codon (TGA) provided A is the fourth residue, acting either as a frameshift “catch and destroy” or a frameshift stop and adjust mechanism and hence implicated in translation initiation. Consistent with both, genomes lacking TGA stop codons exhibit weaker fourth site A-enrichment. Sequences lacking a Shine–Dalgarno sequence and those without upstream leader genes, that may be more error prone during initiation, have greater utilization of A, again suggesting a role in initiation. The frameshift correction model is consistent with the notion that many genomic features are error-mitigation factors and provides the first evidence for site-specific out of frame stop codon selection. We conjecture that the NTG universal start codon may have evolved as a consequence of TGA being a stop codon and the ability of NTGA to rapidly terminate or adjust a ribosome.
Collapse
Affiliation(s)
- Liam Abrahams
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
6
|
Nakagawa S, Niimura Y, Gojobori T. Comparative genomic analysis of translation initiation mechanisms for genes lacking the Shine-Dalgarno sequence in prokaryotes. Nucleic Acids Res 2017; 45:3922-3931. [PMID: 28334743 PMCID: PMC5397173 DOI: 10.1093/nar/gkx124] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 02/11/2017] [Indexed: 02/01/2023] Open
Abstract
In prokaryotes, translation initiation is believed to occur through an interaction between the 3΄ tail of a 16S rRNA and a corresponding Shine–Dalgarno (SD) sequence in the 5΄ untranslated region (UTR) of an mRNA. However, some genes lack SD sequences (non-SD genes), and the fraction of non-SD genes in a genome varies depending on the prokaryotic species. To elucidate non-SD translation initiation mechanisms in prokaryotes from an evolutionary perspective, we statistically examined the nucleotide frequencies around the initiation codons in non-SD genes from 260 prokaryotes (235 bacteria and 25 archaea). We identified distinct nucleotide frequency biases upstream of the initiation codon in bacteria and archaea, likely because of the presence of leaderless mRNAs lacking a 5΄ UTR. Moreover, we observed overall similarities in the nucleotide patterns between upstream and downstream regions of the initiation codon in all examined phyla. Symmetric nucleotide frequency biases might facilitate translation initiation by preventing the formation of secondary structures around the initiation codon. These features are more prominent in species’ genomes that harbor large fractions of non-SD sequences, suggesting that a reduced stability around the initiation codon is important for efficient translation initiation in prokaryotes.
Collapse
Affiliation(s)
- So Nakagawa
- Department of Molecular Life Science, Tokai University School of Medicine, Isehara 259-1193, Japan.,Micro/Nano Technology Center, Tokai University, Hiratsuka 259-1292, Japan
| | - Yoshihito Niimura
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo 113-8657, Japan
| | - Takashi Gojobori
- King Abdullah University of Science and Technology, Computational Bioscience Research Center, Thuwal 23955-6900, Kingdom of Saudi Arabia
| |
Collapse
|
7
|
Nath Choudhury M, Uddin A, Chakraborty S. Codon usage bias and its influencing factors for Y-linked genes in human. Comput Biol Chem 2017; 69:77-86. [DOI: 10.1016/j.compbiolchem.2017.05.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Revised: 05/04/2017] [Accepted: 05/20/2017] [Indexed: 11/30/2022]
|
8
|
Porceddu A, Zenoni S, Camiolo S. The signatures of selection for translational accuracy in plant genes. Genome Biol Evol 2013; 5:1117-26. [PMID: 23695187 PMCID: PMC3698923 DOI: 10.1093/gbe/evt078] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Little is known about the natural selection of synonymous codons within the coding sequences of plant genes. We analyzed the distribution of synonymous codons within plant coding sequences and found that preferred codons tend to encode the more conserved and functionally important residues of plant proteins. This was consistent among several synonymous codon families and applied to genes with different expression profiles and functions. Most of the randomly chosen alternative sets of codons scored weaker associations than the actual sets of preferred codons, suggesting that codon position within plant genes and codon usage bias have coevolved to maximize translational accuracy. All these findings are consistent with the mistranslation-induced protein misfolding theory, which predicts the natural selection of highly preferred codons more frequently at sites where translation errors could compromise protein folding or functionality. Our results will provide an important insight in future studies of protein folding, molecular evolution, and transgene design for optimal expression.
Collapse
Affiliation(s)
- Andrea Porceddu
- Dipartimento di Agraria, Sezione di Agronomia e Coltivazione Erbacee Genetica-SACEG, Università degli studi di Sassari, Italy.
| | | | | |
Collapse
|
9
|
Indiveri C, Galluccio M, Scalise M, Pochini L. Strategies of bacterial over expression of membrane transporters relevant in human health: the successful case of the three members of OCTN subfamily. Mol Biotechnol 2013; 54:724-36. [PMID: 22843325 PMCID: PMC3636443 DOI: 10.1007/s12033-012-9586-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The OCTN subfamily includes OCTN1, 2, and 3 which are structurally and functionally related. These transporters are involved in maintenance of the carnitine homeostasis, which is essential in mammals for fatty acid β-oxidation, VLDL assembly, post-translational modifications, and other essential functions. Indeed, defects of these transporters lead to severe pathologies. OCTN1 and OCTN2 are expressed in many human tissues, while OCTN3 gene has been identified only in mouse and rat. The transporters mediate transport of carnitine and other substrates with different efficiencies and mechanisms. In order to over express the three proteins, a screening of many combinations of E. coli strains with plasmid constructs has been conducted. Only Rosetta(DE3) or Rosettagami2(DE3) gave significant expression. Higher protein amounts were firstly obtained with pET-41a(+) or pGEX-4T1 carrying fusion protein tags which required additional purification passages. Vectors carrying only a 6His tag, suitable for single passage purification, were preferred even though they lead to lower initial expression levels. Expressions were then increased optimizing several critical parameters. hOCTN1 was obtained with pH6EX3 in RosettaGami2(DE3)pLysS. hOCTN2 and mOCTN3 were obtained using pET-21a(+) in Rosetta(DE3). In particular, hOCTN2 was expressed only after codon bias, substituting the second triplet CGG with AAA (R2K mutant). The best growth conditions for hOCTN1 and mOCTN3 were 28 °C and 6 h of induction, while 4 h of induction for hOCTN2R2K. The proteins collected in the insoluble fraction of cell lysates, solubilized with sarkosyl, were purified by Ni-chelating chromatography. Final yield was 2.0, 3.0, or 3.5 mg/l of cell culture for mOCTN3, hOCTN1, or hOCTN2R2K. The data indicated that, in spite of the close evolutionary relations, several factors play different critical roles in bacterial expression of the three proteins, thus general criteria cannot be underlined. However, the strategy of dealing with related proteins revealed to be finally successful for over expressing all the three subfamily members.
Collapse
Affiliation(s)
- Cesare Indiveri
- Department of Cell Biology, University of Calabria, Arcavacata di Rende, Italy.
| | | | | | | |
Collapse
|
10
|
Galluccio M, Amelio L, Scalise M, Pochini L, Boles E, Indiveri C. Over-expression in E. coli and purification of the human OCTN2 transport protein. Mol Biotechnol 2012; 50:1-7. [PMID: 21487769 DOI: 10.1007/s12033-011-9406-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The OCTN2 cDNA amplified from human skin fibroblast was cloned in pET-41a(+) carrying the glutathione S-transferase (GST) gene. The construct pET-41a(+)-hOCTN2 was used to express the GST-hOCTN2 fusion protein in Escherichia coli Rosetta(DE3)pLysS. The best over-expression was obtained after 6 h of induction with IPTG at 28°C. The GST-hOCTN2 polypeptide was collected in the inclusion bodies and showed an apparent molecular mass on SDS-PAGE of 85 kDa. After solubilization with a buffer containing 0.8% sarkosyl and 3 M urea, the fusion protein was applied onto a Ni(2+)-chelating chromatography column. The purified GST-hOCTN2 was treated with thrombin, and the hOCTN2 was separated from the GST by size exclusion chromatography. After the whole procedure, a yield of about 0.2 mg purified protein per liter of cell culture was obtained. To improve the protein yield, hOCTN2 cDNA was subjected to codon bias. The second codon CGG was substituted with AAA; the substitution led to the mutation R2K in the hOCTN2 protein. hOCTN2(R2K) cDNA was cloned in pET-21a(+) carrying a C-terminal 6His tag. The resulting protein was expressed in E. coli Rosetta(DE3)pLysS and purified by Ni(2+)-chelating chromatography. A yield of about 3.5 mg purified protein per liter of cell culture was obtained with this procedure.
Collapse
Affiliation(s)
- Michele Galluccio
- Department of Cell Biology, University of Calabria, Via P. Bucci 4c, 87036, Arcavacata di Rende, Italy.
| | | | | | | | | | | |
Collapse
|
11
|
Porceddu A, Camiolo S. Spatial analyses of mono, di and trinucleotide trends in plant genes. PLoS One 2011; 6:e22855. [PMID: 21829660 PMCID: PMC3148226 DOI: 10.1371/journal.pone.0022855] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Accepted: 06/30/2011] [Indexed: 11/24/2022] Open
Abstract
Genomic DNA sequences display compositional heterogeneity on many scales. In this paper we analyzed tendencies and anomalies in the occurence of mono, di and trinucleotides in structural regions of plant genes. Representation of these trends as a function of position along genic sequences highlighted compositional features peculiar of either monocots or eudicots that were remarkably uniform within these two evolutionary clades. The most evident of these features appeared in the form of gradient of base content along the direction of transcription. The robustness of such a representation was validated in sequences sub-datasets generated considering structural and compositional features such as total length of cds, overall GC content and genic orientation in the genome. Piecewise regression analyses indicated that the gradients could be conveniently approximated to a two segmented model where a first region featuring a steep slope is followed by a second segment fitting a milder variation. In general, monocots species showed steeper segments than eudicots. The guanine gradient was the most distinctive feature between the two evolutionary clades, being moderately increasing in eudicots and firmly decreasing in monocots. Single gene investigation revealed that a high proportion of genes show compositional trends compatible with a segmented model suggesting that these features are essential attributes of gene organization. Dinucleotide and trinucleotide biases were referred to expectation based on a random union of the component elements. The average bias at dinucleotide level identified a significant undererpresentation of some dinucleotide and the overrepresention of others. The bias at trinucleotide level was on average low. Finally, the analysis of bryophyte coding sequences showed mononucleotide, dinucleotide and trinucleotide compositional trends resembling those of higher plants. This finding suggested that the emergenge of compositional bias is an ancient event in evolution which was already present at the time of land conquest by green plants.
Collapse
Affiliation(s)
- Andrea Porceddu
- Dipartimento di Scienze Agronomiche e Genetica Vegetale Agraria, Università degli Studi di Sassari, Sassari, Italy.
| | | |
Collapse
|
12
|
Tang SL, Chang BC, Halgamuge SK. Gene functionality's influence on the second codon: A large-scale survey of second codon composition in three domains. Genomics 2010; 96:92-101. [DOI: 10.1016/j.ygeno.2010.04.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2009] [Revised: 02/03/2010] [Accepted: 04/07/2010] [Indexed: 10/19/2022]
|
13
|
Dynamic evolution of translation initiation mechanisms in prokaryotes. Proc Natl Acad Sci U S A 2010; 107:6382-7. [PMID: 20308567 DOI: 10.1073/pnas.1002036107] [Citation(s) in RCA: 99] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
It is generally believed that prokaryotic translation is initiated by the interaction between the Shine-Dalgarno (SD) sequence in the 5' UTR of an mRNA and the anti-SD sequence in the 3' end of a 16S ribosomal RNA. However, there are two exceptional mechanisms, which do not require the SD sequence for translation initiation: one is mediated by a ribosomal protein S1 (RPS1) and the other used leaderless mRNA that lacks its 5' UTR. To understand the evolutionary changes of the mechanisms of translation initiation, we examined how universal the SD sequence is as an effective initiator for translation among prokaryotes. We identified the SD sequence from 277 species (249 eubacteria and 28 archaebacteria). We also devised an SD index that is a proportion of SD-containing genes in which the differences of GC contents are taken into account. We found that the SD indices varied among prokaryotic species, but were similar within each phylum. Although the anti-SD sequence is conserved among species, loss of the SD sequence seems to have occurred multiple times, independently, in different phyla. For those phyla, RPS1-mediated or leaderless mRNA-used mechanisms of translation initiation are considered to be working to a greater extent. Moreover, we also found that some species, such as Cyanobacteria, may acquire new mechanisms of translation initiation. Our findings indicate that, although translation initiation is indispensable for all protein-coding genes in the genome of every species, its mechanisms have dynamically changed during evolution.
Collapse
|
14
|
Volkova OA, Kochetov AV. Interrelations between the nucleotide context of human start AUG codon, N-end amino acids of the encoded protein and initiation of translation. J Biomol Struct Dyn 2010; 27:611-8. [PMID: 20085378 DOI: 10.1080/07391102.2010.10508575] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
It is known that the recognition of AUG triplet by eukaryotic ribosomes as a translation start site strongly depends on its nucleotide context. However, the relative significance of different context positions is not fully clear. In particular, it concerns the role of 3'-end part of the context located at the beginning of the protein-coding sequence. The significant bias observed in nucleotide frequencies in positions +4, +5, +6 (corresponding to the second codon of CDS) could result from different reasons and their contribution to start codon recognition and initiation of translation is under discussion. In this study, we conducted a comparative computational analysis of the human mRNA samples containing different nucleotides (adenine, guanine or pyrimidine) in the essential context position -3. It was found that the presence of G in position +4 could be important for the context variant GnnAUG but not for AnnAUG. Interestingly, the second position of proteins encoded by mRNAs with AnnAUG context variant was specifically and significantly enriched with serine whereas the presence of GnnAUG context also correlated with a higher occurrence of alanine and glycine. It is likely that the efficiency of translation initiation process can depend on the interplay between 5'-context part, 3'-context part and the type of amino acid in the second position of the encoded protein.
Collapse
Affiliation(s)
- Oxana A Volkova
- Institute of Cytology and Genetics, 10, Lavrentiev Ave, Novosibirsk 630090, Russia.
| | | |
Collapse
|
15
|
Matuda S, Arimura T, Kimura A, Takekura H, Ohta S, Nakano K. A novel protein found in the I bands of myofibrils is produced by alternative splicing of the DLST gene. Biochim Biophys Acta Gen Subj 2009; 1800:31-9. [PMID: 19819302 DOI: 10.1016/j.bbagen.2009.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2009] [Revised: 09/17/2009] [Accepted: 10/02/2009] [Indexed: 11/16/2022]
Abstract
BACKGROUND It is not known if the dihydrolipoamide succinyltransferase (DLST) gene, a mitochondrial protein, undergoes alternative splicing. We identified an uncharacterized protein reacting with an anti-DLST antibody in the I bands of myofibrils in rat skeletal muscle. METHODS Immunocytochemical staining with an anti-DLST antibody, the purification and amino acid sequence analysis of the protein, and the isolation and sequencing of the protein's cDNA were carried out to clarify the properties of the protein and its relationship to the DLST gene. RESULTS A pyrophosphate concentration >10 mM was necessary to extract the protein from myofibrils in the presence of salt with a higher concentration than 0.6 M, at an alkaline pH of 7.5-8.0. The protein corresponded to the amino acid sequence of the C-terminal side of DLST. The cDNAs for this protein were splicing variants of the DLST gene, with deletions of both exons 2 and 3, or only exon 2 or 3. These variants possessed an open reading frame from an initiation codon in exon 8 of the DLST gene to a termination codon in exon 15, generating a protein with a molecular weight of 30 kDa. CONCLUSIONS The DLST gene undergoes alternative splicing, generating the protein isolated from the I bands of myofibrils. GENERAL SIGNIFICANCE The DLST gene produces two different proteins with quite different functions via alternative splicing.
Collapse
Affiliation(s)
- Sadayuki Matuda
- Department of Biology and Health Science, Kanoya National Institute of Fitness and Sports, Kanoya, Kagoshima 891-2393, Japan
| | | | | | | | | | | |
Collapse
|
16
|
Kim DM, Ahn JH, Keum JW. ISES, an integrated strategy for tunable expression of recombinant proteins through combinatorial engineering of initial codons. J Biotechnol 2008. [DOI: 10.1016/j.jbiotec.2008.07.424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
17
|
Kochetov AV. Alternative translation start sites and hidden coding potential of eukaryotic mRNAs. Bioessays 2008; 30:683-91. [PMID: 18536038 DOI: 10.1002/bies.20771] [Citation(s) in RCA: 136] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
It is widely suggested that a eukaryotic mRNA typically contains one translation start site and encodes a single functional protein product. However, according to current points of view on translation initiation mechanisms, eukaryotic ribosomes can recognize several alternative translation start sites and the number of experimentally verified examples of alternative translation is growing rapidly. Also, the frequent occurrence of alternative translation events and their functional significance are supported by the results of computational evaluations. The functional role of alternative translation and its contribution to eukaryotic proteome complexity are discussed.
Collapse
|
18
|
Palenchar PM. Amino Acid Biases in the N- and C-termini of Proteins are Evolutionarily Conserved and are Conserved Between Functionally Related Proteins. Protein J 2008; 27:283-91. [DOI: 10.1007/s10930-008-9136-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
19
|
Ahn JH, Keum JW, Kim DM. High-Throughput, Combinatorial Engineering of Initial Codons for Tunable Expression of Recombinant Proteins. J Proteome Res 2008; 7:2107-13. [DOI: 10.1021/pr700856s] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jin-Ho Ahn
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 151-742, Korea, School of Chemical and Biological Engineering, Seoul National University, Seoul 151-742, Korea, and Department of Fine Chemical Engineering and Chemistry, Chungnam National University, Daejeon 305-764, Korea
| | - Jung-Won Keum
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 151-742, Korea, School of Chemical and Biological Engineering, Seoul National University, Seoul 151-742, Korea, and Department of Fine Chemical Engineering and Chemistry, Chungnam National University, Daejeon 305-764, Korea
| | - Dong-Myung Kim
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 151-742, Korea, School of Chemical and Biological Engineering, Seoul National University, Seoul 151-742, Korea, and Department of Fine Chemical Engineering and Chemistry, Chungnam National University, Daejeon 305-764, Korea
| |
Collapse
|
20
|
Nakagawa S, Niimura Y, Gojobori T, Tanaka H, Miura KI. Diversity of preferred nucleotide sequences around the translation initiation codon in eukaryote genomes. Nucleic Acids Res 2007; 36:861-71. [PMID: 18086709 PMCID: PMC2241899 DOI: 10.1093/nar/gkm1102] [Citation(s) in RCA: 147] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Understanding regulatory mechanisms of protein synthesis in eukaryotes is essential for the accurate annotation of genome sequences. Kozak reported that the nucleotide sequence GCCGCC(A/G)CCAUGG (AUG is the initiation codon) was frequently observed in vertebrate genes and that this ‘consensus’ sequence enhanced translation initiation. However, later studies using invertebrate, fungal and plant genes reported different ‘consensus’ sequences. In this study, we conducted extensive comparative analyses of nucleotide sequences around the initiation codon by using genomic data from 47 eukaryote species including animals, fungi, plants and protists. The analyses revealed that preferred nucleotide sequences are quite diverse among different species, but differences between patterns of nucleotide bias roughly reflect the evolutionary relationships of the species. We also found strong biases of A/G at position −3, A/C at position −2 and C at position +5 that were commonly observed in all species examined. Genes with higher expression levels showed stronger signals, suggesting that these nucleotides are responsible for the regulation of translation initiation. The diversity of preferred nucleotide sequences around the initiation codon might be explained by differences in relative contributions from two distinct patterns, GCCGCCAUG and AAAAAAAUG, which implies the presence of multiple molecular mechanisms for controlling translation initiation.
Collapse
Affiliation(s)
- So Nakagawa
- Department of Systems Biology, School of Biomedical Science, Department of Bioinformatics, Medical Research Institute, Tokyo Medical and Dental University, Yushima, Tokyo, Japan
| | | | | | | | | |
Collapse
|
21
|
C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families. BMC Genomics 2007; 8:191. [PMID: 17594486 PMCID: PMC1929074 DOI: 10.1186/1471-2164-8-191] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2006] [Accepted: 06/26/2007] [Indexed: 12/28/2022] Open
Abstract
Background The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio. Results We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*), the ER-retention signal (K/HDEL*), the ER-retrieval signal for membrane bound proteins (KKxx*), the prenylation signal (CC*) and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists between species, among kingdoms and across eukaryotes. Motifs of note include a serine-acidic peptide (DSD*) as well as several lysine enriched motifs found in nearly all eukaryotic genomes examined. Conclusion We have successfully generated a high confidence representation of eukaryotic motifs anchored at the C-terminus. A high incidence of true-positives in our results suggests that several previously unidentified tripeptide patterns are strong candidates for representing novel peptide motifs of a widely employed nature in the C-terminal biology of eukaryotes. Our application of comparative genomics, statistical over-representation and the adjustment for protein family homology has generated several hypotheses concerning the C-terminal topology as it pertains to sorting and potential protein interaction signals. This approach to background reduction could be expanded for application to protein motif prediction in the protein interior. A parallel N-terminal analysis is presented as supplementary data.
Collapse
|
22
|
Li W, Zou H, Tao M. Sequences downstream of the start codon and their relations to G + C content and optimal growth temperature in prokaryotic genomes. Antonie van Leeuwenhoek 2007; 92:417-27. [PMID: 17562217 DOI: 10.1007/s10482-007-9170-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2007] [Accepted: 03/30/2007] [Indexed: 11/29/2022]
Abstract
The mechanism of translation initiation is responsible for shaping the mRNA sequences downstream of the start codon. However, this region has not been systematically analyzed in prokaryotes. We used sequence logos and statistic methods to analyze the patterns of overrepresented sequences in this region for 125 species of bacteria and 23 species of archaea. The specific positions are compared to the first 33 amino acids in the proteins. At the 2nd amino acid position, Lys, Ser or Thr is highly overrepresented for 68% to 84% of the genomes examined and Ala is highly overrepresented for 57% of the genomes. Overrepresentation of Lys2 is negatively correlated with the G + C content and overrepresentation of Ser2 or Thr2 is positively correlated with the G + C content of genomes. Ile at the 4th to the 8th positions were found to be overrepresented for 91% of the genomes analyzed and this seemed to be conserved for both bacteria and archaea. Organisms growing at high temperatures have relatively low extent of nucleotides bias at 5' termini of open reading frames (ORFs). The extent of overrepresenting A and underrepresenting G at ORF 5' termini is reduced in thermophiles and hyperthermophiles for both archaea and bacteria.
Collapse
Affiliation(s)
- Wencheng Li
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, China
| | | | | |
Collapse
|
23
|
Kermode AR. Plants as factories for production of biopharmaceutical and bioindustrial proteins: lessons from cell biologyThis review is one of a selection of papers published in the Special Issue on Plant Cell Biology. ACTA ACUST UNITED AC 2006. [DOI: 10.1139/b06-069] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Transgenic plants, seeds, and cultured plant cells are potentially one of the most economical systems for large-scale production of recombinant proteins for industrial and pharmaceutical uses. Biochemical, technical, and economic concerns with current production systems have generated enormous interest in developing plants as alternative production systems. However, various challenges must be met before plant systems can fully emerge as suitable, viable alternatives to current animal-based systems for large-scale production of biopharmaceuticals and other products. Aside from regulatory issues and developing efficient methods for downstream processing of recombinant proteins, there are at least two areas of challenge: (1) Can we engineer plant cells to accumulate recombinant proteins to sufficient levels? (2) Can we engineer plant cells to post-translationally modify recombinant proteins so that they are structurally and functionally similar to the native proteins? Attempts to improve the accumulation of a recombinant protein in plant cells require an appreciation of the processes of gene transcription, mRNA stability, processing, and export, and translation initiation and efficiency. Likewise, many post-translational factors must be considered, including protein stability, protein function and activity, and protein targeting. Moreover, we need to understand how the various processes leading from the gene to the functional protein are interdependent and functionally linked. Manipulation of the post-translational processing machinery of plant cells, especially that for N-linked glycosylation and glycan processing, is a challenging and exciting area. The functions of N-glycan heterogeneity and microheterogeneity, especially with respect to protein function, stability, and transport, are poorly understood and this represents an important area of cell biology.
Collapse
Affiliation(s)
- Allison R. Kermode
- Department of Biological Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada (e-mail: )
| |
Collapse
|
24
|
Liu Q. Comparative analysis of base biases around the stop codons in six eukaryotes. Biosystems 2006; 81:281-9. [PMID: 15979780 DOI: 10.1016/j.biosystems.2005.05.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2005] [Revised: 05/11/2005] [Accepted: 05/14/2005] [Indexed: 11/17/2022]
Abstract
Using full-length cDNA sequences, a comparative analysis of sequence patterns around the stop codons in six eukaryotes was performed. Here, it was showed that the codon immediately before and after the stop codons (defined as -1 codon and +1 codon, respectively) were much more biased than other examined positions, especially at the second position of -1 codons and the first position of +1 codons which were rich in As/Us and purines, respectively, for most species. The author speculated that strongly biased sequence pattern from position -2 to +4 might act as an extended translation termination signal. Translation termination was catalyzed by release factors that recognized the stop codons. The multiple amino acid sequence alignment of eukaryotic release factor 1 (eRF1) of 20 species showed that there were 16 residue sites that were strictly conserved, especially the invariant amino acids Ile70 and Lys71. Accordingly, it could be inferred that those candidate amino acids might involve in the recognition process. Moreover, the possible stop signal recognition hypothesis was also discussed herein.
Collapse
Affiliation(s)
- Qingpo Liu
- Department of Agronomy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310029, China.
| |
Collapse
|
25
|
Volkova OA, Titov SE, Kochetov AV. Correlation between the contexts of the translation initiation signal and the N-terminal sequence of arabidopsis, yeast, mouse, and human proteins. Biophysics (Nagoya-shi) 2006. [DOI: 10.1134/s0006350906070037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
26
|
Abstract
Cinnamoyl-CoA reductase (CCR) is responsible for the CoA ester-->aldehyde conversion in monolignol biosynthesis, which can divert phenylpropanoid-derived metabolites into the biosynthesis of lignin. To gain a better understanding of lignin biosynthesis in wheat (Triticum aestivum L.), a cDNA encoding CCR was isolated and named Ta-CCR2. DNA hybridization analyses demonstrated that the Ta-CCR2 gene exists in three copies in the wheat genome. RNA blot hybridization indicated that Ta-CCR2 was expressed most abundantly in root and stem tissues that were in the process of lignification. The secondary and three-dimensional structures of Ta-CCR2 were analyzed by molecular modeling. Recombinant Ta-CCR2 protein purified from E. coli converted feruloyl CoA, 5-OH-feruloyl CoA, sinapoyl CoA and caffeoyl CoA with almost similar efficiency, suggesting that it is involved in both G and S lignin synthesis. Ta-CCR2 had a very low V max value for 4-coumaroyl CoA, which may serve as a mechanism to control metabolic flux to H lignin in vivo . Furthermore, the reaction mechanism of Ta-CCR2 was analyzed in relation to its possible three-dimensional structure. The activity of Ta-CCR2 in relation to lignin biosynthesis is discussed.
Collapse
Affiliation(s)
- Qing-Hu Ma
- Key Laboratory of Photosynthesis and Environmental Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, PR China.
| | | |
Collapse
|
27
|
Qin H, Wu WB, Comeron JM, Kreitman M, Li WH. Intragenic spatial patterns of codon usage bias in prokaryotic and eukaryotic genomes. Genetics 2005; 168:2245-60. [PMID: 15611189 PMCID: PMC1448744 DOI: 10.1534/genetics.104.030866] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To study the roles of translational accuracy, translational efficiency, and the Hill-Robertson effect in codon usage bias, we studied the intragenic spatial distribution of synonymous codon usage bias in four prokaryotic (Escherichia coli, Bacillus subtilis, Sulfolobus tokodaii, and Thermotoga maritima) and two eukaryotic (Saccharomyces cerevisiae and Drosophila melanogaster) genomes. We generated supersequences at each codon position across genes in a genome and computed the overall bias at each codon position. By quantitatively evaluating the trend of spatial patterns using isotonic regression, we show that in yeast and prokaryotic genomes, codon usage bias increases along translational direction, which is consistent with purifying selection against nonsense errors. Fruit fly genes show a nearly symmetric M-shaped spatial pattern of codon usage bias, with less bias in the middle and both ends. The low codon usage bias in the middle region is best explained by interference (the Hill-Robertson effect) between selections at different codon positions. In both yeast and fruit fly, spatial patterns of codon usage bias are characteristically different from patterns of GC-content variations. Effect of expression level on the strength of codon usage bias is more conspicuous than its effect on the shape of the spatial distribution.
Collapse
Affiliation(s)
- Hong Qin
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | |
Collapse
|
28
|
Ohashi Y, Yamashiro A, Washio T, Ishii N, Ohshima H, Michishita T, Tomita M, Itaya M. In silico diagnosis of inherently inhibited gene expression focusing on initial codon combinations. Gene 2005; 347:11-9. [PMID: 15716115 DOI: 10.1016/j.gene.2004.11.027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2004] [Revised: 10/19/2004] [Accepted: 11/17/2004] [Indexed: 11/21/2022]
Abstract
The translation start site, immediately downstream from the start codon, is a dominant factor for gene expression in Escherichia coli. At present, no method exists to improve the expression level of cloned genes, since it remains difficult to find the best codon combination within the region. We determined the expression parameters that correspond to all sense codons within the first four codons using GFPuv which encodes a derivative of green fluorescent protein. Using a genetic algorithm (GA)-based computer program, these parameters were incorporated in a simple, static model for the prediction of translation efficiency, and optimized to the expression level for 137 randomly isolated GFPuv genes. The calculated initial translation index (ITI), also proven for the DsRed2 gene that encodes a red fluorescent protein, should provide a solution to overcome the gene expression problem in cloned genes whose expression is often inherently blocked at the translation process. The proposed method facilitates heterologous protein production in E. coli, the most commonly used host in biological and industrial fields.
Collapse
Affiliation(s)
- Yoshiaki Ohashi
- Institute for Advanced Biosciences, Keio University, 403-1 Nipponkoku, Daihoji, Tsuruoka, Yamagata 997-0017, Japan
| | | | | | | | | | | | | | | |
Collapse
|