1
|
Farookhi H, Xia X. Differential Selection for Translation Efficiency Shapes Translation Machineries in Bacterial Species. Microorganisms 2024; 12:768. [PMID: 38674712 PMCID: PMC11052298 DOI: 10.3390/microorganisms12040768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/01/2024] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open
Abstract
Different bacterial species have dramatically different generation times, from 20-30 min in Escherichia coli to about two weeks in Mycobacterium leprae. The translation machinery in a cell needs to synthesize all proteins for a new cell in each generation. The three subprocesses of translation, i.e., initiation, elongation, and termination, are expected to be under stronger selection pressure to optimize in short-generation bacteria (SGB) such as Vibrio natriegens than in the long-generation Mycobacterium leprae. The initiation efficiency depends on the start codon decoded by the initiation tRNA, the optimal Shine-Dalgarno (SD) decoded by the anti-SD (aSD) sequence on small subunit rRNA, and the secondary structure that may embed the initiation signals and prevent them from being decoded. The elongation efficiency depends on the tRNA pool and codon usage. The termination efficiency in bacteria depends mainly on the nature of the stop codon and the nucleotide immediately downstream of the stop codon. By contrasting SGB with long-generation bacteria (LGB), we predict (1) SGB to have more ribosome RNA operons to produce ribosomes, and more tRNA genes for carrying amino acids to ribosomes, (2) SGB to have a higher percentage of genes using AUG as the start codon and UAA as the stop codon than LGB, (3) SGB to exhibit better codon and anticodon adaptation than LGB, and (4) SGB to have a weaker secondary structure near the translation initiation signals than LGB. These differences between SGB and LGB should be more pronounced in highly expressed genes than the rest of the genes. We present empirical evidence in support of these predictions.
Collapse
Affiliation(s)
- Heba Farookhi
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| |
Collapse
|
2
|
Molecular signatures for gene expression in Mycobacterium leprae: A bioinformatic analysis. GENE REPORTS 2023. [DOI: 10.1016/j.genrep.2023.101741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
3
|
Kohl MP, Kompatscher M, Clementi N, Holl L, Erlacher M. Initiation at AUGUG and GUGUG sequences can lead to translation of overlapping reading frames in E. coli. Nucleic Acids Res 2022; 51:271-289. [PMID: 36546769 PMCID: PMC9841429 DOI: 10.1093/nar/gkac1175] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 11/16/2022] [Accepted: 11/30/2022] [Indexed: 12/24/2022] Open
Abstract
During initiation, the ribosome is tasked to efficiently recognize open reading frames (ORFs) for accurate and fast translation of mRNAs. A critical step is start codon recognition, which is modulated by initiation factors, mRNA structure, a Shine Dalgarno (SD) sequence and the start codon itself. Within the Escherichia coli genome, we identified more than 50 annotated initiation sites harboring AUGUG or GUGUG sequence motifs that provide two canonical start codons, AUG and GUG, in immediate proximity. As these sites may challenge start codon recognition, we studied if and how the ribosome is accurately guided to the designated ORF, with a special focus on the SD sequence as well as adenine at the fourth coding sequence position (A4). By in vitro and in vivo experiments, we characterized key requirements for unambiguous start codon recognition, but also discovered initiation sites that lead to the translation of both overlapping reading frames. Our findings corroborate the existence of an ambiguous translation initiation mechanism, implicating a multitude of so far unrecognized ORFs and translation products in bacteria.
Collapse
Affiliation(s)
- Maximilian P Kohl
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Maria Kompatscher
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Nina Clementi
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Lena Holl
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Matthias D Erlacher
- To whom correspondence should be addressed. Tel: +43 512900370256; Fax: +43 512900373100;
| |
Collapse
|
4
|
Chen F, Cocaign-Bousquet M, Girbal L, Nouaille S. 5'UTR sequences influence protein levels in Escherichia coli by regulating translation initiation and mRNA stability. Front Microbiol 2022; 13:1088941. [PMID: 36620028 PMCID: PMC9810816 DOI: 10.3389/fmicb.2022.1088941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 12/02/2022] [Indexed: 12/24/2022] Open
Abstract
A set of 41 synthetic 5'UTRs with different theoretical translation initiation rates were generated to explore the role of 5'UTRs in the regulation of protein levels in Escherichia coli. The roles of the synthetic 5'UTRs in regulating the expression of different reporter genes were analyzed in vivo. Protein levels varied substantially between the different constructs but for most of the 5'UTRs, protein levels were not correlated with theoretical translation initiation rates. Large variations in mRNA concentrations were measured with the different 5'UTRs even though the same concentration of transcription inducer was used in each case. 5'UTRs were also found to strongly affect mRNA stability, and these changes in mRNA stability often contributed to observed differences in mRNA concentration. Unexpectedly, the effect of the 5'UTRs on mRNA half-lives was found to vary depending on the downstream reporter gene. These results clearly demonstrate that 5'UTRs contribute to gene expression regulation at the level of translation initiation and of mRNA stability, to an extent that depends on the nature of the downstream gene.
Collapse
|
5
|
Xi X, Sun W, Li H, Fan Q, Zhang X, Sun F. A comparative study and evaluation of anti-EGFR nanobodies expressed in Pichia pastoris and Escherichia coli as antitumor moieties. Protein Expr Purif 2021; 184:105888. [PMID: 33857600 DOI: 10.1016/j.pep.2021.105888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 02/23/2021] [Accepted: 04/07/2021] [Indexed: 11/17/2022]
Abstract
Anti-EGFR nanobodies have been successfully applied as antitumor moieties in the photodynamic therapy and drug delivery systems. But the yields of nanobodies were still limited due to the volumetric capacity of the periplasmic compartments and inclusion bodies of Escherichia coli. A comparative study of Pichia pastoris and Escherichia coli was done through characterizing their products. Nanobody 7D12 and 7D12-9G8 were successfully expressed in Pichia pastoris with 6-13.6-fold higher yield. Both two types of nanobodies had internalization ability to be developed as antitumor moieties.
Collapse
Affiliation(s)
- Xi Xi
- Department of Biopharmacy, School of Pharmaceutical Sciences, Jilin University, Changchun, 130021, Jilin, People's Republic of China
| | - Weihan Sun
- Department of Biopharmacy, School of Pharmaceutical Sciences, Jilin University, Changchun, 130021, Jilin, People's Republic of China
| | - Hongrui Li
- Institute of Frontier Medical Science, Jilin University, No. 1163 Xinmin Street, Changchun, 130021, Jilin, People's Republic of China
| | - Qingjie Fan
- Department of Biopharmacy, School of Pharmaceutical Sciences, Jilin University, Changchun, 130021, Jilin, People's Republic of China
| | - Xitian Zhang
- Changchun Intellicrown Pharmaceutical Co., Ltd, No. 1688 Jichang Road, Changchun, 130507, Jilin, People's Republic of China
| | - Fei Sun
- Institute of Frontier Medical Science, Jilin University, No. 1163 Xinmin Street, Changchun, 130021, Jilin, People's Republic of China.
| |
Collapse
|
6
|
De Santis M, Hahn J, Dubnau D. ComEB protein is dispensable for the transformation but must be translated for the optimal synthesis of comEC. Mol Microbiol 2021; 116:71-79. [PMID: 33527432 DOI: 10.1111/mmi.14690] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 01/26/2021] [Accepted: 01/27/2021] [Indexed: 01/05/2023]
Abstract
We show that the ComEB protein is not required for transformation in Bacillus subtilis, despite its expression from within the comE operon under competence control, nor is it required for the correct polar localization of ComGA. We show further that the synthesis of the putative channel protein ComEC is translationally coupled to the upstream comEB open reading frame, so that the translation of comEB and a suboptimal ribosomal-binding site embedded in its sequence are needed for proper comEC expression. Translational coupling appears to be a common mechanism in three major competence operons for the adjustment of protein amounts independent of transcriptional control, probably ensuring the correct stoichiometries for assembly of the transformation machinery. comEB and comFC, respectively, encode cytidine deaminase and a protein resembling type 1 phosphoribosyl transferases and we speculate that nucleotide scavenging proteins are produced under competence control for efficient reutilization of the products of degradation of the non-transforming strand during DNA uptake.
Collapse
Affiliation(s)
- Micaela De Santis
- Public Health Research Institute Center, New Jersey Medical School, Rutgers University, Newark, NJ, USA
| | - Jeanette Hahn
- Public Health Research Institute Center, New Jersey Medical School, Rutgers University, Newark, NJ, USA
| | - David Dubnau
- Public Health Research Institute Center, New Jersey Medical School, Rutgers University, Newark, NJ, USA
| |
Collapse
|
7
|
Ge Z, Li X, Cao X, Wang R, Hu W, Gen L, Han S, Shang Y, Liu Y, Zhou JH. Viral adaption of staphylococcal phage: A genome-based analysis of the selective preference based on codon usage Bias. Genomics 2020; 112:4657-4665. [PMID: 32818632 DOI: 10.1016/j.ygeno.2020.08.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 07/19/2020] [Accepted: 08/11/2020] [Indexed: 12/09/2022]
Abstract
Given the high therapeutic value of the staphylococcal phage, the genome co-evolution of the phage and the host has gained great attention. Though the genome-wide AT richness in staphylococcal phages has been well-studied with nucleotide usage bias, here we proved that host factor, lifestyle and taxonomy are also important factors in understanding the phage nucleotide usages bias using information entropy formula. Such correlation is especially prominent when it comes to the synonymous codon usages of staphylococcal phages, despite the overall scattered codon usage pattern represented by principal component analysis. This strong relationship is explained by nucleotide skew which testified that the usage biases of nucleotide at different codon positions are acting on synonymous codons. Therefore, our study reveals a hidden relationship of genome evolution with host limitation and phagic phenotype, providing new insight into phage genome evolution at genetic level.
Collapse
Affiliation(s)
- Zhiyi Ge
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Xuerui Li
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Xiaoan Cao
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Rui Wang
- Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089, United States of America
| | - Wen Hu
- Gansu Police Vocational College, Lanzhou 730046, Gansu, PR China
| | - Ling Gen
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Shengyi Han
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China; The College of Veterinary Medicine, Gansu Agricultural University, Lanzhou 730070, Gansu Province, PR China
| | - Youjun Shang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Yongsheng Liu
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Jian-Hua Zhou
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China.
| |
Collapse
|
8
|
Wei Y, Xia X. Unique Shine-Dalgarno Sequences in Cyanobacteria and Chloroplasts Reveal Evolutionary Differences in Their Translation Initiation. Genome Biol Evol 2020; 11:3194-3206. [PMID: 31621842 PMCID: PMC6847405 DOI: 10.1093/gbe/evz227] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/15/2019] [Indexed: 12/12/2022] Open
Abstract
Microorganisms require efficient translation to grow and replicate rapidly, and translation is often rate-limited by initiation. A prominent feature that facilitates translation initiation in bacteria is the Shine-Dalgarno (SD) sequence. However, there is much debate over its conservation in Cyanobacteria and in chloroplasts which presumably originated from endosymbiosis of ancient Cyanobacteria. Elucidating the utilization of SD sequences in Cyanobacteria and in chloroplasts is therefore important to understand whether 1) SD role in Cyanobacterial translation has been reduced prior to chloroplast endosymbiosis or 2) translation in Cyanobacteria and in plastid has been subjected to different evolutionary pressures. To test these alternatives, we employed genomic, proteomic, and transcriptomic data to trace differences in SD usage among Synechocystis species, Microcystis aeruginosa, cyanophages, Nicotiana tabacum chloroplast, and Arabidopsis thaliana chloroplast. We corrected their mis-annotated 16S rRNA 3' terminus using an RNA-Seq-based approach to determine their SD/anti-SD locational constraints using an improved measurement DtoStart. We found that cyanophages well-mimic Cyanobacteria in SD usage because both have been under the same selection pressure for SD-mediated initiation. Whereas chloroplasts lost this similarity because the need for SD-facilitated initiation has been reduced in plastids having much reduced genome size and different ribosomal proteins as a result of host-symbiont coevolution. Consequently, SD sequence significantly increases protein expression in Cyanobacteria but not in chloroplasts, and only Cyanobacterial genes compensate for a lack of SD sequence by having weaker secondary structures at the 5' UTR. Our results suggest different evolutionary pressures operate on translation initiation in Cyanobacteria and in chloroplast.
Collapse
Affiliation(s)
- Yulong Wei
- Department of Biology, University of Ottawa, Ontario, Canada
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ontario, Canada.,Ottawa Institute of Systems Biology, University of Ottawa, Ontario, Canada
| |
Collapse
|
9
|
Dissimilation of synonymous codon usage bias in virus-host coevolution due to translational selection. Nat Ecol Evol 2020; 4:589-600. [PMID: 32123323 DOI: 10.1038/s41559-020-1124-7] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2019] [Accepted: 01/21/2020] [Indexed: 12/13/2022]
Abstract
Eighteen of the 20 amino acids are each encoded by more than one synonymous codon. Due to differential transfer RNA supply within the cell, synonymous codons are not used with equal frequency, a phenomenon termed codon usage bias (CUB). Previous studies have demonstrated that CUB of endogenous genes trans-regulates the translational efficiency of other genes. We hypothesized similar effects for CUB of exogenous genes on host translation, and tested it in the case of viral infection, a common form of naturally occurring exogenous gene translation. We analysed public Ribo-Seq datasets from virus-infected yeast and human cells and showed that virus CUB trans-regulated tRNA availability, and therefore the relative decoding time of codons. Manipulative experiments in yeast using 37 synonymous fluorescent proteins confirmed that an exogenous gene with CUB more similar to that of the host would apply decreased translational load on the host per unit of expression, whereas expression of the exogenous gene was elevated. The combination of these two effects was that exogenous genes with CUB overly similar to that of the host severely impeded host translation. Finally, using a manually curated list of viruses and natural and symptomatic hosts, we found that virus CUB tended to be more similar to that of symptomatic hosts than that of natural hosts, supporting a general deleterious effect of excessive CUB similarity between virus and host. Our work revealed repulsion between virus and host CUBs when they are overly similar, a previously unrecognized complexity in the coevolution of virus and host.
Collapse
|
10
|
Weiner I, Shahar N, Marco P, Yacoby I, Tuller T. Solving the Riddle of the Evolution of Shine-Dalgarno Based Translation in Chloroplasts. Mol Biol Evol 2019; 36:2854-2860. [DOI: 10.1093/molbev/msz210] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
AbstractChloroplasts originated from an ancient cyanobacterium and still harbor a bacterial-like genome. However, the centrality of Shine–Dalgarno ribosome binding, which predominantly regulates proteobacterial translation initiation, is significantly decreased in chloroplasts. As plastid ribosomal RNA anti-Shine–Dalgarno elements are similar to their bacterial counterparts, these sites alone cannot explain this decline. By computational simulation we show that upstream point mutations modulate the local structure of ribosomal RNA in chloroplasts, creating significantly tighter structures around the anti-Shine–Dalgarno locus, which in-turn reduce the probability of ribosome binding. To validate our model, we expressed two reporter genes (mCherry, hydrogenase) harboring a Shine–Dalgarno motif in the Chlamydomonas reinhardtii chloroplast. Coexpressing them with a 16S ribosomal RNA, modified according to our model, significantly enhances mCherry and hydrogenase expression compared with coexpression with an endogenous 16S gene.
Collapse
Affiliation(s)
- Iddo Weiner
- Department of Biomedical Engineering, The Iby and Aladar Fleischman Faculty of Engineering, Tel Aviv University, Tel Aviv, Israel
- School of Plant Sciences and Food Security, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Tel Aviv, Israel
| | - Noam Shahar
- School of Plant Sciences and Food Security, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Tel Aviv, Israel
| | - Pini Marco
- School of Plant Sciences and Food Security, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Tel Aviv, Israel
| | - Iftach Yacoby
- School of Plant Sciences and Food Security, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Tel Aviv, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, The Iby and Aladar Fleischman Faculty of Engineering, Tel Aviv University, Tel Aviv, Israel
- The Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
11
|
An improved estimation of tRNA expression to better elucidate the coevolution between tRNA abundance and codon usage in bacteria. Sci Rep 2019; 9:3184. [PMID: 30816249 PMCID: PMC6395768 DOI: 10.1038/s41598-019-39369-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 01/23/2019] [Indexed: 12/16/2022] Open
Abstract
The degree to which codon usage can be explained by tRNA abundance in bacterial species is often inadequate, partly because differential tRNA abundance is often approximated by tRNA copy numbers. To better understand the coevolution between tRNA abundance and codon usage, we provide a better estimate of tRNA abundance by profiling tRNA mapped reads (tRNA tpm) using publicly available RNA Sequencing data. To emphasize the feasibility of our approach, we demonstrate that tRNA tpm is consistent with tRNA abundances derived from RNA fingerprinting experiments in Escherichia coli, Bacillus subtilis, and Salmonella enterica. Furthermore, we do not observe an appreciable reduction in tRNA sequencing efficiency due to post-transcriptional methylations in the seven bacteria studied. To determine optimal codons, we calculate codon usage in highly and lowly expressed genes determined by protein per transcript. We found that tRNA tpm is sensitive to identify more translationally optimal codons than gene copy number and early tRNA fingerprinting abundances. Additionally, tRNA tpm improves the predictive power of tRNA adaptation index over codon preference. Our results suggest that dependence of codon usage on tRNA availability is not always associated with species growth-rate. Conversely, tRNA availability is better optimized to codon usage in fast-growing than slow-growing species.
Collapse
|
12
|
RNA-Seq-Based Analysis Reveals Heterogeneity in Mature 16S rRNA 3' Termini and Extended Anti-Shine-Dalgarno Motifs in Bacterial Species. G3-GENES GENOMES GENETICS 2018; 8:3973-3979. [PMID: 30355764 PMCID: PMC6288834 DOI: 10.1534/g3.118.200729] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
We present an RNA-Seq based approach to map 3′ end sequences of mature 16S rRNA (3′ TAIL) in bacteria with single-base specificity. Our results show that 3′ TAILs are heterogeneous among species; they contain the core CCUCC anti-Shine-Dalgarno motif, but vary in downstream lengths. Importantly, our findings rectify the mis-annotated 16S rRNAs in 11 out of 13 bacterial species studied herein (covering Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, Tenericutes, and Spirochaetes). Furthermore, our results show that species-specific 3′ TAIL boundaries are retained due to their high complementarity with preferred Shine-Dalgarno sequences, suggesting that 3′ TAIL bases downstream of the canonical CCUCC motif play a more important role in translation initiation than previously reported.
Collapse
|
13
|
Abstract
Codon usage depends on mutation bias, tRNA-mediated selection, and the need for high efficiency and accuracy in translation. One codon in a synonymous codon family is often strongly over-used, especially in highly expressed genes, which often leads to a high dN/dS ratio because dS is very small. Many different codon usage indices have been proposed to measure codon usage and codon adaptation. Sense codon could be misread by release factors and stop codons misread by tRNAs, which also contribute to codon usage in rare cases. This chapter outlines the conceptual framework on codon evolution, illustrates codon-specific and gene-specific codon usage indices, and presents their applications. A new index for codon adaptation that accounts for background mutation bias (Index of Translation Elongation) is presented and contrasted with codon adaptation index (CAI) which does not consider background mutation bias. They are used to re-analyze data from a recent paper claiming that translation elongation efficiency matters little in protein production. The reanalysis disproves the claim.
Collapse
|
14
|
Wei Y, Silke JR, Xia X. Elucidating the 16S rRNA 3' boundaries and defining optimal SD/aSD pairing in Escherichia coli and Bacillus subtilis using RNA-Seq data. Sci Rep 2017; 7:17639. [PMID: 29247194 PMCID: PMC5732282 DOI: 10.1038/s41598-017-17918-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 12/04/2017] [Indexed: 11/09/2022] Open
Abstract
Bacterial translation initiation is influenced by base pairing between the Shine-Dalgarno (SD) sequence in the 5' UTR of mRNA and the anti-SD (aSD) sequence at the free 3' end of the 16S rRNA (3' TAIL) due to: 1) the SD/aSD sequence binding location and 2) SD/aSD binding affinity. In order to understand what makes an SD/aSD interaction optimal, we must define: 1) terminus of the 3' TAIL and 2) extent of the core aSD sequence within the 3' TAIL. Our approach to characterize these components in Escherichia coli and Bacillus subtilis involves 1) mapping the 3' boundary of the mature 16S rRNA using high-throughput RNA sequencing (RNA-Seq), and 2) identifying the segment within the 3' TAIL that is strongly preferred in SD/aSD pairing. Using RNA-Seq data, we resolve previous discrepancies in the reported 3' TAIL in B. subtilis and recovered the established 3' TAIL in E. coli. Furthermore, we extend previous studies to suggest that both highly and lowly expressed genes favor SD sequences with intermediate binding affinity, but this trend is exclusive to SD sequences that complement the core aSD sequences defined herein.
Collapse
Affiliation(s)
- Yulong Wei
- Department of Biology, University of Ottawa, 30 Marie Curie, P.O. Box 450, Station A, Ottawa, Ontario, Canada
| | - Jordan R Silke
- Department of Biology, University of Ottawa, 30 Marie Curie, P.O. Box 450, Station A, Ottawa, Ontario, Canada
| | - Xuhua Xia
- Department of Biology, University of Ottawa, 30 Marie Curie, P.O. Box 450, Station A, Ottawa, Ontario, Canada. .,Ottawa Institute of Systems Biology, Ottawa, Ontario, K1H 8M5, Canada.
| |
Collapse
|
15
|
Xia X. DAMBE6: New Tools for Microbial Genomics, Phylogenetics, and Molecular Evolution. J Hered 2017; 108:431-437. [PMID: 28379490 PMCID: PMC5434544 DOI: 10.1093/jhered/esx033] [Citation(s) in RCA: 230] [Impact Index Per Article: 32.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2017] [Accepted: 04/01/2017] [Indexed: 12/21/2022] Open
Abstract
DAMBE is a comprehensive software workbench for data analysis in molecular biology, phylogenetics, and evolution. Several important new functions have been added since version 5 of DAMBE: 1) comprehensive genomic profiling of translation initiation efficiency of different genes in different prokaryotic species, 2) a new index of translation elongation (ITE) that takes into account both tRNA-mediated selection and background mutation on codon–anticodon adaptation, 3) a new and accurate phylogenetic approach based on pairwise alignment only, which is useful for highly divergent sequences from which a reliable multiple sequence alignment is difficult to obtain. Many other functions have been updated and improved including PWM for motif characterization, Gibbs sampler for de novo motif discovery, hidden Markov models for protein secondary structure prediction, self-organizing map for nonlinear clustering of transcriptomic data, comprehensive sequence alignment, and phylogenetic functions. DAMBE features a graphic, user-friendly and intuitive interface, and is freely available from http://dambe.bio.uottawa.ca.
Collapse
Affiliation(s)
- Xuhua Xia
- From the Department of Biology and Center for Advanced Research in Environmental Genomics, University of Ottawa, 30 Marie Curie, PO Box 450, Station A, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|
16
|
How Changes in Anti-SD Sequences Would Affect SD Sequences in Escherichia coli and Bacillus subtilis. G3-GENES GENOMES GENETICS 2017; 7:1607-1615. [PMID: 28364038 PMCID: PMC5427494 DOI: 10.1534/g3.117.039305] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The 3' end of the small ribosomal RNAs (ssu rRNA) in bacteria is directly involved in the selection and binding of mRNA transcripts during translation initiation via well-documented interactions between a Shine-Dalgarno (SD) sequence located upstream of the initiation codon and an anti-SD (aSD) sequence at the 3' end of the ssu rRNA. Consequently, the 3' end of ssu rRNA (3'TAIL) is strongly conserved among bacterial species because a change in the region may impact the translation of many protein-coding genes. Escherichia coli and Bacillus subtilis differ in their 3' ends of ssu rRNA, being GAUCACCUCCUUA3' in E. coli and GAUCACCUCCUUUCU3' or GAUCACCUCCUUUCUA3' in B. subtilis Such differences in 3'TAIL lead to species-specific SDs (designated SDEc for E. coli and SDBs for B. subtilis) that can form strong and well-positioned SD/aSD pairing in one species but not in the other. Selection mediated by the species-specific 3'TAIL is expected to favor SDBs against SDEc in B. subtilis, but favor SDEc against SDBs in E. coli Among well-positioned SDs, SDEc is used more in E. coli than in B. subtilis, and SDBs more in B. subtilis than in E. coli Highly expressed genes and genes of high translation efficiency tend to have longer SDs than lowly expressed genes and genes with low translation efficiency in both species, but more so in B. subtilis than in E. coli Both species overuse SDs matching the bolded part of the 3'TAIL shown above. The 3'TAIL difference contributes to the host specificity of phages.
Collapse
|
17
|
Devi M, Lyngdoh RD. Favored and less favored codon–anticodon duplexes arising from the GC codon family box encoding for alanine: some computational perspectives. J Biomol Struct Dyn 2017; 36:1029-1049. [DOI: 10.1080/07391102.2017.1308886] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Martina Devi
- Department of Chemistry, North-Eastern Hill University, Shillong 793022, India
| | - R.H. Duncan Lyngdoh
- Department of Chemistry, North-Eastern Hill University, Shillong 793022, India
| |
Collapse
|
18
|
Abstract
Bioinformatic analysis can not only accelerate drug target identification and drug candidate screening and refinement, but also facilitate characterization of side effects and predict drug resistance. High-throughput data such as genomic, epigenetic, genome architecture, cistromic, transcriptomic, proteomic, and ribosome profiling data have all made significant contribution to mechanismbased drug discovery and drug repurposing. Accumulation of protein and RNA structures, as well as development of homology modeling and protein structure simulation, coupled with large structure databases of small molecules and metabolites, paved the way for more realistic protein-ligand docking experiments and more informative virtual screening. I present the conceptual framework that drives the collection of these high-throughput data, summarize the utility and potential of mining these data in drug discovery, outline a few inherent limitations in data and software mining these data, point out news ways to refine analysis of these diverse types of data, and highlight commonly used software and databases relevant to drug discovery.
Collapse
Affiliation(s)
- Xuhua Xia
- Department of Biology, Faculty of Science, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
- Ottawa Institute of Systems Biology, Ottawa K1H 8M5, Canada
| |
Collapse
|
19
|
Wei Y, Wang J, Xia X. Coevolution between Stop Codon Usage and Release Factors in Bacterial Species. Mol Biol Evol 2016; 33:2357-67. [PMID: 27297468 PMCID: PMC4989110 DOI: 10.1093/molbev/msw107] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Three stop codons in bacteria represent different translation termination signals, and their usage is expected to depend on their differences in translation termination efficiency, mutation bias, and relative abundance of release factors (RF1 decoding UAA and UAG, and RF2 decoding UAA and UGA). In 14 bacterial species (covering Proteobacteria, Firmicutes, Cyanobacteria, Actinobacteria and Spirochetes) with cellular RF1 and RF2 quantified, UAA is consistently over-represented in highly expressed genes (HEGs) relative to lowly expressed genes (LEGs), whereas UGA usage is the opposite even in species where RF2 is far more abundant than RF1. UGA usage relative to UAG increases significantly with PRF2 [=RF2/(RF1 + RF2)] as expected from adaptation between stop codons and their decoders. PRF2 is > 0.5 over a wide range of AT content (measured by PAT3 as the proportion of AT at third codon sites), but decreases rapidly toward zero at the high range of PAT3. This explains why bacterial lineages with high PAT3 often have UGA reassigned because of low RF2. There is no indication that UAG is a minor stop codon in bacteria as claimed in a recent publication. The claim is invalid because of the failure to apply the two key criteria in identifying a minor codon: (1) it is least preferred by HEGs (or most preferred by LEGs) and (2) it corresponds to the least abundant decoder. Our results suggest a more plausible explanation for why UAA usage increases, and UGA usage decreases, with PAT3, but UAG usage remains low over the entire PAT3 range.
Collapse
Affiliation(s)
- Yulong Wei
- Department of Biology, University of Ottawa, Ottawa, ON, Canada
| | - Juan Wang
- Department of Biology, University of Ottawa, Ottawa, ON, Canada
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ottawa, ON, Canada Ottawa Institute of Systems Biology, Ottawa, ON, Canada
| |
Collapse
|