1
|
Yu P, Zhou S, Gao Y, Liang Y, Guo W, Wang DO, Ding S, Lin S, Wang J, Cun Y. Dynamic Landscapes of tRNA Transcriptomes and Translatomes in Diverse Mouse Tissues. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:834-849. [PMID: 35952936 PMCID: PMC10787195 DOI: 10.1016/j.gpb.2022.07.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 07/02/2022] [Accepted: 07/26/2022] [Indexed: 06/15/2023]
Abstract
Although the function of tRNAs in the translational process is well established, it remains controversial whether tRNA abundance is tightly associated with translational efficiency (TE) in mammals. Moreover, how critically the expression of tRNAs contributes to the establishment of tissue-specific proteomes in mammals has not been well addressed. Here, we measured both tRNA expression using demethylase-tRNA sequencing (DM-tRNA-seq) and TE of mRNAs using ribosome-tagging sequencing (RiboTag-seq) in the brain, heart, and testis of mice. Remarkable variation in the expression of tRNA isodecoders was observed among different tissues. When the statistical effect of isodecoder-grouping on reducing variations is considered through permutating the anticodons, we observed an expected reduction in the variation of anticodon expression across all samples, an unexpected smaller variation of anticodon usage bias, and an unexpected larger variation of tRNA isotype expression at amino acid level. Regardless of whether or not they share the same anticodons, the isodecoders encoding the same amino acids are co-expressed across different tissues. Based on the expression of tRNAs and the TE of mRNAs, we find that the tRNA adaptation index (tAI) and TE are significantly correlated in the same tissues but not between tissues; and tRNA expression and the amino acid composition of translating peptides are positively correlated in the same tissues but not between tissues. We therefore hypothesize that the tissue-specific expression of tRNAs might be due to post-transcriptional mechanisms. This study provides a resource for tRNA and translation studies, as well as novel insights into the dynamics of tRNAs and their roles in translational regulation.
Collapse
Affiliation(s)
- Peng Yu
- Center for Translational Medicine, Precision Medicine Institute, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, China; Department of Radiation Oncology, Affiliated Cancer Hospital & Institute of Guangzhou Medical University, Guangzhou 510080, China
| | - Siting Zhou
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Center for Stem Cell Biology and Tissue Engineering, MOE Key Laboratory for Stem Cells and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
| | - Yan Gao
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Center for Stem Cell Biology and Tissue Engineering, MOE Key Laboratory for Stem Cells and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
| | - Yu Liang
- Center for Translational Medicine, Precision Medicine Institute, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, China
| | - Wenbing Guo
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Center for Stem Cell Biology and Tissue Engineering, MOE Key Laboratory for Stem Cells and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
| | - Dan Ohtan Wang
- RIKEN Center for Biosystems Dynamics Research, Kobe 650-0047, Japan; Graduate School of Biostudies, Kyoto University, Kyoto 606-8501, Japan; Wuya College of Innovation, Shenyang Pharmaceutical University, Shenyang 110016, China
| | - Shuaiwen Ding
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China
| | - Shuibin Lin
- Center for Translational Medicine, Precision Medicine Institute, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, China.
| | - Jinkai Wang
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Center for Stem Cell Biology and Tissue Engineering, MOE Key Laboratory for Stem Cells and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China; RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510080, China.
| | - Yixian Cun
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Center for Stem Cell Biology and Tissue Engineering, MOE Key Laboratory for Stem Cells and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China.
| |
Collapse
|
2
|
Benisty H, Hernandez-Alias X, Weber M, Anglada-Girotto M, Mantica F, Radusky L, Senger G, Calvet F, Weghorn D, Irimia M, Schaefer MH, Serrano L. Genes enriched in A/T-ending codons are co-regulated and conserved across mammals. Cell Syst 2023; 14:312-323.e3. [PMID: 36889307 DOI: 10.1016/j.cels.2023.02.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 07/11/2022] [Accepted: 02/09/2023] [Indexed: 03/09/2023]
Abstract
Codon usage influences gene expression distinctly depending on the cell context. Yet, the importance of codon bias in the simultaneous turnover of specific groups of protein-coding genes remains to be investigated. Here, we find that genes enriched in A/T-ending codons are expressed more coordinately in general and across tissues and development than those enriched in G/C-ending codons. tRNA abundance measurements indicate that this coordination is linked to the expression changes of tRNA isoacceptors reading A/T-ending codons. Genes with similar codon composition are more likely to be part of the same protein complex, especially for genes with A/T-ending codons. The codon preferences of genes with A/T-ending codons are conserved among mammals and other vertebrates. We suggest that this orchestration contributes to tissue-specific and ontogenetic-specific expression, which can facilitate, for instance, timely protein complex formation.
Collapse
Affiliation(s)
- Hannah Benisty
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain.
| | - Xavier Hernandez-Alias
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Marc Weber
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Miquel Anglada-Girotto
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Federica Mantica
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Leandro Radusky
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Gökçe Senger
- Department of Experimental Oncology, European Institute of Oncology (IEO) IRCCS, Via Adamello 16, Milan 20139, Italy
| | - Ferriol Calvet
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Donate Weghorn
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Manuel Irimia
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; ICREA, Pg. Lluis Companys 23, Barcelona 08010, Spain
| | - Martin H Schaefer
- Department of Experimental Oncology, European Institute of Oncology (IEO) IRCCS, Via Adamello 16, Milan 20139, Italy
| | - Luis Serrano
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; Universitat Pompeu Fabra (UPF), Barcelona 08003, Spain; ICREA, Pg. Lluis Companys 23, Barcelona 08010, Spain.
| |
Collapse
|
3
|
Zhou F, Aroua N, Liu Y, Rohde C, Cheng J, Wirth AK, Fijalkowska D, Göllner S, Lotze M, Yun H, Yu X, Pabst C, Sauer T, Oellerich T, Serve H, Röllig C, Bornhäuser M, Thiede C, Baldus C, Frye M, Raffel S, Krijgsveld J, Jeremias I, Beckmann R, Trumpp A, Müller-Tidow C. A Dynamic rRNA Ribomethylome Drives Stemness in Acute Myeloid Leukemia. Cancer Discov 2023; 13:332-347. [PMID: 36259929 PMCID: PMC9900322 DOI: 10.1158/2159-8290.cd-22-0210] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 09/12/2022] [Accepted: 10/14/2022] [Indexed: 02/07/2023]
Abstract
The development and regulation of malignant self-renewal remain unresolved issues. Here, we provide biochemical, genetic, and functional evidence that dynamics in ribosomal RNA (rRNA) 2'-O-methylation regulate leukemia stem cell (LSC) activity in vivo. A comprehensive analysis of the rRNA 2'-O-methylation landscape of 94 patients with acute myeloid leukemia (AML) revealed dynamic 2'-O-methylation specifically at exterior sites of ribosomes. The rRNA 2'-O-methylation pattern is closely associated with AML development stage and LSC gene expression signature. Forced expression of the 2'-O-methyltransferase fibrillarin (FBL) induced an AML stem cell phenotype and enabled engraftment of non-LSC leukemia cells in NSG mice. Enhanced 2'-O-methylation redirected the ribosome translation program toward amino acid transporter mRNAs enriched in optimal codons and subsequently increased intracellular amino acid levels. Methylation at the single site 18S-guanosine 1447 was instrumental for LSC activity. Collectively, our work demonstrates that dynamic 2'-O-methylation at specific sites on rRNAs shifts translational preferences and controls AML LSC self-renewal. SIGNIFICANCE We establish the complete rRNA 2'-O-methylation landscape in human AML. Plasticity of rRNA 2'-O-methylation shifts protein translation toward an LSC phenotype. This dynamic process constitutes a novel concept of how cancers reprogram cell fate and function. This article is highlighted in the In This Issue feature, p. 247.
Collapse
Affiliation(s)
- Fengbiao Zhou
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
- Corresponding Authors: Carsten Müller-Tidow, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-2215-68000; E-mail: ; Fengbiao Zhou, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-221-563-7487; E-mail: ; and Andreas Trumpp, Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany. Phone: 4906-2214-23901; E-mail:
| | - Nesrine Aroua
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Heidelberg Institute of Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), Heidelberg, Germany
| | - Yi Liu
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
| | - Christian Rohde
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
| | - Jingdong Cheng
- Gene Center, Department of Biochemistry, University of Munich, Munich, Germany
| | - Anna-Katharina Wirth
- Research Unit Apoptosis in Hematopoietic Stem Cells (AHS), Helmholtz Center Munich, German Center for Environmental Health, Munich, Germany
| | - Daria Fijalkowska
- Division of Proteomics of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Stefanie Göllner
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Michelle Lotze
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Haiyang Yun
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Xiaobing Yu
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Caroline Pabst
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Tim Sauer
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Thomas Oellerich
- Department of Medicine II, Hematology/Oncology, Goethe University, Frankfurt Am Main, Germany
| | - Hubert Serve
- Department of Medicine II, Hematology/Oncology, Goethe University, Frankfurt Am Main, Germany
| | - Christoph Röllig
- Medical Department 1, University Hospital Dresden, Dresden, Germany
| | | | - Christian Thiede
- Medical Department 1, University Hospital Dresden, Dresden, Germany
| | - Claudia Baldus
- Department of Medicine II, Hematology and Oncology, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Michaela Frye
- Division of Mechanisms Regulating Gene Expression, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Simon Raffel
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Jeroen Krijgsveld
- Division of Proteomics of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Irmela Jeremias
- Research Unit Apoptosis in Hematopoietic Stem Cells (AHS), Helmholtz Center Munich, German Center for Environmental Health, Munich, Germany
- German Cancer Consortium (DKTK), Partner Site Munich, Munich, Germany
| | - Roland Beckmann
- Gene Center, Department of Biochemistry, University of Munich, Munich, Germany
| | - Andreas Trumpp
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Heidelberg Institute of Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), Heidelberg, Germany
- National Center for Tumor Diseases, NCT Heidelberg, Heidelberg, Germany
- Corresponding Authors: Carsten Müller-Tidow, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-2215-68000; E-mail: ; Fengbiao Zhou, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-221-563-7487; E-mail: ; and Andreas Trumpp, Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany. Phone: 4906-2214-23901; E-mail:
| | - Carsten Müller-Tidow
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
- National Center for Tumor Diseases, NCT Heidelberg, Heidelberg, Germany
- Corresponding Authors: Carsten Müller-Tidow, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-2215-68000; E-mail: ; Fengbiao Zhou, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-221-563-7487; E-mail: ; and Andreas Trumpp, Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany. Phone: 4906-2214-23901; E-mail:
| |
Collapse
|
4
|
Panda A, Tuller T. Determinants of associations between codon and amino acid usage patterns of microbial communities and the environment inferred based on a cross-biome metagenomic analysis. NPJ Biofilms Microbiomes 2023; 9:5. [PMID: 36693851 PMCID: PMC9873608 DOI: 10.1038/s41522-023-00372-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 01/11/2023] [Indexed: 01/25/2023] Open
Abstract
Codon and amino acid usage were associated with almost every aspect of microbial life. However, how the environment may impact the codon and amino acid choice of microbial communities at the habitat level is not clearly understood. Therefore, in this study, we analyzed codon and amino acid usage patterns of a large number of environmental samples collected from diverse ecological niches. Our results suggested that samples derived from similar environmental niches, in general, show overall similar codon and amino acid distribution as compared to samples from other habitats. To substantiate the relative impact of the environment, we considered several factors, such as their similarity in GC content, or in functional or taxonomic abundance. Our analysis demonstrated that none of these factors can fully explain the trends that we observed at the codon or amino acid level implying a direct environmental influence on them. Further, our analysis demonstrated different levels of selection on codon bias in different microbial communities with the highest bias in host-associated environments such as the digestive system or oral samples and the lowest level of selection in soil and water samples. Considering a large number of metagenomic samples here we showed that microorganisms collected from similar environmental backgrounds exhibit similar patterns of codon and amino acid usage irrespective of the location or time from where the samples were collected. Thus our study suggested a direct impact of the environment on codon and amino usage of microorganisms that cannot be explained considering the influence of other factors.
Collapse
Affiliation(s)
- Arup Panda
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel.
| |
Collapse
|
5
|
Gillen SL, Waldron JA, Bushell M. Codon optimality in cancer. Oncogene 2021; 40:6309-6320. [PMID: 34584217 PMCID: PMC8585667 DOI: 10.1038/s41388-021-02022-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/24/2021] [Accepted: 09/10/2021] [Indexed: 12/14/2022]
Abstract
A key characteristic of cancer cells is their increased proliferative capacity, which requires elevated levels of protein synthesis. The process of protein synthesis involves the translation of codons within the mRNA coding sequence into a string of amino acids to form a polypeptide chain. As most amino acids are encoded by multiple codons, the nucleotide sequence of a coding region can vary dramatically without altering the polypeptide sequence of the encoded protein. Although mutations that do not alter the final amino acid sequence are often thought of as silent/synonymous, these can still have dramatic effects on protein output. Because each codon has a distinct translation elongation rate and can differentially impact mRNA stability, each codon has a different degree of 'optimality' for protein synthesis. Recent data demonstrates that the codon preference of a transcriptome matches the abundance of tRNAs within the cell and that this supply and demand between tRNAs and mRNAs varies between different cell types. The largest observed distinction is between mRNAs encoding proteins associated with proliferation or differentiation. Nevertheless, precisely how codon optimality and tRNA expression levels regulate cell fate decisions and their role in malignancy is not fully understood. This review describes the current mechanistic understanding on codon optimality, its role in malignancy and discusses the potential to target codon optimality therapeutically in the context of cancer.
Collapse
Affiliation(s)
- Sarah L Gillen
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK.
| | - Joseph A Waldron
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK
| | - Martin Bushell
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK.
- Institute of Cancer Sciences, University of Glasgow, Glasgow, UK, G61 1QH.
| |
Collapse
|
6
|
Analysis of synonymous codon usage bias in human monocytes, B, and T lymphocytes based on transcriptome data. GENE REPORTS 2021. [DOI: 10.1016/j.genrep.2021.101034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
7
|
Co-evolution between codon usage and protein-protein interaction in bacteria. Gene 2021; 778:145475. [PMID: 33549710 DOI: 10.1016/j.gene.2021.145475] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Revised: 12/20/2020] [Accepted: 01/28/2021] [Indexed: 11/23/2022]
Abstract
We study the correlation between the codon usage bias of genetic sequences and the network features of protein-protein interaction (PPI) in bacterial species. We use PCA techniques in the space of codon bias indices to show that genes with similar patterns of codon usage have a significantly higher probability that their encoded proteins are functionally connected and interacting. Importantly, this signal emerges when multiple aspects of codon bias are taken into account at the same time. The present study extends our previous observations on E. coli over a wide set of 34 bacteria. These findings could allow for future investigations on the possible effects of codon bias on the topology of the PPI network, with the aim of improving existing bioinformatics methods for predicting protein interactions.
Collapse
|
8
|
Hernandez‐Alias X, Benisty H, Schaefer MH, Serrano L. Translational efficiency across healthy and tumor tissues is proliferation-related. Mol Syst Biol 2020; 16:e9275. [PMID: 32149479 PMCID: PMC7061310 DOI: 10.15252/msb.20199275] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 02/11/2020] [Accepted: 02/11/2020] [Indexed: 12/20/2022] Open
Abstract
Different tissues express genes with particular codon usage and anticodon tRNA repertoires. However, the codon-anticodon co-adaptation in humans is not completely understood, nor is its effect on tissue-specific protein levels. Here, we first validated the accuracy of small RNA-seq for tRNA quantification across five human cell lines. We then analyzed the tRNA abundance of more than 8,000 tumor samples from TCGA, together with their paired mRNA-seq and proteomics data, to determine the Supply-to-Demand Adaptation. We thereby elucidate that the dynamic adaptation of the tRNA pool is largely related to the proliferative state across tissues. The distribution of such tRNA pools over the whole cellular translatome affects the subsequent translational efficiency, which functionally determines a condition-specific expression program both in healthy and tumor states. Furthermore, the aberrant translational efficiency of some codons in cancer, exemplified by ProCCA and GlyGGT, is associated with poor patient survival. The regulation of these tRNA profiles is partly explained by the tRNA gene copy numbers and their promoter DNA methylation.
Collapse
Affiliation(s)
- Xavier Hernandez‐Alias
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Hannah Benisty
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Martin H Schaefer
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Department of Experimental OncologyIEO, European Institute of Oncology IRCCSMilanItaly
| | - Luis Serrano
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- ICREABarcelonaSpain
| |
Collapse
|
9
|
Kames J, Alexaki A, Holcomb DD, Santana-Quintero LV, Athey JC, Hamasaki-Katagiri N, Katneni U, Golikov A, Ibla JC, Bar H, Kimchi-Sarfaty C. TissueCoCoPUTs: Novel Human Tissue-Specific Codon and Codon-Pair Usage Tables Based on Differential Tissue Gene Expression. J Mol Biol 2020; 432:3369-3378. [PMID: 31982380 DOI: 10.1016/j.jmb.2020.01.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 12/19/2019] [Accepted: 01/08/2020] [Indexed: 02/05/2023]
Abstract
Protein expression in multicellular organisms varies widely across tissues. Codon usage in the transcriptome of each tissue is derived from genomic codon usage and the relative expression level of each gene. We created a comprehensive computational resource that houses tissue-specific codon, codon-pair, and dinucleotide usage data for 51 Homo sapiens tissues (TissueCoCoPUTs: https://hive.biochemistry.gwu.edu/review/tissue_codon), using transcriptome data from the Broad Institute Genotype-Tissue Expression (GTEx) portal. Distances between tissue-specific codon and codon-pair frequencies were used to generate a dendrogram based on the unique patterns of codon and codon-pair usage in each tissue that are clearly distinct from the genomic distribution. This novel resource may be useful in unraveling the relationship between codon usage and tRNA abundance, which could be critical in determining translation kinetics and efficiency across tissues. Areas of investigation such as biotherapeutic development, tissue-specific genetic engineering, and genetic disease prediction will greatly benefit from this resource.
Collapse
Affiliation(s)
- Jacob Kames
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - Aikaterini Alexaki
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - David D Holcomb
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - Luis V Santana-Quintero
- High Performance Integrated Environment, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - John C Athey
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - Nobuko Hamasaki-Katagiri
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - Upendra Katneni
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - Anton Golikov
- High Performance Integrated Environment, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA
| | - Juan C Ibla
- Department of Anesthesiology, Critical Care and Pain Medicine, Boston Children's Hospital and Harvard Medical School, Boston, MA, 02115, USA
| | - Haim Bar
- Department of Statistics, University of Connecticut, Storrs, CT, 06268, USA
| | - Chava Kimchi-Sarfaty
- Division of Plasma Protein Therapeutics, Office of Tissue and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, 20993, USA.
| |
Collapse
|
10
|
Bucca G, Pothi R, Hesketh A, Möller-Levet C, Hodgson DA, Laing EE, Stewart GR, Smith CP. Translational control plays an important role in the adaptive heat-shock response of Streptomyces coelicolor. Nucleic Acids Res 2019; 46:5692-5703. [PMID: 29746664 PMCID: PMC6009599 DOI: 10.1093/nar/gky335] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 04/18/2018] [Indexed: 12/29/2022] Open
Abstract
Stress-induced adaptations require multiple levels of regulation in all organisms to repair cellular damage. In the present study we evaluated the genome-wide transcriptional and translational changes following heat stress exposure in the soil-dwelling model actinomycete bacterium, Streptomyces coelicolor. The combined analysis revealed an unprecedented level of translational control of gene expression, deduced through polysome profiling, in addition to transcriptional changes. Our data show little correlation between the transcriptome and ‘translatome’; while an obvious downward trend in genome wide transcription was observed, polysome associated transcripts following heat-shock showed an opposite upward trend. A handful of key protein players, including the major molecular chaperones and proteases were highly induced at both the transcriptional and translational level following heat-shock, a phenomenon known as ‘potentiation’. Many other transcripts encoding cold-shock proteins, ABC-transporter systems, multiple transcription factors were more highly polysome-associated following heat stress; interestingly, these protein families were not induced at the transcriptional level and therefore were not previously identified as part of the stress response. Thus, stress coping mechanisms at the level of gene expression in this bacterium go well beyond the induction of a relatively small number of molecular chaperones and proteases in order to ensure cellular survival at non-physiological temperatures.
Collapse
Affiliation(s)
- Giselda Bucca
- School of Pharmacy and Biomolecular Sciences, University of Brighton, UK
| | - Radhika Pothi
- School of Biosciences and Medicine, Faculty of Health and Medical Sciences, University of Surrey, UK
| | - Andrew Hesketh
- School of Pharmacy and Biomolecular Sciences, University of Brighton, UK
| | - Carla Möller-Levet
- Bioinformatics facility, Faculty of Health and Medical Sciences, University of Surrey, UK
| | | | - Emma E Laing
- School of Biosciences and Medicine, Faculty of Health and Medical Sciences, University of Surrey, UK
| | - Graham R Stewart
- School of Biosciences and Medicine, Faculty of Health and Medical Sciences, University of Surrey, UK
| | - Colin P Smith
- School of Pharmacy and Biomolecular Sciences, University of Brighton, UK
| |
Collapse
|
11
|
Codon Usage in Trypanosomatids: The Bias of Expression. Trends Parasitol 2018; 34:635-637. [PMID: 29910092 DOI: 10.1016/j.pt.2018.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Accepted: 06/01/2018] [Indexed: 10/14/2022]
Abstract
Translation and RNA decay, two processes in which all mRNAs are engaged, are intimately related processes. Two new studies demonstrate that, in trypanosomatids, codon usage largely shapes mRNA abundance in a translation-dependent manner. The findings indicate that mRNA decay control by codon choice is an ancient and conserved mechanism.
Collapse
|
12
|
Hamed G, Marey M, Amin SES, Tolba MF. Hybrid, randomized and high capacity conservative mutations DNA-based steganography for large sized data. Biosystems 2018; 167:47-61. [PMID: 29608931 DOI: 10.1016/j.biosystems.2018.03.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Revised: 03/20/2018] [Accepted: 03/22/2018] [Indexed: 11/16/2022]
Abstract
In this paper, a well secured, high capacity, preserved algorithm is proposed through integrating the cryptography and steganography concepts with the molecular biology concepts. We achieved this by first encrypting the confidential data using the DNA Playfair cipher to avoid extra information sent to the receiver and it consequently acts as a trap for an attacker. Second, it achieves a randomized steganography process by exploiting the DNA conservative mutations. The DNA conservative mutations are utilized in a way that allows a DNA base to be substituted by another base to allow carrying two bits. Consequently, a high capacity feature is obtained with no payload for the used sequence. There are three main achieved contributions in this work. First, is hiding high capacity of data within DNA by exploiting each codon to hide two bits whilst preserving the sequence properties of protein after the steganography process, which is a trade off in the field. Secondly, using the conservative mutation with all its valid biological permutations, leads to the lowest cracking probability achieved and published till now, as proven in the security analysis section. Finally, a comparison is conducted between the proposed algorithm and five recent substitution based algorithms using large sized data up to three megabytes, to prove the algorithm's scalability.
Collapse
Affiliation(s)
- Ghada Hamed
- Department of Scientific Computing, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt.
| | - Mohammed Marey
- Department of Scientific Computing, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt
| | - Safaa El-Sayed Amin
- Department of Scientific Computing, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt
| | - Mohamed Fahmy Tolba
- Department of Scientific Computing, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt
| |
Collapse
|
13
|
Goodarzi H. Charting the "unknown unknowns" of cancer progression. Sci Transl Med 2018; 9:9/400/eaao0959. [PMID: 28747512 DOI: 10.1126/scitranslmed.aao0959] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 07/07/2017] [Indexed: 11/02/2022]
Abstract
Integrated computational and experimental strategies reveal previously unknown regulatory programs underlying metastatic disease.
Collapse
Affiliation(s)
- Hani Goodarzi
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA 94158, USA.,Department of Urology, University of California, San Francisco, San Francisco, CA 94158, USA.,Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
14
|
Hanson G, Coller J. Codon optimality, bias and usage in translation and mRNA decay. Nat Rev Mol Cell Biol 2017; 19:20-30. [PMID: 29018283 DOI: 10.1038/nrm.2017.91] [Citation(s) in RCA: 424] [Impact Index Per Article: 60.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The advent of ribosome profiling and other tools to probe mRNA translation has revealed that codon bias - the uneven use of synonymous codons in the transcriptome - serves as a secondary genetic code: a code that guides the efficiency of protein production, the fidelity of translation and the metabolism of mRNAs. Recent advancements in our understanding of mRNA decay have revealed a tight coupling between ribosome dynamics and the stability of mRNA transcripts; this coupling integrates codon bias into the concept of codon optimality, or the effects that specific codons and tRNA concentrations have on the efficiency and fidelity of the translation machinery. In this Review, we first discuss the evidence for codon-dependent effects on translation, beginning with the basic mechanisms through which translation perturbation can affect translation efficiency, protein folding and transcript stability. We then discuss how codon effects are leveraged by the cell to tailor the proteome to maintain homeostasis, execute specific gene expression programmes of growth or differentiation and optimize the efficiency of protein production.
Collapse
Affiliation(s)
- Gavin Hanson
- Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Jeff Coller
- Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA
| |
Collapse
|
15
|
Villada JC, Brustolini OJB, Batista da Silveira W. Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design. DNA Res 2017; 24:419-434. [PMID: 28449100 PMCID: PMC5737324 DOI: 10.1093/dnares/dsx014] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Revised: 03/22/2017] [Accepted: 03/23/2017] [Indexed: 01/21/2023] Open
Abstract
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation.
Collapse
Affiliation(s)
- Juan C. Villada
- Department of Microbiology, Universidade Federal de Viçosa, Viçosa 36570-900, Brazil
| | | | | |
Collapse
|
16
|
Wen Y, Zou Z, Li H, Xiang Z, He N. Analysis of codon usage patterns in Morus notabilis based on genome and transcriptome data. Genome 2017; 60:473-484. [PMID: 28177830 DOI: 10.1139/gen-2016-0129] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Codons play important roles in regulating gene expression levels and mRNA half-lives. However, codon usage and related studies in multicellular organisms still lag far behind those in unicellular organisms. In this study, we describe for the first time genome-wide patterns of codon bias in Morus notabilis (mulberry tree), and analyze genome-wide codon usage in 12 other species within the order Rosales. The codon usage of M. notabilis was affected by nucleotide composition, mutation pressure, nature selection, and gene expression level. Translational selection optimal codons were identified and highly expressed genes of M. notabilis tended to use the optimal codons. Genes with higher expression levels have shorter coding region and lower amino acid complexity. Housekeeping genes showed stronger translational selection, which, notably, was not caused by the large differences between the expression level of housekeeping genes and other genes.
Collapse
Affiliation(s)
- Yan Wen
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China
| | - Ziliang Zou
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China
| | - Hongshun Li
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China
| | - Zhonghuai Xiang
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China
| | - Ningjia He
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Tiansheng Road, Beibei, Chongqing 400715, China
| |
Collapse
|
17
|
Trotta E. Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage. BMC Genomics 2016; 17:366. [PMID: 27188984 PMCID: PMC4869280 DOI: 10.1186/s12864-016-2692-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 05/05/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. RESULTS In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. CONCLUSIONS In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
Collapse
Affiliation(s)
- Edoardo Trotta
- Institute of Translational Pharmacology, Consiglio Nazionale delle Ricerche (CNR), Rome, 00133, Italy.
| |
Collapse
|
18
|
Camiolo S, Melito S, Porceddu A. New insights into the interplay between codon bias determinants in plants. DNA Res 2015; 22:461-70. [PMID: 26546225 PMCID: PMC4675714 DOI: 10.1093/dnares/dsv027] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 10/01/2015] [Indexed: 12/28/2022] Open
Abstract
Codon bias is the non-random use of synonymous codons, a phenomenon that has been observed in species as diverse as bacteria, plants and mammals. The preferential use of particular synonymous codons may reflect neutral mechanisms (e.g. mutational bias, G|C-biased gene conversion, genetic drift) and/or selection for mRNA stability, translational efficiency and accuracy. The extent to which these different factors influence codon usage is unknown, so we dissected the contribution of mutational bias and selection towards codon bias in genes from 15 eudicots, 4 monocots and 2 mosses. We analysed the frequency of mononucleotides, dinucleotides and trinucleotides and investigated whether the compositional genomic background could account for the observed codon usage profiles. Neutral forces such as mutational pressure and G|C-biased gene conversion appeared to underlie most of the observed codon bias, although there was also evidence for the selection of optimal translational efficiency and mRNA folding. Our data confirmed the compositional differences between monocots and dicots, with the former featuring in general a lower background compositional bias but a higher overall codon bias.
Collapse
Affiliation(s)
- S Camiolo
- Dipartimento di Agraria, SACEG, Università degli Studi di Sassari, Sassari, Italy
| | - S Melito
- Dipartimento di Agraria, SACEG, Università degli Studi di Sassari, Sassari, Italy
| | - A Porceddu
- Dipartimento di Agraria, SACEG, Università degli Studi di Sassari, Sassari, Italy
| |
Collapse
|
19
|
Supek F. The Code of Silence: Widespread Associations Between Synonymous Codon Biases and Gene Function. J Mol Evol 2015; 82:65-73. [PMID: 26538122 DOI: 10.1007/s00239-015-9714-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 10/30/2015] [Indexed: 02/07/2023]
Abstract
Some mutations in gene coding regions exchange one synonymous codon for another, and thus do not alter the amino acid sequence of the encoded protein. Even though they are often called 'silent,' these mutations may exhibit a plethora of effects on the living cell. Therefore, they are often selected during evolution, causing synonymous codon usage biases in genomes. Comparative analyses of bacterial, archaeal, fungal, and human cancer genomes have found many links between a gene's biological role and the accrual of synonymous mutations during evolution. In particular, highly expressed genes in certain functional categories are enriched with optimal codons, which are decoded by the abundant tRNAs, thus enhancing the speed and accuracy of the translating ribosome. The set of genes exhibiting codon adaptation differs between genomes, and these differences show robust associations to organismal phenotypes. In addition to selection for translation efficiency, other distinct codon bias patterns have been found in: amino acid starvation genes, cyclically expressed genes, tissue-specific genes in animals and plants, oxidative stress response genes, cellular differentiation genes, and oncogenes. In addition, genomes of organisms harboring tRNA modifications exhibit particular codon preferences. The evolutionary trace of codon bias patterns across orthologous genes may be examined to learn about a gene's relevance to various phenotypes, or, more generally, its function in the cell.
Collapse
Affiliation(s)
- Fran Supek
- Division of electronics, Rudjer Boskovic Institute, 10000, Zagreb, Croatia.
- EMBL-CRG Systems Biology Unit, Centre for Genomic Regulation (CRG), 08003, Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), 08003, Barcelona, Spain.
| |
Collapse
|
20
|
Emamjomeh A, Goliaei B, Torkamani A, Ebrahimpour R, Mohammadi N, Parsian A. Protein-protein interaction prediction by combined analysis of genomic and conservation information. Genes Genet Syst 2015; 89:259-72. [PMID: 25948120 DOI: 10.1266/ggs.89.259] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Protein-protein interactions (PPIs) are highly important because of their main role in cellular processes and biochemical pathways; therefore, PPI can be very useful in the prediction of protein functions. Experimental techniques of PPI detection have certain drawbacks; hence computational methods can be used to complement wet lab techniques. Such methods can be applied to PPI prediction as well as validation of experimental results. Computational algorithms can lead to many false PPI predictions, which in turn result in non-adequate performance. We have developed a novel method based on combined analysis, entitled PPIccc. Three different descriptors for PPIccc included gene co-expression values, codon usage similarity and conservation of surface residues between protein products of a gene pair, which combined to predict PPI. Validation of results based on Human Protein Reference Database (HPRD) indicated improvement of performance in our proposed method. The results also revealed that conservation of surface residues between proteins in combination with codon usage similarity of their related genes increase the performance of PPI prediction. This means that codon usage similarity and surface residues between proteins (only sequence-based features) can predict PPIs as good as PPIccc.
Collapse
|
21
|
Gorochowski TE, Ignatova Z, Bovenberg RAL, Roubos JA. Trade-offs between tRNA abundance and mRNA secondary structure support smoothing of translation elongation rate. Nucleic Acids Res 2015; 43:3022-32. [PMID: 25765653 PMCID: PMC4381083 DOI: 10.1093/nar/gkv199] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 02/26/2015] [Indexed: 01/28/2023] Open
Abstract
Translation of protein from mRNA is a complex multi-step process that occurs at a non-uniform rate. Variability in ribosome speed along an mRNA enables refinement of the proteome and plays a critical role in protein biogenesis. Detailed single protein studies have found both tRNA abundance and mRNA secondary structure as key modulators of translation elongation rate, but recent genome-wide ribosome profiling experiments have not observed significant influence of either on translation efficiency. Here we provide evidence that this results from an inherent trade-off between these factors. We find codons pairing to high-abundance tRNAs are preferentially used in regions of high secondary structure content, while codons read by significantly less abundant tRNAs are located in lowly structured regions. By considering long stretches of high and low mRNA secondary structure in Saccharomyces cerevisiae and Escherichia coli and comparing them to randomized-gene models and experimental expression data, we were able to distinguish clear selective pressures and increased protein expression for specific codon choices. The trade-off between secondary structure and tRNA-concentration based codon choice allows for compensation of their independent effects on translation, helping to smooth overall translational speed and reducing the chance of potentially detrimental points of excessively slow or fast ribosome movement.
Collapse
Affiliation(s)
| | - Zoya Ignatova
- Department of Biochemistry, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam-Golm, Germany Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, 20146 Hamburg, Germany
| | | | - Johannes A Roubos
- DSM Biotechnology Center, P.O. Box 1, 2600 MA Delft, The Netherlands
| |
Collapse
|
22
|
Foroughmand-Araabi MH, Goliaei B, Alishahi K, Sadeghi M, Goliaei S. Codon usage and protein sequence pattern dependency in different organisms: A Bioinformatics approach. J Bioinform Comput Biol 2014; 13:1550002. [PMID: 25409941 DOI: 10.1142/s021972001550002x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Although it is known that synonymous codons are not chosen randomly, the role of the codon usage in gene regulation is not clearly understood, yet. Researchers have investigated the relation between the codon usage and various properties, such as gene regulation, translation rate, translation efficiency, mRNA stability, splicing, and protein domains. Recently, a universal codon usage based mechanism for gene regulation is proposed. We studied the role of protein sequence patterns on the codons usage by related genes. Considering a subsequence of a protein that matches to a pattern or motif, we showed that, parts of the genes, which are translated to this subsequence, use specific ratios of synonymous codons. Also, we built a multinomial logistic regression statistical model for codon usage, which considers the effect of patterns on codon usage. This model justifies the observed codon usage preference better than the classic organism dependent codon usage. Our results showed that the codon usage plays a role in controlling protein levels, for genes that participate in a specific biological function. This is the first time that this phenomenon is reported.
Collapse
|
23
|
Zahiri J, Mohammad-Noori M, Ebrahimpour R, Saadat S, Bozorgmehr JH, Goldberg T, Masoudi-Nejad A. LocFuse: human protein-protein interaction prediction via classifier fusion using protein localization information. Genomics 2014; 104:496-503. [PMID: 25458812 DOI: 10.1016/j.ygeno.2014.10.006] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2014] [Revised: 09/28/2014] [Accepted: 10/02/2014] [Indexed: 12/20/2022]
Abstract
UNLABELLED Protein-protein interaction (PPI) detection is one of the central goals of functional genomics and systems biology. Knowledge about the nature of PPIs can help fill the widening gap between sequence information and functional annotations. Although experimental methods have produced valuable PPI data, they also suffer from significant limitations. Computational PPI prediction methods have attracted tremendous attentions. Despite considerable efforts, PPI prediction is still in its infancy in complex multicellular organisms such as humans. Here, we propose a novel ensemble learning method, LocFuse, which is useful in human PPI prediction. This method uses eight different genomic and proteomic features along with four types of different classifiers. The prediction performance of this classifier selection method was found to be considerably better than methods employed hitherto. This confirms the complex nature of the PPI prediction problem and also the necessity of using biological information for classifier fusion. The LocFuse is available at: http://lbb.ut.ac.ir/Download/LBBsoft/LocFuse. BIOLOGICAL SIGNIFICANCE The results revealed that if we divide proteome space according to the cellular localization of proteins, then the utility of some classifiers in PPI prediction can be improved. Therefore, to predict the interaction for any given protein pair, we can select the most accurate classifier with regard to the cellular localization information. Based on the results, we can say that the importance of different features for PPI prediction varies between differently localized proteins; however in general, our novel features, which were extracted from position-specific scoring matrices (PSSMs), are the most important ones and the Random Forest (RF) classifier performs best in most cases. LocFuse was developed with a user-friendly graphic interface and it is freely available for Linux, Mac OSX and MS Windows operating systems.
Collapse
Affiliation(s)
- Javad Zahiri
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran; Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Morteza Mohammad-Noori
- School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran
| | - Reza Ebrahimpour
- Brain and Intelligent Systems Research Lab, Department of Electrical and Computer Engineering, Shahid Rajaee Teacher Training University, Tehran, Iran
| | - Samaneh Saadat
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Joseph H Bozorgmehr
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Tatyana Goldberg
- Department for Bioinformatics and Computational Biology, Faculty of Informatics, TUM, Garching 85748, Germany
| | - Ali Masoudi-Nejad
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.
| |
Collapse
|
24
|
Refining the balance of attenuation and immunogenicity of respiratory syncytial virus by targeted codon deoptimization of virulence genes. mBio 2014; 5:e01704-14. [PMID: 25249281 PMCID: PMC4173764 DOI: 10.1128/mbio.01704-14] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Respiratory syncytial virus (RSV) is the most important pathogen for lower respiratory tract illness in children for which there is no licensed vaccine. Live-attenuated RSV vaccines are the most clinically advanced in children, but achieving an optimal balance of attenuation and immunogenicity is challenging. One way to potentially retain or enhance immunogenicity of attenuated virus is to mutate virulence genes that suppress host immune responses. The NS1 and NS2 virulence genes of the RSV A2 strain were codon deoptimized according to either human or virus codon usage bias, and the resulting recombinant viruses (dNSh and dNSv, respectively) were rescued by reverse genetics. RSV dNSh exhibited the desired phenotype of reduced NS1 and NS2 expression. RSV dNSh was attenuated in BEAS-2B and primary differentiated airway epithelial cells but not in HEp-2 or Vero cells. In BALB/c mice, RSV dNSh exhibited a lower viral load than did A2, and yet it induced slightly higher levels of RSV-neutralizing antibodies than did A2. RSV A2 and RSV dNSh induced equivalent protection against challenge strains A/1997/12-35 and A2-line19F. RSV dNSh caused less STAT2 degradation and less NF-κB activation than did A2 in vitro. Serial passage of RSV dNSh in BEAS-2B cells did not result in mutations in the deoptimized sequences. Taken together, RSV dNSh was moderately attenuated, more immunogenic, and equally protective compared to wild-type RSV and genetically stable. Respiratory syncytial virus (RSV) is the leading cause of infant viral death in the United States and worldwide, and no vaccine is available. Live-attenuated RSV vaccines are the most studied in children but have suffered from genetic instability and low immunogenicity. In order to address both obstacles, we selectively changed the codon usage of the RSV nonstructural (NS) virulence genes NS1 and NS2 to the least-used codons in the human genome (deoptimization). Compared to parental RSV, the codon-deoptimized NS1/NS2 RSV was attenuated in vitro and in mice but induced higher levels of neutralizing antibodies and equivalent protection against challenge. We identified a new attenuating module that retains immunogenicity and is genetically stable, achieved through specific targeting of nonessential virulence genes by codon usage deoptimization.
Collapse
|
25
|
Ma L, Cui P, Zhu J, Zhang Z, Zhang Z. Translational selection in human: more pronounced in housekeeping genes. Biol Direct 2014; 9:17. [PMID: 25011537 PMCID: PMC4100034 DOI: 10.1186/1745-6150-9-17] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 07/02/2014] [Indexed: 02/17/2023] Open
Abstract
BACKGROUND Translational selection is a ubiquitous and significant mechanism to regulate protein expression in prokaryotes and unicellular eukaryotes. Recent evidence has shown that translational selection is weakly operative in highly expressed genes in human and other vertebrates. However, it remains unclear whether translational selection acts differentially on human genes depending on their expression patterns. RESULTS Here we report that human housekeeping (HK) genes that are strictly defined as genes that are expressed ubiquitously and consistently in most or all tissues, are under stronger translational selection. CONCLUSIONS These observations clearly show that translational selection is also closely associated with expression pattern. Our results suggest that human HK genes are more efficiently and/or accurately translated into proteins, which will inevitably open up a new understanding of HK genes and the regulation of gene expression. REVIEWERS This article was reviewed by Yuan Yuan, Baylor College of Medicine; Han Liang, University of Texas MD Anderson Cancer Center (nominated by Dr Laura Landweber) Eugene Koonin, NCBI, NLM, NIH, United States of America Sandor Pongor, International Centre for Genetic Engineering and biotechnology (ICGEB), Italy.
Collapse
Affiliation(s)
| | | | | | | | - Zhang Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, No,1 Beichen West Road, Chaoyang District, Beijing 100101, China.
| |
Collapse
|
26
|
Bicluster pattern of codon context usages between flavivirus and vector mosquito Aedes aegypti: relevance to infection and transcriptional response of mosquito genes. Mol Genet Genomics 2014; 289:885-94. [PMID: 24838953 DOI: 10.1007/s00438-014-0857-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2014] [Accepted: 04/12/2014] [Indexed: 01/18/2023]
Abstract
The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias in usages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis-driven tests to examine the role of codon context bias in evolution of vector-virus interactions at the molecular level.
Collapse
|
27
|
A comparative analysis of synonymous codon usage bias pattern in human albumin superfamily. ScientificWorldJournal 2014; 2014:639682. [PMID: 24707212 PMCID: PMC3951064 DOI: 10.1155/2014/639682] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Accepted: 01/11/2014] [Indexed: 11/29/2022] Open
Abstract
Synonymous codon usage bias is an inevitable phenomenon in organismic taxa across the three domains of life. Though the frequency of codon usage is not equal across species and within genome in the same species, the phenomenon is non random and is tissue-specific. Several factors such as GC content, nucleotide distribution, protein hydropathy, protein secondary structure, and translational selection are reported to contribute to codon usage preference. The synonymous codon usage patterns can be helpful in revealing the expression pattern of genes as well as the evolutionary relationship between the sequences. In this study, synonymous codon usage bias patterns were determined for the evolutionarily close proteins of albumin superfamily, namely, albumin, α-fetoprotein, afamin, and vitamin D-binding protein. Our study demonstrated that the genes of the four albumin superfamily members have low GC content and high values of effective number of codons (ENC) suggesting high expressivity of these genes and less bias in codon usage preferences. This study also provided evidence that the albumin superfamily members are not subjected to mutational selection pressure.
Collapse
|
28
|
Foroughmand-Araabi MH, Goliaei B, Alishahi K, Sadeghi M. Dependency of codon usage on protein sequence patterns: a statistical study. Theor Biol Med Model 2014; 11:2. [PMID: 24410898 PMCID: PMC3896713 DOI: 10.1186/1742-4682-11-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2013] [Accepted: 01/03/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon degeneracy and codon usage by organisms is an interesting and challenging problem. Researchers demonstrated the relation between codon usage and various functions or properties of genes and proteins, such as gene regulation, translation rate, translation efficiency, mRNA stability, splicing, and protein domains. Researchers usually represent segments of proteins responsible for specific functions or structures in a family of proteins as sequence patterns or motifs. We asked the question if organisms use the same codons in pattern segments as compared to the rest of the sequence. METHODS We used the likelihood ratio test, Pearson's chi-squared test, and mutual information to compare these two codon usages. RESULTS We showed that codon usage, in segments of genes that code for a given pattern or motif in a group of proteins, varied from the rest of the gene. The codon usage in these segments was not random. Amino acids with larger number of codons used more specific codon ratios in these segments. We studied the number of amino acids in the pattern (pattern length). As patterns got longer, there was a slight decrease in the fraction of patterns with significant different codon usage in the pattern region as compared to codon usage in the gene region. We defined a measure of specificity of protein patterns, and studied its relation to the codon usage. The difference in the codon usage between pattern region and gene region, was less for the patterns with higher specificity. CONCLUSIONS We provided a hypothesis that there are segments on genes that affect the codon usage and thus influence protein translation speed, and these regions are the regions that code protein pattern regions.
Collapse
Affiliation(s)
| | - Bahram Goliaei
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.
| | | | | |
Collapse
|
29
|
Obayashi T, Okamura Y, Ito S, Tadaka S, Aoki Y, Shirota M, Kinoshita K. ATTED-II in 2014: evaluation of gene coexpression in agriculturally important plants. PLANT & CELL PHYSIOLOGY 2014; 55:e6. [PMID: 24334350 PMCID: PMC3894708 DOI: 10.1093/pcp/pct178] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
ATTED-II (http://atted.jp) is a database of coexpressed genes that was originally developed to identify functionally related genes in Arabidopsis and rice. Herein, we describe an updated version of ATTED-II, which expands this resource to include additional agriculturally important plants. To improve the quality of the coexpression data for Arabidopsis and rice, we included more gene expression data from microarray and RNA sequencing studies. The RNA sequencing-based coexpression data now cover 94% of the Arabidopsis protein-encoding genes, representing a substantial increase from previously available microarray-based coexpression data (76% coverage). We also generated coexpression data for four dicots (soybean, poplar, grape and alfalfa) and one monocot (maize). As both the quantity and quality of expression data for the non-model species are generally poorer than for the model species, we verified coexpression data associated with these new species using multiple methods. First, the overall performance of the coexpression data was evaluated using gene ontology annotations and the coincidence of a genomic feature. Secondly, the reliability of each guide gene was determined by comparing coexpressed gene lists between platforms. With the expanded and newly evaluated coexpression data, ATTED-II represents an important resource for identifying functionally related genes in agriculturally important plants.
Collapse
Affiliation(s)
- Takeshi Obayashi
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
- Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
- *Corresponding author: E-mail, ; Fax, +81-22-795-7179
| | - Yasunobu Okamura
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Satoshi Ito
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Shu Tadaka
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Yuichi Aoki
- Graduate School of Engineering, Tohoku University, 6-6-04, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8579 Japan
| | - Matsuyuki Shirota
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Kengo Kinoshita
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
- Institute of Development, Aging, and Cancer, Tohoku University, Sendai, 980-8575 Japan
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, 980-8573 Japan
| |
Collapse
|
30
|
Wohlgemuth SE, Gorochowski TE, Roubos JA. Translational sensitivity of the Escherichia coli genome to fluctuating tRNA availability. Nucleic Acids Res 2013; 41:8021-33. [PMID: 23842674 PMCID: PMC3783181 DOI: 10.1093/nar/gkt602] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
The synthesis of protein from messenger RNA during translation is a highly dynamic process that plays a key role in controlling the efficiency and fidelity of genome-wide protein expression. The availability of aminoacylated transfer RNA (tRNA) is a major factor influencing the speed of ribosomal movement, which depending on codon choices, varies considerably along a transcript. Furthermore, it has been shown experimentally that tRNA availability can vary significantly under different growth and stress conditions, offering the cell a way to adapt translational dynamics across the genome. Existing models of translation have neglected fluctuations of tRNA pools, instead assuming fixed tRNA availabilities over time. This has lead to an incomplete understanding of this process. Here, we show for the entire Escherichia coli genome how and to what extent translational speed profiles, which capture local aspects of translational elongation, respond to measured shifts in tRNA availability. We find that translational profiles across the genome are affected to differing degrees, with genes that are essential or related to fundamental processes such as translation, being more robust than those linked to regulation. Furthermore, we reveal how fluctuating tRNA availability influences profiles of specific sequences known to play a significant role in translational control of gene expression.
Collapse
|
31
|
Shabalina SA, Spiridonov NA, Kashina A. Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity. Nucleic Acids Res 2013; 41:2073-94. [PMID: 23293005 PMCID: PMC3575835 DOI: 10.1093/nar/gks1205] [Citation(s) in RCA: 187] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions.
Collapse
Affiliation(s)
- Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20984, USA.
| | | | | |
Collapse
|
32
|
Li M, Zhao Z, Chen J, Wang B, Li Z, Li J, Cai M. Characterization of synonymous codon usage bias in the pseudorabies virus US1 gene. Virol Sin 2012; 27:303-15. [PMID: 23055006 DOI: 10.1007/s12250-012-3270-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2012] [Accepted: 09/12/2012] [Indexed: 12/11/2022] Open
Abstract
In the present study, we examined the codon usage bias between pseudorabies virus (PRV) US1 gene and the US1-like genes of 20 reference alphaherpesviruses. Comparative analysis showed noticeable disparities of the synonymous codon usage bias in the 21 alphaherpesviruses, indicated by codon adaptation index, effective number of codons (ENc) and GC3s value. The codon usage pattern of PRV US1 gene was phylogenetically conserved and similar to that of the US1-like genes of the genus Varicellovirus of alphaherpesvirus, with a strong bias towards the codons with C and G at the third codon position. Cluster analysis of codon usage pattern of PRV US1 gene with its reference alphaherpesviruses demonstrated that the codon usage bias of US1-like genes of 21 alphaherpesviruses had a very close relation with their gene functions. ENc-plot revealed that the genetic heterogeneity in PRV US1 gene and the 20 reference alphaherpesviruses was constrained by G+C content, as well as the gene length. In addition, comparison of codon preferences in the US1 gene of PRV with those of E. coli, yeast and human revealed that there were 50 codons showing distinct usage differences between PRV and yeast, 49 between PRV and human, but 48 between PRV and E. coli. Although there were slightly fewer differences in codon usages between E.coli and PRV, the difference is unlikely to be statistically significant, and experimental studies are necessary to establish the most suitable expression system for PRV US1. In conclusion, these results may improve our understanding of the evolution, pathogenesis and functional studies of PRV, as well as contributing to the area of herpesvirus research or even studies with other viruses.
Collapse
Affiliation(s)
- Meili Li
- Department of Pathogenic Biology and Immunology, Guangzhou Medical University, Guangzhou 510182, China
| | | | | | | | | | | | | |
Collapse
|
33
|
β-tubulin paralogue tubC is frequently misidentified as the benA gene in Aspergillus section Nigri taxonomy: primer specificity testing and taxonomic consequences. Persoonia - Molecular Phylogeny and Evolution of Fungi 2012; 29:1-10. [PMID: 23606761 PMCID: PMC3589786 DOI: 10.3767/003158512x658123] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2012] [Accepted: 08/06/2012] [Indexed: 11/25/2022]
Abstract
β-tubulin (benA, tub-2) and calmodulin (caM) are crucial genes in the taxonomy of Aspergillus section Nigri. Widely used β-tubulin primers are not specific for the benA gene for some taxa and preferentially amplify the tubC paralogue. Sequences of the tubC paralogue are widely combined with benA sequences in recent taxonomical works as well as other works, resulting in incongruent trees. In this study we newly provide benA sequences for several ex-type strains, which were characterised using the tubC gene only. We designed a highly specific forward primer to benA designated Ben2f for use in Aspergillus section Nigri, and tested specificity of numerous primer combinations to β-tubulin paralogs. The primer pairs with the highest specificity to the benA gene and functional across species in section Nigri includes Ben2f/Bt2b, Ben2f/T22 and T10/T22. We also provide tools based on codon usage bias analysis that reliably distinguish both paralogues. Exon/intron arrangement is the next distinctive characteristic, although this tool is not valid outside section Nigri. The species identity of taxa from the A. aculeatus clade used in previous molecular studies was revised using combined molecular data (ITS, benA, caM). These data together with two different PCR-fingerprinting methods indicated that A. japonicus should be treated as a synonym of A. violaceofuscus. Similarly, A. fijiensis is reduced to synonymy with A. brunneoviolaceus.
Collapse
|
34
|
Fasoli M, Dal Santo S, Zenoni S, Tornielli GB, Farina L, Zamboni A, Porceddu A, Venturini L, Bicego M, Murino V, Ferrarini A, Delledonne M, Pezzotti M. The grapevine expression atlas reveals a deep transcriptome shift driving the entire plant into a maturation program. THE PLANT CELL 2012; 24:3489-505. [PMID: 22948079 PMCID: PMC3480284 DOI: 10.1105/tpc.112.100230] [Citation(s) in RCA: 147] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Revised: 07/06/2012] [Accepted: 08/20/2012] [Indexed: 05/18/2023]
Abstract
We developed a genome-wide transcriptomic atlas of grapevine (Vitis vinifera) based on 54 samples representing green and woody tissues and organs at different developmental stages as well as specialized tissues such as pollen and senescent leaves. Together, these samples expressed ∼91% of the predicted grapevine genes. Pollen and senescent leaves had unique transcriptomes reflecting their specialized functions and physiological status. However, microarray and RNA-seq analysis grouped all the other samples into two major classes based on maturity rather than organ identity, namely, the vegetative/green and mature/woody categories. This division represents a fundamental transcriptomic reprogramming during the maturation process and was highlighted by three statistical approaches identifying the transcriptional relationships among samples (correlation analysis), putative biomarkers (O2PLS-DA approach), and sets of strongly and consistently expressed genes that define groups (topics) of similar samples (biclustering analysis). Gene coexpression analysis indicated that the mature/woody developmental program results from the reiterative coactivation of pathways that are largely inactive in vegetative/green tissues, often involving the coregulation of clusters of neighboring genes and global regulation based on codon preference. This global transcriptomic reprogramming during maturation has not been observed in herbaceous annual species and may be a defining characteristic of perennial woody plants.
Collapse
Affiliation(s)
- Marianna Fasoli
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Silvia Dal Santo
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Sara Zenoni
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | | | - Lorenzo Farina
- Dipartimento di Informatica e Sistemistica Antonio Ruberti, Università degli Studi di Roma La Sapienza, 00185 Rome, Italy
| | - Anita Zamboni
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Andrea Porceddu
- Dipartimento di Scienze Agronomiche e Genetica Vegetale Agraria, Università degli Studi di Sassari, 07100 Sassari, Italy
| | - Luca Venturini
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Manuele Bicego
- Dipartimento di Informatica, Università degli Studi di Verona, 37134 Verona, Italy
| | | | - Alberto Ferrarini
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Massimo Delledonne
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Mario Pezzotti
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
- Address correspondence to
| |
Collapse
|
35
|
Abstract
The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression.
Collapse
|
36
|
Bioinformatic study of the relationship between protein regulation and sequence properties. Genomics 2012; 100:240-4. [PMID: 22800766 DOI: 10.1016/j.ygeno.2012.07.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2012] [Revised: 05/21/2012] [Accepted: 07/03/2012] [Indexed: 11/21/2022]
Abstract
Although protein expression and regulation have been intensively studied, a complete picture of its mechanisms is still to be drawn. Analysis of high-throughput quantitative proteomics data provides a way to better understand protein regulation. Here, we introduce a bioinformatic analysis method to correlate protein regulation with individual amino acid patterns. We compare the amino acid composition between groups of regulated and unregulated proteins and investigate the correlation between codon usage patterns and protein regulation levels in two Sulfolobus species in "biofilm vs planktonic" experiments. The identified amino acids can then be associated with the regulation of specific gene functions. Strikingly, our analysis shows that functional categories of regulated proteins with similar composition and codon usage pattern of specific amino acids behave similarly. This finding can contribute to a better understanding of protein and gene expression regulation and could find applications in gene optimisation.
Collapse
|
37
|
Zhou Y, Zhou YS, He F, Song J, Zhang Z. Can simple codon pair usage predict protein-protein interaction? MOLECULAR BIOSYSTEMS 2012; 8:1396-404. [PMID: 22392100 DOI: 10.1039/c2mb05427b] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Deciphering functional interactions between proteins is one of the great challenges in biology. Sequence-based homology-free encoding schemes have been increasingly applied to develop promising protein-protein interaction (PPI) predictors by means of statistical or machine learning methods. Here we analyze the relationship between codon pair usage and PPIs in yeast. We show that codon pair usage of interacting protein pairs differs significantly from randomly expected. This motivates the development of a novel approach for predicting PPIs, with codon pair frequency difference as input to a Support Vector Machine predictor, termed as CCPPI. 10-fold cross-validation tests based on yeast PPI datasets with balanced positive-to-negative ratios indicate that CCPPI performs better than other sequence-based encoding schemes. Moreover, it ranks the best when tested on an unbalanced large-scale dataset. Although CCPPI is subjected to high false positive rates like many PPI predictors, statistical analyses of the predicted true positives confirm that the success of CCPPI is partly ascribed to its capability to capture proteomic co-expression and functional similarities between interacting protein pairs. Our findings suggest that codon pairs of interacting protein pairs evolve in a coordinated manner and consequently they provide additional information beyond amino acids-based encoding schemes. CCPPI has been made freely available at: http://protein.cau.edu.cn/ccppi.
Collapse
Affiliation(s)
- Yuan Zhou
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | | | | | | | | |
Collapse
|
38
|
Olivares-Hernández R, Bordel S, Nielsen J. Codon usage variability determines the correlation between proteome and transcriptome fold changes. BMC SYSTEMS BIOLOGY 2011; 5:33. [PMID: 21352515 PMCID: PMC3058016 DOI: 10.1186/1752-0509-5-33] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Accepted: 02/25/2011] [Indexed: 12/02/2022]
Abstract
Background The availability of high throughput experimental methods has made possible to observe the relationships between proteome and transcirptome. The protein abundances show a positive but weak correlation with the concentrations of their cognate mRNAs. This weak correlation implies that there are other crucial effects involved in the regulation of protein translation, different from the sole availability of mRNA. It is well known that ribosome and tRNA concentrations are sources of variation in protein levels. Thus, by using integrated analysis of omics data, genomic information, transcriptome and proteome, we aim to unravel important variables affecting translation. Results We identified how much of the variability in the correlation between protein and mRNA concentrations can be attributed to the gene codon frequencies. We propose the hypothesis that the influence of codon frequency is due to the competition of cognate and near-cognate tRNA binding; which in turn is a function of the tRNA concentrations. Transcriptome and proteome data were combined in two analytical steps; first, we used Self-Organizing Maps (SOM) to identify similarities among genes, based on their codon frequencies, grouping them into different clusters; and second, we calculated the variance in the protein mRNA correlation in the sampled genes from each cluster. This procedure is justified within a mathematical framework. Conclusions With the proposed method we observed that in all the six studied cases most of the variability in the relation protein-transcript could be explained by the variation in codon composition.
Collapse
Affiliation(s)
- Roberto Olivares-Hernández
- Systems Biology, Department of Chemical and Biological Engineering, Chalmers University of Technology, Kemivägen 10, Gothenburg, Sweden
| | | | | |
Collapse
|
39
|
Olivares-Hernández R, Usaite R, Nielsen J. Integrative analysis using proteome and transcriptome data from yeast to unravel regulatory patterns at post-transcriptional level. Biotechnol Bioeng 2010; 107:865-75. [DOI: 10.1002/bit.22868] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
40
|
Czech A, Fedyunin I, Zhang G, Ignatova Z. Silent mutations in sight: co-variations in tRNA abundance as a key to unravel consequences of silent mutations. MOLECULAR BIOSYSTEMS 2010; 6:1767-72. [PMID: 20617253 DOI: 10.1039/c004796c] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Mutations that alter the amino acid sequence are known to potentially exert deleterious effects on protein function, whereas substitutions of nucleotides without amino acid change are assumed to be neutral for the protein's functionality. However, cumulative evidence suggests that synonymous substitutions might also induce phenotypic variability by affecting splicing accuracy, translation fidelity, and conformation and function of proteins. tRNA isoacceptors mediate the translation of codons to amino acids, and asymmetric tRNA abundance causes variations in the rate of translation of each single triplet. Consequently, the effect of a silent point mutation in the coding region could be significant due to differential abundances of the cognate tRNA(s), emphasizing the importance of precise assessment of tRNA composition. Here, we provide an overview of the methods used to quantitatively determine the concentrations of tRNA species and discuss synonymous mutations in the context of tRNA composition of the cell, thus providing a new twist on the detrimental impact of the silent mutations.
Collapse
Affiliation(s)
- Andreas Czech
- Biochemistry, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str 24-25, 14467 Potsdam-Golm, Germany
| | | | | | | |
Collapse
|
41
|
Saunders R, Deane CM. Synonymous codon usage influences the local protein structure observed. Nucleic Acids Res 2010; 38:6719-28. [PMID: 20530529 PMCID: PMC2965230 DOI: 10.1093/nar/gkq495] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Translation of mRNA into protein is a unidirectional information flow process. Analysing the input (mRNA) and output (protein) of translation, we find that local protein structure information is encoded in the mRNA nucleotide sequence. The Coding Sequence and Structure (CSandS) database developed in this work provides a detailed mapping between over 4000 solved protein structures and their mRNA. CSandS facilitates a comprehensive analysis of codon usage over many organisms. In assigning translation speed, we find that relative codon usage is less informative than tRNA concentration. For all speed measures, no evidence was found that domain boundaries are enriched with slow codons. In fact, genes seemingly avoid slow codons around structurally defined domain boundaries. Translation speed, however, does decrease at the transition into secondary structure. Codons are identified that have structural preferences significantly different from the amino acid they encode. However, each organism has its own set of ‘significant codons’. Our results support the premise that codons encode more information than merely amino acids and give insight into the role of translation in protein folding.
Collapse
Affiliation(s)
- Rhodri Saunders
- Department of Statistics, Oxford University, 1 South Parks Road, Oxford OX1 3TG, UK.
| | | |
Collapse
|
42
|
Salavati R, Najafabadi HS. Sequence-based functional annotation: what if most of the genes are unique to a genome? Trends Parasitol 2010; 26:225-9. [DOI: 10.1016/j.pt.2010.02.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2009] [Revised: 12/08/2009] [Accepted: 02/04/2010] [Indexed: 11/30/2022]
|