1
|
Siddika MA, Ahmed KA, Alam MS, Bushra J, Begum RA. Complete mitogenome and intra-family comparative mitogenomics showed distinct position of Pama Croaker Otolithoides pama. Sci Rep 2024; 14:13820. [PMID: 38879694 PMCID: PMC11180200 DOI: 10.1038/s41598-024-64791-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 06/13/2024] [Indexed: 06/19/2024] Open
Abstract
The Pama Croaker, Otolithoides pama, is an economically important fish species in Bangladesh. Intra-family similarities in morphology and typical barcode sequences of cox1 create ambiguities in its identification. Therefore, morphology and the complete mitochondrial genome of O. pama, and comparative mitogenomics within the family Sciaenidae have been studied. Extracted genomic DNA was subjected to Illumina-based short read sequencing for De-Novo mitogenome assembly. The complete mitogenome of O. pama (Accession: OQ784575.1) was 16,513 bp, with strong AC biasness and strand asymmetry. Relative synonymous codon usage (RSCU) among 13 protein-coding genes (PCGs) of O. pama was also analyzed. The studied mitogenomes including O. pama exhibited consistent sizes and gene orders, except for the genus Johnius which possessed notably longer mitogenomes with unique gene rearrangements. Different genetic distance metrics across 30 species of Sciaenidae family demonstrated 12S rRNA and the control region (CR) as the most conserved and variable regions, respectively, while most of the PCGs undergone a purifying selection. Different phylogenetic trees were congruent with one another, where O. pama was distinctly placed. This study would contribute to distinguishing closely related fish species of Sciaenidae family and can be instrumental in conserving the genetic diversity of O. pama.
Collapse
Affiliation(s)
- Most Ayesha Siddika
- Genetics and Molecular Biology Laboratory, Department of Zoology, University of Dhaka, Dhaka, 1000, Bangladesh
| | | | - Mohammad Shamimul Alam
- Genetics and Molecular Biology Laboratory, Department of Zoology, University of Dhaka, Dhaka, 1000, Bangladesh.
| | - Jannatul Bushra
- Genetics and Molecular Biology Laboratory, Department of Zoology, University of Dhaka, Dhaka, 1000, Bangladesh
| | - Rowshan Ara Begum
- Genetics and Molecular Biology Laboratory, Department of Zoology, University of Dhaka, Dhaka, 1000, Bangladesh
| |
Collapse
|
2
|
Ke Z, Zhou K, Hou M, Luo H, Li Z, Pan X, Zhou J, Jing T, Ye H. Characterization of the Complete Mitochondrial Genome of the Elongate Loach and Its Phylogenetic Implications in Cobitidae. Animals (Basel) 2023; 13:3841. [PMID: 38136877 PMCID: PMC10740543 DOI: 10.3390/ani13243841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 12/01/2023] [Accepted: 12/02/2023] [Indexed: 12/24/2023] Open
Abstract
The elongate loach is an endemic fish in China. Previous studies have provided some insights into the mitochondrial genome composition and the phylogenetic relationships of the elongate loach inferred using protein-coding genes (PCGs), yet detailed information about it remains limited. Therefore, in this study we sequenced the complete mitochondrial genome of the elongate loach and analyzed its structural characteristics. The PCGs and mitochondrial genome were used for selective stress analysis and genomic comparative analysis. The complete mitochondrial genome of the elongate loach, together with those of 35 Cyprinidae species, was used to infer the phylogenetic relationships of the Cobitidae family through maximum likelihood (ML) reconstruction. The results showed that the genome sequence has a full length of 16,591 bp, which includes 13 PCGs, 22 transfer RNA genes (tRNA), 2 ribosomal RNA genes (rRNA), and 2 non-coding regions (CR D-loop and light chain sub-chain replication origin OL). Overall, the elongate loach shared the same gene arrangement and composition of the mitochondrial genes with other teleost fishes. The Ka/Ks ratios of all mitochondrial PCGs were less than 1, indicating that all of the PCGs were evolving under purifying selection. Genome comparison analyses showed a significant sequence homology of species of Leptobotia. A significant identity between L. elongata and the other five Leptobotia species was observed in the visualization result, except for L. mantschurica, which lacked the tRNA-Arg gene and had a shorter tRNA-Asp gene. The phylogenetic tree revealed that the Cobitidae species examined here can be grouped into two clades, with the elongate loach forming a sister relationship with L. microphthalma. This study could provide additional inferences for a better understanding of the phylogenetic relationships among Cobitidae species.
Collapse
Affiliation(s)
- Zhenlin Ke
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Key Laboratory of Aquatic Science of Chongqing, College of Fisheries, Southwest University, Chongqing 402460, China; (Z.K.); (M.H.); (H.L.); (T.J.)
- Key Laboratory of Aquatic Science of Chongqing, Chongqing 400175, China
| | - Kangqi Zhou
- Guangxi Key Laboratory of Aquatic Genetic Breeding and Healthy Aquaculture, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (K.Z.); (Z.L.); (X.P.)
| | - Mengdan Hou
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Key Laboratory of Aquatic Science of Chongqing, College of Fisheries, Southwest University, Chongqing 402460, China; (Z.K.); (M.H.); (H.L.); (T.J.)
- Key Laboratory of Aquatic Science of Chongqing, Chongqing 400175, China
| | - Hui Luo
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Key Laboratory of Aquatic Science of Chongqing, College of Fisheries, Southwest University, Chongqing 402460, China; (Z.K.); (M.H.); (H.L.); (T.J.)
- Key Laboratory of Aquatic Science of Chongqing, Chongqing 400175, China
| | - Zhe Li
- Guangxi Key Laboratory of Aquatic Genetic Breeding and Healthy Aquaculture, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (K.Z.); (Z.L.); (X.P.)
| | - Xianhui Pan
- Guangxi Key Laboratory of Aquatic Genetic Breeding and Healthy Aquaculture, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (K.Z.); (Z.L.); (X.P.)
| | - Jian Zhou
- Fisheries Institute, Sichuan Academy of Agricultural Sciences, Chengdu 611731, China
| | - Tingsen Jing
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Key Laboratory of Aquatic Science of Chongqing, College of Fisheries, Southwest University, Chongqing 402460, China; (Z.K.); (M.H.); (H.L.); (T.J.)
- Key Laboratory of Aquatic Science of Chongqing, Chongqing 400175, China
| | - Hua Ye
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Key Laboratory of Aquatic Science of Chongqing, College of Fisheries, Southwest University, Chongqing 402460, China; (Z.K.); (M.H.); (H.L.); (T.J.)
- Key Laboratory of Aquatic Science of Chongqing, Chongqing 400175, China
| |
Collapse
|
3
|
Zhou F, Aroua N, Liu Y, Rohde C, Cheng J, Wirth AK, Fijalkowska D, Göllner S, Lotze M, Yun H, Yu X, Pabst C, Sauer T, Oellerich T, Serve H, Röllig C, Bornhäuser M, Thiede C, Baldus C, Frye M, Raffel S, Krijgsveld J, Jeremias I, Beckmann R, Trumpp A, Müller-Tidow C. A Dynamic rRNA Ribomethylome Drives Stemness in Acute Myeloid Leukemia. Cancer Discov 2023; 13:332-347. [PMID: 36259929 PMCID: PMC9900322 DOI: 10.1158/2159-8290.cd-22-0210] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 09/12/2022] [Accepted: 10/14/2022] [Indexed: 02/07/2023]
Abstract
The development and regulation of malignant self-renewal remain unresolved issues. Here, we provide biochemical, genetic, and functional evidence that dynamics in ribosomal RNA (rRNA) 2'-O-methylation regulate leukemia stem cell (LSC) activity in vivo. A comprehensive analysis of the rRNA 2'-O-methylation landscape of 94 patients with acute myeloid leukemia (AML) revealed dynamic 2'-O-methylation specifically at exterior sites of ribosomes. The rRNA 2'-O-methylation pattern is closely associated with AML development stage and LSC gene expression signature. Forced expression of the 2'-O-methyltransferase fibrillarin (FBL) induced an AML stem cell phenotype and enabled engraftment of non-LSC leukemia cells in NSG mice. Enhanced 2'-O-methylation redirected the ribosome translation program toward amino acid transporter mRNAs enriched in optimal codons and subsequently increased intracellular amino acid levels. Methylation at the single site 18S-guanosine 1447 was instrumental for LSC activity. Collectively, our work demonstrates that dynamic 2'-O-methylation at specific sites on rRNAs shifts translational preferences and controls AML LSC self-renewal. SIGNIFICANCE We establish the complete rRNA 2'-O-methylation landscape in human AML. Plasticity of rRNA 2'-O-methylation shifts protein translation toward an LSC phenotype. This dynamic process constitutes a novel concept of how cancers reprogram cell fate and function. This article is highlighted in the In This Issue feature, p. 247.
Collapse
Affiliation(s)
- Fengbiao Zhou
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
- Corresponding Authors: Carsten Müller-Tidow, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-2215-68000; E-mail: ; Fengbiao Zhou, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-221-563-7487; E-mail: ; and Andreas Trumpp, Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany. Phone: 4906-2214-23901; E-mail:
| | - Nesrine Aroua
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Heidelberg Institute of Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), Heidelberg, Germany
| | - Yi Liu
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
| | - Christian Rohde
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
| | - Jingdong Cheng
- Gene Center, Department of Biochemistry, University of Munich, Munich, Germany
| | - Anna-Katharina Wirth
- Research Unit Apoptosis in Hematopoietic Stem Cells (AHS), Helmholtz Center Munich, German Center for Environmental Health, Munich, Germany
| | - Daria Fijalkowska
- Division of Proteomics of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Stefanie Göllner
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Michelle Lotze
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Haiyang Yun
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Xiaobing Yu
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Caroline Pabst
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Tim Sauer
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Thomas Oellerich
- Department of Medicine II, Hematology/Oncology, Goethe University, Frankfurt Am Main, Germany
| | - Hubert Serve
- Department of Medicine II, Hematology/Oncology, Goethe University, Frankfurt Am Main, Germany
| | - Christoph Röllig
- Medical Department 1, University Hospital Dresden, Dresden, Germany
| | | | - Christian Thiede
- Medical Department 1, University Hospital Dresden, Dresden, Germany
| | - Claudia Baldus
- Department of Medicine II, Hematology and Oncology, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Michaela Frye
- Division of Mechanisms Regulating Gene Expression, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Simon Raffel
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
| | - Jeroen Krijgsveld
- Division of Proteomics of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Irmela Jeremias
- Research Unit Apoptosis in Hematopoietic Stem Cells (AHS), Helmholtz Center Munich, German Center for Environmental Health, Munich, Germany
- German Cancer Consortium (DKTK), Partner Site Munich, Munich, Germany
| | - Roland Beckmann
- Gene Center, Department of Biochemistry, University of Munich, Munich, Germany
| | - Andreas Trumpp
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Heidelberg Institute of Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), Heidelberg, Germany
- National Center for Tumor Diseases, NCT Heidelberg, Heidelberg, Germany
- Corresponding Authors: Carsten Müller-Tidow, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-2215-68000; E-mail: ; Fengbiao Zhou, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-221-563-7487; E-mail: ; and Andreas Trumpp, Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany. Phone: 4906-2214-23901; E-mail:
| | - Carsten Müller-Tidow
- Department of Internal Medicine V, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Medicine Partnership Unit EMBL-UKHD, Heidelberg, Germany
- National Center for Tumor Diseases, NCT Heidelberg, Heidelberg, Germany
- Corresponding Authors: Carsten Müller-Tidow, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-2215-68000; E-mail: ; Fengbiao Zhou, Department of Internal Medicine V, Heidelberg University Hospital, 69120 Heidelberg, Germany. Phone: 4906-221-563-7487; E-mail: ; and Andreas Trumpp, Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany. Phone: 4906-2214-23901; E-mail:
| |
Collapse
|
4
|
Panda A, Tuller T. Determinants of associations between codon and amino acid usage patterns of microbial communities and the environment inferred based on a cross-biome metagenomic analysis. NPJ Biofilms Microbiomes 2023; 9:5. [PMID: 36693851 PMCID: PMC9873608 DOI: 10.1038/s41522-023-00372-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 01/11/2023] [Indexed: 01/25/2023] Open
Abstract
Codon and amino acid usage were associated with almost every aspect of microbial life. However, how the environment may impact the codon and amino acid choice of microbial communities at the habitat level is not clearly understood. Therefore, in this study, we analyzed codon and amino acid usage patterns of a large number of environmental samples collected from diverse ecological niches. Our results suggested that samples derived from similar environmental niches, in general, show overall similar codon and amino acid distribution as compared to samples from other habitats. To substantiate the relative impact of the environment, we considered several factors, such as their similarity in GC content, or in functional or taxonomic abundance. Our analysis demonstrated that none of these factors can fully explain the trends that we observed at the codon or amino acid level implying a direct environmental influence on them. Further, our analysis demonstrated different levels of selection on codon bias in different microbial communities with the highest bias in host-associated environments such as the digestive system or oral samples and the lowest level of selection in soil and water samples. Considering a large number of metagenomic samples here we showed that microorganisms collected from similar environmental backgrounds exhibit similar patterns of codon and amino acid usage irrespective of the location or time from where the samples were collected. Thus our study suggested a direct impact of the environment on codon and amino usage of microorganisms that cannot be explained considering the influence of other factors.
Collapse
Affiliation(s)
- Arup Panda
- grid.12136.370000 0004 1937 0546Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978 Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel.
| |
Collapse
|
5
|
Characterization of mitochondrial genome of Indian Ocean blue-spotted maskray, Neotrygon indica and its phylogenetic relationship within Dasyatidae Family. Int J Biol Macromol 2022; 223:458-467. [PMID: 36347369 DOI: 10.1016/j.ijbiomac.2022.10.277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 10/11/2022] [Accepted: 10/28/2022] [Indexed: 11/06/2022]
Abstract
The present study characterized complete mitochondrial genome of Blue-spotted maskray, Neotrygon indica and studied the evolutionary relationship of the species within the Dasyatidae family. The total length of the mitogenome was 17,974 bp including 37 genes and a non-coding control region. The average frequency of nucleotides in protein-coding genes was A: 29.1 %, T: 30.2 %, G: 13.0 % and C: 27.7 % with AT content of 59.3 %. The values of AT and GC skewness were -0.018 and -0.338, respectively. Comparative analyses showed a large number of average synonymous substitutions per synonymous site (Ks) in gene NADH4 (5.07) followed by NADH5 (4.72). High values of average number of non-synonymous substitutions per non-synonymous site (Ka) were observed in genes ATPase8 (0.54) and NADH2 (0.44). Genes NADH4L and NADH2 showed high interspecific genetic distance values of 0.224 ± 0.001 and 0.213 ± 0.002, respectively. Heat map analysis showed variation in codon usage among different species of the Dasyatidae family. The phylogenetic tree showed a sister relationship between the Dasyatinae and the Neotrygoninae subfamilies. Neotrygon indica formed as a sister species to the clade consisting of N. varidens and N. orientalis. Based on the present results, Neotrygon indica could have diverged from the common ancestor of the two latter in the Plio-Pleistocene. The present study showed distinct characteristics of N. indica from its congeners through comparative mitogenomics.
Collapse
|
6
|
Shibai A, Kotani H, Sakata N, Furusawa C, Tsuru S. Purifying selection enduringly acts on the sequence evolution of highly expressed proteins in Escherichia coli. G3 GENES|GENOMES|GENETICS 2022; 12:6694045. [PMID: 36073932 PMCID: PMC9635659 DOI: 10.1093/g3journal/jkac235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 08/27/2022] [Indexed: 11/17/2022]
Abstract
The evolutionary speed of a protein sequence is constrained by its expression level, with highly expressed proteins evolving relatively slowly. This negative correlation between expression levels and evolutionary rates (known as the E–R anticorrelation) has already been widely observed in past macroevolution between species from bacteria to animals. However, it remains unclear whether this seemingly general law also governs recent evolution, including past and de novo, within a species. However, the advent of genomic sequencing and high-throughput phenotyping, particularly for bacteria, has revealed fundamental gaps between the 2 evolutionary processes and has provided empirical data opposing the possible underlying mechanisms which are widely believed. These conflicts raise questions about the generalization of the E–R anticorrelation and the relevance of plausible mechanisms. To explore the ubiquitous impact of expression levels on molecular evolution and test the relevance of the possible underlying mechanisms, we analyzed the genome sequences of 99 strains of Escherichia coli for evolution within species in nature. We also analyzed genomic mutations accumulated under laboratory conditions as a model of de novo evolution within species. Here, we show that E–R anticorrelation is significant in both past and de novo evolution within species in E. coli. Our data also confirmed ongoing purifying selection on highly expressed genes. Ongoing selection included codon-level purifying selection, supporting the relevance of the underlying mechanisms. However, the impact of codon-level purifying selection on the constraints in evolution within species might be smaller than previously expected from evolution between species.
Collapse
Affiliation(s)
- Atsushi Shibai
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Hazuki Kotani
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Natsue Sakata
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Chikara Furusawa
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| | - Saburo Tsuru
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| |
Collapse
|
7
|
Ahmed W, Gupta S, Singh D, Singh R. Insight of genetic features prevalent in three Echinoderm species (Apostichopus japonicus, Heliocedaris erythrogramma and Asterias rubens) and their evolutionary association using comparative codon pattern analysis. GENE REPORTS 2022. [DOI: 10.1016/j.genrep.2022.101589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
8
|
Variables Influencing Differences in Sequence Conservation in the Fission Yeast Schizosaccharomyces pombe. J Mol Evol 2021; 89:601-610. [PMID: 34436628 PMCID: PMC8599406 DOI: 10.1007/s00239-021-10028-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Accepted: 08/17/2021] [Indexed: 11/17/2022]
Abstract
Which variables determine the constraints on gene sequence evolution is one of the most central questions in molecular evolution. In the fission yeast Schizosaccharomyces pombe, an important model organism, the variables influencing the rate of sequence evolution have yet to be determined. Previous studies in other single celled organisms have generally found gene expression levels to be most significant, with numerous other variables such as gene length and functional importance identified as having a smaller impact. Using publicly available data, we used partial least squares regression, principal components regression, and partial correlations to determine the variables most strongly associated with sequence evolution constraints. We identify centrality in the protein–protein interactions network, amino acid composition, and cellular location as the most important determinants of sequence conservation. However, each factor only explains a small amount of variance, and there are numerous variables having a significant or heterogeneous influence. Our models explain more than half of the variance in dN, raising the possibility that future refined models could quantify the role of stochastics in evolutionary rate variation.
Collapse
|
9
|
Santoni D. The impact of codon choice on translation process in Saccharomyces cerevisiae: folding class, protein function and secondary structure. J Theor Biol 2021; 526:110806. [PMID: 34111456 DOI: 10.1016/j.jtbi.2021.110806] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 05/26/2021] [Accepted: 06/03/2021] [Indexed: 11/28/2022]
Abstract
The genetic code consists in a set of rules used by living organisms to translate genomic information, contained in genes, into proteins; every amino acid is coded by a set of nucleotide triplets or codons. We refer to codon choice as the choice of a given codon, among the synonymous available ones, to code a given amino acid occurrence. The aim of this work is to shed light on the pivotal role that codon choice plays in regulating the timing of translation process, through patterns of low and high translation efficiency codons. A translation efficiency value, namely codon score, was associated to each codon through a formula based on the number of tRNAs gene copies able to translate the given codon. By using codon scores, those k-mers of the proteome of Saccharomyces cerevisiae, showing low and high average scores associated to the correspondent codons, were computed. The analysis of distribution of both low and high average score k-mers clearly showed that, in particular for higher k-mer size, they occur much more than expected, strongly suggesting a functional role. Moreover performed analysis highlighted that significant k-mers preferentially occur in some protein folding classes, such as those containing alpha helices, and in some functional classes mainly involved in transcription process while codon choice seems to have a very low impact in proteins associated to energy production and metabolism. The relationship between secondary structures and significant k-mers was investigated, revealing that low score k-mers tend to preferentially occur in coil or close to coil regions and almost never in beta sheets, while high score k-mers preferentially occur in alpha helices, avoiding beta sheets, and close to coil regions for high k-mer sizes. Finally the analysis of distribution of significant codon patterns along the proteins highlighted a relevant enrichment of low average score k-mers at the 5' end of protein-coding sequences in the region from 5th to 25th amino acid.
Collapse
Affiliation(s)
- Daniele Santoni
- Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council of Italy, Via dei Taurini 19, Rome 00185, Italy.
| |
Collapse
|
10
|
The Adenine/Thymine Deleterious Selection Model for GC Content Evolution at the Third Codon Position of the Histone Genes in Drosophila. Genes (Basel) 2021; 12:genes12050721. [PMID: 34065869 PMCID: PMC8150595 DOI: 10.3390/genes12050721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 05/07/2021] [Accepted: 05/07/2021] [Indexed: 12/02/2022] Open
Abstract
The evolution of the GC (guanine cytosine) content at the third codon position of the histone genes (H1, H2A, H2B, H3, H4, H2AvD, H3.3A, H3.3B, and H4r) in 12 or more Drosophila species is reviewed. For explaining the evolution of the GC content at the third codon position of the genes, a model assuming selection with a deleterious effect for adenine/thymine and a size effect is presented. The applicability of the model to whole-genome genes is also discussed.
Collapse
|
11
|
Dubreuil B, Levy ED. Abundance Imparts Evolutionary Constraints of Similar Magnitude on the Buried, Surface, and Disordered Regions of Proteins. Front Mol Biosci 2021; 8:626729. [PMID: 33996892 PMCID: PMC8119896 DOI: 10.3389/fmolb.2021.626729] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 03/29/2021] [Indexed: 12/02/2022] Open
Abstract
An understanding of the forces shaping protein conservation is key, both for the fundamental knowledge it represents and to allow for optimal use of evolutionary information in practical applications. Sequence conservation is typically examined at one of two levels. The first is a residue-level, where intra-protein differences are analyzed and the second is a protein-level, where inter-protein differences are studied. At a residue level, we know that solvent-accessibility is a prime determinant of conservation. By inverting this logic, we inferred that disordered regions are slightly more solvent-accessible on average than the most exposed surface residues in domains. By integrating abundance information with evolutionary data within and across proteins, we confirmed a previously reported strong surface-core association in the evolution of structured regions, but we found a comparatively weak association between disordered and structured regions. The facts that disordered and structured regions experience different structural constraints and evolve independently provide a unique setup to examine an outstanding question: why is a protein’s abundance the main determinant of its sequence conservation? Indeed, any structural or biophysical property linked to the abundance-conservation relationship should increase the relative conservation of regions concerned with that property (e.g., disordered residues with mis-interactions, domain residues with misfolding). Surprisingly, however, we found the conservation of disordered and structured regions to increase in equal proportion with abundance. This observation implies that either abundance-related constraints are structure-independent, or multiple constraints apply to different regions and perfectly balance each other.
Collapse
Affiliation(s)
- Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
12
|
Nowak K, Błażej P, Wnetrzak M, Mackiewicz D, Mackiewicz P. Some theoretical aspects of reprogramming the standard genetic code. Genetics 2021; 218:6169163. [PMID: 33711098 PMCID: PMC8128387 DOI: 10.1093/genetics/iyab040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 02/11/2021] [Indexed: 11/12/2022] Open
Abstract
Reprogramming of the standard genetic code to include non-canonical amino acids (ncAAs) opens new prospects for medicine, industry, and biotechnology. There are several methods of code engineering, which allow us for storing new genetic information in DNA sequences and producing proteins with new properties. Here, we provided a theoretical background for the optimal genetic code expansion, which may find application in the experimental design of the genetic code. We assumed that the expanded genetic code includes both canonical and non-canonical information stored in 64 classical codons. What is more, the new coding system is robust to point mutations and minimizes the possibility of reversion from the new to old information. In order to find such codes, we applied graph theory to analyze the properties of optimal codon sets. We presented the formal procedure in finding the optimal codes with various number of vacant codons that could be assigned to new amino acids. Finally, we discussed the optimal number of the newly incorporated ncAAs and also the optimal size of codon groups that can be assigned to ncAAs.
Collapse
Affiliation(s)
- Kuba Nowak
- Faculty of Mathematics and Computer Science, University of Wrocław, ul. F. Joliot-Curie 15, 50-383 Wrocław, Poland
| | - Paweł Błażej
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul F. Joliot-Curie 14a, 50-383 Wrocław, Poland
| | - Małgorzata Wnetrzak
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul F. Joliot-Curie 14a, 50-383 Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul F. Joliot-Curie 14a, 50-383 Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul F. Joliot-Curie 14a, 50-383 Wrocław, Poland
| |
Collapse
|
13
|
Maldonado LL, Bertelli AM, Kamenetzky L. Molecular features similarities between SARS-CoV-2, SARS, MERS and key human genes could favour the viral infections and trigger collateral effects. Sci Rep 2021; 11:4108. [PMID: 33602998 PMCID: PMC7893037 DOI: 10.1038/s41598-021-83595-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 01/26/2021] [Indexed: 01/31/2023] Open
Abstract
In December 2019, rising pneumonia cases caused by a novel β-coronavirus (SARS-CoV-2) occurred in Wuhan, China, which has rapidly spread worldwide, causing thousands of deaths. The WHO declared the SARS-CoV-2 outbreak as a public health emergency of international concern, since then several scientists are dedicated to its study. It has been observed that many human viruses have codon usage biases that match highly expressed proteins in the tissues they infect and depend on the host cell machinery for the replication and co-evolution. In this work, we analysed 91 molecular features and codon usage patterns for 339 viral genes and 463 human genes that consisted of 677,873 codon positions. Hereby, we selected the highly expressed genes from human lung tissue to perform computational studies that permit to compare their molecular features with those of SARS, SARS-CoV-2 and MERS genes. The integrated analysis of all the features revealed that certain viral genes and overexpressed human genes have similar codon usage patterns. The main pattern was the A/T bias that together with other features could propitiate the viral infection, enhanced by a host dependant specialization of the translation machinery of only some of the overexpressed genes. The envelope protein E, the membrane glycoprotein M and ORF7 could be further benefited. This could be the key for a facilitated translation and viral replication conducting to different comorbidities depending on the genetic variability of population due to the host translation machinery. This is the first codon usage approach that reveals which human genes could be potentially deregulated due to the codon usage similarities between the host and the viral genes when the virus is already inside the human cells of the lung tissues. Our work leaded to the identification of additional highly expressed human genes which are not the usual suspects but might play a role in the viral infection and settle the basis for further research in the field of human genetics associated with new viral infections. To identify the genes that could be deregulated under a viral infection is important to predict the collateral effects and determine which individuals would be more susceptible based on their genetic features and comorbidities associated.
Collapse
Affiliation(s)
- Lucas L Maldonado
- IMPaM, CONICET, Facultad de Medicina, Universidad de Buenos Aires, Ciudad Autónoma de Buenos Aires, Argentina.
| | | | - Laura Kamenetzky
- IMPaM, CONICET, Facultad de Medicina, Universidad de Buenos Aires, Ciudad Autónoma de Buenos Aires, Argentina
- iB3 | Instituto de Biociencias, Biotecnología y Biología traslacional, Departamento de Fisiologia y Biologia Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Autónoma de Buenos Aires, Argentina
| |
Collapse
|
14
|
Evans P, Cox NJ, Gamazon ER. The regulatory genome constrains protein sequence evolution: implications for the search for disease-associated genes. PeerJ 2020; 8:e9554. [PMID: 32765967 PMCID: PMC7380284 DOI: 10.7717/peerj.9554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 06/24/2020] [Indexed: 11/20/2022] Open
Abstract
The development of explanatory models of protein sequence evolution has broad implications for our understanding of cellular biology, population history, and disease etiology. Here we analyze the GTEx transcriptome resource to quantify the effect of the transcriptome on protein sequence evolution in a multi-tissue framework. We find substantial variation among the central nervous system tissues in the effect of expression variance on evolutionary rate, with highly variable genes in the cortex showing significantly greater purifying selection than highly variable genes in subcortical regions (Mann-Whitney U p = 1.4 × 10-4). The remaining tissues cluster in observed expression correlation with evolutionary rate, enabling evolutionary analysis of genes in diverse physiological systems, including digestive, reproductive, and immune systems. Importantly, the tissue in which a gene attains its maximum expression variance significantly varies (p = 5.55 × 10-284) with evolutionary rate, suggesting a tissue-anchored model of protein sequence evolution. Using a large-scale reference resource, we show that the tissue-anchored model provides a transcriptome-based approach to predicting the primary affected tissue of developmental disorders. Using gradient boosted regression trees to model evolutionary rate under a range of model parameters, selected features explain up to 62% of the variation in evolutionary rate and provide additional support for the tissue model. Finally, we investigate several methodological implications, including the importance of evolutionary-rate-aware gene expression imputation models using genetic data for improved search for disease-associated genes in transcriptome-wide association studies. Collectively, this study presents a comprehensive transcriptome-based analysis of a range of factors that may constrain molecular evolution and proposes a novel framework for the study of gene function and disease mechanism.
Collapse
Affiliation(s)
- Patrick Evans
- Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, United States of America
| | - Nancy J Cox
- Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, United States of America
| | - Eric R Gamazon
- Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, United States of America.,Clare Hall, University of Cambridge, Cambridge, United Kingdom.,MRC Epidemiology Unit, University of Cambridge, Cambridge, United Kingdom.,Data Science Institute, Vanderbilt University, Nashville, TN, United States of America
| |
Collapse
|
15
|
Abstract
Darwin's theory of evolution emphasized that positive selection of functional proficiency provides the fitness that ultimately determines the structure of life, a view that has dominated biochemical thinking of enzymes as perfectly optimized for their specific functions. The 20th-century modern synthesis, structural biology, and the central dogma explained the machinery of evolution, and nearly neutral theory explained how selection competes with random fixation dynamics that produce molecular clocks essential e.g. for dating evolutionary histories. However, quantitative proteomics revealed that selection pressures not relating to optimal function play much larger roles than previously thought, acting perhaps most importantly via protein expression levels. This paper first summarizes recent progress in the 21st century toward recovering this universal selection pressure. Then, the paper argues that proteome cost minimization is the dominant, underlying 'non-function' selection pressure controlling most of the evolution of already functionally adapted living systems. A theory of proteome cost minimization is described and argued to have consequences for understanding evolutionary trade-offs, aging, cancer, and neurodegenerative protein-misfolding diseases.
Collapse
|
16
|
Abstract
Adaptive mutations play an important role in molecular evolution. However, the frequency and nature of these mutations at the intramolecular level are poorly understood. To address this, we analyzed the impact of protein architecture on the rate of adaptive substitutions, aiming to understand how protein biophysics influences fitness and adaptation. Using Drosophila melanogaster and Arabidopsis thaliana population genomics data, we fitted models of distribution of fitness effects and estimated the rate of adaptive amino-acid substitutions both at the protein and amino-acid residue level. We performed a comprehensive analysis covering genome, gene, and protein structure, by exploring a multitude of factors with a plausible impact on the rate of adaptive evolution, such as intron number, protein length, secondary structure, relative solvent accessibility, intrinsic protein disorder, chaperone affinity, gene expression, protein function, and protein-protein interactions. We found that the relative solvent accessibility is a major determinant of adaptive evolution, with most adaptive mutations occurring at the surface of proteins. Moreover, we observe that the rate of adaptive substitutions differs between protein functional classes, with genes encoding for protein biosynthesis and degradation signaling exhibiting the fastest rates of protein adaptation. Overall, our results suggest that adaptive evolution in proteins is mainly driven by intermolecular interactions, with host-pathogen coevolution likely playing a major role.
Collapse
Affiliation(s)
- Ana Filipa Moutinho
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Fernanda Fontes Trancoso
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Julien Yann Dutheil
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Unité Mixte de Recherche 5554 Institut des Sciences de l'Evolution, CNRS, IRD, EPHE, Université de Montpellier, Montpellier, France
| |
Collapse
|
17
|
Victor MP, Acharya D, Begum T, Ghosh TC. The optimization of mRNA expression level by its intrinsic properties—Insights from codon usage pattern and structural stability of mRNA. Genomics 2019; 111:1292-1297. [DOI: 10.1016/j.ygeno.2018.08.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Revised: 08/14/2018] [Accepted: 08/24/2018] [Indexed: 11/17/2022]
|
18
|
Karami K, Zerehdaran S, Javadmanesh A, Shariati MM, Fallahi H. Characterization of bovine (Bos taurus) imprinted genes from genomic to amino acid attributes by data mining approaches. PLoS One 2019; 14:e0217813. [PMID: 31170205 PMCID: PMC6553745 DOI: 10.1371/journal.pone.0217813] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2018] [Accepted: 05/21/2019] [Indexed: 01/05/2023] Open
Abstract
Genomic imprinting results in monoallelic expression of genes in mammals and flowering plants. Understanding the function of imprinted genes improves our knowledge of the regulatory processes in the genome. In this study, we have employed classification and clustering algorithms with attribute weighting to specify the unique attributes of both imprinted (monoallelic) and biallelic expressed genes. We have obtained characteristics of 22 known monoallelically expressed (imprinted) and 8 biallelic expressed genes that have been experimentally validated alongside 208 randomly selected genes in bovine (Bos taurus). Attribute weighting methods and various supervised and unsupervised algorithms in machine learning were applied. Unique characteristics were discovered and used to distinguish mono and biallelic expressed genes from each other in bovine. To obtain the accuracy of classification, 10-fold cross-validation with concerning each combination of attribute weighting (feature selection) and machine learning algorithms, was used. Our approach was able to accurately predict mono and biallelic genes using the genomics and proteomics attributes.
Collapse
Affiliation(s)
- Keyvan Karami
- Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Saeed Zerehdaran
- Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Ali Javadmanesh
- Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Mohammad Mahdi Shariati
- Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Hossein Fallahi
- Department of Biology, School of Sciences, Razi University, Kermanshah, Iran
| |
Collapse
|
19
|
Seike T, Kobayashi Y, Sahara T, Ohgiya S, Kamagata Y, Fujimori KE. Molecular evolutionary engineering of xylose isomerase to improve its catalytic activity and performance of micro-aerobic glucose/xylose co-fermentation in Saccharomyces cerevisiae. BIOTECHNOLOGY FOR BIOFUELS 2019; 12:139. [PMID: 31178927 PMCID: PMC6551904 DOI: 10.1186/s13068-019-1474-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Accepted: 05/23/2019] [Indexed: 06/09/2023]
Abstract
BACKGROUND Expression of d-xylose isomerase having high catalytic activity in Saccharomyces cerevisiae (S. cerevisiae) is a prerequisite for efficient and economical production of bioethanol from cellulosic biomass. Although previous studies demonstrated functional expression of several xylose isomerases (XI) in S. cerevisiae, identification of XIs having higher catalytic activity is needed. Here, we report a new strategy to improve xylose fermentation in the S. cerevisiae strain IR-2 that involves an evolutionary engineering to select top-performing XIs from eight previously reported XIs derived from various species. RESULTS Eight XI genes shown to have good expression in S. cerevisiae were introduced into the strain IR-2 having a deletion of GRE3 and XKS1 overexpression that allows use of d-xylose as a carbon source. Each transformant was evaluated under aerobic and micro-aerobic culture conditions. The strain expressing XI from Lachnoclostridium phytofermentans ISDg (LpXI) had the highest d-xylose consumption rate after 72 h of micro-aerobic fermentation on d-glucose and d-xylose mixed medium. To enhance LpXI catalytic activity, we performed random mutagenesis using error-prone polymerase chain reaction (PCR), which yielded two LpXI candidates, SS82 and SS92, that showed markedly improved fermentation performance. The LpXI genes in these clones carried either T63I or V162A/N303T point mutations. The SS120 strain expressing LpXI with the double mutation of T63I/V162A assimilated nearly 85 g/L d-glucose and 35 g/L d-xylose to produce 53.3 g/L ethanol in 72 h with an ethanol yield of approximately 0.44 (g/g-input sugars). An in vitro enzyme assay showed that, compared to wild-type, the LpXI double mutant in SS120 had a considerably higher V max (0.107 µmol/mg protein/min) and lower K m (37.1 mM). CONCLUSIONS This study demonstrated that LpXI has the highest d-xylose consumption rate among the XIs expressed in IR-2 under micro-aerobic co-fermentation conditions. A combination of novel mutations (T63I and V162A) significantly improved the enzymatic activity of LpXI, indicating that LpXI-T63I/V162A would be a potential construct for highly efficient production of cellulosic ethanol.
Collapse
Affiliation(s)
- Taisuke Seike
- Bioproduction Research Institute (BPRI), National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566 Japan
- Present Address: Center for Biosystems Dynamics Research (BDR), RIKEN, 6-2-3 Furuedai, Suita, Osaka 565-0874 Japan
| | - Yosuke Kobayashi
- Bioproduction Research Institute (BPRI), National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566 Japan
- Present Address: Biomaterial in Tokyo Company Limited, 4-7 Kashiwa-Inter-Minami, Kashiwa, Chiba 277-0872 Japan
| | - Takehiko Sahara
- Bioproduction Research Institute (BPRI), National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566 Japan
| | - Satoru Ohgiya
- Bioproduction Research Institute (BPRI), National Institute of Advanced Industrial Science and Technology (AIST), 2-17-2-1 Tsukisamu-higashi, Toyohira, Sapporo, Hokkaido 062-8517 Japan
| | - Yoichi Kamagata
- Bioproduction Research Institute (BPRI), National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566 Japan
| | - Kazuhiro E. Fujimori
- Bioproduction Research Institute (BPRI), National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566 Japan
| |
Collapse
|
20
|
Guillén Y, Casillas S, Ruiz A. Genome-Wide Patterns of Sequence Divergence of Protein-Coding Genes Between Drosophila buzzatii and D. mojavensis. J Hered 2019; 110:92-101. [PMID: 30124907 DOI: 10.1093/jhered/esy041] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 08/14/2018] [Indexed: 12/15/2022] Open
Abstract
Evolutionary rates for protein-coding genes are determined not only by natural selection but also by multiple genomic factors including mutation rates, recombination, gene expression levels, and chromosomal location. To investigate the joint effects of different genomic determinants on protein evolution, we compared the coding sequences of 9017 single-copy orthologs between 2 cactophilic species from the Drosophila subgenus, Drosophila mojavensis and D. buzzatii, whose genomes have been previously sequenced. We assessed the impact of 7 genomic determinants, that is, chromosome type, recombination, chromosomal inversions, expression breadth, expression level, gene length, and the number of exons, on divergence rates of protein-coding genes to understand patterns of evolutionary variation. Integrative analysis of these factors revealed that 1) X-linked and autosomal genes evolve at significantly different rates in agreement with the faster-X hypothesis, 2) genes located on the dot chromosome and pericentromeric regions have higher divergence rates, 3) genes located at chromosomes with more fixed inversions have higher pairwise divergence than those located at nearly collinear chromosomes, and 4) gene expression patterns can be considered the strongest determinant of protein evolution. In addition, the number of exons and protein length had a significant effect on pairwise divergence at synonymous sites. All in all, our results show the relative importance of each genomic factor on the rates of protein evolution and functional constraint in these 2 cactophilic Drosophila species.
Collapse
Affiliation(s)
- Yolanda Guillén
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | - Sònia Casillas
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain.,The Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | - Alfredo Ruiz
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| |
Collapse
|
21
|
Maldonado LL, Stegmayer G, Milone DH, Oliveira G, Rosenzvit M, Kamenetzky L. Whole genome analysis of codon usage in Echinococcus. Mol Biochem Parasitol 2018; 225:54-66. [DOI: 10.1016/j.molbiopara.2018.08.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2018] [Revised: 07/20/2018] [Accepted: 08/01/2018] [Indexed: 01/15/2023]
|
22
|
Wei X, Zhang J. On the Origin of Compositional Features of Ribosomes. Genome Biol Evol 2018; 10:2010-2016. [PMID: 30059996 PMCID: PMC6097593 DOI: 10.1093/gbe/evy169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/28/2018] [Indexed: 01/08/2023] Open
Abstract
Ribosomes are highly abundant in cells and comprise, besides RNAs of varying lengths, 55–80 similarly sized, short proteins. This seemingly unusual composition is thought to have resulted from selection for rapid autocatalytic ribosome production. Here, we demonstrate that ribosomal protein-splitting mutations cannot accelerate ribosome production. The autocatalytic explanation is also unnecessary, because protein lengths generally decline with expression levels. Although ribosomal proteins are shorter than expected from their expression levels, they are not outliers among members of large protein complexes in mean protein length or coefficient of variation. These observations are explainable because 1) shortening proteins lowers their synthetic cost and reduces the waste from mistranslation-induced protein dysfunction and degradation, 2) such benefits rise with expression levels, and 3) members of large complexes participate in more protein–protein interactions so are less tolerant to mistranslation. These and other considerations suggest that the compositional features of ribosomes originate from cellular energy economics.
Collapse
Affiliation(s)
- Xinzhu Wei
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
23
|
Codon usage of highly expressed genes affects proteome-wide translation efficiency. Proc Natl Acad Sci U S A 2018; 115:E4940-E4949. [PMID: 29735666 DOI: 10.1073/pnas.1719375115] [Citation(s) in RCA: 114] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although the genetic code is redundant, synonymous codons for the same amino acid are not used with equal frequencies in genomes, a phenomenon termed "codon usage bias." Previous studies have demonstrated that synonymous changes in a coding sequence can exert significant cis effects on the gene's expression level. However, whether the codon composition of a gene can also affect the translation efficiency of other genes has not been thoroughly explored. To study how codon usage bias influences the cellular economy of translation, we massively converted abundant codons to their rare synonymous counterpart in several highly expressed genes in Escherichia coli This perturbation reduces both the cellular fitness and the translation efficiency of genes that have high initiation rates and are naturally enriched with the manipulated codon, in agreement with theoretical predictions. Interestingly, we could alleviate the observed phenotypes by increasing the supply of the tRNA for the highly demanded codon, thus demonstrating that the codon usage of highly expressed genes was selected in evolution to maintain the efficiency of global protein translation.
Collapse
|
24
|
Espinar L, Schikora Tamarit MÀ, Domingo J, Carey LB. Promoter architecture determines cotranslational regulation of mRNA. Genome Res 2018; 28:509-518. [PMID: 29567675 PMCID: PMC5880241 DOI: 10.1101/gr.230458.117] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2017] [Accepted: 02/27/2018] [Indexed: 01/08/2023]
Abstract
Information that regulates gene expression is encoded throughout each gene but if different regulatory regions can be understood in isolation, or if they interact, is unknown. Here we measure mRNA levels for 10,000 open reading frames (ORFs) transcribed from either an inducible or constitutive promoter. We find that the strength of cotranslational regulation on mRNA levels is determined by promoter architecture. By using a novel computational genetic screen of 6402 RNA-seq experiments, we identify the RNA helicase Dbp2 as the mechanism by which cotranslational regulation is reduced specifically for inducible promoters. Finally, we find that for constitutive genes, but not inducible genes, most of the information encoding regulation of mRNA levels in response to changes in growth rate is encoded in the ORF and not in the promoter. Thus, the ORF sequence is a major regulator of gene expression, and a nonlinear interaction between promoters and ORFs determines mRNA levels.
Collapse
Affiliation(s)
- Lorena Espinar
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | | | - Júlia Domingo
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain.,EMBL-CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain
| | - Lucas B Carey
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| |
Collapse
|
25
|
Dissimilar substitution rates between two strands of DNA influence codon usage pattern in some human genes. Gene 2018; 645:179-187. [PMID: 29229516 DOI: 10.1016/j.gene.2017.12.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 12/05/2017] [Accepted: 12/07/2017] [Indexed: 11/23/2022]
Abstract
We illustrated the descriptive aspects of codon usage of some important human genes and their expression potential in E. coli. By comparing the results of various codon usage parameters, effects that are due to selection and mutational pressures have been deciphered. The variation in GC3s explains a significant proportion of the variation in codon usage patterns. The codons CGC, CGG, CTG and GCG showed strong positive correlation with GC3, which suggested that codon usage had been influenced by GC bias. We also found that ACC (Thr, RSCU-1.77), GCC (Ala, RSCU-1.67), CCC (Pro, RSCU-1.54), TCC (Ser, RSCU-1.47) were frequently used which signified that C was common at 2nd and 3rd codon positions. Correspondence analysis revealed that F1 axis had significant correlation with various GC contents suggesting that compositional properties under mutation pressure might affect codon usage bias. Nc-GC3 plot analysis suggested that both mutation pressure and natural selection might affect the codon usage bias which is also supported by neutrality plot analysis. The dinucleotide CT, TG and AG were significantly over-represented and CG, TA, AT, TT, and GT were underrepresented due to high rate of spontaneous mutation resulting from cytosine deamination.
Collapse
|
26
|
Rogers DW, McConnell E, Miller EL, Greig D. Diminishing Returns on Intragenic Repeat Number Expansion in the Production of Signaling Peptides. Mol Biol Evol 2017; 34:3176-3185. [PMID: 28961820 PMCID: PMC5850478 DOI: 10.1093/molbev/msx243] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Signaling peptides enable communication between cells, both within and between individuals, and are therefore key to the control of complex physiological and behavioral responses. Since their small sizes prevent direct transmission to secretory pathways, these peptides are often produced as part of a larger polyprotein comprising precursors for multiple related or identical peptides; the physiological and behavioral consequences of this unusual gene structure are not understood. Here, we show that the number of mature-pheromone-encoding repeats in the yeast α-mating-factor gene MFα1 varies considerably between closely related isolates of both Saccharomyces cerevisiae and its sister species Saccharomyces paradoxus. Variation in repeat number has important phenotypic consequences: Increasing repeat number caused higher pheromone production and greater competitive mating success. However, the magnitude of the improvement decreased with increasing repeat number such that repeat amplification beyond that observed in natural isolates failed to generate more pheromone, and could actually reduce sexual fitness. We investigate multiple explanations for this pattern of diminishing returns and find that our results are most consistent with a translational trade-off: Increasing the number of encoded repeats results in more mature pheromone per translation event, but also generates longer transcripts thereby reducing the rate of translation—a phenomenon known as length-dependent translation. Length-dependent translation may be a powerful constraint on the evolution of genes encoding repetitive or modular proteins, with important physiological and behavioral consequences across eukaryotes.
Collapse
Affiliation(s)
- David W Rogers
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Ellen McConnell
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Eric L Miller
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Veterinary Medicine, Cambridge Veterinary School, University of Cambridge, Cambridge, United Kingdom
| | - Duncan Greig
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| |
Collapse
|
27
|
Mazumdar P, Binti Othman R, Mebus K, Ramakrishnan N, Ann Harikrishna J. Codon usage and codon pair patterns in non-grass monocot genomes. ANNALS OF BOTANY 2017; 120:893-909. [PMID: 29155926 PMCID: PMC5710610 DOI: 10.1093/aob/mcx112] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 09/19/2017] [Indexed: 05/19/2023]
Abstract
BACKGROUND AND AIMS Studies on codon usage in monocots have focused on grasses, and observed patterns of this taxon were generalized to all monocot species. Here, non-grass monocot species were analysed to investigate the differences between grass and non-grass monocots. METHODS First, studies of codon usage in monocots were reviewed. The current information was then extended regarding codon usage, as well as codon-pair context bias, using four completely sequenced non-grass monocot genomes (Musa acuminata, Musa balbisiana, Phoenix dactylifera and Spirodela polyrhiza) for which comparable transcriptome datasets are available. Measurements were taken regarding relative synonymous codon usage, effective number of codons, derived optimal codon and GC content and then the relationships investigated to infer the underlying evolutionary forces. KEY RESULTS The research identified optimal codons, rare codons and preferred codon-pair context in the non-grass monocot species studied. In contrast to the bimodal distribution of GC3 (GC content in third codon position) in grasses, non-grass monocots showed a unimodal distribution. Disproportionate use of G and C (and of A and T) in two- and four-codon amino acids detected in the analysis rules out the mutational bias hypothesis as an explanation of genomic variation in GC content. There was found to be a positive relationship between CAI (codon adaptation index; predicts the level of expression of a gene) and GC3. In addition, a strong correlation was observed between coding and genomic GC content and negative correlation of GC3 with gene length, indicating a strong impact of GC-biased gene conversion (gBGC) in shaping codon usage and nucleotide composition in non-grass monocots. CONCLUSION Optimal codons in these non-grass monocots show a preference for G/C in the third codon position. These results support the concept that codon usage and nucleotide composition in non-grass monocots are mainly driven by gBGC.
Collapse
Affiliation(s)
- Purabi Mazumdar
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - RofinaYasmin Binti Othman
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Katharina Mebus
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
| | - N Ramakrishnan
- Electrical and Computer System Engineering, School of Engineering, Monash University Malaysia, Bandar Sunway, Malaysia
| | - Jennifer Ann Harikrishna
- Centre for Research in Biotechnology for Agriculture, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
- For correspondence. E-mail:
| |
Collapse
|
28
|
Higgs PG, Hao W, Golding GB. Identification of Conflicting Selective Effects on Highly Expressed Genes. Evol Bioinform Online 2017. [DOI: 10.1177/117693430700300015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike's Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage.
Collapse
Affiliation(s)
- Paul G. Higgs
- Department of Physics and Astronomy, McMaster University, Hamilton, Ontario L8S 4M1
| | - Weilong Hao
- Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1
| | - G. Brian Golding
- Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1
| |
Collapse
|
29
|
Cuperus JT, Groves B, Kuchina A, Rosenberg AB, Jojic N, Fields S, Seelig G. Deep learning of the regulatory grammar of yeast 5' untranslated regions from 500,000 random sequences. Genome Res 2017; 27:2015-2024. [PMID: 29097404 PMCID: PMC5741052 DOI: 10.1101/gr.224964.117] [Citation(s) in RCA: 102] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 10/18/2017] [Indexed: 11/25/2022]
Abstract
Our ability to predict protein expression from DNA sequence alone remains poor, reflecting our limited understanding of cis-regulatory grammar and hampering the design of engineered genes for synthetic biology applications. Here, we generate a model that predicts the protein expression of the 5′ untranslated region (UTR) of mRNAs in the yeast Saccharomyces cerevisiae. We constructed a library of half a million 50-nucleotide-long random 5′ UTRs and assayed their activity in a massively parallel growth selection experiment. The resulting data allow us to quantify the impact on protein expression of Kozak sequence composition, upstream open reading frames (uORFs), and secondary structure. We trained a convolutional neural network (CNN) on the random library and showed that it performs well at predicting the protein expression of both a held-out set of the random 5′ UTRs as well as native S. cerevisiae 5′ UTRs. The model additionally was used to computationally evolve highly active 5′ UTRs. We confirmed experimentally that the great majority of the evolved sequences led to higher protein expression rates than the starting sequences, demonstrating the predictive power of this model.
Collapse
Affiliation(s)
- Josh T Cuperus
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
| | - Benjamin Groves
- Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA
| | - Anna Kuchina
- Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA
| | - Alexander B Rosenberg
- Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA
| | | | - Stanley Fields
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA.,Department of Medicine, University of Washington, Seattle, Washington 98195, USA
| | - Georg Seelig
- Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA.,Department of Computer Science & Engineering, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
30
|
Codon usage and amino acid usage influence genes expression level. Genetica 2017; 146:53-63. [DOI: 10.1007/s10709-017-9996-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2017] [Accepted: 10/09/2017] [Indexed: 11/30/2022]
|
31
|
Hanson G, Coller J. Codon optimality, bias and usage in translation and mRNA decay. Nat Rev Mol Cell Biol 2017; 19:20-30. [PMID: 29018283 DOI: 10.1038/nrm.2017.91] [Citation(s) in RCA: 409] [Impact Index Per Article: 58.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The advent of ribosome profiling and other tools to probe mRNA translation has revealed that codon bias - the uneven use of synonymous codons in the transcriptome - serves as a secondary genetic code: a code that guides the efficiency of protein production, the fidelity of translation and the metabolism of mRNAs. Recent advancements in our understanding of mRNA decay have revealed a tight coupling between ribosome dynamics and the stability of mRNA transcripts; this coupling integrates codon bias into the concept of codon optimality, or the effects that specific codons and tRNA concentrations have on the efficiency and fidelity of the translation machinery. In this Review, we first discuss the evidence for codon-dependent effects on translation, beginning with the basic mechanisms through which translation perturbation can affect translation efficiency, protein folding and transcript stability. We then discuss how codon effects are leveraged by the cell to tailor the proteome to maintain homeostasis, execute specific gene expression programmes of growth or differentiation and optimize the efficiency of protein production.
Collapse
Affiliation(s)
- Gavin Hanson
- Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Jeff Coller
- Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA
| |
Collapse
|
32
|
Mahato NK, Gupta V, Singh P, Kumari R, Verma H, Tripathi C, Rani P, Sharma A, Singhvi N, Sood U, Hira P, Kohli P, Nayyar N, Puri A, Bajaj A, Kumar R, Negi V, Talwar C, Khurana H, Nagar S, Sharma M, Mishra H, Singh AK, Dhingra G, Negi RK, Shakarad M, Singh Y, Lal R. Microbial taxonomy in the era of OMICS: application of DNA sequences, computational tools and techniques. Antonie van Leeuwenhoek 2017; 110:1357-1371. [PMID: 28831610 DOI: 10.1007/s10482-017-0928-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 08/10/2017] [Indexed: 02/06/2023]
Abstract
The current prokaryotic taxonomy classifies phenotypically and genotypically diverse microorganisms using a polyphasic approach. With advances in the next-generation sequencing technologies and computational tools for analysis of genomes, the traditional polyphasic method is complemented with genomic data to delineate and classify bacterial genera and species as an alternative to cumbersome and error-prone laboratory tests. This review discusses the applications of sequence-based tools and techniques for bacterial classification and provides a scheme for more robust and reproducible bacterial classification based on genomic data. The present review highlights promising tools and techniques such as ortho-Average Nucleotide Identity, Genome to Genome Distance Calculator and Multi Locus Sequence Analysis, which can be validly employed for characterizing novel microorganisms and assessing phylogenetic relationships. In addition, the review discusses the possibility of employing metagenomic data to assess the phylogenetic associations of uncultured microorganisms. Through this article, we present a review of genomic approaches that can be included in the scheme of taxonomy of bacteria and archaea based on computational and in silico advances to boost the credibility of taxonomic classification in this genomic era.
Collapse
Affiliation(s)
| | - Vipin Gupta
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Priya Singh
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Rashmi Kumari
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | | | - Charu Tripathi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Pooja Rani
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Anukriti Sharma
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Nirjara Singhvi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Utkarsh Sood
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Princy Hira
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Puneet Kohli
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Namita Nayyar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Akshita Puri
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Abhay Bajaj
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Roshan Kumar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Vivek Negi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Chandni Talwar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Himani Khurana
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Shekhar Nagar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Monika Sharma
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Harshita Mishra
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Amit Kumar Singh
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Gauri Dhingra
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Ram Krishan Negi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | | | - Yogendra Singh
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Rup Lal
- Department of Zoology, University of Delhi, Delhi, 110007, India.
| |
Collapse
|
33
|
Seligmann H, Warthi G. Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes. Comput Struct Biotechnol J 2017; 15:412-424. [PMID: 28924459 PMCID: PMC5591391 DOI: 10.1016/j.csbj.2017.08.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Revised: 07/20/2017] [Accepted: 08/05/2017] [Indexed: 12/14/2022] Open
Abstract
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Collapse
Affiliation(s)
- Hervé Seligmann
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
- Dept. Ecol Evol Behav, Alexander Silberman Inst Life Sci, The Hebrew University of Jerusalem, IL-91904 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
| |
Collapse
|
34
|
Abstract
Males and females exhibit highly dimorphic phenotypes, particularly in their gonads, which is believed to be driven largely by differential gene expression. Typically, the protein sequences of genes upregulated in males, or male-biased genes, evolve rapidly as compared to female-biased and unbiased genes. To date, the specific study of gonad-biased genes remains uncommon in metazoans. Here, we identified and studied a total of 2927, 2013, and 4449 coding sequences (CDS) with ovary-biased, testis-biased, and unbiased expression, respectively, in the yellow fever mosquito Aedes aegypti The results showed that ovary-biased and unbiased CDS had higher nonsynonymous to synonymous substitution rates (dN/dS) and lower optimal codon usage (those codons that promote efficient translation) than testis-biased genes. Further, we observed higher dN/dS in ovary-biased genes than in testis-biased genes, even for genes coexpressed in nonsexual (embryo) tissues. Ovary-specific genes evolved exceptionally fast, as compared to testis- or embryo-specific genes, and exhibited higher frequency of positive selection. Genes with ovary expression were preferentially involved in olfactory binding and reception. We hypothesize that at least two potential mechanisms could explain rapid evolution of ovary-biased genes in this mosquito: (1) the evolutionary rate of ovary-biased genes may be accelerated by sexual selection (including female-female competition or male-mate choice) affecting olfactory genes during female swarming by males, and/or by adaptive evolution of olfactory signaling within the female reproductive system (e.g., sperm-ovary signaling); and/or (2) testis-biased genes may exhibit decelerated evolutionary rates due to the formation of mating plugs in the female after copulation, which limits male-male sperm competition.
Collapse
|
35
|
The Impact of Selection at the Amino Acid Level on the Usage of Synonymous Codons. G3-GENES GENOMES GENETICS 2017; 7:967-981. [PMID: 28122952 PMCID: PMC5345726 DOI: 10.1534/g3.116.038125] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
There are two main forces that affect usage of synonymous codons: directional mutational pressure and selection. The effectiveness of protein translation is usually considered as the main selectional factor. However, biased codon usage can also be a byproduct of a general selection at the amino acid level interacting with nucleotide replacements. To evaluate the validity and strength of such an effect, we superimposed >3.5 billion unrestricted mutational processes on the selection of nonsynonymous substitutions based on the differences in physicochemical properties of the coded amino acids. Using a modified evolutionary optimization algorithm, we determined the conditions in which the effect on the relative codon usage is maximized. We found that the effect is enhanced by mutational processes generating more adenine and thymine than guanine and cytosine, as well as more purines than pyrimidines. Interestingly, this effect is observed only under an unrestricted model of nucleotide substitution, and disappears when the mutational process is time-reversible. Comparison of the simulation results with data for real protein coding sequences indicates that the impact of selection at the amino acid level on synonymous codon usage cannot be neglected. Furthermore, it can considerably interfere, especially in AT-rich genomes, with other selections on codon usage, e.g., translational efficiency. It may also lead to difficulties in the recognition of other effects influencing codon bias, and an overestimation of protein coding sequences whose codon usage is subjected to adaptational selection.
Collapse
|
36
|
Sen A, Hsieh WC, Aguilar RC. The Information Content of Glutamine-Rich Sequences Define Protein Functional Characteristics. PROCEEDINGS OF THE IEEE. INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS 2017; 105:385-393. [PMID: 32963411 PMCID: PMC7505158 DOI: 10.1109/jproc.2016.2613076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The presence of abnormally expanded glutamine (Q) repeats within specific proteins (e.g., huntingtin) are the well-established cause of several neurogenerative diseases, including Huntington disease and spinocerebellar ataxias. However, the impact of "expanded Q" stretches on the protein function is not well-understood, mostly due to lack of knowledge about the physiological role of Q repeats and the mechanism by which these repeats achieve functional-specificity. Indeed, is intriguing that regions with such low complexity (low information content) can display exquisite functional specificity. Prompting the question: where is this information stored? Applying biochemical/structural constraints and statistical analysis of protein composition we identified Q-rich (QR) regions present in coiled coils of yeast transcription factors and endocytic proteins. Our analysis indicated the existence of non-Q amino-acids differentially enriched or excluded from QR regions in one protein group versus the other. Importantly, when the non-Q amino-acids from an endocytic protein were exchanged by the ones enriched in QR from transcription factors, the resulting protein was unable to localize to the plasma membrane and was instead found in the nucleus. These results indicate that while QR repeats can efficiently engage in binding, the non-Q amino-acids provide essential specificity information. We speculate that coupling low complexity regions with information-intensive determinants might be a strategy used in many protein systems involved in different biological processes.
Collapse
Affiliation(s)
- Arpita Sen
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Current address, Dept. of Molecular & Cell Biology, University of California, Berkeley
| | - Wen-Chieh Hsieh
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - R. Claudio Aguilar
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
37
|
Thompson DA, Cubillos FA. Natural gene expression variation studies in yeast. Yeast 2016; 34:3-17. [PMID: 27668700 DOI: 10.1002/yea.3210] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Revised: 09/16/2016] [Accepted: 09/18/2016] [Indexed: 11/06/2022] Open
Abstract
The rise of sequence information across different yeast species and strains is driving an increasing number of studies in the emerging field of genomics to associate polymorphic variants, mRNA abundance and phenotypic differences between individuals. Here, we gathered evidence from recent studies covering several layers that define the genotype-phenotype gap, such as mRNA abundance, allele-specific expression and translation efficiency to demonstrate how genetic variants co-evolve and define an individual's genome. Moreover, we exposed several antecedents where inter- and intra-specific studies led to opposite conclusions, probably owing to genetic divergence. Future studies in this area will benefit from the access to a massive array of well-annotated genomes and new sequencing technologies, which will allow the fine breakdown of the complex layers that delineate the genotype-phenotype map. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
| | - Francisco A Cubillos
- Centro de Estudios en Ciencia y Tecnología de Alimentos, Universidad de Santiago de Chile, Santiago, Chile.,Millennium Nucleus for Fungal Integrative and Synthetic Biology.,Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
| |
Collapse
|
38
|
Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes. Mar Genomics 2016; 32:31-39. [PMID: 27733306 DOI: 10.1016/j.margen.2016.10.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Revised: 09/11/2016] [Accepted: 10/03/2016] [Indexed: 11/20/2022]
Abstract
With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes.
Collapse
|
39
|
Cakiroglu SA, Zaugg JB, Luscombe NM. Backmasking in the yeast genome: encoding overlapping information for protein-coding and RNA degradation. Nucleic Acids Res 2016; 44:8065-72. [PMID: 27492286 PMCID: PMC5041482 DOI: 10.1093/nar/gkw683] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Revised: 07/20/2016] [Accepted: 07/24/2016] [Indexed: 01/04/2023] Open
Abstract
Backmasking is a recording technique used to hide a sound or message in a music track in reverse, meaning that it is only audible when the record is played backwards. Analogously, the compact yeast genome encodes for diverse sources of information such as overlapping coding and non-coding transcripts, and protein-binding sites on the two complementary DNA strands. Examples are the consensus binding site sequences of the RNA-binding proteins Nrd1 and Nab3 that target non-coding transcripts for degradation. Here, by examining the overlap of stable (SUTs, stable unannotated transcripts) and unstable (CUTs, cryptic unstable transcripts) transcripts with protein-coding genes, we show that the predicted Nrd1 and Nab3-binding site sequences occur at differing frequencies. They are always depleted in the sense direction of protein-coding genes, thus avoiding degradation of the transcript. However in the antisense direction, predicted binding sites occur at high frequencies in genes with overlapping unstable ncRNAs (CUTs), so limiting the availability of non-functional transcripts. In contrast they are depleted in genes with overlapping stable ncRNAs (SUTs), presumably to avoid degrading the non-coding transcript. The protein-coding genes maintain similar amino-acid contents, but they display distinct codon usages so that Nrd1 and Nab3-binding sites can arise at differing frequencies in antisense depending on the overlapping transcript type. Our study demonstrates how yeast has evolved to encode multiple layers of information-protein-coding genes in one strand and the relative chance of degrading antisense RNA in the other strand-in the same regions of a compact genome.
Collapse
Affiliation(s)
- S Aylin Cakiroglu
- The Francis Crick Institute, 44 Lincoln's Inn Fields Laboratory, London WC2A 3LY, UK
| | - Judith B Zaugg
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Nicholas M Luscombe
- The Francis Crick Institute, 44 Lincoln's Inn Fields Laboratory, London WC2A 3LY, UK UCL Genetics Institute, University College London, London WC1E 6BT, UK Okinawa Institute of Science & Technology Graduate University, Okinawa 904-0495, Japan
| |
Collapse
|
40
|
Whittle CA, Extavour CG. Expression-Linked Patterns of Codon Usage, Amino Acid Frequency, and Protein Length in the Basally Branching Arthropod Parasteatoda tepidariorum. Genome Biol Evol 2016; 8:2722-36. [PMID: 27017527 PMCID: PMC5630913 DOI: 10.1093/gbe/evw068] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum. First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model.
Collapse
Affiliation(s)
- Carrie A Whittle
- Department of Organismic and Evolutionary Biology, Harvard University
| | - Cassandra G Extavour
- Department of Organismic and Evolutionary Biology, Harvard University Department of Molecular and Cellular Biology, Harvard University
| |
Collapse
|
41
|
Hoek TA, Axelrod K, Biancalani T, Yurtsev EA, Liu J, Gore J. Resource Availability Modulates the Cooperative and Competitive Nature of a Microbial Cross-Feeding Mutualism. PLoS Biol 2016; 14:e1002540. [PMID: 27557335 PMCID: PMC4996419 DOI: 10.1371/journal.pbio.1002540] [Citation(s) in RCA: 136] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 07/28/2016] [Indexed: 12/19/2022] Open
Abstract
Mutualisms between species play an important role in ecosystem function and stability. However, in some environments, the competitive aspects of an interaction may dominate the mutualistic aspects. Although these transitions could have far-reaching implications, it has been difficult to study the causes and consequences of this mutualistic–competitive transition in experimentally tractable systems. Here, we study a microbial cross-feeding mutualism in which each yeast strain supplies an essential amino acid for its partner strain. We find that, depending upon the amount of freely available amino acid in the environment, this pair of strains can exhibit an obligatory mutualism, facultative mutualism, competition, parasitism, competitive exclusion, or failed mutualism leading to extinction of the population. A simple model capturing the essential features of this interaction explains how resource availability modulates the interaction and predicts that changes in the dynamics of the mutualism in deteriorating environments can provide advance warning that collapse of the mutualism is imminent. We confirm this prediction experimentally by showing that, in the high nutrient competitive regime, the strains rapidly reach a common carrying capacity before slowly reaching the equilibrium ratio between the strains. However, in the low nutrient regime, before collapse of the obligate mutualism, we find that the ratio rapidly reaches its equilibrium and it is the total abundance that is slow to reach equilibrium. Our results provide a general framework for how mutualisms may transition between qualitatively different regimes of interaction in response to changes in nutrient availability in the environment. A combination of computational modeling and experiments reveals the striking effects of changing resource availability on the population dynamics observed between two cross-feeding yeast strains. Species often engage in mutualistic interactions that are beneficial for both partners. However, there is also a cost associated with cooperation, for example, in the form of energy required to make nutrients for a partner. When environments change, the costs and benefits of cooperating can change as well, and this can cause the mutualistic interaction to break down into other interaction types, such as parasitism. In this study, we varied nutrient availability to examine how changing environments can affect the interaction between two cross-feeding yeast strains. Lower nutrient concentrations made each strain more dependent on the nutrients provided by its partner strain and thus favored cooperation. Using both experiments and mathematic models, we found that in different environments, these yeast strains can interact in at least seven different qualitatively different ways, including obligate mutualism, facultative mutualism, parasitism, and competition. We also found that the dynamics of how the two strains influence each other change drastically in different nutrient concentrations. Examining the population dynamics could therefore potentially be used to predict the stability or collapse of a community.
Collapse
Affiliation(s)
- Tim A. Hoek
- Hubrecht Institute, The Royal Netherlands Academy of Arts and Sciences (KNAW) and University Medical Center Utrecht, Utrecht, the Netherlands
| | - Kevin Axelrod
- Biophysics PhD Program, Harvard University, Cambridge, Massachusetts, United States of America
| | - Tommaso Biancalani
- Physics of Living Systems, Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Eugene A. Yurtsev
- Physics of Living Systems, Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Jinghui Liu
- Physics of Living Systems, Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Jeff Gore
- Physics of Living Systems, Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
42
|
Bazzini AA, Del Viso F, Moreno-Mateos MA, Johnstone TG, Vejnar CE, Qin Y, Yao J, Khokha MK, Giraldez AJ. Codon identity regulates mRNA stability and translation efficiency during the maternal-to-zygotic transition. EMBO J 2016; 35:2087-2103. [PMID: 27436874 DOI: 10.15252/embj.201694699] [Citation(s) in RCA: 187] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 06/16/2016] [Indexed: 12/26/2022] Open
Abstract
Cellular transitions require dramatic changes in gene expression that are supported by regulated mRNA decay and new transcription. The maternal-to-zygotic transition is a conserved developmental progression during which thousands of maternal mRNAs are cleared by post-transcriptional mechanisms. Although some maternal mRNAs are targeted for degradation by microRNAs, this pathway does not fully explain mRNA clearance. We investigated how codon identity and translation affect mRNA stability during development and homeostasis. We show that the codon triplet contains translation-dependent regulatory information that influences transcript decay. Codon composition shapes maternal mRNA clearance during the maternal-to-zygotic transition in zebrafish, Xenopus, mouse, and Drosophila, and gene expression during homeostasis across human tissues. Some synonymous codons show consistent stabilizing or destabilizing effects, suggesting that amino acid composition influences mRNA stability. Codon composition affects both polyadenylation status and translation efficiency. Thus, the ribosome interprets two codes within the mRNA: the genetic code which specifies the amino acid sequence and a conserved "codon optimality code" that shapes mRNA stability and translation efficiency across vertebrates.
Collapse
Affiliation(s)
- Ariel A Bazzini
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Florencia Del Viso
- Departments of Pediatrics, Yale University School of Medicine, New Haven, CT, USA
| | | | - Timothy G Johnstone
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
| | - Charles E Vejnar
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
| | - Yidan Qin
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX, USA
| | - Jun Yao
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX, USA
| | - Mustafa K Khokha
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA Departments of Pediatrics, Yale University School of Medicine, New Haven, CT, USA
| | - Antonio J Giraldez
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA Yale Stem Cell Center, Yale University School of Medicine, New Haven, CT, USA Yale Cancer Center, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
43
|
Satapathy SS, Powdel BR, Buragohain AK, Ray SK. Discrepancy among the synonymous codons with respect to their selection as optimal codon in bacteria. DNA Res 2016; 23:441-449. [PMID: 27426467 PMCID: PMC5066170 DOI: 10.1093/dnares/dsw027] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Accepted: 05/19/2016] [Indexed: 01/05/2023] Open
Abstract
The different triplets encoding the same amino acid, termed as synonymous codons, are not equally abundant in a genome. Factors such as G + C% and tRNA are known to influence their abundance in a genome. However, the order of the nucleotide in each codon per se might also be another factor impacting on its abundance values. Of the synonymous codons for specific amino acids, some are preferentially used in the high expression genes that are referred to as the 'optimal codons' (OCs). In this study, we compared OCs of the 18 amino acids in 221 species of bacteria. It is observed that there is amino acid specific influence for the selection of OCs. There is also influence of phylogeny in the choice of OCs for some amino acids such as Glu, Gln, Lys and Leu. The phenomenon of codon bias is also supported by the comparative studies of the abundance values of the synonymous codons with same G + C. It is likely that the order of the nucleotides in the triplet codon is also perhaps involved in the phenomenon of codon usage bias in organisms.
Collapse
Affiliation(s)
| | - Bhesh Raj Powdel
- Department of Statistics, Darrang College, Tezpur 784001, Assam, India
| | - Alak Kumar Buragohain
- Department of Molecular Biology and Biotechnology, Tezpur University, Napaam, Tezpur 784028, Assam, India.,Office of the Vice-Chancellor, Dibrugarh University, Dibrugarh 786004, Assam, India
| | - Suvendra Kumar Ray
- Department of Molecular Biology and Biotechnology, Tezpur University, Napaam, Tezpur 784028, Assam, India
| |
Collapse
|
44
|
Wei Y, Wang J, Xia X. Coevolution between Stop Codon Usage and Release Factors in Bacterial Species. Mol Biol Evol 2016; 33:2357-67. [PMID: 27297468 PMCID: PMC4989110 DOI: 10.1093/molbev/msw107] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Three stop codons in bacteria represent different translation termination signals, and their usage is expected to depend on their differences in translation termination efficiency, mutation bias, and relative abundance of release factors (RF1 decoding UAA and UAG, and RF2 decoding UAA and UGA). In 14 bacterial species (covering Proteobacteria, Firmicutes, Cyanobacteria, Actinobacteria and Spirochetes) with cellular RF1 and RF2 quantified, UAA is consistently over-represented in highly expressed genes (HEGs) relative to lowly expressed genes (LEGs), whereas UGA usage is the opposite even in species where RF2 is far more abundant than RF1. UGA usage relative to UAG increases significantly with PRF2 [=RF2/(RF1 + RF2)] as expected from adaptation between stop codons and their decoders. PRF2 is > 0.5 over a wide range of AT content (measured by PAT3 as the proportion of AT at third codon sites), but decreases rapidly toward zero at the high range of PAT3. This explains why bacterial lineages with high PAT3 often have UGA reassigned because of low RF2. There is no indication that UAG is a minor stop codon in bacteria as claimed in a recent publication. The claim is invalid because of the failure to apply the two key criteria in identifying a minor codon: (1) it is least preferred by HEGs (or most preferred by LEGs) and (2) it corresponds to the least abundant decoder. Our results suggest a more plausible explanation for why UAA usage increases, and UGA usage decreases, with PAT3, but UAG usage remains low over the entire PAT3 range.
Collapse
Affiliation(s)
- Yulong Wei
- Department of Biology, University of Ottawa, Ottawa, ON, Canada
| | - Juan Wang
- Department of Biology, University of Ottawa, Ottawa, ON, Canada
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ottawa, ON, Canada Ottawa Institute of Systems Biology, Ottawa, ON, Canada
| |
Collapse
|
45
|
Yin H, Ma L, Wang G, Li M, Zhang Z. Old genes experience stronger translational selection than young genes. Gene 2016; 590:29-34. [PMID: 27259662 DOI: 10.1016/j.gene.2016.05.041] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Revised: 05/27/2016] [Accepted: 05/29/2016] [Indexed: 12/12/2022]
Abstract
Selection on synonymous codon usage for translation efficiency and/or accuracy has been identified as a widespread mechanism in many living organisms. However, it remains unknown whether translational selection associates closely with gene age and acts differentially on genes with different evolutionary ages. To address this issue, here we investigate the strength of translational selection acting on different aged genes in human. Our results show that old genes present stronger translational selection than young genes, demonstrating that translational selection correlates positively with gene age. We further explore the difference of translational selection in duplicates vs. singletons and in housekeeping vs. tissue-specific genes. We find that translational selection acts comparably in old singletons and old duplicates and stronger translational selection in old genes is contributed primarily by housekeeping genes. For young genes, contrastingly, singletons experience stronger translational selection than duplicates, presumably due to redundant function of duplicated genes during their early evolutionary stage. Taken together, our results indicate that translational selection acting on a gene would not be constant during all stages of evolution, associating closely with gene age.
Collapse
Affiliation(s)
- Hongyan Yin
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Lina Ma
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing 100101, China
| | - Guangyu Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mengwei Li
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhang Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
46
|
Ho WW, Smith SD. Molecular evolution of anthocyanin pigmentation genes following losses of flower color. BMC Evol Biol 2016; 16:98. [PMID: 27161359 PMCID: PMC4862180 DOI: 10.1186/s12862-016-0675-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Accepted: 04/29/2016] [Indexed: 11/27/2022] Open
Abstract
Background Phenotypic transitions, such as trait gain or loss, are predicted to carry evolutionary consequences for the genes that control their development. For example, trait losses can result in molecular decay of the pathways underlying the trait. Focusing on the Iochrominae clade (Solanaceae), we examine how repeated losses of floral anthocyanin pigmentation associated with flower color transitions have affected the molecular evolution of three anthocyanin pathway genes (Chi, F3h, and Dfr). Results We recovered intact coding regions for the three genes in all of the lineages that have lost floral pigmentation, suggesting that molecular decay is not associated with these flower color transitions. However, two of the three genes (Chi, F3h) show significantly elevated dN/dS ratios in lineages without floral pigmentation. Maximum likelihood analyses suggest that this increase is due to relaxed constraint on anthocyanin genes in the unpigmented lineages as opposed to positive selection. Despite the increase, the values for dN/dS in both pigmented and unpigmented lineages were consistent overall with purifying selection acting on these loci. Conclusions The broad conservation of anthocyanin pathway genes across lineages with and without floral anthocyanins is consistent with the growing consensus that losses of pigmentation are largely achieved by changes in gene expression as opposed to structural mutations. Moreover, this conservation maintains the potential for regain of flower color, and indicates that evolutionary losses of floral pigmentation may be readily reversible. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0675-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Winnie W Ho
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, USA
| | - Stacey D Smith
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, USA.
| |
Collapse
|
47
|
Hajjari M, Sadeghi I, Salavaty A, Nasiri H, Birgani MT. Tissue Specific Expression Levels of Apoptosis Involved Genes Have Correlations with Codon and Amino Acid Usage. Genomics Inform 2016; 14:234-240. [PMID: 28154517 PMCID: PMC5287130 DOI: 10.5808/gi.2016.14.4.234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Revised: 11/25/2016] [Accepted: 12/01/2016] [Indexed: 11/20/2022] Open
Affiliation(s)
- Mohammadreza Hajjari
- Department of Genetics, Faculty of Sciences, Shahid Chamran University of Ahvaz, Ahvaz 61357-83151, Iran
| | - Iman Sadeghi
- Department of Molecular Genetics, Faculty of Biosciences, Tarbiat Modares University of Tehran, Tehran 14115116, Iran
| | - Abbas Salavaty
- Department of Genetics, Faculty of Sciences, Shahid Chamran University of Ahvaz, Ahvaz 61357-83151, Iran
| | - Habib Nasiri
- Department of Medical Genetics, Nika Center of Preventive Medicine and Health Promotion, Tehran 1418944711, Iran
| | - Maryam Tahmasebi Birgani
- Department of Medical Genetics, School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz 61357-15794, Iran
| |
Collapse
|
48
|
Adrion JR, White PS, Montooth KL. The Roles of Compensatory Evolution and Constraint in Aminoacyl tRNA Synthetase Evolution. Mol Biol Evol 2015; 33:152-61. [PMID: 26416980 PMCID: PMC4693975 DOI: 10.1093/molbev/msv206] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Mitochondrial protein translation requires interactions between transfer RNAs encoded by the mitochondrial genome (mt-tRNAs) and mitochondrial aminoacyl tRNA synthetase proteins (mt-aaRS) encoded by the nuclear genome. It has been argued that animal mt-tRNAs have higher deleterious substitution rates relative to their nuclear-encoded counterparts, the cytoplasmic tRNAs (cyt-tRNAs). This dynamic predicts elevated rates of compensatory evolution of mt-aaRS that interact with mt-tRNAs, relative to aaRS that interact with cyt-tRNAs (cyt-aaRS). We find that mt-aaRS do evolve at significantly higher rates (exemplified by higher dN and dN/dS) relative to cyt-aaRS, across mammals, birds, and Drosophila. While this pattern supports a model of compensatory evolution, the level at which a gene is expressed is a more general predictor of protein evolutionary rate. We find that gene expression level explains 10–56% of the variance in aaRS dN/dS, and that cyt-aaRS are more highly expressed in addition to having lower dN/dS values relative to mt-aaRS, consistent with more highly expressed genes being more evolutionarily constrained. Furthermore, we find no evidence of positive selection acting on either class of aaRS protein, as would be expected under a model of compensatory evolution. Nevertheless, the signature of faster mt-aaRS evolution persists in mammalian, but not bird or Drosophila, lineages after controlling for gene expression, suggesting some additional effect of compensatory evolution for mammalian mt-aaRS. We conclude that gene expression is the strongest factor governing differential amino acid substitution rates in proteins interacting with mitochondrial versus cytoplasmic factors, with important differences in mt-aaRS molecular evolution among taxonomic groups.
Collapse
Affiliation(s)
| | - P Signe White
- Department of Biology, Indiana University, Bloomington
| | | |
Collapse
|
49
|
Whittle CA, Extavour CG. Codon and Amino Acid Usage Are Shaped by Selection Across Divergent Model Organisms of the Pancrustacea. G3 (BETHESDA, MD.) 2015; 5:2307-21. [PMID: 26384771 PMCID: PMC4632051 DOI: 10.1534/g3.115.021402] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 08/28/2015] [Indexed: 01/24/2023]
Abstract
In protein-coding genes, synonymous codon usage and amino acid composition correlate to expression in some eukaryotes, and may result from translational selection. Here, we studied large-scale RNA-seq data from three divergent arthropod models, including cricket (Gryllus bimaculatus), milkweed bug (Oncopeltus fasciatus), and the amphipod crustacean Parhyale hawaiensis, and tested for optimization of codon and amino acid usage relative to expression level. We report strong signals of AT3 optimal codons (those favored in highly expressed genes) in G. bimaculatus and O. fasciatus, whereas weaker signs of GC3 optimal codons were found in P. hawaiensis, suggesting selection on codon usage in all three organisms. Further, in G. bimaculatus and O. fasciatus, high expression was associated with lowered frequency of amino acids with large size/complexity (S/C) scores in favor of those with intermediate S/C values; thus, selection may favor smaller amino acids while retaining those of moderate size for protein stability or conformation. In P. hawaiensis, highly transcribed genes had elevated frequency of amino acids with large and small S/C scores, suggesting a complex dynamic in this crustacean. In all species, the highly transcribed genes appeared to favor short proteins, high optimal codon usage, specific amino acids, and were preferentially involved in cell-cycling and protein synthesis. Together, based on examination of 1,680,067, 1,667,783, and 1,326,896 codon sites in G. bimaculatus, O. fasciatus, and P. hawaiensis, respectively, we conclude that translational selection shapes codon and amino acid usage in these three Pancrustacean arthropods.
Collapse
Affiliation(s)
- Carrie A Whittle
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138
| | - Cassandra G Extavour
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138 Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138
| |
Collapse
|
50
|
Mohanta TK, Mohanta N, Mohanta YK, Parida P, Bae H. Genome-wide identification of Calcineurin B-Like (CBL) gene family of plants reveals novel conserved motifs and evolutionary aspects in calcium signaling events. BMC PLANT BIOLOGY 2015; 15:189. [PMID: 26245459 PMCID: PMC4527274 DOI: 10.1186/s12870-015-0543-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 06/09/2015] [Indexed: 05/21/2023]
Abstract
BACKGROUND Calcium ions, the most versatile secondary messenger found in plants, are involved in the regulation of diverse arrays of plant growth and development, as well as biotic and abiotic stress responses. The calcineurin B-like proteins are one of the most important genes that act as calcium sensors. RESULTS In this study, we identified calcineurin B-like gene family members from 38 different plant species and assigned a unique nomenclature to each of them. Sequence analysis showed that, the CBL proteins contain three calcium binding EF-hand domain that contains several conserved Asp and Glu amino acid residues. The third EF-hand of the CBL protein was found to posses the D/E-x-D calcium binding sensor motif. Phylogenetic analysis showed that, the CBL genes fall into six different groups. Additionally, except group B CBLs, all the CBL proteins were found to contain N-terminal palmitoylation and myristoylation sites. An evolutionary study showed that, CBL genes are evolved from a common ancestor and subsequently diverged during the course of evolution of land plants. Tajima's neutrality test showed that, CBL genes are highly polymorphic and evolved via decreasing population size due to balanced selection. Differential expression analysis with cold and heat stress treatment led to differential modulation of OsCBL genes. CONCLUSIONS The basic architecture of plant CBL genes is conserved throughout the plant kingdom. Evolutionary analysis showed that, these genes are evolved from a common ancestor of lower eukaryotic plant lineage and led to broadening of the calcium signaling events in higher eukaryotic organisms.
Collapse
Affiliation(s)
- Tapan Kumar Mohanta
- School of Biotechnology, Yeungnam University Gyeongsan, Gyeongbook, 712-749, Republic of Korea.
| | - Nibedita Mohanta
- Department of Biotechnology, North Orissa University, Sri Ramchandra Vihar, Takatpur, Baripada, Mayurbhanj, Orissa, 757003, India.
| | - Yugal Kishore Mohanta
- Department of Botany, North Orissa University, Sri Ramchandra Vihar, Takatpur, Baripada, Mayurbhanj, Orissa, 757003, India.
| | - Pratap Parida
- Center for studies in Biotechnology, Dibrugarh University, Dibrugarh, 786004, Assam, India.
| | - Hanhong Bae
- School of Biotechnology, Yeungnam University Gyeongsan, Gyeongbook, 712-749, Republic of Korea.
| |
Collapse
|