1
|
Zhu X, Ricci-Tam C, Hager ER, Sgro AE. Self-cleaving peptides for expression of multiple genes in Dictyostelium discoideum. PLoS One 2023; 18:e0281211. [PMID: 36862626 PMCID: PMC9980757 DOI: 10.1371/journal.pone.0281211] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 01/18/2023] [Indexed: 03/03/2023] Open
Abstract
The social amoeba Dictyostelium discoideum is a model for a wide range of biological processes including chemotaxis, cell-cell communication, phagocytosis, and development. Interrogating these processes with modern genetic tools often requires the expression of multiple transgenes. While it is possible to transfect multiple transcriptional units, the use of separate promoters and terminators for each gene leads to large plasmid sizes and possible interference between units. In many eukaryotic systems this challenge has been addressed through polycistronic expression mediated by 2A viral peptides, permitting efficient, co-regulated gene expression. Here, we screen the most commonly used 2A peptides, porcine teschovirus-1 2A (P2A), Thosea asigna virus 2A (T2A), equine rhinitis A virus 2A (E2A), and foot-and-mouth disease virus 2A (F2A), for activity in D. discoideum and find that all the screened 2A sequences are effective. However, combining the coding sequences of two proteins into a single transcript leads to notable strain-dependent decreases in expression level, suggesting additional factors regulate gene expression in D. discoideum that merit further investigation. Our results show that P2A is the optimal sequence for polycistronic expression in D. discoideum, opening up new possibilities for genetic engineering in this model system.
Collapse
Affiliation(s)
- Xinwen Zhu
- Department of Biomedical Engineering, Boston University, Boston, MA, United States of America
- Biological Design Center, Boston University, Boston, MA, United States of America
| | - Chiara Ricci-Tam
- Department of Biomedical Engineering, Boston University, Boston, MA, United States of America
- Biological Design Center, Boston University, Boston, MA, United States of America
| | - Emily R. Hager
- Department of Biomedical Engineering, Boston University, Boston, MA, United States of America
- Biological Design Center, Boston University, Boston, MA, United States of America
| | - Allyson E. Sgro
- Department of Biomedical Engineering, Boston University, Boston, MA, United States of America
- Biological Design Center, Boston University, Boston, MA, United States of America
- * E-mail:
| |
Collapse
|
2
|
Tamjid N, Eskandari S, Karimi Z, Nezafat N, Negahdaripour M. Vaccinomics strategy to design an epitope peptide vaccine against Helicobacter pylori. Process Biochem 2022. [DOI: 10.1016/j.procbio.2022.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
3
|
Chandra S, Gupta K, Khare S, Kohli P, Asok A, Mohan SV, Gowda H, Varadarajan R. The High Mutational Sensitivity of ccdA Antitoxin Is Linked to Codon Optimality. Mol Biol Evol 2022; 39:6693774. [PMID: 36069948 PMCID: PMC9555053 DOI: 10.1093/molbev/msac187] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Deep mutational scanning studies suggest that synonymous mutations are typically silent and that most exposed, nonactive-site residues are tolerant to mutations. Here, we show that the ccdA antitoxin component of the Escherichia coli ccdAB toxin-antitoxin system is unusually sensitive to mutations when studied in the operonic context. A large fraction (∼80%) of single-codon mutations, including many synonymous mutations in the ccdA gene shows inactive phenotype, but they retain native-like binding affinity towards cognate toxin, CcdB. Therefore, the observed phenotypic effects are largely not due to alterations in protein structure/stability, consistent with a large region of CcdA being intrinsically disordered. E. coli codon preference and strength of ribosome-binding associated with translation of downstream ccdB gene are found to be major contributors of the observed ccdA mutant phenotypes. In select cases, proteomics studies reveal altered ratios of CcdA:CcdB protein levels in vivo, suggesting that the ccdA mutations likely alter relative translation efficiencies of the two genes in the operon. We extend these results by studying single-site synonymous mutations that lead to loss of function phenotypes in the relBE operon upon introduction of rarer codons. Thus, in their operonic context, genes are likely to be more sensitive to both synonymous and nonsynonymous point mutations than inferred previously.
Collapse
Affiliation(s)
| | | | - Shruti Khare
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Pehu Kohli
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Aparna Asok
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | | | - Harsha Gowda
- Institute of Bioinformatics, Bangalore 560100, India
| | | |
Collapse
|
4
|
Arwansyah A, Arif AR, Kade A, Taiyeb M, Ramli I, Santoso T, Ningsih P, Natsir H, Tahril T, Uday Kumar K. Molecular modelling on multiepitope-based vaccine against SARS-CoV-2 using immunoinformatics, molecular docking, and molecular dynamics simulation. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022; 33:649-675. [PMID: 36083166 DOI: 10.1080/1062936x.2022.2117846] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Accepted: 08/22/2022] [Indexed: 06/15/2023]
Abstract
The pandemic of COVID-19 caused by SARS-CoV-2 has made a worldwide health emergency. Despite the fact that current vaccines are readily available, several SARSCoV-2 variants affecting the existing vaccine are to be less effective due to the mutations in the structural proteins. Furthermore, the appearance of the new variants cannot be easily predicted in the future. Therefore, the attempts to construct new vaccines or to modify the current vaccines are still pivotal works for preventing the spread of the virus. In the present investigation, the computational analysis through immunoinformatics, molecular docking, and molecular dynamics (MD) simulation is employed to construct an effective vaccine against SARS-CoV2. The structural proteins of SARS-CoV2 are utilized to create a multiepitope-based vaccine (MEV). According to our findings presented by systematic procedures in the current investigation, the MEV construct may be able to trigger a strong immunological response against the virus. Therefore, the designed MEV could be a potential vaccine candidate against SARS-CoV-2, and also it is expected to be effective for other variants.
Collapse
Affiliation(s)
- A Arwansyah
- Department of Chemistry Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia
| | - A R Arif
- Department of Chemistry, Faculty of Mathematics and Natural Sciences, Hasanuddin University, Makassar, Indonesia
| | - A Kade
- Department of Physics Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia
| | - M Taiyeb
- Department of Biology, Faculty of Mathematics and Natural Sciences, Makassar State University, Makassar, Indonesia
| | - I Ramli
- Department of Physics, Faculty of Science, Universitas Cokroaminoto Palopo, Palopo, Indonesia
| | - T Santoso
- Department of Chemistry Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia
| | - P Ningsih
- Department of Chemistry Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia
| | - H Natsir
- Department of Chemistry, Faculty of Mathematics and Natural Sciences, Hasanuddin University, Makassar, Indonesia
| | - T Tahril
- Department of Chemistry Education, Faculty of Teacher Training and Education, Tadulako University, Palu, Indonesia
| | - K Uday Kumar
- Department of Radiology, Toxicology and Population Protection, Faculty of Health and Social Studies, University of South Bohemia Cesk´e Budˇejovice, Czech Republic
| |
Collapse
|
5
|
Andargie M, Congyi Z. Genome-wide analysis of codon usage in sesame ( Sesamum indicum L.). Heliyon 2022; 8:e08687. [PMID: 35106386 PMCID: PMC8789531 DOI: 10.1016/j.heliyon.2021.e08687] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/20/2021] [Accepted: 12/24/2021] [Indexed: 10/28/2022] Open
Abstract
Sesamum indicum is an ancient oil crop grown in tropical and subtropical areas of the world. We have analyzed 23,538 coding sequences (CDS) of S. indicum to understand the factors shaping codon usage in this important oil crop plant. We identified eleven highly preferred codons in S. indicum that have AT-endings. The slope of a neutrality plot was less than one while effective number of codons (ENC) plot showed distribution above and below the standard curve. There is a significant relationship between protein length and relative synonymous codon usage (RSCU) at the primary axis while there is a weak correlation between protein length and Nc values. Correspondence analysis conducted on RSCU values differentiated CDS based on their GC content and their characteristic feature and showed a discrete distribution. Moreover, by determining codon usage, we found out that majority of the lignan biosynthesis related genes showed a weaker codon usage bias. These results provide insights into understanding codon evolution in sesame.
Collapse
Affiliation(s)
- Mebeaselassie Andargie
- University of Goettingen, Molecular Phytopathology and Mycotoxin Research, Grisebachstrasse 6, 37077 Goettingen, Germany
| | - Zhu Congyi
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization (MOA), Guangdong Province Key Laboratory of Tropical and Subtropical Fruit Tree Research, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| |
Collapse
|
6
|
Analyses of Lysin-motif Receptor-like Kinase ( LysM-RLK) Gene Family in Allotetraploid Brassica napus L. and Its Progenitor Species: An In Silico Study. Cells 2021; 11:cells11010037. [PMID: 35011598 PMCID: PMC8750388 DOI: 10.3390/cells11010037] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 12/10/2021] [Accepted: 12/20/2021] [Indexed: 12/11/2022] Open
Abstract
The LysM receptor-like kinases (LysM-RLKs) play a crucial role in plant symbiosis and response to environmental stresses. Brassica napus, B. rapa, and B. oleracea are utilized as valuable vegetables. Different biotic and abiotic stressors affect these crops, resulting in yield losses. Therefore, genome-wide analysis of the LysM-RLK gene family was conducted. From the genome of the examined species, 33 LysM-RLK have been found. The conserved domains of Brassica LysM-RLKs were divided into three groups: LYK, LYP, and LysMn. In the BrassicaLysM-RLK gene family, only segmental duplication has occurred. The Ka/Ks ratio for the duplicated pair of genes was less than one indicating that the genes’ function had not changed over time. The BrassicaLysM-RLKs contain 70 cis-elements, indicating that they are involved in stress response. 39 miRNA molecules were responsible for the post-transcriptional regulation of 12 Brassica LysM-RLKs. A total of 22 SSR loci were discovered in 16 Brassica LysM-RLKs. According to RNA-seq data, the highest expression in response to biotic stresses was related to BnLYP6. According to the docking simulations, several residues in the active sites of BnLYP6 are in direct contact with the docked chitin and could be useful in future studies to develop pathogen-resistant B. napus. This research reveals comprehensive information that could lead to the identification of potential genes for Brassica species genetic manipulation.
Collapse
|
7
|
Zhang P, Xu W, Lu X, Wang L. Analysis of codon usage bias of chloroplast genomes in Gynostemma species. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2021; 27:2727-2737. [PMID: 35035132 PMCID: PMC8720125 DOI: 10.1007/s12298-021-01105-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 11/13/2021] [Accepted: 11/20/2021] [Indexed: 05/05/2023]
Abstract
UNLABELLED Gynostemma plants are important Chinese medicinal material and economic crops. Codon usage analysis is a good way to understand organism evolution and phylogeny. There is no report yet about analysis of codon usage bias of chloroplast genomes in Gynostemma species. In this study, the chloroplast genomes in nine Gynostemma species were analyzed systematically to explore the factors affecting the formation of codon usage bias. The codon usage indicators were analyzed. Multivariate statistical analysis including analysis of neutrality plot, effective number of codons plot, parity rule 2 plot and correspondence were performed. Composition analysis of codons showed that the frequency of GC in chloroplast genes of all nine Gynostemma species was less than 50%, and the protein-coding sequences of chloroplast genes preferred to end with A/T at the third codon position. The chloroplast genes had an overall weak codon usage bias. A total of 29 high frequency codons and 12 optimal codons were identified. These could provide useful information in optimizing and modifying codons thus improving the gene expression of Gynostemma species. The results of multivariate analysis showed that the codon usage patterns were not only affected by single one factor but multiple factors. Mutation pressure, natural selection and base composition might have an influence on the codon usage patterns while natural selection might be the main determinant. The study could provide a reference for organism evolution and phylogeny of Gynostemma species and help to understand the patterns of codons in chloroplast genomes in other plant species. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s12298-021-01105-z.
Collapse
Affiliation(s)
- Peipei Zhang
- School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing, 211198 Jiangsu China
| | - Wenbo Xu
- School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing, 211198 Jiangsu China
| | - Xu Lu
- School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing, 211198 Jiangsu China
| | - Long Wang
- School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing, 211198 Jiangsu China
| |
Collapse
|
8
|
Das JK, Roy S. Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns. Genomics 2021; 113:2177-2188. [PMID: 34019999 PMCID: PMC8131179 DOI: 10.1016/j.ygeno.2021.05.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Revised: 05/09/2021] [Accepted: 05/14/2021] [Indexed: 01/04/2023]
Abstract
The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution. The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses. A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV.
Collapse
Affiliation(s)
- Jayanta Kumar Das
- Department of Pediatrics, Johns Hopkins University School of Medicine, MD, USA.
| | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok, India.
| |
Collapse
|
9
|
de Oliveira JL, Morales AC, Hurst LD, Urrutia AO, Thompson CRL, Wolf JB. Inferring Adaptive Codon Preference to Understand Sources of Selection Shaping Codon Usage Bias. Mol Biol Evol 2021; 38:3247-3266. [PMID: 33871580 PMCID: PMC8321536 DOI: 10.1093/molbev/msab099] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Alternative synonymous codons are often used at unequal frequencies. Classically, studies of such codon usage bias (CUB) attempted to separate the impact of neutral from selective forces by assuming that deviations from a predicted neutral equilibrium capture selection. However, GC-biased gene conversion (gBGC) can also cause deviation from a neutral null. Alternatively, selection has been inferred from CUB in highly expressed genes, but the accuracy of this approach has not been extensively tested, and gBGC can interfere with such extrapolations (e.g., if expression and gene conversion rates covary). It is therefore critical to examine deviations from a mutational null in a species with no gBGC. To achieve this goal, we implement such an analysis in the highly AT rich genome of Dictyostelium discoideum, where we find no evidence of gBGC. We infer neutral CUB under mutational equilibrium to quantify "adaptive codon preference," a nontautologous genome wide quantitative measure of the relative selection strength driving CUB. We observe signatures of purifying selection consistent with selection favoring adaptive codon preference. Preferred codons are not GC rich, underscoring the independence from gBGC. Expression-associated "preference" largely matches adaptive codon preference but does not wholly capture the influence of selection shaping patterns across all genes, suggesting selective constraints associated specifically with high expression. We observe patterns consistent with effects on mRNA translation and stability shaping adaptive codon preference. Thus, our approach to quantifying adaptive codon preference provides a framework for inferring the sources of selection that shape CUB across different contexts within the genome.
Collapse
Affiliation(s)
- Janaina Lima de Oliveira
- Instituto de Biologia, Universidade Federal da Bahia, Salvador, Bahia, 40170-115, Brazil.,Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Atahualpa Castillo Morales
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Laurence D Hurst
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Araxi O Urrutia
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK.,Instituto de Ecologia, UNAM, Ciudad de Mexico 04510, Mexico
| | - Christopher R L Thompson
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London, WC1E 6BT, UK
| | - Jason B Wolf
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| |
Collapse
|
10
|
Construction of anti-codon table of the plant kingdom and evolution of tRNA selenocysteine (tRNA Sec). BMC Genomics 2020; 21:804. [PMID: 33213362 PMCID: PMC7678280 DOI: 10.1186/s12864-020-07216-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 11/08/2020] [Indexed: 12/26/2022] Open
Abstract
Background The tRNAs act as a bridge between the coding mRNA and incoming amino acids during protein translation. The anti-codon of tRNA recognizes the codon of the mRNA and deliver the amino acid into the protein translation chain. However, we did not know about the exact abundance of anti-codons in the genome and whether the frequency of abundance remains same across the plant lineage or not. Results Therefore, we analysed the tRNAnome of 128 plant species and reported an anti-codon table of the plant kingdom. We found that CAU anti-codon of tRNAMet has highest (5.039%) whereas GCG anti-codon of tRNAArg has lowest (0.004%) abundance. However, when we compared the anti-codon frequencies according to the tRNA isotypes, we found tRNALeu (7.808%) has highest abundance followed by tRNASer (7.668%) and tRNAGly (7.523%). Similarly, suppressor tRNA (0.036%) has lowest abundance followed by tRNASec (0.066%) and tRNAHis (2.109). The genome of Ipomoea nil, Papaver somniferum, and Zea mays encoded the highest number of anti-codons (isoacceptor) at 59 each whereas the genome of Ostreococcus tauri was found to encode only 18 isoacceptors. The tRNASec genes undergone losses more frequently than duplication and we found that tRNASec showed anti-codon switch during the course of evolution. Conclusion The anti-codon table of the plant tRNA will enable us to understand the synonymous codon usage of the plant kingdom and can be very helpful to understand which codon is preferred over other during the translation. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-020-07216-3.
Collapse
|
11
|
|
12
|
Komar AA. [Synonymous Codon Usage-a Guide for Co-Translational Protein Folding in the Cell]. Mol Biol (Mosk) 2019; 53:883-898. [PMID: 31876270 PMCID: PMC8462064 DOI: 10.1134/s0026898419060090] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2019] [Accepted: 05/14/2019] [Indexed: 06/10/2023]
Abstract
In the cell, protein folding begins during protein synthesis/translation and thus is a co-translational process. Co-translational protein folding is tightly linked to translation elongation, which is not a uniform process. While there are many reasons for translation non-uniformity, it is generally believed that non-uniform synonymous codon usage is one of the key factors modulating translation elongation rates. Frequent/optimal codons as a rule are translated more rapidly than infrequently used ones and vice versa. Over 30 years ago, it was hypothesized that changes in synonymous codon usage affecting translation elongation rates could impinge on co-translation protein folding and that many synonymous codons are strategically placed within mRNA to ensure a particular translation kinetics facilitating productive step-by-step co-translational folding of proteins. It was suggested that this particular translation kinetics (and, specifically, translation pause sites) may define the window of opportunity for the protein parts to fold locally, particularly at the critical points where folding is far from equilibrium. It was thus hypothesized that synonymous codons may provide a secondary code for protein folding in the cell. Although, mostly accepted now, this hypothesis appeared to be difficult to prove and many convincing results were obtained only relatively recently. Here, I review the progress in the field and explain, why this simple idea appeared to be so challenging to prove.
Collapse
Affiliation(s)
- A A Komar
- Center for Gene Regulation in Health and Disease and Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, Ohio, 44115 USA
- Department of Biochemistry and Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio, 44106 USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, 44195 USA
- DAPCEL, Inc., Cleveland, Ohio, 44106 USA
| |
Collapse
|
13
|
Canale AS, Venev SV, Whitfield TW, Caffrey DR, Marasco WA, Schiffer CA, Kowalik TF, Jensen JD, Finberg RW, Zeldovich KB, Wang JP, Bolon DNA. Synonymous Mutations at the Beginning of the Influenza A Virus Hemagglutinin Gene Impact Experimental Fitness. J Mol Biol 2018; 430:1098-1115. [PMID: 29466705 DOI: 10.1016/j.jmb.2018.02.009] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Revised: 01/19/2018] [Accepted: 02/05/2018] [Indexed: 01/15/2023]
Abstract
The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses.
Collapse
Affiliation(s)
- Aneth S Canale
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Sergey V Venev
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Troy W Whitfield
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA; Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Daniel R Caffrey
- Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Wayne A Marasco
- Department of Cancer Immunology & Virology, Dana-Farber Cancer Institute, Harvard Medical School, 450 Brookline Avenue, Boston, MA 02215, USA
| | - Celia A Schiffer
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Timothy F Kowalik
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ. 85281, USA
| | - Robert W Finberg
- Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Konstantin B Zeldovich
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Jennifer P Wang
- Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01655, USA.
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01655, USA.
| |
Collapse
|
14
|
Lin D, Li L, Xie T, Yin Q, Saksena N, Wu R, Li W, Dai G, Ma J, Zhou X, Chen X. Codon usage variation of Zika virus: The potential roles of NS2B and NS4A in its global pandemic. Virus Res 2018; 247:71-83. [PMID: 29428601 DOI: 10.1016/j.virusres.2018.01.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Revised: 01/26/2018] [Accepted: 01/29/2018] [Indexed: 12/18/2022]
Abstract
A comprehensive demonstration of Zika virus (ZIKV) molecular evolution is essential for understanding its adaptation and expansion in its recent pandemics. Despite several studies on mutations and codon usage in ZIKVs, the variations in codon usage patterns across individual genes and their biological implication remains unclear. Here, we performed a gene-by-gene comparison of the codon usage variation in ZIKVs of the African and Asian lineages. We found that besides the evidence of positive selection (Ka/Ks >1) in the Asian lineage of the ZIKV genome, codon usage patterns were gene-specific and codon usage variation of ZIKV genes, was possibly constrained by their individual functional features, such as transmembrane domains, or antigenicity. In particular, the NS2B and NS4A genes showed distinct codon usage patterns, clearly separating them from the clusters of other genes in the correspondence analysis (CA). In the Asian lineage, the NS2B and NS4A genes showed the highest codon usage bias (ENC values: 51.01 ± 0.72 and 48.89 ± 0.99 respectively), and were subjected to the highest translation selection (ENCobs/ENCexp ratio: 0.847 ± 0.0297 and 0.828 ± 0.0233 respectively) in comparison to the African lineages of ZIKV. The CpG frequency of the NS2B showed a gradual ascending trend in the Asian ZIKV lineages, while in NS4A it was constrained along with the expansion of the Asian lineage. Furthermore, between the African and Asian lineages, differentiated and specific over-represented codons were more prominent in the NS2B and NS4A. Together, our study implies that ZIKVs are in the process of evolutionary fine tuning their codon as seen in the recent pandemics, and NS2B and NS4A could have played a potential role in the molecular evolution of the Asian lineage and their establishment.
Collapse
Affiliation(s)
- Dechun Lin
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, Guangdong, China; BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China.
| | - Liqiang Li
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China.
| | - Tian Xie
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Qingqing Yin
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Nitin Saksena
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China.
| | - Rangke Wu
- The School of Foreign Studies, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Wanyu Li
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Geyang Dai
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Jinmin Ma
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China; China National Genebank, BGI-Shenzhen, Shenzhen 518083, Guangdong, China; Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Xiaohong Zhou
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| | - Xiaoguang Chen
- Department of Pathogen Biology, Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou 510515, Guangdong, China.
| |
Collapse
|
15
|
Codon usage and amino acid usage influence genes expression level. Genetica 2017; 146:53-63. [DOI: 10.1007/s10709-017-9996-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2017] [Accepted: 10/09/2017] [Indexed: 11/30/2022]
|
16
|
Yi S, Li Y, Wang W. Selection shapes the patterns of codon usage in three closely related species of genus Misgurnus. Genomics 2017; 110:134-142. [PMID: 28911975 DOI: 10.1016/j.ygeno.2017.09.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 08/29/2017] [Accepted: 09/07/2017] [Indexed: 11/17/2022]
Abstract
Neutrality plots revealed that selection probably dominates codon bias, whereas mutation plays only a minor role, in shaping the codon bias in three loaches, Misgurnus anguillicaudatus, M. mohoity, and M. bipartitus. These three species also clearly showed similar tendencies in the preferential usage of codons. Nineteen, nine, and 14 preferred codon pairs and 179, 182, and 173 avoided codon pairs were also detected in M. anguillicaudatus, M. bipartitus, and M. mohoity, respectively, and the most frequently avoided type of cP3-cA1 dinucleotide in these species was nnUAnn. The expression-linked patterns of codon usage revealed that higher expression was associated with higher GC3, lower ENC, and a smaller proportion of amino acids with high size/complexity (S/C) scores in these three species. These results elucidate selectively driven codon bias in Misgurnus species, and reveal the potential importance of expression-mediated selection in shaping the genome evolution of fish.
Collapse
Affiliation(s)
- Shaokui Yi
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan 430070, China
| | - Yanhe Li
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan 430070, China.
| | - Weimin Wang
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
17
|
Szitenberg A, Cha S, Opperman CH, Bird DM, Blaxter ML, Lunt DH. Genetic Drift, Not Life History or RNAi, Determine Long-Term Evolution of Transposable Elements. Genome Biol Evol 2016; 8:2964-2978. [PMID: 27566762 PMCID: PMC5635653 DOI: 10.1093/gbe/evw208] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/20/2016] [Indexed: 12/11/2022] Open
Abstract
Transposable elements (TEs) are a major source of genome variation across the branches of life. Although TEs may play an adaptive role in their host's genome, they are more often deleterious, and purifying selection is an important factor controlling their genomic loads. In contrast, life history, mating system, GC content, and RNAi pathways have been suggested to account for the disparity of TE loads in different species. Previous studies of fungal, plant, and animal genomes have reported conflicting results regarding the direction in which these genomic features drive TE evolution. Many of these studies have had limited power, however, because they studied taxonomically narrow systems, comparing only a limited number of phylogenetically independent contrasts, and did not address long-term effects on TE evolution. Here, we test the long-term determinants of TE evolution by comparing 42 nematode genomes spanning over 500 million years of diversification. This analysis includes numerous transitions between life history states, and RNAi pathways, and evaluates if these forces are sufficiently persistent to affect the long-term evolution of TE loads in eukaryotic genomes. Although we demonstrate statistical power to detect selection, we find no evidence that variation in these factors influence genomic TE loads across extended periods of time. In contrast, the effects of genetic drift appear to persist and control TE variation among species. We suggest that variation in the tested factors are largely inconsequential to the large differences in TE content observed between genomes, and only by these large-scale comparisons can we distinguish long-term and persistent effects from transient or random changes.
Collapse
Affiliation(s)
- Amir Szitenberg
- Evolutionary Biology Group, School of Environmental Sciences, University of Hull, England, United Kingdom The Dead Sea and Arava Science Center, Israel
| | - Soyeon Cha
- Department of Plant Pathology, North Carolina State University
| | | | - David M Bird
- Department of Plant Pathology, North Carolina State University
| | - Mark L Blaxter
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Scotland
| | - David H Lunt
- Evolutionary Biology Group, School of Environmental Sciences, University of Hull, England, United Kingdom
| |
Collapse
|
18
|
Karumathil S, Dirisala VR, Srinadh U, Nikhil V, Kumar NSS, Nair RR. Evolution of Synonymous Codon Usage in the Mitogenomes of Certain Species of Bilaterian Lineage with Special Reference to Chaetognatha. Bioinform Biol Insights 2016; 10:167-84. [PMID: 27688709 PMCID: PMC5034883 DOI: 10.4137/bbi.s38192] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Revised: 08/17/2016] [Accepted: 08/28/2016] [Indexed: 11/20/2022] Open
Abstract
Chaetognatha is a minor phylum, comprising transparent marine invertebrates varying in size from 0.5 to 12 cm. The exact phylogenetic position of Chaetognatha in Metazoa has not been deciphered as some embryological characteristics place chaetognaths among deuterostomes and some morphological characteristics place these among protostomes. In this study, the major factors that drive synonymous codon usage bias (SCUB) in the mitogenomes of representative species of Chaetognatha and chosen species of other closely related phyla were analyzed. Spearman’s rank correlation analyses of nucleotide contents suggested that mutational pressure and selection were acting in all examined mitogenomes but with varying intensities. The quantification of SCUB using effective number of codons vs. GC composition at the third codon position (GC3) plot suggested that mutational pressure due to GC compositional constraints might be one of the major influencing forces driving the SCUB in all chaetognaths except Sagitta enflata. However, neutrality plots revealed no significant correlation between GC3 and cumulative GC content at first and second codon positions (GC12) in all other species, except in Daphnia pulex. The parity rule 2 bias plot showed that significant compositional differences existed between C and G, as well as between A and T, contents in most of the protein-coding genes (PCGs) and, comparatively, A and T contents were used more proportionally than C and G contents in all chosen mitogenomes. Chi-square analysis revealed the presence of putative optimal codons in all species, except in S. enflata. The correspondence analysis identified that mutational pressure and selection act on the mitogenomes of the selected chaetognaths and other phyla with varying intensities. The cluster analysis based on relative synonymous codon usage (RSCU) values revealed that RSCU variations in the PCGs of mitogenomes of chaetognaths are more comparable with those of protostomes. Apart from mutational pressure and selection, certain unknown selective forces might be acting on the PCGs in the analyzed mitogenomes as the phenomenon of SCUB could not be explained by mutational pressure, by selection, or by both.
Collapse
Affiliation(s)
- Sudeesh Karumathil
- Aushmath Biosciences, Administrative office, Devaraj Corner, Vadavalli Post, Coimbatore, Tamil Nadu, India
| | - Vijaya R Dirisala
- Department of Biotechnology, Vignan's University (Vignan's Foundation for Science, Technology and Research University), Guntur, Andhra Pradesh, India
| | - Uthpala Srinadh
- Department of Biotechnology, Vignan's University (Vignan's Foundation for Science, Technology and Research University), Guntur, Andhra Pradesh, India
| | - Valaboju Nikhil
- Department of Biotechnology, Vignan's University (Vignan's Foundation for Science, Technology and Research University), Guntur, Andhra Pradesh, India
| | - N Satya Sampath Kumar
- Department of Biotechnology, Vignan's University (Vignan's Foundation for Science, Technology and Research University), Guntur, Andhra Pradesh, India
| | - Rahul R Nair
- Aushmath Biosciences, Administrative office, Devaraj Corner, Vadavalli Post, Coimbatore, Tamil Nadu, India
| |
Collapse
|
19
|
Whittle CA, Extavour CG. Expression-Linked Patterns of Codon Usage, Amino Acid Frequency, and Protein Length in the Basally Branching Arthropod Parasteatoda tepidariorum. Genome Biol Evol 2016; 8:2722-36. [PMID: 27017527 PMCID: PMC5630913 DOI: 10.1093/gbe/evw068] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum. First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model.
Collapse
Affiliation(s)
- Carrie A Whittle
- Department of Organismic and Evolutionary Biology, Harvard University
| | - Cassandra G Extavour
- Department of Organismic and Evolutionary Biology, Harvard University Department of Molecular and Cellular Biology, Harvard University
| |
Collapse
|
20
|
Meyer MM. The role of mRNA structure in bacterial translational regulation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27301829 DOI: 10.1002/wrna.1370] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Revised: 05/12/2016] [Accepted: 05/16/2016] [Indexed: 01/08/2023]
Abstract
The characteristics of bacterial messenger RNAs (mRNAs) that influence translation efficiency provide many convenient handles for regulation of gene expression, especially when coupled with the processes of transcription termination and mRNA degradation. An mRNA's structure, especially near the site of initiation, has profound consequences for how readily it is translated. This property allows bacterial gene expression to be altered by changes to mRNA structure induced by temperature, or interactions with a wide variety of cellular components including small molecules, other RNAs (such as sRNAs and tRNAs), and RNA-binding proteins. This review discusses the links between mRNA structure and translation efficiency, and how mRNA structure is manipulated by conditions and signals within the cell to regulate gene expression. The range of RNA regulators discussed follows a continuum from very complex tertiary structures such as riboswitch aptamers and ribosomal protein-binding sites to thermosensors and mRNA:sRNA interactions that involve only base-pairing interactions. Furthermore, the high degrees of diversity observed for both mRNA structures and the mechanisms by which inhibition of translation occur have significant consequences for understanding the evolution of bacterial translational regulation. WIREs RNA 2017, 8:e1370. doi: 10.1002/wrna.1370 For further resources related to this article, please visit the WIREs website.
Collapse
|
21
|
Hauber DJ, Grogan DW, DeBry RW. Mutations to Less-Preferred Synonymous Codons in a Highly Expressed Gene of Escherichia coli: Fitness and Epistatic Interactions. PLoS One 2016; 11:e0146375. [PMID: 26727272 PMCID: PMC4699635 DOI: 10.1371/journal.pone.0146375] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 12/16/2015] [Indexed: 01/11/2023] Open
Abstract
Codon-tRNA coevolution to maximize protein production has been, until recently, the dominant hypothesis to explain codon-usage bias in highly expressed bacterial genes. Two predictions of this hypothesis are 1) selection is weak; and 2) similar silent replacements at different codons should have similar fitness consequence. We used an allele-replacement strategy to change five specific 3rd-codon-position (silent) sites in the highly expressed Escherichia coli ribosomal protein gene rplQ from the wild type to a less-preferred alternative. We introduced the five mutations within a 10-codon region. Four of the silent sites were chosen to test the second prediction, with a CTG to CTA mutation being introduced at two closely linked leucine codons and an AAA to AAG mutation being introduced at two closely linked lysine codons. We also introduced a fifth silent mutation, a GTG to GTA mutation at a valine codon in the same genic region. We measured the fitness effect of the individual mutations by competing each single-mutant strain against the parental wild-type strain, using a disrupted form of the araA gene as a selectively neutral phenotypic marker to distinguish between strains in direct competition experiments. Three of the silent mutations had a fitness effect of |s| > 0.02, which is contradictory to the prediction that selection will be weak. The two leucine mutations had significantly different fitness effects, as did the two lysine mutations, contradictory to the prediction that similar mutations at different codons should have similar fitness effects. We also constructed a strain carrying all five silent mutations in combination. Its fitness effect was greater than that predicted from the individual fitness values, suggesting that negative synergistic epistasis acts on the combination allele.
Collapse
Affiliation(s)
- David J. Hauber
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Dennis W. Grogan
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Ronald W. DeBry
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
| |
Collapse
|
22
|
Wang YY, Chang XL, Tao ZY, Wang XL, Jiao YM, Chen Y, Qi WJ, Xia H, Yang XD, Sun X, Shen JL, Fang Q. Optimized codon usage enhances the expression and immunogenicity of DNA vaccine encoding Taenia solium oncosphere TSOL18 gene. Mol Med Rep 2015; 12:281-8. [PMID: 25738605 DOI: 10.3892/mmr.2015.3387] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Accepted: 01/21/2015] [Indexed: 11/06/2022] Open
Abstract
Cysticercosis due to larval cysts of Taenia solium, is a serious public health problem affecting humans in numerous regions worldwide. The oncospheral stage-specific TSOL18 antigen is a promising candidate for an anti-cysticercosis vaccine. It has been reported that the immunogenicity of the DNA vaccine may be enhanced through codon optimization of candidate genes. The aim of the present study was to further increase the efficacy of the cysticercosis DNA vaccine; therefore, a codon optimized recombinant expression plasmid pVAX1/TSOL18 was developed in order to enhance expression and immunogenicity of TSOL18. The gene encoding TSOL18 of Taenia solium was optimized, and the resulting opt-TSOL18 gene was amplified and expressed. The results of the present study showed that the codon-optimized TSOL18 gene was successfully expressed in CHO-K1 cells, and immunized mice vaccinated with opt-TSOL18 recombinant expression plasmids demonstrated opt‑TSOL18 expression in muscle fibers, as determined by immunohistochemistry. In addition, the codon-optimized TSOL18 gene produced a significantly greater effect compared with that of TSOL18 and active spleen cells were markedly stimulated in vaccinated mice. 3H-thymidine incorporation was significantly greater in the opt-TSOL18 group compared with that of the TSOL18, pVAX and blank control groups (P<0.01). In conclusion, the eukaryotic expression vector containing the codon-optimized TSOL18 gene was successfully constructed and was confirmed to be expressed in vivo and in vitro. The expression and immunogenicity of the codon-optimized TSOL18 gene were markedly greater compared with that of the un-optimized gene. Therefore, these results may provide the basis for an optimized TSOL18 gene vaccine against cysticercosis.
Collapse
Affiliation(s)
- Yuan-Yuan Wang
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Xue-Lian Chang
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Zhi-Yong Tao
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Xiao-Li Wang
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Yu-Meng Jiao
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Yong Chen
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Wen-Juan Qi
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Hui Xia
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Xiao-Di Yang
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Xin Sun
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| | - Ji-Long Shen
- Department of Pathobiology, Anhui Medical University, Hefei, Anhui 230032, P.R. China
| | - Qiang Fang
- Department of Microbiology and Parasitology, Bengbu Medical College, Bengbu, Anhui 233030, P.R. China
| |
Collapse
|
23
|
Wei L, He J, Jia X, Qi Q, Liang Z, Zheng H, Ping Y, Liu S, Sun J. Analysis of codon usage bias of mitochondrial genome in Bombyx mori and its relation to evolution. BMC Evol Biol 2014; 14:262. [PMID: 25515024 PMCID: PMC4276022 DOI: 10.1186/s12862-014-0262-4] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Accepted: 12/09/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Synonymous codon usage bias (SCUB) is an inevitable phenomenon in organismic taxa, generally referring to differences in the occurrence frequency of codons across different species or within the genome of the same species. SCUB happens in various degrees under pressure from nature selection, mutation bias and other factors in different ways. It also attaches great significance to gene expression and species evolution, however, a systematic investigation towards the codon usage in Bombyx mori (B. mori) has not been reported yet. Moreover, it is still indistinct about the reasons contributing to the bias or the relationship between the bias and the evolution of B. mori. RESULTS The comparison of the codon usage pattern between the genomic DNA (gDNA) and the mitochondrial DNA (mtDNA) from B. mori suggests that mtDNA has a higher level of codon bias. Furthermore, the correspondence analysis suggests that natural selection, such as gene length, gene function and translational selection, dominates the codon preference of mtDNA, while the composition constraints for mutation bias only plays a minor role. Additionally, the clustering results of the silkworm superfamily suggest a lack of explicitness in the relationship between the codon usage of mitogenome and species evolution. CONCLUSIONS Among the complicated influence factors leading to codon bias, natural selection is found to play a major role in shaping the high bias in the mtDNA of B. mori from our current data. Although the cluster analysis reveals that codon bias correlates little with the species evolution, furthermore, a detailed analysis of codon usage of mitogenome provides better insight into the evolutionary relationships in Lepidoptera. However, more new methods and data are needed to investigate the relationship between the mtDNA bias and evolution.
Collapse
Affiliation(s)
- Lei Wei
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Jian He
- Guangzhou East Campus Lab Center, Sun Yat-sen University, Guangzhou, 510006, China.
| | - Xian Jia
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Qi Qi
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Zhisheng Liang
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Hao Zheng
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Yao Ping
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Shuyu Liu
- Guangzhou East Campus Lab Center, Sun Yat-sen University, Guangzhou, 510006, China.
| | - Jingchen Sun
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
24
|
Zhou HQ, Ning LW, Zhang HX, Guo FB. Analysis of the relationship between genomic GC Content and patterns of base usage, codon usage and amino acid usage in prokaryotes: similar GC content adopts similar compositional frequencies regardless of the phylogenetic lineages. PLoS One 2014; 9:e107319. [PMID: 25255224 PMCID: PMC4177787 DOI: 10.1371/journal.pone.0107319] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Accepted: 08/08/2014] [Indexed: 11/19/2022] Open
Abstract
The GC contents of 2670 prokaryotic genomes that belong to diverse phylogenetic lineages were analyzed in this paper. These genomes had GC contents that ranged from 13.5% to 74.9%. We analyzed the distance of base frequencies at the three codon positions, codon frequencies, and amino acid compositions across genomes with respect to the differences in the GC content of these prokaryotic species. We found that although the phylogenetic lineages were remote among some species, a similar genomic GC content forced them to adopt similar base usage patterns at the three codon positions, codon usage patterns, and amino acid usage patterns. Our work demonstrates that in prokaryotic genomes: a) base usage, codon usage, and amino acid usage change with GC content with a linear correlation; b) the distance of each usage has a linear correlation with the GC content difference; and c) GC content is more essential than phylogenetic lineage in determining base usage, codon usage, and amino acid usage. This work is exceptional in that we adopted intuitively graphic methods for all analyses, and we used these analyses to examine as many as 2670 prokaryotes. We hope that this work is helpful for understanding common features in the organization of microbial genomes.
Collapse
Affiliation(s)
- Hui-Qi Zhou
- Center of Bioinformatics and Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu, China
| | - Lu-Wen Ning
- Center of Bioinformatics and Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu, China
| | - Hui-Xiong Zhang
- Center of Bioinformatics and Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu, China
| | - Feng-Biao Guo
- Center of Bioinformatics and Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu, China
- * E-mail:
| |
Collapse
|
25
|
Evidence that natural selection on codon usage in Drosophila pseudoobscura varies across codons. G3-GENES GENOMES GENETICS 2014; 4:681-92. [PMID: 24531731 PMCID: PMC4059240 DOI: 10.1534/g3.114.010488] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Like other species of Drosophila, Drosophila pseudoobscura has a distinct bias toward the usage of C- and G-ending codons. Previous studies have indicated that this bias is due, at least in part, to natural selection. Codon bias clearly differs among amino acids (and other codon classes) in Drosophila, which may reflect differences in the intensity of selection on codon usage. Ongoing natural selection on synonymous codon usage should be reflected in the shapes of the site frequency spectra of derived states at polymorphic positions. Specifically, regardless of other demographic effects on the spectrum, it should be shifted toward higher values for changes from less-preferred to more-preferred codons, and toward lower values for the converse. If the intensity of natural selection is increased, shifts in the site frequency spectra should be more pronounced. A total of 33,729 synonymous polymorphic sites on Chromosome 2 in D. pseudoobscura were analyzed. Shifts in the site frequency spectra are consistent with differential intensity of natural selection on codon usage, with stronger shifts associated with higher codon bias. The shifts, in general, are greater for polymorphic synonymous sites than for polymorphic intron sites, also consistent with natural selection. However, unlike observations in D. melanogaster, codon bias is not reduced in areas of low recombination in D. pseudoobscura; the site frequency spectrum signal for selection on codon usage remains strong in these regions. However, diversity is reduced, as expected. It is possible that estimates of low recombination reflect a recent change in recombination rate.
Collapse
|
26
|
Strong purifying selection at synonymous sites in D. melanogaster. PLoS Genet 2013; 9:e1003527. [PMID: 23737754 PMCID: PMC3667748 DOI: 10.1371/journal.pgen.1003527] [Citation(s) in RCA: 144] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Accepted: 04/08/2013] [Indexed: 11/19/2022] Open
Abstract
Synonymous sites are generally assumed to be subject to weak selective constraint. For this reason, they are often neglected as a possible source of important functional variation. We use site frequency spectra from deep population sequencing data to show that, contrary to this expectation, 22% of four-fold synonymous (4D) sites in Drosophila melanogaster evolve under very strong selective constraint while few, if any, appear to be under weak constraint. Linking polymorphism with divergence data, we further find that the fraction of synonymous sites exposed to strong purifying selection is higher for those positions that show slower evolution on the Drosophila phylogeny. The function underlying the inferred strong constraint appears to be separate from splicing enhancers, nucleosome positioning, and the translational optimization generating canonical codon bias. The fraction of synonymous sites under strong constraint within a gene correlates well with gene expression, particularly in the mid-late embryo, pupae, and adult developmental stages. Genes enriched in strongly constrained synonymous sites tend to be particularly functionally important and are often involved in key developmental pathways. Given that the observed widespread constraint acting on synonymous sites is likely not limited to Drosophila, the role of synonymous sites in genetic disease and adaptation should be reevaluated.
Collapse
|
27
|
A comparative analysis on the synonymous codon usage pattern in viral functional genes and their translational initiation region of ASFV. Virus Genes 2012; 46:271-9. [PMID: 23161403 DOI: 10.1007/s11262-012-0847-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2012] [Accepted: 11/01/2012] [Indexed: 01/21/2023]
Abstract
The synonymous codon usage pattern of African swine fever virus (ASFV), the similarity degree of the synonymous codon usage between this virus and some organisms and the synonymous codon usage bias for the translation initiation region of viral functional genes in the whole genome of ASFV have been investigated by some simply statistical analyses. Although both GC12% (the GC content at the first and second codon positions) and GC3% (the GC content at the third codon position) of viral functional genes have a large fluctuation, the significant correlations between GC12 and GC3% and between GC3% and the first principal axis of principle component analysis on the relative synonymous codon usage of the viral functional genes imply that mutation pressure of ASFV plays an important role in the synonymous codon usage pattern. Turning to the synonymous codon usage of this virus, the codons with U/A end predominate in the synonymous codon family for the same amino acid and a weak codon usage bias in both leading and lagging strands suggests that strand compositional asymmetry does not take part in the formation of codon usage in ASFV. The interaction between the absolute codon usage bias and GC3% suggests that other selections take part in the formation of codon usage, except for the mutation pressure. It is noted that the similarity degree of codon usage between ASFV and soft tick is higher than that between the virus and the pig, suggesting that the soft tick plays a more important role than the pig in the codon usage pattern of ASFV. The translational initiation region of the viral functional genes generally have a strong tendency to select some synonymous codons with low GC content, suggesting that the synonymous codon usage bias caused by translation selection from the host takes part in modulating the translation initiation efficiency of ASFV functional genes.
Collapse
|
28
|
β-tubulin paralogue tubC is frequently misidentified as the benA gene in Aspergillus section Nigri taxonomy: primer specificity testing and taxonomic consequences. Persoonia - Molecular Phylogeny and Evolution of Fungi 2012; 29:1-10. [PMID: 23606761 PMCID: PMC3589786 DOI: 10.3767/003158512x658123] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2012] [Accepted: 08/06/2012] [Indexed: 11/25/2022]
Abstract
β-tubulin (benA, tub-2) and calmodulin (caM) are crucial genes in the taxonomy of Aspergillus section Nigri. Widely used β-tubulin primers are not specific for the benA gene for some taxa and preferentially amplify the tubC paralogue. Sequences of the tubC paralogue are widely combined with benA sequences in recent taxonomical works as well as other works, resulting in incongruent trees. In this study we newly provide benA sequences for several ex-type strains, which were characterised using the tubC gene only. We designed a highly specific forward primer to benA designated Ben2f for use in Aspergillus section Nigri, and tested specificity of numerous primer combinations to β-tubulin paralogs. The primer pairs with the highest specificity to the benA gene and functional across species in section Nigri includes Ben2f/Bt2b, Ben2f/T22 and T10/T22. We also provide tools based on codon usage bias analysis that reliably distinguish both paralogues. Exon/intron arrangement is the next distinctive characteristic, although this tool is not valid outside section Nigri. The species identity of taxa from the A. aculeatus clade used in previous molecular studies was revised using combined molecular data (ITS, benA, caM). These data together with two different PCR-fingerprinting methods indicated that A. japonicus should be treated as a synonym of A. violaceofuscus. Similarly, A. fijiensis is reduced to synonymy with A. brunneoviolaceus.
Collapse
|
29
|
Wu H, Zhang Z, Hu S, Yu J. On the molecular mechanism of GC content variation among eubacterial genomes. Biol Direct 2012; 7:2. [PMID: 22230424 PMCID: PMC3274465 DOI: 10.1186/1745-6150-7-2] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2011] [Accepted: 01/10/2012] [Indexed: 12/02/2022] Open
Abstract
Background As a key parameter of genome sequence variation, the GC content of bacterial genomes has been investigated for over half a century, and many hypotheses have been put forward to explain this GC content variation and its relationship to other fundamental processes. Previously, we classified eubacteria into dnaE-based groups (the dimeric combination of DNA polymerase III alpha subunits), according to a hypothesis where GC content variation is essentially governed by genome replication and DNA repair mechanisms. Further investigation led to the discovery that two major mutator genes, polC and dnaE2, may be responsible for genomic GC content variation. Consequently, an in-depth analysis was conducted to evaluate various potential intrinsic and extrinsic factors in association with GC content variation among eubacterial genomes. Results Mutator genes, especially those with dominant effects on the mutation spectra, are biased towards either GC or AT richness, and they alter genomic GC content in the two opposite directions. Increased bacterial genome size (or gene number) appears to rely on increased genomic GC content; however, it is unclear whether the changes are directly related to certain environmental pressures. Certain environmental and bacteriological features are related to GC content variation, but their trends are more obvious when analyzed under the dnaE-based grouping scheme. Most terrestrial, plant-associated, and nitrogen-fixing bacteria are members of the dnaE1|dnaE2 group, whereas most pathogenic or symbiotic bacteria in insects, and those dwelling in aquatic environments, are largely members of the dnaE1|polV group. Conclusion Our studies provide several lines of evidence indicating that DNA polymerase III α subunit and its isoforms participating in either replication (such as polC) or SOS mutagenesis/translesion synthesis (such as dnaE2), play dominant roles in determining GC variability. Other environmental or bacteriological factors, such as genome size, temperature, oxygen requirement, and habitat, either play subsidiary roles or rely indirectly on different mutator genes to fine-tune the GC content. These results provide a comprehensive insight into mechanisms of GC content variation and the robustness of eubacterial genomes in adapting their ever-changing environments over billions of years. Reviewers This paper was reviewed by Nicolas Galtier, Adam Eyre-Walker, and Eugene Koonin.
Collapse
Affiliation(s)
- Hao Wu
- James D Watson Institute of Genome Sciences, Zhejiang University, Hangzhou 310007, China
| | | | | | | |
Collapse
|
30
|
Ma F, Zhuang Y, Li Y, Xu X, Chen X. Usage Patterns of Codons Versus Complementary Codons Among Cellular Organisms and Organelles. J BIOL SYST 2011. [DOI: 10.1142/s0218339003000944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Genetic code is one of the most important biological languages in communications between DNA and protein, so peoples have been paying a great attention to the usage bias of synonymous codons. Based on Grosjean and Ikemura's "optimum combination of codon-anticodon complex" and "translation efficiency" hypotheses, in this paper, we put forward that a biased codon usage is identical to its corresponding complementary codon usage preference. To testify the hypothesis and reveal usage patterns between codons and corresponding complementary codons among different cellular organisms and organelles, the usage data of both codons and corresponding complementary ones from 28 cellular organisms and 20 organelles were analyzed. The results showed that: (1) there is a significantly positive correlation between codons and their complementary ones in most cellular organisms, chloroplasts and mitochondria; (2) all 32 single pairs codon versus complementary codon shared the likely usage correlation patterns, with the significantly positive, unrelated and significantly negative pair number of 18, 12 and 2 within 28 cellular organisms as well as 11, 17 and 4 within 20 organelles respectively, and some usage patterns of 32 single pairs codon versus complementary codon of cellular organisms are highly consistency with two kinds of organelles, which strongly implied that their codon usage has undergone the similar evolutionary selection in their wobbling and modification; (3) the codon-frequency tree agreed fairly well with the traditional one. These results demonstrated the validity of our hypothesis, and indicated the usefulness of correlation between codon and complementary codon in elucidating molecular evolutionary mechanisms.
Collapse
Affiliation(s)
- Fei Ma
- Institute of Bioinformatics, Tsinghua University, Beijing 100084, China
| | - Yonglong Zhuang
- Institute of Bioinformatics, Tsinghua University, Beijing 100084, China
| | - Yanda Li
- Institute of Bioinformatics, Tsinghua University, Beijing 100084, China
| | - Xiaofeng Xu
- Life Science College, Nanjing Normal University, Nanjing 210097, China
| | - Xueping Chen
- College of Economics and Technology, University of Science and Technology of China, Hefei 230052, China
| |
Collapse
|
31
|
Explaining complex codon usage patterns with selection for translational efficiency, mutation bias, and genetic drift. Proc Natl Acad Sci U S A 2011; 108:10231-6. [PMID: 21646514 DOI: 10.1073/pnas.1016719108] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The genetic code is redundant with most amino acids using multiple codons. In many organisms, codon usage is biased toward particular codons. Understanding the adaptive and nonadaptive forces driving the evolution of codon usage bias (CUB) has been an area of intense focus and debate in the fields of molecular and evolutionary biology. However, their relative importance in shaping genomic patterns of CUB remains unsolved. Using a nested model of protein translation and population genetics, we show that observed gene level variation of CUB in Saccharomyces cerevisiae can be explained almost entirely by selection for efficient ribosomal usage, genetic drift, and biased mutation. The correlation between observed codon counts within individual genes and our model predictions is 0.96. Although a variety of factors shape patterns of CUB at the level of individual sites within genes, our results suggest that selection for efficient ribosome usage is a central force in shaping codon usage at the genomic scale. In addition, our model allows direct estimation of codon-specific mutation rates and elongation times and can be readily applied to any organism with high-throughput expression datasets. More generally, we have developed a natural framework for integrating models of molecular processes to population genetics models to quantitatively estimate parameters underlying fundamental biological processes, such a protein translation.
Collapse
|
32
|
Davis JJ, Olsen GJ. Characterizing the native codon usages of a genome: an axis projection approach. Mol Biol Evol 2010; 28:211-21. [PMID: 20679093 PMCID: PMC3002238 DOI: 10.1093/molbev/msq185] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Codon usage can provide insights into the nature of the genes in a genome. Genes that are “native” to a genome (have not been recently acquired by horizontal transfer) range in codon usage from a low-bias “typical” usage to a more biased “high-expression” usage characteristic of genes encoding abundant proteins. Genes that differ from these native codon usages are candidates for foreign genes that have been recently acquired by horizontal gene transfer. In this study, we present a method for characterizing the codon usages of native genes—both typical and highly expressed—within a genome. Each gene is evaluated relative to a half line (or axis) in a 59D space of codon usage. The axis begins at the modal codon usage, the usage that matches the largest number of genes in the genome, and it passes through a point representing the codon usage of a set of genes with expression-related bias. A gene whose codon usage matches (does not significantly differ from) a point on this axis is a candidate native gene, and the location of its projection onto the axis provides a general estimate of its expression level. A gene that differs significantly from all points on the axis is a candidate foreign gene. This automated approach offers significant improvements over existing methods. We illustrate this by analyzing the genomes of Pseudomonas aeruginosa PAO1 and Bacillus anthracis A0248, which can be difficult to analyze with commonly used methods due to their biased base compositions. Finally, we use this approach to measure the proportion of candidate foreign genes in 923 bacterial and archaeal genomes. The organisms with the most homogeneous genomes (containing the fewest candidate foreign genes) are mostly endosymbionts and parasites, though with exceptions that include Pelagibacter ubique and Beutenbergia cavernae. The organisms with the most heterogeneous genomes (containing the most candidate foreign genes) include members of the genera Bacteroides, Corynebacterium, Desulfotalea, Neisseria, Xylella, and Thermobaculum.
Collapse
Affiliation(s)
- James J Davis
- Department of Microbiology, University of Illinois at Urbana-Champaign
| | | |
Collapse
|
33
|
Synonymous codon usage analysis of thirty two mycobacteriophage genomes. Adv Bioinformatics 2010:316936. [PMID: 20150956 PMCID: PMC2817497 DOI: 10.1155/2009/316936] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2009] [Accepted: 10/27/2009] [Indexed: 11/17/2022] Open
Abstract
Synonymous codon usage of protein coding genes of thirty two completely sequenced mycobacteriophage genomes was studied using multivariate statistical analysis. One of the major factors influencing codon usage is identified to be compositional bias. Codons ending with either C or G are preferred in highly expressed genes among which C ending codons are highly preferred over G ending codons. A strong negative correlation between effective number of codons (Nc) and GC3s content was also observed, showing that the codon usage was effected by gene nucleotide composition. Translational selection is also identified to play a role in shaping the codon usage operative at the level of translational accuracy. High level of heterogeneity is seen among and between the genomes. Length of genes is also identified to influence the codon usage in 11 out of 32 phage genomes. Mycobacteriophage Cooper is identified to be the highly biased genome with better translation efficiency comparing well with the host specific tRNA genes.
Collapse
|
34
|
Raymond A, Lovell S, Lorimer D, Walchli J, Mixon M, Wallace E, Thompkins K, Archer K, Burgin A, Stewart L. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer. BMC Biotechnol 2009; 9:37. [PMID: 19383143 PMCID: PMC2680836 DOI: 10.1186/1472-6750-9-37] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2008] [Accepted: 04/21/2009] [Indexed: 01/29/2023] Open
Abstract
Background With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. Results In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38α), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. Conclusion The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.
Collapse
Affiliation(s)
- Amy Raymond
- deCODE biostructures Inc, 7869 NE Day Road West, Bainbridge Island, WA 98110, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Zhou M, Long W, Li X. Patterns of synonymous codon usage bias in chloroplast genomes of seed plants. ACTA ACUST UNITED AC 2008. [DOI: 10.1007/s11632-008-0047-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
36
|
Llopart A, Mabillé A, Peters-Hall JR, Comeron JM, Kliman RM. A new test for selection applied to codon usage in Drosophila simulans and D. mauritiana. J Mol Evol 2008; 66:224-31. [PMID: 18288436 DOI: 10.1007/s00239-008-9072-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2007] [Revised: 12/04/2007] [Accepted: 01/25/2008] [Indexed: 11/26/2022]
Abstract
In many organisms, synonymous codon usage is biased by a history of natural selection. However, codon bias, itself, does not indicate that selection is ongoing; it may be a vestige of past selection. Simple statistical tests have been devised to infer ongoing selection on codon usage by comparing the derived state frequency spectra at polymorphic sites segregating either derived preferred codons or derived unpreferred codons; if selection is effective, the frequency of derived states should be higher in the former. We propose a new test that uses the inferred degree of preference, essentially calculating the correlation of derived state frequency and the difference in preference between the derived and the ancestral states; the correlation should be positive if selection is effective. When implementing the test, derived and ancestral states can be assigned by parsimony or on the basis of relative probability. In either case, statistical significance is estimated by a simple permutation test. We explored the statistical power of the test by sampling polymorphism data from 14 loci in 16 strains of D. simulans, finding that the test retains 80% power even when quite a few of the data are discarded. The power of the test likely reflects better use of multiple features of the data, combining population frequencies of polymorphic variants and quantitative estimates of codon preferences. We also applied this novel test to 14 newly sequenced loci in five strains of D. mauritiana, showing for the first time ongoing selection on codon usage in this species.
Collapse
Affiliation(s)
- Ana Llopart
- Department of Biological Sciences, University of Iowa, Iowa City, IA 52242, USA
| | | | | | | | | |
Collapse
|
37
|
Sadovsky MG, Putintseva YA. On the correlation between the synonymy of codon usage and taxonomy. DOKL BIOCHEM BIOPHYS 2008; 416:243-4. [PMID: 18064822 DOI: 10.1134/s1607672907050043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Affiliation(s)
- M G Sadovsky
- Institute of Computational Modeling, Siberian Branch of the Russian Academy of Sciences, Akademgorodok, Krasnoyarsk, 660036, Russia
| | | |
Collapse
|
38
|
Sahdev S, Khattar SK, Saini KS. Production of active eukaryotic proteins through bacterial expression systems: a review of the existing biotechnology strategies. Mol Cell Biochem 2007. [PMID: 17874175 DOI: 10.1007/s11010‐007‐9603‐6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Among the various expression systems employed for the over-production of proteins, bacteria still remains the favorite choice of a Protein Biochemist. However, even today, due to the lack of post-translational modification machinery in bacteria, recombinant eukaryotic protein production poses an immense challenge, which invariably leads to the production of biologically in-active protein in this host. A number of techniques are cited in the literature, which describe the conversion of inactive protein, expressed as an insoluble fraction, into a soluble and active form. Overall, we have divided these methods into three major groups: Group-I, where the factors influencing the formation of insoluble fraction are modified through a stringent control of the cellular milieu, thereby leading to the expression of recombinant protein as soluble moiety; Group-II, where protein is refolded from the inclusion bodies and thereby target protein modification is avoided; Group-III, where the target protein is engineered to achieve soluble expression through fusion protein technology. Even within the same family of proteins (e.g., tyrosine kinases), optimization of standard operating protocol (SOP) may still be required for each protein's over-production at a pilot-scale in Escherichia coli. However, once standardized, this procedure can be made amenable to the industrial production for that particular protein with minimum alterations.
Collapse
Affiliation(s)
- Sudhir Sahdev
- Department of Biotechnology & Bioinformatics, New Drug Discovery Research, Ranbaxy Research Laboratories-R&D-3, 20-Sector 18 Udyog Vihar, Gurgaon, India.
| | | | | |
Collapse
|
39
|
Sahdev S, Khattar SK, Saini KS. Production of active eukaryotic proteins through bacterial expression systems: a review of the existing biotechnology strategies. Mol Cell Biochem 2007; 307:249-64. [PMID: 17874175 DOI: 10.1007/s11010-007-9603-6] [Citation(s) in RCA: 255] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2007] [Accepted: 08/27/2007] [Indexed: 12/13/2022]
Abstract
Among the various expression systems employed for the over-production of proteins, bacteria still remains the favorite choice of a Protein Biochemist. However, even today, due to the lack of post-translational modification machinery in bacteria, recombinant eukaryotic protein production poses an immense challenge, which invariably leads to the production of biologically in-active protein in this host. A number of techniques are cited in the literature, which describe the conversion of inactive protein, expressed as an insoluble fraction, into a soluble and active form. Overall, we have divided these methods into three major groups: Group-I, where the factors influencing the formation of insoluble fraction are modified through a stringent control of the cellular milieu, thereby leading to the expression of recombinant protein as soluble moiety; Group-II, where protein is refolded from the inclusion bodies and thereby target protein modification is avoided; Group-III, where the target protein is engineered to achieve soluble expression through fusion protein technology. Even within the same family of proteins (e.g., tyrosine kinases), optimization of standard operating protocol (SOP) may still be required for each protein's over-production at a pilot-scale in Escherichia coli. However, once standardized, this procedure can be made amenable to the industrial production for that particular protein with minimum alterations.
Collapse
Affiliation(s)
- Sudhir Sahdev
- Department of Biotechnology & Bioinformatics, New Drug Discovery Research, Ranbaxy Research Laboratories-R&D-3, 20-Sector 18 Udyog Vihar, Gurgaon, India.
| | | | | |
Collapse
|
40
|
Mukhopadhyay P, Basak S, Ghosh TC. Nature of selective constraints on synonymous codon usage of rice differs in GC-poor and GC-rich genes. Gene 2007; 400:71-81. [PMID: 17629420 DOI: 10.1016/j.gene.2007.05.027] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2007] [Revised: 04/28/2007] [Accepted: 05/31/2007] [Indexed: 10/23/2022]
Abstract
Synonymous codon usage and cellular tRNA abundance are thought to be co-evolved in optimizing translational efficiencies in highly expressed genes. Here in this communication by taking the advantage of publicly available gene expression data of rice and Arabidopsis we demonstrated that tRNA gene copy number is not the only driving force favoring translational selection in all highly expressed genes of rice. We found that forces favoring translational selection differ between GC-rich and GC-poor classes of genes. Supporting our results we also showed that, in highly expressed genes of GC-poor class there is a perfect correspondence between majority of preferred codons and tRNA gene copy number that confers translational efficiencies to this group of genes. However, tRNA gene copy number is not fully consistent with models of translational selection in GC-rich group of genes, where constraints on mRNA secondary structure play a role to optimize codon usage in highly expressed genes.
Collapse
Affiliation(s)
- Pamela Mukhopadhyay
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata-700 054, India
| | | | | |
Collapse
|
41
|
Kliman RM, Bernal CA. Unusual usage of AGG and TTG codons in humans and their viruses. Gene 2005; 352:92-9. [PMID: 15922516 DOI: 10.1016/j.gene.2005.04.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2004] [Revised: 12/28/2004] [Accepted: 04/01/2005] [Indexed: 11/22/2022]
Abstract
Prior analysis on human protein-coding DNA sequences has identified local base composition as the primary predictor of synonymous codon usage. However, in many organisms, codon usage is influenced by natural selection, particularly for efficient expression of functional gene products. Because viruses are expected to evolve codon usage in the context of their host's molecular machinery, their genomes provide another window into the forces that guide their host's molecular evolution. Factor analysis was performed on codon usage of 16,654 genes annotated in Build 34 of the human genome, and the primary factor was correlated strongly with local base composition. However, two codons, AGG and TTG, rose in frequency as all other C- and G-ending codons decreased in frequency. These two codons were the only C- or G-ending codons with usages that negatively correlated with gene expression. Variation among viruses in codon usage also strongly reflects variation in base composition and, again, AGG and TTG decrease in frequency as all other C- and G-ending codons increase in frequency. It appears that usages of these two codons can not be explained by local compositional biases, implying a more direct role of natural selection on codon usage in humans.
Collapse
Affiliation(s)
- Richard M Kliman
- Department of Biological Sciences, Cedar Crest College, 100 College Drive, Allentown, PA, USA.
| | | |
Collapse
|
42
|
Chanda I, Pan A, Dutta C. Proteome composition in Plasmodium falciparum: higher usage of GC-rich nonsynonymous codons in highly expressed genes. J Mol Evol 2005; 61:513-23. [PMID: 16044241 DOI: 10.1007/s00239-005-0023-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2005] [Accepted: 04/19/2005] [Indexed: 10/25/2022]
Abstract
The parasite Plasmodium falciparum, responsible for the most deadly form of human malaria, is one of the extremely AT-rich genomes sequenced so far and known to possess many atypical characteristics. Using multivariate statistical approaches, the present study analyzes the amino acid usage pattern in 5038 annotated protein-coding sequences in P. falciparum clone 3D7. The amino acid composition of individual proteins, though dominated by the directional mutational pressure, exhibits wide variation across the proteome. The Asn content, expression level, mean molecular weight, hydropathy, and aromaticity are found to be the major sources of variation in amino acid usage. At all stages of development, frequencies of residues encoded by GC-rich codons such as Gly, Ala, Arg, and Pro increase significantly in the products of the highly expressed genes. Investigation of nucleotide substitution patterns in P. falciparum and other Plasmodium species reveals that the nonsynonymous sites of highly expressed genes are more conserved than those of the lowly expressed ones, though for synonymous sites, the reverse is true. The highly expressed genes are, therefore, expected to be closer to their putative ancestral state in amino acid composition, and a plausible reason for their sequences being GC-rich at nonsynonymous codon positions could be that their ancestral state was less AT-biased. Negative correlation of the expression level of proteins with respective molecular weights supports the notion that P. falciparum, in spite of its intracellular parasitic lifestyle, follows the principle of cost minimization.
Collapse
Affiliation(s)
- Ipsita Chanda
- Human Genetics & Genomics Group, Indian Institute of Chemical Biology, Kolkata 700032, India
| | | | | |
Collapse
|
43
|
Fadiel A, Lithwick S, Naftolin F. The influence of environmental adaptation on bacterial genome structure. Lett Appl Microbiol 2005; 40:12-8. [PMID: 15612996 DOI: 10.1111/j.1472-765x.2004.01619.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
AIMS Researchers have long been puzzled by the diversity of life. Now that the complete genomic sequence of many organisms has been determined, it is possible to evaluate the impact of organismal variation on sequence structure or vice versa. The aim of this investigation was to explore genomic changes mandated by organismal adaptation to its ecological niches. METHODS AND RESULTS Coding sequences from three phylogenetically related bacterial species namely Mycoplasma genitalium, M. pneumoniae and Ureaplasma urealyticum were subject to in depth sequence analyses. M. genitalium and M. pneumoniae both belong to the genus Mycoplasma while U. urealyticum is a member of the genus Ureaplasma. However, M. genitalium and U. urealyticum are urogenital pathogens while M. pneumoniae is a respiratory pathogen. Complete transcriptomes were downloaded from NCBI for each species, and were subject to in silico investigation using in-house software, and public sequence analysis tools. Clear similarities in transcriptome structure were identified among the functionally similar species M. genitalium and U. urealyticum while no such relationship was identified among the phylogenetically related species M. genitalium and M. pneumoniae. CONCLUSIONS It is plausible to conclude that, in these bacterial species, environmental stimuli might be more influential in shaping sequence signatures than phylogenetic relationships. SIGNIFICANCE AND IMPACT OF THE STUDY This study suggests that molecular signatures within the transcriptomes of the species examined are likely to be a product of evolutionary adaptation to diverse environmental ecological stimuli, and not a result of common phylogeny.
Collapse
Affiliation(s)
- A Fadiel
- The Bioinformatics Supercomputing Centre, The Hospital for Sick Children, Toronto, ON, Canada.
| | | | | |
Collapse
|
44
|
Peixoto L, Fernández V, Musto H. The effect of expression levels on codon usage in Plasmodium falciparum. Parasitology 2004; 128:245-51. [PMID: 15074874 DOI: 10.1017/s0031182003004517] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
The usage of alternative synonymous codons in the completely sequenced, extremely A+T-rich parasite Plasmodium falciparum was studied. Confirming previous studies obtained with less than 3% of the total genes recently described, we found that A- and U-ending triplets predominate but translational selection increases the frequency of a subset of codons in highly expressed genes. However, some new results come from the analysis of the complete sequence. First, there is more variation in GC3 than previously described; second, the effect of natural selection acting at the level of translation has been analysed with real expression data at 4 different stages and third, we found that highly expressed proteins increment the frequency of energetically less expensive amino acids. The implications of these results are discussed.
Collapse
Affiliation(s)
- L Peixoto
- Laboratorio de Organización Evolución del Genoma, Facultad de Ciencias, Iguá 4225, Montevideo 11400, Uruguay
| | | | | |
Collapse
|
45
|
Peixoto L, Zavala A, Romero H, Musto H. The strength of translational selection for codon usage varies in the three replicons of Sinorhizobium meliloti. Gene 2004; 320:109-16. [PMID: 14597394 DOI: 10.1016/s0378-1119(03)00815-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The genome of the nitrogen-fixing bacterium Sinorhizobium meliloti is composed of three replicons of 3.65 (chromosome), 1.35 (pSymA) and 1.68 Mb (pSymB), respectively. While the chromosome encodes for most of the housekeeping functions, the three elements may contribute to symbiosis, though pSymA is absolutely necessary for nodulation and nitrogen fixation, since it harbours all the characterized nodulation and symbiotic fixation genes. On the other hand, the majority of the sequences located in this megaplasmid are probably not expressed during the free-living stage of the organism. Since most of the sequences located in pSymA are transcribed only at the stage of bacteroids when most probably the fate of the bacterium is to die, the mutations occurring at this stage will not be fixed in the population. Therefore, if natural selection contributes to the codon usage pattern in this species, its effect will be much weaker for the genes placed in pSymA. A codon usage analysis of the genes comprising the three replicons is consistent with the conclusion that selection for translational speed shapes the codon usage of the two replicons which are important for competitive cell growth while the codon usage of the third replicon reflects primarily the mutational bias.
Collapse
Affiliation(s)
- Lucía Peixoto
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Iguá 4225, Montevideo 11400, Uruguay
| | | | | | | |
Collapse
|
46
|
Kliman RM, Irving N, Santiago M. Selection conflicts, gene expression, and codon usage trends in yeast. J Mol Evol 2003; 57:98-109. [PMID: 12962310 DOI: 10.1007/s00239-003-2459-9] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Synonymous codon usage in yeast appears to be influenced by natural selection on gene expression, as well as regional variation in compositional bias. Because of the large number of potential targets of selection (i.e., most of the codons in the genome) and presumed small selection coefficients, codon usage is an excellent model for studying factors that limit the effectiveness of selection. We use factor analysis to identify major trends in codon usage for 5836 genes in Saccharomyces cerevisiae. The primary factor is strongly correlated with gene expression, consistent with the model that a subset of codons allows for more efficient translation. The secondary factor is very strongly correlated with third codon position GC content and probably reflects regional variation in compositional bias. We find that preferred codon usage decreases in the face of three potential limitations on the effectiveness of selection: reduced recombination rate, increased gene length, and reduced intergenic spacing. All three patterns are consistent with the Hill-Robertson effect (reduced effectiveness of selection among linked targets). A reduction in gene expression in closely spaced genes may also reflect selection conflicts due to antagonistic pleiotropy.
Collapse
Affiliation(s)
- Richard M Kliman
- Department of Biological Sciences, Kean University, 1000 Morris Avenue, Union, NJ 08830, USA.
| | | | | |
Collapse
|
47
|
Chen LL, Zhang CT. Seven GC-rich microbial genomes adopt similar codon usage patterns regardless of their phylogenetic lineages. Biochem Biophys Res Commun 2003; 306:310-7. [PMID: 12788106 DOI: 10.1016/s0006-291x(03)00973-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Seven GC-rich (group I) and three AT-rich (group II) microbial genomes are analyzed in this paper. The seven microbes in group I belong to different phylogenetic lineages, even different domains of life. The common feature is that they are highly GC-rich organisms, with more than 60% genomic GC content. Group II includes three bacteria, which belong to the same subdivision as Pseudomonas aeruginosa in group I. The genomic GC content of the three bacteria is in the range of 26-50%. It is shown that although the phylogenetic lineages of the organisms in group I are remote, the common feature of highly genomic GC content forces them to adopt similar codon usage patterns, which constitutes the basis of an algorithm using a set of universal parameters to recognize known genes in the seven genomes. The common codon usage pattern of function known genes in the seven genomes is GGS type, where G, G, and S are the bases of G, non-G, and G/C, respectively. On the contrary, although the phylogenetic lineages of the three bacteria in group II are quite close, the codon usage patterns of function known genes in these genomes are obviously distinct. There are no universal parameters to identify known genes in the three genomes in group II. It can be deduced that the genomic GC content is more important than phylogenetic lineage in gene recognition programs. We hope that the work might be useful for understanding the common characteristics in the organization of microbial genomes.
Collapse
Affiliation(s)
- Ling-Ling Chen
- Department of Physics, Tianjin University, 300072, Tianjin, China
| | | |
Collapse
|
48
|
Myre MA, O'Day DH. Nucleomorphin. A novel, acidic, nuclear calmodulin-binding protein from dictyostelium that regulates nuclear number. J Biol Chem 2002; 277:19735-44. [PMID: 11919178 DOI: 10.1074/jbc.m109717200] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Probing of Dictyostelium discoideum cell extracts after SDS-PAGE using (35)S-recombinant calmodulin (CaM) as a probe has revealed approximately three-dozen Ca(2+)-dependent calmodulin binding proteins. Here, we report the molecular cloning, expression, and subcellular localization of a gene encoding a novel calmodulin-binding protein (CaMBP); we have called nucleomorphin, from D. discoideum. A lambdaZAP cDNA expression library of cells from multicellular development was screened using a recombinant calmodulin probe ((35)S-VU1-CaM). The open reading frame of 1119 nucleotides encodes a polypeptide of 340 amino acids with a calculated molecular mass of 38.7 kDa and is constitutively expressed throughout the Dictyostelium life cycle. Nucleomorphin contains a highly acidic glutamic/aspartic acid inverted repeat (DEED) with significant similarity to the conserved nucleoplasmin domain and a putative transmembrane domain in the carboxyl-terminal region. Southern blotting reveals that nucleomorphin exists as a single copy gene. Using gel overlay assays and CaM-agarose we show that bacterially expressed nucleomorphin binds to bovine CaM in a Ca(2+)-dependent manner. Amino-terminal fusion to the green fluorescence protein (GFP) showed that GFP-NumA localized to the nucleus as distinct arc-like patterns similar to heterochromatin regions. GFP-NumA lacking the acidic DEED repeat still showed arc-like accumulations at the nuclear periphery, but the number of nuclei in these cells was increased markedly compared with control cells. Cells expressing GFP-NumA lacking the transmembrane domain localized to the nuclear periphery but did not affect nuclear number or gross morphology. Nucleomorphin is the first nuclear CaMBP to be identified in Dictyostelium. Furthermore, these data present the first identification of a member of the nucleoplasmin family as a calmodulin-binding protein and suggest nucleomorphin has a role in nuclear structure in Dictyostelium.
Collapse
Affiliation(s)
- Michael A Myre
- Department of Zoology, University of Toronto at Mississauga, Mississauga, Ontario L5L 1C6, Canada
| | | |
Collapse
|
49
|
Likhoshvai VA, Matushkin YG. Differentiation of single-cell organisms according to elongation stages crucial for gene expression efficacy. FEBS Lett 2002; 516:87-92. [PMID: 11959109 DOI: 10.1016/s0014-5793(02)02507-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
We analyzed the interrelation between the efficiency of a gene expression and the nucleotide composition of all protein-coding sequences in 38 unicellular organisms whose complete genomic sequences are known. These organisms comprise 37 prokaryotic (29 eubacteria and eight archaebacteria) and one eukaryotic (yeast) species. We demonstrated that frequency analysis of gene codon composition fails to reflect adequately the gene expression efficiency of all these organisms. We constructed a measure, the elongation efficiency index, that considers simultaneously the information on codon frequencies and the degree of mRNA local self-complementarity. This measure recognizes the ribosome-coding genes as highly expressed in all the unicellular organisms studied. According to our analysis, these species fall into five groups differentiated by the process that makes the key contribution to the elongation rate.
Collapse
Affiliation(s)
- Vitali A Likhoshvai
- Laboratory of Molecular Evolution, Institute of Cytology and Genetics, Prospekt Lavrentieva 10, 630090, Novosibirsk, Russia.
| | | |
Collapse
|
50
|
Hey J, Kliman RM. Interactions between natural selection, recombination and gene density in the genes of Drosophila. Genetics 2002; 160:595-608. [PMID: 11861564 PMCID: PMC1461979 DOI: 10.1093/genetics/160.2.595] [Citation(s) in RCA: 143] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In Drosophila, as in many organisms, natural selection leads to high levels of codon bias in genes that are highly expressed. Thus codon bias is an indicator of the intensity of one kind of selection that is experienced by genes and can be used to assess the impact of other genomic factors on natural selection. Among 13,000 genes in the Drosophila genome, codon bias has a slight positive, and strongly significant, association with recombination--as expected if recombination allows natural selection to act more efficiently when multiple linked sites segregate functional variation. The same reasoning leads to the expectation that the efficiency of selection, and thus average codon bias, should decline with gene density. However, this prediction is not confirmed. Levels of codon bias and gene expression are highest for those genes in an intermediate range of gene density, a pattern that may be the result of a tradeoff between the advantages for gene expression of close gene spacing and disadvantages arising from regulatory conflicts among tightly packed genes. These factors appear to overlay the more subtle effect of linkage among selected sites that gives rise to the association between recombination rate and codon bias.
Collapse
Affiliation(s)
- Jody Hey
- Department of Genetics, Rutgers University, Piscataway, New Jersey 08854-8082, USA.
| | | |
Collapse
|