1
|
Castellano LA, McNamara RJ, Pallarés HM, Gamarnik AV, Alvarez DE, Bazzini AA. Dengue virus preferentially uses human and mosquito non-optimal codons. Mol Syst Biol 2024:10.1038/s44320-024-00052-7. [PMID: 39039212 DOI: 10.1038/s44320-024-00052-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 06/25/2024] [Accepted: 06/26/2024] [Indexed: 07/24/2024] Open
Abstract
Codon optimality refers to the effect that codon composition has on messenger RNA (mRNA) stability and translation level and implies that synonymous codons are not silent from a regulatory point of view. Here, we investigated the adaptation of virus genomes to the host optimality code using mosquito-borne dengue virus (DENV) as a model. We demonstrated that codon optimality exists in mosquito cells and showed that DENV preferentially uses nonoptimal (destabilizing) codons and avoids codons that are defined as optimal (stabilizing) in either human or mosquito cells. Human genes enriched in the codons preferentially and frequently used by DENV are upregulated during infection, and so is the tRNA decoding the nonoptimal and DENV preferentially used codon for arginine. We found that adaptation during single-host passaging in human or mosquito cells results in the selection of synonymous mutations towards DENV's preferred nonoptimal codons that increase virus fitness. Finally, our analyses revealed that hundreds of viruses preferentially use nonoptimal codons, with those infecting a single host displaying an even stronger bias, suggesting that host-pathogen interaction shapes virus-synonymous codon choice.
Collapse
Affiliation(s)
- Luciana A Castellano
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA
| | - Ryan J McNamara
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA
| | - Horacio M Pallarés
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA
- Fundación Instituto Leloir, Instituto de Investigaciones Bioquímicas de Buenos Aires IIBBA-CONICET, Ciudad Autónoma de Buenos Aires, Argentina
| | - Andrea V Gamarnik
- Fundación Instituto Leloir, Instituto de Investigaciones Bioquímicas de Buenos Aires IIBBA-CONICET, Ciudad Autónoma de Buenos Aires, Argentina
| | - Diego E Alvarez
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín-CONICET, San Martín B1650, Argentina
| | - Ariel A Bazzini
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA.
- Department of Molecular and Integrative Physiology, University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS, 66160, USA.
| |
Collapse
|
2
|
Sharma D, Chakraborty S. RNA editing sites and triplet usage in exomes of bat RNA virus genomes of the family Paramyxoviridae. Microb Pathog 2024; 194:106796. [PMID: 39025379 DOI: 10.1016/j.micpath.2024.106796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 07/09/2024] [Accepted: 07/15/2024] [Indexed: 07/20/2024]
Abstract
Bats contain a diverse spectrum of viral species in their bodies. The RNA virus family Paramyxoviridae tends to infect several vertebrate species, which are accountable for a variety of devastating infections in both humans and animals. Viruses of this kind include measles, mumps, and Hendra. Some synonymous codons are favoured over others in mRNAs during gene-to-protein synthesis process. Such phenomenon is termed as codon usage bias (CUB). Our research emphasized many aspects that shape the CUB of genes in the Paramyxoviridae family found in bats. Here, the nitrogenous base A occurred the most. AT was found to be abundant in the coding sequences of the Paramyxoviridae family. RSCU data revealed that A or T ending codons occurred more frequently than predicted. Furthermore, 3 overrepresented codons (CAT, AGA, and GCA) and 7 underrepresented codons (CCG, TCG, CGC, CGG, CGT, GCG and ACG) were detected in the viral genomes. Correspondence analysis, neutrality plot, and parity plots highlight the combined impact of mutational pressure and natural selection on CUB. The neutrality plot of GC12 against GC3 yielded a regression coefficient value of 0.366, indicating that natural selection had a significant (63.4 %) impact. Moreover, RNA editing analysis was done, which revealed the highest frequency of C to T mutations. The results of our research revealed the pattern of codon usage and RNA editing sites in Paramyxoviridae genomes.
Collapse
Affiliation(s)
- Deepika Sharma
- Department of Biotechnology, Assam University, Silchar, 788011, Assam, India.
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, 788011, Assam, India.
| |
Collapse
|
3
|
Gao W, Chen X, He J, Sha A, Luo Y, Xiao W, Xiong Z, Li Q. Intraspecific and interspecific variations in the synonymous codon usage in mitochondrial genomes of 8 pleurotus strains. BMC Genomics 2024; 25:456. [PMID: 38730418 PMCID: PMC11084086 DOI: 10.1186/s12864-024-10374-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 05/03/2024] [Indexed: 05/12/2024] Open
Abstract
In this study, we investigated the codon bias of twelve mitochondrial core protein coding genes (PCGs) in eight Pleurotus strains, two of which are from the same species. The results revealed that the codons of all Pleurotus strains had a preference for ending in A/T. Furthermore, the correlation between codon base compositions and codon adaptation index (CAI), codon bias index (CBI) and frequency of optimal codons (FOP) indices was also detected, implying the influence of base composition on codon bias. The two P. ostreatus species were found to have differences in various base bias indicators. The average effective number of codons (ENC) of mitochondrial core PCGs of Pleurotus was found to be less than 35, indicating strong codon preference of mitochondrial core PCGs of Pleurotus. The neutrality plot analysis and PR2-Bias plot analysis further suggested that natural selection plays an important role in Pleurotus codon bias. Additionally, six to ten optimal codons (ΔRSCU > 0.08 and RSCU > 1) were identified in eight Pleurotus strains, with UGU and ACU being the most widely used optimal codons in Pleurotus. Finally, based on the combined mitochondrial sequence and RSCU value, the genetic relationship between different Pleurotus strains was deduced, showing large variations between them. This research has improved our understanding of synonymous codon usage characteristics and evolution of this important fungal group.
Collapse
Affiliation(s)
- Wei Gao
- Clinical Medical College & Affiliated Hospital of Chengdu University, Chengdu University, Chengdu, Sichuan, China
| | - Xiaodie Chen
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China
| | - Jing He
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China
| | - Ajia Sha
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China
| | - Yingyong Luo
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China
| | - Wenqi Xiao
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China
| | - Zhuang Xiong
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China
| | - Qiang Li
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Food and Biological Engineering, Chengdu University, Chengdu, Sichuan, China.
- School of Food and Biological Engineering, Chengdu University, 2025 # Chengluo Avenue, Longquanyi District, Chengdu, Sichuan, 610106, China.
| |
Collapse
|
4
|
Shan KJ, Wu C, Tang X, Lu R, Hu Y, Tan W, Lu J. Molecular Evolution of Protein Sequences and Codon Usage in Monkeypox Viruses. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzad003. [PMID: 38862422 DOI: 10.1093/gpbjnl/qzad003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 10/06/2023] [Accepted: 10/11/2023] [Indexed: 06/13/2024]
Abstract
The monkeypox virus (mpox virus, MPXV) epidemic in 2022 has posed a significant public health risk. Yet, the evolutionary principles of MPXV remain largely unknown. Here, we examined the evolutionary patterns of protein sequences and codon usage in MPXV. We first demonstrated the signal of positive selection in OPG027, specifically in the Clade I lineage of MPXV. Subsequently, we discovered accelerated protein sequence evolution over time in the variants responsible for the 2022 outbreak. Furthermore, we showed strong epistasis between amino acid substitutions located in different genes. The codon adaptation index (CAI) analysis revealed that MPXV genes tended to use more non-preferred codons compared to human genes, and the CAI decreased over time and diverged between clades, with Clade I > IIa and IIb-A > IIb-B. While the decrease in fatality rate among the three groups aligned with the CAI pattern, it remains unclear whether this correlation was coincidental or if the deoptimization of codon usage in MPXV led to a reduction in fatality rates. This study sheds new light on the mechanisms that govern the evolution of MPXV in human populations.
Collapse
Affiliation(s)
- Ke-Jia Shan
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing 100871, China
- Sinovac Biotech Ltd., Beijing 100085, China
| | - Changcheng Wu
- NHC Key Laboratory of Biosafety, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 100052, China
| | - Xiaolu Tang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing 100871, China
| | - Roujian Lu
- NHC Key Laboratory of Biosafety, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 100052, China
| | - Yaling Hu
- Sinovac Biotech Ltd., Beijing 100085, China
| | - Wenjie Tan
- NHC Key Laboratory of Biosafety, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 100052, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
5
|
Aktürk Dizman Y. Analysis of codon usage bias of exonuclease genes in invertebrate iridescent viruses. Virology 2024; 593:110030. [PMID: 38402641 DOI: 10.1016/j.virol.2024.110030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/04/2024] [Accepted: 02/13/2024] [Indexed: 02/27/2024]
Abstract
Invertebrate iridescent viruses (IIVs) are double-stranded DNA viruses that belong to the Iridoviridae family. IIVs result diseases that vary in severity from subclinical to lethal in invertebrate hosts. Codon usage bias (CUB) analysis is a versatile method for comprehending the genetic and evolutionary aspects of species. In this study, we analyzed the CUB in 10 invertebrate iridescent viruses exonuclease genes by calculating and comparing the nucleotide contents, effective number of codons (ENC), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and others. The results revealed that IIVs exonuclease genes are rich in A/T. The ENC analysis displayed a low codon usage bias in IIVs exonuclease genes. ENC-plot, neutrality plot, and parity rule 2 plot demonstrated that besides mutational pressure, other factors like natural selection, dinucleotide content, and aromaticity also contributed to CUB. The findings could enhance our understanding of the evolution of IIVs exonuclease genes.
Collapse
Affiliation(s)
- Yeşim Aktürk Dizman
- Department of Biology, Faculty of Arts and Sciences, Recep Tayyip Erdogan University, 53100, Rize, Türkiye.
| |
Collapse
|
6
|
Baeza JA, Stephens NC, Baker A, Lyons A, Franks B, Pirro S, Feldheim KA. Insights into the nuclear and mitochondrial genome of the Lemon shark Negaprion brevirostris using low-coverage sequencing: Genome size, repetitive elements, mitochondrial genome, and phylogenetic placement. Gene 2024; 894:147939. [PMID: 38572145 PMCID: PMC10990291 DOI: 10.1016/j.gene.2023.147939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2024]
Abstract
The Lemon shark Negaprion brevirostris is an important species experiencing conservation issues that is in need of genomic resources. Herein, we conducted a genome survey sequencing in N. brevirostris and determined genome size, explored repetitive elements, assembled and annotated the 45S rRNA DNA operon, and assembled and described in detail the mitochondrial genome. Lastly, the phylogenetic position of N. brevirostris in the family Carcharhinidae was examined using translated protein coding genes. The estimated haploid genome size ranged between 2.29 and 2.58 Gbp using a k-mer analysis, which is slightly below the genome size estimated for other sharks belonging to the family Carcharhinidae. Using a k-mer analysis, approx. 64-71 % of the genome of N. brevirostris was composed of repetitive elements. A relatively large proportion of the 'repeatome' could not be annotated. Taking into account only annotated repetitive elements, Class I - Long Interspersed Nuclear Element (LINE) were the most abundant repetitive elements followed by Class I - Penelope and Satellite DNA. The nuclear ribosomal operon was fully assembled. The AT-rich complete mitochondrial genome was 16,703 bp long and encoded 13 protein coding genes, 2 ribosomal RNA genes, and 22 transfer RNA genes. Negaprion brevirostris is closely related to the genera Carcharhinus, Glyphis and Lamiopsis in the family Carcharinidae. This new genomic resources will aid with the development of conservation plans for this large coastal shark.
Collapse
Affiliation(s)
- J. Antonio Baeza
- Department of Biological Sciences, Clemson University, Clemson, SC, USA
- Smithsonian Marine Station at Fort Pierce, Smithsonian Institution, Fort Pierce, FL, USA
- Departamento de Biología Marina, Universidad Catolica del Norte, Coquimbo, Chile
| | | | - Alyssa Baker
- Department of Biological Sciences, Clemson University, Clemson, SC, USA
| | - Andrew Lyons
- Marine Science Research Institute, Jacksonville University, Florida, USA
| | - Bryan Franks
- Marine Science Research Institute, Jacksonville University, Florida, USA
| | | | - Kevin A. Feldheim
- Pritzker Laboratory for Molecular Systematics and Evolution, Field Museum of Natural History, Chicago, IL, USA
| |
Collapse
|
7
|
Xiao M, Hu X, Li Y, Liu Q, Shen S, Jiang T, Zhang L, Zhou Y, Li Y, Luo X, Bai L, Yan W. Comparative analysis of codon usage patterns in the chloroplast genomes of nine forage legumes. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2024; 30:153-166. [PMID: 38623162 PMCID: PMC11016040 DOI: 10.1007/s12298-024-01421-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 02/19/2024] [Accepted: 02/23/2024] [Indexed: 04/17/2024]
Abstract
Leguminosae is one of the three largest families of angiosperms after Compositae and Orchidaceae. It is widely distributed and grows in a variety of environments, including plains, mountains, deserts, forests, grasslands, and even waters where almost all legumes can be found. It is one of the most important sources of starch, protein and oil in the food of mankind and also an important source of high-quality forage material for animals, which has important economic significance. In our study, the codon usage patterns and variation sources of the chloroplast genome of nine important forage legumes were systematically analyzed. Meanwhile, we also constructed a phylogenetic tree based on the whole chloroplast genomes and protein coding sequences of these nine forage legumes. Our results showed that the chloroplast genomes of nine forage legumes end with A/T bases, and seven identical high-frequency (HF) codons were detected among the nine forage legumes. ENC-GC3s mapping, PR2 analysis, and neutral analysis showed that the codon bias of nine forage legumes was influenced by many factors, among which natural selection was the main influencing factor. The codon usage frequency showed that the Nicotiana tabacum and Saccharomyces cerevisiae can be considered as receptors for the exogenous expression of chloroplast genes of these nine forage legumes. The phylogenetic relationships of the chloroplast genomes and protein coding genes were highly similar, and the nine forage legumes were divided into three major clades. Among the clades Melilotus officinalis was more closely related to Medicago sativa, and Galega officinalis was more closely related to Galega orientalis. This study provides a scientific basis for the molecular markers research, species identification and phylogenetic studies of forage legumes. Supplementary Information The online version contains supplementary material available at 10.1007/s12298-024-01421-0.
Collapse
Affiliation(s)
- Mingkun Xiao
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Xiang Hu
- Tropical Eco-agricultural Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan China
| | - Yaqi Li
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Qian Liu
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Shaobin Shen
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Tailing Jiang
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Linhui Zhang
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Yingchun Zhou
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Yuexian Li
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Xin Luo
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Lina Bai
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| | - Wei Yan
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan China
| |
Collapse
|
8
|
Wang Y, Chi C, Zhang J, Zhang K, Deng D, Zheng W, Chen N, Meurens F, Zhu J. Systematic analysis of the codon usage patterns of African swine fever virus genome coding sequences reveals its host adaptation phenotype. Microb Genom 2024; 10:001186. [PMID: 38270515 PMCID: PMC10868601 DOI: 10.1099/mgen.0.001186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 01/09/2024] [Indexed: 01/26/2024] Open
Abstract
African swine fever (ASF) is a severe haemorrhagic disease caused by the African swine fever virus (ASFV), transmitted by ticks, resulting in high mortality among domestic pigs and wild boars. The global spread of ASFV poses significant economic threats to the swine industry. This study employs diverse analytical methods to explore ASFV's evolution and host adaptation, focusing on codon usage patterns and associated factors. Utilizing phylogenetic analysis methods including neighbour-joining and maximum-likelihood, 64 ASFV strains were categorized into four clades. Codon usage bias (CUB) is modest in ASFV coding sequences. This research identifies multiple factors - such as nucleotide composition, mutational pressures, natural selection and geographical diversity - contributing to the formation of CUB in ASFV. Analysis of relative synonymous codon usage reveals CUB variations within clades and among ASFVs and their hosts. Both Codon Adaptation Index and Similarity Index analyses confirm that ASFV strains are highly adapted to soft ticks (Ornithodoros moubata) but less so to domestic pigs, which could be a result of the long-term co-evolution of ASFV with ticks. This study sheds light on the factors influencing ASFV's codon usage and fitness dynamics, enriching our understanding of its evolution, adaptation and host interactions.
Collapse
Affiliation(s)
- Yuening Wang
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - Chenglin Chi
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - Jiajia Zhang
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - Kaili Zhang
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - Dafu Deng
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - Wanglong Zheng
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - Nanhua Chen
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| | - François Meurens
- Swine and Poultry Infectious Diseases Research Center, Faculty of Veterinary Medicine, University of Montreal, St. Hyacinthe, QC, J2S 2M2, Canada
- Department of Veterinary Microbiology and Immunology, Western College of Veterinary Medicine, University of Saskatchewan, Saskatoon, SK, S7N 5E2, Canada
| | - Jianzhong Zhu
- College of Veterinary Medicine, Yangzhou University, Yangzhou, 225009, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou, 225009, PR China
- Comparative Medicine Research Institute, Yangzhou University, Yangzhou, 225009, PR China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, 225009, PR China
| |
Collapse
|
9
|
Yang Q, Xin C, Xiao QS, Lin YT, Li L, Zhao JL. Codon usage bias in chloroplast genes implicate adaptive evolution of four ginger species. FRONTIERS IN PLANT SCIENCE 2023; 14:1304264. [PMID: 38169692 PMCID: PMC10758403 DOI: 10.3389/fpls.2023.1304264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/01/2023] [Indexed: 01/05/2024]
Abstract
Codon usage bias (CUB) refers to different codons exhibiting varying frequencies of usage in the genome. Studying CUB is crucial for understanding genome structure, function, and evolutionary processes. Herein, we investigated the codon usage patterns and influencing factors of protein-coding genes in the chloroplast genomes of four sister genera (monophyletic Roscoea and Cautleya, and monophyletic Pommereschea and Rhynchanthus) from the Zingiberaceae family with contrasting habitats in southwestern China. These genera exhibit distinct habitats, providing a unique opportunity to explore the adaptive evolution of codon usage. We conducted a comprehensive analysis of nucleotide composition and codon usage on protein-coding genes in the chloroplast genomes. The study focused on understanding the relationship between codon usage and environmental adaptation, with a particular emphasis on genes associated with photosynthesis. Nucleotide composition analysis revealed that the overall G/C content of the coding genes was ˂ 48%, indicating an enrichment of A/T bases. Additionally, synonymous and optimal codons were biased toward ending with A/U bases. Natural selection is the primary factor influencing CUB characteristics, particularly photosynthesis-associated genes. We observed differential gene expressions related to light adaptation among sister genera inhabiting different environments. Certain codons were favored under specific conditions, possibly contributing to gene expression regulation in particular environments. This study provides insights into the adaptive evolution of these sister genera by analyzing CUB and offers theoretical assistance for understanding gene expression and regulation. In addition, the data support the relationship between RNA editing and CUB, and the findings shed light on potential research directions for investigating adaptive evolution.
Collapse
Affiliation(s)
- Qian Yang
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, China
| | - Cheng Xin
- School of Life Sciences, Zhengzhou University, Zhengzhou, China
| | - Qing-Song Xiao
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, China
| | - Ya-Ting Lin
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, China
| | - Li Li
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, China
| | - Jian-Li Zhao
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology and Centre for Invasion Biology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, China
| |
Collapse
|
10
|
Wu X, Shan K, Zan F, Tang X, Qian Z, Lu J. Optimization and Deoptimization of Codons in SARS-CoV-2 and Related Implications for Vaccine Development. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2205445. [PMID: 37267926 PMCID: PMC10427376 DOI: 10.1002/advs.202205445] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 04/08/2023] [Indexed: 06/04/2023]
Abstract
The spread of coronavirus disease 2019 (COVID-19), caused by severe respiratory syndrome coronavirus 2 (SARS-CoV-2), has progressed into a global pandemic. To date, thousands of genetic variants have been identified among SARS-CoV-2 isolates collected from patients. Sequence analysis reveals that the codon adaptation index (CAI) values of viral sequences have decreased over time but with occasional fluctuations. Through evolution modeling, it is found that this phenomenon may result from the virus's mutation preference during transmission. Using dual-luciferase assays, it is further discovered that the deoptimization of codons in the viral sequence may weaken protein expression during virus evolution, indicating that codon usage may play an important role in virus fitness. Finally, given the importance of codon usage in protein expression and particularly for mRNA vaccines, it is designed several codon-optimized Omicron BA.2.12.1, BA.4/5, and XBB.1.5 spike mRNA vaccine candidates and experimentally validated their high levels of expression. This study highlights the importance of codon usage in virus evolution and provides guidelines for codon optimization in mRNA and DNA vaccine development.
Collapse
Affiliation(s)
- Xinkai Wu
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| | - Ke‐jia Shan
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| | - Fuwen Zan
- NHC Key Laboratory of Systems Biology of PathogensInstitute of Pathogen BiologyChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijing100176China
| | - Xiaolu Tang
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| | - Zhaohui Qian
- NHC Key Laboratory of Systems Biology of PathogensInstitute of Pathogen BiologyChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijing100176China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| |
Collapse
|
11
|
Burk RD, Mirabello L, DeSalle R. Distinguishing Genetic Drift from Selection in Papillomavirus Evolution. Viruses 2023; 15:1631. [PMID: 37631973 PMCID: PMC10458755 DOI: 10.3390/v15081631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 07/20/2023] [Accepted: 07/21/2023] [Indexed: 08/27/2023] Open
Abstract
Pervasive purifying selection on non-synonymous substitutions is a hallmark of papillomavirus genome history, but the role of selection on and the drift of non-coding DNA motifs on HPV diversification is poorly understood. In this study, more than a thousand complete genomes representing Alphapapillomavirus types, lineages, and SNP variants were examined phylogenetically and interrogated for the number and position of non-coding DNA sequence motifs using Principal Components Analyses, Ancestral State Reconstructions, and Phylogenetic Independent Contrasts. For anciently diverged Alphapapillomavirus types, composition of the four nucleotides (A, C, G, T), codon usage, trimer usage, and 13 established non-coding DNA sequence motifs revealed phylogenetic clusters consistent with genetic drift. Ancestral state reconstruction and Phylogenetic Independent Contrasts revealed ancient genome alterations, particularly for the CpG and APOBEC3 motifs. Each evolutionary analytical method we performed supports the unanticipated conclusion that genetic drift and different evolutionary drivers have structured Alphapapillomavirus genomes in distinct ways during successive epochs, even extending to differences in more recently formed variant lineages.
Collapse
Affiliation(s)
- Robert D. Burk
- Departments of Pediatrics, Microbiology & Immunology, Epidemiology & Population Health, Obstetrics, Gynecology and Woman’s Health, and Albert Einstein Cancer Center, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Lisa Mirabello
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | - Robert DeSalle
- Sackler Institute of Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| |
Collapse
|
12
|
Alonso AM, Diambra L. Dicodon-based measures for modeling gene expression. Bioinformatics 2023; 39:btad380. [PMID: 37307098 PMCID: PMC10287933 DOI: 10.1093/bioinformatics/btad380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 05/20/2023] [Accepted: 06/09/2023] [Indexed: 06/14/2023] Open
Abstract
MOTIVATION Codon usage preference patterns have been associated with modulation of translation efficiency, protein folding, and mRNA decay. However, new studies support that codon pair usage has also a remarkable effect at the gene expression level. Here, we expand the concept of CAI to answer if codon pair usage patterns can be understood in terms of codon usage bias, or if they offer new information regarding coding translation efficiency. RESULTS Through the implementation of a weighting strategy to consider the dicodon contributions, we observe that the dicodon-based measure has greater correlations with gene expression level than CAI. Interestingly, we have noted that dicodons associated with a low value of adaptiveness are related to dicodons which mediate strong translational inhibition in yeast. We have also noticed that some codon-pairs have a smaller dicodon contribution than estimated by the product of the respective codon contributions. AVAILABILITY AND IMPLEMENTATION Scripts, implemented in Python, are freely available for download at https://zenodo.org/record/7738276#.ZBIDBtLMIdU.
Collapse
Affiliation(s)
- Andres M Alonso
- Instituto Tecnológico Chascomús (INTECH), CONICET-UNSAM, Intendente Marino km 8.2, Chascomús, 7130 Provincia de Buenos Aires, Argentina
- CCT-La Plata, CONICET, Calle 8 Nº 1467, La Plata, B1904CMC Provincia de Buenos Aires, Argentina
| | - Luis Diambra
- CCT-La Plata, CONICET, Calle 8 Nº 1467, La Plata, B1904CMC Provincia de Buenos Aires, Argentina
- Centro Regional de Estudios Genómicos, FCE-UNLP, Blvd 120 N∘ 1461, La Plata, 1900 Provincia de Buenos Aires, Argentina
| |
Collapse
|
13
|
Antonio Baeza J, Rajapakse D, Pearson L, Kreiser BR. Low coverage sequencing provides insights into the key features of the nuclear and mitochondrial genomes of the Alligator Snapping Turtle Macrochelys temminckii. Gene 2023; 873:147478. [PMID: 37182558 DOI: 10.1016/j.gene.2023.147478] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Revised: 04/24/2023] [Accepted: 05/08/2023] [Indexed: 05/16/2023]
Abstract
The alligator snapping turtle Macrochelys temminckii is a culturally, ecologically, and evolutionary relevant species of conservation concern. In this study, we conducted a genome survey of M. temminckii. Using a low-coverage short read sequencing strategy, this study estimated the genome size, repetitive genome content, annotated and quantified repetitive elements, assembled the 45S rRNA DNA operon, and characterized in detail the mitochondrial genome of M. temminckii. Using a k-mer strategy, the estimated haploid genome size varied between 3.77 and 3.19 Gbp, which is within the range previously reported for other representatives of the family Chelydridae. Repetitive genome content estimates using different k-mers (21 to 51) indicated that more than 75% of the genome of M. temminckii comprised repetitive elements. Taking into account only annotated repetitive elements, the most common repetitive elements were classified as Class I - Long Interspersed Nuclear Element (LINE) which were more abundant than Class I - Penelope and Class I - Long Terminal Repeat (LTR) Ty3-gypsy mobile elements. Less abundant repeat element families in the nuclear genome of M. temminckii included Class I - DIRS mobile elements and Satellite DNA. The nuclear ribosomal operon was partially assembled into two contigs, one encoding the complete ssrDNA and a second comprising the full lsrDNA. The AT-rich complete mitochondrial genome was 16,570 bp long. These new genomic resources are of utmost importance to aid in the development of conservation plans for this freshwater turtle.
Collapse
Affiliation(s)
- J Antonio Baeza
- Department of Biological Sciences, Clemson University, Clemson, SC, USA; Departamento de Biología Marina, Universidad Catolica del Norte, Coquimbo, Chile; Smithsonian Marine Station at Fort Pierce, Smithsonian Institution, Fort Pierce, FL, USA.
| | - Dilani Rajapakse
- Department of Biological Sciences, Clemson University, Clemson, SC, USA
| | - Luke Pearson
- U.S. Fish and Wildlife Service, Mississippi Ecological Services Field Office, 6578 Dogwood View Parkway, Jackson, MS 39213
| | - Brian R Kreiser
- School of Biological, Environmental and Earth Sciences, University of Southern Mississippi, Hattiesburg, Mississippi 39406 USA
| |
Collapse
|
14
|
Wang H, Liu S, Lv Y, Wei W. Codon usage bias of Venezuelan equine encephalitis virus and its host adaption. Virus Res 2023; 328:199081. [PMID: 36854361 DOI: 10.1016/j.virusres.2023.199081] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 09/14/2022] [Accepted: 02/24/2023] [Indexed: 03/02/2023]
Abstract
Venezuelan equine encephalitis virus (VEEV) is an emerging zoonotic virus in the alphavirus genus. It can be transmitted to humans due to spillover from equid-mosquito cycles. The symptoms caused by VEEV include fever, headache, myalgia, nausea, and vomiting. It can also cause encephalitis in severe cases. The evolutionary features of VEEV are largely unknown. In this study, we comprehensively analyzed the codon usage pattern of VEEV by computing a variety of indicators, such as effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), on 130 VEEV coding sequences retrieved from GenBank. The results showed that the codon usage bias of VEEV is relatively low. ENc-GC3s plot, neutrality plot, and CAI-ENc correlation analyses supported that translational selection plays an important role in shaping the codon usage pattern of VEEV whereas the mutation pressure has a minor influence. Analysis of RSCU values showed that most of the preferred codons in VEEV are C/G-ended. Analysis of dinucleotide composition found that all CG- and UA-containing codons are not preferentially used. Phylogenetic analysis showed that VEEV isolates can be clustered into three genera and evolutionary force affects the codon usage pattern. Furthermore, a correspondence analysis (COA) showed that aromaticity and hydrophobicity as well as geographical distribution also have certain effects on the codon usage variation of VEEV, suggesting the possible involvement of translational selection. Overall, the codon usage of VEEV is comparatively slight and translational selection might be the main factor that shapes the codon usage pattern of VEEV. This study will promote our understanding about the evolution of VEEV and its host adaption, and might provide some clues for preventing the cross-species transmission of VEEV.
Collapse
Affiliation(s)
- Hongju Wang
- Kaifeng Key Laboratory of Infection and Biological Safety, School of Basic Medical Sciences, Henan University, Kaifeng, China
| | - Shijie Liu
- School of Clinical Medicine, Henan University, Kaifeng, China
| | - Yao Lv
- School of Clinical Medicine, Henan University, Kaifeng, China
| | - Wenqiang Wei
- Kaifeng Key Laboratory of Infection and Biological Safety, School of Basic Medical Sciences, Henan University, Kaifeng, China.
| |
Collapse
|
15
|
Xiong B, Wang T, Huang S, Liao L, Wang X, Deng H, Zhang M, He J, Sun G, He S, Wang Z. Analysis of Codon Usage Bias in Xyloglucan Endotransglycosylase (XET) Genes. Int J Mol Sci 2023; 24:ijms24076108. [PMID: 37047091 PMCID: PMC10094191 DOI: 10.3390/ijms24076108] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 03/17/2023] [Accepted: 03/21/2023] [Indexed: 04/14/2023] Open
Abstract
Xyloglucan endotransglycosylase (XET) genes are widely distributed in most plants, but the codon usage bias of XET genes has remained uncharacterized. Thus, we analyzed the codon usage bias using 4500 codons of 20 XET genes to elucidate the genetic and evolutionary patterns. Phylogenetic and hierarchical cluster analyses revealed that the 20 XET genes belonged to two groups. The closer the genetic distance, the more similar the codon usage preference. The codon usage bias of most XET genes was weak, but there was also some codon usage bias. AGA, AGG, AUC, and GUG were the top four codons (RSCU > 1.5) in the 20 XET genes. CitXET had a stronger codon usage bias, and there were eight optimal codons of CitXET (i.e., AGA, AUU, UCU, CUU, CCA, GCU, GUU, and AAA). The RSCU values underwent a correspondence analysis. The two main factors affecting codon usage bias (i.e., Axes 1 and 2) accounted for 54.8% and 17.6% of the total variation, respectively. Multiple correspondence analysis revealed that XET genes were widely distributed, with Group 1 genes being closer to Axis 1 than Group 2 genes, which were closer to Axis 2. Codons with A/U at the third codon position were distributed closer to Axis 1 than codons with G/C at the third codon position. PgXET, ZmXET, VlXET, VrXET, and PcXET were biased toward codons ending with G/C. In contrast, CitXET, DpXET, and BrpXET were strongly biased toward codons ending with A/U, indicating that these XET genes have a strong codon usage bias. Translational selection and base composition (especially A and U at the third codon position), followed by mutation pressure and natural selection, may be the most important factors affecting codon usage of 20 XET genes. These results may be useful in clarifying the codon usage bias of XET genes and the relevant evolutionary characteristics.
Collapse
Affiliation(s)
- Bo Xiong
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Tie Wang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Shengjia Huang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Ling Liao
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Xun Wang
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Honghong Deng
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Mingfei Zhang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Jiaxian He
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Guochao Sun
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Siya He
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Zhihui Wang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| |
Collapse
|
16
|
Khandia R, Khan AA, Karuvantevida N, Gurjar P, Rzhepakovsky IV, Legaz I. Insights into Synonymous Codon Usage Bias in Hepatitis C Virus and Its Adaptation to Hosts. Pathogens 2023; 12:pathogens12020325. [PMID: 36839597 PMCID: PMC9961758 DOI: 10.3390/pathogens12020325] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 01/25/2023] [Accepted: 02/02/2023] [Indexed: 02/17/2023] Open
Abstract
Hepatitis C virus (HCV) is enveloped RNA virus, encoding for a polyprotein that is processed by cellular proteases. The virus is responsible for liver cirrhosis, allograft rejection, and human hepatocellular carcinoma. Based on studies including compositional analysis, odds ratio analysis, parity analysis, skew analysis, relative synonymous codon usage, codon bias, and protein properties, it was evident that codon usage bias in HCV is dependent upon the nucleotide composition. Codon context analysis revealed CTC-CTG as a preferred codon pair. While CGA and CGT codons were rare, none of the codons were rare in HCV-like viruses envisaged in the present study. Many of the preferred codon pairs were valine amino acid-initiated, which possibly infers viral infectivity; hence the role of selection forces appears to act on the HCV genome, which was further validated by neutrality analysis where selection accounted for 87.28%, while mutation accounted for 12.72% force shaping codon usage. Furthermore, codon usage was correlated with the length of the genome. HCV viruses prefer valine-initiated codon pairs, while HCV-like viruses prefer alanine-initiated codon pairs. The HCV host range is very narrow and is confined to only humans and chimpanzees. Based on indices including codon usage correlation analysis, similarity index, and relative codon deoptimization index, it is evident in the study that the chimpanzee is the primary host of the virus. The present study helped elucidate the preferred host for HCV. The information presented in the study paved the way for generating an attenuated vaccine candidate through viral recoding, with finely tuned nucleotide composition and a perfect balance of preferred and rare codons.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal 462026, India
- Correspondence: (R.K.); (I.L.)
| | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia
| | - Noushad Karuvantevida
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | - Pankaj Gurjar
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, NSW 2770, Australia
| | | | - Isabel Legaz
- Department of Legal and Forensic Medicine, Biomedical Research Institute (IMIB), Regional Campus of International Excellence “Campus Mare Nostrum”, Faculty of Medicine, University of Murcia, 30120 Murcia, Spain
- Correspondence: (R.K.); (I.L.)
| |
Collapse
|
17
|
Pu F, Wang R, Yang X, Hu X, Wang J, Zhang L, Zhao Y, Zhang D, Liu Z, Liu J. Nucleotide and codon usage biases involved in the evolution of African swine fever virus: A comparative genomics analysis. J Basic Microbiol 2023; 63:499-518. [PMID: 36782108 DOI: 10.1002/jobm.202200624] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/05/2023] [Accepted: 01/21/2023] [Indexed: 02/15/2023]
Abstract
Since African swine fever virus (ASFV) replication is closely related to its host's machinery, codon usage of viral genome can be subject to selection pressures. A better understanding of codon usage can give new insights into viral evolution. We implemented information entropy and revealed that the nucleotide usage pattern of ASFV is significantly associated with viral isolation factors (region and time), especially the usages of thymine and cytosine. Despite the domination of adenine and thymine in the viral genome, we found that mutation pressure alters the overall codon usage pattern of ASFV, followed by selective forces from natural selection. Moreover, the nucleotide skew index at the gene level indicates that nucleotide usages influencing synonymous codon bias of ASFV are significantly correlated with viral protein hydropathy. Finally, evolutionary plasticity is proved to contribute to the weakness in synonymous codons with A- or T-end serving as optimal codons of ASFV, suggesting that fine-tuning translation selection plays a role in synonymous codon usages of ASFV for adapting host. Taken together, ASFV is subject to evolutionary dynamics on nucleotide selections and synonymous codon usage, and our detailed analysis offers deeper insights into the genetic characteristics of this newly emerging virus around the world.
Collapse
Affiliation(s)
- Feiyang Pu
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Rui Wang
- Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Xuanye Yang
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Xinyan Hu
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Jinqian Wang
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Lijuan Zhang
- College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Yongqing Zhao
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Derong Zhang
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Zewen Liu
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| | - Junlin Liu
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,College of Life Science and Engineering, Northwest Minzu University, Lanzhou, Gansu, China
| |
Collapse
|
18
|
Abstract
Phylogenetic analysis of monkeypox virus genomes showed statistically significant divergence and nascent subclades during the 2022 mpox outbreak. Frequency of G>A/C>T transitions has increased in recent years, probably resulting from apolipoprotein B mRNA editing enzyme catalytic polypeptide 3G (APOBEC3) deaminase editing. This microevolutionary pattern most likely reflects community spread of the virus and adaptation to humans.
Collapse
|
19
|
Lewitus E, Bai H, Rolland M. Design of a pan-betacoronavirus vaccine candidate through a phylogenetically informed approach. SCIENCE ADVANCES 2023; 9:eabq4149. [PMID: 36652518 PMCID: PMC9848278 DOI: 10.1126/sciadv.abq4149] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 12/16/2022] [Indexed: 06/01/2023]
Abstract
Coronaviruses are a diverse family of viruses that crossed over into humans at least seven times, precipitating mild to catastrophic outcomes. The severe acute respiratory syndrome coronavirus 2 pandemic renewed efforts to identify strains with zoonotic potential and to develop pan-coronavirus vaccines. The analysis of 2181 coronavirus genomes (from 102 host species) confirmed the limited sequence conservation across genera (alpha-, beta-, delta-, and gammacoronavirus) and proteins. A phylogenetically informed pan-coronavirus vaccine was not feasible because of high genetic heterogeneity across genera. We focused on betacoronaviruses and identified nonhuman-infecting receptor binding domain (RBD) sequences that were more genetically similar to human coronaviruses than expected given their phylogenetic divergence. These human-like RBDs defined three phylogenetic clusters. A vaccine candidate based on a representative sequence for each cluster covers the diversity estimated to protect against existing and future human-infecting betacoronaviruses. Our findings emphasize the potential value of conceptualizing prophylaxis against zoonoses in terms of genetic, rather than species, diversity.
Collapse
Affiliation(s)
- Eric Lewitus
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine Inc., Bethesda, MD, USA
| | - Hongjun Bai
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine Inc., Bethesda, MD, USA
| | - Morgane Rolland
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine Inc., Bethesda, MD, USA
| |
Collapse
|
20
|
Zhou J, Xing Y, Zhou Z, Wang S. A comprehensive analysis of Usutu virus (USUV) genomes revealed lineage-specific codon usage patterns and host adaptations. Front Microbiol 2023; 13:967999. [PMID: 36713228 PMCID: PMC9878346 DOI: 10.3389/fmicb.2022.967999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 12/19/2022] [Indexed: 01/15/2023] Open
Abstract
The Usutu virus (USUV) is an emerging arbovirus virus maintained in the environment of Afro-Eurasia via a bird-mosquito-bird enzootic cycle and sporadically infected other vertebrates. Despite primarily asymptomatic or mild symptoms, humans infected by USUV can develop severe neurological diseases such as meningoencephalitis. However, no detailed study has yet been conducted to investigate its evolution from the perspective of codon usage patterns. Codon usage choice of viruses reflects the genetic variations that enable them to reconcile their viability and fitness toward the external environment and new hosts. This study performed a comprehensive evolution and codon usage analysis of USUVs. Our reconstructed phylogenetic tree confirmed that the circulation viruses belong to eight distinct lineages, reaffirmed by principal component analysis based on codon usage patterns. We also found a relatively small codon usage bias and that natural selection, mutation pressure, dinucleotide abundance, and evolutionary processes collectively shaped the codon usage of the USUV, with natural selection predominating over the others. Additionally, a complex interaction of codon usage between the USUV and its host was observed. This process could have enabled USUV to adapt to various hosts and vectors, including humans. Therefore, the USUV may possess a potential risk of cross-species transmission and subsequent outbreaks. In this respect, further epidemiologic surveys, diversity monitoring, and pathogenetic research are warranted.
Collapse
|
21
|
Identification and analysis of putative tRNA genes in baculovirus genomes. Virus Res 2022; 322:198949. [PMID: 36181979 DOI: 10.1016/j.virusres.2022.198949] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 09/26/2022] [Accepted: 09/27/2022] [Indexed: 12/24/2022]
Abstract
Transfer RNAs (tRNAs) genes are both coded for and arranged along some viral genomes representing the entire virosphere and seem to play different biological functions during infection, other than transferring the correct amino acid to a growing peptide chain. Baculovirus genome description and annotation has focused mostly on protein-coding genes, microRNA, and homologous regions. Here we carried out a large-scale in silico search for putative tRNA genes in baculovirus genomes. Ninety-six of 257 baculovirus genomes analyzed was found to contain at least one putative tRNA gene. We found great diversity in primary and secondary structure, in location within the genome, in intron presence and size, and in anti-codon identity. In some cases, genes of tRNA-containing genomes were found to have a bias for the codons specified by the tRNAs present in such genomes. Moreover, analysis revealed that most of the putative tRNA genes possessed conserved motifs for tRNA type 2 promoters, including the A-box and B-box motifs with few mismatches from the eukaryotic canonical motifs. From publicly available small RNA deep sequencing datasets of baculovirus-infected insect cells, we found evidence that a putative Autographa californica multiple nucleopolyhedrovirus Gln-tRNA gene was transcribed and modified with the addition of the non-templated 3'-CCA tail found at the end of all tRNAs. Further research is needed to determine the expression and functionality of these viral tRNAs.
Collapse
|
22
|
King KM, Rajadhyaksha EV, Tobey IG, Van Doorslaer K. Synonymous nucleotide changes drive papillomavirus evolution. Tumour Virus Res 2022; 14:200248. [PMID: 36265836 PMCID: PMC9589209 DOI: 10.1016/j.tvr.2022.200248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 10/11/2022] [Accepted: 10/12/2022] [Indexed: 11/06/2022] Open
Abstract
Papillomaviruses have been evolving alongside their hosts for at least 450 million years. This review will discuss some of the insights gained into the evolution of this diverse family of viruses. Papillomavirus evolution is constrained by pervasive purifying selection to maximize viral fitness. Yet these viruses need to adapt to changes in their environment, e.g., the host immune system. It has long been known that these viruses evolved a codon usage that doesn't match the infected host. Here we discuss how papillomavirus genomes evolve by acquiring synonymous changes that allow the virus to avoid detection by the host innate immune system without changing the encoded proteins and associated fitness loss. We discuss the implications of studying viral evolution, lifecycle, and cancer progression.
Collapse
Affiliation(s)
- Kelly M King
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, USA
| | - Esha Vikram Rajadhyaksha
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, USA; Department of Physiology and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Isabelle G Tobey
- Cancer Biology Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ, USA
| | - Koenraad Van Doorslaer
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, USA; Cancer Biology Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ, USA; The BIO5 Institute, The Department of Immunobiology, Genetics Graduate Interdisciplinary Program, UA Cancer Center, University of Arizona Tucson, Arizona, USA.
| |
Collapse
|
23
|
Comparative Analysis on the Codon Usage Pattern of the Chloroplast Genomes in Malus Species. Biochem Genet 2022; 61:1050-1064. [DOI: 10.1007/s10528-022-10302-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 11/08/2022] [Indexed: 11/23/2022]
|
24
|
Jiang L, Zhang Q, Xiao S, Si F. Deep decoding of codon usage strategies and host adaption preferences of soybean mosaic virus. Int J Biol Macromol 2022; 222:803-817. [PMID: 36167098 DOI: 10.1016/j.ijbiomac.2022.09.179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 09/13/2022] [Accepted: 09/17/2022] [Indexed: 11/05/2022]
Abstract
Soybean mosaic virus (SMV) has threatened the global yield of Leguminosae crops, but the mechanism of its infection, spread, and evolution remains unknown. A systemic analysis of 107 SMV strains was performed to explore the genome-wide codon usage profile and the various factors influencing the codon usage patterns of SMV, which provides insight into its molecular evolution and elucidates its unknown host adaptation pattern. The overall nucleotide composition and correlation analysis revealed that the preferred synonymous codons mostly end with A/U. Clustering by RSCU value of each strain and phylogenetic tree analysis showed that the SMV isolates studied were divided into four clades, with a low overall extent of codon usage bias (CUB) in SMV. According to the ENC, PR2, neutrality plot, and correspondence analysis, natural selection of geographical diversity may play a critical role in the CUB. Higher adaptability was shown in Glycine with SMV and more pressure was received by clade III. These findings could not only provide valuable information about the overall codon usage pattern of the SMV genome, but could also aid in the clarification of the involved mechanisms that dominate the codon usage patterns and genetic evolution of the SMV genome.
Collapse
Affiliation(s)
- Li Jiang
- Guangdong Provincial Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510006, China
| | - Qiang Zhang
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou 450009, China
| | - Shimin Xiao
- Shanwei Marine Industry Institute, Shanwei Institute of Technology, Shanwei 516600, China.
| | - Fusheng Si
- Institute of Animal Science and Veterinary Medicine, Shanghai Academy of Agricultural Sciences, Shanghai Key Laboratory of Agricultural Genetics and Breeding, Shanghai Engineering Research Center of Breeding Pig, Shanghai 201106, China.
| |
Collapse
|
25
|
Complete Chloroplast Genome of Gladiolus gandavensis (Gladiolus) and Genetic Evolutionary Analysis. Genes (Basel) 2022; 13:genes13091599. [PMID: 36140767 PMCID: PMC9498597 DOI: 10.3390/genes13091599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 08/26/2022] [Accepted: 09/04/2022] [Indexed: 11/25/2022] Open
Abstract
Gladiolus is an important ornamental plant that is one of the world’s four most-grown cut flowers. Gladiolus gandavensis has only been found in the Cangnan County (Zhejiang Province) of China, which is recorded in the “Botanical”. To explore the origin of G. gandavensis, chloroplast genome sequencing was conducted. The results indicated that a total of 151,654 bp of circular DNA was obtained. The chloroplast genome of G. gandavensis has a quadripartite structure (contains a large single-copy (LSC) region (81,547 bp), a small single-copy region (SSC) (17,895 bp), and two inverted repeats (IRs) (IRa and IRb, 52,212 bp)), similar to that of other species. In addition, a total of 84 protein-coding genes, 8 rRNA-encoding genes, and 38 tRNA-encoding genes were present in the chloroplast genome. To further study the structural characteristics of the chloroplast genome in G. gandavensis, a comparative analysis of eight species of the Iridaceae family was conducted, and the results revealed higher similarity in the IR regions than in the LSC and SSC regions. In addition, 265 simple sequence repeats (SSRs) were detected in this study. The results of the phylogenetic analysis indicated that the chloroplast genome of G. gandavensis has high homology with the Crocus cartwrightianus and Crocus sativus chloroplast genomes. Genetic analysis based on the rbcl sequence among 49 Gladiolus species showed that samples 42, 49, 50, and 54 had high homology with the three samples from China (64, 65, and 66), which might be caused by chance similarity in genotypes. These results suggest that G. gandavensis may have originated from South Africa.
Collapse
|
26
|
Subramanian K, Payne B, Feyertag F, Alvarez-Ponce D. The Codon Statistics Database: a Database of Codon Usage Bias. Mol Biol Evol 2022; 39:6647594. [PMID: 35859338 PMCID: PMC9372565 DOI: 10.1093/molbev/msac157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
We present the Codon Statistics Database, an online database that contains codon usage statistics for all the species with reference or representative genomes in RefSeq (over 15,000). The user can search for any species and access two sets of tables. One set lists, for each codon, the frequency, the Relative Synonymous Codon Usage, and whether the codon is preferred. Another set of tables lists, for each gene, its GC content, Effective Number of Codons, Codon Adaptation Index, and frequency of optimal codons. Equivalent tables can be accessed for (1) all nuclear genes, (2) nuclear genes encoding ribosomal proteins, (3) mitochondrial genes, and (4) chloroplast genes (if available in the relevant assembly). The user can also search for any taxonomic group (e.g., “primates”) and obtain a table comparing all the species in the group. The database is free to access without registration at http://codonstatsdb.unr.edu.
Collapse
Affiliation(s)
| | - Bryan Payne
- Biology Department, University of Nevada, Reno, Reno, NV, 89557
| | - Felix Feyertag
- Biology Department, University of Nevada, Reno, Reno, NV, 89557
| | | |
Collapse
|
27
|
Wu H, Li B, Miao Z, Hu L, Zhou L, Lu Y. Codon usage of host-specific P genotypes (VP4) in group A rotavirus. BMC Genomics 2022; 23:518. [PMID: 35842571 PMCID: PMC9288207 DOI: 10.1186/s12864-022-08730-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 06/30/2022] [Indexed: 11/26/2022] Open
Abstract
Background Group A rotavirus (RVA) is a common causative agent of acute gastroenteritis in infants and young children worldwide. RVA P genotypes, determined by VP4 sequences, have been confirmed to infect humans and animals. However, their codon usage patterns that are essential to obtain insights into the viral evolution, host adaptability, and genetic characterization remained unclear, especially across animal hosts. Results We performed a comprehensive codon usage analysis of eight host-specific RVA P genotypes, including human RVA (P[4] and P[8]), porcine RVA (P[13] and P[23]), and zoonotic RVA (P[1], P[6], P[7] and P[19]), based on 233 VP4 complete coding sequences. Nucleotide composition, relative synonymous codon usage (RSCU), and effective number of codons (ENC) were calculated. Principal component analysis (PCA) based on RSCU values was used to explore the codon usage patterns of different RVA P genotypes. In addition, mutation pressure and natural selection were identified by using ENC-plot, parity rule 2 plot, and neutrality plot analyses. All VP4 sequences preferred using A/U nucleotides (A: 0.354-0.377, U: 0.267-0.314) than G/C nucleotides across genotypes. Similarly, majority of commonly used synonymous codons were likely to end with A/U nucleotides (A: 9/18-12/18, U: 6/18-9/18). In PCA, human, porcine, and zoonotic genotypes clustered separately in terms of RSCU values, indicating the host-specific codon usage patterns; however, porcine and zoonotic genotypes were partly overlapped. Human genotypes, P[4] and P[8], had stronger codon usage bias, as indicated by more over-represented codons and lower ENC, compared to porcine and zoonotic genotypes. Moreover, natural selection was determined to be a predominant driver in shaping the codon usage bias across the eight P genotypes. In addition, mutation pressure contributed to the codon usage bias of human genotypes. Conclusions Our study identified a strong codon usage bias of human RVA P genotypes attributable to both natural selection and mutation pressure, whereas similar codon usage bias between porcine and zoonotic genotypes predominantly attributable to natural selection. It further suggests possible cross-species transmission. Therefore, it warrants further surveillance of RVA P genotypes for early identification of zoonotic infection. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08730-2.
Collapse
Affiliation(s)
- Han Wu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Bingzhe Li
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Ziping Miao
- Institute of Communicable Diseases Prevention and Control, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, 310052, Zhejiang, China
| | - Linjie Hu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Lu Zhou
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Yihan Lu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
28
|
Ahmed W, Gupta S, Singh D, Singh R. Insight of genetic features prevalent in three Echinoderm species (Apostichopus japonicus, Heliocedaris erythrogramma and Asterias rubens) and their evolutionary association using comparative codon pattern analysis. GENE REPORTS 2022. [DOI: 10.1016/j.genrep.2022.101589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
29
|
Shafat Z, Ahmed A, Parvez MK, Parveen S. Analysis of codon usage patterns in open reading frame 4 of hepatitis E viruses. BENI-SUEF UNIVERSITY JOURNAL OF BASIC AND APPLIED SCIENCES 2022; 11:65. [PMID: 35573872 PMCID: PMC9086417 DOI: 10.1186/s43088-022-00244-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 04/19/2022] [Indexed: 12/01/2022] Open
Abstract
Background Hepatitis E virus (HEV) is a member of the family Hepeviridae and causes acute HEV infections resulting in thousands of deaths worldwide. The zoonotic nature of HEV in addition to its tendency from human to human transmission has led scientists across the globe to work on its different aspects. HEV also accounts for about 30% mortality rates in case of pregnant women. The genome of HEV is organized into three open reading frames (ORFs): ORF1 ORF2 and ORF3. A reading frame encoded protein ORF4 has recently been discovered which is exclusive to GT 1 isolates of HEV. The ORF4 is suggested to play crucial role in pregnancy-associated pathology and enhanced replication. Though studies have documented the ORF4’s importance, the genetic features of ORF4 protein genes in terms of compositional patterns have not been elucidated. As codon usage performs critical role in establishment of the host–pathogen relationship, therefore, the present study reports the codon usage analysis (based on nucleotide sequences of HEV ORF4 available in the public database) in three hosts along with the factors influencing the codon usage patterns of the protein genes of ORF4 of HEV. Results The nucleotide composition analysis indicated that ORF4 protein genes showed overrepresentation of C nucleotide and while A nucleotide was the least-represented, with random distribution of G and T(U) nucleotides. The relative synonymous codon usage (RSCU) analysis revealed biasness toward C/G-ended codons (over U/A) in all three natural HEV-hosts (human, rat and ferret). It was observed that all the ORF4 genes were richly endowed with GC content. Further, our results showed the occurrence of both coincidence and antagonistic codon usage patterns among HEV-hosts. The findings further emphasized that both mutational and selection forces influenced the codon usage patterns of ORF4 protein genes. Conclusions To the best of our knowledge, this is first bioinformatics study evaluating codon usage patterns in HEV ORF4 protein genes. The findings from this study are expected to increase our understanding toward significant factors involved in evolutionary changes of ORF4. Supplementary Information The online version contains supplementary material available at 10.1186/s43088-022-00244-w.
Collapse
|
30
|
Shafat Z, Ahmed A, Parvez MK, Parveen S. Decoding the codon usage patterns in Y-domain region of hepatitis E viruses. J Genet Eng Biotechnol 2022; 20:56. [PMID: 35404024 PMCID: PMC9001762 DOI: 10.1186/s43141-022-00319-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 02/17/2022] [Indexed: 12/17/2022]
Abstract
Background Hepatitis E virus (HEV) is a positive-sense RNA virus belonging to the family Hepeviridae. The genome of HEV is organized into three open-reading frames (ORFs): ORF1, ORF2, and ORF3. The ORF1 non-structural Y-domain region (YDR) has been demonstrated to play an important role in the HEV pathogenesis. The nucleotide composition, synonymous codon usage bias in conjunction with other factors influencing the viral YDR genes of HEV have not been studied. Codon usage represents a significant mechanism in establishing the host-pathogen relationship. The present study for the first time elucidates the detailed codon usage patterns of YDR among HEV and HEV-hosts (Human, Rabbit, Mongoose, Pig, Wild boar, Camel, Monkey). Results The overall nucleotide composition revealed the abundance of C and U nucleotides in YDR genomes. The relative synonymous codon usage (RSCU) analysis indicated biasness towards C and U over A and G ended codons in HEV across all hosts. Codon frequency comparative analyses among HEV-hosts showed both similarities and discrepancies in usage of preferred codons encoding amino acids, which revealed that HEV codon preference neither completely differed nor completely showed similarity with its hosts. Thus, our results clearly indicated that the synonymous codon usage of HEV is a mixture of the two types of codon usage: coincidence and antagonism. Mutation pressure from virus and natural selection from host seems to be accountable for shaping the codon usage patterns in YDR. The study emphasised that the influence of compositional constraints, codon usage biasness, mutational alongside the selective forces were reflected in the occurrence of YDR codon usage patterns. Conclusions Our study is the first in its kind to have reported the analysis of codon usage patterns on a total of seven different natural HEV hosts. Therefore, knowledge of preferred codons obtained from our study will not only augment our understanding towards molecular evolution but is also envisaged to provide insight into the efficient viral expression, viral adaptation, and host effects on the HEV YDR codon usage. Supplementary Information The online version contains supplementary material available at 10.1186/s43141-022-00319-2.
Collapse
Affiliation(s)
- Zoya Shafat
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| | - Anwar Ahmed
- Centre of Excellence in Biotechnology Research, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Mohammad K Parvez
- Department of Pharmacognosy, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Shama Parveen
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India.
| |
Collapse
|
31
|
Mogro EG, Bottero D, Lozano MJ. Analysis of SARS-CoV-2 synonymous codon usage evolution throughout the COVID-19 pandemic. Virology 2022; 568:56-71. [PMID: 35134624 PMCID: PMC8808327 DOI: 10.1016/j.virol.2022.01.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 01/21/2022] [Accepted: 01/21/2022] [Indexed: 12/12/2022]
Abstract
SARS-CoV-2, the seventh coronavirus known to infect humans, can cause severe life-threatening respiratory pathologies. To better understand SARS-CoV-2 evolution, genome-wide analyses have been made, including the general characterization of its codons usage profile. Here we present a bioinformatic analysis of the evolution of SARS-CoV-2 codon usage over time using complete genomes collected since December 2019. Our results show that SARS-CoV-2 codon usage pattern is antagonistic to, and it is getting farther away from that of the human host. Further, a selection of deoptimized codons over time, which was accompanied by a decrease in both the codon adaptation index and the effective number of codons, was observed. All together, these findings suggest that SARS-CoV-2 could be evolving, at least from the perspective of the synonymous codon usage, to become less pathogenic.
Collapse
Affiliation(s)
- Ezequiel G Mogro
- Instituto de Biotecnología y Biología Molecular (IBBM), CONICET, CCT-La Plata, Universidad Nacional de La Plata (UNLP), Argentina
| | - Daniela Bottero
- Instituto de Biotecnología y Biología Molecular (IBBM), CONICET, CCT-La Plata, Universidad Nacional de La Plata (UNLP), Argentina
| | - Mauricio J Lozano
- Instituto de Biotecnología y Biología Molecular (IBBM), CONICET, CCT-La Plata, Universidad Nacional de La Plata (UNLP), Argentina.
| |
Collapse
|
32
|
Gaunt ER, Digard P. Compositional biases in RNA viruses: Causes, consequences and applications. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1679. [PMID: 34155814 PMCID: PMC8420353 DOI: 10.1002/wrna.1679] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 05/29/2021] [Accepted: 05/31/2021] [Indexed: 01/05/2023]
Abstract
If each of the four nucleotides were represented equally in the genomes of viruses and the hosts they infect, each base would occur at a frequency of 25%. However, this is not observed in nature. Similarly, the order of nucleotides is not random (e.g., in the human genome, guanine follows cytosine at a frequency of ~0.0125, or a quarter the number of times predicted by random representation). Codon usage and codon order are also nonrandom. Furthermore, nucleotide and codon biases vary between species. Such biases have various drivers, including cellular proteins that recognize specific patterns in nucleic acids, that once triggered, induce mutations or invoke intrinsic or innate immune responses. In this review we examine the types of compositional biases identified in viral genomes and current understanding of the evolutionary mechanisms underpinning these trends. Finally, we consider the potential for large scale synonymous recoding strategies to engineer RNA virus vaccines, including those with pandemic potential, such as influenza A virus and Severe Acute Respiratory Syndrome Coronavirus Virus 2. This article is categorized under: RNA in Disease and Development > RNA in Disease RNA Evolution and Genomics > Computational Analyses of RNA RNA Interactions with Proteins and Other Molecules > Protein-RNA Recognition.
Collapse
Affiliation(s)
- Eleanor R. Gaunt
- Department of Infection and ImmunityThe Roslin Institute, The University of EdinburghEdinburghUK
| | - Paul Digard
- Department of Infection and ImmunityThe Roslin Institute, The University of EdinburghEdinburghUK
| |
Collapse
|
33
|
The low abundance of CpG in the SARS-CoV-2 genome is not an evolutionarily signature of ZAP. Sci Rep 2022; 12:2420. [PMID: 35165300 PMCID: PMC8844275 DOI: 10.1038/s41598-022-06046-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 12/28/2021] [Indexed: 12/23/2022] Open
Abstract
The zinc finger antiviral protein (ZAP) is known to restrict viral replication by binding to the CpG rich regions of viral RNA, and subsequently inducing viral RNA degradation. This enzyme has recently been shown to be capable of restricting SARS-CoV-2. These data have led to the hypothesis that the low abundance of CpG in the SARS-CoV-2 genome is due to an evolutionary pressure exerted by the host ZAP. To investigate this hypothesis, we performed a detailed analysis of many coronavirus sequences and ZAP RNA binding preference data. Our analyses showed neither evidence for an evolutionary pressure acting specifically on CpG dinucleotides, nor a link between the activity of ZAP and the low CpG abundance of the SARS-CoV-2 genome.
Collapse
|
34
|
He Z, Qin L, Wang W, Ding S, Xu X, Zhang S. The dinucleotide composition of sugarcane mosaic virus is shaped more by protein coding regions than by host species. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2022; 97:105165. [PMID: 34861431 DOI: 10.1016/j.meegid.2021.105165] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 11/23/2021] [Accepted: 11/26/2021] [Indexed: 12/12/2022]
Abstract
Sugarcane mosaic virus (SCMV), which belongs to the Potyvirus genus of the family Potyviridae, causes mosaic diseases in canna, sugarcane and maize worldwide. Previously, the genetic variations, timescale, codon usage patterns and host adaptions of SCMV were determined. However, the dinucleotide composition and the dinucleotide bias from hosts or the protein coding regions of the virus have yet to be investigated. In this study, comprehensive analyses of the dinucleotide composition and dinucleotide bias from hosts, lineages and protein coding regions of SCMV were performed using 131 complete genomic sequences. We found that UpG and CpA were largely overrepresented while UpA, CpC, and CpG were largely underrepresented in the polyprotein and 11 protein coding region data sets. SCMV dinucleotide composition bias is more strongly dependent on the protein coding regions than on hosts. A weak association between the dinucleotide composition and SCMV lineages was also observed. Our analysis provides a novel perspective on the molecular evolutionary mechanisms of SCMV and may provide a better understanding of future research on the origin and evolutionary patterns of SCMV.
Collapse
Affiliation(s)
- Zhen He
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China; Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China.
| | - Lang Qin
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China
| | - Wenzhi Wang
- Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, PR China
| | - Shiwen Ding
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China
| | - Xiaowei Xu
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China
| | - Shuzhen Zhang
- Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, PR China.
| |
Collapse
|
35
|
Andargie M, Congyi Z. Genome-wide analysis of codon usage in sesame ( Sesamum indicum L.). Heliyon 2022; 8:e08687. [PMID: 35106386 PMCID: PMC8789531 DOI: 10.1016/j.heliyon.2021.e08687] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/20/2021] [Accepted: 12/24/2021] [Indexed: 10/28/2022] Open
Abstract
Sesamum indicum is an ancient oil crop grown in tropical and subtropical areas of the world. We have analyzed 23,538 coding sequences (CDS) of S. indicum to understand the factors shaping codon usage in this important oil crop plant. We identified eleven highly preferred codons in S. indicum that have AT-endings. The slope of a neutrality plot was less than one while effective number of codons (ENC) plot showed distribution above and below the standard curve. There is a significant relationship between protein length and relative synonymous codon usage (RSCU) at the primary axis while there is a weak correlation between protein length and Nc values. Correspondence analysis conducted on RSCU values differentiated CDS based on their GC content and their characteristic feature and showed a discrete distribution. Moreover, by determining codon usage, we found out that majority of the lignan biosynthesis related genes showed a weaker codon usage bias. These results provide insights into understanding codon evolution in sesame.
Collapse
Affiliation(s)
- Mebeaselassie Andargie
- University of Goettingen, Molecular Phytopathology and Mycotoxin Research, Grisebachstrasse 6, 37077 Goettingen, Germany
| | - Zhu Congyi
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization (MOA), Guangdong Province Key Laboratory of Tropical and Subtropical Fruit Tree Research, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| |
Collapse
|
36
|
Nair RR, Mohan M, Rudramurthy GR, Vivekanandam R, Satheshkumar PS. Strategies and Patterns of Codon Bias in Molluscum Contagiosum Virus. Pathogens 2021; 10:1649. [PMID: 34959603 PMCID: PMC8703355 DOI: 10.3390/pathogens10121649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/14/2021] [Accepted: 12/16/2021] [Indexed: 11/22/2022] Open
Abstract
Trends associated with codon usage in molluscum contagiosum virus (MCV) and factors governing the evolution of codon usage have not been investigated so far. In this study, attempts were made to decipher the codon usage trends and discover the major evolutionary forces that influence the patterns of codon usage in MCV with special reference to sub-types 1 and 2, MCV-1 and MCV-2, respectively. Three hypotheses were tested: (1) codon usage patterns of MCV-1 and MCV-2 are identical; (2) SCUB (synonymous codon usage bias) patterns of MCV-1 and MCV-2 slightly deviate from that of human host to avoid affecting the fitness of host; and (3) translational selection predominantly shapes the SCUB of MCV-1 and MCV-2. Various codon usage indices viz. relative codon usage value, effective number of codons and codon adaptation index were calculated to infer the nature of codon usage. Correspondence analysis and correlation analysis were performed to assess the relative contribution of silent base contents and significance of codon usage indices in defining bias in codon usage. Among the tested hypotheses, only the second and third hypotheses were accepted.
Collapse
Affiliation(s)
- Rahul Raveendran Nair
- Centre for Evolutionary Ecology, Aushmath Biosciences, Vadavalli Post, Coimbatore 641041, India
| | - Manikandan Mohan
- College of Pharmacy, University of Georgia, Athens, GA 30605, USA;
| | | | - Reethu Vivekanandam
- Department of Biotechnology, Bharathiyar University, Coimbatore 641046, India;
| | | |
Collapse
|
37
|
Chakraborty S, Basumatary P, Nath D, Paul S, Uddin A. Compositional features and pattern of codon usage for mitochondrial CO genes among reptiles. Mitochondrion 2021; 62:111-121. [PMID: 34793987 DOI: 10.1016/j.mito.2021.11.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 11/02/2021] [Accepted: 11/10/2021] [Indexed: 11/27/2022]
Abstract
The phenomenon of non-random occurrence of synonymous nucleotide triplets (codons) in the coding sequences of genes is the codon usage bias (CUB). In this study, we used bioinformatic tool kit to analyze the compositional pattern and CUB of mitogenes namely COI, COII and COIII across different orders of reptiles. Estimation of overall base composition in the protein-coding sequences of COI, COII and COIII genes of the reptilian orders revealed an uneven usage of nucleotides. The overall count of A nucleotide was found to be the highest while the overall count of G nucleotide was the least. The CO genes across the three reptilian orders were prominently AT biased. Comparison of the GC proportion at each codon position displayed that GC1 percentage ranked the highest in all the three CO genes of the reptilian orders. SCUO values indicated weaker CUB, while considerable variation of SCUO values existed in the three CO genes across the studied reptiles. Relative synonymous codon usage (RSCU) values indicated that mostly the A ending codons were preferred. Based on the parameters namely neutrality plot, mutational responsive index and translational selection, we could conclude that natural selection was the major evolutionary force in COI, COII and COIII genes in the studied reptilian orders. However, correspondence analysis, parity plot and correlation studies indicated the existence of mutation pressure as well on the CO genes.
Collapse
Affiliation(s)
- Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| | | | - Durbba Nath
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Sunanda Paul
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi788150, Assam, India.
| |
Collapse
|
38
|
Chen Z, Utro F, Platt D, DeSalle R, Parida L, Chan PKS, Burk RD. K-Mer Analyses Reveal Different Evolutionary Histories of Alpha, Beta, and Gamma Papillomaviruses. Int J Mol Sci 2021; 22:9657. [PMID: 34502564 PMCID: PMC8432194 DOI: 10.3390/ijms22179657] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/04/2021] [Accepted: 09/05/2021] [Indexed: 12/31/2022] Open
Abstract
Papillomaviruses (PVs) are a heterogeneous group of DNA viruses that can infect fish, birds, reptiles, and mammals. PVs infecting humans (HPVs) phylogenetically cluster into five genera (Alpha-, Beta-, Gamma-, Mu- and Nu-PV), with differences in tissue tropism and carcinogenicity. The evolutionary features associated with the divergence of Papillomaviridae are not well understood. Using a combination of k-mer distributions, genetic metrics, and phylogenetic algorithms, we sought to evaluate the characteristics and differences of Alpha-, Beta- and Gamma-PVs constituting the majority of HPV genomes. A total of 640 PVs including 442 HPV types, 27 non-human primate PV types, and 171 non-primate animal PV types were evaluated. Our analyses revealed the highest genetic diversity amongst Gamma-PVs compared to the Alpha and Beta PVs, suggesting reduced selective pressures on Gamma-PVs. Using a sequence alignment-free trimer (k = 3) phylogeny algorithm, we reconstructed a phylogeny that grouped most HPV types into a monophyletic clade that was further split into three branches similar to alignment-based classifications. Interestingly, a subset of low-risk Alpha HPVs (the species Alpha-2, 3, 4, and 14) split from other HPVs and were clustered with non-human primate PVs. Surprisingly, the trimer-constructed phylogeny grouped the Gamma-6 species types originally isolated from the cervicovaginal region with the main Alpha-HPV clade. These data indicate that characterization of papillomavirus heterogeneity via orthogonal approaches reveals novel insights into the biological understanding of HPV genomes.
Collapse
Affiliation(s)
- Zigui Chen
- Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China;
- Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Filippo Utro
- Computational Genomics, IBM T. J. Watson Research, Yorktown Heights, NY 10598, USA; (F.U.); (D.P.); (L.P.)
| | - Daniel Platt
- Computational Genomics, IBM T. J. Watson Research, Yorktown Heights, NY 10598, USA; (F.U.); (D.P.); (L.P.)
| | - Rob DeSalle
- Sackler Institute of Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA;
| | - Laxmi Parida
- Computational Genomics, IBM T. J. Watson Research, Yorktown Heights, NY 10598, USA; (F.U.); (D.P.); (L.P.)
| | - Paul K. S. Chan
- Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China;
- Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Robert D. Burk
- Department of Pediatrics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY 10461, USA
- Department of Obstetrics, Gynecology and Woman’s Health, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| |
Collapse
|
39
|
Mollentze N, Babayan SA, Streicker DG. Identifying and prioritizing potential human-infecting viruses from their genome sequences. PLoS Biol 2021; 19:e3001390. [PMID: 34582436 PMCID: PMC8478193 DOI: 10.1371/journal.pbio.3001390] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 08/10/2021] [Indexed: 11/18/2022] Open
Abstract
Determining which animal viruses may be capable of infecting humans is currently intractable at the time of their discovery, precluding prioritization of high-risk viruses for early investigation and outbreak preparedness. Given the increasing use of genomics in virus discovery and the otherwise sparse knowledge of the biology of newly discovered viruses, we developed machine learning models that identify candidate zoonoses solely using signatures of host range encoded in viral genomes. Within a dataset of 861 viral species with known zoonotic status, our approach outperformed models based on the phylogenetic relatedness of viruses to known human-infecting viruses (area under the receiver operating characteristic curve [AUC] = 0.773), distinguishing high-risk viruses within families that contain a minority of human-infecting species and identifying putatively undetected or so far unrealized zoonoses. Analyses of the underpinnings of model predictions suggested the existence of generalizable features of viral genomes that are independent of virus taxonomic relationships and that may preadapt viruses to infect humans. Our model reduced a second set of 645 animal-associated viruses that were excluded from training to 272 high and 41 very high-risk candidate zoonoses and showed significantly elevated predicted zoonotic risk in viruses from nonhuman primates, but not other mammalian or avian host groups. A second application showed that our models could have identified Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) as a relatively high-risk coronavirus strain and that this prediction required no prior knowledge of zoonotic Severe Acute Respiratory Syndrome (SARS)-related coronaviruses. Genome-based zoonotic risk assessment provides a rapid, low-cost approach to enable evidence-driven virus surveillance and increases the feasibility of downstream biological and ecological characterization of viruses.
Collapse
Affiliation(s)
- Nardus Mollentze
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Simon A. Babayan
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Daniel G. Streicker
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| |
Collapse
|
40
|
Pereira-Gómez M, Carrau L, Fajardo Á, Moreno P, Moratorio G. Altering Compositional Properties of Viral Genomes to Design Live-Attenuated Vaccines. Front Microbiol 2021; 12:676582. [PMID: 34276608 PMCID: PMC8278477 DOI: 10.3389/fmicb.2021.676582] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 06/01/2021] [Indexed: 12/11/2022] Open
Abstract
Live-attenuated vaccines have been historically used to successfully prevent numerous diseases caused by a broad variety of RNA viruses due to their ability to elicit strong and perdurable immune-protective responses. In recent years, various strategies have been explored to achieve viral attenuation by rational genetic design rather than using classic and empirical approaches, based on successive passages in cell culture. A deeper understanding of evolutionary implications of distinct viral genomic compositional aspects, as well as substantial advances in synthetic biology technologies, have provided a framework to achieve new viral attenuation strategies. Herein, we will discuss different approaches that are currently applied to modify compositional features of viruses in order to develop novel live-attenuated vaccines.
Collapse
Affiliation(s)
- Marianoel Pereira-Gómez
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Lucía Carrau
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Álvaro Fajardo
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Pilar Moreno
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Gonzalo Moratorio
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| |
Collapse
|
41
|
Deb B, Uddin A, Chakraborty S. Analysis of codon usage of Horseshoe Bat Hepatitis B virus and its host. Virology 2021; 561:69-79. [PMID: 34171764 DOI: 10.1016/j.virol.2021.05.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 05/07/2021] [Accepted: 05/19/2021] [Indexed: 11/28/2022]
Abstract
In the present analysis, codon usage strategies and base distribution of Horseshoe bat hepatitis B virus (HBHBV) were analyzed and compared with its host Rhinolophus sinicus, as no work was yet reported. The magnitude of synonymous codon usage bias (CUB) in the virus and its host was low with higher proportion of the base C. Notably, 21 more frequently used codons, 19 less frequently used codons and 3 underrepresented codons (TCG, ACG and GCG) were found to be similar in both virus and its host coding sequences. Neutrality plot analysis reported greater role of natural selection in HBHBV (67.84%) and R. sinicus (76.90%) over mutation pressure. Base skewness and protein properties also influenced the CUB of genes. Further, codon usage analysis depicted, HBHBV and R. sinicus had many similarities in codon usage patterns that might reflect viral adaptation to its host.
Collapse
Affiliation(s)
- Bornali Deb
- Department of Biotechnology, Assam University, Silchar, 788150, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi, 788150, Assam, India
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, 788150, Assam, India.
| |
Collapse
|
42
|
Genome-Wide Analysis of Codon Usage Patterns of SARS-CoV-2 Virus Reveals Global Heterogeneity of COVID-19. Biomolecules 2021; 11:biom11060912. [PMID: 34207362 PMCID: PMC8233742 DOI: 10.3390/biom11060912] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/14/2021] [Accepted: 06/14/2021] [Indexed: 12/14/2022] Open
Abstract
The ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characterized due to the lack of comparative study of a large enough sample size from around the globe to reduce the standard deviation to the acceptable margin of error. To better understand the SARS-CoV-2 genome architecture, we have performed a comprehensive analysis of codon usage bias of sixty (60) strains to get a snapshot of its global heterogeneity. Our study shows a relatively low codon usage bias in the SARS-CoV-2 viral genome globally, with nearly all the over-preferred codons' A.U. ended. We concluded that the SARS-CoV-2 genome is primarily shaped by mutation pressure; however, marginal selection pressure cannot be overlooked. Within the A/U rich virus genomes of SARS-CoV-2, the standard deviation in G.C. (42.91% ± 5.84%) and the GC3 value (30.14% ± 6.93%) points towards global heterogeneity of the virus. Several SARS-CoV-2 viral strains were originated from different viral lineages at the exact geographic location also supports this fact. Taking all together, these findings suggest that the general root ancestry of the global genomes are different with different genome's level adaptation to host. This research may provide new insights into the codon patterns, host adaptation, and global heterogeneity of SARS-CoV-2.
Collapse
|
43
|
Evolutionary Patterns of Codon Usage in Major Lineages of Porcine Reproductive and Respiratory Syndrome Virus in China. Viruses 2021; 13:v13061044. [PMID: 34072978 PMCID: PMC8228872 DOI: 10.3390/v13061044] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 05/16/2021] [Accepted: 05/25/2021] [Indexed: 11/17/2022] Open
Abstract
Porcine reproductive and respiratory syndrome virus (PRRSV) is economically important and characterized by its extensive variation. The codon usage patterns and their influence on viral evolution and host adaptation among different PRRSV strains remain largely unknown. Here, the codon usage of ORF5 genes from lineages 1, 3, 5, and 8, and MLV strains of type 2 PRRSV in China was analyzed. A compositional property analysis of ORF5 genes revealed that nucleotide C is most frequently used at the third position of codons, accompanied by rich GC3s. The effective number of codon (ENC) and codon pair bias (CPB) values indicate that all ORF5 genes have low codon bias and the differences in CPB scores among four lineages are almost not significant. When compared with host codon usage patterns, lineage 1 strains show higher CAI and SiD values, with a high similarity to pig, which might relate to its predominant epidemic propensity in the field. The CAI, RCDI, and SiD values of ORF5 genes from different passages of MLV JXA1R indicate no relation between attenuation and CPB or codon adaptation decrease during serial passage on non-host cells. These findings provide a novel way of understanding the PRRSV's evolution, related to viral survival, host adaptation, and virulence.
Collapse
|
44
|
Franzo G. SARS-CoV-2 and other human coronavirus show genome patterns previously associated to reduced viral recognition and altered immune response. Sci Rep 2021; 11:10696. [PMID: 34021237 PMCID: PMC8139983 DOI: 10.1038/s41598-021-90278-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 05/10/2021] [Indexed: 12/23/2022] Open
Abstract
A new pandemic caused by the betacoronavirus SARS-CoV-2 originated in China in late 2019. Although often asymptomatic, a relevant percentage of affected people can develop severe pneumonia. Initial evidence suggests that dysregulation of the immune response could contribute to the pathogenesis, as previously demonstrated for SARS-CoV. The presence of genome composition features involved in delaying viral recognition is herein investigated for human coronaviruses (HCoVs), with a special emphasis on SARS-CoV-2. A broad collection of HCoVs polyprotein, envelope, matrix, nucleocapsid and spike coding sequences was downloaded and several statistics representative of genome composition and codon bias were investigated. A model able to evaluate and test the presence of a significant under- or over-representation of dinucleotide pairs while accounting for the underlying codon bias and protein sequence was also implemented. The study revealed the significant under-representation of CpG dinucleotide pair in all HcoV, but especially in SARS-CoV and even more in SARS-CoV-2. The presence of forces acting to minimize CpG content was confirmed by relative synonymous codon usage pattern. Codons containing the CpG pair were severely under-represented, primarily in the polyprotein and spike coding sequences of SARS-CoV-2. Additionally, a significant under-representation of the TpA pair was observed in the N and S region of SARS-CoV and SARS-CoV-2. Increasing experimental evidence has proven that CpG and TpA are targeted by innate antiviral host defences, contributing both to RNA degradation and RIG-1 mediated interferon production. The low content of these dinucleotides could contribute to a delayed interferon production, dysregulated immune response, higher viral replication and poor outcome. Significantly, the RIG-1 signalling pathway was proven to be defective in elderlies, suggesting a likely interaction between limited viral recognition and lower responsiveness in interferon production that could justify the higher disease severity and mortality in older patients.
Collapse
Affiliation(s)
- Giovanni Franzo
- Department of Animal Medicine, Production and Health (MAPS), University of Padua, Viale dell'Università 16, 35020, Legnaro, Padua, Italy.
| |
Collapse
|
45
|
Mordstein C, Cano L, Morales AC, Young B, Ho AT, Rice AM, Liss M, Hurst LD, Kudla G. Transcription, mRNA export and immune evasion shape the codon usage of viruses. Genome Biol Evol 2021; 13:6275682. [PMID: 33988683 PMCID: PMC8410142 DOI: 10.1093/gbe/evab106] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2021] [Indexed: 12/15/2022] Open
Abstract
The nucleotide composition, dinucleotide composition, and codon usage of many viruses differs from their hosts. These differences arise because viruses are subject to unique mutation and selection pressures that do not apply to host genomes; however, the molecular mechanisms that underlie these evolutionary forces are unclear. Here, we analysed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation. We find that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses. In an experimental follow-up, we find that the effects of GC content on gene expression depend on whether the genetic material is delivered to the cell as DNA or mRNA, whether it is transcribed by endogenous or exogenous RNA polymerase, and whether transcription takes place in the nucleus or cytoplasm. Our results suggest that viral codon usage cannot be explained by a simple adaptation to the codon usage of the host - instead, it reflects the combination of multiple selective and mutational pressures, including the need for efficient transcription, export, and immune evasion.
Collapse
Affiliation(s)
- Christine Mordstein
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Laura Cano
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| | - Atahualpa Castillo Morales
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Bethan Young
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alexander T Ho
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alan M Rice
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Michael Liss
- Thermo Fisher Scientific, GENEART GmbH, Regensburg, Germany
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Grzegorz Kudla
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
46
|
Hernandez-Alias X, Benisty H, Schaefer MH, Serrano L. Translational adaptation of human viruses to the tissues they infect. Cell Rep 2021; 34:108872. [PMID: 33730572 PMCID: PMC7962955 DOI: 10.1016/j.celrep.2021.108872] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 12/15/2020] [Accepted: 02/23/2021] [Indexed: 12/22/2022] Open
Abstract
Viruses need to hijack the translational machinery of the host cell for a productive infection to happen. However, given the dynamic landscape of tRNA pools among tissues, it is unclear whether different viruses infecting different tissues have adapted their codon usage toward their tropism. Here, we collect the coding sequences of 502 human-infecting viruses and determine that tropism explains changes in codon usage. Using the tRNA abundances across 23 human tissues from The Cancer Genome Atlas (TCGA), we build an in silico model of translational efficiency that validates the correspondence of the viral codon usage with the translational machinery of their tropism. For instance, we detect that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is specifically adapted to the upper respiratory tract and alveoli. Furthermore, this correspondence is specifically defined in early viral proteins. The observed tissue-specific translational efficiency could be useful for the development of antiviral therapies and vaccines.
Collapse
Affiliation(s)
- Xavier Hernandez-Alias
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain.
| | - Hannah Benisty
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Martin H Schaefer
- IEO European Institute of Oncology IRCCS, Department of Experimental Oncology, Via Adamello 16, Milan 20139, Italy.
| | - Luis Serrano
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; Universitat Pompeu Fabra (UPF), Barcelona 08002, Spain; ICREA, Pg. Lluís Companys 23, Barcelona 08010, Spain.
| |
Collapse
|
47
|
Li L, Hu Y, He M, Zhang B, Wu W, Cai P, Huo D, Hong Y. Comparative chloroplast genomes: insights into the evolution of the chloroplast genome of Camellia sinensis and the phylogeny of Camellia. BMC Genomics 2021; 22:138. [PMID: 33637038 PMCID: PMC7912895 DOI: 10.1186/s12864-021-07427-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 02/05/2021] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Chloroplast genome resources can provide useful information for the evolution of plant species. Tea plant (Camellia sinensis) is among the most economically valuable member of Camellia. Here, we determined the chloroplast genome of the first natural triploid Chinary type tea ('Wuyi narcissus' cultivar of Camellia sinensis var. sinensis, CWN) and conducted the genome comparison with the diploid Chinary type tea (Camellia sinensis var. sinensis, CSS) and two types of diploid Assamica type teas (Camellia sinensis var. assamica: Chinese Assamica type tea, CSA and Indian Assamica type tea, CIA). Further, the evolutionary mechanism of the chloroplast genome of Camellia sinensis and the relationships of Camellia species based on chloroplast genome were discussed. RESULTS Comparative analysis showed the evolutionary dynamics of chloroplast genome of Camellia sinensis were the repeats and insertion-deletions (indels), and distribution of the repeats, indels and substitutions were significantly correlated. Chinese tea and Indian tea had significant differences in the structural characteristic and the codon usage of the chloroplast genome. Analysis of sequence characterized amplified region (SCAR) using sequences of the intergenic spacers (trnE/trnT) showed none of 292 different Camellia sinensis cultivars had similar sequence characteristic to triploid CWN, but the other four Camellia species did. Estimations of the divergence time showed that CIA diverged from the common ancestor of two Assamica type teas about 6.2 Mya (CI: 4.4-8.1 Mya). CSS and CSA diverged to each other about 0.8 Mya (CI: 0.4-1.5 Mya). Moreover, phylogenetic clustering was not exactly consistent with the current taxonomy of Camellia. CONCLUSIONS The repeat-induced and indel-induced mutations were two important dynamics contributed to the diversification of the chloroplast genome in Camellia sinensis, which were not mutually exclusive. Chinese tea and Indian tea might have undergone different selection pressures. Chloroplast transfer occurred during the polyploid evolution in Camellia sinensis. In addition, our results supported the three different domestication origins of Chinary type tea, Chinese Assamica type tea and Indian Assamica type tea. And, the current classification of some Camellia species might need to be further discussed.
Collapse
Affiliation(s)
- Li Li
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China.
| | - Yunfei Hu
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Min He
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Bo Zhang
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Wei Wu
- College of Mathematics and Computer Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Pumo Cai
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Da Huo
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Yongcong Hong
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China.
| |
Collapse
|
48
|
Bergner LM, Mollentze N, Orton RJ, Tello C, Broos A, Biek R, Streicker DG. Characterizing and Evaluating the Zoonotic Potential of Novel Viruses Discovered in Vampire Bats. Viruses 2021; 13:252. [PMID: 33562073 PMCID: PMC7914986 DOI: 10.3390/v13020252] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/25/2021] [Accepted: 02/03/2021] [Indexed: 12/17/2022] Open
Abstract
The contemporary surge in metagenomic sequencing has transformed knowledge of viral diversity in wildlife. However, evaluating which newly discovered viruses pose sufficient risk of infecting humans to merit detailed laboratory characterization and surveillance remains largely speculative. Machine learning algorithms have been developed to address this imbalance by ranking the relative likelihood of human infection based on viral genome sequences, but are not yet routinely applied to viruses at the time of their discovery. Here, we characterized viral genomes detected through metagenomic sequencing of feces and saliva from common vampire bats (Desmodus rotundus) and used these data as a case study in evaluating zoonotic potential using molecular sequencing data. Of 58 detected viral families, including 17 which infect mammals, the only known zoonosis detected was rabies virus; however, additional genomes were detected from the families Hepeviridae, Coronaviridae, Reoviridae, Astroviridae and Picornaviridae, all of which contain human-infecting species. In phylogenetic analyses, novel vampire bat viruses most frequently grouped with other bat viruses that are not currently known to infect humans. In agreement, machine learning models built from only phylogenetic information ranked all novel viruses similarly, yielding little insight into zoonotic potential. In contrast, genome composition-based machine learning models estimated different levels of zoonotic potential, even for closely related viruses, categorizing one out of four detected hepeviruses and two out of three picornaviruses as having high priority for further research. We highlight the value of evaluating zoonotic potential beyond ad hoc consideration of phylogeny and provide surveillance recommendations for novel viruses in a wildlife host which has frequent contact with humans and domestic animals.
Collapse
Affiliation(s)
- Laura M. Bergner
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK; (N.M.); (R.B.); (D.G.S.)
- MRC–University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK; (R.J.O.); (A.B.)
| | - Nardus Mollentze
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK; (N.M.); (R.B.); (D.G.S.)
- MRC–University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK; (R.J.O.); (A.B.)
| | - Richard J. Orton
- MRC–University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK; (R.J.O.); (A.B.)
| | - Carlos Tello
- Association for the Conservation and Development of Natural Resources, Lima 15037, Peru;
- Yunkawasi, Lima 15049, Peru
| | - Alice Broos
- MRC–University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK; (R.J.O.); (A.B.)
| | - Roman Biek
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK; (N.M.); (R.B.); (D.G.S.)
| | - Daniel G. Streicker
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK; (N.M.); (R.B.); (D.G.S.)
- MRC–University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK; (R.J.O.); (A.B.)
| |
Collapse
|
49
|
Deb B, Uddin A, Chakraborty S. Composition, codon usage pattern, protein properties, and influencing factors in the genomes of members of the family Anelloviridae. Arch Virol 2021; 166:461-474. [PMID: 33392821 PMCID: PMC7779081 DOI: 10.1007/s00705-020-04890-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 10/02/2020] [Indexed: 01/31/2023]
Abstract
The present study was carried out on 62 genome sequences of members of the family Anelloviridae, as there have been no reports of genome analysis of these DNA viruses using a bioinformatics approach. The genes were found to be rich in AC content with low codon usage bias (CUB). Relative synonymous codon usage (RSCU) values identified the preferred codons for each amino acid in the family. The codon AGA was overrepresented, while the codons TCG, TTG, CGG, CGT, ACG, GCG and GAT were underrepresented in all of the genomes. A significant correlation was found between the effective number of codons (ENC) and base constraints, indicating that compositional properties might have influenced the CUB. A highly significant correlation was observed between the overall base content and the base content at the third codon position, indicating that mutations might have affected the CUB. A highly significant positive correlation was observed between GC12 and GC3 (r = 0.904, p < 0.01), which indicated that directional mutation pressure influenced all three codon positions. A neutrality plot revealed that the contribution of mutation and natural selection in determining the CUB was 58.6% and 41.4%, respectively.
Collapse
Affiliation(s)
- Bornali Deb
- Department of Biotechnology, Assam University, Silchar, Assam 788150 India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi, Assam 788150 India
| | | |
Collapse
|
50
|
Chen Z, Boon SS, Wang MH, Chan RWY, Chan PKS. Genomic and evolutionary comparison between SARS-CoV-2 and other human coronaviruses. J Virol Methods 2020; 289:114032. [PMID: 33290786 PMCID: PMC7718587 DOI: 10.1016/j.jviromet.2020.114032] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 11/15/2020] [Accepted: 12/02/2020] [Indexed: 11/28/2022]
Abstract
Three highly pathogenic human coronaviruses can cause severe acute respiratory syndrome (SARS-CoV, SARS-CoV-2 and MERS-CoV). Although phylogenetic analyses have indicated ancient origin of human coronaviruses from animal relatives, their evolutionary history remains to be established. Using phylogenetics and “high order genomic structures” including trimer spectrums, codon usage and dinucleotide suppression, we observed distinct clustering of all human coronaviruses that formed phylogenetic clades with their closest animal relatives, indicating they have encompassed long evolutionary histories within specific ecological niches before jumping species barrier to infect humans. The close relationships between SARS-CoV and SARS-CoV-2 imply similar evolutionary origin. However, a lower Effective Codon Number (ENC) pattern and CpG dinucleotide suppression in SARS-CoV-2 genomes compared to SARS-CoV and MERS-CoV may imply a better host fitness, and thus their success in sustaining a pandemic. Characterization of coronavirus heterogeneity via complementary approaches enriches our understanding on the evolution and virus-host interaction of these emerging human pathogens while the underlying mechanistic basis in pathogenicity warrants further investigation.
Collapse
Affiliation(s)
- Zigui Chen
- Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region.
| | - Siaw S Boon
- Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region
| | - Maggie H Wang
- Jockey Club School of Public Health and Primary Care, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region
| | - Renee W Y Chan
- Department of Paediatrics, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region
| | - Paul K S Chan
- Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region; Stanley Ho Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong Special Administrative Region
| |
Collapse
|