1
|
Fang R, Yang X, Guo Y, Peng B, Dong R, Li S, Xu S. SARS-CoV-2 infection in animals: Patterns, transmission routes, and drivers. ECO-ENVIRONMENT & HEALTH 2024; 3:45-54. [PMID: 38169914 PMCID: PMC10758742 DOI: 10.1016/j.eehl.2023.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 09/05/2023] [Accepted: 09/17/2023] [Indexed: 01/05/2024]
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is more widespread in animals than previously thought, and it may be able to infect a wider range of domestic and wild species. To effectively control the spread of the virus and protect animal health, it is crucial to understand the cross-species transmission mechanisms and risk factors of SARS-CoV-2. This article collects published literature on SARS-CoV-2 in animals and examines the distribution, transmission routes, biophysical, and anthropogenic drivers of infected animals. The reported cases of infection in animals are mainly concentrated in South America, North America, and Europe, and species affected include lions, white-tailed deer, pangolins, minks, and cats. Biophysical factors influencing infection of animals with SARS-CoV-2 include environmental determinants, high-risk landscapes, air quality, and susceptibility of different animal species, while anthropogenic factors comprise human behavior, intensive livestock farming, animal markets, and land management. Due to current research gaps and surveillance capacity shortcomings, future mitigation strategies need to be designed from a One Health perspective, with research focused on key regions with significant data gaps in Asia and Africa to understand the drivers, pathways, and spatiotemporal dynamics of interspecies transmission.
Collapse
Affiliation(s)
- Ruying Fang
- School of Environmental Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Xin Yang
- School of Environmental Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Yiyang Guo
- School of Environmental Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Bingjie Peng
- School of Environmental Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Ruixuan Dong
- School of Environmental Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Sen Li
- School of Environmental Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Shunqing Xu
- School of Life Sciences, Hainan University, Haikou 570228, China
| |
Collapse
|
2
|
Li J, Tian F, Zhang S, Liu SS, Kang XP, Li YD, Wei JQ, Lin W, Lei Z, Feng Y, Jiang JF, Jiang T, Tong Y. Genomic representation predicts an asymptotic host adaptation of bat coronaviruses using deep learning. Front Microbiol 2023; 14:1157608. [PMID: 37213516 PMCID: PMC10198438 DOI: 10.3389/fmicb.2023.1157608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 04/03/2023] [Indexed: 05/23/2023] Open
Abstract
Introduction Coronaviruses (CoVs) are naturally found in bats and can occasionally cause infection and transmission in humans and other mammals. Our study aimed to build a deep learning (DL) method to predict the adaptation of bat CoVs to other mammals. Methods The CoV genome was represented with a method of dinucleotide composition representation (DCR) for the two main viral genes, ORF1ab and Spike. DCR features were first analyzed for their distribution among adaptive hosts and then trained with a DL classifier of convolutional neural networks (CNN) to predict the adaptation of bat CoVs. Results and discussion The results demonstrated inter-host separation and intra-host clustering of DCR-represented CoVs for six host types: Artiodactyla, Carnivora, Chiroptera, Primates, Rodentia/Lagomorpha, and Suiformes. The DCR-based CNN with five host labels (without Chiroptera) predicted a dominant adaptation of bat CoVs to Artiodactyla hosts, then to Carnivora and Rodentia/Lagomorpha mammals, and later to primates. Moreover, a linear asymptotic adaptation of all CoVs (except Suiformes) from Artiodactyla to Carnivora and Rodentia/Lagomorpha and then to Primates indicates an asymptotic bats-other mammals-human adaptation. Conclusion Genomic dinucleotides represented as DCR indicate a host-specific separation, and clustering predicts a linear asymptotic adaptation shift of bat CoVs from other mammals to humans via deep learning.
Collapse
Affiliation(s)
- Jing Li
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
| | - Fengjuan Tian
- Beijing Advanced Innovation Center for Soft Matter Science and Engineering (BAIC-SM), College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, China
| | - Sen Zhang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
| | - Shun-Shuai Liu
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
| | - Xiao-Ping Kang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
| | - Ya-Dan Li
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
| | - Jun-Qing Wei
- Beijing Advanced Innovation Center for Soft Matter Science and Engineering (BAIC-SM), College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, China
| | - Wei Lin
- Beijing Advanced Innovation Center for Soft Matter Science and Engineering (BAIC-SM), College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, China
| | - Zhongyi Lei
- Beijing Advanced Innovation Center for Soft Matter Science and Engineering (BAIC-SM), College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, China
| | - Ye Feng
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
| | - Jia-Fu Jiang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
- Jia-Fu Jiang
| | - Tao Jiang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, AMMS, Beijing, China
- Tao Jiang
| | - Yigang Tong
- Beijing Advanced Innovation Center for Soft Matter Science and Engineering (BAIC-SM), College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, China
- *Correspondence: Yigang Tong
| |
Collapse
|
3
|
Tai JH, Sun HY, Tseng YC, Li G, Chang SY, Yeh SH, Chen PJ, Chaw SM, Wang HY. Contrasting patterns in the early stage of SARS-CoV-2 evolution between humans and minks. Mol Biol Evol 2022; 39:6658056. [PMID: 35934827 PMCID: PMC9384665 DOI: 10.1093/molbev/msac156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
One of the unique features of SARS-CoV-2 is its apparent neutral evolution during the early pandemic (before February 2020). This contrasts with the preceding SARS-CoV epidemics, where viruses evolved adaptively. SARS-CoV-2 may exhibit a unique or adaptive feature which deviates from other coronaviruses. Alternatively, the virus may have been cryptically circulating in humans for a sufficient time to have acquired adaptive changes before the onset of the current pandemic. To test the scenarios above, we analyzed the SARS-CoV-2 sequences from minks (Neovision vision) and parental humans. In the early phase of the mink epidemic (April to May 2020), nonsynonymous to synonymous mutation ratio per site in the spike protein is 2.93, indicating a selection process favoring adaptive amino acid changes. Mutations in the spike protein were concentrated within its receptor binding domain and receptor binding motif. An excess of high frequency derived variants produced by genetic hitchhiking was found during the middle (June to July 2020) and late phase I (August to September 2020) of the mink epidemic. In contrast, the site frequency spectra of early SARS-CoV-2 in humans only show an excess of low frequency mutations, consistent with the recent outbreak of the virus. Strong positive selection in the mink SARS-CoV-2 implies the virus may not be pre-adapted to a wide range of hosts and illustrates how a virus evolves to establish a continuous infection in a new host. Therefore, the lack of positive selection signal during the early pandemic in humans deserves further investigation.
Collapse
Affiliation(s)
- Jui Hung Tai
- Graduate Institute of Clinical Medicine, College of Medicine, National Taiwan University, Taipei 10002, Taiwan.,Genome and Systems Biology Degree Program, National Taiwan University and Academia Sinica, Taipei 10617, Taiwan
| | - Hsiao Yu Sun
- Taipei Municipal Zhongshan Girls High School, Taipei 10490, Taiwan
| | - Yi Cheng Tseng
- Institute of Ecology and Evolutionary Biology, National Taiwan University, Taipei 10617, Taiwan
| | - Guanghao Li
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Sui Yuan Chang
- Department of Clinical Laboratory Sciences and Medical Biotechnology, College of Medicine, National Taiwan University, Taipei 10002, Taiwan
| | - Shiou Hwei Yeh
- Department of Microbiology, College of Medicine, National Taiwan University, Taipei 10617, Taiwan
| | - Pei Jer Chen
- Graduate Institute of Clinical Medicine, College of Medicine, National Taiwan University, Taipei 10002, Taiwan.,Department of Microbiology, College of Medicine, National Taiwan University, Taipei 10617, Taiwan.,Hepatitis Research Center, National Taiwan University College of Medicine and National Taiwan University Hospital, Taipei 10002, Taiwan.,Department of Internal Medicine, National Taiwan University College of Medicine and National Taiwan University Hospital, Taipei 10002, Taiwan.,Department of Medical Research, National Taiwan University College of Medicine and National Taiwan University Hospital, Taipei 10002, Taiwan
| | - Shu Miaw Chaw
- Biodiversity Research Center, Academia Sinica, Taipei 11529, Taiwan
| | - Hurng Yi Wang
- Graduate Institute of Clinical Medicine, College of Medicine, National Taiwan University, Taipei 10002, Taiwan.,Institute of Ecology and Evolutionary Biology, National Taiwan University, Taipei 10617, Taiwan.,Graduate Institute of Medical Genomics and Proteomics, National Taiwan University College of Medicine, Taipei 10002, Taiwan
| |
Collapse
|
4
|
Li J, Wu YN, Zhang S, Kang XP, Jiang T. Deep learning based on biologically interpretable genome representation predicts two types of human adaptation of SARS-CoV-2 variants. Brief Bioinform 2022; 23:6540151. [PMID: 35233612 PMCID: PMC9116219 DOI: 10.1093/bib/bbac036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 01/23/2022] [Accepted: 01/25/2022] [Indexed: 12/27/2022] Open
Abstract
Explosively emerging SARS-CoV-2 variants challenge current nomenclature schemes based on genetic diversity and biological significance. Genomic composition-based machine learning methods have recently performed well in identifying phenotype–genotype relationships. We introduced a framework involving dinucleotide (DNT) composition representation (DCR) to parse the general human adaptation of RNA viruses and applied a three-dimensional convolutional neural network (3D CNN) analysis to learn the human adaptation of other existing coronaviruses (CoVs) and predict the adaptation of SARS-CoV-2 variants of concern (VOCs). A markedly separable, linear DCR distribution was observed in two major genes—receptor-binding glycoprotein and RNA-dependent RNA polymerase (RdRp)—of six families of single-stranded (ssRNA) viruses. Additionally, there was a general host-specific distribution of both the spike proteins and RdRps of CoVs. The 3D CNN based on spike DCR predicted a dominant type II adaptation of most Beta, Delta and Omicron VOCs, with high transmissibility and low pathogenicity. Type I adaptation with opposite transmissibility and pathogenicity was predicted for SARS-CoV-2 Alpha VOCs (77%) and Kappa variants of interest (58%). The identified adaptive determinants included D1118H and A570D mutations and local DNTs. Thus, the 3D CNN model based on DCR features predicts SARS-CoV-2, a major type II human adaptation and is qualified to predict variant adaptation in real time, facilitating the risk-assessment of emerging SARS-CoV-2 variants and COVID-19 control.
Collapse
Affiliation(s)
- Jing Li
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology 10 and Epidemiology, Beijing 100071, China
| | - Ya-Nan Wu
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology 10 and Epidemiology, Beijing 100071, China
| | - Sen Zhang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology 10 and Epidemiology, Beijing 100071, China
| | - Xiao-Ping Kang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology 10 and Epidemiology, Beijing 100071, China
| | - Tao Jiang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology 10 and Epidemiology, Beijing 100071, China
| |
Collapse
|
5
|
The low abundance of CpG in the SARS-CoV-2 genome is not an evolutionarily signature of ZAP. Sci Rep 2022; 12:2420. [PMID: 35165300 PMCID: PMC8844275 DOI: 10.1038/s41598-022-06046-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 12/28/2021] [Indexed: 12/23/2022] Open
Abstract
The zinc finger antiviral protein (ZAP) is known to restrict viral replication by binding to the CpG rich regions of viral RNA, and subsequently inducing viral RNA degradation. This enzyme has recently been shown to be capable of restricting SARS-CoV-2. These data have led to the hypothesis that the low abundance of CpG in the SARS-CoV-2 genome is due to an evolutionary pressure exerted by the host ZAP. To investigate this hypothesis, we performed a detailed analysis of many coronavirus sequences and ZAP RNA binding preference data. Our analyses showed neither evidence for an evolutionary pressure acting specifically on CpG dinucleotides, nor a link between the activity of ZAP and the low CpG abundance of the SARS-CoV-2 genome.
Collapse
|
6
|
Kumar A, Goyal N, Saranathan N, Dhamija S, Saraswat S, Menon MB, Vivekanandan P. The slowing rate of CpG depletion in SARS-CoV-2 genomes is consistent with adaptations to the human host. Mol Biol Evol 2022; 39:6521032. [PMID: 35134218 PMCID: PMC8892944 DOI: 10.1093/molbev/msac029] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Depletion of CpG dinucleotides in SARS-CoV-2 genomes has been linked to virus evolution, host-switching, virus replication, and innate immune responses. Temporal variations, if any, in the rate of CpG depletion during virus evolution in the host remain poorly understood. Here, we analysed the CpG content of over 1.4 million full-length SARS-CoV-2 genomes representing over 170 million documented infections during the first 17 months of the pandemic. Our findings suggest that the extent of CpG depletion in SARS-CoV-2 genomes is modest. Interestingly, the rate of CpG depletion is highest during early evolution in humans and it gradually tapers off almost reaching an equilibrium; this is consistent with adaptations to the human host. Furthermore, within the coding regions, CpG depletion occurs predominantly at codon positions 2-3 and 3-1. Loss of ZAP-binding motifs in SARS-CoV-2 genomes is primarily driven by the loss of the terminal CpG in the motifs. Nonetheless, majority of the CpG depletion in SARS-CoV-2 genomes occurs outside ZAP-binding motifs. SARS-CoV-2 genomes selectively lose CpGs-motifs from a U-rich context; this may help avoid immune recognition by TLR7. SARS-CoV-2 alpha-, beta- and delta-variants of concern have reduced CpG content compared to sequences from the beginning of the pandemic. In sum, we provide evidence that the rate of CpG depletion in virus genomes is not uniform and it greatly varies over time and during adaptations to the host. This work highlights how temporal variations in selection pressures during virus adaption may impact the rate and the extent of CpG depletion in virus genomes.
Collapse
Affiliation(s)
- Akhil Kumar
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Nishank Goyal
- Department of Chemical Engineering, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Nandhini Saranathan
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Sonam Dhamija
- CSIR-Institute of Genomics and Integrative Biology, New Delhi-110025, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad- 201002, India
| | - Saurabh Saraswat
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Manoj B Menon
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Perumal Vivekanandan
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| |
Collapse
|
7
|
Gomes Noll JC, do Nascimento GM, Diel DG. Natural Transmission and Experimental Models of SARS CoV-2 Infection in Animals. Comp Med 2021; 71:369-382. [PMID: 34702427 PMCID: PMC8594260 DOI: 10.30802/aalas-cm-21-000046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 06/14/2021] [Accepted: 09/21/2021] [Indexed: 12/21/2022]
Abstract
Since the World Health Organization declared COVID-19 a pandemic in March 2020, millions of people have contracted SARS-CoV-2 and died from the infection. Several domestic and wild species have contracted the disease as well. From the beginning, scientists have been working to develop vaccines and establish therapies that can prevent disease development and improve the clinical outcome in infected people. To understand various aspects of viral pathogenesis and infection dynamics and to support preclinical evaluation of vaccines and therapeutics, a diverse number of animal species have been evaluated for use as models of the disease and infection in humans. Here, we discuss natural SARS-CoV-2 infection of domestic and captive wild animals, as well as the susceptibility of several species to experimental infection with this virus.
Collapse
Key Words
- aav, adeno-associated virus
- ace2, angiotensin-converting enzyme 2
- adv, adenovirus 5
- covid-19, coronavirus disease 2019
- dpi, days post-inoculation
- hace2, human angiotensin-converting enzyme 2
- k18-hace2, keratin 18 humanized angiotensin-converting enzyme 2
- mers-cov, middle east respiratory syndrome coronavirus
- rbd, receptor-binding domain
- sars-cov, severe acute respiratory syndrome coronavirus
- sars-cov-2, severe acute respiratory syndrome coronavirus 2
Collapse
Affiliation(s)
- Jessica C Gomes Noll
- Department of Population Medicine and Diagnostic Sciences, Animal Health Diagnostic Center, College of Veterinary Medicine, Cornell University, Ithaca, New York
| | - Gabriela M do Nascimento
- Department of Population Medicine and Diagnostic Sciences, Animal Health Diagnostic Center, College of Veterinary Medicine, Cornell University, Ithaca, New York
| | - Diego G Diel
- Department of Population Medicine and Diagnostic Sciences, Animal Health Diagnostic Center, College of Veterinary Medicine, Cornell University, Ithaca, New York
| |
Collapse
|
8
|
Postnikova OA, Uppal S, Huang W, Kane MA, Villasmil R, Rogozin IB, Poliakov E, Redmond TM. The Functional Consequences of the Novel Ribosomal Pausing Site in SARS-CoV-2 Spike Glycoprotein RNA. Int J Mol Sci 2021; 22:6490. [PMID: 34204305 PMCID: PMC8235447 DOI: 10.3390/ijms22126490] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 06/08/2021] [Accepted: 06/09/2021] [Indexed: 11/24/2022] Open
Abstract
The SARS-CoV-2 Spike glycoprotein (S protein) acquired a unique new 4 amino acid -PRRA- insertion sequence at amino acid residues (aa) 681-684 that forms a new furin cleavage site in S protein as well as several new glycosylation sites. We studied various statistical properties of the -PRRA- insertion at the RNA level (CCUCGGCGGGCA). The nucleotide composition and codon usage of this sequence are different from the rest of the SARS-CoV-2 genome. One of such features is two tandem CGG codons, although the CGG codon is the rarest codon in the SARS-CoV-2 genome. This suggests that the insertion sequence could cause ribosome pausing as the result of these rare codons. Due to population variants, the Nextstrain divergence measure of the CCU codon is extremely large. We cannot exclude that this divergence might affect host immune responses/effectiveness of SARS-CoV-2 vaccines, possibilities awaiting further investigation. Our experimental studies show that the expression level of original RNA sequence "wildtype" spike protein is much lower than for codon-optimized spike protein in all studied cell lines. Interestingly, the original spike sequence produces a higher titer of pseudoviral particles and a higher level of infection. Further mutagenesis experiments suggest that this dual-effect insert, comprised of a combination of overlapping translation pausing and furin sites, has allowed SARS-CoV-2 to infect its new host (human) more readily. This underlines the importance of ribosome pausing to allow efficient regulation of protein expression and also of cotranslational subdomain folding.
Collapse
Affiliation(s)
- Olga A. Postnikova
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA; (O.A.P.); (S.U.)
| | - Sheetal Uppal
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA; (O.A.P.); (S.U.)
| | - Weiliang Huang
- Department of Pharmaceutical Sciences, School of Pharmacy Mass Spectrometry Center, University of Maryland, Baltimore, MD 21201, USA; (W.H.); (M.A.K.)
| | - Maureen A. Kane
- Department of Pharmaceutical Sciences, School of Pharmacy Mass Spectrometry Center, University of Maryland, Baltimore, MD 21201, USA; (W.H.); (M.A.K.)
| | - Rafael Villasmil
- Flow Cytometry Core Facility, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA;
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugenia Poliakov
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA; (O.A.P.); (S.U.)
| | - T. Michael Redmond
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA; (O.A.P.); (S.U.)
| |
Collapse
|
9
|
Brierley L, Fowler A. Predicting the animal hosts of coronaviruses from compositional biases of spike protein and whole genome sequences through machine learning. PLoS Pathog 2021; 17:e1009149. [PMID: 33878118 PMCID: PMC8087038 DOI: 10.1371/journal.ppat.1009149] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 04/30/2021] [Accepted: 04/09/2021] [Indexed: 12/21/2022] Open
Abstract
The COVID-19 pandemic has demonstrated the serious potential for novel zoonotic coronaviruses to emerge and cause major outbreaks. The immediate animal origin of the causative virus, SARS-CoV-2, remains unknown, a notoriously challenging task for emerging disease investigations. Coevolution with hosts leads to specific evolutionary signatures within viral genomes that can inform likely animal origins. We obtained a set of 650 spike protein and 511 whole genome nucleotide sequences from 222 and 185 viruses belonging to the family Coronaviridae, respectively. We then trained random forest models independently on genome composition biases of spike protein and whole genome sequences, including dinucleotide and codon usage biases in order to predict animal host (of nine possible categories, including human). In hold-one-out cross-validation, predictive accuracy on unseen coronaviruses consistently reached ~73%, indicating evolutionary signal in spike proteins to be just as informative as whole genome sequences. However, different composition biases were informative in each case. Applying optimised random forest models to classify human sequences of MERS-CoV and SARS-CoV revealed evolutionary signatures consistent with their recognised intermediate hosts (camelids, carnivores), while human sequences of SARS-CoV-2 were predicted as having bat hosts (suborder Yinpterochiroptera), supporting bats as the suspected origins of the current pandemic. In addition to phylogeny, variation in genome composition can act as an informative approach to predict emerging virus traits as soon as sequences are available. More widely, this work demonstrates the potential in combining genetic resources with machine learning algorithms to address long-standing challenges in emerging infectious diseases.
Collapse
Affiliation(s)
- Liam Brierley
- Department of Health Data Science, University of Liverpool, Brownlow Street, Liverpool, United Kingdom
| | - Anna Fowler
- Department of Health Data Science, University of Liverpool, Brownlow Street, Liverpool, United Kingdom
| |
Collapse
|
10
|
Seyran M, Hassan SS, Uversky VN, Pal Choudhury P, Uhal BD, Lundstrom K, Attrish D, Rezaei N, Aljabali AAA, Ghosh S, Pizzol D, Adadi P, El-Aziz TMA, Kandimalla R, Tambuwala MM, Lal A, Azad GK, Sherchan SP, Baetas-da-Cruz W, Palù G, Brufsky AM. Urgent Need for Field Surveys of Coronaviruses in Southeast Asia to Understand the SARS-CoV-2 Phylogeny and Risk Assessment for Future Outbreaks. Biomolecules 2021; 11:398. [PMID: 33803118 PMCID: PMC7999587 DOI: 10.3390/biom11030398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Revised: 02/19/2021] [Accepted: 02/20/2021] [Indexed: 02/06/2023] Open
Abstract
Phylogenetic analysis of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is focused on a single isolate of bat coronaviruses (bat CoVs) which does not adequately represent genetically related coronaviruses (CoVs) [...].
Collapse
Affiliation(s)
- Murat Seyran
- Doctoral Studies in Natural and Technical Sciences (SPL 44), University of Vienna, Währinger Straße, A-1090 Vienna, Austria;
| | - Sk. Sarif Hassan
- Department of Mathematics, Pingla Thana Mahavidyalaya, Maligram, Paschim Medinipur 721140, West Bengal, India;
| | - Vladimir N. Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Pabitra Pal Choudhury
- Applied Statistics Unit, Indian Statistical Institute, Kolkata 700108, West Bengal, India;
| | - Bruce D. Uhal
- Department of Physiology, Michigan State University, East Lansing, MI 48824, USA;
| | | | - Diksha Attrish
- Dr. B R Ambedkar Center for Biomedical Research (ACBR), University of Delhi (North Camps), Delhi-110007, India;
| | - Nima Rezaei
- Research Center for Immunodeficiencies, Pediatrics Center of Excellence, Children’s Medical Center, Tehran, University of Medical Sciences, Tehran 1419733151, Iran;
- Network of Immunity in Infection, Malignancy and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), Tehran 1419733151, Iran
| | - Alaa A. A. Aljabali
- Department of Pharmaceutics and Pharmaceutical Technology, Yarmouk University-Faculty of Pharmacy, Irbid 566, Jordan;
| | - Shinjini Ghosh
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata 700009, West Bengal, India;
| | - Damiano Pizzol
- Italian Agency for Development Cooperation—Khartoum, Sudan Street 33, Al Amarat 13374, Sudan;
| | - Parise Adadi
- Department of Food Science, University of Otago, Dunedin 9054, New Zealand;
| | - Tarek Mohamed Abd El-Aziz
- Department of Cellular and Integrative Physiology, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Dr, San Antonio, TX 78229-3900, USA;
- Zoology Department, Faculty of Science, Minia University, El-Minia 61519, Egypt
| | - Ramesh Kandimalla
- CSIR-Indian Institute of Chemical Technology Uppal Road, Tarnaka, Hyderabad 500007, Telangana State, India;
| | - Murtaza M. Tambuwala
- School of Pharmacy and Pharmaceutical Science, Ulster University, Coleraine BT52 1SA, Northern Ireland, UK;
| | - Amos Lal
- Division of Pulmonary and Critical Care Medicine, Mayo Clinic, Rochester, MN 55905, USA;
| | | | - Samendra P. Sherchan
- Department of Environmental Health Sciences, Tulane University, New Orleans, LA 70112, USA;
| | - Wagner Baetas-da-Cruz
- Translational Laboratory in Molecular Physiology, Centre for Experimental Surgery, College of Medicine, Federal University of Rio de Janeiro (UFRJ), Rio de Janeiro 21941901, Brazil;
| | - Giorgio Palù
- Department of Molecular Medicine, University of Padova, Via Gabelli 63, 35121 Padova, Italy
| | - Adam M. Brufsky
- UPMC Hillman Cancer Center, Department of Medicine, Division of Hematology/Oncology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA;
| |
Collapse
|
11
|
MacLean OA, Lytras S, Weaver S, Singer JB, Boni MF, Lemey P, Kosakovsky Pond SL, Robertson DL. Natural selection in the evolution of SARS-CoV-2 in bats created a generalist virus and highly capable human pathogen. PLoS Biol 2021; 19:e3001115. [PMID: 33711012 PMCID: PMC7990310 DOI: 10.1371/journal.pbio.3001115] [Citation(s) in RCA: 126] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 03/24/2021] [Accepted: 01/25/2021] [Indexed: 02/08/2023] Open
Abstract
Virus host shifts are generally associated with novel adaptations to exploit the cells of the new host species optimally. Surprisingly, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has apparently required little to no significant adaptation to humans since the start of the Coronavirus Disease 2019 (COVID-19) pandemic and to October 2020. Here we assess the types of natural selection taking place in Sarbecoviruses in horseshoe bats versus the early SARS-CoV-2 evolution in humans. While there is moderate evidence of diversifying positive selection in SARS-CoV-2 in humans, it is limited to the early phase of the pandemic, and purifying selection is much weaker in SARS-CoV-2 than in related bat Sarbecoviruses. In contrast, our analysis detects evidence for significant positive episodic diversifying selection acting at the base of the bat virus lineage SARS-CoV-2 emerged from, accompanied by an adaptive depletion in CpG composition presumed to be linked to the action of antiviral mechanisms in these ancestral bat hosts. The closest bat virus to SARS-CoV-2, RmYN02 (sharing an ancestor about 1976), is a recombinant with a structure that includes differential CpG content in Spike; clear evidence of coinfection and evolution in bats without involvement of other species. While an undiscovered "facilitating" intermediate species cannot be discounted, collectively, our results support the progenitor of SARS-CoV-2 being capable of efficient human-human transmission as a consequence of its adaptive evolutionary history in bats, not humans, which created a relatively generalist virus.
Collapse
Affiliation(s)
- Oscar A. MacLean
- MRC-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
| | - Spyros Lytras
- MRC-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
| | - Steven Weaver
- Temple University, Institute for Genomics and Evolutionary Medicine, Philadelphia, Pennsylvania, United States of America
| | - Joshua B. Singer
- MRC-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
| | - Maciej F. Boni
- Center for Infectious Disease Dynamics, Department of Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| | - Sergei L. Kosakovsky Pond
- Temple University, Institute for Genomics and Evolutionary Medicine, Philadelphia, Pennsylvania, United States of America
| | - David L. Robertson
- MRC-University of Glasgow Centre for Virus Research, Glasgow, United Kingdom
| |
Collapse
|
12
|
Subedi S, Koirala S, Chai L. COVID-19 in Farm Animals: Host Susceptibility and Prevention Strategies. Animals (Basel) 2021; 11:640. [PMID: 33670889 PMCID: PMC7997237 DOI: 10.3390/ani11030640] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 02/18/2021] [Accepted: 02/24/2021] [Indexed: 12/11/2022] Open
Abstract
COVID-19 is caused by the virus SARS-CoV-2 that belongings to the family of Coronaviridae, which has affected multiple species and demonstrated zoonotic potential. The COVID-19 infections have been reported on farm animals (e.g., minks) and pets, which were discussed and summarized in this study. Although the damage of COVID-19 has not been reported as serious as highly pathogenic avian influenza (HPAI) for poultry and African Swine Fever (ASF) for pigs on commercial farms so far, the transmission mechanism of COVID-19 among group animals/farms and its long-term impacts are still not clear. Prior to the marketing of efficient vaccines for livestock and animals, on-farm biosecurity measures (e.g., conventional disinfection strategies and innovated technologies) need to be considered or innovated in preventing the direct contact spread or the airborne transmission of COVID-19.
Collapse
Affiliation(s)
- Sachin Subedi
- Faculty of Animal Science, Veterinary Science and Fisheries, Agriculture & Forestry University, Chitwan 44200, Nepal; (S.S.); (S.K.)
| | - Sulove Koirala
- Faculty of Animal Science, Veterinary Science and Fisheries, Agriculture & Forestry University, Chitwan 44200, Nepal; (S.S.); (S.K.)
| | - Lilong Chai
- Department of Poultry Science, College of Agricultural and Environmental Sciences, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
13
|
The Long-Term Evolutionary History of Gradual Reduction of CpG Dinucleotides in the SARS-CoV-2 Lineage. BIOLOGY 2021; 10:biology10010052. [PMID: 33445785 PMCID: PMC7828247 DOI: 10.3390/biology10010052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 12/29/2020] [Accepted: 01/09/2021] [Indexed: 12/24/2022]
Abstract
Simple Summary Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) caused the coronavirus disease 2019 (COVID-19), a pandemic that infected over 81 million people worldwide. This has led the scientific community to characterize the genome of this virus, including its nucleotide composition. Investigation of the dinucleotide frequency revealed that the proportion of CG dinucleotides (CpG) is highly reduced in the viral genomes. Since CpG dinucleotides is the target site for the host antiviral zinc finger protein, it has been suggested that the reduction in the proportion of CpG is the viral response to escape from the host defense machinery. In the present study, we investigated the time of origin of reduction in the CpG content. Whole genome analyses based on all representative viral genomes of the group Betacoronavirus revealed that the CpG content in the lineage of SARS-CoV-2 has been progressively declining over the past 1213 years. The depletion of CpG was found to occur at neutral—as well as selectively constrained—positions of the viral genomes. Abstract Recent studies suggested that the fraction of CG dinucleotides (CpG) is severely reduced in the genome of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The CpG deficiency was predicted to be the adaptive response of the virus to evade degradation of the viral RNA by the antiviral zinc finger protein that specifically binds to CpG nucleotides. By comparing all representative genomes belonging to the genus Betacoronavirus, this study examined the potential time of origin of CpG depletion. The results of this investigation revealed a highly significant correlation between the proportions of CpG nucleotide (CpG content) of the betacoronavirus species and their times of divergence from SARS-CoV-2. Species that are distantly related to SARS-CoV-2 had much higher CpG contents than that of SARS-CoV-2. Conversely, closely related species had low CpG contents that are similar to or slightly higher than that of SARS-CoV-2. These results suggest a systematic and continuous reduction in the CpG content in the SARS-CoV-2 lineage that might have started since the Sarbecovirus + Hibecovirus clade separated from Nobecovirus, which was estimated to be 1213 years ago. This depletion was not found to be mediated by the GC contents of the genomes. Our results also showed that the depletion of CpG occurred at neutral positions of the genome as well as those under selection. The latter is evident from the progressive reduction in the proportion of arginine amino acid (coded by CpG dinucleotides) in the SARS-CoV-2 lineage over time. The results of this study suggest that shedding CpG nucleotides from their genome is a continuing process in this viral lineage, potentially to escape from their host defense mechanisms.
Collapse
|
14
|
Abstract
Identifying the origin of SARS-CoV-2, the etiological agent of the current COVID-19 pandemic, may help us to avoid future epidemics of coronavirus and other zoonoses. Several theories about the zoonotic origin of SARS-CoV-2 have recently been proposed. While Betacoronavirus found in Rhinolophus bats from China have been broadly implicated, their genetic dissimilarity to SARS-CoV-2 is so high that they are highly unlikely to be its direct ancestors. Thus, an intermediary host is suspected to link bat to human coronaviruses. Based on genomic CpG dinucleotide patterns in different coronaviruses from different hosts, it was suggested that SARS-CoV-2 might have evolved in a canid gastro-intestinal tract prior to transmission to humans. However, similar CpG patterns are now reported in coronaviruses from other hosts, including bats themselves and pangolins. Therefore, reduced genomic CpG alone is not a highly predictive biomarker, suggesting a need for additional biomarkers to reveal intermediate hosts or tissues. The hunt for the zoonotic origin of SARS-CoV-2 continues.
Collapse
Affiliation(s)
- Thomas Leitner
- Theroretical Biology and Biophysics group, Los Alamos National Laboratory, Los Alamos, NM
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA.,Center for Excellence in Genome Medicine and Research, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|