1
|
Forna A, Weedop KB, Damodaran L, Hassell N, Kondor R, Bahl J, Drake JM, Rohani P. Sequence-based detection of emerging antigenically novel influenza A viruses. Proc Biol Sci 2024; 291:20240790. [PMID: 39140324 PMCID: PMC11323087 DOI: 10.1098/rspb.2024.0790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 05/21/2024] [Accepted: 07/11/2024] [Indexed: 08/15/2024] Open
Abstract
The detection of evolutionary transitions in influenza A (H3N2) viruses' antigenicity is a major obstacle to effective vaccine design and development. In this study, we describe Novel Influenza Virus A Detector (NIAViD), an unsupervised machine learning tool, adept at identifying these transitions, using the HA1 sequence and associated physico-chemical properties. NIAViD performed with 88.9% (95% CI, 56.5-98.0%) and 72.7% (95% CI, 43.4-90.3%) sensitivity in training and validation, respectively, outperforming the uncalibrated null model-33.3% (95% CI, 12.1-64.6%) and does not require potentially biased, time-consuming and costly laboratory assays. The pivotal role of the Boman's index, indicative of the virus's cell surface binding potential, is underscored, enhancing the precision of detecting antigenic transitions. NIAViD's efficacy is not only in identifying influenza isolates that belong to novel antigenic clusters, but also in pinpointing potential sites driving significant antigenic changes, without the reliance on explicit modelling of haemagglutinin inhibition titres. We believe this approach holds promise to augment existing surveillance networks, offering timely insights for the development of updated, effective influenza vaccines. Consequently, NIAViD, in conjunction with other resources, could be used to support surveillance efforts and inform the development of updated influenza vaccines.
Collapse
Affiliation(s)
- Alpha Forna
- Odum School of Ecology, University of Georgia, Athens, GA30602, USA
- Center for the Ecology of Infectious Diseases, University of Georgia, Athens, GA30602, USA
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA30606, USA
| | - K. Bodie Weedop
- Odum School of Ecology, University of Georgia, Athens, GA30602, USA
| | - Lambodhar Damodaran
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA30606, USA
| | - Norman Hassell
- Centers for Disease Control and Prevention, Atlanta, GA30329, USA
| | - Rebecca Kondor
- Centers for Disease Control and Prevention, Atlanta, GA30329, USA
| | - Justin Bahl
- Center for the Ecology of Infectious Diseases, University of Georgia, Athens, GA30602, USA
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA30606, USA
| | - John M. Drake
- Odum School of Ecology, University of Georgia, Athens, GA30602, USA
- Center for the Ecology of Infectious Diseases, University of Georgia, Athens, GA30602, USA
- Center for Influenza Disease & Emergence Research (CIDER), Athens, GA30602, USA
| | - Pejman Rohani
- Odum School of Ecology, University of Georgia, Athens, GA30602, USA
- Center for the Ecology of Infectious Diseases, University of Georgia, Athens, GA30602, USA
- Center for Influenza Disease & Emergence Research (CIDER), Athens, GA30602, USA
- Department of Infectious Diseases, University of Georgia, Athens, GA30602, USA
| |
Collapse
|
2
|
Jia Q, Xia Y, Dong F, Li W. MetaFluAD: meta-learning for predicting antigenic distances among influenza viruses. Brief Bioinform 2024; 25:bbae395. [PMID: 39129362 PMCID: PMC11317534 DOI: 10.1093/bib/bbae395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Revised: 06/24/2024] [Accepted: 07/27/2024] [Indexed: 08/13/2024] Open
Abstract
Influenza viruses rapidly evolve to evade previously acquired human immunity. Maintaining vaccine efficacy necessitates continuous monitoring of antigenic differences among strains. Traditional serological methods for assessing these differences are labor-intensive and time-consuming, highlighting the need for efficient computational approaches. This paper proposes MetaFluAD, a meta-learning-based method designed to predict quantitative antigenic distances among strains. This method models antigenic relationships between strains, represented by their hemagglutinin (HA) sequences, as a weighted attributed network. Employing a graph neural network (GNN)-based encoder combined with a robust meta-learning framework, MetaFluAD learns comprehensive strain representations within a unified space encompassing both antigenic and genetic features. Furthermore, the meta-learning framework enables knowledge transfer across different influenza subtypes, allowing MetaFluAD to achieve remarkable performance with limited data. MetaFluAD demonstrates excellent performance and overall robustness across various influenza subtypes, including A/H3N2, A/H1N1, A/H5N1, B/Victoria, and B/Yamagata. MetaFluAD synthesizes the strengths of GNN-based encoding and meta-learning to offer a promising approach for accurate antigenic distance prediction. Additionally, MetaFluAD can effectively identify dominant antigenic clusters within seasonal influenza viruses, aiding in the development of effective vaccines and efficient monitoring of viral evolution.
Collapse
Affiliation(s)
- Qitao Jia
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| | - Yuanling Xia
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650500, China
| | - Fanglin Dong
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| | - Weihua Li
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| |
Collapse
|
3
|
Shah SAW, Palomar DP, Barr I, Poon LLM, Quadeer AA, McKay MR. Seasonal antigenic prediction of influenza A H3N2 using machine learning. Nat Commun 2024; 15:3833. [PMID: 38714654 PMCID: PMC11076571 DOI: 10.1038/s41467-024-47862-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 04/10/2024] [Indexed: 05/10/2024] Open
Abstract
Antigenic characterization of circulating influenza A virus (IAV) isolates is routinely assessed by using the hemagglutination inhibition (HI) assays for surveillance purposes. It is also used to determine the need for annual influenza vaccine updates as well as for pandemic preparedness. Performing antigenic characterization of IAV on a global scale is confronted with high costs, animal availability, and other practical challenges. Here we present a machine learning model that accurately predicts (normalized) outputs of HI assays involving circulating human IAV H3N2 viruses, using their hemagglutinin subunit 1 (HA1) sequences and associated metadata. Each season, the model learns an updated nonlinear mapping of genetic to antigenic changes using data from past seasons only. The model accurately distinguishes antigenic variants from non-variants and adaptively characterizes seasonal dynamics of HA1 sites having the strongest influence on antigenic change. Antigenic predictions produced by the model can aid influenza surveillance, public health management, and vaccine strain selection activities.
Collapse
Affiliation(s)
- Syed Awais W Shah
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
| | - Daniel P Palomar
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
- Department of Industrial Engineering & Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
| | - Ian Barr
- WHO Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria, Australia
- Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia
| | - Leo L M Poon
- School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
- Centre for Immunology & Infection, Hong Kong SAR, China
| | - Ahmed Abdul Quadeer
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China.
- Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria, Australia.
| | - Matthew R McKay
- Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia.
- Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria, Australia.
| |
Collapse
|
4
|
Meng J, Liu J, Song W, Li H, Wang J, Zhang L, Peng Y, Wu A, Jiang T. PREDAC-CNN: predicting antigenic clusters of seasonal influenza A viruses with convolutional neural network. Brief Bioinform 2024; 25:bbae033. [PMID: 38343322 PMCID: PMC10859661 DOI: 10.1093/bib/bbae033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Revised: 01/13/2024] [Accepted: 01/18/2024] [Indexed: 02/15/2024] Open
Abstract
Vaccination stands as the most effective and economical strategy for prevention and control of influenza. The primary target of neutralizing antibodies is the surface antigen hemagglutinin (HA). However, ongoing mutations in the HA sequence result in antigenic drift. The success of a vaccine is contingent on its antigenic congruence with circulating strains. Thus, predicting antigenic variants and deducing antigenic clusters of influenza viruses are pivotal for recommendation of vaccine strains. The antigenicity of influenza A viruses is determined by the interplay of amino acids in the HA1 sequence. In this study, we exploit the ability of convolutional neural networks (CNNs) to extract spatial feature representations in the convolutional layers, which can discern interactions between amino acid sites. We introduce PREDAC-CNN, a model designed to track antigenic evolution of seasonal influenza A viruses. Accessible at http://predac-cnn.cloudna.cn, PREDAC-CNN formulates a spatially oriented representation of the HA1 sequence, optimized for the convolutional framework. It effectively probes interactions among amino acid sites in the HA1 sequence. Also, PREDAC-CNN focuses exclusively on physicochemical attributes crucial for the antigenicity of influenza viruses, thereby eliminating unnecessary amino acid embeddings. Together, PREDAC-CNN is adept at capturing interactions of amino acid sites within the HA1 sequence and examining the collective impact of point mutations on antigenic variation. Through 5-fold cross-validation and retrospective testing, PREDAC-CNN has shown superior performance in predicting antigenic variants compared to its counterparts. Additionally, PREDAC-CNN has been instrumental in identifying predominant antigenic clusters for A/H3N2 (1968-2023) and A/H1N1 (1977-2023) viruses, significantly aiding in vaccine strain recommendation.
Collapse
Affiliation(s)
- Jing Meng
- State Key Laboratory of Common Mechanism Research for Major Diseases, Suzhou Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Suzhou 215123, Jiangsu, China
| | - Jingze Liu
- State Key Laboratory of Common Mechanism Research for Major Diseases, Suzhou Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Suzhou 215123, Jiangsu, China
| | - Wenkai Song
- College of Computer Science, Sichuan University, Chengdu 610065, China
| | - Honglei Li
- Beijing Cloudna Technology Company, Limited, Beijing 100029, China
| | | | - Le Zhang
- College of Computer Science, Sichuan University, Chengdu 610065, China
| | - Yousong Peng
- College of Biology, Hunan University, Changsha 410082, China
| | - Aiping Wu
- State Key Laboratory of Common Mechanism Research for Major Diseases, Suzhou Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Suzhou 215123, Jiangsu, China
| | - Taijiao Jiang
- State Key Laboratory of Common Mechanism Research for Major Diseases, Suzhou Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Suzhou 215123, Jiangsu, China
- Guangzhou National Laboratory, Guangzhou 510005, China
- State Key Laboratory of Respiratory Disease, the First Affiliated Hospital of Guangzhou Medical University, Guangzhou Medical University, Guangzhou, 510120, China
| |
Collapse
|
5
|
Liu M, Liu J, Song W, Peng Y, Ding X, Deng L, Jiang T. Development of PREDAC-H1pdm to model the antigenic evolution of influenza A/(H1N1) pdm09 viruses. Virol Sin 2023; 38:541-548. [PMID: 37211247 PMCID: PMC10436056 DOI: 10.1016/j.virs.2023.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 05/17/2023] [Indexed: 05/23/2023] Open
Abstract
The Influenza A (H1N1) pdm09 virus caused a global pandemic in 2009 and has circulated seasonally ever since. As the continual genetic evolution of hemagglutinin in this virus leads to antigenic drift, rapid identification of antigenic variants and characterization of the antigenic evolution are needed. In this study, we developed PREDAC-H1pdm, a model to predict antigenic relationships between H1N1pdm viruses and identify antigenic clusters for post-2009 pandemic H1N1 strains. Our model performed well in predicting antigenic variants, which was helpful in influenza surveillance. By mapping the antigenic clusters for H1N1pdm, we found that substitutions on the Sa epitope were common for H1N1pdm, whereas for the former seasonal H1N1, substitutions on the Sb epitope were more common in antigenic evolution. Additionally, the localized epidemic pattern of H1N1pdm was more obvious than that of the former seasonal H1N1, which could make vaccine recommendation more sophisticated. Overall, the antigenic relationship prediction model we developed provides a rapid determination method for identifying antigenic variants, and the further analysis of evolutionary and epidemic characteristics can facilitate vaccine recommendations and influenza surveillance for H1N1pdm.
Collapse
Affiliation(s)
- Mi Liu
- Jiangsu Institute of Clinical Immunology, The First Affiliated Hospital of Soochow University, Suzhou, 215006, China
| | - Jingze Liu
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China; Suzhou Institute of Systems Medicine, Suzhou, 215123, China
| | - Wenjun Song
- Guangzhou Laboratory, Guangzhou, 510005, China
| | - Yousong Peng
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha, 410082, China
| | - Xiao Ding
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China; Suzhou Institute of Systems Medicine, Suzhou, 215123, China
| | - Lizong Deng
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China; Suzhou Institute of Systems Medicine, Suzhou, 215123, China
| | - Taijiao Jiang
- Suzhou Institute of Systems Medicine, Suzhou, 215123, China; Guangzhou Laboratory, Guangzhou, 510005, China; State Key Laboratory of Respiratory Disease, the First Affiliated Hospital of Guangzhou Medical University, Guangzhou Medical University, Guangzhou, 510120, China.
| |
Collapse
|
6
|
Peng F, Xia Y, Li W. Prediction of Antigenic Distance in Influenza A Using Attribute Network Embedding. Viruses 2023; 15:1478. [PMID: 37515165 PMCID: PMC10385503 DOI: 10.3390/v15071478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 06/23/2023] [Accepted: 06/28/2023] [Indexed: 07/30/2023] Open
Abstract
Owing to the rapid changes in the antigenicity of influenza viruses, it is difficult for humans to obtain lasting immunity through antiviral therapy. Hence, tracking the dynamic changes in the antigenicity of influenza viruses can provide a basis for vaccines and drug treatments to cope with the spread of influenza viruses. In this paper, we developed a novel quantitative prediction method to predict the antigenic distance between virus strains using attribute network embedding techniques. An antigenic network is built to model and combine the genetic and antigenic characteristics of the influenza A virus H3N2, using the continuous distributed representation of the virus strain protein sequence (ProtVec) as a node attribute and the antigenic distance between virus strains as an edge weight. The results show a strong positive correlation between supplementing genetic features and antigenic distance prediction accuracy. Further analysis indicates that our prediction model can comprehensively and accurately track the differences in antigenic distances between vaccines and influenza virus strains, and it outperforms existing methods in predicting antigenic distances between strains.
Collapse
Affiliation(s)
- Fujun Peng
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| | - Yuanling Xia
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650500, China
| | - Weihua Li
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| |
Collapse
|
7
|
Makau DN, Prieto C, Martínez-Lobo FJ, Paploski IAD, VanderWaal K. Predicting Antigenic Distance from Genetic Data for PRRSV-Type 1: Applications of Machine Learning. Microbiol Spectr 2023; 11:e0408522. [PMID: 36511691 PMCID: PMC9927307 DOI: 10.1128/spectrum.04085-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 11/18/2022] [Indexed: 12/15/2022] Open
Abstract
The control of porcine reproductive and respiratory syndrome (PRRS) remains a significant challenge due to the genetic and antigenic variability of the causative virus (PRRSV). Predominantly, PRRSV management includes using vaccines and live virus inoculations to confer immunity against PRRSV on farms. While understanding cross-protection among strains is crucial for the continued success of these interventions, understanding how genetic diversity translates to antigenic diversity remains elusive. We developed machine learning algorithms to estimate antigenic distance in silico, based on genetic sequence data, and identify differences in specific amino acid sites associated with antigenic differences between viruses. First, we obtained antigenic distance estimates derived from serum neutralization assays cross-reacting PRRSV monospecific antisera with virus isolates from 27 PRRSV1 viruses circulating in Europe. Antigenic distances were weakly to moderately associated with ectodomain amino acid distance for open reading frames (ORFs) 2 to 4 (ρ < 0.2) and ORF5 (ρ = 0.3), respectively. Dividing the antigenic distance values at the median, we then categorized the sera-virus pairs into two levels: low and high antigenic distance (dissimilarity). In the machine learning models, we used amino acid distances in the ectodomains of ORFs 2 to 5 and site-wise amino acid differences between the viruses as potential predictors of antigenic dissimilarity. Using mixed-effect gradient boosting models, we estimated the antigenic distance (high versus low) between serum-virus pairs with an accuracy of 81% (95% confidence interval, 76 to 85%); sensitivity and specificity were 86% and 75%, respectively. We demonstrate that using sequence data we can estimate antigenic distance and potential cross-protection between PRRSV1 strains. IMPORTANCE Understanding cross-protection between cocirculating PRRSV1 strains is crucial to reducing losses associated with PRRS outbreaks on farms. While experimental studies to determine cross-protection are instrumental, these in vivo studies are not always practical or timely for the many cocirculating and emerging PRRSV strains. In this study, we demonstrate the ability to rapidly estimate potential immunologic cross-reaction between different PRRSV1 strains in silico using sequence data routinely collected by production systems. These models can provide fast turn-around information crucial for improving PRRS management decisions such as selecting vaccines/live virus inoculation to be used on farms and assessing the risk of outbreaks by emerging strains on farms previously exposed to certain PRRSV strains and vaccine development among others.
Collapse
Affiliation(s)
- Dennis N. Makau
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Minneapolis, USA
| | - Cinta Prieto
- Departamento de Sanidad Animal, Facultad de Veterinaria, Universidad Complutense de Madrid, Madrid, Spain
| | | | - I. A. D. Paploski
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Minneapolis, USA
| | - Kimberly VanderWaal
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Minneapolis, USA
| |
Collapse
|
8
|
Bhardwaj A, Sharma R, Grover A. Immuno-informatics guided designing of a multi-epitope vaccine against Dengue and Zika. J Biomol Struct Dyn 2023; 41:1-15. [PMID: 34796791 DOI: 10.1080/07391102.2021.2002720] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Dengue and zika are amongst the most prevalent mosquito-borne diseases caused by closely related members Dengue virus (DENV) and Zika virus (ZIKV), respectively, of the Flaviviridae family. DENV and ZIKV have been reported to co-infect several people, resulting in fatalities across the world. A vaccine that can safeguard against both these pathogens concurrently, can offer several advantages. This study has employed immuno-informatics for devising a multi-epitope, multi-pathogenic vaccine against both these viruses. Since, the two viruses share a common vector source, whose salivary components are reported to aid viral pathogenesis; antigenic salivary proteins from Aedes aegypti were also incorporated into the design of the vaccine along with conserved structural and non-structural viral proteins. Conserved B- and T-cell epitopes were identified for all the selected antigenic proteins. These epitopes were merged and further supplemented with β-defensin as an adjuvant, to yield an immunogenic vaccine construct. In-silico 3D modeling and structural validation of the vaccine construct was conducted, followed by its molecular docking and molecular dynamics simulation studies with human TLR2. Immune simulation study was also performed, and it further provided support that the designed vaccine can mount an effective immune response and hence provide protection against both DENV and ZIKV. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Aditi Bhardwaj
- School of Biosciences & Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Ritika Sharma
- School of Biotechnology, Jawaharlal Nehru University (JNU), Delhi, India
| | - Abhinav Grover
- School of Biotechnology, Jawaharlal Nehru University (JNU), Delhi, India
| |
Collapse
|
9
|
Abbas ME, Chengzhang Z, Fathalla A, Xiao Y. End-to-end antigenic variant generation for H1N1 influenza HA protein using sequence to sequence models. PLoS One 2022; 17:e0266198. [PMID: 35344562 PMCID: PMC8959165 DOI: 10.1371/journal.pone.0266198] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 03/16/2022] [Indexed: 11/23/2022] Open
Abstract
The growing risk of new variants of the influenza A virus is the most significant to public health. The risk imposed from new variants may have been lethal, as witnessed in the year 2009. Even though the improvement in predicting antigenicity of influenza viruses has rapidly progressed, few studies employed deep learning methodologies. The most recent literature mostly relied on classification techniques, while a model that generates the HA protein of the antigenic variant is not developed. However, the antigenic pair of influenza virus A can be determined in a laboratory setup, the process needs a tremendous amount of time and labor. Antigenic shift and drift which are caused by changes in surface protein favored the influenza A virus in evading immunity. The high frequency of the minor changes in the surface protein poses a challenge to identifying the antigenic variant of an emerging virus. These changes slow down vaccine selection and the manufacturing process. In this vein, the proposed model could help save the time and efforts exerted to identify the antigenic pair of the influenza virus. The proposed model utilized an end-to-end learning methodology relying on deep sequence-to-sequence architecture to generate the antigenic variant of a given influenza A virus using surface protein. Employing the BLEU score to evaluate the generated HA protein of the antigenic variant of influenza virus A against the actual variant, the proposed model achieved a mean accuracy of 97.57%.
Collapse
Affiliation(s)
- Mohamed Elsayed Abbas
- School of Computer Science and Engineering, Central South University, Changsha, China
- Mobile Health Ministry of Education-China Mobile Joint Laboratory, Changsha, China
| | - Zhu Chengzhang
- School of Computer Science and Engineering, Central South University, Changsha, China
- The College of Literature and Journalism, Central South University, Changsha, China
- Mobile Health Ministry of Education-China Mobile Joint Laboratory, Changsha, China
| | - Ahmed Fathalla
- Department of Mathematics, Faculty of Science,Suez Canal University, Ismailia, Egypt
| | - Yalong Xiao
- School of Computer Science and Engineering, Central South University, Changsha, China
- The College of Literature and Journalism, Central South University, Changsha, China
| |
Collapse
|
10
|
Wang H, Zang Y, Zhao Y, Hao D, Kang Y, Zhang J, Zhang Z, Zhang L, Yang Z, Zhang S. Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning. Viruses 2022; 14:v14030469. [PMID: 35336876 PMCID: PMC8950662 DOI: 10.3390/v14030469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 02/15/2022] [Accepted: 02/17/2022] [Indexed: 01/27/2023] Open
Abstract
To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we applied principal component analysis and hierarchical clustering analysis on thousands of HA and NA sequences of A/H1N1 and A/H3N2. We discovered significant coevolution between HA and NA at the sequence level, which is closely related to the type of host species and virus epidemic years. Furthermore, we propose a sequence-to-sequence transformer model (S2STM), which mainly consists of an encoder and a decoder that adopts a multi-head attention mechanism for establishing the mapping relationship between HA and NA sequences. The training results reveal that the S2STM can effectively realize the “translation” from HA to NA or vice versa, thereby building a relationship network between them. Our work combines unsupervised and supervised machine learning methods to identify the sequence matching between HA and NA, which will advance our understanding of IAVs’ evolution and also provide a novel idea for sequence analysis methods.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Zhiwei Yang
- Correspondence: (Z.Y.); (S.Z.); Tel.: +86-029-8266-8634 (Z.Y.); +86-029-8266-0915 (S.Z.)
| | - Shengli Zhang
- Correspondence: (Z.Y.); (S.Z.); Tel.: +86-029-8266-8634 (Z.Y.); +86-029-8266-0915 (S.Z.)
| |
Collapse
|
11
|
A statistical analysis of antigenic similarity among influenza A (H3N2) viruses. Heliyon 2021; 7:e08384. [PMID: 34825090 PMCID: PMC8605065 DOI: 10.1016/j.heliyon.2021.e08384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 04/21/2021] [Accepted: 11/10/2021] [Indexed: 11/20/2022] Open
Abstract
An accurate assessment of antigenic similarity between influenza viruses is important for vaccine strain recommendations and influenza surveillance. Due to the mechanisms that result in frequent changes in the antigenicities of strains, it is desirable to obtain an antigenic similarity measure that accounts for specific changes in strains that are of epidemiological importance in influenza. Empirically grounded statistical models best achieve this. In this study, an interpretable machine-learning model was developed using distinguishing features of antigenic variants to analyze antigenic similarity. The features comprised of cluster information, amino acid sequences located in known antigenic and receptor-binding sites of influenza A (H3N2). In order to assess validity of parameters, accuracy and relevance of model to vaccine effectiveness, the model was applied to influenza A (H3N2) viruses due to their abundant genetic data and epidemiological relevance to influenza surveillance. An application of the model revealed that all model parameters were statistically significant to determining antigenic similarity between strains. Furthermore, upon evaluating the model for predicting antigenic similarity between strains, it achieved 95% area under Receiver Operating Characteristic curve (AUC), 94% accuracy, 76% precision, 97% specificity, 68% sensitivity and a diagnostic odds ratio (DOR) of 83.19. Above all, the model was found to be strongly related to influenza vaccine effectiveness to indicate the correlation between vaccine effectiveness and antigenic similarity between vaccine and circulating strains in an epidemic. The study predicts probabilities of antigenic similarity and estimates changes in strains that lead to antigenic variants. A successful application of the methods presented in this study would complement the global efforts in influenza surveillance.
Collapse
|
12
|
Gill HK, Sehgal VK, Verma AK. A deep neural network based context-aware smart epidemic surveillance in smart cities. LIBRARY HI TECH 2021. [DOI: 10.1108/lht-02-2021-0063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
Epidemics not only affect the public health but also are a threat to a nation's growth and economy as well. Early prediction of epidemic can be beneficial to take preventive measures and to reduce the impact of epidemic in an area.
Design/methodology/approach
A deep neural network (DNN) based context aware smart epidemic system has been proposed to prevent and monitor epidemic spread in a geographical area. Various neural networks (NNs) have been used: LSTM, RNN, BPNN to detect the level of disease, direction of spread of disease in a geographical area and marking the high-risk areas. Multiple DNNs collect and process various data points and these DNNs are decided based on type of data points. Output of one DNN is used by another DNN to reach to final prediction.
Findings
The experimental evaluation of the proposed framework achieved the accuracy of 87% for the synthetic dataset generated for Zika epidemic in Brazil in 2016.
Originality/value
The proposed framework is designed in a way that every data point is carefully processed and contributes to the final decision. These multiple DNNs will act as a single DNN for the end user.
Collapse
|
13
|
Tarasova O, Poroikov V. Machine Learning in Discovery of New Antivirals and Optimization of Viral Infections Therapy. Curr Med Chem 2021; 28:7840-7861. [PMID: 33949929 DOI: 10.2174/0929867328666210504114351] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 02/13/2021] [Accepted: 02/24/2021] [Indexed: 11/22/2022]
Abstract
Nowadays, computational approaches play an important role in the design of new drug-like compounds and optimization of pharmacotherapeutic treatment of diseases. The emerging growth of viral infections, including those caused by the Human Immunodeficiency Virus (HIV), Ebola virus, recently detected coronavirus, and some others, leads to many newly infected people with a high risk of death or severe complications. A huge amount of chemical, biological, clinical data is at the disposal of the researchers. Therefore, there are many opportunities to find the relationships between the particular features of chemical data and the antiviral activity of biologically active compounds based on machine learning approaches. Biological and clinical data can also be used for building models to predict relationships between viral genotype and drug resistance, which might help determine the clinical outcome of treatment. In the current study, we consider machine-learning approaches in the antiviral research carried out during the past decade. We overview in detail the application of machine-learning methods for the design of new potential antiviral agents and vaccines, drug resistance prediction, and analysis of virus-host interactions. Our review also covers the perspectives of using the machine-learning approaches for antiviral research, including Dengue, Ebola viruses, Influenza A, Human Immunodeficiency Virus, coronaviruses, and some others.
Collapse
Affiliation(s)
- Olga Tarasova
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow. Russian Federation
| | - Vladimir Poroikov
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow. Russian Federation
| |
Collapse
|