1
|
Choi WJ, Park J, Seong DY, Chung DS, Hong D. A prediction of mutations in infectious viruses using artificial intelligence. Genomics Inform 2024; 22:15. [PMID: 39380083 PMCID: PMC11463117 DOI: 10.1186/s44342-024-00019-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2024] [Accepted: 09/18/2024] [Indexed: 10/10/2024] Open
Abstract
Many subtypes of SARS-CoV-2 have emerged since its early stages, with mutations showing regional and racial differences. These mutations significantly affected the infectivity and severity of the virus. This study aimed to predict the mutations that occur during the evolution of SARS-CoV-2 and identify the key characteristics for making these predictions. We collected and organized data on the lineage, date, clade, and mutations of SARS-CoV-2 from publicly available databases and processed them to predict the mutations. In addition, we utilized various artificial intelligence models to predict newly emerging mutations and created various training sets based on clade information. Using only mutation information resulted in low performance of the learning models, whereas incorporating clade differentiation resulted in high performance in machine learning models, including XGBoost (accuracy: 0.999). However, mutations fixed in the receptor-binding motif (RBM) region of Omicron resulted in decreased predictive performance. Using these models, we predicted potential mutation positions for 24C, following the recently emerged 24A and 24B clades. We identified a mutation at position Q493 in the RBM region. Our study developed effective artificial intelligence models and characteristics for predicting new mutations in continuously evolving infectious viruses.
Collapse
Affiliation(s)
- Won Jong Choi
- Department of Precision Medicine and Big Data, College of Medicine, The Catholic University of Korea, Seoul, 06591, Republic of Korea
- Department of Medical Informatics, The Catholic University of Korea, Seoul, 06591, Republic of Korea
| | - Jongkeun Park
- Department of Medical Informatics, The Catholic University of Korea, Seoul, 06591, Republic of Korea
| | - Do Young Seong
- Department of Precision Medicine and Big Data, College of Medicine, The Catholic University of Korea, Seoul, 06591, Republic of Korea
- Department of Medical Informatics, The Catholic University of Korea, Seoul, 06591, Republic of Korea
| | - Dae Sun Chung
- Department of Medical Informatics, The Catholic University of Korea, Seoul, 06591, Republic of Korea
- Department of Medical Sciences, Graduate Schoolof, College of Medicine , The Catholic University of Korea, Seoul, 06591, Republic of Korea
| | - Dongwan Hong
- Department of Precision Medicine and Big Data, College of Medicine, The Catholic University of Korea, Seoul, 06591, Republic of Korea.
- Department of Medical Informatics, The Catholic University of Korea, Seoul, 06591, Republic of Korea.
- Department of Medical Sciences, Graduate Schoolof, College of Medicine , The Catholic University of Korea, Seoul, 06591, Republic of Korea.
- Precision Medicine Research Center, College of Medicine, The Catholic University of Korea, Seoul, 06591, Republic of Korea.
- Cancer Evolution Research Center, College of Medicine, The Catholic University of Korea, Seoul, 06591, Republic of Korea.
- College of Medicine, CMC Institute for Basic Medical Science, The Catholic University of Korea, Seoul, 06591, Republic of Korea.
| |
Collapse
|
2
|
Sultana N, Nagesha SN, Reddy CNL, Ramesh BN, Shyamalamma S, Shashidhara KS, Satish KM, Pradeep C, Vidyadhar GD. Computational analysis of affinity dynamics between the variants of SARS-CoV-2 spike protein (RBD) and human ACE-2 receptor. Virol J 2024; 21:88. [PMID: 38641844 PMCID: PMC11031966 DOI: 10.1186/s12985-024-02365-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 04/13/2024] [Indexed: 04/21/2024] Open
Abstract
The novel coronavirus SARS-CoV-2 resulted in a significant worldwide health emergency known as the COVID-19 pandemic. This crisis has been marked by the widespread of various variants, with certain ones causing notable apprehension. In this study, we harnessed computational techniques to scrutinize these Variants of Concern (VOCs), including various Omicron subvariants. Our approach involved the use of protein structure prediction algorithms and molecular docking techniques, we have investigated the effects of mutations within the Receptor Binding Domain (RBD) of SARS-CoV-2 and how these mutations influence its interactions with the human angiotensin-converting enzyme 2 (hACE-2) receptor. Further we have predicted the structural alterations in the RBD of naturally occurring SARS-CoV-2 variants using the tr-Rosetta algorithm. Subsequent docking and binding analysis employing HADDOCK and PRODIGY illuminated crucial interactions occurring at the Receptor-Binding Motif (RBM). Our findings revealed a hierarchy of increased binding affinity between the human ACE2 receptor and the various RBDs, in the order of wild type (Wuhan-strain) < Beta < Alpha < Gamma < Omicron-B.1.1.529 < Delta < Omicron-BA.2.12.1 < Omicron-BA.5.2.1 < Omicron-BA.1.1. Notably, Omicron-BA.1.1 demonstrated the highest binding affinity of -17.4 kcal mol-1 to the hACE2 receptor when compared to all the mutant complexes. Additionally, our examination indicated that mutations occurring in active residues of the Receptor Binding Domain (RBD) consistently improved the binding affinity and intermolecular interactions in all mutant complexes. Analysis of the differences among variants has laid a foundation for the structure-based drug design targeting the RBD region of SARS-CoV-2.
Collapse
Affiliation(s)
- Nishad Sultana
- Department of Plant Biotechnology, University of Agricultural Sciences, GKVK, Bangalore, 560 065, India
| | - S N Nagesha
- Department of Plant Biotechnology, College of Agriculture, Hassan, UAS, Bangalore, 573 225, India.
| | | | - B N Ramesh
- ICAR-PHT, UAS, GKVK, Bangalore, 560 065, India
| | - S Shyamalamma
- Department of Plant Biotechnology, University of Agricultural Sciences, GKVK, Bangalore, 560 065, India
| | - K S Shashidhara
- Department of Genetics and Plant Breeding, College of Agriculture, Hassan, UAS, Bangalore, 573 225, India
| | - K M Satish
- Department Biotechnology, KSNUAHS, Shivamogga, 577 412, India
| | - C Pradeep
- Department of Plant Biotechnology, University of Agricultural Sciences, GKVK, Bangalore, 560 065, India
| | - G D Vidyadhar
- Department of Plant Biotechnology, University of Agricultural Sciences, GKVK, Bangalore, 560 065, India
| |
Collapse
|
3
|
Sila T, Surasombatpattana S, Rajborirug S, Laochareonsuk W, Choochuen P, Kongkamol C, Ingviya T, Prompat N, Mahasirimongkol S, Sangkhathat S, Aiewsakun P. SARS-CoV-2 variant with the spike protein mutation F306L in the southern border provinces of Thailand. Sci Rep 2024; 14:7729. [PMID: 38565881 PMCID: PMC10987673 DOI: 10.1038/s41598-024-56646-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 03/08/2024] [Indexed: 04/04/2024] Open
Abstract
The southernmost part of Thailand is a unique and culturally diverse region that has been greatly affected by the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) outbreak during the coronavirus disease-2019 pandemic. To gain insights into this situation, we analyzed 1942 whole-genome sequences of SARS-CoV-2 obtained from the five southernmost provinces of Thailand between April 2021 and March 2022, together with those publicly available in the Global Initiative on Sharing All Influenza Data database. Our analysis revealed evidence for transboundary transmissions of the virus in and out of the five southernmost provinces during the study period, from both domestic and international sources. The most prevalent viral variant in our sequence dataset was the Delta B.1.617.2.85 variant, also known as the Delta AY.85 variant, with many samples carrying a non-synonymous mutation F306L in their spike protein. Protein-protein docking and binding interface analyses suggested that the mutation may enhance the binding between the spike protein and host cell receptor protein angiotensin-converting enzyme 2, and we found that the mutation was significantly associated with an increased fatality rate. This mutation has also been observed in other SARS-CoV-2 variants, suggesting that it is of particular interest and should be monitored.
Collapse
Affiliation(s)
- Thanit Sila
- Department of Pathology, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Smonrapat Surasombatpattana
- Department of Pathology, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Songyos Rajborirug
- Department of Epidemiology, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Wison Laochareonsuk
- Division of Surgery, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
- Translational Medicine Research Center, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Pongsakorn Choochuen
- Translational Medicine Research Center, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
- Department of Biomedical Sciences and Biomedical Engineering, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Chanon Kongkamol
- Department of Family Medicine and Preventive Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Thammasin Ingviya
- Department of Family Medicine and Preventive Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand
| | - Napat Prompat
- Faculty of Medical Technology, Medical of Technology Service Center, Prince of Songkla University, Songkhla, 90110, Thailand
| | - Surakameth Mahasirimongkol
- Department of Medical Sciences, Genetics Center, Medical Life Sciences Institute, Ministry of Public Health, Nonthaburi, 11000, Thailand
| | - Surasak Sangkhathat
- Division of Surgery, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand.
- Translational Medicine Research Center, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand.
| | - Pakorn Aiewsakun
- Pornchai Matangkasombut Center for Microbial Genomics, Department of Microbiology, Faculty of Science, Mahidol University, Bangkok, 10400, Thailand.
- Department of Microbiology, Faculty of Science, Mahidol University, Bangkok, 10400, Thailand.
| |
Collapse
|
4
|
Kosenko M, Onkhonova G, Susloparov I, Ryzhikov A. SARS-CoV-2 proteins structural studies using synchrotron radiation. Biophys Rev 2023; 15:1185-1194. [PMID: 37974992 PMCID: PMC10643813 DOI: 10.1007/s12551-023-01153-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 09/20/2023] [Indexed: 11/19/2023] Open
Abstract
In the process of the development of structural biology, both the size and the complexity of the determined macromolecular structures have grown significantly. As a result, the range of application areas for the results of structural studies of biological macromolecules has expanded. Significant progress in the development of structural biology methods has been largely achieved through the use of synchrotron radiation. Modern sources of synchrotron radiation allow to conduct high-performance structural studies with high temporal and spatial resolution. Thus, modern techniques make it possible to obtain not only static structures, but also to study dynamic processes, which play a key role in understanding biological mechanisms. One of the key directions in the development of structural research is the drug design based on the structures of biomolecules. Synchrotron radiation offers insights into the three-dimensional time-resolved structure of individual viral proteins and their complexes at atomic resolution. The rapid and accurate determination of protein structures is crucial for understanding viral pathogenicity and designing targeted therapeutics. Through the application of experimental techniques, including X-ray crystallography and small-angle X-ray scattering (SAXS), it is possible to elucidate the structural details of SARS-CoV-2 virion containing 4 structural, 16 nonstructural proteins (nsp), and several accessory proteins. The most studied potential targets for vaccines and drugs are the structural spike (S) protein, which is responsible for entering the host cell, as well as nonstructural proteins essential for replication and transcription, such as main protease (Mpro), papain-like protease (PLpro), and RNA-dependent RNA polymerase (RdRp). This article provides a brief overview of structural analysis techniques, with focus on synchrotron radiation-based methods applied to the analysis of SARS-CoV-2 proteins.
Collapse
Affiliation(s)
- Maksim Kosenko
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| | - Galina Onkhonova
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| | - Ivan Susloparov
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| | - Alexander Ryzhikov
- Federal Budgetary Research Institution State Research Center of Virology and Biotechnology “Vector” Rospotrebnadzor, Koltsovo, 630559 Russia
| |
Collapse
|
5
|
Paiardini A. Protein Structure Prediction in Drug Discovery. Biomolecules 2023; 13:1258. [PMID: 37627323 PMCID: PMC10452104 DOI: 10.3390/biom13081258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 08/15/2023] [Indexed: 08/27/2023] Open
Abstract
When the results of DeepMind's AlphaFold2 at CASP were announced in 2020, the scientific world was so amazed by how effectively it performed that "it will change everything" became the motto for this revolution [...].
Collapse
Affiliation(s)
- Alessandro Paiardini
- Department of Biochemical Sciences "A. Rossi Fanelli", Sapienza University, 00185 Rome, Italy
| |
Collapse
|
6
|
Gutnik D, Evseev P, Miroshnikov K, Shneider M. Using AlphaFold Predictions in Viral Research. Curr Issues Mol Biol 2023; 45:3705-3732. [PMID: 37185764 PMCID: PMC10136805 DOI: 10.3390/cimb45040240] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/19/2023] [Accepted: 04/20/2023] [Indexed: 05/17/2023] Open
Abstract
Elucidation of the tertiary structure of proteins is an important task for biological and medical studies. AlphaFold, a modern deep-learning algorithm, enables the prediction of protein structure to a high level of accuracy. It has been applied in numerous studies in various areas of biology and medicine. Viruses are biological entities infecting eukaryotic and procaryotic organisms. They can pose a danger for humans and economically significant animals and plants, but they can also be useful for biological control, suppressing populations of pests and pathogens. AlphaFold can be used for studies of molecular mechanisms of viral infection to facilitate several activities, including drug design. Computational prediction and analysis of the structure of bacteriophage receptor-binding proteins can contribute to more efficient phage therapy. In addition, AlphaFold predictions can be used for the discovery of enzymes of bacteriophage origin that are able to degrade the cell wall of bacterial pathogens. The use of AlphaFold can assist fundamental viral research, including evolutionary studies. The ongoing development and improvement of AlphaFold can ensure that its contribution to the study of viral proteins will be significant in the future.
Collapse
Affiliation(s)
- Daria Gutnik
- Limnological Institute of the Siberian Branch of the Russian Academy of Sciences, 3 Ulan-Batorskaya Str., 664033 Irkutsk, Russia
| | - Peter Evseev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 16/10 Miklukho-Maklaya Str., GSP-7, 117997 Moscow, Russia
| | - Konstantin Miroshnikov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 16/10 Miklukho-Maklaya Str., GSP-7, 117997 Moscow, Russia
| | - Mikhail Shneider
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 16/10 Miklukho-Maklaya Str., GSP-7, 117997 Moscow, Russia
| |
Collapse
|