1
|
Defilippo A, Veltri P, Lió P, Guzzi PH. Leveraging graph neural networks for supporting automatic triage of patients. Sci Rep 2024; 14:12548. [PMID: 38822012 PMCID: PMC11143315 DOI: 10.1038/s41598-024-63376-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 05/28/2024] [Indexed: 06/02/2024] Open
Abstract
Patient triage is crucial in emergency departments, ensuring timely and appropriate care based on correctly evaluating the emergency grade of patient conditions. Triage methods are generally performed by human operator based on her own experience and information that are gathered from the patient management process. Thus, it is a process that can generate errors in emergency-level associations. Recently, Traditional triage methods heavily rely on human decisions, which can be subjective and prone to errors. A growing interest has recently been focused on leveraging artificial intelligence (AI) to develop algorithms to maximize information gathering and minimize errors in patient triage processing. We define and implement an AI-based module to manage patients' emergency code assignments in emergency departments. It uses historical data from the emergency department to train the medical decision-making process. Data containing relevant patient information, such as vital signs, symptoms, and medical history, accurately classify patients into triage categories. Experimental results demonstrate that the proposed algorithm achieved high accuracy outperforming traditional triage methods. By using the proposed method, we claim that healthcare professionals can predict severity index to guide patient management processing and resource allocation.
Collapse
Affiliation(s)
- Annamaria Defilippo
- Dept. Medical and Surgical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy
| | - Pierangelo Veltri
- DIMES Department of Informatics, Modeling, Electronics and Systems, UNICAL, Rende, Cosenza, Italy
| | - Pietro Lió
- Department of Computer Science and Technology, Cambridge University, Cambridge, UK
| | - Pietro Hiram Guzzi
- Dept. Medical and Surgical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy.
| |
Collapse
|
2
|
Giancotti R, Lomoio U, Puccio B, Tradigo G, Vizza P, Torti C, Veltri P, Guzzi PH. The Omicron XBB.1 Variant and Its Descendants: Genomic Mutations, Rapid Dissemination and Notable Characteristics. BIOLOGY 2024; 13:90. [PMID: 38392308 PMCID: PMC10886209 DOI: 10.3390/biology13020090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 01/26/2024] [Accepted: 01/30/2024] [Indexed: 02/24/2024]
Abstract
The SARS-CoV-2 virus, which is a major threat to human health, has undergone many mutations during the replication process due to errors in the replication steps and modifications in the structure of viral proteins. The XBB variant was identified for the first time in Singapore in the fall of 2022. It was then detected in other countries, including the United States, Canada, and the United Kingdom. We study the impact of sequence changes on spike protein structure on the subvariants of XBB, with particular attention to the velocity of variant diffusion and virus activity with respect to its diffusion. We examine the structural and functional distinctions of the variants in three different conformations: (i) spike glycoprotein in complex with ACE2 (1-up state), (ii) spike glycoprotein (closed-1 state), and (iii) S protein (open-1 state). We also estimate the affinity binding between the spike protein and ACE2. The market binding affinity observed in specific variants raises questions about the efficacy of current vaccines in preparing the immune system for virus variant recognition. This work may be useful in devising strategies to manage the ongoing COVID-19 pandemic. To stay ahead of the virus evolution, further research and surveillance should be carried out to adjust public health measures accordingly.
Collapse
Affiliation(s)
- Raffaele Giancotti
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | - Ugo Lomoio
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | - Barbara Puccio
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | | | - Patrizia Vizza
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | - Carlo Torti
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | - Pierangelo Veltri
- Department of Computer Engineering, Modelling, Electronics and System, University of Calabria, 87036 Rende, Italy
| | - Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| |
Collapse
|
3
|
Tradigo G, Das JK, Vizza P, Roy S, Guzzi PH, Veltri P. Strategies and Trends in COVID-19 Vaccination Delivery: What We Learn and What We May Use for the Future. Vaccines (Basel) 2023; 11:1496. [PMID: 37766172 PMCID: PMC10535057 DOI: 10.3390/vaccines11091496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 09/03/2023] [Accepted: 09/13/2023] [Indexed: 09/29/2023] Open
Abstract
Vaccination has been the most effective way to control the outbreak of the COVID-19 pandemic. The numbers and types of vaccines have reached considerable proportions, even if the question of vaccine procedures and frequency still needs to be resolved. We have come to learn the necessity of defining vaccination distribution strategies with regard to COVID-19 that could be used for any future pandemics of similar gravity. In fact, vaccine monitoring implies the existence of a strategy that should be measurable in terms of input and output, based on a mathematical model, including death rates, the spread of infections, symptoms, hospitalization, and so on. This paper addresses the issue of vaccine diffusion and strategies for monitoring the pandemic. It provides a description of the importance and take up of vaccines and the links between procedures and the containment of COVID-19 variants, as well as the long-term effects. Finally, the paper focuses on the global scenario in a world undergoing profound social and political change, with particular attention on current and future health provision. This contribution would represent an example of vaccination experiences, which can be useful in other pandemic or epidemiological contexts.
Collapse
Affiliation(s)
- Giuseppe Tradigo
- Department of Computer Science, eCampus University, 22060 Novedrate, Italy;
| | - Jayanta Kumar Das
- Longitudinal Studies Section, Translation Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA;
| | - Patrizia Vizza
- Department of Surgical and Medical Science, Magna Græcia University, 88100 Catanzaro, Italy;
| | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok 737102, India;
| | - Pietro Hiram Guzzi
- Department of Surgical and Medical Science, Magna Græcia University, 88100 Catanzaro, Italy;
| | - Pierangelo Veltri
- Department of Computer Science, Modelling, Electronics and Systems, University of Calabria, 87036 Rende, Italy;
| |
Collapse
|
4
|
Shukla N, Srivastava N, Gupta R, Srivastava P, Narayan J. COVID Variants, Villain and Victory: A Bioinformatics Perspective. Microorganisms 2023; 11:2039. [PMID: 37630599 PMCID: PMC10459809 DOI: 10.3390/microorganisms11082039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 07/11/2023] [Accepted: 07/11/2023] [Indexed: 08/27/2023] Open
Abstract
The SARS-CoV-2 virus, a novel member of the Coronaviridae family, is responsible for the viral infection known as Coronavirus Disease 2019 (COVID-19). In response to the urgent and critical need for rapid detection, diagnosis, analysis, interpretation, and treatment of COVID-19, a wide variety of bioinformatics tools have been developed. Given the virulence of SARS-CoV-2, it is crucial to explore the pathophysiology of the virus. We intend to examine how bioinformatics, in conjunction with next-generation sequencing techniques, can be leveraged to improve current diagnostic tools and streamline vaccine development for emerging SARS-CoV-2 variants. We also emphasize how bioinformatics, in general, can contribute to critical areas of biomedicine, including clinical diagnostics, SARS-CoV-2 genomic surveillance and its evolution, identification of potential drug targets, and development of therapeutic strategies. Currently, state-of-the-art bioinformatics tools have helped overcome technical obstacles with respect to genomic surveillance and have assisted in rapid detection, diagnosis, and delivering precise treatment to individuals on time.
Collapse
Affiliation(s)
- Nityendra Shukla
- CSIR Institute of Genomics and Integrative Biology, Mall Road, Delhi 110007, India; (N.S.); (R.G.)
| | - Neha Srivastava
- Amity Institute of Biotechnology, Amity University, Uttar Pradesh, Lucknow Campus, Lucknow 226010, India; (N.S.); (P.S.)
| | - Rohit Gupta
- CSIR Institute of Genomics and Integrative Biology, Mall Road, Delhi 110007, India; (N.S.); (R.G.)
| | - Prachi Srivastava
- Amity Institute of Biotechnology, Amity University, Uttar Pradesh, Lucknow Campus, Lucknow 226010, India; (N.S.); (P.S.)
| | - Jitendra Narayan
- CSIR Institute of Genomics and Integrative Biology, Mall Road, Delhi 110007, India; (N.S.); (R.G.)
| |
Collapse
|
5
|
Lomoio U, Puccio B, Tradigo G, Guzzi PH, Veltri P. SARS-CoV-2 protein structure and sequence mutations: Evolutionary analysis and effects on virus variants. PLoS One 2023; 18:e0283400. [PMID: 37471335 PMCID: PMC10358949 DOI: 10.1371/journal.pone.0283400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 07/04/2023] [Indexed: 07/22/2023] Open
Abstract
The structure and sequence of proteins strongly influence their biological functions. New models and algorithms can help researchers in understanding how the evolution of sequences and structures is related to changes in functions. Recently, studies of SARS-CoV-2 Spike (S) protein structures have been performed to predict binding receptors and infection activity in COVID-19, hence the scientific interest in the effects of virus mutations due to sequence, structure and vaccination arises. However, there is the need for models and tools to study the links between the evolution of S protein sequence, structure and functions, and virus transmissibility and the effects of vaccination. As studies on S protein have been generated a large amount of relevant information, we propose in this work to use Protein Contact Networks (PCNs) to relate protein structures with biological properties by means of network topology properties. Topological properties are used to compare the structural changes with sequence changes. We find that both node centrality and community extraction analysis can be used to relate protein stability and functionality with sequence mutations. Starting from this we compare structural evolution to sequence changes and study mutations from a temporal perspective focusing on virus variants. Finally by applying our model to the Omicron variant we report a timeline correlation between Omicron and the vaccination campaign.
Collapse
Affiliation(s)
- Ugo Lomoio
- Department of Surgical and Medical Sciences, University of Catanzaro, Catanzaro, Italy
| | - Barbara Puccio
- Department of Surgical and Medical Sciences, University of Catanzaro, Catanzaro, Italy
| | | | - Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, University of Catanzaro, Catanzaro, Italy
| | | |
Collapse
|
6
|
Combi C, Facelli JC, Haddawy P, Holmes JH, Koch S, Liu H, Meyer J, Peleg M, Pozzi G, Stiglic G, Veltri P, Yang CC. The IHI Rochester Report 2022 on Healthcare Informatics Research: Resuming After the CoViD-19. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023; 7:169-202. [PMID: 37359193 PMCID: PMC10150351 DOI: 10.1007/s41666-023-00126-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/01/2022] [Accepted: 02/02/2023] [Indexed: 06/28/2023]
Abstract
In 2020, the CoViD-19 pandemic spread worldwide in an unexpected way and suddenly modified many life issues, including social habits, social relationships, teaching modalities, and more. Such changes were also observable in many different healthcare and medical contexts. Moreover, the CoViD-19 pandemic acted as a stress test for many research endeavors, and revealed some limitations, especially in contexts where research results had an immediate impact on the social and healthcare habits of millions of people. As a result, the research community is called to perform a deep analysis of the steps already taken, and to re-think steps for the near and far future to capitalize on the lessons learned due to the pandemic. In this direction, on June 09th-11th, 2022, a group of twelve healthcare informatics researchers met in Rochester, MN, USA. This meeting was initiated by the Institute for Healthcare Informatics-IHI, and hosted by the Mayo Clinic. The goal of the meeting was to discuss and propose a research agenda for biomedical and health informatics for the next decade, in light of the changes and the lessons learned from the CoViD-19 pandemic. This article reports the main topics discussed and the conclusions reached. The intended readers of this paper, besides the biomedical and health informatics research community, are all those stakeholders in academia, industry, and government, who could benefit from the new research findings in biomedical and health informatics research. Indeed, research directions and social and policy implications are the main focus of the research agenda we propose, according to three levels: the care of individuals, the healthcare system view, and the population view.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Pierangelo Veltri
- University Magna Græcia, Catanzaro, Italy
- University of Calabria, Rende, Italy
| | | |
Collapse
|
7
|
Guzzi PH, di Paola L, Puccio B, Lomoio U, Giuliani A, Veltri P. Computational analysis of the sequence-structure relation in SARS-CoV-2 spike protein using protein contact networks. Sci Rep 2023; 13:2837. [PMID: 36808182 PMCID: PMC9936485 DOI: 10.1038/s41598-023-30052-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 02/15/2023] [Indexed: 02/19/2023] Open
Abstract
The structure of proteins impacts directly on the function they perform. Mutations in the primary sequence can provoke structural changes with consequent modification of functional properties. SARS-CoV-2 proteins have been extensively studied during the pandemic. This wide dataset, related to sequence and structure, has enabled joint sequence-structure analysis. In this work, we focus on the SARS-CoV-2 S (Spike) protein and the relations between sequence mutations and structure variations, in order to shed light on the structural changes stemming from the position of mutated amino acid residues in three different SARS-CoV-2 strains. We propose the use of protein contact network (PCN) formalism to: (i) obtain a global metric space and compare various molecular entities, (ii) give a structural explanation of the observed phenotype, and (iii) provide context dependent descriptors of single mutations. PCNs have been used to compare sequence and structure of the Alpha, Delta, and Omicron SARS-CoV-2 variants, and we found that omicron has a unique mutational pattern leading to different structural consequences from mutations of other strains. The non-random distribution of changes in network centrality along the chain has allowed to shed light on the structural (and functional) consequences of mutations.
Collapse
Affiliation(s)
- Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy.
| | - Luisa di Paola
- grid.9657.d0000 0004 1757 5329Unit of Chemical-Physics Fundamentals in Chemical Engineering, Department of Engineering, Universita Campus Bio-Medico di Roma, via Alvaro del Portillo 21, 00128 Rome, Italy
| | - Barbara Puccio
- grid.411489.10000 0001 2168 2547Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy
| | - Ugo Lomoio
- grid.411489.10000 0001 2168 2547Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy
| | - Alessandro Giuliani
- grid.416651.10000 0000 9120 6856Environment and Health Department, Istituto Superiore di Sanita, Rome, Italy
| | - Pierangelo Veltri
- grid.411489.10000 0001 2168 2547Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy ,grid.7778.f0000 0004 1937 0319Department of Computer, Modeling, Electronics and System Engineering, University of Calabria, Rende, Italy
| |
Collapse
|
8
|
Structural analysis of SARS-CoV-2 Spike protein variants through graph embedding. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS 2023; 12:3. [PMID: 36506261 PMCID: PMC9718452 DOI: 10.1007/s13721-022-00397-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 10/21/2022] [Accepted: 11/16/2022] [Indexed: 12/03/2022]
Abstract
Since December 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has affected almost all countries. The unprecedented spreading of this virus has led to the insurgence of many variants that impact protein sequence and structure that need continuous monitoring and analysis of the sequences to understand the genetic evolution and to prevent possible dangerous outcomes. Some variants causing the modification of the structure of the proteins, such as the Spike protein S, need to be monitored. Protein contact networks (PCNs) have been recently proposed as a modelling framework for protein structures. In such a framework, the protein structure is represented as an unweighted graph whose nodes are the central atoms of the backbones (C- α ), and edges connect two atoms falling in the spatial distance between 4 and 7 Å. PCN may also be a data-rich representation since we may add to each node/atom biological and topological information. Such formalism enables the possibility of using algorithms from graph theory to analyze the graph. In particular, we refer to graph embedding methods enabling the analysis of such graphs with deep learning methods. In this work, we explore the possibility of embedding PCN using Graph Neural Networks and then analyze in the embedded space each residue to distinguish mutated residues from non-mutated ones. In particular, we analyzed the structure of the Spike protein of the coronavirus. First, we obtained the PCNs of the Spike protein for the wild-type, α , β , and δ variants. Then we used the GraphSage embedding algorithm to obtain an unsupervised embedding. Then we analyzed the point of mutation in the embedded space. Results show the characteristics of the mutation point in the embedding space.
Collapse
|
9
|
Hosseinzadeh MM, Cannataro M, Guzzi PH, Dondi R. Temporal networks in biology and medicine: a survey on models, algorithms, and tools. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS 2022; 12:10. [PMID: 36618274 PMCID: PMC9803903 DOI: 10.1007/s13721-022-00406-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/16/2022] [Accepted: 12/17/2022] [Indexed: 01/01/2023]
Abstract
The use of static graphs for modelling and analysis of biological and biomedical data plays a key role in biomedical research. However, many real-world scenarios present dynamic behaviours resulting in both node and edges modification as well as feature evolution. Consequently, ad-hoc models for capturing these evolutions along the time have been introduced, also referred to as dynamic, temporal, time-varying graphs. Here, we focus on temporal graphs, i.e., graphs whose evolution is represented by a sequence of time-ordered snapshots. Each snapshot represents a graph active in a particular timestamp. We survey temporal graph models and related algorithms, presenting fundamentals aspects and the recent advances. We formally define temporal graphs, focusing on the problem setting and we present their main applications in biology and medicine. We also present temporal graph embedding and the application to recent problems such as epidemic modelling. Finally, we further state some promising research directions in the area. Main results of this study include a systematic review of fundamental temporal network problems and their algorithmic solutions considered in the literature, in particular those having application in computational biology and medicine. We also include the main software developed in this context.
Collapse
Affiliation(s)
| | - Mario Cannataro
- Department of Surgical and Medical Sciences and Data Analytics Research Center, University Magna Graecia of Catanzaro, Catanzaro, Italy
| | - Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences and Data Analytics Research Center, University Magna Graecia of Catanzaro, Catanzaro, Italy
| | - Riccardo Dondi
- Department of Literature, Philosophy, Communication Studies, University of Bergamo, Bergamo, Italy
| |
Collapse
|
10
|
Prosperi M, Rife B, Marini S, Salemi M. Transmission cluster characteristics of global, regional, and lineage-specific SARS-CoV-2 phylogenies. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2022; 2022:2940-2944. [PMID: 36780250 PMCID: PMC9912475 DOI: 10.1109/bibm55620.2022.9995364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The SARS-CoV-2 pandemic has been presenting in periodic waves and multiple variants, of which some dominated over time with increased transmissibility. SARS-CoV-2 is still adapting in the human population, thus it is crucial to understand its evolutionary patterns and dynamics ahead of time. In this work, we analyzed transmission clusters and topology of SARS-CoV-2 phylogenies at the global, regional (North America) and clade-specific (Delta and Omicron) epidemic scales. We used the Nextstrain's nCov open global all-time phylogeny (September 2022, 2,698 strains, 2,243 for North America, 499 for Delta21A, and 543 for Omicron20M), with Nextstrain's clade annotation and Pango lineages. Transmission clusters were identified using Phylopart, DYNAMITE, and several tree imbalance measures were calculated, including staircase-ness, Sackin and Colless index. We found that the phylogenetic clustering profiles of the global epidemic have highest diversification at a distance threshold of 3% (divergence of 10, where the tree sampled median is 49). Phylopart and DYNAMITE clusters moderately-to-highly agree with the Pango nomenclature and the Nextstrain's clade. At the regional and clade-specific scale, transmission clustering profiles tend to flatten and similar clusters are found at distance thresholds between 0.05% and 25%. All the considered phylogenies exhibit high tree imbalance with respect to what expected in random phylogenies, suggesting short infection times and antigenic drift, perhaps due to progressive transition from innate to adaptive immunity in the population.
Collapse
Affiliation(s)
- Mattia Prosperi
- Department of Epidemiology, College of Public Health and
Health Professions, University of Florida Gainesville, Fl,
USA
| | - Brittany Rife
- Department of Pathology, Immunology and Laboratory
Medicine, College of Medicine, University of Florida
Gainesville, Fl, USA
| | - Simone Marini
- Department of Epidemiology, College of Public Health and
Health Professions, University of Florida Gainesville, Fl,
USA
| | - Marco Salemi
- Department of Pathology, Immunology and Laboratory
Medicine, College of Medicine, University of Florida
Gainesville, Fl, USA
| |
Collapse
|
11
|
Guzzi PH, Di Paola L, Giuliani A, Veltri P. PCN-Miner: an open-source extensible tool for the analysis of Protein Contact Networks. Bioinformatics 2022; 38:4235-4237. [PMID: 35799364 DOI: 10.1093/bioinformatics/btac450] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 06/14/2022] [Accepted: 07/04/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Protein Contact Network (PCN) is a powerful method for analysing the structure and function of proteins, with a specific focus on disclosing the molecular features of allosteric regulation through the discovery of modular substructures. The importance of PCN analysis has been shown in many contexts, such as the analysis of SARS-CoV-2 Spike protein and its complexes with the Angiotensin Converting Enzyme 2 (ACE2) human receptors. Even if there exist many software tools implementing such methods, there is a growing need for the introduction of tools integrating existing approaches. RESULTS We present PCN-Miner, a software tool implemented in the Python programming language, able to (i) import protein structures from the Protein Data Bank; (ii) generate the corresponding PCN; (iii) model, analyse and visualize PCNs and related protein structures by using a set of known algorithms and metrics. The PCN-Miner can cover a large set of applications: from clustering to embedding and subsequent analysis. AVAILABILITY AND IMPLEMENTATION The PCN-Miner tool is freely available at the following GitHub repository: https://github.com/hguzzi/ProteinContactNetworks. It is also available in the Python Package Index (PyPI) repository.
Collapse
Affiliation(s)
- Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | - Luisa Di Paola
- Unit of Chemical-Physics Fundamentals in Chemical Engineering, Department of Engineering, Università Campus Bio-Medico di Roma, 00128 Rome, Italy
| | - Alessandro Giuliani
- Environment and Health Department, Istituto Superiore di Sanità, 00161Rome, Italy
| | - Pierangelo Veltri
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| |
Collapse
|
12
|
Hiram Guzzi P, Petrizzelli F, Mazza T. Disease spreading modeling and analysis: a survey. Brief Bioinform 2022; 23:6606045. [PMID: 35692095 DOI: 10.1093/bib/bbac230] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 05/16/2022] [Accepted: 05/17/2022] [Indexed: 12/18/2022] Open
Abstract
MOTIVATION The control of the diffusion of diseases is a critical subject of a broad research area, which involves both clinical and political aspects. It makes wide use of computational tools, such as ordinary differential equations, stochastic simulation frameworks and graph theory, and interaction data, from molecular to social granularity levels, to model the ways diseases arise and spread. The coronavirus disease 2019 (COVID-19) is a perfect testbench example to show how these models may help avoid severe lockdown by suggesting, for instance, the best strategies of vaccine prioritization. RESULTS Here, we focus on and discuss some graph-based epidemiological models and show how their use may significantly improve the disease spreading control. We offer some examples related to the recent COVID-19 pandemic and discuss how to generalize them to other diseases.
Collapse
Affiliation(s)
- Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Graecia University, Catanzaro, 88110, Italy
| | - Francesco Petrizzelli
- Bioinformatics unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, 71013, Italy
| | - Tommaso Mazza
- Bioinformatics unit, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, 71013, Italy
| |
Collapse
|
13
|
Petrizzelli F, Guzzi PH, Mazza T. Beyond COVID-19 Pandemic: Topology-aware optimization of vaccination strategy for minimizing virus spreading. Comput Struct Biotechnol J 2022; 20:2664-2671. [PMID: 35664237 PMCID: PMC9135485 DOI: 10.1016/j.csbj.2022.05.040] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 12/12/2022] Open
Abstract
Paper discusses the relevance of the adoption of ad-hoc vaccination strategies. Paper shows how to evaluate the impact of different vaccination strategy by considering network-based models. Tailored interventions, e.g., vaccination, applied on central nodes of these networks may efficiently stop the propagation of an infection. The way node "centrality" is defined is the key to curb infection spreading.
The mitigation of an infectious disease spreading has recently gained considerable attention from the research community. It may be obtained by adopting sanitary measurements (e.g., vaccination, wearing masks), social rules (e.g., social distancing), together with an extensive vaccination campaign. Vaccination is currently the primary way for mitigating the Coronavirus Disease (COVID-19) outbreak without severe lockdown. Its effectiveness also depends on the number and timeliness of administrations and thus demands strict prioritization criteria. Almost all countries have prioritized similar classes of exposed workers: healthcare professionals and the elderly, obtaining to maximize the survival of patients and years of life saved. Nevertheless, the virus is currently spreading at high rates, and any prioritization criterion so far adopted did not account for the structural organization of the contact networks. We reckon that a network where nodes are people while the edges represent their social contacts may efficiently model the virus’s spreading. It is known that tailored interventions (e.g., vaccination) on central nodes may efficiently stop the propagation, thereby eliminating the “bridge edges.” We then introduce such a model and consider both synthetic and real datasets. We present the benefits of a topology-aware versus an age-based vaccination strategy to mitigate the spreading of the virus. The code is available at https://github.com/mazzalab/playgrounds.
Collapse
Affiliation(s)
- Francesco Petrizzelli
- Laboratory of Bioinformatics, Fondazione IRCCS Casa Sollievo della Sofferenza, Viale Capuccini, 71013 S. Giovanni Rotondo, Fg, Italy
| | - Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, University of Catanzaro, Catanzaro, Campus S Venuta, 88100, Italy
- Corresponding authors.
| | - Tommaso Mazza
- Laboratory of Bioinformatics, Fondazione IRCCS Casa Sollievo della Sofferenza, Viale Capuccini, 71013 S. Giovanni Rotondo, Fg, Italy
- Corresponding authors.
| |
Collapse
|
14
|
Jukič M, Kores K, Janežič D, Bren U. Repurposing of Drugs for SARS-CoV-2 Using Inverse Docking Fingerprints. Front Chem 2021; 9:757826. [PMID: 35028304 PMCID: PMC8748264 DOI: 10.3389/fchem.2021.757826] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 11/12/2021] [Indexed: 01/08/2023] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 or SARS-CoV-2 is a virus that belongs to the Coronaviridae family. This group of viruses commonly causes colds but possesses a tremendous pathogenic potential. In humans, an outbreak of SARS caused by the SARS-CoV virus was first reported in 2003, followed by 2012 when the Middle East respiratory syndrome coronavirus (MERS-CoV) led to an outbreak of Middle East respiratory syndrome (MERS). Moreover, COVID-19 represents a serious socioeconomic and global health problem that has already claimed more than four million lives. To date, there are only a handful of therapeutic options to combat this disease, and only a single direct-acting antiviral, the conditionally approved remdesivir. Since there is an urgent need for active drugs against SARS-CoV-2, the strategy of drug repurposing represents one of the fastest ways to achieve this goal. An in silico drug repurposing study using two methods was conducted. A structure-based virtual screening of the FDA-approved drug database on SARS-CoV-2 main protease was performed, and the 11 highest-scoring compounds with known 3CLpro activity were identified while the methodology was used to report further 11 potential and completely novel 3CLpro inhibitors. Then, inverse molecular docking was performed on the entire viral protein database as well as on the Coronaviridae family protein subset to examine the hit compounds in detail. Instead of target fishing, inverse docking fingerprints were generated for each hit compound as well as for the five most frequently reported and direct-acting repurposed drugs that served as controls. In this way, the target-hitting space was examined and compared and we can support the further biological evaluation of all 11 newly reported hits on SARS-CoV-2 3CLpro as well as recommend further in-depth studies on antihelminthic class member compounds. The authors acknowledge the general usefulness of this approach for a full-fledged inverse docking fingerprint screening in the future.
Collapse
Affiliation(s)
- Marko Jukič
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| | - Katarina Kores
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
| | - Dušanka Janežič
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| | - Urban Bren
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| |
Collapse
|
15
|
Yang X, Wang W, Ma JL, Qiu YL, Lu K, Cao DS, Wu CK. BioNet: a large-scale and heterogeneous biological network model for interaction prediction with graph convolution. Brief Bioinform 2021; 23:6440126. [PMID: 34849567 PMCID: PMC8690188 DOI: 10.1093/bib/bbab491] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 10/24/2021] [Accepted: 10/25/2021] [Indexed: 01/09/2023] Open
Abstract
Motivation Understanding chemical–gene interactions (CGIs) is crucial for screening drugs. Wet experiments are usually costly and laborious, which limits relevant studies to a small scale. On the contrary, computational studies enable efficient in-silico exploration. For the CGI prediction problem, a common method is to perform systematic analyses on a heterogeneous network involving various biomedical entities. Recently, graph neural networks become popular in the field of relation prediction. However, the inherent heterogeneous complexity of biological interaction networks and the massive amount of data pose enormous challenges. This paper aims to develop a data-driven model that is capable of learning latent information from the interaction network and making correct predictions. Results We developed BioNet, a deep biological networkmodel with a graph encoder–decoder architecture. The graph encoder utilizes graph convolution to learn latent information embedded in complex interactions among chemicals, genes, diseases and biological pathways. The learning process is featured by two consecutive steps. Then, embedded information learnt by the encoder is then employed to make multi-type interaction predictions between chemicals and genes with a tensor decomposition decoder based on the RESCAL algorithm. BioNet includes 79 325 entities as nodes, and 34 005 501 relations as edges. To train such a massive deep graph model, BioNet introduces a parallel training algorithm utilizing multiple Graphics Processing Unit (GPUs). The evaluation experiments indicated that BioNet exhibits outstanding prediction performance with a best area under Receiver Operating Characteristic (ROC) curve of 0.952, which significantly surpasses state-of-theart methods. For further validation, top predicted CGIs of cancer and COVID-19 by BioNet were verified by external curated data and published literature.
Collapse
Affiliation(s)
- Xi Yang
- College of Computer, National University of Defense Technology, China
| | - Wei Wang
- National Supercomputer Center in Tianjin, China
| | - Jing-Lun Ma
- College of Computer, National University of Defense Technology, China
| | - Yan-Long Qiu
- College of Computer, National University of Defense Technology, China
| | - Kai Lu
- College of Computer, National University of Defense Technology, China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, China
| | - Cheng-Kun Wu
- Institute for Quantum Information & State Key Laboratory of High Performance Computing, College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
| |
Collapse
|
16
|
Das J, Thakuri B, MohanKumar K, Roy S, Sljoka A, Sun GQ, Chakraborty A. Mutation-Induced Long-Range Allosteric Interactions in the Spike Protein Determine the Infectivity of SARS-CoV-2 Emerging Variants. ACS OMEGA 2021; 6:31312-31327. [PMID: 34805715 PMCID: PMC8592041 DOI: 10.1021/acsomega.1c05155] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 11/01/2021] [Indexed: 05/04/2023]
Abstract
The emergence of a variety of highly transmissible SARS-CoV-2 variants, the causative agent of COVID-19, with multiple spike mutations poses serious challenges in overcoming the ongoing deadly pandemic. It is, therefore, essential to understand how these variants gain enhanced ability to evade immune responses with a higher rate of spreading infection. To address this question, here we have individually assessed the effects of SARS-CoV-2 variant-specific spike (S) protein receptor-binding domain (RBD) mutations E484K, K417N, L452Q, L452R, N501Y, and T478K that characterize and differentiate several emerging variants. Despite the hundreds of apparently neutral mutations observed in the domains other than the RBD, we have shown that each RBD mutation site is differentially engaged in an interdomain allosteric network involving mutation sites from a distant domain, affecting interactions with the human receptor angiotensin-converting enzyme-2 (ACE2). This allosteric network couples the residues of the N-terminal domain (NTD) and the RBD, which are modulated by the RBD-specific mutations and are capable of propagating mutation-induced perturbations between these domains through a combination of structural changes and effector-dependent modulations of dynamics. One key feature of this network is the inclusion of compensatory mutations segregated into three characteristically different clusters, where each cluster residue site is allosterically coupled with specific RBD mutation sites. Notably, each RBD mutation acted like a positive allosteric modulator; nevertheless, K417N was shown to have the largest effects among all of the mutations on the allostery and thereby holds the highest binding affinity with ACE2. This result will be useful for designing the targeted control measure and therapeutic efforts aiming at allosteric modulators.
Collapse
Affiliation(s)
- Jayanta
Kumar Das
- Department
of Pediatrics, Johns Hopkins University
School of Medicine, Baltimore, Maryland 21287, United States
| | - Bikash Thakuri
- Department
of Mathematics, Sikkim University, Gangtok, Sikkim 737102, India
| | - Krishnan MohanKumar
- Department
of Pediatrics, Johns Hopkins University
School of Medicine, Baltimore, Maryland 21287, United States
| | - Swarup Roy
- Department
of Computer Applications, Sikkim University, Gangtok, Sikkim 737102, India
| | - Adnan Sljoka
- RIKEN
Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku Tokyo 103-0027, Japan
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 1A1, Canada
| | - Gui-Quan Sun
- Department
of Mathematics, North University of China, Taiyuan, Shanxi 030051, China
- Complex
Systems Research Center, Shanxi University, Taiyuan, Shanxi 030006, China
| | - Amit Chakraborty
- Department
of Mathematics, Sikkim University, Gangtok, Sikkim 737102, India
- , . Phone: +91 9784811895
| |
Collapse
|
17
|
Tayara H, Abdelbaky I, To Chong K. Recent omics-based computational methods for COVID-19 drug discovery and repurposing. Brief Bioinform 2021; 22:6355836. [PMID: 34423353 DOI: 10.1093/bib/bbab339] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 07/09/2021] [Indexed: 12/22/2022] Open
Abstract
The coronavirus disease 2019 (COVID-19) pandemic, caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the main reason for the increasing number of deaths worldwide. Although strict quarantine measures were followed in many countries, the disease situation is still intractable. Thus, it is needed to utilize all possible means to confront this pandemic. Therefore, researchers are in a race against the time to produce potential treatments to cure or reduce the increasing infections of COVID-19. Computational methods are widely proving rapid successes in biological related problems, including diagnosis and treatment of diseases. Many efforts in recent months utilized Artificial Intelligence (AI) techniques in the context of fighting the spread of COVID-19. Providing periodic reviews and discussions of recent efforts saves the time of researchers and helps to link their endeavors for a faster and efficient confrontation of the pandemic. In this review, we discuss the recent promising studies that used Omics-based data and utilized AI algorithms and other computational tools to achieve this goal. We review the established datasets and the developed methods that were basically directed to new or repurposed drugs, vaccinations and diagnosis. The tools and methods varied depending on the level of details in the available information such as structures, sequences or metabolic data.
Collapse
Affiliation(s)
- Hilal Tayara
- School of international Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Ibrahim Abdelbaky
- Artificial Intelligence Department, Faculty of Computers and Artificial Intelligence, Benha University, Banha 13518, Egypt
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju, Jeollabukdo 54896, Republic of Korea.,Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
18
|
Das JK, Roy S, Guzzi PH. Analyzing host-viral interactome of SARS-CoV-2 for identifying vulnerable host proteins during COVID-19 pathogenesis. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2021; 93:104921. [PMID: 34004362 PMCID: PMC8123524 DOI: 10.1016/j.meegid.2021.104921] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Revised: 05/04/2021] [Accepted: 05/07/2021] [Indexed: 02/07/2023]
Abstract
The development of therapeutic targets for COVID-19 relies on understanding the molecular mechanism of pathogenesis. Identifying genes or proteins involved in the infection mechanism is the key to shedding light on the complex molecular mechanisms. The combined effort of many laboratories distributed throughout the world has produced protein and genetic interactions. We integrated available results and obtained a host protein-protein interaction network composed of 1432 human proteins. Next, we performed network centrality analysis to identify critical proteins in the derived network. Finally, we performed a functional enrichment analysis of central proteins. We observed that the identified proteins are primarily associated with several crucial pathways, including cellular process, signaling transduction, neurodegenerative diseases. We focused on the proteins that are involved in human respiratory tract diseases. We highlighted many potential therapeutic targets, including RBX1, HSPA5, ITCH, RAB7A, RAB5A, RAB8A, PSMC5, CAPZB, CANX, IGF2R, and HSPA1A, which are central and also associated with multiple diseases.
Collapse
Affiliation(s)
- Jayanta Kumar Das
- Department of Pediatrics, School of Medicine, Johns Hopkins University, MD, USA
| | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok, India,Corresponding authors
| | - Pietro Hiram Guzzi
- Department of Medical and Surgical Sciences, Magna Graecia University, Catanzaro, Italy,Corresponding authors
| |
Collapse
|
19
|
Mercatelli D, Pedace E, Veltri P, Giorgi FM, Guzzi PH. Exploiting the molecular basis of age and gender differences in outcomes of SARS-CoV-2 infections. Comput Struct Biotechnol J 2021; 19:4092-4100. [PMID: 34306570 PMCID: PMC8271029 DOI: 10.1016/j.csbj.2021.07.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 07/06/2021] [Accepted: 07/06/2021] [Indexed: 12/15/2022] Open
Abstract
Motivation: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection (coronavirus disease, 2019; COVID-19) is associated with adverse outcomes in patients. It has been observed that lethality seems to be related to the age of patients. While ageing has been extensively demonstrated to be accompanied by some modifications at the gene expression level, a possible link with COVID-19 manifestation still need to be investigated at the molecular level. Objectives: This study aims to shed out light on a possible link between the increased COVID-19 lethality and the molecular changes that occur in elderly people. Methods: We considered public datasets of ageing-related genes and their expression at the tissue level. We selected human proteins interacting with viral ones that are known to be related to the ageing process. Finally, we investigated changes in the expression level of coding genes at the tissue, gender and age level. Results: We observed a significant intersection between some SARS-CoV-2 interactors and ageing-related genes, suggesting that those genes are particularly affected by COVID-19 infection. Our analysis evidenced that virus infection particularly involves ageing molecular mechanisms centred around proteins EEF2, NPM1, HMGA1, HMGA2, APEX1, CHEK1, PRKDC, and GPX4. We found that HMGA1 and NPM1 have different expressions in the lung of males, while HMGA1, APEX1, CHEK1, EEF2, and NPM1 present changes in expression in males due to ageing effects. Conclusion: Our study generated a mechanistic framework to clarify the correlation between COVID-19 incidence in elderly patients and molecular mechanisms of ageing. We also provide testable hypotheses for future investigation and pharmacological solutions tailored to specific age ranges.
Collapse
Affiliation(s)
| | | | - Pierangelo Veltri
- University of Catanzaro, Department of Medical and Surgical Sciences, Italy
| | | | - Pietro Hiram Guzzi
- University of Catanzaro, Department of Medical and Surgical Sciences, Italy
| |
Collapse
|
20
|
Das JK, Chakraborty S, Roy S. A scheme for inferring viral-host associations based on codon usage patterns identifies the most affected signaling pathways during COVID-19. J Biomed Inform 2021; 118:103801. [PMID: 33965637 PMCID: PMC8102073 DOI: 10.1016/j.jbi.2021.103801] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 05/02/2021] [Accepted: 05/03/2021] [Indexed: 12/16/2022]
Abstract
Understanding the molecular mechanism of COVID-19 pathogenesis helps in the rapid therapeutic target identification. Usually, viral protein targets host proteins in an organized fashion. The expression of any viral gene depends mostly on the host translational machinery. Recent studies report the great significance of codon usage biases in establishing host-viral protein–protein interactions (PPI). Exploring the codon usage patterns between a pair of co-evolved host and viral proteins may present novel insight into the host-viral protein interactomes during disease pathogenesis. Leveraging the similarity in codon usage patterns, we propose a computational scheme to recreate the host-viral protein–protein interaction network. We use host proteins from seventeen (17) essential signaling pathways for our current work towards understanding the possible targeting mechanism of SARS-CoV-2 proteins. We infer both negatively and positively interacting edges in the network. Further, extensive analysis is performed to understand the host PPI network topologically and the attacking behavior of the viral proteins. Our study reveals that viral proteins mostly utilize codons, rare in the targeted host proteins (negatively correlated interaction). Among them, non-structural proteins, NSP3 and structural protein, Spike (S), are the most influential proteins in interacting with multiple host proteins. While ranking the most affected pathways, MAPK pathways observe to be the worst affected during the SARS-CoV-2 infection. Several proteins participating in multiple pathways are highly central in host PPI and mostly targeted by multiple viral proteins. We observe many potential targets (host proteins) from the affected pathways associated with the various drug molecules, including Arsenic trioxide, Dexamethasone, Hydroxychloroquine, Ritonavir, and Interferon beta, which are either under clinical trial or in use during COVID-19.
Collapse
Affiliation(s)
- Jayanta Kumar Das
- Department of Pediatrics, Johns Hopkins University, School of Medicine, MD, USA
| | | | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok, India.
| |
Collapse
|
21
|
Characterizing genomic variants and mutations in SARS-CoV-2 proteins from Indian isolates. GENE REPORTS 2021; 25:101044. [PMID: 33623833 PMCID: PMC7893251 DOI: 10.1016/j.genrep.2021.101044] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/25/2020] [Accepted: 01/29/2021] [Indexed: 12/17/2022]
Abstract
SARS-CoV-2 is mutating and creating divergent variants by altering the composition of essential constituent proteins. Pharmacologically, it is crucial to understand the diverse mechanism of mutations for stable vaccine or anti-viral drug design. Our current study concentrates on all the constituent proteins of 469 SARS-CoV-2 genome samples, derived from Indian patients. However, the study may easily be extended to the samples across the globe. We perform clustering analysis towards identifying unique variants in each of the SARS-CoV-2 proteins. A total of 536 mutated positions within the coding regions of SARS-CoV-2 proteins are detected among the identified variants from Indian isolates. We quantify mutations by focusing on the unique variants of each SARS-CoV-2 protein. We report the average number of mutation per variant, percentage of mutated positions, synonymous and non-synonymous mutations, mutations occurring in three codon positions and so on. Our study reveals the most susceptible six (06) proteins, which are ORF1ab, Spike (S), Nucleocapsid (N), ORF3a, ORF7a, and ORF8. Several non-synonymous substitutions are observed to be unique in different SARS-CoV-2 proteins. A total of 57 possible deleterious amino acid substitutions are predicted, which may impact on the protein functions. Several mutations show a large decrease in protein stability and are observed in putative functional domains of the proteins that might have some role in disease pathogenesis. We observe a good number of physicochemical property change during above deleterious substitutions.
Collapse
|