1
|
Guzzi PH, di Paola L, Puccio B, Lomoio U, Giuliani A, Veltri P. Computational analysis of the sequence-structure relation in SARS-CoV-2 spike protein using protein contact networks. Sci Rep 2023; 13:2837. [PMID: 36808182 PMCID: PMC9936485 DOI: 10.1038/s41598-023-30052-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 02/15/2023] [Indexed: 02/19/2023] Open
Abstract
The structure of proteins impacts directly on the function they perform. Mutations in the primary sequence can provoke structural changes with consequent modification of functional properties. SARS-CoV-2 proteins have been extensively studied during the pandemic. This wide dataset, related to sequence and structure, has enabled joint sequence-structure analysis. In this work, we focus on the SARS-CoV-2 S (Spike) protein and the relations between sequence mutations and structure variations, in order to shed light on the structural changes stemming from the position of mutated amino acid residues in three different SARS-CoV-2 strains. We propose the use of protein contact network (PCN) formalism to: (i) obtain a global metric space and compare various molecular entities, (ii) give a structural explanation of the observed phenotype, and (iii) provide context dependent descriptors of single mutations. PCNs have been used to compare sequence and structure of the Alpha, Delta, and Omicron SARS-CoV-2 variants, and we found that omicron has a unique mutational pattern leading to different structural consequences from mutations of other strains. The non-random distribution of changes in network centrality along the chain has allowed to shed light on the structural (and functional) consequences of mutations.
Collapse
Affiliation(s)
- Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy.
| | - Luisa di Paola
- grid.9657.d0000 0004 1757 5329Unit of Chemical-Physics Fundamentals in Chemical Engineering, Department of Engineering, Universita Campus Bio-Medico di Roma, via Alvaro del Portillo 21, 00128 Rome, Italy
| | - Barbara Puccio
- grid.411489.10000 0001 2168 2547Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy
| | - Ugo Lomoio
- grid.411489.10000 0001 2168 2547Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy
| | - Alessandro Giuliani
- grid.416651.10000 0000 9120 6856Environment and Health Department, Istituto Superiore di Sanita, Rome, Italy
| | - Pierangelo Veltri
- grid.411489.10000 0001 2168 2547Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, Catanzaro, Italy ,grid.7778.f0000 0004 1937 0319Department of Computer, Modeling, Electronics and System Engineering, University of Calabria, Rende, Italy
| |
Collapse
|
2
|
Damodaran K, Khan T, Bickel D, Jaya S, Vranken WF, Sudandiradoss C. New simulation insights on the structural transition mechanism of bovine rhodopsin activation. Proteins 2023; 91:771-780. [PMID: 36629258 DOI: 10.1002/prot.26465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 12/02/2022] [Accepted: 01/02/2023] [Indexed: 01/12/2023]
Abstract
Inactive rhodopsin can absorb photons, which induces different structural transitions that finally activate rhodopsin. We have examined the change in spatial configurations and physicochemical factors that result during the transition mechanism from the inactive to the active rhodopsin state via intermediates. During the activation process, many existing atomic contacts are disrupted, and new ones are formed. This is related to the movement of Helix 5, which tilts away from Helix 3 in the intermediate state in lumirhodopsin and moves closer to Helix 3 again in the active state. Similar patterns of changing atomic contacts are observed between Helices 3 and 5 of the adenosine and neurotensin receptors. In addition, residues 220-238 of rhodopsin, which are disordered in the inactive state, fold in the active state before binding to the Gα, where it catalyzes GDP/GTP exchange on the Gα subunit. Finally, molecular dynamics simulations in the membrane environment revealed that the arrestin binding region adopts a more flexible extended conformation upon phosphorylation, likely promoting arrestin binding and inactivation. In summary, our results provide additional structural understanding of specific rhodopsin activation which might be relevant to other Class A G protein-coupled receptor proteins.
Collapse
Affiliation(s)
- Kamalesh Damodaran
- Department of Integrative Biology, School of Bioscience and Technology, Vellore Institute of Technology, Vellore, India.,Interuniversity Institute of Bioinformatics in Brussels, ULB/VUB, Brussels, Belgium
| | - Taushif Khan
- Interuniversity Institute of Bioinformatics in Brussels, ULB/VUB, Brussels, Belgium.,Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - David Bickel
- Interuniversity Institute of Bioinformatics in Brussels, ULB/VUB, Brussels, Belgium.,Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Sreeshma Jaya
- Department of Biotechnology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Wim F Vranken
- Interuniversity Institute of Bioinformatics in Brussels, ULB/VUB, Brussels, Belgium.,Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Chinnappan Sudandiradoss
- Department of Biotechnology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| |
Collapse
|
3
|
Structural analysis of SARS-CoV-2 Spike protein variants through graph embedding. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS 2023; 12:3. [PMID: 36506261 PMCID: PMC9718452 DOI: 10.1007/s13721-022-00397-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 10/21/2022] [Accepted: 11/16/2022] [Indexed: 12/03/2022]
Abstract
Since December 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has affected almost all countries. The unprecedented spreading of this virus has led to the insurgence of many variants that impact protein sequence and structure that need continuous monitoring and analysis of the sequences to understand the genetic evolution and to prevent possible dangerous outcomes. Some variants causing the modification of the structure of the proteins, such as the Spike protein S, need to be monitored. Protein contact networks (PCNs) have been recently proposed as a modelling framework for protein structures. In such a framework, the protein structure is represented as an unweighted graph whose nodes are the central atoms of the backbones (C- α ), and edges connect two atoms falling in the spatial distance between 4 and 7 Å. PCN may also be a data-rich representation since we may add to each node/atom biological and topological information. Such formalism enables the possibility of using algorithms from graph theory to analyze the graph. In particular, we refer to graph embedding methods enabling the analysis of such graphs with deep learning methods. In this work, we explore the possibility of embedding PCN using Graph Neural Networks and then analyze in the embedded space each residue to distinguish mutated residues from non-mutated ones. In particular, we analyzed the structure of the Spike protein of the coronavirus. First, we obtained the PCNs of the Spike protein for the wild-type, α , β , and δ variants. Then we used the GraphSage embedding algorithm to obtain an unsupervised embedding. Then we analyzed the point of mutation in the embedded space. Results show the characteristics of the mutation point in the embedding space.
Collapse
|
4
|
Guzzi PH, Di Paola L, Giuliani A, Veltri P. PCN-Miner: an open-source extensible tool for the analysis of Protein Contact Networks. Bioinformatics 2022; 38:4235-4237. [PMID: 35799364 DOI: 10.1093/bioinformatics/btac450] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 06/14/2022] [Accepted: 07/04/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Protein Contact Network (PCN) is a powerful method for analysing the structure and function of proteins, with a specific focus on disclosing the molecular features of allosteric regulation through the discovery of modular substructures. The importance of PCN analysis has been shown in many contexts, such as the analysis of SARS-CoV-2 Spike protein and its complexes with the Angiotensin Converting Enzyme 2 (ACE2) human receptors. Even if there exist many software tools implementing such methods, there is a growing need for the introduction of tools integrating existing approaches. RESULTS We present PCN-Miner, a software tool implemented in the Python programming language, able to (i) import protein structures from the Protein Data Bank; (ii) generate the corresponding PCN; (iii) model, analyse and visualize PCNs and related protein structures by using a set of known algorithms and metrics. The PCN-Miner can cover a large set of applications: from clustering to embedding and subsequent analysis. AVAILABILITY AND IMPLEMENTATION The PCN-Miner tool is freely available at the following GitHub repository: https://github.com/hguzzi/ProteinContactNetworks. It is also available in the Python Package Index (PyPI) repository.
Collapse
Affiliation(s)
- Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| | - Luisa Di Paola
- Unit of Chemical-Physics Fundamentals in Chemical Engineering, Department of Engineering, Università Campus Bio-Medico di Roma, 00128 Rome, Italy
| | - Alessandro Giuliani
- Environment and Health Department, Istituto Superiore di Sanità, 00161Rome, Italy
| | - Pierangelo Veltri
- Department of Surgical and Medical Sciences, Magna Graecia University of Catanzaro, 88100 Catanzaro, Italy
| |
Collapse
|
5
|
Pseudo-Symmetric Assembly of Protodomains as a Common Denominator in the Evolution of Polytopic Helical Membrane Proteins. J Mol Evol 2020; 88:319-344. [PMID: 32189026 PMCID: PMC7162841 DOI: 10.1007/s00239-020-09934-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 02/16/2020] [Indexed: 11/05/2022]
Abstract
The polytopic helical membrane proteome is dominated by proteins containing seven transmembrane helices (7TMHs). They cannot be grouped under a monolithic fold or superfold. However, a parallel structural analysis of folds around that magic number of seven in distinct protein superfamilies (SWEET, PnuC, TRIC, FocA, Aquaporin, GPCRs) reveals a common homology, not in their structural fold, but in their systematic pseudo-symmetric construction during their evolution. Our analysis leads to guiding principles of intragenic duplication and pseudo-symmetric assembly of ancestral transmembrane helical protodomains, consisting of 3 (or 4) helices. A parallel deconstruction and reconstruction of these domains provides a structural and mechanistic framework for their evolutionary paths. It highlights the conformational plasticity inherent to fold formation itself, the role of structural as well as functional constraints in shaping that fold, and the usefulness of protodomains as a tool to probe convergent vs divergent evolution. In the case of FocA vs. Aquaporin, this protodomain analysis sheds new light on their potential divergent evolution at the protodomain level followed by duplication and parallel evolution of the two folds. GPCR domains, whose function does not seem to require symmetry, nevertheless exhibit structural pseudo-symmetry. Their construction follows the same protodomain assembly as any other pseudo-symmetric protein suggesting their potential evolutionary origins. Interestingly, all the 6/7/8TMH pseudo-symmetric folds in this study also assemble as oligomeric forms in the membrane, emphasizing the role of symmetry in evolution, revealing self-assembly and co-evolution not only at the protodomain level but also at the domain level.
Collapse
|
6
|
Structural modules of the stress-induced protein HflX: an outlook on its evolution and biological role. Curr Genet 2018; 65:363-370. [PMID: 30448945 DOI: 10.1007/s00294-018-0905-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 11/05/2018] [Accepted: 11/13/2018] [Indexed: 12/23/2022]
Abstract
Multifunctional proteins often show modular structures. A functional domain and the structural modules within the domain show evolutionary conservation of their spatial arrangement since that gives the protein its functionality. However, the question remains as to how members of different domains of life (Archaea, Bacteria, Eukarya), polish and perfect these modules within conserved multidomain proteins, to tailor functional proteins according to their specific requirements. In the quest for plausible answers to this question, we studied the bacterial protein HflX. HflX is a universally conserved member of the Obg-GTPase superfamily but its functional role in Archaea and Eukarya is barely known. It is a multidomain protein and possesses, in addition to its conserved GTPase domain, an ATP-binding N-terminal domain. It is involved in heat stress response in Escherichia coli and our laboratory recently identified an ATP-dependent RNA helicase activity of E. coli HflX, which is likely instrumental in rescuing ribosomes during heat stress. Because perception and response to stress is expected to be different in different life forms, the question is whether this activity is preserved in higher organisms or not. Thus, we explored the evolution pattern of different structural modules of HflX, with particular emphasis on the ATP-binding domain, to understand plausible biological role of HflX in other forms of life. Our analyses indicate that, while the evolutionary pattern of the GTPase domain follows a conserved phylogeny, conservation of the ATP-binding domain shows a complicated pattern. The limited analysis described here hints towards possible evolutionary adaptations and modifications of the domain, something which needs to be investigated in more depth in homologs from other life forms. Deciphering how nature 'tweaks' such modules, both structurally and functionally, may help in understanding the evolution of such proteins, and, on a large-scale, of stress-related proteins in general as well.
Collapse
|
7
|
Khan T, Panday SK, Ghosh I. ProLego: tool for extracting and visualizing topological modules in protein structures. BMC Bioinformatics 2018; 19:167. [PMID: 29728050 PMCID: PMC5935970 DOI: 10.1186/s12859-018-2171-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 04/30/2018] [Indexed: 11/10/2022] Open
Abstract
Background In protein design, correct use of topology is among the initial and most critical feature. Meticulous selection of backbone topology aids in drastically reducing the structure search space. With ProLego, we present a server application to explore the component aspect of protein structures and provide an intuitive and efficient way to scan the protein topology space. Result We have implemented in-house developed “topological representation” in an automated-pipeline to extract protein topology from given protein structure. Using the topology string, ProLego, compares topology against a non-redundant extensive topology database (ProLegoDB) as well as extracts constituent topological modules. The platform offers interactive topology visualization graphs. Conclusion ProLego, provides an alternative but comprehensive way to scan and visualize protein topology along with an extensive database of protein topology. ProLego can be found at http://www.proteinlego.com Electronic supplementary material The online version of this article (10.1186/s12859-018-2171-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Taushif Khan
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, 110067, India.
| | - Shailesh Kumar Panday
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, 110067, India
| | - Indira Ghosh
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, 110067, India
| |
Collapse
|