1
|
Yu CC, Raj N, Chu JW. Statistical Learning of Protein Elastic Network from Positional Covariance Matrix. Comput Struct Biotechnol J 2023; 21:2524-2535. [PMID: 37095762 PMCID: PMC10121796 DOI: 10.1016/j.csbj.2023.03.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/20/2023] [Accepted: 03/20/2023] [Indexed: 03/30/2023] Open
Abstract
Positional fluctuation and covariance during protein dynamics are key observables for understanding the molecular origin of biological functions. A frequently employed potential energy function for describing protein structural variation at the coarse-gained level is elastic network model (ENM). A long-standing issue in biomolecular simulation is thus the parametrization of ENM spring constants from the components of positional covariance matrix (PCM). Based on sensitivity analysis of PCM, the direct-coupling statistics of each spring, which is a specific combination of position fluctuation and covariance, is found to exhibit prominent signal of parameter dependence. This finding provides the basis for devising the objective function and the scheme of running through the effective one-dimensional optimization of every spring by self-consistent iteration. Formal derivation of the positional covariance statistical learning (PCSL) method also motivates the necessary data regularization for stable calculations. Robust convergence of PCSL is achieved in taking an all-atom molecular dynamics trajectory or an ensemble of homologous structures as input data. The PCSL framework can also be generalized with mixed objective functions to capture specific property such as the residue flexibility profile. Such physical chemistry-based statistical learning thus provides a useful platform for integrating the mechanical information encoded in various experimental or computational data.
Collapse
|
2
|
Wordom update 2: A user-friendly program for the analysis of molecular structures and conformational ensembles. Comput Struct Biotechnol J 2023; 21:1390-1402. [PMID: 36817953 PMCID: PMC9929209 DOI: 10.1016/j.csbj.2023.01.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 01/19/2023] [Accepted: 01/20/2023] [Indexed: 01/29/2023] Open
Abstract
We present the second update of Wordom, a user-friendly and efficient program for manipulation and analysis of conformational ensembles from molecular simulations. The actual update expands some of the existing modules and adds 21 new modules to the update 1 published in 2011. The new adds can be divided into three sets that: 1) analyze atomic fluctuations and structural communication; 2) explore ion-channel conformational dynamics and ionic translocation; and 3) compute geometrical indices of structural deformation. Set 1 serves to compute correlations of motions, find geometrically stable domains, identify a dynamically invariant core, find changes in domain-domain separation and mutual orientation, perform wavelet analysis of large-scale simulations, process the output of principal component analysis of atomic fluctuations, perform functional mode analysis, infer regions of mechanical rigidity, analyze overall fluctuations, and perform the perturbation response scanning. Set 2 includes modules specific for ion channels, which serve to monitor the pore radius as well as water or ion fluxes, and measure functional collective motions like receptor twisting or tilting angles. Finally, set 3 includes tools to monitor structural deformations by computing angles, perimeter, area, volume, β-sheet curvature, radial distribution function, and center of mass. The ring perception module is also included, helpful to monitor supramolecular self-assemblies. This update places Wordom among the most suitable, complete, user-friendly, and efficient software for the analysis of biomolecular simulations. The source code of Wordom and the relative documentation are available under the GNU general public license at http://wordom.sf.net.
Collapse
|
3
|
Felline A, Gentile S, Fanelli F. psnGPCRdb: The Structure-network Database of G Protein Coupled Receptors. J Mol Biol 2023:167950. [PMID: 36646374 DOI: 10.1016/j.jmb.2023.167950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/02/2023] [Accepted: 01/03/2023] [Indexed: 01/15/2023]
Abstract
G protein coupled receptors (GPCRs) are critical eukaryotic signal transduction gatekeepers and represent the largest protein superfamily in the human proteome, with more than 800 members. They share seven transmembrane helices organized in an up-down bundle architecture. GPCR-mediated signaling pathways have been linked to numerous human diseases, and GPCRs are the targets of approximately 35% of all drugs currently on the market. Structure network analysis, a graph theory-based approach, represents a cutting-edge tool to deeply understand GPCR function, which strongly relies on communication between the extracellular and intracellular poles of their structure. psnGPCRdb stores the structure networks (i.e., linked nodes, hubs, communities and communication pathways) computed on all updated GPCR structures in the Protein Data Bank, in their isolated states or in complex with extracellular and/or intracellular molecules. The structure communication signatures of a sub-family or family of GPCRs as well as of their small-molecule activators or inhibitors are stored as consensus networks. The database stores also all meaningful structure network-based comparisons (i.e., difference networks) of functionally different states (i.e., inactive or active) of a given receptor sub-type, or of consensus networks representative of a receptor sub-type, type, sub-family or family. Single or consensus GPCR networks hold also information on amino acid conservation. The database allows to graphically analyze 3D structure networks together with interactive data-tables. Ligand-centric networks can be analyzed as well. psnGPCRdb is unique and represents a powerful resource to unravel GPCR function with important implications in cell signaling and drug design. psnGPCRdb is freely available at: http://webpsn.hpc.unimo.it/psngpcr.php.
Collapse
Affiliation(s)
- Angelo Felline
- Department of Life Sciences, University of Modena and Reggio Emilia, via Campy 103, 41125 Modena, Italy
| | - Sara Gentile
- Department of Life Sciences, University of Modena and Reggio Emilia, via Campy 103, 41125 Modena, Italy
| | - Francesca Fanelli
- Department of Life Sciences, University of Modena and Reggio Emilia, via Campy 103, 41125 Modena, Italy; Center for Neuroscience and Neurotechnology, University of Modena and Reggio Emilia, via Campi 287, 41125 Modena, Italy.
| |
Collapse
|
4
|
Felline A, Raimondi F, Gentile S, Fanelli F. Structural communication between the GTPase Sec4p and its activator Sec2p: Determinants of GEF activity and early deformations to nucleotide release. Comput Struct Biotechnol J 2022; 20:5162-5180. [PMID: 36187918 PMCID: PMC9508438 DOI: 10.1016/j.csbj.2022.09.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 09/08/2022] [Accepted: 09/08/2022] [Indexed: 11/29/2022] Open
Abstract
Ras GTPases are molecular switches that cycle between OFF and ON states depending on the bound nucleotide (i.e. GDP-bound and GTP-bound, respectively). The Rab GTPase, Sec4p, plays regulatory roles in multiple steps of intracellular vesicle trafficking. Nucleotide release is catalyzed by the Guanine Nucleotide Exchange Factor (GEF) Sec2p. Here, the integration of structural information with molecular dynamics (MD) simulations addressed a number of questions concerning the intrinsic and stimulated dynamics of Sec2p and Sec4p as well as the chain of structural deformations leading to GEF-assisted activation of the Rab GTPase. Sec2p holds an intrinsic ability to adopt the conformation found in the crystallographic complexes with Sec4p, thus suggesting that the latter selects and shifts the conformational equilibrium towards a pre-existing bound-like conformation of Sec2p. The anchoring of Sec4p to a suitable conformation of Sec2p favors the Sec2p-assisted pulling on itself of the α1/switch 1 (SWI) loop and of SWI, which loose any contact with GDP. Those deformations of Sec4p would occur earlier. Formation of the final Sec2p-Sec4p hydrophobic interface, accomplishes later. Disruption of the nucleotide cage would cause firstly loss of interactions with the guanine ring and secondly loss of interactions with the phosphates. The ease in sampling the energy landscape and adopting a bound-like conformation likely favors the catalyzing ability of GEFs for Ras GTPases.
Collapse
|
5
|
Felline A, Seeber M, Fanelli F. PSNtools for standalone and web-based structure network analyses of conformational ensembles. Comput Struct Biotechnol J 2022; 20:640-649. [PMID: 35140884 PMCID: PMC8801349 DOI: 10.1016/j.csbj.2021.12.044] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2021] [Revised: 12/22/2021] [Accepted: 12/30/2021] [Indexed: 12/03/2022] Open
Abstract
Structure graphs, in which interacting amino acids/nucleotides correspond to linked nodes, represent cutting-edge tools to investigate macromolecular function. The graph-based approach defined as Protein Structure Network (PSN) was initially implemented in the Wordom software and subsequently in the webPSN server. PSNs are computed either on a molecular dynamics (MD) trajectory (PSN-MD) or on a single structure. In the latter case, information on atomic fluctuations is inferred from the Elastic Network Model-Normal Mode Analysis (ENM-NMA) (PSN-ENM). While Wordom performs both PSN-ENM and PSN-MD analyses but without output post-processing, the webPSN server performs only single-structure PSN-EMN but assisting the user in input setup and output analysis. Here we release for the first time the standalone software PSNtools, which allows calculation and post-processing of PSN analyses carried out either on single structures or on conformational ensembles. Relevant unique and novel features of PSNtools are either comparisons of two networks or computations of consensus networks on sets of homologous/analogous macromolecular structures or conformational ensembles. Network comparisons and consensus serve to infer differences in functionally different states of the same system or network-based signatures in groups of bio-macromolecules sharing either the same functionality or the same fold. In addition to the new software, here we release also an updated version of the webPSN server, which allows performing an interactive graphical analysis of PSN-MD, following the upload of the PSNtools output. PSNtools, the auxiliary binary version of Wordom software, and the WebPSN server are freely available at http://webpsn.hpc.unimo.it/wpsn3.php.
Collapse
|
6
|
Felline A, Seeber M, Fanelli F. webPSN v2.0: a webserver to infer fingerprints of structural communication in biomacromolecules. Nucleic Acids Res 2020; 48:W94-W103. [PMID: 32427333 PMCID: PMC7319592 DOI: 10.1093/nar/gkaa397] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Revised: 04/21/2020] [Accepted: 05/17/2020] [Indexed: 12/25/2022] Open
Abstract
A mixed Protein Structure Network (PSN) and Elastic Network Model-Normal Mode Analysis (ENM-NMA)-based strategy (i.e. PSN-ENM) was developed to investigate structural communication in bio-macromolecules. Protein Structure Graphs (PSGs) are computed on a single structure, whereas information on system dynamics is supplied by ENM-NMA. The approach was implemented in a webserver (webPSN), which was significantly updated herein. The webserver now handles both proteins and nucleic acids and relies on an internal upgradable database of network parameters for ions and small molecules in all PDB structures. Apart from the radical restyle of the server and some changes in the calculation setup, other major novelties concern the possibility to: a) compute the differences in nodes, links, and communication pathways between two structures (i.e. network difference) and b) infer links, hubs, communities, and metapaths from consensus networks computed on a number of structures. These new features are useful to identify commonalties and differences between two different functional states of the same system or structural-communication signatures in homologous or analogous systems. The output analysis relies on 3D-representations, interactive tables and graphs, also available for download. Speed and accuracy make this server suitable to comparatively investigate structural communication in large sets of bio-macromolecular systems. URL: http://webpsn.hpc.unimore.it.
Collapse
Affiliation(s)
- Angelo Felline
- Department of Life Sciences, University of Modena and Reggio Emilia, Modena 41125, Italy
| | - Michele Seeber
- Department of Life Sciences, University of Modena and Reggio Emilia, Modena 41125, Italy
| | - Francesca Fanelli
- Department of Life Sciences, University of Modena and Reggio Emilia, Modena 41125, Italy.,Center for Neuroscience and Neurotechnology, University of Modena and Reggio Emilia, Modena 41125, Italy
| |
Collapse
|
7
|
Bellucci L, Felline A, Fanelli F. Dynamics and structural communication in the ternary complex of fully phosphorylated V2 vasopressin receptor, vasopressin, and β-arrestin 1. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2020; 1862:183355. [PMID: 32413442 DOI: 10.1016/j.bbamem.2020.183355] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Revised: 05/08/2020] [Accepted: 05/10/2020] [Indexed: 12/12/2022]
Abstract
G protein-coupled receptors (GPCRs) are critically regulated by arrestins, which not only desensitize G-protein signaling but also initiate a G protein-independent wave of signaling. The information from structure determination was herein exploited to build a structural model of the ternary complex, comprising fully phosphorylated V2 vasopressin receptor (V2R), the agonist arginine vasopressin (AVP), and β-arrestin 1 (β-arr1). Molecular simulations served to explore dynamics and structural communication in the ternary complex. Flexibility and mechanical profiles reflect fold of V2R and β-arr1. Highly conserved amino acids tend to behave as hubs in the structure network and contribute the most to the mechanical rigidity of V2R seven-helix bundle and of β-arr1. Two structurally and dynamically distinct receptor-arrestin interfaces assist the twist of the N- and C-terminal domains (ND and CD, respectively) of β-arr1 with respect to each other, which is linked to arrestin activation. While motion of the ND is essentially assisted by the fully phosphorylated C-tail of V2R (V2RCt), that of CD is assisted by the second and third intracellular loops and the cytosolic extensions of helices 5 and 6. In the presence of the receptor, the β-arr1 inter-domain twist angle correlates with the modes describing the essential subspace of the ternary complex. β-arr1 motions are also influenced by the anchoring to the membrane of the C-edge-loops in the β-arr1-CD. Overall fluctuations reveal a coupling between motions of the agonist binding site and of β-arr1-ND, which are in allosteric communication between each other. Mechanical rigidity points, often acting as hubs in the structure network and distributed along the main axis of the receptor helix bundle, contribute to establish a preferential communication pathway between agonist ligand and the ND of arrestin. Such communication, mediated by highly conserved amino acids, involves also the first amino acid in the arrestin C-tail, which is highly dynamic and is involved in clathrin-mediated GPCR internalization.
Collapse
Affiliation(s)
- Luca Bellucci
- Department of Life Sciences, University of Modena and Reggio Emilia, via Campi 103, 41125 Modena, Italy; NEST, Istituto Nanoscienze-CNR, Piazza San Silvestro 12, 56127 Pisa, Italy
| | - Angelo Felline
- Department of Life Sciences, University of Modena and Reggio Emilia, via Campi 103, 41125 Modena, Italy
| | - Francesca Fanelli
- Department of Life Sciences, University of Modena and Reggio Emilia, via Campi 103, 41125 Modena, Italy; Center for Neuroscience and Neurotechnology, University of Modena and Reggio Emilia, via Campi 287, 41125 Modena, Italy.
| |
Collapse
|
8
|
Felline A, Belmonte L, Raimondi F, Bellucci L, Fanelli F. Interconnecting Flexibility, Structural Communication, and Function in RhoGEF Oncoproteins. J Chem Inf Model 2019; 59:4300-4313. [PMID: 31490066 DOI: 10.1021/acs.jcim.9b00271] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Dbl family Rho guanine nucleotide exchange factors (RhoGEFs) play a central role in cell biology by catalyzing the exchange of guanosine 5'-triphosphate for guanosine 5'-diphosphate (GDP) on RhoA. Insights into the oncogenic constitutive activity of the Lbc RhoGEF were gained by analyzing the structure and dynamics of the protein in different functional states and in comparison with a close homologue, leukemia-associated RhoGEF. Higher intrinsic flexibility, less dense and extended structure network, and less stable allosteric communication pathways in Lbc, compared to the nonconstitutively active homologue, emerged as major determinants of the constitutive activity. Independent of the state, the essential dynamics of the two RhoGEFs is contributed by the last 10 amino acids of Dbl homology (DH) and the whole pleckstrin homology (PH) domains and tends to be equalized by the presence of RhoA. The catalytic activity of the RhoGEF relies on the scaffolding action of the DH domain that primarily turns the switch I (SWI) of RhoA on itself through highly conserved amino acids participating in the stability core and essential for function. Changes in the conformation of SWI and disorganization of the RhoA regions deputed to nucleotide binding are among the major RhoGEF effects leading to GDP release. Binding of RhoA reorganizes the allosteric communication on RhoGEF, strengthening the communication among the canonical RhoA binding site on DH, a secondary RhoA binding site on PH, and the binding site for heterotrimeric G proteins, suggesting dual roles for RhoA as a catalysis substrate and as a regulatory protein. The structure network-based analysis tool employed in this study proved to be useful for predicting potentially druggable regulatory sites in protein structures.
Collapse
Affiliation(s)
- Angelo Felline
- Department of Life Sciences , University of Modena and Reggio Emilia , via Campi 103 , 41125 Modena , Italy
| | - Luca Belmonte
- Department of Life Sciences , University of Modena and Reggio Emilia , via Campi 103 , 41125 Modena , Italy
| | - Francesco Raimondi
- Department of Life Sciences , University of Modena and Reggio Emilia , via Campi 103 , 41125 Modena , Italy
| | - Luca Bellucci
- Department of Life Sciences , University of Modena and Reggio Emilia , via Campi 103 , 41125 Modena , Italy
| | - Francesca Fanelli
- Department of Life Sciences , University of Modena and Reggio Emilia , via Campi 103 , 41125 Modena , Italy.,Center for Neuroscience and Neurotechnology , University of Modena and Reggio Emilia , via Campi 287 , 41125 Modena , Italy
| |
Collapse
|
9
|
Fanelli F, Felline A. Uncovering GPCR and G Protein Function by Protein Structure Network Analysis. COMPUTATIONAL TOOLS FOR CHEMICAL BIOLOGY 2017. [DOI: 10.1039/9781788010139-00198] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Protein structure network (PSN) analysis is one of the graph theory-based approaches currently used for investigating structural communication in biomolecular systems. Information on the system's dynamics can be provided by atomistic molecular dynamics (MD) simulations or coarse grained elastic network models paired with normal mode analysis (ENM-NMA). This chapter reports on selected applications of PSN analysis to uncover the structural communication in G protein coupled receptors (GPCRs) and G proteins. Strategies to highlight changes in structural communication caused by mutations, ligand and protein binding are described. Conserved amino acids, sites of misfolding mutations, or ligands acting as functional switches tend to behave as hubs in the native structure networks. Densely linked regions in the protein structure graphs could be identified as playing central roles in protein stability and function. Changes in the communication pathway fingerprints depending on the bound ligand or following amino acid mutation could be highlighted as well. A bridge between misfolding and misrouting could be established in rhodopsin mutants linked to inherited blindness. The analysis of native network perturbations by misfolding mutations served to infer key structural elements of protein responsiveness to small chaperones with implications for drug discovery.
Collapse
Affiliation(s)
- Francesca Fanelli
- Department of Life Sciences University of Modena and Reggio Emilia Italy
- Center for Neuroscience and Neurotechnology University of Modena and Reggio Emilia Italy
| | - Angelo Felline
- Department of Life Sciences University of Modena and Reggio Emilia Italy
| |
Collapse
|
10
|
Dissecting intrinsic and ligand-induced structural communication in the β3 headpiece of integrins. Biochim Biophys Acta Gen Subj 2017; 1861:2367-2381. [DOI: 10.1016/j.bbagen.2017.05.018] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Revised: 05/20/2017] [Accepted: 05/22/2017] [Indexed: 12/15/2022]
|