1
|
Giulini M, Fiorentini R, Tubiana L, Potestio R, Menichetti R. EXCOGITO, an Extensible Coarse-Graining Toolbox for the Investigation of Biomolecules by Means of Low-Resolution Representations. J Chem Inf Model 2024; 64:4912-4927. [PMID: 38860513 DOI: 10.1021/acs.jcim.4c00490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]
Abstract
Bottom-up coarse-grained (CG) models proved to be essential to complement and sometimes even replace all-atom representations of soft matter systems and biological macromolecules. The development of low-resolution models takes the moves from the reduction of the degrees of freedom employed, that is, the definition of a mapping between a system's high-resolution description and its simplified counterpart. Even in the absence of an explicit parametrization and simulation of a CG model, the observation of the atomistic system in simpler terms can be informative: this idea is leveraged by the mapping entropy, a measure of the information loss inherent to the process of coarsening. Mapping entropy lies at the heart of the extensible coarse-graining toolbox, EXCOGITO, developed to perform a number of operations and analyses on molecular systems pivoting around the properties of mappings. EXCOGITO can process an all-atom trajectory to compute the mapping entropy, identify the mapping that minimizes it, and establish quantitative relations between a low-resolution representation and the geometrical, structural, and energetic features of the system. Here, the software, which is available free of charge under an open-source license, is presented and showcased to introduce potential users to its capabilities and usage.
Collapse
Affiliation(s)
- Marco Giulini
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Raffaele Fiorentini
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Luca Tubiana
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Raffaello Potestio
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Roberto Menichetti
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| |
Collapse
|
2
|
Morand J, Yip S, Velegrakis Y, Lattanzi G, Potestio R, Tubiana L. Quality assessment and community detection methods for anonymized mobility data in the Italian Covid context. Sci Rep 2024; 14:4636. [PMID: 38409411 PMCID: PMC10897296 DOI: 10.1038/s41598-024-54878-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Accepted: 02/17/2024] [Indexed: 02/28/2024] Open
Abstract
We discuss how to assess the reliability of partial, anonymized mobility data and compare two different methods to identify spatial communities based on movements: Greedy Modularity Clustering (GMC) and the novel Critical Variable Selection (CVS). These capture different aspects of mobility: direct population fluxes (GMC) and the probability for individuals to move between two nodes (CVS). As a test case, we consider movements of Italians before and during the SARS-Cov2 pandemic, using Facebook users' data and publicly available information from the Italian National Institute of Statistics (Istat) to construct daily mobility networks at the interprovincial level. Using the Perron-Frobenius (PF) theorem, we show how the mean stochastic network has a stationary population density state comparable with data from Istat, and how this ceases to be the case if even a moderate amount of pruning is applied to the network. We then identify the first two national lockdowns through temporal clustering of the mobility networks, define two representative graphs for the lockdown and non-lockdown conditions and perform optimal spatial community identification on both graphs using the GMC and CVS approaches. Despite the fundamental differences in the methods, the variation of information (VI) between them assesses that they return similar partitions of the Italian provincial networks in both situations. The information provided can be used to inform policy, for example, to define an optimal scale for lockdown measures. Our approach is general and can be applied to other countries or geographical scales.
Collapse
Affiliation(s)
- Jules Morand
- University of Trento, via Sommarive 14, 38123, Trento, Italy.
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, 38123, Trento, Italy.
| | - Shoichi Yip
- University of Trento, via Sommarive 14, 38123, Trento, Italy
| | - Yannis Velegrakis
- University of Trento, via Sommarive 14, 38123, Trento, Italy
- Utrecht University, Princetonplein 5, 3584 CC, Utrecht, The Netherlands
| | - Gianluca Lattanzi
- University of Trento, via Sommarive 14, 38123, Trento, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, 38123, Trento, Italy
| | - Raffaello Potestio
- University of Trento, via Sommarive 14, 38123, Trento, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, 38123, Trento, Italy
| | - Luca Tubiana
- University of Trento, via Sommarive 14, 38123, Trento, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, 38123, Trento, Italy
| |
Collapse
|
3
|
Kidder KM, Shell MS, Noid WG. Surveying the energy landscape of coarse-grained mappings. J Chem Phys 2024; 160:054105. [PMID: 38310476 DOI: 10.1063/5.0182524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 12/28/2023] [Indexed: 02/05/2024] Open
Abstract
Simulations of soft materials often adopt low-resolution coarse-grained (CG) models. However, the CG representation is not unique and its impact upon simulated properties is poorly understood. In this work, we investigate the space of CG representations for ubiquitin, which is a typical globular protein with 72 amino acids. We employ Monte Carlo methods to ergodically sample this space and to characterize its landscape. By adopting the Gaussian network model as an analytically tractable atomistic model for equilibrium fluctuations, we exactly assess the intrinsic quality of each CG representation without introducing any approximations in sampling configurations or in modeling interactions. We focus on two metrics, the spectral quality and the information content, that quantify the extent to which the CG representation preserves low-frequency, large-amplitude motions and configurational information, respectively. The spectral quality and information content are weakly correlated among high-resolution representations but become strongly anticorrelated among low-resolution representations. Representations with maximal spectral quality appear consistent with physical intuition, while low-resolution representations with maximal information content do not. Interestingly, quenching studies indicate that the energy landscape of mapping space is very smooth and highly connected. Moreover, our study suggests a critical resolution below which a "phase transition" qualitatively distinguishes good and bad representations.
Collapse
Affiliation(s)
- Katherine M Kidder
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - M Scott Shell
- Department of Chemical Engineering, University of California, Santa Barbara, California 93106, USA
| | - W G Noid
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|