1
|
Herrera Segura C, Montoya E, Tapias D. Subaging in underparametrized Deep Neural Networks. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac8f1b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
In this work, we consider a simple classification problem to show that the dynamics of finite--width Deep Neural Networks in the underparametrized regime gives rise to effects similar to those associated with glassy systems, namely a slow evolution of the loss function and aging. Remarkably, the aging is sublinear in the waiting time (subaging) and the power--law exponent characterizing it is robust to different architectures under the constraint of a constant total number of parameters. Our results are maintained in the more complex scenario of the MNIST database. We find that for this database there is a unique exponent ruling the subaging behavior in the whole phase.
Collapse
|
2
|
Dicks L, Wales DJ. Elucidating the solution structure of the K-means cost function using energy landscape theory. J Chem Phys 2022; 156:054109. [DOI: 10.1063/5.0078793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- L. Dicks
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - D. J. Wales
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
3
|
Jesus WS, Prudente FV, Marques JMC, Pereira FB. Modeling microsolvation clusters with electronic-structure calculations guided by analytical potentials and predictive machine learning techniques. Phys Chem Chem Phys 2021; 23:1738-1749. [PMID: 33427847 DOI: 10.1039/d0cp05200k] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
We propose a new methodology to study, at the density functional theory (DFT) level, the clusters resulting from the microsolvation of alkali-metal ions with rare-gas atoms. The workflow begins with a global optimization search to generate a pool of low-energy minimum structures for different cluster sizes. This is achieved by employing an analytical potential energy surface (PES) and an evolutionary algorithm (EA). The next main stage of the methodology is devoted to establish an adequate DFT approach to treat the microsolvation system, through a systematic benchmark study involving several combinations of functionals and basis sets, in order to characterize the global minimum structures of the smaller clusters. In the next stage, we apply machine learning (ML) classification algorithms to predict how the low-energy minima of the analytical PES map to the DFT ones. An early and accurate detection of likely DFT local minima is extremely important to guide the choice of the most promising low-energy minima of large clusters to be re-optimized at the DFT level of theory. In this work, the methodology was applied to the Li+Krn (n = 2-14 and 16) microsolvation clusters for which the most competitive DFT approach was found to be the B3LYP-D3/aug-pcseg-1. Additionally, the ML classifier was able to accurately predict most of the solutions to be re-optimized at the DFT level of theory, thereby greatly enhancing the efficiency of the process and allowing its applicability to larger clusters.
Collapse
Affiliation(s)
- W S Jesus
- Instituto de Física, Universidade Federal da Bahia, 40170-115 Salvador, BA, Brazil.
| | - F V Prudente
- Instituto de Física, Universidade Federal da Bahia, 40170-115 Salvador, BA, Brazil.
| | - J M C Marques
- CQC, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal.
| | - F B Pereira
- Coimbra Polytechnic - ISEC, Coimbra, Portugal and Centro de Informática e Sistemas da Universidade de Coimbra (CISUC), Coimbra, Portugal.
| |
Collapse
|
4
|
Verpoort PC, Lee AA, Wales DJ. Archetypal landscapes for deep neural networks. Proc Natl Acad Sci U S A 2020; 117:21857-21864. [PMID: 32843349 PMCID: PMC7486703 DOI: 10.1073/pnas.1919995117] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The predictive capabilities of deep neural networks (DNNs) continue to evolve to increasingly impressive levels. However, it is still unclear how training procedures for DNNs succeed in finding parameters that produce good results for such high-dimensional and nonconvex loss functions. In particular, we wish to understand why simple optimization schemes, such as stochastic gradient descent, do not end up trapped in local minima with high loss values that would not yield useful predictions. We explain the optimizability of DNNs by characterizing the local minima and transition states of the loss-function landscape (LFL) along with their connectivity. We show that the LFL of a DNN in the shallow network or data-abundant limit is funneled, and thus easy to optimize. Crucially, in the opposite low-data/deep limit, although the number of minima increases, the landscape is characterized by many minima with similar loss values separated by low barriers. This organization is different from the hierarchical landscapes of structural glass formers and explains why minimization procedures commonly employed by the machine-learning community can navigate the LFL successfully and reach low-lying solutions.
Collapse
Affiliation(s)
- Philipp C Verpoort
- Department of Physics, University of Cambridge, Cambridge CB3 0HE, United Kingdom;
| | - Alpha A Lee
- Department of Physics, University of Cambridge, Cambridge CB3 0HE, United Kingdom
| | - David J Wales
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
5
|
Chitturi SR, Verpoort PC, Lee AA, Wales DJ. Perspective: new insights from loss function landscapes of neural networks. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/ab7aef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
We investigate the structure of the loss function landscape for neural networks subject to dataset mislabelling, increased training set diversity, and reduced node connectivity, using various techniques developed for energy landscape exploration. The benchmarking models are classification problems for atomic geometry optimisation and hand-written digit prediction. We consider the effect of varying the size of the atomic configuration space used to generate initial geometries and find that the number of stationary points increases rapidly with the size of the training configuration space. We introduce a measure of node locality to limit network connectivity and perturb permutational weight symmetry, and examine how this parameter affects the resulting landscapes. We find that highly-reduced systems have low capacity and exhibit landscapes with very few minima. On the other hand, small amounts of reduced connectivity can enhance network expressibility and can yield more complex landscapes. Investigating the effect of deliberate classification errors in the training data, we find that the variance in testing AUC, computed over a sample of minima, grows significantly with the training error, providing new insight into the role of the variance-bias trade-off when training under noise. Finally, we illustrate how the number of local minima for networks with two and three hidden layers, but a comparable number of variable edge weights, increases significantly with the number of layers, and as the number of training data decreases. This work helps shed further light on neural network loss landscapes and provides guidance for future work on neural network training and optimisation.
Collapse
|
6
|
Becker S, Zhang Y, Lee AA. Geometry of Energy Landscapes and the Optimizability of Deep Neural Networks. PHYSICAL REVIEW LETTERS 2020; 124:108301. [PMID: 32216422 DOI: 10.1103/physrevlett.124.108301] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 11/13/2019] [Accepted: 02/06/2020] [Indexed: 06/10/2023]
Abstract
Deep neural networks are workhorse models in machine learning with multiple layers of nonlinear functions composed in series. Their loss function is highly nonconvex, yet empirically even gradient descent minimization is sufficient to arrive at accurate and predictive models. It is hitherto unknown why deep neural networks are easily optimizable. We analyze the energy landscape of a spin glass model of deep neural networks using random matrix theory and algebraic geometry. We analytically show that the multilayered structure holds the key to optimizability: Fixing the number of parameters and increasing network depth, the number of stationary points in the loss function decreases, minima become more clustered in parameter space, and the trade-off between the depth and width of minima becomes less severe. Our analytical results are numerically verified through comparison with neural networks trained on a set of classical benchmark datasets. Our model uncovers generic design principles of machine learning models.
Collapse
Affiliation(s)
- Simon Becker
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, United Kingdom
| | - Yao Zhang
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, United Kingdom
- Cavendish Laboratory, University of Cambridge, Cambridge CB3 0HE, United Kingdom
| | - Alpha A Lee
- Cavendish Laboratory, University of Cambridge, Cambridge CB3 0HE, United Kingdom
| |
Collapse
|
7
|
Hernández-Rojas J, Calvo F. The Structure of Adamantane Clusters: Atomistic vs. Coarse-Grained Predictions From Global Optimization. Front Chem 2019; 7:573. [PMID: 31475136 PMCID: PMC6707085 DOI: 10.3389/fchem.2019.00573] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 07/29/2019] [Indexed: 11/24/2022] Open
Abstract
Candidate structures for the global minima of adamantane clusters, (C10H16)N, are presented. Based on a rigid model for individual molecules with atom-atom pairwise interactions that include Lennard-Jones and Coulomb contributions, low-energy structures were obtained up to N = 42 using the basin-hopping method. The results indicate that adamantane clusters initially grow accordingly with an icosahedral packing scheme, followed above N = 14 by a structural transition toward face-centered cubic structures. The special stabilities obtained at N = 13, 19, and 38 are consistent with these two structural families, and agree with recent mass spectrometry measurements on cationic adamantane clusters. Coarse-graining the intermolecular potential by averaging over all possible orientations only partially confirm the all-atom results, the magic numbers at 13 and 38 being preserved. However, the details near the structural transition are not captured well, because despite their high symmetry the adamantane molecules are still rather anisotropic.
Collapse
Affiliation(s)
- Javier Hernández-Rojas
- Departamento de Física e IUdEA, Universidad de La Laguna, San Cristóbal de La Laguna, Spain
| | | |
Collapse
|
8
|
Abstract
Recent advances in the potential energy landscapes approach are highlighted, including both theoretical and computational contributions. Treating the high dimensionality of molecular and condensed matter systems of contemporary interest is important for understanding how emergent properties are encoded in the landscape and for calculating these properties while faithfully representing barriers between different morphologies. The pathways characterized in full dimensionality, which are used to construct kinetic transition networks, may prove useful in guiding such calculations. The energy landscape perspective has also produced new procedures for structure prediction and analysis of thermodynamic properties. Basin-hopping global optimization, with alternative acceptance criteria and generalizations to multiple metric spaces, has been used to treat systems ranging from biomolecules to nanoalloy clusters and condensed matter. This review also illustrates how all this methodology, developed in the context of chemical physics, can be transferred to landscapes defined by cost functions associated with machine learning.
Collapse
Affiliation(s)
- David J Wales
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom;
| |
Collapse
|
9
|
Zhang Y, Saxe AM, Advani MS, Lee AA. Energy–entropy competition and the effectiveness of stochastic gradient descent in machine learning. Mol Phys 2018. [DOI: 10.1080/00268976.2018.1483535] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Yao Zhang
- Cavendish Laboratory, University of Cambridge, Cambridge, UK
| | - Andrew M. Saxe
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Madhu S. Advani
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Alpha A. Lee
- Cavendish Laboratory, University of Cambridge, Cambridge, UK
| |
Collapse
|
10
|
Mehta D, Zhao X, Bernal EA, Wales DJ. Loss surface of XOR artificial neural networks. Phys Rev E 2018; 97:052307. [PMID: 29906831 DOI: 10.1103/physreve.97.052307] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Indexed: 11/07/2022]
Abstract
Training an artificial neural network involves an optimization process over the landscape defined by the cost (loss) as a function of the network parameters. We explore these landscapes using optimization tools developed for potential energy landscapes in molecular science. The number of local minima and transition states (saddle points of index one), as well as the ratio of transition states to minima, grow rapidly with the number of nodes in the network. There is also a strong dependence on the regularization parameter, with the landscape becoming more convex (fewer minima) as the regularization term increases. We demonstrate that in our formulation, stationary points for networks with N_{h} hidden nodes, including the minimal network required to fit the XOR data, are also stationary points for networks with N_{h}+1 hidden nodes when all the weights involving the additional node are zero. Hence, smaller networks trained on XOR data are embedded in the landscapes of larger networks. Our results clarify certain aspects of the classification and sensitivity (to perturbations in the input data) of minima and saddle points for this system, and may provide insight into dropout and network compression.
Collapse
Affiliation(s)
- Dhagash Mehta
- Systems Department, United Technologies Research Center, East Hartford, Connecticut 06108, USA
| | - Xiaojun Zhao
- Systems Department, United Technologies Research Center, East Hartford, Connecticut 06108, USA
| | - Edgar A Bernal
- Systems Department, United Technologies Research Center, East Hartford, Connecticut 06108, USA
| | - David J Wales
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, UK
| |
Collapse
|
11
|
Ballard AJ, Das R, Martiniani S, Mehta D, Sagun L, Stevenson JD, Wales DJ. Energy landscapes for machine learning. Phys Chem Chem Phys 2018; 19:12585-12603. [PMID: 28367548 DOI: 10.1039/c7cp01108c] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Machine learning techniques are being increasingly used as flexible non-linear fitting and prediction tools in the physical sciences. Fitting functions that exhibit multiple solutions as local minima can be analysed in terms of the corresponding machine learning landscape. Methods to explore and visualise molecular potential energy landscapes can be applied to these machine learning landscapes to gain new insight into the solution space involved in training and the nature of the corresponding predictions. In particular, we can define quantities analogous to molecular structure, thermodynamics, and kinetics, and relate these emergent properties to the structure of the underlying landscape. This Perspective aims to describe these analogies with examples from recent applications, and suggest avenues for new interdisciplinary research.
Collapse
Affiliation(s)
- Andrew J Ballard
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, UK.
| | - Ritankar Das
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, UK.
| | - Stefano Martiniani
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, UK.
| | - Dhagash Mehta
- Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, IN, USA
| | - Levent Sagun
- Mathematics Department, Courant Institute, New York University, NY, USA
| | | | - David J Wales
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, UK.
| |
Collapse
|
12
|
Das R, Wales DJ. Machine learning landscapes and predictions for patient outcomes. ROYAL SOCIETY OPEN SCIENCE 2017; 4:170175. [PMID: 28791144 PMCID: PMC5541539 DOI: 10.1098/rsos.170175] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 06/23/2017] [Indexed: 06/07/2023]
Abstract
The theory and computational tools developed to interpret and explore energy landscapes in molecular science are applied to the landscapes defined by local minima for neural networks. These machine learning landscapes correspond to fits of training data, where the inputs are vital signs and laboratory measurements for a database of patients, and the objective is to predict a clinical outcome. In this contribution, we test the predictions obtained by fitting to single measurements, and then to combinations of between 2 and 10 different patient medical data items. The effect of including measurements over different time intervals from the 48 h period in question is analysed, and the most recent values are found to be the most important. We also compare results obtained for neural networks as a function of the number of hidden nodes, and for different values of a regularization parameter. The predictions are compared with an alternative convex fitting function, and a strong correlation is observed. The dependence of these results on the patients randomly selected for training and testing decreases systematically with the size of the database available. The machine learning landscapes defined by neural network fits in this investigation have single-funnel character, which probably explains why it is relatively straightforward to obtain the global minimum solution, or a fit that behaves similarly to this optimal parameterization.
Collapse
Affiliation(s)
| | - David J. Wales
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, UK
| |
Collapse
|
13
|
Angelié C, Soudan JM. Nanothermodynamics of iron clusters: Small clusters, icosahedral and fcc-cuboctahedral structures. J Chem Phys 2017; 146:174303. [PMID: 28477605 DOI: 10.1063/1.4982252] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The study of the thermodynamics and structures of iron clusters has been carried on, focusing on small clusters and initial icosahedral and fcc-cuboctahedral structures. Two combined tools are used. First, energy intervals are explored by the Monte Carlo algorithm, called σ-mapping, detailed in the work of Soudan et al. [J. Chem. Phys. 135, 144109 (2011), Paper I]. In its flat histogram version, it provides the classical density of states, gp(Ep), in terms of the potential energy of the system. Second, the iron system is described by a potential which is called "corrected EAM" (cEAM), explained in the work of Basire et al. [J. Chem. Phys. 141, 104304 (2014), Paper II]. Small clusters from 3 to 12 atoms in their ground state have been compared first with published Density Functional Theory (DFT) calculations, giving a complete agreement of geometries. The series of 13, 55, 147, and 309 atom icosahedrons is shown to be the most stable form for the cEAM potential. However, the 147 atom cluster has a special behaviour, since decreasing the energy from the liquid zone leads to the irreversible trapping of the cluster in a reproducible amorphous state, 7.38 eV higher in energy than the icosahedron. This behaviour is not observed at the higher size of 309 atoms. The heat capacity of the 55, 147, and 309 atom clusters revealed a pronounced peak in the solid zone, related to a solid-solid transition, prior to the melting peak. The corresponding series of 13, 55, and 147 atom cuboctahedrons has been compared, underscoring the unstability towards the icosahedral structure. This unstability occurs clearly in several steps for the 147 atom cluster, with a sudden transformation at a transition state. This illustrates the concerted icosahedron-cuboctahedron transformation of Buckminster Fuller-Mackay, which is calculated for the cEAM potential. Two other clusters of initial fcc structures with 24 and 38 atoms have been studied, as well as a 302 atom cluster. Each one relaxes towards a more stable structure without regularity. The 38 atom cluster exhibits a nearly glassy relaxation, through a cascade of six metastable states of long life. This behaviour, as that of the 147 atom cluster towards the amorphous state, shows that difficulties to reach ergodicity in the lower half of the solid zone are related to particular features of the potential energy landscape, and not necessarily to a too large size of the system. Comparisons of the cEAM iron system with published results about Lennard-Jones systems and DFT calculations are made. The results of the previous clusters have been combined with that of Paper II to plot the cohesive energy Ec and the melting temperature Tm in terms of the cluster atom number Nat. The Nat-1/3 linear dependence of the melting temperature (Pawlow law) is observed again for Nat > 150. In contrast, for Nat < 150, the curve diverges strongly from the Pawlow law, giving it an overall V-shape, with a linear increase of Tm when Nat goes from 55 to 13 atoms. Surprisingly, the 38 atom cluster is anomalously below the overall curve.
Collapse
Affiliation(s)
- C Angelié
- CEA/DRF/IRAMIS/LIDYL-UMR 9222, LIDYL, CEA, CNRS, Université Paris-Saclay, CEA Saclay, 91191 Gif-sur-Yvette Cedex, France
| | - J-M Soudan
- CEA/DRF/IRAMIS/LIDYL-UMR 9222, LIDYL, CEA, CNRS, Université Paris-Saclay, CEA Saclay, 91191 Gif-sur-Yvette Cedex, France
| |
Collapse
|
14
|
Wales DJ. Decoding heat capacity features from the energy landscape. Phys Rev E 2017; 95:030105. [PMID: 28415307 DOI: 10.1103/physreve.95.030105] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Indexed: 04/28/2023]
Abstract
A general scheme is derived to connect transitions in configuration space with features in the heat capacity. A formulation in terms of occupation probabilities for local minima that define the potential energy landscape provides a quantitative description of how contributions arise from competition between different states. The theory does not rely on a structural interpretation for the local minima, so it is equally applicable to molecular energy landscapes and the landscapes defined by abstract functions. Applications are presented for low-temperature solid-solid transitions in atomic clusters, which involve just a few local minima with different morphologies, and for cluster melting, which is driven by the landscape entropy associated with the more numerous high-energy minima. Analyzing these features in terms of the balance between states with increasing and decreasing occupation probabilities provides a direct interpretation of the underlying transitions. This approach enables us to identify a qualitatively different transition that is caused by a single local minimum associated with an exceptionally large catchment volume in configuration space for a machine learning landscape.
Collapse
Affiliation(s)
- David J Wales
- University Chemical Laboratories, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
15
|
Zhai H, Alexandrova AN. Fluxionality of Catalytic Clusters: When It Matters and How to Address It. ACS Catal 2017. [DOI: 10.1021/acscatal.6b03243] [Citation(s) in RCA: 117] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Huanchen Zhai
- Department
of Chemistry and Biochemistry, University of California, Los Angeles, Los
Angeles, California 90095, United States
| | - Anastassia N. Alexandrova
- Department
of Chemistry and Biochemistry, University of California, Los Angeles, Los
Angeles, California 90095, United States
- California NanoSystems Institute, Los Angeles, California 90095, United States
| |
Collapse
|
16
|
|
17
|
Zhai H, Alexandrova AN. Ensemble-Average Representation of Pt Clusters in Conditions of Catalysis Accessed through GPU Accelerated Deep Neural Network Fitting Global Optimization. J Chem Theory Comput 2016; 12:6213-6226. [DOI: 10.1021/acs.jctc.6b00994] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Huanchen Zhai
- Department
of Chemistry and Biochemistry, University of California, Los Angeles, Los
Angeles, California 90095, United States
| | - Anastassia N. Alexandrova
- Department
of Chemistry and Biochemistry, University of California, Los Angeles, Los
Angeles, California 90095, United States
- California NanoSystems
Institute, Los Angeles, California 90095, United States
| |
Collapse
|
18
|
Martiniani S, Schrenk KJ, Stevenson JD, Wales DJ, Frenkel D. Structural analysis of high-dimensional basins of attraction. Phys Rev E 2016; 94:031301. [PMID: 27739758 DOI: 10.1103/physreve.94.031301] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Indexed: 06/06/2023]
Abstract
We propose an efficient Monte Carlo method for the computation of the volumes of high-dimensional bodies with arbitrary shape. We start with a region of known volume within the interior of the manifold and then use the multistate Bennett acceptance-ratio method to compute the dimensionless free-energy difference between a series of equilibrium simulations performed within this object. The method produces results that are in excellent agreement with thermodynamic integration, as well as a direct estimate of the associated statistical uncertainties. The histogram method also allows us to directly obtain an estimate of the interior radial probability density profile, thus yielding useful insight into the structural properties of such a high-dimensional body. We illustrate the method by analyzing the effect of structural disorder on the basins of attraction of mechanically stable packings of soft repulsive spheres.
Collapse
Affiliation(s)
- Stefano Martiniani
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - K Julian Schrenk
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Jacob D Stevenson
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
- Microsoft Research Limited, 21 Station Road, Cambridge CB1 2FB, United Kingdom
| | - David J Wales
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Daan Frenkel
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
19
|
Das R, Wales DJ. Energy landscapes for a machine-learning prediction of patient discharge. Phys Rev E 2016; 93:063310. [PMID: 27415390 DOI: 10.1103/physreve.93.063310] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Indexed: 11/07/2022]
Abstract
The energy landscapes framework is applied to a configuration space generated by training the parameters of a neural network. In this study the input data consists of time series for a collection of vital signs monitored for hospital patients, and the outcomes are patient discharge or continued hospitalisation. Using machine learning as a predictive diagnostic tool to identify patterns in large quantities of electronic health record data in real time is a very attractive approach for supporting clinical decisions, which have the potential to improve patient outcomes and reduce waiting times for discharge. Here we report some preliminary analysis to show how machine learning might be applied. In particular, we visualize the fitting landscape in terms of locally optimal neural networks and the connections between them in parameter space. We anticipate that these results, and analogues of thermodynamic properties for molecular systems, may help in the future design of improved predictive tools.
Collapse
Affiliation(s)
- Ritankar Das
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David J Wales
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|