1
|
Wang Z, Tao P, Chen L. Brain-inspired chaotic spiking backpropagation. Natl Sci Rev 2024; 11:nwae037. [PMID: 38707198 PMCID: PMC11067972 DOI: 10.1093/nsr/nwae037] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 12/19/2023] [Accepted: 01/17/2024] [Indexed: 05/07/2024] Open
Abstract
Spiking neural networks (SNNs) have superior energy efficiency due to their spiking signal transmission, which mimics biological nervous systems, but they are difficult to train effectively. Although surrogate gradient-based methods offer a workable solution, trained SNNs frequently fall into local minima because they are still primarily based on gradient dynamics. Inspired by the chaotic dynamics in animal brain learning, we propose a chaotic spiking backpropagation (CSBP) method that introduces a loss function to generate brain-like chaotic dynamics and further takes advantage of the ergodic and pseudo-random nature to make SNN learning effective and robust. From a computational viewpoint, we found that CSBP significantly outperforms current state-of-the-art methods on both neuromorphic data sets (e.g. DVS-CIFAR10 and DVS-Gesture) and large-scale static data sets (e.g. CIFAR100 and ImageNet) in terms of accuracy and robustness. From a theoretical viewpoint, we show that the learning process of CSBP is initially chaotic, then subject to various bifurcations and eventually converges to gradient dynamics, consistently with the observation of animal brain activity. Our work provides a superior core tool for direct SNN training and offers new insights into understanding the learning process of a biological brain.
Collapse
Affiliation(s)
- Zijian Wang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
| | - Peng Tao
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
| | - Luonan Chen
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai 200031, China
- Guangdong Institute of Intelligence Science and Technology, Hengqin, Zhuhai 519031, China
- Pazhou Laboratory (Huangpu), Guangzhou 510555, China
| |
Collapse
|
2
|
Annesi BL, Lauditi C, Lucibello C, Malatesta EM, Perugini G, Pittorino F, Saglietti L. Star-Shaped Space of Solutions of the Spherical Negative Perceptron. PHYSICAL REVIEW LETTERS 2023; 131:227301. [PMID: 38101365 DOI: 10.1103/physrevlett.131.227301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 09/05/2023] [Accepted: 11/08/2023] [Indexed: 12/17/2023]
Abstract
Empirical studies on the landscape of neural networks have shown that low-energy configurations are often found in complex connected structures, where zero-energy paths between pairs of distant solutions can be constructed. Here, we consider the spherical negative perceptron, a prototypical nonconvex neural network model framed as a continuous constraint satisfaction problem. We introduce a general analytical method for computing energy barriers in the simplex with vertex configurations sampled from the equilibrium. We find that in the overparametrized regime the solution manifold displays simple connectivity properties. There exists a large geodesically convex component that is attractive for a wide range of optimization dynamics. Inside this region we identify a subset of atypical high-margin solutions that are geodesically connected with most other solutions, giving rise to a star-shaped geometry. We analytically characterize the organization of the connected space of solutions and show numerical evidence of a transition, at larger constraint densities, where the aforementioned simple geodesic connectivity breaks down.
Collapse
Affiliation(s)
| | - Clarissa Lauditi
- Department of Applied Science and Technology, Politecnico di Torino, 10129 Torino, Italy
| | - Carlo Lucibello
- Department of Computing Sciences, Bocconi University, 20136 Milano, Italy
- Bocconi Institute for Data Science and Analytics, 20136 Milano, Italy
| | - Enrico M Malatesta
- Department of Computing Sciences, Bocconi University, 20136 Milano, Italy
- Bocconi Institute for Data Science and Analytics, 20136 Milano, Italy
| | - Gabriele Perugini
- Department of Computing Sciences, Bocconi University, 20136 Milano, Italy
| | - Fabrizio Pittorino
- Bocconi Institute for Data Science and Analytics, 20136 Milano, Italy
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, 20125 Milano, Italy
| | - Luca Saglietti
- Department of Computing Sciences, Bocconi University, 20136 Milano, Italy
- Bocconi Institute for Data Science and Analytics, 20136 Milano, Italy
| |
Collapse
|
3
|
Rodríguez-Cruz C, Molaei M, Thirumalaiswamy A, Feitosa K, Manoharan VN, Sivarajan S, Reich DH, Riggleman RA, Crocker JC. Experimental observations of fractal landscape dynamics in a dense emulsion. SOFT MATTER 2023; 19:6805-6813. [PMID: 37650227 DOI: 10.1039/d3sm00852e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
Many soft and biological materials display so-called 'soft glassy' dynamics; their constituents undergo anomalous random motions and complex cooperative rearrangements. A recent simulation model of one soft glassy material, a coarsening foam, suggested that the random motions of its bubbles are due to the system configuration moving over a fractal energy landscape in high-dimensional space. Here we show that the salient geometrical features of such high-dimensional fractal landscapes can be explored and reliably quantified, using empirical trajectory data from many degrees of freedom, in a model-free manner. For a mayonnaise-like dense emulsion, analysis of the observed trajectories of oil droplets quantitatively reproduces the high-dimensional fractal geometry of the configuration path and its associated local energy minima generated using a computational model. That geometry in turn drives the droplets' complex random motion observed in real space. Our results indicate that experimental studies can elucidate whether the similar dynamics in different soft and biological materials may also be due to fractal landscape dynamics.
Collapse
Affiliation(s)
- Clary Rodríguez-Cruz
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA.
| | - Mehdi Molaei
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA.
| | - Amruthesh Thirumalaiswamy
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA.
| | - Klebert Feitosa
- Department of Physics and Astronomy, James Madison University, Harrisonburg, Virginia, USA
| | - Vinothan N Manoharan
- Harvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts, USA
- Department of Physics, Harvard University, Cambridge, Massachusetts, USA
| | - Shankar Sivarajan
- Department of Physics and Astronomy, Johns Hopkins University, Baltimore, Maryland, USA
| | - Daniel H Reich
- Department of Physics and Astronomy, Johns Hopkins University, Baltimore, Maryland, USA
| | - Robert A Riggleman
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA.
| | - John C Crocker
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
4
|
Exploring canyons in glassy energy landscapes using metadynamics. Proc Natl Acad Sci U S A 2022; 119:e2210535119. [PMID: 36256806 PMCID: PMC9618120 DOI: 10.1073/pnas.2210535119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The complex physics of glass-forming systems is controlled by the structure of the low-energy portions of their potential energy landscapes. Here we report that a modified metadynamics algorithm efficiently explores and samples low-energy regions of such high-dimensional landscapes. In the energy landscape for a model foam, our algorithm finds and descends meandering canyons in the landscape, which contain dense clusters of energy minima along their floors. Similar canyon structures in the energy landscapes of two model glass formers—hard sphere fluids and the Kob–Andersen glass—allow us to reach high densities and low energies, respectively. In the hard sphere system, fluid configurations are found to form continuous regions that cover the canyon floors up to densities well above the jamming transition. For the Kob–Andersen glass former, our technique samples low-energy states with modest computational effort, with the lowest energies found approaching the predicted Kauzmann limit.
Collapse
|
5
|
Herrera Segura C, Montoya E, Tapias D. Subaging in underparametrized Deep Neural Networks. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac8f1b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
In this work, we consider a simple classification problem to show that the dynamics of finite--width Deep Neural Networks in the underparametrized regime gives rise to effects similar to those associated with glassy systems, namely a slow evolution of the loss function and aging. Remarkably, the aging is sublinear in the waiting time (subaging) and the power--law exponent characterizing it is robust to different architectures under the constraint of a constant total number of parameters. Our results are maintained in the more complex scenario of the MNIST database. We find that for this database there is a unique exponent ruling the subaging behavior in the whole phase.
Collapse
|