Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Advani MS, Saxe AM, Sompolinsky H. High-dimensional dynamics of generalization error in neural networks. Neural Netw 2020;132:428-446. [PMID: 33022471 PMCID: PMC7685244 DOI: 10.1016/j.neunet.2020.08.022] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2020] [Revised: 08/18/2020] [Accepted: 08/24/2020] [Indexed: 11/19/2022]

For:	Advani MS, Saxe AM, Sompolinsky H. High-dimensional dynamics of generalization error in neural networks. Neural Netw 2020;132:428-446. [PMID: 33022471 PMCID: PMC7685244 DOI: 10.1016/j.neunet.2020.08.022] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2020] [Revised: 08/18/2020] [Accepted: 08/24/2020] [Indexed: 11/19/2022]

Number

Cited by Other Article(s)

Lee H, Kim Y, Yang SY, Choi H. Improved weight initialization for deep and narrow feedforward neural network. Neural Netw 2024;176:106362. [PMID: 38733795 DOI: 10.1016/j.neunet.2024.106362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 04/03/2024] [Accepted: 04/29/2024] [Indexed: 05/13/2024]

Bahri Y, Dyer E, Kaplan J, Lee J, Sharma U. Explaining neural scaling laws. Proc Natl Acad Sci U S A 2024;121:e2311878121. [PMID: 38913889 PMCID: PMC11228526 DOI: 10.1073/pnas.2311878121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 03/05/2024] [Indexed: 06/26/2024] Open

Mastrovito D, Liu YH, Kusmierz L, Shea-Brown E, Koch C, Mihalas S. Transition to chaos separates learning regimes and relates to measure of consciousness in recurrent neural networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.15.594236. [PMID: 38798582 PMCID: PMC11118502 DOI: 10.1101/2024.05.15.594236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

Thomas T, Straub D, Tatai F, Shene M, Tosik T, Kersting K, Rothkopf CA. Modelling dataset bias in machine-learned theories of economic decision-making. Nat Hum Behav 2024;8:679-691. [PMID: 38216691 PMCID: PMC11045447 DOI: 10.1038/s41562-023-01784-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 11/14/2023] [Indexed: 01/14/2024]

Katsuno H, Kimura Y, Yamazaki T, Takigawa I. Machine Learning Refinement of In Situ Images Acquired by Low Electron Dose LC-TEM. MICROSCOPY AND MICROANALYSIS : THE OFFICIAL JOURNAL OF MICROSCOPY SOCIETY OF AMERICA, MICROBEAM ANALYSIS SOCIETY, MICROSCOPICAL SOCIETY OF CANADA 2024;30:77-84. [PMID: 38285924 DOI: 10.1093/micmic/ozad142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 11/21/2023] [Accepted: 12/21/2023] [Indexed: 01/31/2024]

Huang L, Zhang C, Zhang H. Self-Adaptive Training: Bridging Supervised and Self-Supervised Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:1362-1377. [PMID: 36306295 DOI: 10.1109/tpami.2022.3217792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Lasko TA, Strobl EV, Stead WW. Why do probabilistic clinical models fail to transport between sites. NPJ Digit Med 2024;7:53. [PMID: 38429353 PMCID: PMC10907678 DOI: 10.1038/s41746-024-01037-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 02/14/2024] [Indexed: 03/03/2024] Open

Liu YH, Baratin A, Cornford J, Mihalas S, Shea-Brown E, Lajoie G. How connectivity structure shapes rich and lazy learning in neural circuits. ARXIV 2024:arXiv:2310.08513v2. [PMID: 37873007 PMCID: PMC10593070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Fischer B, Chemnitz M, Zhu Y, Perron N, Roztocki P, MacLellan B, Di Lauro L, Aadhi A, Rimoldi C, Falk TH, Morandotti R. Neuromorphic Computing via Fission-based Broadband Frequency Generation. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023;10:e2303835. [PMID: 37786262 PMCID: PMC10724387 DOI: 10.1002/advs.202303835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 08/31/2023] [Indexed: 10/04/2023]

Affiliation(s)

Bennet Fischer Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada Leibniz Institute of Photonic TechnologyAlbert‐Einstein Str. 907745JenaGermany
Mario Chemnitz Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada Leibniz Institute of Photonic TechnologyAlbert‐Einstein Str. 907745JenaGermany
Yi Zhu Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada
Nicolas Perron Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada
Piotr Roztocki Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada Ki3 Photonics Technologies2547 Rue SicardMontrealQuebecH1V 2Y8Canada
Benjamin MacLellan Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada
Luigi Di Lauro Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada
A. Aadhi Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada
Cristina Rimoldi Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada Dipartimento di Elettronica e TelecomunicazioniPolitecnico di TorinoCorso Duca degli Abruzzi 24Torino10129Italy
Tiago H. Falk Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada
Roberto Morandotti Institut National de la Recherche Scientifique – ÉnergieMatériaux et Télécommunications1650 Blvd. Lionel‐BouletVarennesQuebecJ3X1S2Canada

Collapse

Sun W, Advani M, Spruston N, Saxe A, Fitzgerald JE. Organizing memories for generalization in complementary learning systems. Nat Neurosci 2023;26:1438-1448. [PMID: 37474639 PMCID: PMC10400413 DOI: 10.1038/s41593-023-01382-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 06/13/2023] [Indexed: 07/22/2023]

Hanin B, Zlokapa A. Bayesian interpolation with deep linear networks. Proc Natl Acad Sci U S A 2023;120:e2301345120. [PMID: 37252994 PMCID: PMC10266010 DOI: 10.1073/pnas.2301345120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 04/25/2023] [Indexed: 06/01/2023] Open

Shan H, Sompolinsky H. Minimum perturbation theory of deep perceptual learning. Phys Rev E 2022;106:064406. [PMID: 36671118 DOI: 10.1103/physreve.106.064406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Accepted: 11/22/2022] [Indexed: 06/17/2023]

Abstract

Perceptual learning (PL) involves long-lasting improvement in perceptual tasks following extensive training and is accompanied by modified neuronal responses in sensory cortical areas in the brain. Understanding the dynamics of PL and the resultant synaptic changes is important for causally connecting PL to the observed neural plasticity. This is theoretically challenging because learning-related changes are distributed across many stages of the sensory hierarchy. In this paper, we modeled the sensory hierarchy as a deep nonlinear neural network and studied PL of fine discrimination, a common and well-studied paradigm of PL. Using tools from statistical physics, we developed a mean-field theory of the network in the limit of a large number of neurons and large number of examples. Our theory suggests that, in this thermodynamic limit, the input-output function of the network can be exactly mapped to that of a deep linear network, allowing us to characterize the space of solutions for the task. Surprisingly, we found that modifying synaptic weights in the first layer of the hierarchy is both sufficient and necessary for PL. To address the degeneracy of the space of solutions, we postulate that PL dynamics are constrained by a normative minimum perturbation (MP) principle, which favors weight matrices with minimal changes relative to their prelearning values. Interestingly, MP plasticity induces changes to weights and neural representations in all layers of the network, except for the readout weight vector. While weight changes in higher layers are not necessary for learning, they help reduce overall perturbation to the network. In addition, such plasticity can be learned simply through slow learning. We further elucidate the properties of MP changes and compare them against experimental findings. Overall, our statistical mechanics theory of PL provides mechanistic and normative understanding of several important empirical findings of PL.

Collapse

Saglietti L, Mannelli SS, Saxe A. An analytical theory of curriculum learning in teacher-student networks. JOURNAL OF STATISTICAL MECHANICS (ONLINE) 2022;2022:114014. [PMID: 37817944 PMCID: PMC10561397 DOI: 10.1088/1742-5468/ac9b3c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 10/13/2022] [Indexed: 10/12/2023]

Abstract

In animals and humans, curriculum learning-presenting data in a curated order-is critical to rapid learning and effective pedagogy. A long history of experiments has demonstrated the impact of curricula in a variety of animals but, despite its ubiquitous presence, a theoretical understanding of the phenomenon is still lacking. Surprisingly, in contrast to animal learning, curricula strategies are not widely used in machine learning and recent simulation studies reach the conclusion that curricula are moderately effective or even ineffective in most cases. This stark difference in the importance of curriculum raises a fundamental theoretical question: when and why does curriculum learning help? In this work, we analyse a prototypical neural network model of curriculum learning in the high-dimensional limit, employing statistical physics methods. We study a task in which a sparse set of informative features are embedded amidst a large set of noisy features. We analytically derive average learning trajectories for simple neural networks on this task, which establish a clear speed benefit for curriculum learning in the online setting. However, when training experiences can be stored and replayed (for instance, during sleep), the advantage of curriculum in standard neural networks disappears, in line with observations from the deep learning literature. Inspired by synaptic consolidation techniques developed to combat catastrophic forgetting, we propose curriculum-aware algorithms that consolidate synapses at curriculum change points and investigate whether this can boost the benefits of curricula. We derive generalisation performance as a function of consolidation strength (implemented as an L 2 regularisation/elastic coupling connecting learning phases), and show that curriculum-aware algorithms can yield a large improvement in test performance. Our reduced analytical descriptions help reconcile apparently conflicting empirical results, trace regimes where curriculum learning yields the largest gains, and provide experimentally-accessible predictions for the impact of task parameters on curriculum benefits. More broadly, our results suggest that fully exploiting a curriculum may require explicit adjustments in the loss.

Collapse

Ma X, Sardy S, Hengartner N, Bobenko N, Lin YT. A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks. STATISTICS AND COMPUTING 2022;32:99. [PMID: 36299529 PMCID: PMC9587964 DOI: 10.1007/s11222-022-10169-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 10/12/2022] [Indexed: 06/16/2023]

Ingrosso A, Goldt S. Data-driven emergence of convolutional structure in neural networks. Proc Natl Acad Sci U S A 2022;119:e2201854119. [PMID: 36161906 PMCID: PMC9546588 DOI: 10.1073/pnas.2201854119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 08/12/2022] [Indexed: 11/18/2022] Open

Rocks JW, Mehta P. Bias-variance decomposition of overparameterized regression with random linear features. Phys Rev E 2022;106:025304. [PMID: 36109970 PMCID: PMC9906786 DOI: 10.1103/physreve.106.025304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 07/12/2022] [Indexed: 01/21/2023]

Gu K, Masotto X, Bachani V, Lakshminarayanan B, Nikodem J, Yin D. An instance-dependent simulation framework for learning with label noise. Mach Learn 2022. [DOI: 10.1007/s10994-022-06207-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Gradient-based learning drives robust representations in recurrent neural networks by balancing compression and expansion. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00498-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Sarraf A, Khalili S. An upper bound on the variance of scalar multilayer perceptrons for log-concave distributions. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.11.062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zavatone-Veth JA, Tong WL, Pehlevan C. Contrasting random and learned features in deep Bayesian linear regression. Phys Rev E 2022;105:064118. [PMID: 35854590 DOI: 10.1103/physreve.105.064118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 05/26/2022] [Indexed: 06/15/2023]

Sahs J, Pyle R, Damaraju A, Caro JO, Tavaslioglu O, Lu A, Anselmi F, Patel AB. Shallow Univariate ReLU Networks as Splines: Initialization, Loss Surface, Hessian, and Gradient Flow Dynamics. Front Artif Intell 2022;5:889981. [PMID: 35647529 PMCID: PMC9131019 DOI: 10.3389/frai.2022.889981] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 04/04/2022] [Indexed: 11/18/2022] Open

Abstract

Understanding the learning dynamics and inductive bias of neural networks (NNs) is hindered by the opacity of the relationship between NN parameters and the function represented. Partially, this is due to symmetries inherent within the NN parameterization, allowing multiple different parameter settings to result in an identical output function, resulting in both an unclear relationship and redundant degrees of freedom. The NN parameterization is invariant under two symmetries: permutation of the neurons and a continuous family of transformations of the scale of weight and bias parameters. We propose taking a quotient with respect to the second symmetry group and reparametrizing ReLU NNs as continuous piecewise linear splines. Using this spline lens, we study learning dynamics in shallow univariate ReLU NNs, finding unexpected insights and explanations for several perplexing phenomena. We develop a surprisingly simple and transparent view of the structure of the loss surface, including its critical and fixed points, Hessian, and Hessian spectrum. We also show that standard weight initializations yield very flat initial functions, and that this flatness, together with overparametrization and the initial weight scale, is responsible for the strength and type of implicit regularization, consistent with previous work. Our implicit regularization results are complementary to recent work, showing that initialization scale critically controls implicit regularization via a kernel-based argument. Overall, removing the weight scale symmetry enables us to prove these results more simply and enables us to prove new results and gain new insights while offering a far more transparent and intuitive picture. Looking forward, our quotiented spline-based approach will extend naturally to the multivariate and deep settings, and alongside the kernel-based view, we believe it will play a foundational role in efforts to understand neural networks. Videos of learning dynamics using a spline-based visualization are available at http://shorturl.at/tFWZ2.

Collapse

Hastie T, Montanari A, Rosset S, Tibshirani RJ. SURPRISES IN HIGH-DIMENSIONAL RIDGELESS LEAST SQUARES INTERPOLATION. Ann Stat 2022;50:949-986. [PMID: 36120512 PMCID: PMC9481183 DOI: 10.1214/21-aos2133] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]

Hiratani N, Latham PE. Developmental and evolutionary constraints on olfactory circuit selection. Proc Natl Acad Sci U S A 2022;119:e2100600119. [PMID: 35263217 PMCID: PMC8931209 DOI: 10.1073/pnas.2100600119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 01/14/2022] [Indexed: 11/18/2022] Open

Rocks JW, Mehta P. Memorizing without overfitting: Bias, variance, and interpolation in overparameterized models. PHYSICAL REVIEW RESEARCH 2022;4:013201. [PMID: 36713351 PMCID: PMC9879296 DOI: 10.1103/physrevresearch.4.013201] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Abstract

The bias-variance trade-off is a central concept in supervised learning. In classical statistics, increasing the complexity of a model (e.g., number of parameters) reduces bias but also increases variance. Until recently, it was commonly believed that optimal performance is achieved at intermediate model complexities which strike a balance between bias and variance. Modern Deep Learning methods flout this dogma, achieving state-of-the-art performance using "over-parameterized models" where the number of fit parameters is large enough to perfectly fit the training data. As a result, understanding bias and variance in over-parameterized models has emerged as a fundamental problem in machine learning. Here, we use methods from statistical physics to derive analytic expressions for bias and variance in two minimal models of over-parameterization (linear regression and two-layer neural networks with nonlinear data distributions), allowing us to disentangle properties stemming from the model architecture and random sampling of data. In both models, increasing the number of fit parameters leads to a phase transition where the training error goes to zero and the test error diverges as a result of the variance (while the bias remains finite). Beyond this threshold, the test error of the two-layer neural network decreases due to a monotonic decrease in both the bias and variance in contrast with the classical bias-variance trade-off. We also show that in contrast with classical intuition, over-parameterized models can overfit even in the absence of noise and exhibit bias even if the student and teacher models match. We synthesize these results to construct a holistic understanding of generalization error and the bias-variance trade-off in over-parameterized models and relate our results to random matrix theory.

Collapse

Gerace F, Saglietti L, Sarao Mannelli S, Saxe A, Zdeborová L. Probing transfer learning with a model of synthetic correlated datasets. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac4f3f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open

D'Amario V, Srivastava S, Sasaki T, Boix X. The Data Efficiency of Deep Learning Is Degraded by Unnecessary Input Dimensions. Front Comput Neurosci 2022;16:760085. [PMID: 35173595 PMCID: PMC8842477 DOI: 10.3389/fncom.2022.760085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 01/03/2022] [Indexed: 11/29/2022] Open

Schneider J. Correlated Initialization for Correlated Data. Neural Process Lett 2022. [DOI: 10.1007/s11063-021-10728-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Allaire F, Mallet V, Filippi JB. Emulation of wildland fire spread simulation using deep learning. Neural Netw 2021;141:184-198. [PMID: 33906084 DOI: 10.1016/j.neunet.2021.04.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 02/28/2021] [Accepted: 04/05/2021] [Indexed: 02/08/2023]

A Statistician Teaches Deep Learning. JOURNAL OF STATISTICAL THEORY AND PRACTICE 2021. [DOI: 10.1007/s42519-021-00193-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Rossbroich J, Trotter D, Beninger J, Tóth K, Naud R. Linear-nonlinear cascades capture synaptic dynamics. PLoS Comput Biol 2021;17:e1008013. [PMID: 33720935 PMCID: PMC7993773 DOI: 10.1371/journal.pcbi.1008013] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 03/25/2021] [Accepted: 02/25/2021] [Indexed: 11/18/2022] Open

Steinberg J, Advani M, Sompolinsky H. New role for circuit expansion for learning in neural networks. Phys Rev E 2021;103:022404. [PMID: 33736047 DOI: 10.1103/physreve.103.022404] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 12/16/2020] [Indexed: 11/07/2022]

Goldt S, Advani MS, Saxe AM, Krzakala F, Zdeborová L. Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup. JOURNAL OF STATISTICAL MECHANICS (ONLINE) 2020;2020:124010. [PMID: 34262607 PMCID: PMC8252911 DOI: 10.1088/1742-5468/abc61e] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 10/28/2020] [Indexed: 06/13/2023]