Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hastie T, Montanari A, Rosset S, Tibshirani RJ. SURPRISES IN HIGH-DIMENSIONAL RIDGELESS LEAST SQUARES INTERPOLATION. Ann Stat 2022;50:949-986. [PMID: 36120512 PMCID: PMC9481183 DOI: 10.1214/21-aos2133] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]

For:	Hastie T, Montanari A, Rosset S, Tibshirani RJ. SURPRISES IN HIGH-DIMENSIONAL RIDGELESS LEAST SQUARES INTERPOLATION. Ann Stat 2022;50:949-986. [PMID: 36120512 PMCID: PMC9481183 DOI: 10.1214/21-aos2133] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]

Number

Cited by Other Article(s)

Agliari E, Alemanno F, Aquaro M, Fachechi A. Regularization, early-stopping and dreaming: A Hopfield-like setup to address generalization and overfitting. Neural Netw 2024;177:106389. [PMID: 38788291 DOI: 10.1016/j.neunet.2024.106389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 04/12/2024] [Accepted: 05/12/2024] [Indexed: 05/26/2024]

Bahri Y, Dyer E, Kaplan J, Lee J, Sharma U. Explaining neural scaling laws. Proc Natl Acad Sci U S A 2024;121:e2311878121. [PMID: 38913889 PMCID: PMC11228526 DOI: 10.1073/pnas.2311878121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 03/05/2024] [Indexed: 06/26/2024] Open

Gerace F, Krzakala F, Loureiro B, Stephan L, Zdeborová L. Gaussian universality of perceptrons with random labels. Phys Rev E 2024;109:034305. [PMID: 38632742 DOI: 10.1103/physreve.109.034305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 12/08/2023] [Indexed: 04/19/2024]

Ruben BS, Pehlevan C. Learning Curves for Noisy Heterogeneous Feature-Subsampled Ridge Ensembles. ARXIV 2024:arXiv:2307.03176v3. [PMID: 37461424 PMCID: PMC10350086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]

Hanin B, Zlokapa A. Bayesian interpolation with deep linear networks. Proc Natl Acad Sci U S A 2023;120:e2301345120. [PMID: 37252994 PMCID: PMC10266010 DOI: 10.1073/pnas.2301345120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 04/25/2023] [Indexed: 06/01/2023] Open

Okuno A, Yano K. A generalization gap estimation for overparameterized models via the Langevin functional variance. J Comput Graph Stat 2023. [DOI: 10.1080/10618600.2023.2197488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/07/2023]

DeBacker JR, McMillan GP, Martchenke N, Lacey CM, Stuehm HR, Hungerford ME, Konrad-Martin D. Ototoxicity prognostic models in adult and pediatric cancer patients: a rapid review. J Cancer Surviv 2023;17:82-100. [PMID: 36729346 DOI: 10.1007/s11764-022-01315-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 12/07/2022] [Indexed: 02/03/2023]

Abstract

PURPOSE

A cornerstone of treatment for many cancers is the administration of platinum-based chemotherapies and/or ionizing radiation, which can be ototoxic. An accurate ototoxicity risk assessment would be useful for counseling, treatment planning, and survivorship follow-up in patients with cancer.

METHODS

This systematic review evaluated the literature on predictive models for estimating a patient's risk for chemotherapy-related auditory injury to accelerate development of computational approaches for the clinical management of ototoxicity in cancer patients. Of the 1195 articles identified in a PubMed search from 2010 forward, 15 studies met inclusion for the review.

CONCLUSIONS

All but 1 study used an abstraction of the audiogram as a modeled outcome; however, specific outcome measures varied. Consistently used predictors were age, baseline hearing, cumulative cisplatin dose, and radiation dose to the cochlea. Just 5 studies were judged to have an overall low risk of bias. Future studies should attempt to minimize bias by following statistical best practices including not selecting multivariate predictors based on univariate analysis, validation in independent cohorts, and clearly reporting the management of missing and censored data. Future modeling efforts should adopt a transdisciplinary approach to define a unified set of clinical, treatment, and/or genetic risk factors. Creating a flexible model that uses a common set of predictors to forecast the full post-treatment audiogram may accelerate work in this area. Such a model could be adapted for use in counseling, treatment planning, and follow-up by audiologists and oncologists and could be incorporated into ototoxicity genetic association studies as well as clinical trials investigating otoprotective agents.

IMPLICATIONS FOR CANCER SURVIVORS

Improvements in the ability to model post-treatment hearing loss can help to improve patient quality of life following cancer care. The improvements advocated for in this review should allow for the acceleration of advancements in modeling the auditory impact of these treatments to support treatment planning and patient counseling during and after care.

Collapse

Anceschi N, Fasano A, Durante D, Zanella G. Bayesian Conjugacy in Probit, Tobit, Multinomial Probit and Extensions: A Review and New Results. J Am Stat Assoc 2023. [DOI: 10.1080/01621459.2023.2169150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Ngampruetikorn V, Schwab DJ. Information bottleneck theory of high-dimensional regression: relevancy, efficiency and optimality. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 2022;35:9784-9796. [PMID: 37332888 PMCID: PMC10275337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]

Ma X, Sardy S, Hengartner N, Bobenko N, Lin YT. A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks. STATISTICS AND COMPUTING 2022;32:99. [PMID: 36299529 PMCID: PMC9587964 DOI: 10.1007/s11222-022-10169-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 10/12/2022] [Indexed: 06/16/2023]

Montanari A, Zhong Y. The interpolation phase transition in neural networks: Memorization and generalization under lazy training. Ann Stat 2022. [DOI: 10.1214/22-aos2211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Chinot G, Löffler M, van de Geer S. On the robustness of minimum norm interpolators and regularized empirical risk minimizers. Ann Stat 2022. [DOI: 10.1214/22-aos2190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Javanmard A, Soltanolkotabi M. Precise statistical analysis of classification accuracies for adversarial training. Ann Stat 2022. [DOI: 10.1214/22-aos2180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Ariosto S, Pacelli R, Ginelli F, Gherardi M, Rotondo P. Universal mean-field upper bound for the generalization gap of deep neural networks. Phys Rev E 2022;105:064309. [PMID: 35854557 DOI: 10.1103/physreve.105.064309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 06/08/2022] [Indexed: 11/07/2022]

Rocks JW, Mehta P. Memorizing without overfitting: Bias, variance, and interpolation in overparameterized models. PHYSICAL REVIEW RESEARCH 2022;4:013201. [PMID: 36713351 PMCID: PMC9879296 DOI: 10.1103/physrevresearch.4.013201] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Abstract

The bias-variance trade-off is a central concept in supervised learning. In classical statistics, increasing the complexity of a model (e.g., number of parameters) reduces bias but also increases variance. Until recently, it was commonly believed that optimal performance is achieved at intermediate model complexities which strike a balance between bias and variance. Modern Deep Learning methods flout this dogma, achieving state-of-the-art performance using "over-parameterized models" where the number of fit parameters is large enough to perfectly fit the training data. As a result, understanding bias and variance in over-parameterized models has emerged as a fundamental problem in machine learning. Here, we use methods from statistical physics to derive analytic expressions for bias and variance in two minimal models of over-parameterization (linear regression and two-layer neural networks with nonlinear data distributions), allowing us to disentangle properties stemming from the model architecture and random sampling of data. In both models, increasing the number of fit parameters leads to a phase transition where the training error goes to zero and the test error diverges as a result of the variance (while the bias remains finite). Beyond this threshold, the test error of the two-layer neural network decreases due to a monotonic decrease in both the bias and variance in contrast with the classical bias-variance trade-off. We also show that in contrast with classical intuition, over-parameterized models can overfit even in the absence of noise and exhibit bias even if the student and teacher models match. We synthesize these results to construct a holistic understanding of generalization error and the bias-variance trade-off in over-parameterized models and relate our results to random matrix theory.

Collapse

Chen X, Liu Q, Tong XT. Dimension independent excess risk by stochastic gradient descent. Electron J Stat 2022. [DOI: 10.1214/22-ejs2055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]