Stéphane d'Ascoli
Stéphane d'Ascoli
Bio
Research
Outreach
Music
Travel
CV
Light
Dark
Automatic
Scaling description of generalization with number of parameters in deep learning
Mario Geiger
,
Arthur Jacot
,
Stefano Spigler
,
Franck Gabriel
,
Levent Sagun
,
Stéphane d’Ascoli
,
Giulio Biroli
,
Clément Hongler
,
Matthieu Wyart
January 2020
Cite
ArXiv
J. Stat. Mech
Type
Journal article
Publication
Journal of Statistical Mechanics: Theory and Experiment
Related
Jamming transition as a paradigm to understand the loss landscape of deep neural networks
Transformed CNNs: recasting pre-trained convolutional layers with self-attention
On the interplay between data structure and loss function in classification problems
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Triple descent and the two kinds of overfitting: where and why do they appear?
Cite
×