Stéphane d'Ascoli
Stéphane d'Ascoli
Bio
Research
Outreach
Music
Travel
CV
Light
Dark
Automatic
Length generalization in arithmetic transformers
Samy Jelassi
,
Stéphane d'Ascoli
,
Carles Domingo-Enrich
,
Yuhuai Wu
,
Yuanzhi Li
,
François Charton
January 2023
Cite
ArXiv
Type
Journal article
Publication
arXiv preprint arXiv:2306.15400
Related
End-to-end symbolic regression with transformers
Boolformer: Symbolic Regression of Logic Functions with Transformers
Optimal learning rate schedules in high-dimensional non-convex optimization problems
Transformed CNNs: recasting pre-trained convolutional layers with self-attention
On the interplay between data structure and loss function in classification problems
Cite
×