Stéphane d'Ascoli

Research Scientist

Meta AI, Paris

Biography

Hi! I’m a Research Scientist at Meta AI, working in the Brain & AI team. Previously, I was an AI4Science research fellow at EPFL, and completed a Ph.D. in deep learning, during which I shared my time between the Center for Data Science of ENS Paris and Facebook AI Research – you can find my thesis here. Prior to that, I studied Theoretical Physics at ENS Paris, and worked with NASA on black hole mergers. You can download my CV here.

My current research focuses on decoding neural activity, with the aim of understanding better how the brain works, and perhaps one day help those who have difficulties to speak or type. I am also interested in understanding large neural networks and applying them to computer vision, symbolic regression and the natural sciences in general. Outside work, I love communicating about science (I wrote a few books for the general public), playing the clarinet and travelling very far on my bicycle!

Education

PhD in Artificial Intelligence, 2022

Ecole Normale Supérieure, Paris
Master's in Theoretical Physics, 2018

Ecole Normale Supérieure, Paris
Bachelor's in Physics, 2016

Ecole Normale Supérieure, Paris

Research

Stéphane d'Ascoli, Samy Bengio, Josh Susskind, Emmanuel Abbé

January 2023 arXiv preprint arXiv:2309.12207

Boolformer: Symbolic Regression of Logic Functions with Transformers

ArXiv Code Demo Twitter

Samy Jelassi, Stéphane d'Ascoli, Carles Domingo-Enrich, Yuhuai Wu, Yuanzhi Li, François Charton

January 2023 arXiv preprint arXiv:2306.15400

Length generalization in arithmetic transformers

ArXiv

Stéphane d'Ascoli, Sören Becker, Alexander Mathis, Philippe Schwaller, Niki Kilbertus

January 2023

ODEFormer: Symbolic Regression of Dynamical Systems with Transformers

ArXiv Code Demo Twitter

Stéphane d’Ascoli, Pierre-Alexandre Kamienny, Guillaume Lample, Francois Charton

January 2022 International Conference on Machine Learning

Deep symbolic regression for recurrence prediction

ArXiv Yannic Kilcher Demo Code Talk Twitter

Pierre-Alexandre Kamienny, Stéphane d'Ascoli, Guillaume Lample, François Charton

January 2022 Advances in Neural Information Processing Systems

End-to-end symbolic regression with transformers

ArXiv Demo Code Talk Twitter

Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli

January 2022 arXiv preprint arXiv:2202.04509

Optimal learning rate schedules in high-dimensional non-convex optimization problems

ArXiv Code Twitter

Maria Refinetti, Stéphane D'Ascoli, Ruben Ohana, Sebastian Goldt

January 2021 International Conference on Machine Learning

Align, then memorise: the dynamics of learning with feedback alignment

Arxiv ICML J. Phys. A Code Talk

Stéphane D'Ascoli, Hugo Touvron, Matthew L Leavitt, Ari S Morcos, Giulio Biroli, Levent Sagun

January 2021 Internation Conference on Machine Learning

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

ArXiv ICML Blog post Code Long talk Short talk Twitter

Stéphane d'Ascoli, Marylou Gabrié, Levent Sagun, Giulio Biroli

January 2021 Advances in Neural Information Processing Systems

On the interplay between data structure and loss function in classification problems

ArXiv NeurIPS Code Talk

Stéphane d'Ascoli, Levent Sagun, Giulio Biroli, Ari Morcos

January 2021 arXiv preprint arXiv:2106.05795

Transformed CNNs: recasting pre-trained convolutional layers with self-attention

ArXiv

Stéphane D’ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier, Marc Lelarge

January 2020 International Conference on Statistical Language and Speech Processing

Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems

ArXiv Springer Code

Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli, Florent Krzakala

January 2020 International Conference on Machine Learning

Double Trouble in Double Descent: Bias and Variance (s) in the Lazy Regime

ArXiv ICML Medium Code Talk

Mario Geiger, Arthur Jacot, Stefano Spigler, Franck Gabriel, Levent Sagun, Stéphane d’Ascoli, Giulio Biroli, Clément Hongler, Matthieu Wyart

January 2020 Journal of Statistical Mechanics: Theory and Experiment

Scaling description of generalization with number of parameters in deep learning

ArXiv J. Stat. Mech

Stéphane d'Ascoli, Levent Sagun, Giulio Biroli

January 2020 Advances in Neural Information Processing Systems

Triple descent and the two kinds of overfitting: where and why do they appear?

ArXiv NeurIPS J. Stat Code Talk

S Spigler, M Geiger, S d’Ascoli, L Sagun, G Biroli, M Wyart

January 2019 Journal of Physics A: Mathematical and Theoretical

A jamming transition from under-to over-parametrization affects generalization in deep learning

ArXiv J. Phys. A Code

Stéphane d'Ascoli, Levent Sagun, Giulio Biroli, Joan Bruna

January 2019 Advances in Neural Information Processing Systems

Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias

ArXiv NeurIPS Slides Code

Mario Geiger, Stefano Spigler, Stéphane d'Ascoli, Levent Sagun, Marco Baity-Jesi, Giulio Biroli, Matthieu Wyart

January 2019 Physical Review E

Jamming transition as a paradigm to understand the loss landscape of deep neural networks

Arxiv Phys. Rev. E Code

Stéphane d’Ascoli, Scott C Noble, Dennis B Bowen, Manuela Campanelli, Julian H Krolik, Vassilios Mewes

January 2018 The Astrophysical Journal

Electromagnetic Emission from Supermassive Binary Black Holes Approaching Merger

ArXiv Ap. J. NASA press release Video

Outreach

Podcasts

L’espace-temps est courbe: qu’est-ce à dire?

Podcast “La Conversation Scientifique” with Etienne Klein, France Culture, June 2021.

Big Bang et Trous Noirs

Podcast “Minute Papillon” with Sidonie Bonnec, France Bleu, May 2021.

Books

Voyage au Coeur de l’Atome

Book on quantum mechanics, co-written with Adrien Bouscal, published by First Editions, May 2022.

Voyage au Coeur de l’Espace-Temps

Book on relativity, co-written with Arthur Touati, published by First Editions, March 2021. Also available on Audible as an audio book.

Comprendre la révolution de l’Intelligence Artificielle

Book on AI, published by First Editions, March 2020.

L’Intelligence Artificielle en 5 minutes par jour

Short book on AI, published by First Editions, September 2020.

Videos

Les Intelligences Artificielles les plus flippantes

Co-wrote the script for Dr. Nozman, November 2022.

Deep Symbolic Regression for Recurrent Sequences

Interview with Yannic Kilcher, January 2022.

Qu’est-ce que l’Intelligence Artificielle?

Conference “Les assises du livre numérique”, organized by the Syndicat National de l’Edition, December 2021.

Comprendre la révolution de l’Intelligence Artificielle

Conference “Les Mardis Scientifiques”, organized by Université du Temps Libre, November 2021.

Simulation Reveals Spiraling Supermassive Black Holes

Explanatory video on black hole mergers, co-produced with NASA, October 2018.

360-degree Simulated View of the Sky Between Two Supermassive Black Holes

VR visualization of binary black holes, co-produced with NASA, October 2018.

Stéphane d'Ascoli

Research Scientist

Meta AI, Paris

Biography

Education

Research

Outreach

Podcasts

L’espace-temps est courbe: qu’est-ce à dire?

Big Bang et Trous Noirs

Books

Voyage au Coeur de l’Atome

Voyage au Coeur de l’Espace-Temps

Comprendre la révolution de l’Intelligence Artificielle

L’Intelligence Artificielle en 5 minutes par jour

Videos

Les Intelligences Artificielles les plus flippantes

Deep Symbolic Regression for Recurrent Sequences

Qu’est-ce que l’Intelligence Artificielle?

Comprendre la révolution de l’Intelligence Artificielle

Simulation Reveals Spiraling Supermassive Black Holes

360-degree Simulated View of the Sky Between Two Supermassive Black Holes

Music

Travel