Align, then memorise: the dynamics of learning with feedback alignment
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems
Double Trouble in Double Descent: Bias and Variance (s) in the Lazy Regime
Triple descent and the two kinds of overfitting: where and why do they appear?
Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias