Publications
2025
- PreprintAsymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention NetworksarXiv preprint, Jun 2025
2024
- Preprint
- PreprintRepetita iuvant: Data repetition allows sgd to learn high-dimensional multi-index functionsarXiv preprint, May 2024
2023
- ConferenceUniversality laws for Gaussian mixtures in generalized linear modelsIn Advances in Neural Information Processing Systems, Dec 2023
- PreprintEscaping mediocrity: how two-layer networks learn hard single-index models with SGDarXiv preprint, May 2023
2022
2021
- Preprint