research
Analytic bounds linking smoothing scale to Fourier decay and optimization geometry, motivated by energy-based and score-based generative models.
Training transformer scratchpads to learn modular multiplication for the SAIR Modular Arithmetic Challenge.