(S) GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability

Publication
Advances in Neural Information Processing Systems (NeurIPS)
Suriya Gunasekar
Suriya Gunasekar
Senior Researcher