Non-Euclidean Gradient Descent Operates at the Edge of Stability
This paper extends the Edge of Stability phenomenon to non-Euclidean gradient descent by introducing a generalized sharpness measure based on directional smoothness, demonstrating that diverse optimizers exhibit similar stability thresholds and oscillatory behaviors across arbitrary geometric norms.