Can Network Flatness Explain the Training Speed-Generalisation Connection?

Published in Bayesian Deep Learning Workshop at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPs), 2021