This paper considers several aspects of random matrix universality in de...
We investigate the local spectral statistics of the loss surface Hessian...
We conjecture that the reason for the difference in generalisation betwe...
We study the effect of mini-batching on the loss landscape of deep neura...
Hessian based measures of flatness, such as the trace, Frobenius and spe...
We investigate whether the Wigner semi-circle and Marcenko-Pastur
distri...
Iterate averaging has a rich history in optimisation, but has only very
...
We present MLRG Deep Curvature suite, a PyTorch-based, open-source packa...
Graph spectral techniques for measuring graph similarity, or for learnin...
Efficient approximation lies at the heart of large-scale machine learnin...
We present a novel algorithm for learning the spectral density of large ...
Evaluating the log determinant of a positive definite matrix is ubiquito...
The ability of many powerful machine learning algorithms to deal with la...
The scalable calculation of matrix determinants has been a bottleneck to...