When using Stochastic Gradient Descent (SGD) for training machine learni...
The success of SGD in deep learning has been ascribed by prior works to ...
Finetuning a pretrained model has become a standard approach for trainin...
Large neural networks trained in the overparameterized regime are able t...
In machine learning, we traditionally evaluate the performance of a sing...
It is well established that training deep neural networks gives useful
r...
We prove a new upper bound on the generalization gap of classifiers that...
Despite the vast success of Deep Neural Networks in numerous application...
We show that a variety of modern deep learning tasks exhibit a
"double-d...
We perform an experimental study of the dynamics of Stochastic Gradient
...
In this paper we study the problem of robust influence maximization in t...