We develop a contrastive framework for learning better prior distributio...
We investigate the efficacy of treating all the parameters in a Bayesian...
Training on web-scale data can take months. But most computation and tim...
We introduce Goldilocks Selection, a technique for faster model training...
There remains much uncertainty about the relative effectiveness of diffe...
In many real-world applications of machine learning, data are distribute...