In this work, we develop and release Llama 2, a collection of pretrained...
We present a theory for the previously unexplained divergent behavior no...
This paper studies the role of over-parametrization in solving non-conve...
The paper studies the performance of the Model-Agnostic Meta-Learning (M...