We test the hypothesis that language models trained with reinforcement
l...
As AI systems become more capable, we would like to enlist their help to...
We describe our early efforts to red team language models in order to
si...
We study whether language models can evaluate the validity of their own
...
Despite progress in training neural networks for lossy image compression...
Accurate image classification given small amounts of labelled data (few-...