Eli Tran-Johnson | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Generator

Featured Co-authors

Samuel R. Bowman
79 publications
Ethan Perez
28 publications
Dario Amodei
27 publications
Stanislav Fort
25 publications
Jack Clark
22 publications
Dawn Drain
21 publications
Jared Kaplan
21 publications
Azalia Mirhoseini
20 publications
Sam McCandlish
19 publications
Tom Brown
17 publications
Amanda Askell
16 publications

research

∙ 02/15/2023

The Capacity for Moral Self-Correction in Large Language Models

We test the hypothesis that language models trained with reinforcement l...

0 Deep Ganguli, et al. ∙

research

∙ 12/15/2022

Constitutional AI: Harmlessness from AI Feedback

As AI systems become more capable, we would like to enlist their help to...

0 Yuntao Bai, et al. ∙

research

∙ 11/04/2022

Measuring Progress on Scalable Oversight for Large Language Models

Developing safe and useful general-purpose AI systems will require us to...

0 Samuel R. Bowman, et al. ∙

research

∙ 08/23/2022

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

We describe our early efforts to red team language models in order to si...

0 Deep Ganguli, et al. ∙

research

∙ 07/11/2022

Language Models (Mostly) Know What They Know

We study whether language models can evaluate the validity of their own ...

12 Saurav Kadavath, et al. ∙