Dmitrii Krasheninnikov | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Pieter Abbeel
263 publications
Xin Chen
162 publications
Dorsa Sadigh
102 publications
Adrian Weller
101 publications
Dylan Hadfield-Menell
38 publications
Anca Dragan
33 publications
Umang Bhatt
31 publications
David Krueger
31 publications
Herke van Hoof
30 publications
Erdem Bıyık
27 publications
Tomasz Korbak
18 publications

research

∙ 07/27/2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Reinforcement learning from human feedback (RLHF) is a technique for tra...

0 Stephen Casper, et al. ∙

research

∙ 02/20/2023

Harms from Increasingly Agentic Algorithmic Systems

Research in Fairness, Accountability, Transparency, and Ethics (FATE) ha...

0 Alan Chan, et al. ∙

research

∙ 09/27/2022

Defining and Characterizing Reward Hacking

We provide the first formal definition of reward hacking, a phenomenon w...

0 Joar Skalse, et al. ∙

research

∙ 03/22/2021

Combining Reward Information from Multiple Sources

Given two sources of evidence about a latent variable, one can combine t...

0 Dmitrii Krasheninnikov, et al. ∙

research

∙ 02/12/2019

Preferences Implicit in the State of the World

Reinforcement learning (RL) agents optimize only the features specified ...

2 Rohin Shah, et al. ∙