Reinforcement learning from human feedback (RLHF) is a technique for tra...
Reward learning algorithms utilize human feedback to infer a reward func...
Reinforcement learning from human feedback (RLHF) is a powerful techniqu...
Specifying reward functions for robots that operate in environments with...
AI algorithms increasingly make decisions that impact entire groups of
h...
The efficient and fair allocation of limited resources is a classical pr...