Markus Kunesch

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Nando de Freitas
83 publications
Marcus Hutter
83 publications
Joel Z. Leibo
53 publications
Shane Legg
41 publications
Julien Perolat
34 publications
Yutian Chen
30 publications
Pedro A. Ortega
30 publications
Bilal Piot
26 publications
Tom Everitt
25 publications
Scott Reed
24 publications
Tim Genewein
19 publications

research

∙ 05/29/2023

Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity

Is it possible to evaluate the moral cognition of complex artificial age...

0 Yiran Mao, et al. ∙

research

∙ 09/30/2022

Beyond Bayes-optimality: meta-learning what you know you don't know

Meta-training agents with memory has been shown to culminate in Bayes-op...

8 Jordi Grau-Moya, et al. ∙

research

∙ 03/23/2022

Your Policy Regularizer is Secretly an Adversary

Policy regularization methods such as maximum entropy regularization are...

0 Rob Brekelmans, et al. ∙

research

∙ 11/04/2021

Model-Free Risk-Sensitive Reinforcement Learning

We extend temporal-difference (TD) learning in order to obtain risk-sens...

9 Grégoire Delétang, et al. ∙

research

∙ 10/20/2021

Shaking the foundations: delusions in sequence models for interaction and control

The recent phenomenal success of language models has reinvigorated machi...

68 Pedro A. Ortega, et al. ∙

research

∙ 03/05/2021

Causal Analysis of Agent Behavior for AI Safety

As machine learning systems become more powerful they also become increa...

26 Grégoire Delétang, et al. ∙

research

∙ 10/14/2020

Human-interpretable model explainability on high-dimensional data

The importance of explainability in machine learning continues to grow, ...

19 Damien de Mijolla, et al. ∙

Markus Kunesch

Featured Co-authors

Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity

Beyond Bayes-optimality: meta-learning what you know you don't know

Your Policy Regularizer is Secretly an Adversary

Model-Free Risk-Sensitive Reinforcement Learning

Shaking the foundations: delusions in sequence models for interaction and control

Causal Analysis of Agent Behavior for AI Safety

Human-interpretable model explainability on high-dimensional data

Sign in with Google

Consider DeepAI Pro