Low impact agency: review and discussion

03/06/2023
by   Danilo Naiff, et al.
0

Powerful artificial intelligence poses an existential threat if the AI decides to drastically change the world in pursuit of its goals. The hope of low-impact artificial intelligence is to incentivize AI to not do that just because this causes a large impact in the world. In this work, we first review the concept of low-impact agency and previous proposals to approach the problem, and then propose future research directions in the topic, with the goal to ensure low-impactedness is useful in making AI safe.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset