Thoughts: Favorite tweets

Wednesday, August 14, 2019

Favorite tweets

With time, we hope these design principles can help inform best practices for how to build capable RL agents without reward tampering incentives.

This builds upon our previous work on understanding agent incentives with causal influence diagrams: https://t.co/WBc7fUJCbf
— DeepMind (@DeepMindAI) August 14, 2019

from Twitter https://twitter.com/DeepMindAI

August 14, 2019 at 08:47AM
via IFTTT

No comments:

Post a Comment