An agent which learned to play Mario without rewards. Instead, it was incentivized to avoid "boredom" (that is, getting into states where it can predict what will happen next). Discovered warp levels, how to defeat bosses, etc. More details: https://t.co/lGw3rZUbv3 pic.twitter.com/6ObS35iZZS
— Greg Brockman (@gdb) October 31, 2018
from Twitter https://twitter.com/gdb
October 31, 2018 at 10:24AM
via IFTTT
No comments:
Post a Comment