Introducing a New Framework for Flexible and Reproducible Reinforcement Learning Research

As more and more teams work with the Arcade Learning Environment to train RL models on Atari games, it becomes more important to a) ensure that comparisons are truly like-for-like and reproducible, and b) find ways to speed up and simplify model iteration. Google has developed a Tensorflow framework, called Dopamine, allowing researchers to focus on the content of their tests and code, rather than spending time iterating and tweaking.
https://ai.googleblog.com/2018/08/introducing-new-framework-for-flexible.html
Posted by Pablo Samuel Castro, Research Software Developer and Marc G. Bellemare, Research Scientist, Google Brain Team Reinforcement lear…