As more and more teams work with the Arcade Learning Environment to train RL models on Atari games, it becomes more important to a) ensure that comparisons are truly like-for-like and reproducible, and b) find ways to speed up and simplify model iteration. Google has developed a Tensorflow framework, called Dopamine, allowing researchers to focus on the content of their tests and code, rather than spending time iterating and tweaking.
https://ai.googleblog.com/2018/08/introducing-new-framework-for-flexible.html
Posted by Pablo Samuel Castro, Research Software Developer and Marc G. Bellemare, Research Scientist, Google Brain Team Reinforcement lear…
