As more and more teams work with the Arcade Learning Environment to train RL models on Atari games, it becomes more important to a) ensure that comparisons are truly like-for-like and reproducible, and b) find ways to speed up and simplify model iteration. Google has developed a Tensorflow framework, called Dopamine, allowing researchers to focus on the content of their tests and code, rather than spending time iterating and tweaking.
https://rob.al/2NaPvKZ
Posted by Pablo Samuel Castro, Research Software Developer and Marc G. Bellemare, Research Scientist, Google Brain Team Reinforcement lear…