Unofficial code reproducing Agent57
This repository contains unofficial code reproducing Agent57, which outperformed humans in all Atari games. agent.py define agent to play a supecific environment. buffer.py define buffer to store experiences with priorites. learner.py define learner to update parameter such as q networks and functions related to intrinsic reward. main.py run the main pipeline. model.py define some models such as q network and functions related to intrinsic reward. segment_tree.py define segment tree which decide segment index according to the priority. tester.py define tester […]
Read more