Skip to content

hedrox/random-network-distillation

This branch is 13 commits ahead of openai/random-network-distillation:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

685d18c · Jan 16, 2022

History

24 Commits
May 20, 2020
Mar 23, 2020
Jan 16, 2022
May 4, 2021
Oct 12, 2020
Oct 16, 2018
Mar 23, 2020
Oct 16, 2018
Oct 16, 2018
May 14, 2019
May 14, 2019
Mar 23, 2020
May 4, 2021
Mar 30, 2021
May 20, 2020
May 20, 2020
May 4, 2021
May 20, 2020
Jul 24, 2019
Jul 24, 2019
May 20, 2020
Mar 23, 2020

Repository files navigation

Andrei Marin, Traian Rebedea, Ionel Hosu
Politehnica University of Bucharest

Abstract

Games on the Atari 2600 platform have served as a benchmark for reinforcement learning algorithms in recent years, and while deep reinforcement learning approaches make progress on most games, there are still some games that the majority of these algorithms struggle with. These are called hard exploration games. We introduce two new developments for the Random Network Distillation (RND) architecture. We apply self-attention and the mechanism of ego motion on the RND architecture and we evaluate them on three hard exploration tasks from the Atari platform. We find that the proposed ego network model improve the baseline of the RND architecture on these tasks.

Installation Guide

First install the conda environment

conda create --name <env_name> --file conda_requirements.txt

Then install dependencies that cannot be installed with conda

pip install -r pip_requirements.txt

Usage

To train an Ego RND agent on Montezuma's Revenge, run the following command

python run_atari.py --save_model

Acknowledgement

This work is based on OpenAI's Exploration by Random Network Distillation by Yuri Burda, Harri Edwards, Amos Storkey, Oleg Klimov