Dreaming of Reinforcement Learning: GAN-enhanced Experience-Replay

Bachelor’s Project Thesis

The last decade has seen an extraordinary rise in popularity of Deep Reinforcement Learning (DRL) techniques, due to their capacity of attaining super-human results in most types of control tasks. However, they all suffer from the substantial drawback of long training times. Past studies have tried to minimize training time by improving the trajectories of the ExperienceReplay (ER) memory buffer. This thesis explores the possibility of improving the convergence time of the Deep Q-Learning algorithm by initializing its ER memory buffer with GAN-generated trajectories. The Deep Q-Network (DQN) agents are tested on the Atari 2600 game of Pong, in which the states to be generated are represented by frames of the game. Results show that the DQN agent corresponding to the initialized memory buffer does not exhibit a significantly faster convergence time to the same reward as the non-initialized agent.

Atari 2600 Pong

Hand-picked images from the original dataset (left column) and randomly-picked generated images (right column). Each overlapping frame is colored as to showcase the movement of the ball and paddles. The images used in the algorithm are completely gray-scale.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
misc		misc
src		src
.gitignore		.gitignore
Dreaming_of_Reinforcement_Learning.pdf		Dreaming_of_Reinforcement_Learning.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dreaming of Reinforcement Learning: GAN-enhanced Experience-Replay

Bachelor’s Project Thesis

About

Releases

Packages

Languages

a-turcu/dreaming-rl

Folders and files

Latest commit

History

Repository files navigation

Dreaming of Reinforcement Learning: GAN-enhanced Experience-Replay

Bachelor’s Project Thesis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages