Skip to content

Solving openAI's game 'Lunar-Lander' with Deep Reinforcement Learning

License

Notifications You must be signed in to change notification settings

Kyziridis/Lunar-Network

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lunar-Lander with Lunar-Network

OpenAI's gym module called Box2D game Lunar-Lander

Instructions

  1. In order to run both scripts you need to install openAI-gym lib.

  2. For gym install follow the next steps:

    $ pip install --upgrade pip
    $ git clone https://github.com/openai/gym.git
    $ cd gym
    $ pip install -e '.[box2d]'

    # test that it worked 
    $ python >>> import gym >>> gym.make('LunarLander-v2')

    # The above instructions are for linux systems. 
    # We do not use proprietary software so we do not have instructions for Windows or MacOS.  

You can visit openAI gym in order to inspect this kind of environments.

  1. Other dependencies : Keras, matplotlib, numpy, random, rensor flow

  2. Do not use any Python-IDE to run the code if you want to see the game-simulation (environment-render). Run the code in a console or a terminal by typping: $ python script-name.py

Comments:

The DQN.py uses either a ready model for training or it builds a new one. Under /pretrain folder you can find 5 different models which won the game. If you want to use one pretrain model of those, copy it to the parent directory where the DQN.py is located and rename it as "lunar-lander_solved.h5". If you use a pretrained model, when you run the script you will see the message >>> LOAD an existing model, and under this you will see the summary of that model. If you do not include a pretrained model the script will build a new model and the corresponding message will be >>> Build a new model.

The Naiv-NN.py it does not use any pretrained models. Under the folder "pretrain" you can find also some plots and some of the videos we monitored.

For further information about the procedure you can read the Lunar_Network.pdf which reports the ideas and the procedure behind the problem. It was part of a free-choice assignment for Neural-Networks master-course.

Enjoy!

SUPPORT >_ GNU/Linux >_ Free Software Foundation