This is the repository for the masters thesis titled "Interactive Reinforcement Learning for Adaptive Thermal Comfort"
-
Updated
Oct 2, 2024 - Jupyter Notebook
This is the repository for the masters thesis titled "Interactive Reinforcement Learning for Adaptive Thermal Comfort"
This project implements an agent for playing the SonicTheHedgehog2 game from a ROM file using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.
Distributed training for RL algo on pytorch
🍄 Reinforcement Learning agent for Super Mario Bros
A reinforcement learning A3C implementation trained to play Super Mario Bros
Autonomous 1:10 race car with a reinforcement learning based approach
Obstacle avoidance agent in a custom Gym environment
This repository contains implementations for reward shaping based governance kernel layer experiments
In this project, I created an agent to solve the CartPole task using the stablebaselines3 library. CartPole is a problem from the OpenAI Gym catalog, in which the goal is to maintain balance of a wooden pole using motors attached to its ends. The agent must decide whether to move the pole left or right to maintain balance.
Nokia's classic 'snake' game, written in NumPy and converted into a Gymnasium Environment() for use with gradient-based reinforcement learning algorithms
Training an agent to land a spacecraft in the LunarLander environment.
Git repository for LQR and Reinforcement Learning labs. Code for modeling human movement and solving optimization using PPO algorithm.
Example of Reinforcement Learning Environment on Minecraft with Stable-Baselines3 and CraftGround
This project implements an agent for playing the VizDoom game on various levels using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.
Predicting prime numbers as list of bits.
Implementation of RL algorithms using the stable baselines library
Pilotage d'un pendule de Furuta avec un Raspberry PI
Analyzing policy entropy of reinforcement learning agents
Add a description, image, and links to the stable-baselines3 topic page so that developers can more easily learn about it.
To associate your repository with the stable-baselines3 topic, visit your repo's landing page and select "manage topics."