-
-
-
direct-preference-optimization Public
Forked from eric-mitchell/direct-preference-optimizationReference implementation for DPO (Direct Preference Optimization)
Python Apache License 2.0 UpdatedDec 29, 2023 -
-
sycophancy-intervention Public
Forked from google/sycophancy-interventionScripts for generating synthetic finetuning data for reducing sycophancy.
Python Apache License 2.0 UpdatedAug 16, 2023 -
End-to-End-TTS-Fine-Tune Public
Forked from hwRG/End-to-End-TTS-Fine-TuneUse FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
Python MIT License UpdatedJul 30, 2023 -
rl-teacher Public
Forked from nottombrown/rl-teacherCode for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
-
-
crazyswarm Public
Forked from USC-ACTLab/crazyswarmA Large Quadcopter Swarm
Python MIT License UpdatedJul 22, 2022 -
PettingZoo Public
Forked from Farama-Foundation/PettingZooGym for multi-agent reinforcement learning
Python Other UpdatedJun 30, 2022 -
crazyflie-firmware Public
Forked from snulion-study/crazyflie-firmwareThe main firmware for the Crazyflie Nano Quadcopter, Crazyflie Bolt Quadcopter and Roadrunner Positioning Tag.
C GNU General Public License v3.0 UpdatedJun 7, 2022 -
quad_sim2multireal Public
Forked from amolchanov86/quad_sim2multirealRepository for IROS 2019
Python UpdatedJun 5, 2022 -
-
SuperSuit Public
Forked from Farama-Foundation/SuperSuitEasy-to-use micro-wrappers for Gym and PettingZoo based RL Environments
Python MIT License UpdatedMay 6, 2022 -
-
-
Popular-RL-Algorithms Public
Forked from quantumiracle/Popular-RL-AlgorithmsPyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Jupyter Notebook Apache License 2.0 UpdatedMar 14, 2022 -
crazyflie-clients-python Public
Forked from bitcraze/crazyflie-clients-pythonHost applications and library for Crazyflie written in Python.
Python Other UpdatedFeb 8, 2022 -
BPref Public
Forked from rll-research/BPrefOfficial codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
Python MIT License UpdatedJan 27, 2022 -
crazyflie-lib-python Public
Forked from bitcraze/crazyflie-lib-pythonPython library to communicate with Crazyflie
Python Other UpdatedJan 27, 2022 -
-
-
-
SNU_SLAM_2021-2 Public
Re-VLOAM : Recurrent Neural Networks based Visual LiDAR Odometry and Mapping
-
-
-
-
crazyflie_ros Public
Forked from whoenig/crazyflie_rosROS Driver for Bitcraze Crazyflie
C++ MIT License UpdatedAug 3, 2021 -
RainbowDQN_highway Public
RainbowDQN algorithm for GYM highway environment
-
sac-discrete.pytorch Public
Forked from toshikwa/sac-discrete.pytorchA PyTorch implementation of SAC-Discrete.
Python MIT License UpdatedMay 27, 2021