OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
- Updated
Jul 29, 2025 - Python
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Reinforced Recommendation toolkit built around pytorch 1.7
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
DI-engine docs (Chinese and English)
Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)
Train an RL agent to localize actively (PyTorch)
A high-performance Atari A3C agent in 180 lines of PyTorch
A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Pytorch solutions for UC Berkeley's cs285 assignments
A repository for implementation of deep reinforcement learning lectured at Samsung
A PyTorch Implementation of "Optimization of Molecules via Deep Reinforcement Learning".
Pytorch starter code for UC Berkeley's cs285 assignments
Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG
PyTorch implementation of Constrained Policy Optimization
[ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
This code is the result of the collaboration of RL Turkey team.
A repository for code of reinforcement learning algorithms with PyTorch
Add a description, image, and links to the pytorch-rl topic page so that developers can more easily learn about it.
To associate your repository with the pytorch-rl topic, visit your repo's landing page and select "manage topics."