Name	Name	Last commit message	Last commit date
Latest commit History 12 Commits
actor_critic_layer	actor_critic_layer
models	models
mujoco_files	mujoco_files
LICENSE	LICENSE
README.md	README.md
agent.py	agent.py
agent_env_params.py	agent_env_params.py
environ.py	environ.py
environment.py	environment.py
global_utils.py	global_utils.py
options.py	options.py
performance.jpg	performance.jpg
performance_data.txt	performance_data.txt
plot_data.py	plot_data.py
run_HAC.py	run_HAC.py

Name

Last commit message

Last commit date

actor_critic_layer

HDDPG + HER + RND

This repository contains the code to implement the Hierarchical Deep Deterministic Policy Gradient (HDDPG) & Hindsight Experience Replay(HER) & Random Network Distillation(RND) algorithm. Our experiment environment is Mocojo Robot environment, including Reach、Push、PickandPlace、Slide. However, We only finished the Reach task till now.

To run the codes, you can first execute the command "python run_HAC.py --layers 1 --her --normalize --retrain --env reach --episodes 5000 --threadings 1". The meaning of the flag is easy to understand, and you can read the option.py file to see all the flags. There is a "performance.jpg" showing the accuracy of training only if the threadings is 1.

Our RND is an off-policy implement as most of the popular Curiosity Driven methods are on-policy recently, so we need to compute the intrinsic reward every batch sampled from the replay buffer because it changes when training.

More details will be added later.

Thanks to the author of HAC, HER and RND.

Version LOG

2019/5/7 First Version

Hierachical DDPG and HER;
Observation (State/Goal) Normalization;
RND;
Mutilprocessing (so we can run many experiments in the same time);
Reach and Push environment;

2019/5/10 Update

Use gym to create environment class(so it is easy to use other environment);
Hand Reach environment;

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HDDPG + HER + RND

Version LOG

2019/5/7 First Version

2019/5/10 Update

About

Uh oh!

Releases

Packages

Languages

License

YangRui2015/Sparse-Reward-Algorithms

Folders and files

Latest commit

History

Repository files navigation

HDDPG + HER + RND

Version LOG

2019/5/7 First Version

2019/5/10 Update

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages