Breakend (Peter Henderson) / Starred

Stars

3 stars written in Python

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,870 843 Updated May 29, 2022

jeanharb / option_critic

Implementation of the Option-Critic Architecture on the Atari (ALE) environment

Python 181 54 Updated Sep 21, 2017

reglab / casehold

Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings"

Python 93 19 Updated Mar 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peter Henderson Breakend

Achievements

Achievements

Block or report Breakend

Stars

ikostrikov / pytorch-a2c-ppo-acktr-gail

jeanharb / option_critic

reglab / casehold