Skip to content
View Breakend's full-sized avatar

Block or report Breakend

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
3 stars written in Python
Clear filter

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,870 843 Updated May 29, 2022

Implementation of the Option-Critic Architecture on the Atari (ALE) environment

Python 181 54 Updated Sep 21, 2017

Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings"

Python 93 19 Updated Mar 27, 2023