action-value-function

Here are 2 public repositories matching this topic...

antonio-f / Dynamic-Programming

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

reinforcement-learning openai-gym gym dynamic-programming policy-evaluation policy-iteration value-iteration bellman-equation frozenlake policy-improvement state-value-function action-value-function

Updated Apr 3, 2019
Jupyter Notebook

antonio-f / MonteCarlo-methods

Star

Monte Carlo methods for Reinforcement Learning (from Udacity's "Deep Reinforcement Learning Nanodegree Program").

reinforcement-learning openai-gym gym monte-carlo-methods reinforcement-learning-excercises action-value-function blackjack-env state-va mc-prediciton mc-control

Updated Apr 16, 2019
Jupyter Notebook

Improve this page

Add a description, image, and links to the action-value-function topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the action-value-function topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly