The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is fo…
actions deep-reinforcement-learning prediction data-visualization convergence dqn epsilon-greedy states rl rewards hyperparameter-tuning model-evaluation model-building optimal-policy markov-decision-process epsilon-decay mdp-framework training-dqn-agent q-values-tracking minibatch-gradient-descent
- Updated
Jul 9, 2021 - Jupyter Notebook