banditml / offline-policy-evaluation Star 224 Code Issues Pull requests Discussions Implementations and examples of common offline policy evaluation methods in Python. importance-sampling counterfactual-learning off-policy-evaluation doubly-robust offline-policy-evaluation counterfactual-policy-evaluation Updated Feb 11, 2023 Python
PlaytikaOSS / pybandits Star 51 Code Issues Pull requests Discussions Python library for Multi-Armed Bandits reinforcement-learning thompson-sampling multi-armed-bandits multi-armed-bandit bayesian-neural-networks contextual-bandits multiarmed-bandits stochastic-bandit-algorithms stochastic-bandit contextual-bandit-algorithms offline-policy-evaluation Updated Dec 23, 2025 Python