#
bandits
Here are 26 public repositories matching this topic...
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
-
Updated
Aug 21, 2019 - Jupyter Notebook
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
-
Updated
Jun 4, 2021 - Python
Another A/B test library
scala
public
functional-programming
functional-reactive-programming
ab-testing
bayesian
bandits
bayesian-analysis
bandit
bandit-algorithm
-
Updated
Oct 18, 2021 - Scala
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
-
Updated
Nov 14, 2019 - Jupyter Notebook
Thompson Sampling for Bandits using UCB policy
-
Updated
Jul 29, 2017 - Python
Python implementation of common RL algorithms using OpenAI gym environments
-
Updated
Jan 8, 2021 - Python
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
reinforcement-learning
paper
semi-supervised-learning
bandits
bandit
contextual-bandits
contextual-bandit
self-supervised-learning
nonstationary-environments
-
Updated
Sep 21, 2020 - MATLAB
Code for our paper: "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior".
reinforcement-learning
game-theory
multiplayer-game
behavioral-cloning
multiagent-systems
human-behavior
bandits
contextual-bandits
prisoner-dilemma
-
Updated
Jun 15, 2020 - Python
Code for our ICDMW 2018 paper: "Contextual Bandit with Adaptive Feature Extraction".
reinforcement-learning
feature-extraction
icdm
representation-learning
bandits
contextual-bandits
nonstationary
icdm2018
-
Updated
Jun 15, 2020 - MATLAB
A python library for (finite) Partial Monitoring algorithms
-
Updated
Sep 12, 2017 - Jupyter Notebook
This repo contains all the stuff I encountered while playing OverTheWire games.
-
Updated
Dec 25, 2020
Collaborative project for documenting ML/DS learnings.
-
Updated
Jun 29, 2021 - Jupyter Notebook
-
Updated
May 28, 2021 - Jupyter Notebook
An assignment for the implementation of Online Learning, Bandits and Reinforcement Learning
-
Updated
Dec 18, 2018 - Jupyter Notebook
-
Updated
Nov 7, 2019 - Python
Foundations of Intelligent and Learning Agenet
-
Updated
Dec 13, 2019 - Python
Simple Implementations of Bandit Algorithms in python
bandit-learning
multi-armed-bandits
online-learning
bandits
bandit
online-learning-algorithms
bandit-algorithms
online-learning-python
-
Updated
Oct 7, 2021 - Python
-
Updated
Nov 16, 2017 - Python
-
Updated
Oct 3, 2021 - Python
Play Rock, Paper, Scissors (Kaggle competition) with Reinforcement Learning: bandits, tabular Q-learning and PPO with LSTM.
-
Updated
Mar 2, 2021 - Python
reinforcement-learning
policy-gradient
dynamic-programming
markov-decision-processes
bandits
sarsa-lambda
-
Updated
Aug 16, 2017 - Jupyter Notebook
Improve this page
Add a description, image, and links to the bandits topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the bandits topic, visit your repo's landing page and select "manage topics."
The docstring for
tf_py_environment.__getattr__indicates that certain PyEnvironment methods might be incompatible with TF.