Bandit Learning Research

Epsilon-greedy: Random exploration with probability ε
UCB: Upper confidence bound approach
Thompson Sampling: Bayesian method
Contextual bandits: Uses additional info to make decisions

Multi-armed bandit algorithms; Research done at Dr Ji's SNAIL lab @ Virginia Tech

Implements different bandit algorithms that solve the exploration vs exploitation problem.

Algorithms

numpy, matplotlib, scipy

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.ipynb_checkpoints		.ipynb_checkpoints
BanditLearning.ipynb		BanditLearning.ipynb
DataAnalysis.ipynb		DataAnalysis.ipynb
README.md		README.md
bandit_data.csv		bandit_data.csv