Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

This is a blog post based on the paper Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble, which proposes the offline RL algorithm EDAC (Ensemble Diversified Actor Critic).

There are several implementations:

run blog locally

pip install -r blog/requirements.txt

quarto preview blog/blog.ipynb

The main content is in blog/blog.ipynb, the references are in blog/references.bib and the images are in blog/figures/.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.github/workflows		.github/workflows
blog		blog
.gitignore		.gitignore
README.md		README.md