Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

This is a blog post based on the paper Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble, which proposes the offline RL algorithm EDAC (Ensemble Diversified Actor Critic).

There are several implementations:

run blog locally

pip install -r blog/requirements.txt

quarto preview blog/blog.ipynb

The main content is in blog/blog.ipynb, the references are in blog/references.bib and the images are in blog/figures/.