This is a blog post based on the paper Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble, which proposes the offline RL algorithm EDAC (Ensemble Diversified Actor Critic).
There are several implementations:
- install Quarto
- clone this repo
pip install -r blog/requirements.txtquarto preview blog/blog.ipynbThe main content is in blog/blog.ipynb, the references are in blog/references.bib and the images are in blog/figures/.