❓ Question
Hi, I was looking into the code inside the SAC algorithm, and I encountered an odd line here.
When comparing to make the update of the target, should not be using self._n_updates, or something similar instead gradient_step? such variable is always zero, hence, is always entering in the condition?.
There is maybe any specific reason to do it this way?
Thanks!
Checklist