Apply scale loss when performing accum_freq

In this line https://github.com/mlfoundations/open_clip/blob/fc5a37b72d705f760ebbc7915b84729816ed471f/src/open_clip_train/train.py#L162
We are trying to accumulate the gradients and perform optimizer step only after we accumulate gradients for `accum_freq` steps.

I am wondering whether do we need to divide the `total_loss` by `accum_freq` to scale the loss properly.