Replies: 1 comment
-
and another error..
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Ive noticed this new optimization method, it is suggested this is this suitable argument to use,
decouple=True weight_decay=0.6
What do the different values do? Is there an optimal range of decay?
Many thanks
EDIT:
Actually getting an error including weight_decay 0.6
Beta Was this translation helpful? Give feedback.
All reactions