-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
@sethaxen if i remember correctly you suggested using p=1/N for the augmented softmax, I can see that the RMSE plots for that version are near straight lines or weird curves in some parametrizations and alright in some, the error isn't high or something but yeah. they come out similar to the rest when i take p=0.5 or something.
in contrast with this for p=0.5. do you have any thoughts about which values of p we should we go for in the actual paper