Open
Description
I've been trying to replicate the results of your leaderboard, but I found a number of things confusing (based on the "medium" data in the linked colab):
- leaderboard is based on "realworld" level, but colab is based on "medium" level, do you have ready medium results?
- using a vgg-16 model (the one found in mc_dropout/model) and training, I found the below results:
for deterministic:
(accuracy with pink the deterministic)
and for mc_dropout:
with numbers (first is mc_dropout and second is deterministic)
In your paper mc_dropout outperformed the deterministic approach by a quite a bit, I didn't expect the deterministic approach to perform so badly, these results seem a bit more sensible but not to this other extent, can you find the reason for this discrepancy?
- AUC results behave weirdly:
for mc_dropout
here is a colab to replicate the above
also recommend updating your linked colab with the proper required packages as in it's current form it does not run
Metadata
Metadata
Assignees
Labels
No labels