replicating results of leaderboard

I've been trying to replicate the results of your leaderboard, but I found a number of things confusing (based on the "medium" data in the linked colab):
1) leaderboard is based on "realworld" level, but colab is based on "medium" level, do you have ready medium results?
2) using a vgg-16 model (the one found in mc_dropout/model) and training, I found the below results:

for deterministic: 
![image](https://user-images.githubusercontent.com/25182234/76882009-66d30800-6850-11ea-9df5-5cfd50fac59b.png) (accuracy with pink the deterministic)

and  for mc_dropout:
![image](https://user-images.githubusercontent.com/25182234/76882517-2758eb80-6851-11ea-9405-2d8e3e03f373.png)
with numbers (first is mc_dropout and second is deterministic)
![image](https://user-images.githubusercontent.com/25182234/76882696-6ab35a00-6851-11ea-84cc-78425fd8d97f.png)

In your [paper ](https://arxiv.org/abs/1912.10481) mc_dropout outperformed the deterministic approach by a quite a bit, I didn't expect the deterministic approach to perform so badly, these results seem a bit more sensible but not to this other extent, can you find the reason for this discrepancy? 

3) AUC results behave weirdly: 
for mc_dropout

![image](https://user-images.githubusercontent.com/25182234/76883194-1eb4e500-6852-11ea-9c79-7d74a339dc15.png)


[here is a colab to replicate the above ](https://colab.research.google.com/drive/1eyRquycs6PFNoCTJVLcPym8g8ptddci8
)
also recommend updating your linked colab with the proper required packages as in it's current form it does not run


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

replicating results of leaderboard #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

replicating results of leaderboard #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions