Skip to content

replicating results of leaderboard #8

Open
@husseinmozannar

Description

@husseinmozannar

I've been trying to replicate the results of your leaderboard, but I found a number of things confusing (based on the "medium" data in the linked colab):

  1. leaderboard is based on "realworld" level, but colab is based on "medium" level, do you have ready medium results?
  2. using a vgg-16 model (the one found in mc_dropout/model) and training, I found the below results:

for deterministic:
image (accuracy with pink the deterministic)

and for mc_dropout:
image
with numbers (first is mc_dropout and second is deterministic)
image

In your paper mc_dropout outperformed the deterministic approach by a quite a bit, I didn't expect the deterministic approach to perform so badly, these results seem a bit more sensible but not to this other extent, can you find the reason for this discrepancy?

  1. AUC results behave weirdly:
    for mc_dropout

image

here is a colab to replicate the above
also recommend updating your linked colab with the proper required packages as in it's current form it does not run

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions