Skip to content

Any place I can find the exact training method/configuration used for the pretrained models? #1602

Answered by rwightman
rezafuru asked this question in Q&A
Discussion options

You must be logged in to vote

@rezafuru the checkpoints come from a WIDE variety of sources, even for my own training I didn't link to specific hparams. Keeping that sort of info synced and up to date is more than a 1 person job.

It's a two edge sword to do so, for every person that understand any set of hparam needs adjustments on different GPU counts, etc there are more that don't get this and file complaint issues "cannot reproduce this result exactly on my .. blah blah'

As I update the various models to support the new multi-weight pretrained configs, I am trying to add tags that differentiate some of the models a bit better by training type.. I have some internal 'codes' for my hparam sets that will be put in a g…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by rezafuru
Comment options

You must be logged in to vote
1 reply
@rezafuru
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants