Replies: 1 comment 2 replies
-
1 epoch means going through all the images once (including repeats) Also I'd recommend that instead of 100 repeats, you'd do 10 repeats and 10 epochs. I don't train anything for more than 3000-4000 steps which is with my batch size of 4-6 less than 1000 steps. Because if it isn't working with that, it ain't working and I need to rethink the whole things. Most things I train I get done at like 2500 steps. Here is a thought thou. Give this a try, which is what I do. I personally am firm believer in "You don't need reg images by default", this is because by testing and talking with people who also train a lot... We been just able to get things done without them - almost always. Only in some specific cases where the training is failing to catch on I add regularisation - but I generally have bad luck with getting them to work. I have realised that if it doesn't work without, it probably wont work with them and this is because I am doing something fundamentally wrong. And I'm nore sure if I understoof your question right. But the order of images in training is set by the seed. |
Beta Was this translation helpful? Give feedback.
-
Hi all,
I have 5000 reg images and 50 training images of myself.
Right now, I'm using 100 repeats so that I can make use of all 5k reg images, because 50*100 gives me 5k training samples.
In total, this yields 501002 = 10k steps.
I noticed, that quite early, I'd say at step 2-3k, the models are quite good already.
To avoid overfitting, I want to use the "earliest" checkpoint (least number of steps) which fits my needs w.r.t. reproducing my model.
Now the question is:
let's assume the earliest modell which fits my needs is already at around step 2000 and I chose this one.
It means, 1k steps were reg-images, 1k steps are training samples.
Do those 1k samples effectively contain ALL my 50 training images (only repeated for *20 times)?
Or is every training image repeated for 100 times first and than the next image is trained, which would mean, at step 1k, only 10 samples have been used for training with 100 repeats each?
Thanks a lot!
Beta Was this translation helpful? Give feedback.
All reactions