-
Notifications
You must be signed in to change notification settings - Fork 13
Closed
Description
Hi,
Congratulations on the paper—it is truly interesting! I have a few questions regarding the implementation and the reproducibility of the results.
For the Cityscapes dataset, I downloaded the leftImg8bit images along with the gtFine annotations. Then, I used the CityscapesScripts to obtain the trainLabelIds. Is this the correct procedure to set up the data?
Additionally, I have some specific questions about the training script:
- In
train.py,optimizer.zero_grad()appears to be missing. Is this intentional? - The default learning rate in the code is 0.0001, whereas the paper mentions a
learning rateof 0.004. Which one is correct? - When enabling all the losses, the MAV loss becomes extremely large and the training becomes very unstable. Essentially, the model does not learn anything. I have tried using both learning rates mentioned above, as well as including and excluding
optimizer.zero_grad(). Why is this happening? Are there specific hyperparameters required for the algorithm to function correctly?
Lastly, if possible, it would be incredibly helpful to have a set of instructions to reproduce the results.
Thanks!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels