Strange output log #2

launchauto · 2021-05-28T03:48:05Z

Hi authors, I have pretrianed your moby_swin_tiny model using 8 Tesla V100 GPU
and reproduced your results in downstream task. I get 74.394% on linear evaluation and 43.1% on COCO object detection task, 39.3% on COCO segmentation task. But the loss and grad_norm is really weired during training. Can you show me your log?
Here is my log. The loss drops to 7 and then rises to 16, then never drop again. During the pretraining task, the grad norm average value sometimes rises to infinite.
log_rank0.txt

launchauto · 2021-05-28T03:51:55Z

The uploaded txt log_rank0.txt is one of the eight gpus pretrain logs.
And the uploaded txt log_rank7.txt is one of the eight gpus linear evaluation logs.
log_rank7.txt

michuanhaohao · 2021-06-15T02:13:29Z

I also encountered the same problem.

tbup · 2022-02-08T06:46:11Z

@launchauto @michuanhaohao me too, but I run it with precision O0. Did you run with the O0 precision?
log_rank0.txt

Rocky1salady-killer · 2022-06-23T06:16:51Z

我也遇到了这个问题！loss一直是16永远不会下降？

Rocky1salady-killer · 2022-06-23T06:46:44Z

怎么才能不适用apex混合精度呢？我使用swin transformer进行训练的时候，loss就会下降并且收敛。然而，我注意到swin transformer工程当中没有使用apex混合精度

Chengyang852 · 2023-03-21T02:17:35Z

Is it normal for the loss value to be around 16? Has anyone encountered this problem?

YohjiNtpu · 2023-03-27T06:54:40Z

怎么才能不适合用apex混合精度呢？我用swin transformer进行训练的时候，loss就会下降并收敛。不过，我注意到swin transformer工程中没有使用apex混

请问您的问题解决了吗

YohjiNtpu · 2023-03-27T06:55:14Z

loss值在16左右正常吗？有没有人遇到过这个问题？

我也是

Pang-b0 · 2023-04-03T09:00:34Z

Excuse me, have you solved the problem that loss drops to 8.9 and then rises in the opposite direction? Is it caused by apex mixed precision training?

YohjiNtpu · 2023-04-03T09:35:10Z

请问，loss下降到8.9然后反方向上升的问题解决了吗？是顶点混合精度训练导致的吗？

没有/(ㄒoㄒ)/~~

Pang-b0 · 2023-04-03T15:51:34Z

会不会是loss函数的问题呀这个代码你还在关注吗，我的loss从开始就是16 降不下去

YohjiNtpu · 2023-04-05T02:59:12Z

不会是loss随便数的问题呀这个代号你还在关注吗，我的loss从开始就是16降不下

我也没有解决。。

launchauto closed this as completed May 28, 2021

launchauto reopened this May 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange output log #2

Strange output log #2

launchauto commented May 28, 2021

launchauto commented May 28, 2021 •

edited

Loading

michuanhaohao commented Jun 15, 2021

tbup commented Feb 8, 2022

Rocky1salady-killer commented Jun 23, 2022

Rocky1salady-killer commented Jun 23, 2022

Chengyang852 commented Mar 21, 2023

YohjiNtpu commented Mar 27, 2023

YohjiNtpu commented Mar 27, 2023

Pang-b0 commented Apr 3, 2023

YohjiNtpu commented Apr 3, 2023

Pang-b0 commented Apr 3, 2023

YohjiNtpu commented Apr 5, 2023

Strange output log #2

Strange output log #2

Comments

launchauto commented May 28, 2021

launchauto commented May 28, 2021 • edited Loading

michuanhaohao commented Jun 15, 2021

tbup commented Feb 8, 2022

Rocky1salady-killer commented Jun 23, 2022

Rocky1salady-killer commented Jun 23, 2022

Chengyang852 commented Mar 21, 2023

YohjiNtpu commented Mar 27, 2023

YohjiNtpu commented Mar 27, 2023

Pang-b0 commented Apr 3, 2023

YohjiNtpu commented Apr 3, 2023

Pang-b0 commented Apr 3, 2023

YohjiNtpu commented Apr 5, 2023

launchauto commented May 28, 2021 •

edited

Loading