Skip to content

out of mamory on gpu 0 #16171

@gunh4mmer

Description

@gunh4mmer

🔎 Search before asking

  • I have searched the PaddleOCR Docs and found no similar bug report.
  • I have searched the PaddleOCR Issues and found no similar bug report.
  • I have searched the PaddleOCR Discussions and found no similar bug report.

🐛 Bug (问题描述)

使用ocrv5的server模型训练det,train和val的batch均为1,训练过程正常,val时第二张图像即爆显存(图像较大,均为1200w-2000w像素),根据#6989添加了limit_type_side: 960、limit_type: max,但是仍然没效果;

再次测试,将val数据集减少到一张图片,发现训练时显存占用3.8gb,val时占用15gb,并且首次val之后继续训练时显存仍然保持在13gb以上,第二次val时也炸了显存。

请求提供帮助

PP-OCRv5_server_det - 副本.txt

🏃‍♂️ Environment (运行环境)

ubuntu20.04 paddleocr3.1 paddlepaddle3.1

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

train.py

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions