Got errors when initiating training

```
[rank6]:     self.ema_model = EMAModel(self.accelerator.unwrap_model(self.model), self.device)
[rank6]:                               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank6]:   File "/mnt/workspace/miniconda3/envs/blip3o/lib/python3.11/site-packages/accelerate/accelerator.py", line 2866, in unwrap_model
[rank6]:     return extract_model_from_parallel(model, keep_fp32_wrapper, keep_torch_compile)
[rank6]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank6]:   File "/mnt/workspace/miniconda3/envs/blip3o/lib/python3.11/site-packages/accelerate/utils/other.py", line 180, in extract_model_from_parallel
[rank6]:     model = model._orig_mod
[rank6]:             ^^^^^^^^^^^^^^^
[rank6]:   File "/mnt/workspace/miniconda3/envs/blip3o/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1931, in __getattr__
[rank6]:     raise AttributeError(
[rank6]: AttributeError: 'DistributedDataParallel' object has no attribute '_orig_mod'
```
While this error seems to be related to DDP and accelerate, I can barely find information on the Google. Do you have any idea how to address this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Got errors when initiating training #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Got errors when initiating training #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions