CPU Offloading

I saw the line `pipe.enable_model_cpu_offload()` here https://github.com/instantX-research/InstantIR/blob/main/pipelines/sdxl_instantir.py#L113C13-L113C44 and tried the approach with the gradio app, but get the following error:

```
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm)
```

What else needs to be done in the code to fix this error and make this work?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CPU Offloading #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

CPU Offloading #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions