How to use quantizer after pipeline loaded?

How to use quantizer after pipeline loaded? 

- Currently

```python
# Quantization occurs at load time.
pipe = QwenImagePipeline.from_pretrained(
    (
        args.model_path
        if args.model_path is not None
        else os.environ.get(
            "QWEN_IMAGE_DIR",
            "Qwen/Qwen-Image",
        )
    ),
    scheduler=scheduler,
    torch_dtype=torch.bfloat16,
    quantization_config=quantization_config,
)
```

- What i want  

```python
# Load on CPU -> Load and fuse lora -> quantize -> to GPU
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use quantizer after pipeline loaded? #12823

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to use quantizer after pipeline loaded? #12823

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions