feat: add fp8 optimization for FLUX #2208

kohya-ss · 2025-09-25T13:12:51Z

Add --fp8_scaled (block-wise fp8 quantization) for FLUX.1 and Chroma models.

See kohya-ss/musubi-tuner#564 for details about fp8 quantization.

Use --fp8_scaled instead of --fp8_base for flux_train_network.py.

kohya-ss · 2025-09-25T23:47:36Z

Comparison of inference results with Chroma1-HD.

bfloat16:

fp8_scaled (block-wise quantization):

fp8 (no quantization):

The generation settings are the same as here, but the noise is generated on CUDA: https://huggingface.co/lodestones/Chroma1-HD#how-to-use

iqddd · 2025-10-06T01:37:26Z

But why use_scaled_mm=False?

feat: add fp8 optimization for FLUX

f6137a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: add fp8 optimization for FLUX #2208

feat: add fp8 optimization for FLUX #2208

Uh oh!

kohya-ss commented Sep 25, 2025 •

edited

Loading

Uh oh!

kohya-ss commented Sep 25, 2025

Uh oh!

iqddd commented Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat: add fp8 optimization for FLUX #2208

Are you sure you want to change the base?

feat: add fp8 optimization for FLUX #2208

Uh oh!

Conversation

kohya-ss commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kohya-ss commented Sep 25, 2025

Uh oh!

iqddd commented Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kohya-ss commented Sep 25, 2025 •

edited

Loading