How to debug with IREE when neural network contains large weight? #20418

FlintWangacc · 2025-03-30T03:40:28Z

FlintWangacc
Mar 30, 2025

Recently I am debugging deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B model on IREE. It contains weight about 15G. So when I try to dump the IR generated by IREE. It takes a lot of disk space and a lot of time. Can we seperate the code and weight? Or is there any other method to do it?

Answered by ScottTodd

Mar 31, 2025

Yep, we recommend separating model parameters (weights) from program code. Here are our docs on using parameter files:

Also, if you are dumping IR, see the IR printing options at https://mlir.llvm.org/docs/PassManagement/#ir-printing together with https://mlir.llvm.org/getting_started/Debugging/. You can do something like --mlir-print-ir-after-all --mlir-elide-elementsattrs-if-larger=64 to dump all IR but print constants larger than 64 bytes in redacted form. IREE adds more options on top of those like --dump-compilation-phases-to=${PATH} which prints after high level phases in t…

View full answer

FlintWangacc · 2025-03-30T03:40:39Z

FlintWangacc
Mar 30, 2025
Author

#20398

0 replies

FlintWangacc · 2025-03-31T09:53:43Z

FlintWangacc
Mar 31, 2025
Author

I found it is possible to use iree-turbine to make neural network external weight.

1 reply

ScottTodd Mar 31, 2025
Maintainer

Yep, we recommend separating model parameters (weights) from program code. Here are our docs on using parameter files:

Also, if you are dumping IR, see the IR printing options at https://mlir.llvm.org/docs/PassManagement/#ir-printing together with https://mlir.llvm.org/getting_started/Debugging/. You can do something like --mlir-print-ir-after-all --mlir-elide-elementsattrs-if-larger=64 to dump all IR but print constants larger than 64 bytes in redacted form. IREE adds more options on top of those like --dump-compilation-phases-to=${PATH} which prints after high level phases in the IREE compiler instead of after every pass. That and other IREE-specific tips are documented here: https://iree.dev/developers/general/developer-tips/#dumping-compilation-phases.

This page also has other general tips for working with full programs/models: https://iree.dev/developers/debugging/model-development/

Answer selected by FlintWangacc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to debug with IREE when neural network contains large weight? #20418

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to debug with IREE when neural network contains large weight? #20418

Uh oh!

FlintWangacc Mar 30, 2025

Replies: 2 comments · 1 reply

Uh oh!

FlintWangacc Mar 30, 2025 Author

Uh oh!

FlintWangacc Mar 31, 2025 Author

Uh oh!

ScottTodd Mar 31, 2025 Maintainer

FlintWangacc
Mar 30, 2025

Replies: 2 comments 1 reply

FlintWangacc
Mar 30, 2025
Author

FlintWangacc
Mar 31, 2025
Author

ScottTodd Mar 31, 2025
Maintainer