Feat/quantization mode #179

ptoupas · 2025-11-11T16:05:56Z

Purpose

Introduce a new quantization related feature to the modelconverter tool by adding an optional, high-level configuration parameter (quantization_mode) to simplify model quantization during RVC4 convertion.
The goal is to make quantization strategies more accessible through the UI, while still allowing full control through advanced flags.

Specification

Introduced a new quantization_mode parameter for RVC4 model conversion, which has 5 available options: [INT8_STANDARD, INT8_ACCURACY_FOCUSED, INT8_INT16_MIXED, FP16_STANDARD, CUSTOM]. Defaults to INT8_STANDARD and the current default behavior of the quantized conversion should not be affected.
When this parameter is set to anything but CUSTOM it will take precedence over any user-defined SNPE flags provided via the snpe_onnx_to_dlc_args, snpe_dlc_quant_args, and snpe_dlc_graph_prepare_args.
Removed the compress_to_fp16 parameter which is now replaced by the FP16_STANDARD mode in the quantization_mode parameter.
When the quantization_mode is set to INT8_INT16_MIXED a .json encodings file is automatically generated to keep the inputs and outputs of the model into 8-bit precision so that the generated DLC can be executed and be compatible with DAI.
The remaining RVC4 related parameters such as use_per_channel_quantization, use_per_row_quantization, optimization_level, and others, should not be affected by the changes introduced in that PR.

Dependencies & Potential Impact

None / not applicable

Deployment Plan

None / not applicable

Testing & Validation

Tested the conversion of the following models under different configurations and settings through the CLI.

…ertion.

modelconverter/hub/README.md

klemen1999

Left some minor comments but generally LGTM.

modelconverter/packages/rvc4/exporter.py

modelconverter/utils/types.py

ptoupas added 2 commits November 11, 2025 17:47

Add QuantizationMode and removed compress_to_fp16 option on RVC4 conv…

3861659

…ertion.

Add RVC4 quantization_mode parameter to README

248214b

klemen1999 reviewed Nov 12, 2025

View reviewed changes

modelconverter/hub/README.md Show resolved Hide resolved

ptoupas added 4 commits November 12, 2025 11:08

Revert changes on hub convert.

22c60d2

Remove 'quantization_mode' from hub README

a32c83a

Add rvc4 disable_calibration flag to defaults.taml file

ae49a12

Update README with info on the new rvc4.quantization_mode option.

aa01f08

ptoupas marked this pull request as ready for review November 12, 2025 11:59

ptoupas requested a review from a team as a code owner November 12, 2025 11:59

ptoupas requested review from conorsim, klemen1999, kozlov721 and tersekmatija and removed request for a team November 12, 2025 11:59

ptoupas self-assigned this Nov 12, 2025

ptoupas added enhancement New feature or request RVC4 Changes affecting RVC4 export labels Nov 12, 2025

kozlov721 and others added 2 commits November 12, 2025 13:42

updated action

ad7315e

Remove the addition of the qcs8550 soc when converting to fp16 for RVC4.

1bf9e09

klemen1999 approved these changes Nov 13, 2025

View reviewed changes

modelconverter/packages/rvc4/exporter.py Outdated Show resolved Hide resolved

modelconverter/utils/types.py Outdated Show resolved Hide resolved

Addressed PR comments.

18c0787

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/quantization mode #179

Feat/quantization mode #179

Uh oh!

ptoupas commented Nov 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

klemen1999 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Feat/quantization mode #179

Are you sure you want to change the base?

Feat/quantization mode #179

Uh oh!

Conversation

ptoupas commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Specification

Dependencies & Potential Impact

Deployment Plan

Testing & Validation

Uh oh!

Uh oh!

klemen1999 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ptoupas commented Nov 11, 2025 •

edited

Loading