use llama.cpp quantization with QAT finetune #11170

anyinlover · 2025-01-10T04:46:41Z

anyinlover
Jan 10, 2025

I'm finetuning a 1B model with QAT using torchtune library, but I wonder is it match to llama.cpp quantization algorithm, does anyone have some experience?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use llama.cpp quantization with QAT finetune #11170

{{title}}

Replies: 0 comments

Select a reply

use llama.cpp quantization with QAT finetune #11170

anyinlover Jan 10, 2025

Replies: 0 comments

anyinlover
Jan 10, 2025