How to reproduce your evals? #2

eldarkurtic · 2025-04-16T08:06:56Z

Hey folks, great work on GPTQv2!

I would like to reproduce your W4g128 evaluations from the README.md, but I am getting significantly worse results than the ones you report here.

Could you help me out to reproduce your evals by:

releasing your model weights on HF-Hub or providing exact commands to recreate it (W4g128 GPTQv2)
providing exact lm-evaluation-harness commands you used to obtain these scores

yhhhli · 2025-04-16T14:12:26Z

The results were reported by the GPTQModel package, we recommend running the test files in their repository.

We will upload the checkpoints to Huggingface.

Qubitium · 2025-04-17T07:32:39Z

@eldarkurtic @yhhhli Please ref the GPTQModel issue (ModelCloud/GPTQModel#1545) for full reproduction of quant code, eval code, and HF uploaded models.

Provide feedback