Evalution for Qwen2 #176

kris-singh · 2025-04-06T02:30:46Z

Hi,
Thanks for the great work!

I have a couple of questions.

The README.md file says you used the conv_mode=phi for training, while the train_qwen2_base.sh file has the conv_mode=qwen2_base. Is this a typo in the README file?
To evaluate models trained with the Qwen2 model, what should the conv_mode argument be set to? Is it qwen2_base or qwen2_instruct? I am assuming that it is qwen2_instruct.
Finally, what should one consider for evaluating the model on more benchmarks, such as SugarCrepe? Is it just the conv_mode parameter or something else as well?

Update:

Moreover, I found out a bigger bug with the script files. The Phi-2 model has a max length of size 2048 and not 3072.

Thanks,
Krish

ZhangXJ199 · 2025-04-24T01:37:33Z

conv_mode is related to the type of language model. When using Qwen as the language model, use qwen2_base.
It depends on whether you're using the base model or the instruct model of Qwen2.
As mentioned above, conv_mode is only related to the language model.
We have changed it to 2048.

Provide feedback