can't reproduce AE-LC numbers in hf ckpt(Llama-3-8b-SFT-DPO, Llama-3-8b-SFT-SimPO))

1. vllm under any version mismatch with current env and if you separate eval and train. I still need a version of vllm
2. in separate env, using the command below, I try different engine (4o and 4-turbo) and get some numbers dose not make sense. Have you ever try different annotators when use 4o, it give me a result where DPO>SimPO, while 4-turbo gives the opposite
![image](https://github.com/user-attachments/assets/cff6fa57-7714-4ceb-ba3d-5908e5241d51)
> alpaca_eval evaluate_from_model\
 --model_configs /mnt/vepfs/fs_users/\*\*\*/xAI-RLHF/\*\*\*/SimPO/eval/alpacaeval2/configs/Llama-3-Base-8B-SFT-SimPO.yaml\
 --annotators_config weighted_alpaca_eval_gpt4_turbo\

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

can't reproduce AE-LC numbers in hf ckpt(Llama-3-8b-SFT-DPO, Llama-3-8b-SFT-SimPO)) #77

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

can't reproduce AE-LC numbers in hf ckpt(Llama-3-8b-SFT-DPO, Llama-3-8b-SFT-SimPO)) #77

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions