Skip to content

用评测代码测试 Qwen2.5-Math-1.5B 结果和 report 的结果出入比较大 #47

Open
@pipixiaqishi1

Description

@pipixiaqishi1

您好,请问 base model 的评测是有专门的 prompt 吗?直接用对 instruct 模型的评测代码测试Qwen2.5-Math-1.5B,结果与 report 结果差距有点大。

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions