Skip to content

DPO训练有支持长度惩罚的参数可选吗? #4048

@leileilin

Description

@leileilin

看了最新文档的所有参数,没有找到相关参数,问一下是没有实现这个功能吗?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions