Skip to content

Recommended fine-tuning recipe for 2b model #115

@zhuqiangLu

Description

@zhuqiangLu

For those who have successfully fine-tune the 2b model, could you please share your fine-tuning recipe?

I was trying fine-tune the 2b model with 20k images with --pn 0.06M and the result is not satisfying. Also, I noticed the learning rate in the training script is set to 0.006, I was wondering perhaps this value is way too high for fine-tuning.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions