都是lora训练。dpo 的create_new_adapter为false。dpo的lora参数需要和sft一致吗,比如lora_rank、lora_target这些 #8698
Unanswered
zouzoutingting
asked this question in
Q&A
Replies: 2 comments
-
|
可以发下重复的相关issue吗,我去看下 @hiyouga |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
需要一致 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Reminder
System Info
。
Reproduction
Others
No response
Beta Was this translation helpful? Give feedback.
All reactions