Replies: 1 comment 9 replies
-
Hey! This is the first request we had for distill. Could you let us know which tool that you would normally use for distillation? We'll need to then discuss internally on whether to open a CLI for this option. |
Beta Was this translation helpful? Give feedback.
9 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Now, Distill+SFT+RL is a standardized posttraining method that improves model performance and achieves good results. Hope it can be realized in axolotl .
Beta Was this translation helpful? Give feedback.
All reactions