-
Notifications
You must be signed in to change notification settings - Fork 7.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Models: Add Gemma-2-9b-it-GGUF #2803
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: ThiloteE <[email protected]>
I am a little unsure, if the Ready for review. |
Signed-off-by: ThiloteE <[email protected]>
Signed-off-by: ThiloteE <[email protected]>
Signed-off-by: ThiloteE <[email protected]>
This model is not supported on the Nomic Vulkan backend. |
First of all, thank you for the time and effort you've put into this Pull Request. Your work is greatly appreciated! 🙌 I was wondering if there’s an expected timeline for merging this PR? If there’s anything I can do to help move it forward, please let me know. Thanks again for your support and for maintaining this awesome project! |
We need configurable stop sequences first |
Describe your changes
Adds model support for Gemma-2-9b-it
Description of Model
At the date of writing, the model has strong results in benchmarks (for its parameter size). It claims to support a context of up to 8k.
Personal Impression:
For 9 billion parameters, the model has reasonable output. I tested the model with a 14k character conversation and there were no tokenizer issues and no severe repetition problems as far as I could discern. I have seen refusals when it was tasked with certain things and it seems to be finetuned with a particular alignment. Its quality of responses makes it a good model, if you can bear its alignment or your use case happens to fall within the originally intended use cases of the model. It mainly will appeal to English speaking users.
Clayton reported, the model has a tendency to keep asking questions, even if instructed not to.
Critique:
Motivation for this pull-request
Checklist before requesting a review