Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove binary state from high-level API and use Jinja templates #3147

Draft
wants to merge 22 commits into
base: main
Choose a base branch
from

Conversation

cebtenzzre
Copy link
Member

See the top commit, "WIP". Everything else is #3073.

cebtenzzre and others added 16 commits November 5, 2024 12:15
- Python bindings use `jinja2`
- server.cpp is not implemented
- chatapi.cpp is not implemented

Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
We still need this for models that don't include bos_token in their chat
template. Llama 3.1 8B Instruct sets this to false.

Signed-off-by: Jared Van Bortel <[email protected]>
Importantly, the non-chat completions endpoint (`/v1/completions`) no
longer uses a system prompt or LocalDocs, as those are not applicable.

Signed-off-by: Jared Van Bortel <[email protected]>
Before this PR, GPT4All inserted a system message into non-chat
completions, and it attempted to use LocalDocs with them. It no longer
does either of these things because they do not make sense here.

This changes the output slightly, so the test needs to be updated.

Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Signed-off-by: Jared Van Bortel <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants