Fix spacing issues with deepseek models: #3470

manyoso · 2025-02-06T16:08:07Z

If the think tag is empty don't show it at all
Always trim the responses for whitespace so if the model generates
blah
[new line]
final response

The finale response is trimmed of the newline.

1) If the think tag is empty don't show it at all 2) Always trim the responses for whitespace so if the model generates <think>blah</think> [new line] final response The finale response is trimmed of the newline. Signed-off-by: Adam Treat <[email protected]>

Signed-off-by: Adam Treat <[email protected]>

cebtenzzre · 2025-02-06T16:19:12Z

gpt4all-chat/src/chatllm.cpp

@@ -976,7 +976,7 @@ class ChatViewResponseHandler : public BaseResponseHandler {
    {
        Q_UNUSED(bufferIdx)
        try {
-            m_cllm->m_chatModel->setResponseValue(response);
+            m_cllm->m_chatModel->setResponseValue(response.trimmed());


If we trim leading whitespace here, then we no longer need to do so in onRegularResponse:

-auto respStr = QString::fromUtf8(m_result->response); -return onBufferResponse(removeLeadingWhitespace(respStr), 0); +return onBufferResponse(QString::fromUtf8(m_result->response), 0);

I think the reason we don't currently remove trailing whitespace while generating is that it causes a "jump" where if the model outputs e.g. 10 blank lines and one new word, it will only show any change once it generates the word.

Is there any reason related to tool calling that we would need to start doing that here?

Maybe we should just trim leading whitespace here?

see new version. taking the temporary is gross though

I don't mind; it's just like glib's g_strchug (or C functions like strtok).

gpt4all-chat/CHANGELOG.md

Signed-off-by: Adam Treat <[email protected]>

Signed-off-by: AT <[email protected]>

cebtenzzre · 2025-02-06T16:57:36Z

gpt4all-chat/src/chatllm.cpp

    auto respStr = QString::fromUtf8(result.response);
    if (!respStr.isEmpty() && (std::as_const(respStr).back().isSpace() || finalBuffers.size() > 1)) {
        if (finalBuffers.size() > 1)
-            m_chatModel->setResponseValue(finalBuffers.last());
+            m_chatModel->setResponseValue(finalBuffers.last().trimmed());
        else
            m_chatModel->setResponseValue(respStr.trimmed());
        emit responseChanged();


While we're here, shouldn't this be more like:

auto respStr = finalBuffers.size() > 1 ? finalBuffers.last() : QString::fromUtf8(result.response); if (!respStr.isEmpty() && std::as_const(respStr).back().isSpace()) { m_chatModel->setResponseValue(respStr.trimmed()); emit responseChanged(); }

The purpose of this code is to trim the trailing whitespace from the end of the response if needed, since the response handler won't do it while generating. I don't see why any check on result.response is necessary if we aren't using its value.

Yeah, could be simplified.

manyoso requested a review from cebtenzzre February 6, 2025 16:08

Add changelog.

edb346e

Signed-off-by: Adam Treat <[email protected]>

cebtenzzre reviewed Feb 6, 2025

View reviewed changes

gpt4all-chat/CHANGELOG.md Outdated Show resolved Hide resolved

manyoso and others added 2 commits February 6, 2025 11:25

Changes.

126447b

Signed-off-by: Adam Treat <[email protected]>

Merge branch 'main' into fix_deepseek_spacing

167715f

Signed-off-by: AT <[email protected]>

cebtenzzre reviewed Feb 6, 2025

View reviewed changes

cebtenzzre approved these changes Feb 6, 2025

View reviewed changes

manyoso merged commit 5e7e4b3 into main Feb 6, 2025
4 of 12 checks passed

cebtenzzre deleted the fix_deepseek_spacing branch February 10, 2025 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix spacing issues with deepseek models: #3470

Fix spacing issues with deepseek models: #3470

Uh oh!

manyoso commented Feb 6, 2025

Uh oh!

cebtenzzre Feb 6, 2025

Uh oh!

manyoso Feb 6, 2025 •

edited

Loading

Uh oh!

manyoso Feb 6, 2025

Uh oh!

cebtenzzre Feb 6, 2025

Uh oh!

Uh oh!

cebtenzzre Feb 6, 2025

Uh oh!

manyoso Feb 6, 2025

Uh oh!

Uh oh!

Uh oh!

Fix spacing issues with deepseek models: #3470

Fix spacing issues with deepseek models: #3470

Uh oh!

Conversation

manyoso commented Feb 6, 2025

Uh oh!

cebtenzzre Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

manyoso Feb 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

manyoso Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

cebtenzzre Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cebtenzzre Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

manyoso Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

manyoso Feb 6, 2025 •

edited

Loading