Skip to content

qwen3如何不使用思维链微调 #4038

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
zzhiyun opened this issue Apr 29, 2025 · 1 comment
Open

qwen3如何不使用思维链微调 #4038

zzhiyun opened this issue Apr 29, 2025 · 1 comment

Comments

@zzhiyun
Copy link

zzhiyun commented Apr 29, 2025

您好,请问sft时不使用思维链的情况下是直接在微调数据里面添加对话模版中不使用思维链情况下的填充吗?

@Jintao-Huang
Copy link
Collaborator

row['query'] = row['query'] + ' /no_think'
row['response'] = '<think>\n\n</think>\n\n' + row['response']

参考这里。如果不考虑思维链的丢失情况,数据集格式和之前没区别也可以

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants