We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[INFO:swift] Dataset filtered, origin length: 1124869, filtered dataset length: 586472
swift版本号:3.4.0
3.4.0版本的swift会过滤数据集,是什么原因导致?2.6.0版本的swift不会过滤
The text was updated successfully, but these errors were encountered:
命令行参数--truncation_strategy delete,样本的tokens超过max_length会被删除。
--truncation_strategy delete
max_length
Sorry, something went wrong.
不是超过max_length被删除的原因,而是dataset map处理的时候被delete的
你看看 上面的报错信息
过滤时候会打印
No branches or pull requests
[INFO:swift] Dataset filtered, origin length: 1124869, filtered dataset length: 586472
swift版本号:3.4.0
3.4.0版本的swift会过滤数据集,是什么原因导致?2.6.0版本的swift不会过滤
The text was updated successfully, but these errors were encountered: