Skip to content

Conversation

@jmichaelov
Copy link
Contributor

@jmichaelov jmichaelov commented Sep 11, 2025

  • Individual task names as specified in the README now match the names in the .yaml files.
  • Trust remote code argument now set to true (needed for dataset to load).

@jmichaelov jmichaelov changed the title Correct lambada_multilingual_stablelm task names in README Fix lambada_multilingual_stablelm Sep 11, 2025
@baberabb
Copy link
Contributor

Hi! datasets removed support for trust_remote_code(and scripts based datasets on the hub) in the latest release (4.0). Currently we pinned datasets 3.6 temporarily, to identify and update the datasets on the hub. But you should be able to pass the arg with --trust_remote_code from the cli in the meanwhile for those that require it.

If you have the bandwidth, would appreciate if you could upload an updated dataset. Otherwise I'll try doing it!

@baberabb
Copy link
Contributor

adapted the datasets-cli script to convert to parquet

@baberabb baberabb merged commit 54e606f into EleutherAI:main Nov 19, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants