Skip to content

new dataset commonsense_qa#13

Open
wendlerc wants to merge 1 commit intobasefrom
dataset/commonsense_qa
Open

new dataset commonsense_qa#13
wendlerc wants to merge 1 commit intobasefrom
dataset/commonsense_qa

Conversation

@wendlerc
Copy link
Collaborator

Included a new dataset: Commonsense QA.

Getting started:

  1. python scripts/download/commonsense_qa.py
  2. python scripts/data_generation/gsm8k_pause_injector.py --n_pauses_per_patterns {} --n_pauses_end_of_question 50 --pause_token "<PAUSE>" --tokenizer_hf_name /dlabscratch1/wendler/code/nanotron/hf_models/tinyllamapause --augm_dataset_save_location data/commonsense_qa_injected_50 --dataset_location data/commonsense_qa
  3. python src/trl_train.py experiment=trl_train/sft_pause_tiny_llama_pause_qa rl_algorithm.policy.model.language_model.config.detach_ctrl_tok_clf=True;

@wendlerc
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments