What is the training config?

Hello, thanks for your work! I want to try to implement this work myself, but I cann't achieve the high performance by xP3 and mT0-xxl as shown in the paper Crosslingual Generalization through Multitask Finetuning. I wonder the training details of this work, how many steps do you train the model, and what is your lr-decay-ratio? Could I get the config file to implement your result? Thank you very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What is the training config? #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What is the training config? #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions