Skip to content

Setting hugging face datasets in SpeechLM2 #14268

@mhenrichsen

Description

@mhenrichsen

Hi,

I looked through your Canary-qwen example config and was wondering if it's possible to use HF datasets. If not, what format do the trainer expect the datasets to be in?

Column names etc.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions