Should the prompts is masked/Ignored when the loss is calculated?

Should the prompts (<|im_start|>user ....) part should counted when the loss is calculated?

<img width="889" height="340" alt="Image" src="https://github.com/user-attachments/assets/8dd77d04-bcfa-4503-b0f4-3d485ec9eaaa" />

The prompts are the context, which will not be changed for specific task, during the inference. Should the prompts parttern be learned during the training?

Namely，should we make modification：
`
tgt = ids0 + tgt_audio + ids1 
`
to
`
tgt = [IGNORE_TOKEN_ID] * len(ids0) + tgt_audio + [IGNORE_TOKEN_ID] * len(ids1)
`

in [extractor_touch_asu](https://github.com/wenet-e2e/west/blob/b7b50520ed85b11c48ed63eda44025392550f299/west/models/touch_asu/extractor_touch_asu.py#L117)




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should the prompts is masked/Ignored when the loss is calculated? #94

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Should the prompts is masked/Ignored when the loss is calculated? #94

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions