Skip to content

Should the prompts is masked/Ignored when the loss is calculated? #94

@jackyguo624

Description

@jackyguo624

Should the prompts (<|im_start|>user ....) part should counted when the loss is calculated?

Image

The prompts are the context, which will not be changed for specific task, during the inference. Should the prompts parttern be learned during the training?

Namely,should we make modification:
tgt = ids0 + tgt_audio + ids1
to
tgt = [IGNORE_TOKEN_ID] * len(ids0) + tgt_audio + [IGNORE_TOKEN_ID] * len(ids1)

in extractor_touch_asu

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions