How can T5 model  output options ABCDE directly?

Hi, this is really interesting and excellent work!

Here I have a question that when I used command 'sh run_scripts/sevila/inference/nextqa_infer.sh' on Next-qa dataset for 0-shot test, I found T5 model would genarate answers like '['Option A']'(using self.t5_tokenizer.batch_decode(outputs_qa.sequences, skip_special_tokens=True)). Thus pred_logits_qa = outputs_qa.scores[1] make sense. 
![image](https://github.com/user-attachments/assets/23715d52-0091-4e19-8c30-4514dbb02026)

I am curious why the T5 model can follow instructions so well and output options directly? But when I use the same prompts to ask questions to large language models, such as LLaVa, it output a piece of text instead of the options like ABCD. Is it because the pre-trained model is also trained on multiple choice questions?

Thanks！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How can T5 model output options ABCDE directly? #28

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How can T5 model output options ABCDE directly? #28

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions