Question about baseline reward in `caption_mplug_scst.py`

The [code](https://github.com/X-PLUG/mPLUG/blob/c666bfa1044bde5a6ce47fa1b4ae22d7bf9de633/caption_mplug_scst.py#L84-L86) in this repo shows that baseline reward is calculated by averaging reward of generated captions. However, the [original version of scst](https://github.com/ruotianluo/self-critical.pytorch) as well as some other scst implementation (e.g., in [VALOR](https://github.com/TXH-mercury/VALOR)) calculate the baseline reward with greedy-search-generated caption. Is there any reference or explanation about current implementation in this repo? Really appreciate it if I obtain any help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about baseline reward in `caption_mplug_scst.py` #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about baseline reward in caption_mplug_scst.py #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Question about baseline reward in `caption_mplug_scst.py` #9