Skip to content

Analysis of the hallucination benchmark result in Appendix of your paper #1

@laserwave

Description

@laserwave

Hi,nice work.

In table 7, you report the POPE result, which decreased in some sets of experiments(comparing with and without). As your method assigns low weights to contradictory text tokens, an increase of hallucination benchmark metrics is expected in my opinion.

Do you have any comments on this, thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions