LongForm Semantic Entropy

**Is your feature request related to a problem? Please describe.**
We would like to enable the longform version of Semantic Entropy as proposed by [Farquhar et al., 2024](https://www.nature.com/articles/s41586-024-07421-0). This approach takes the following steps:
1. Deconstruct an LLM response into individual 'factoids'
2. For each factoid, create a small set of questions for which the factoid is the answer.
3. For each of the questions created in step (2) above, generate $m$ sampled responses from which semantic entropy is computed.
4. Average the semantic entropy for each factoid in the response to get longform semantic entropy score. 

<img width="1002" height="556" alt="Image" src="https://github.com/user-attachments/assets/b6af5705-dfa2-4427-a702-9313e4212ac4" />

**Describe the solution you'd like**
Assigning this issue to @mohitcek and trust his decision-making for this one. 

**Describe alternatives you've considered**
Only offering short-form (regular) semantic entropy

**Additional context**
The claim-to-question will require an LLM. Let's use the same prompt used in the original paper. 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LongForm Semantic Entropy #195

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

LongForm Semantic Entropy #195

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions