Skip to content

Conversation

nayana1729
Copy link

This PR integrates the HELMET evaluation task into lighteval. Files added include:

  • helmet.py task implementation
  • json datasets: asqa_revised.json and qampari_revised.json
  • test_helmet.py tests to check dataset loading and prompt retrieval (uses pytest)

References issue: #731

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants