ItaEval

This repository contains the configuration and code utilities to run the ItaEval evaluation suite.

The repository is a fork of the lm-eval-harness. We last aligned on 1980a13.

Exploring the Suite

All the configuration file are under lm_eval/tasks/ita_eval. We also have included it as a "benchmark" under lm_eval/tasks/benchmarks.

Getting Started

We release several runner bash scripts to run base and chat models against the suite. Head to bash/ to find them.

Note that the recipes listed in the folder are tailored to our hardware and you will very likely need to adapt them to yours.

Run your own model

In a scenario where all of the dependencies are installed correctly, you should be able to run your model on ItaEval with

MODEL="your-model-id-on-the-huggingface-hub"
lm_eval --model hf \
    --model_args pretrained=${MODEL},dtype=bfloat16 \
    --tasks ita_eval \
    --batch_size 1 \
    --log_samples \
    --output_path "."

Add a model to the Leaderboard

Follow these steps:

Run the evaluation with the code above. You will end up with a folder containing a file starting with results_
Copy and push that folder into this directory: https://huggingface.co/datasets/RiTA-nlp/ita-eval-results/
Edit the model_info.yaml file to add the information about the new model(s)
Run this script from the main directory of the ita-eval-results repository.
Push the changes.

Note, points 2 through 5 require having access to the results repository.

Acknowledgments

ItaEval and TweetyIta are the results of the joint effort of members of the Risorse per la Lingua Italiana community. We thank every member that dedicated their personal time to the sprints. We thank CINECA for providing the computational resources (ISCRA grant: HP10C3RW9F).

Cite

@inproceedings{attanasio2024itaeval,
  title={ItaEval and TweetyIta: A New Extensive Benchmark and Efficiency-First Language Model for Italian},
  author={Attanasio, Giuseppe and Delobelle, Pieter and La Quatra, Moreno and Santilli, Andrea and Savoldi, Beatrice},
  booktitle={CLiC-it 2024: Tenth Italian Conference on Computational Linguistics, Date: 2024/12/04-2024/12/06, Location: Pisa, Italy},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3,598 Commits
.github/workflows		.github/workflows
bash		bash
docs		docs
examples		examples
formatting_tests		formatting_tests
lm_eval		lm_eval
scripts		scripts
templates/new_yaml_task		templates/new_yaml_task
tests		tests
.coveragerc		.coveragerc
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.bib		CITATION.bib
CODEOWNERS		CODEOWNERS
LICENSE.md		LICENSE.md
README.md		README.md
ignore.txt		ignore.txt
mypy.ini		mypy.ini
pile_statistics.json		pile_statistics.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ItaEval

Exploring the Suite

Getting Started

Run your own model

Add a model to the Leaderboard

Acknowledgments

Cite

About

Uh oh!

Languages

License

RiTA-nlp/ita-eval

Folders and files

Latest commit

History

Repository files navigation

ItaEval

Exploring the Suite

Getting Started

Run your own model

Add a model to the Leaderboard

Acknowledgments

Cite

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages