Restructure opengpt-x tasks #86

katrinklug · 2023-06-06T12:19:30Z

Restructering OpenGPT-X tasks to locate them in an own folder. Thus we have a better overview about the tasks and can implement upstream changes easier

KlaudiaTH · 2023-06-08T13:23:02Z

Reminder: Don't forget to move the tasks on branches that are not merged yet

…patibility)

…patibility) ; --limit can now be a fraction of the dataset size

…-evaluation-harness into organization-tasks

KlaudiaTH · 2023-06-20T15:04:13Z

@katrinklug
I can't merge the PR since there are build errors caused by missing dependencies (e.g. importlib-resources etc..) This needs to be fixed first. Also, one of the build checks (flake8) failed because of formatting. It is probably checked for PEP-8 formatting rules and so I still ask you to format the code accordingly (I think it should be enough to use the formatting tool black).

KlaudiaTH · 2023-06-20T15:37:08Z

@katrinklug #87
(I have implemented a patch for this.)

KlaudiaTH · 2023-06-22T06:26:00Z

lm_eval/evaluator.py

@katrinklug There seems to be inconsistencies with some of the arguments. E.g. you included no_tokenizer_check for "hf" and "gpt2" models but missed to also add it to other model types e.g. "hf-causal" or "hf-causal-experimental" or "gpt3" etc.. I have added a temporary fix in the ...thellmann1/workdir/lm_eval_setup/opengptx/lm-evaluation-harness/lm_eval/evaluator.py (from line 67), but this type of args definitely belong to the model args and need to be handled differently:

# this is just a temporary fix if isinstance(model, str): if model_args is None: model_args = "" additional_args = { "batch_size": batch_size, "device": device, } if model in ("hf", "gpt2"): additional_args["no_tokenizer_check"] = no_tokenizer_check lm = lm_eval.models.get_model(model).create_from_arg_string( model_args, additional_args, )

KlaudiaTH · 2023-06-22T06:28:35Z

lm_eval/models/huggingface.py

@katrinklug #87
I have implemented a patch for this. Can you please apply the patch?
You'll find it in lm_eval_setup/0001-Pass-trust_remote_code-from-model_args-to-AutoTokeni.patch on the lm_eval_setup branch.

e.g.
git am ~/path/to/evaluation/repo/lm_eval_setup/0001-Pass-trust_remote_code-from-model_args-to-AutoTokeni.patch

…ed as well

katrinklug and others added 6 commits May 11, 2023 09:03

Add new evaluation tasks and fix README

b20956f

Fix pre-commit checks

a208277

Restructure opengpt-x tasks

acf9b8a

Fix pre-commit checks

549614c

Merge changes from xcsr branch

6702527

changes wrt prefix on ogptx tasks

7037b7e

ghstgs and others added 23 commits June 9, 2023 12:12

add big bench and migrate arithmetic and wikitext to huggingface sources

ac881d3

add big bench and migrate arithmetic and wikitext to huggingface sources

def5ff1

update greedy_until in tasks as dictionary rather than list

c2f6136

update greedy_until in tasks as dictionary rather than list

8748c3b

fix spaces in prompts

82d71ae

add support for the JSON task, add tasks to registry, and minor fixes

9c0abfd

add new tasks from Eleuther repo

aca3634

add huggingface.py model (commented out as well as some dependencies)

7e44a7e

update README and utility scripts

c440f16

update greedy_until in tasks as dictionary rather than list for model

1085f84

migrate to latest versions (detailed eval names kept for backward com…

d36c748

…patibility)

migrate to latest versions (detailed eval names kept for backward com…

ee7cb83

…patibility) ; --limit can now be a fraction of the dataset size

update greedy_until in tasks as dictionary rather than list for model

366f9ba

update GPT2 model with latest developments in origin repo

c8f193d

Code fixes for review

a71284f

Fix evaluator for output of new tasks

c077812

Add new multilingual tasks from branch xcsr

cfd5b9b

add anthropic_llms.py

1d869ea

add anthropic_llms.py

a87dd3b

use omegaconf to process args_dict

61697b1

small edit

bddd018

Merge branch 'organization-tasks' of https://github.com/katrinklug/lm…

73f4773

…-evaluation-harness into organization-tasks

Uncomment huggingface model for evaluation

2ae5791

katrinklug added 5 commits June 16, 2023 08:32

Added importlib_resources to dependencies

1af1fae

Add Eleuther AI multilingual tasks as tmp tasks

fb433c2

Fix pre-commit checks

3a6561e

Small fix in aam_xnli task

4c2ef36

Fix ogx_xcodah and ogx_xcsqa tasks

483e857

Fix pytest error for bleurt package in setup

db11e3c

KlaudiaTH reviewed Jun 22, 2023

View reviewed changes

KlaudiaTH and others added 6 commits June 22, 2023 08:54

Pass trust_remote_code from model_args to AutoTokenizer.from_pretrain…

67257ae

…ed as well

Fixes for merging PR

bd86e36

Pass black

e9d6b62

Pass flake8

ca7183a

Pass pre-commit

ce5928c

Small change in file name for write detailed info

47e9e69

KlaudiaTH approved these changes Jun 27, 2023

View reviewed changes

KlaudiaTH merged commit 5a330e9 into OpenGPTX:master Jun 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure opengpt-x tasks #86

Restructure opengpt-x tasks #86

katrinklug commented Jun 6, 2023

KlaudiaTH commented Jun 8, 2023

KlaudiaTH commented Jun 20, 2023 •

edited

Loading

KlaudiaTH commented Jun 20, 2023

KlaudiaTH Jun 22, 2023

KlaudiaTH Jun 22, 2023

Restructure opengpt-x tasks #86

Restructure opengpt-x tasks #86

Conversation

katrinklug commented Jun 6, 2023

KlaudiaTH commented Jun 8, 2023

KlaudiaTH commented Jun 20, 2023 • edited Loading

KlaudiaTH commented Jun 20, 2023

KlaudiaTH Jun 22, 2023

Choose a reason for hiding this comment

KlaudiaTH Jun 22, 2023

Choose a reason for hiding this comment

KlaudiaTH commented Jun 20, 2023 •

edited

Loading