Hi, I also meet some errors when trying to reproduce the result of eight multi-task results.

I run the config gen command like: python ./launch.py gen --template mixlora --tasks "arc-c;arc-e;boolq;obqa;piqa;siqa;hellaswag;winogrande" --multi_task True --adapter_name mixlora --num_epochs 3 --batch_size 4 --micro_batch_size 1 --learning_rate 3e-4 --cutoff_len 512 before start training.
Is this due to any error of my env?

python ./launch.py run --base_model /nfsdat/home/bzzhangslm/model/LLM-Research/Meta-Llama-3___1-8B-Instruct --config moe_peft.json
[2025-02-27 21:21:04,031] MoE-PEFT: NVIDIA CUDA initialized successfully.
[2025-02-27 21:21:04,035] MoE-PEFT: Initializing pre-trained model.
[2025-02-27 21:21:04,035] MoE-PEFT: Loading model with half precision.
Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:09<00:00,  2.29s/it]
[2025-02-27 21:21:13,637] MoE-PEFT: Use eager as attention implementation.
[2025-02-27 21:21:13,932] MoE-PEFT: Detecting <pad> is None, setting to <eos> by default.
[2025-02-27 21:21:14,180] MoE-PEFT: Using efficient operators.
[2025-02-27 21:21:14,182] MoE-PEFT: mixlora_0 total trainable params: 241172480
[2025-02-27 21:21:14,183] MoE-PEFT: mixlora_0 total trainable params (except gates): 240123904
[2025-02-27 21:21:15,165] MoE-PEFT: Preparing data for 7 tasks
[2025-02-27 21:21:23,069] MoE-PEFT: Preparing data for ARC-Challenge
[2025-02-27 21:21:28,187] MoE-PEFT: Preparing data for ARC-Easy
[2025-02-27 21:21:33,871] MoE-PEFT: Preparing data for BoolQ
[2025-02-27 21:21:41,593] MoE-PEFT: Preparing data for OpenBookQA
README.md: 6.81kB [00:00, 14.6MB/s]                                                                                                                                                    
Traceback (most recent call last):
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft.py", line 291, in <module>
    moe_peft.train(
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/trainer.py", line 312, in train
    input_args = dispatcher.get_train_data()
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/dispatcher.py", line 290, in get_train_data
    self.__dispatch_task_in()
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/dispatcher.py", line 271, in __dispatch_task_in
    task.load_data()
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/dispatcher.py", line 85, in load_data
    self.train_token_data_ = self.dataload_function_(self.tokenizer_)
                             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/trainer.py", line 79, in _dataload_fn
    data = self.task_.loading_data(True, self.data_path)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/tasks/common.py", line 199, in loading_data
    data.extend(task.loading_data(is_train, None if len(path) == 0 else path))
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nfsdat/home/bzzhangslm/llm/MoS/MoE-PEFT/moe_peft/tasks/qa_tasks.py", line 158, in loading_data
    data = hf_datasets.load_dataset(
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nfsdat/home/bzzhangslm/miniconda3/envs/moe_peft/lib/python3.12/site-packages/datasets/load.py", line 2129, in load_dataset
    builder_instance = load_dataset_builder(
                       ^^^^^^^^^^^^^^^^^^^^^
  File "/nfsdat/home/bzzhangslm/miniconda3/envs/moe_peft/lib/python3.12/site-packages/datasets/load.py", line 1886, in load_dataset_builder
    builder_instance: DatasetBuilder = builder_cls(
                                       ^^^^^^^^^^^^
TypeError: 'NoneType' object is not callable

How can I evaluate these 8 datasets to get the eval results? Based on evaluator provided by MoE-PEFT, maybe using:

python ./evaluator.py \
    --base_model /nfsdat/home/bzzhangslm/model/LLM-Research/Meta-Llama-3___1-8B-Instruct \
    --task_name arc-c \
    --data_path arc-c \
    --lora_weights ./casual_0 \
    --load_16bit True \
    --save_file ./saved/eval/eval.json

Are the task_name and data_path settings right?
Thanks!

Evaluation on benchmarks after training. #24

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions