-
Notifications
You must be signed in to change notification settings - Fork 3
Description
下了解压,chinese-poetry Erya-dataset,
目录如下:
Erya-dataset/
dataset finetune.tgz stage_2 trans.tgz
chinese-poetry/
曹操诗集 论语 纳兰性德 诗经 四书五经 五代诗词 御定全唐詩 _config.yml LICENSE log.log README.md strains
楚辞 蒙学 全唐诗 水墨唐诗 宋词 幽梦影 元曲 images loader rank requirements.txt test_poetry.py
运行:
python gen_dataset.py --data-root=./dataset --output=data.jsonl
出错:
Traceback (most recent call last):
File "/usr/local/linxu/workp310/ancient-chat-llm/dataset/gen_dataset.py", line 304, in
get_erya_data(
File "/usr/local/linxu/workp310/ancient-chat-llm/dataset/gen_dataset.py", line 58, in get_erya_data
for sub_dir in data_path.iterdir():
File "/usr/local/linxu/miniconda3/envs/ancient-chat-llm/lib/python3.10/pathlib.py", line 1017, in iterdir
for name in self._accessor.listdir(self):
FileNotFoundError: [Errno 2] No such file or directory: 'dataset/Erya-dataset/dataset'
文件夹确实正确,不知道是什么问题?