-
Notifications
You must be signed in to change notification settings - Fork 39
Open
Labels
bugSomething isn't workingSomething isn't workinggood first issueGood for newcomersGood for newcomersgood-titleThe title was checked by ChatGPTThe title was checked by ChatGPThelp wantedExtra attention is neededExtra attention is needed
Description
Problem: The training section references scripts/directories that don't exist in the repo (e.g., scripts/target/08). If it states "But you can use your own datasets", doesn't it mean that on path scripts/target/08 should be some datasets from author/developing team? Is it going to fail if not specify paths while trying to retrain model?
Lines 270 to 277 in c8a8d91
| 6. You have to specify train and test dataset: set the `HOME_TRAIN_DATASET` | |
| environment variable | |
| for train dataset and the `HOME_TEST_DATASET` environment variable for test | |
| dataset. | |
| Usually, these files are in `scripts/target/08` directory after dataset | |
| collection (if you have not skipped it). | |
| But you can use your own datasets. |
Suggestion 1: Add datasets (if they aren't private) to GitHub repository
Suggestion 2: Add datasets (if they aren't private) to HuggingFace and make dataset pull pipeline. There is some docs for that
Suggestion 3: Find open datasets as train & test example and make dataset pull pipeline
Suggestion 4: Rewrite the sentence in more clarified way
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workinggood first issueGood for newcomersGood for newcomersgood-titleThe title was checked by ChatGPTThe title was checked by ChatGPThelp wantedExtra attention is neededExtra attention is needed