Improved Lora finetuning script #1179

pattplatt · 2024-03-22T13:53:38Z

Added code to check if val data is longer than train data which prevents errors if this case occurs. Also added code that allows to use of the validation data in model training except for the example prompt. These two changes could also be implemented in the other finetuning/training scripts.

…nts errors if this case occurs, also added feature to use val data in model training

litgpt/finetune/lora.py

carmocca · 2024-03-25T02:23:28Z

litgpt/finetune/lora.py

+    rand = random.randint(0, 50)
+    try:
+        instruction = val_dataloader.dataset.data[rand]["instruction"]
+    except Exception as e:
+        print(f"Import of validation data failed: {e}")
+        instruction = "Recommend a movie for me to watch during the weekend and explain the reason."


I can understand the desire for not hardcoding the instruction. On the other hand, always using the same one is useful to observe progress in the continuation.

Maybe it's best to drop this bit entirely instead

My thought was that it is helpful to get a feeling if the model generalizes well over the validation data. If you always have the same prompt you can't really tell, or am I missing something?

I see this as a preference without no right or wrong. I'll defer this decision to the folks who finetune the most: @rasbt and @awaelchli

I'd say for the generalization aspect, we already calculate the loss over the validation set. The fixed prompt here is more of a small visual check, and I do think it helps having it the same prompt.

We could potentially do it like this:

by default select a random validation set instruction (like we do now) and keep it constant over the training for visual purposes

let users override this in the config perhaps via a "rotate" argument or so. Where validation_instruction: str = "fixed" defaults to the current behavior but that might be overkill

It's not clear to me when this happens. Everytime the validation function is called?

To add what Sebastian said I would say that we can't tell very well from a single example by how much the model is improving. Whether it is a sample from the dataset or one we provide doesn't matter much. It's there as a sanity check to make sure the model eventually starts following instructions and adopting the prompt template. I am in favor of keeping it simple.

It's not clear to me when this happens. Everytime the validation function is called?

I'd say if we were to add the rotation, that would be also done every eval.interval steps (the default is 100) (which currently includes both calculating the loss over the entire validation set and then also using the one example prompt/instruction for a quick visual sanity check that the model generates coherent text.)

litgpt/finetune/lora.py

added code to check if val data is longer than train data which preve…

0cae6b4

…nts errors if this case occurs, also added feature to use val data in model training

pattplatt requested review from awaelchli, carmocca and lantiga as code owners March 22, 2024 13:53

carmocca reviewed Mar 25, 2024

View reviewed changes

rasbt reviewed Mar 26, 2024

View reviewed changes

litgpt/finetune/lora.py Outdated Show resolved Hide resolved

carmocca mentioned this pull request Mar 26, 2024

Add back support for longest sequence first #1195

Open

carmocca mentioned this pull request Apr 2, 2024

Harcoded incorrect (and repeated) validation example #796

Closed

rasbt added 2 commits April 16, 2024 09:59

Update litgpt/finetune/lora.py

e9cd2b1

Merge branch 'main' into improve_lora_finetuning

6bf87f0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved Lora finetuning script #1179

Improved Lora finetuning script #1179

pattplatt commented Mar 22, 2024

carmocca Mar 25, 2024

pattplatt Mar 25, 2024 •

edited

Loading

carmocca Mar 25, 2024

rasbt Mar 25, 2024

pattplatt Mar 25, 2024

awaelchli Mar 26, 2024

rasbt Mar 26, 2024

Improved Lora finetuning script #1179

Are you sure you want to change the base?

Improved Lora finetuning script #1179

Conversation

pattplatt commented Mar 22, 2024

carmocca Mar 25, 2024

Choose a reason for hiding this comment

pattplatt Mar 25, 2024 • edited Loading

Choose a reason for hiding this comment

carmocca Mar 25, 2024

Choose a reason for hiding this comment

rasbt Mar 25, 2024

Choose a reason for hiding this comment

pattplatt Mar 25, 2024

Choose a reason for hiding this comment

awaelchli Mar 26, 2024

Choose a reason for hiding this comment

rasbt Mar 26, 2024

Choose a reason for hiding this comment

pattplatt Mar 25, 2024 •

edited

Loading