Finetuning #255

AkshitaB · 2022-04-04T06:14:25Z

Changes proposed in this pull request:

transformers::finetune step, which mostly mimics the TorchTrainStep with additions for tokenizing the data (and updating the model embeddings). It also contains model-specific defaults for the data collator.
RunGeneration now allows the trained model object as input.

Before submitting

I've read and followed all steps in the Making a pull request
section of the CONTRIBUTING docs.
I've updated or added any relevant docstrings following the syntax described in the
Writing docstrings section of the CONTRIBUTING docs.
If this PR fixes a bug, I've added a test that will fail without my fix.
If this PR adds a new feature, I've added tests that sufficiently cover my new functionality.

After submitting

All GitHub Actions jobs for my pull request have passed.

dirkgr · 2022-04-12T18:35:04Z

examples/finetune/snli_steps.py

+
+
+@Step.register("subset-data")
+class SubsetData(Step):


We have the DatasetRemix step for Tango's DatasetDict. Can we have the same for HF's datasets?

Will work on it separately: #268 This is technically unrelated to finetuning.

tango/integrations/transformers/finetune.py

dirkgr · 2022-04-12T19:01:28Z

tango/integrations/transformers/finetune.py

+
+    def run(  # type: ignore[override]
+        self,
+        model: Lazy[Model],


If I want to do some sort of curriculum learning, can I pass in the output of another training step here?

This can be done once we fix this: #269

And this: #270

tango/integrations/transformers/finetune.py

tango/integrations/transformers/run_generation.py

AkshitaB added 11 commits March 24, 2022 01:09

temp commit

d72d4ac

move_to_device should work for UserDict too

3f08572

works

9a07a1b

clean up

f7fc850

run generation with model

77ade5f

causal lm

71c485c

Merge branch 'main' into finetuning

747893f

change label

ded17f4

Merge branch 'main' into finetuning

613a744

single step finetune

87ff48e

docstrings, tests, cleanup

ff1e6a3

AkshitaB marked this pull request as draft April 4, 2022 06:14

AkshitaB added 5 commits April 3, 2022 23:14

Merge branch 'main' into finetuning

5bb6a72

fix bug with num tokens

bfa8b24

update changelog

dbfe36e

fix test

0fe64a8

test with different model

b04c8ff

AkshitaB requested review from epwalsh and dirkgr April 4, 2022 21:58

AkshitaB added 2 commits April 4, 2022 15:19

simplify

2ace306

limit loss calculation to actual labels

da291db

AkshitaB marked this pull request as ready for review April 12, 2022 18:17

dirkgr requested changes Apr 12, 2022

View reviewed changes

AkshitaB and others added 3 commits April 14, 2022 23:22

address comments

c247c48

Merge branch 'main' into finetuning

92d84c7

Merge branch 'main' into finetuning

d9033b7

dirkgr approved these changes Apr 19, 2022

View reviewed changes

dirkgr enabled auto-merge (squash) April 19, 2022 22:16

dirkgr merged commit 1083049 into main Apr 19, 2022

dirkgr deleted the finetuning branch April 19, 2022 22:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning #255

Finetuning #255

AkshitaB commented Apr 4, 2022 •

edited

Loading

dirkgr Apr 12, 2022

AkshitaB Apr 15, 2022

dirkgr Apr 12, 2022

AkshitaB Apr 15, 2022

AkshitaB Apr 15, 2022



		@Step.register("subset-data")
		class SubsetData(Step):

Finetuning #255

Finetuning #255

Conversation

AkshitaB commented Apr 4, 2022 • edited Loading

Before submitting

After submitting

dirkgr Apr 12, 2022

Choose a reason for hiding this comment

AkshitaB Apr 15, 2022

Choose a reason for hiding this comment

dirkgr Apr 12, 2022

Choose a reason for hiding this comment

AkshitaB Apr 15, 2022

Choose a reason for hiding this comment

AkshitaB Apr 15, 2022

Choose a reason for hiding this comment

AkshitaB commented Apr 4, 2022 •

edited

Loading