[WIP] Test out thunder.jit w/ NeMo models. #1694

tfogal · 2025-01-24T21:28:11Z

What does this PR do?

Demonstrates how to test thunder.jit with NeMo models.

PR review

This isn't meant to be merged or reviewed. If we actually want to do this, someone would want to extend the testing to test both thunder.jit and thunder.dynamo.thunderfx, not remove the thunderfx testing (as done here).

Did you have fun?

With this group, always :-)

for more information, see https://pre-commit.ci

crcrpar · 2025-01-25T02:21:45Z

thunder/tests/test_networks.py

@@ -445,7 +454,8 @@ def test_hf_for_nemo(model_id):
    # fullgraph=True used to work with transformers 4.45.2, but it doesn't work
    # with 4.46.2 because of re.findall usage in the loss function
    fullgraph = False
-    compiled_model = thunderfx(model, fullgraph=fullgraph)
+    # compiled_model = thunderfx(model, fullgraph=fullgraph)
+    compiled_model = thunder.jit(model, fullgraph=fullgraph)


Wasn't this failing due to unsupported argument as I thought thunder.jit doesn't have fullgraph argument?
Or **compile_options takes keyword argument so there wouldn't be errors for unsupported args?

Suggested change

compiled_model = thunder.jit(model, fullgraph=fullgraph)

compiled_model = thunder.jit(model, fullgraph=fullgraph)

Nope. One can see the logs by clicking through the CI links below:

=========================== short test summary info ============================ FAILED thunder/tests/test_networks.py::test_hf_for_nemo[bigcode/starcoder2-7b] - AssertionError FAILED thunder/tests/test_networks.py::test_hf_for_nemo[microsoft/Phi-3-mini-128k-instruct] - AssertionError: expected tensor with (48,), cuda:0, torch.float32, requires_grad=False, got (1,), cuda:0, torch.bfloat16, False FAILED thunder/tests/test_networks.py::test_thunderfx_mistral_nemo_small - AssertionError ============ 3 failed, 29 passed, 172 warnings in 172.63s (0:02:52) ============

so I guess your theory is correct that we do not error out due to unsupported (kw)args.
I have a vague memory of us discussing doing that though; maybe it's just a warning?

That is one of the things I dislike about the options.

tfogal and others added 2 commits January 24, 2025 13:26

Test out thunder.jit w/ NeMo models.

9799dbe

[pre-commit.ci] auto fixes from pre-commit.com hooks

de69f46

for more information, see https://pre-commit.ci

crcrpar reviewed Jan 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Test out thunder.jit w/ NeMo models. #1694

[WIP] Test out thunder.jit w/ NeMo models. #1694

tfogal commented Jan 24, 2025

crcrpar Jan 25, 2025

tfogal Jan 25, 2025

t-vi Jan 26, 2025

	compiled_model = thunder.jit(model, fullgraph=fullgraph)
	compiled_model = thunder.jit(model, fullgraph=fullgraph)

[WIP] Test out thunder.jit w/ NeMo models. #1694

Are you sure you want to change the base?

[WIP] Test out thunder.jit w/ NeMo models. #1694

Conversation

tfogal commented Jan 24, 2025

What does this PR do?

PR review

Did you have fun?

crcrpar Jan 25, 2025

Choose a reason for hiding this comment

tfogal Jan 25, 2025

Choose a reason for hiding this comment

t-vi Jan 26, 2025

Choose a reason for hiding this comment