Allow Proxy creation without a set TraceCtx #1598

IvanYashchuk · 2024-12-30T09:12:44Z

Now creating TensorProxies is as simple as in PyTorch:

In [1]: from thunder.core.proxies import TensorProxy

In [2]: from thunder.core.dtypes import float32

In [3]: from thunder.core.devices import cpu

In [4]: TensorProxy(shape=(1,), device=cpu, dtype=float32)
Out[4]: <TensorProxy(name="None", dtype=thunder.dtypes.float32, shape=(1,))>

Base PR: #1597.
Fixes #1593.

mruberry · 2024-12-30T15:29:27Z

I'm surprised this appears to be causing CI failures! I wonder what's going on.

Fixing #1593 sounds great, and this looks like a surgical way to support the creation of tensor proxies outside of a trace. What can we actually do with these tensor proxies — just call meta functions directly on them? That alone sounds interesting.

Could we make the creation of traceless tensor proxies more explicit — at least for now — like by requiring a special function be called? The function could pass a new option kwarg to the existing functions (like Proxy creation and name origination), like traceless=True, which they could query to change their behavior. The error when creating a proxy outside a trace without setting traceless could be more explicit then, too, preventing accidents.

In the future (or this PR), we could also imagine that name generation is still automatic when traceless=True is set. We could have a traceless state that keeps a counter, for example, and returns names like t0, t1, ...

t-vi · 2024-12-30T16:21:58Z

This is exactly the opposite direction of where we should move. Traces need to own proxies more, not less.
We have had a ton of misery from name clashes in 2024 and we don't need to increase that going forward.

mruberry · 2024-12-30T16:24:44Z

This is exactly the opposite direction of where we should move. Traces need to own proxies more, not less. We have had a ton of misery from name clashes in 2024 and we don't need to increase that going forward.

I don't think these ideas have to be in conflict. The traceless proxies couldn't be used in any trace (because they're not owned by any trace), so they shouldn't interoperate with proxies owned by a trace

IvanYashchuk · 2024-12-31T08:22:30Z

I'm surprised this appears to be causing CI failures! I wonder what's going on.

All the failures are from the base branch (#1596) and are now resolved. The errors were due to the reliance of jit_ext.py file on the assumption that a new name would be generated if the requested name is used (fixed now).

IvanYashchuk · 2024-12-31T08:59:52Z

What can we actually do with these tensor proxies — just call meta functions directly on them?

Yes, call meta functions, but only simple ones that do not require an active trace (ones that do not call other symbols). It's useful for development, exploration, and debugging. Less concepts to explain and keep in mind to a new developer when beginning to explain the internals of Thunder.

In the future (or this PR), we could also imagine that name generation is still automatic when traceless=True is set. We could have a traceless state that keeps a counter, for example, and returns names like t0, t1, ...

Yes, we used to have a global counter in the early versions. Actual names are needed when a bound symbol is constructed to generate a Python line of code.

Could we make the creation of traceless tensor proxies more explicit — at least for now — like by requiring a special function be called? The function could pass a new option kwarg to the existing functions (like Proxy creation and name origination), like traceless=True, which they could query to change their behavior. The error when creating a proxy outside a trace without setting traceless could be more explicit then, too, preventing accidents.

Sure, we can do that. My intention is to decrease cognitive friction for someone coming from other array frameworks. Setting traceless=True means someone needs to understand (and we need to explain) what a "traceful" proxy is.

t-vi · 2025-01-07T08:50:32Z

Can we make this a utility function "traceless_tensorproxy" or so rather than baking it into the proxy main path, please?

I would prefer not to have proxies that will be added to traces next being created outside the trace context by accident.

Allow Proxy creation without a set TraceCtx

369e8ad

IvanYashchuk added the tracing architecture label Dec 30, 2024

IvanYashchuk added 2 commits December 30, 2024 11:36

Merge branch 'proxy-update2' into proxy-update3

cea37a3

Merge branch 'proxy-update2' into proxy-update3

7b0b34c

Merge branch 'proxy-update2' into proxy-update3

8e436cf

Merge branch 'proxy-update2' into proxy-update3

609b2d4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Proxy creation without a set TraceCtx #1598

Allow Proxy creation without a set TraceCtx #1598

IvanYashchuk commented Dec 30, 2024

mruberry commented Dec 30, 2024

t-vi commented Dec 30, 2024

mruberry commented Dec 30, 2024

IvanYashchuk commented Dec 31, 2024

IvanYashchuk commented Dec 31, 2024

t-vi commented Jan 7, 2025

Allow Proxy creation without a set TraceCtx #1598

Are you sure you want to change the base?

Allow Proxy creation without a set TraceCtx #1598

Conversation

IvanYashchuk commented Dec 30, 2024

mruberry commented Dec 30, 2024

t-vi commented Dec 30, 2024

mruberry commented Dec 30, 2024

IvanYashchuk commented Dec 31, 2024

IvanYashchuk commented Dec 31, 2024

t-vi commented Jan 7, 2025