Does flax have an implementation of a full decoder-only model? #3702
Replies: 3 comments
-
|
Hi, AFAIK Flax does not provide implementations for ~complex transformer components such as |
Beta Was this translation helpful? Give feedback.
-
|
@epignatelli In general you can just take one of the model classes from |
Beta Was this translation helpful? Give feedback.
-
|
Thanks @davisyoshida, The problem with those is that they have a lot of bells and whistles that, if not needed, make everything very hard to read, maintain and debug -- that's why I was looking for a plain implementation. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
As per title, does flax have an implementation of a full decoder-only model, detached from its use in NPL?
I mean a generic implementation that can be used, for example, in RL.
Till now, I have found:
but I have not found the implementation of a full transformer.
Can anybody point me to one?
Beta Was this translation helpful? Give feedback.
All reactions