Open
Description
I see the two functions appear in a lot of places in the code base. Shall we unify them into a single place?
And can we treat eager_attention_forward
as another option in ALL_ATTENTION_FUNCTIONS
? Any concerns?
Metadata
Metadata
Assignees
Labels
No labels