Announcement: new signature for `MultiHeadDotProductAttention`'s `call` method #3389

chiamp · 2023-10-03T23:57:08Z

chiamp
Oct 3, 2023

Currently, MultiHeadDotProductAttention layer's call method signature is MultiHeadDotProductAttention.__call__(inputs_q, inputs_kv, mask=None, deterministic=None). As discussed in #1737, there are some cases where passing in separate values for the key and values is desired, which isn't possible with the current API. The PR #3379 adds two more arguments, inputs_k and inputs_v to the call method signature and sets the method signature to the following: MultiHeadDotProductAttention.__call__(inputs_q, inputs_k=None, inputs_v=None, *, inputs_kv=None, mask=None, deterministic=None). Note that the inputs_kv, mask and deterministic args are now keyword arguments.

if inputs_k and inputs_v are None, then they will both copy the value of inputs_q (i.e. self attention)
if inputs_v is None, it will copy the value of inputs_k (same behavior as the previous API, i.e. module.apply(inputs_q=query, inputs_k=key_value, ...) is equivalent to module.apply(inputs_q=query, inputs_kv=key_value, ...))
if inputs_kv is not None, both inputs_k and inputs_v will copy the value of inputs_kv

Users can still use inputs_kv but a DeprecationWarning will be raised and inputs_kv will be removed in the future.
Since self attention can be done using this new API, the SelfAttention layer will also raise a DeprecationWarning and will be removed in the future.

Some examples of porting over your code to the new method signature:

module = MultiHeadDotProductAttention(...)
sa_module = SelfAttention(...)

Old API	New API
`module.apply(query, key_value, mask, deterministic)`	`module.apply(query, key_value, mask=mask, deterministic=deterministic)`
`module.apply(inputs_q=query, inputs_kv=key_value, mask=mask, deterministic=deterministic)`	`module.apply(inputs_q=query, inputs_k=key_value, mask=mask, deterministic=deterministic)`
`sa_module.apply(query, mask, deterministic)`	`module.apply(query, mask=mask, deterministic=deterministic)`
`sa_module.apply(inputs_q=query, mask=mask, deterministic=deterministic)`	`module.apply(inputs_q=query, mask=mask, deterministic=deterministic)`

For additional context, check out the PR #3379 and the discussion thread #1737.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Announcement: new signature for `MultiHeadDotProductAttention`'s `call` method #3389

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Announcement: new signature for MultiHeadDotProductAttention's __call__ method #3389

Uh oh!

chiamp Oct 3, 2023

Replies: 0 comments

Announcement: new signature for `MultiHeadDotProductAttention`'s `call` method #3389

chiamp
Oct 3, 2023