[Model] Add T5 model (2/2) #11901

NickLucche · 2025-01-09T14:47:11Z

Add support fot T5 (encoder-decoder model).
Follow-up and based on #11334, so it needs this other PR merged before it can be addressed, as it assumes to have a backend that supports passing a custom attention bias in both prefill and decode (xformers+pagedattention as of now).

Some topics I'd like to discuss here:

xFormers is a hard dependency for T5 as of now, do we silently enforce the backend or just raise an error? Surely we need to raise one on platforms xformers does not support (rocm)
T5 uses the same decoder_start_token_id as it does for padding, but there's no explicit BOS. Current logic in preprocess.py would just crash. Is this the best approach to handle the quirk of T5?
change to xformers.py I'd rather have been able to spare this one, but it was assuming alibi_slopes were the only way to have multiple attention biases (one per sequence).

Signed-off-by: NickLucche <[email protected]>

github-actions · 2025-01-09T14:47:23Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: NickLucche <[email protected]>

NickLucche · 2025-01-09T16:37:29Z

tests/models/encoder_decoder/language/test_bart.py

@@ -2,170 +2,12 @@

 Run `pytest tests/models/encoder_decoder/language/test_bart.py`.
 """
-from typing import List, Optional, Tuple, Type
-
 import pytest


we can drop this folder altogether, ../language/language was never really great

NickLucche added 15 commits January 9, 2025 14:03

wip

5e59703

Signed-off-by: NickLucche <[email protected]>

add working kernel with padded_max_seq_len as arg

416412d

Signed-off-by: NickLucche <[email protected]>

add attn_bias case to pagedattn tests

1d1f2a0

Signed-off-by: NickLucche <[email protected]>

format

7fb263d

Signed-off-by: NickLucche <[email protected]>

format

ac6bf63

Signed-off-by: NickLucche <[email protected]>

enforce last dim of attn bias to be block aligned

5c47f43

Signed-off-by: NickLucche <[email protected]>

wip

f97939f

Signed-off-by: NickLucche <[email protected]>

wip

f8df36a

Signed-off-by: NickLucche <[email protected]>

first working version :)

0d7b0c5

clean up

43eca38

Signed-off-by: NickLucche <[email protected]>

address missing bos token case

b481f5d

Signed-off-by: NickLucche <[email protected]>

format and clean up

67bdbbc

Signed-off-by: NickLucche <[email protected]>

t5 tests

bd264c7

Signed-off-by: NickLucche <[email protected]>

format

2d5b4fb

Signed-off-by: NickLucche <[email protected]>

sync with custom attn bias pr

dc25e4d

Signed-off-by: NickLucche <[email protected]>

NickLucche requested review from tlrmchlsmth, WoosukKwon, DarkLight1337 and ywang96 as code owners January 9, 2025 14:47

NickLucche added 2 commits January 9, 2025 15:23

remove spurious files

3eae4f6

Signed-off-by: NickLucche <[email protected]>

update to use new attention_type interface

455d0cb

Signed-off-by: NickLucche <[email protected]>

NickLucche commented Jan 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add T5 model (2/2) #11901

[Model] Add T5 model (2/2) #11901

NickLucche commented Jan 9, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jan 9, 2025

NickLucche Jan 9, 2025

[Model] Add T5 model (2/2) #11901

Are you sure you want to change the base?

[Model] Add T5 model (2/2) #11901

Conversation

NickLucche commented Jan 9, 2025 • edited by github-actions bot Loading

github-actions bot commented Jan 9, 2025

NickLucche Jan 9, 2025

Choose a reason for hiding this comment

NickLucche commented Jan 9, 2025 •

edited by github-actions bot

Loading