Add conditional checks to _check_and_adjust_attn_implementation() #41542

zheliuyu · 2025-10-13T11:56:11Z

What does this PR do?

To prevent unnecessary downloads by kernels, avoid installing kernels-community/flash-attn and kernels-community/vllm-flash-attn3 when attn_implementation=flash_attention_2 is specified for NPU.

test

script

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "Qwen/Qwen3-0.6B",
    device_map="auto",
    torch_dtype="auto",
    attn_implementation="flash_attention_2",
).eval()
print("Operation successful")

Before

`torch_dtype` is deprecated! Use `dtype` instead!
Fetching 0 files: 0it [00:00, ?it/s]
Traceback (most recent call last):
  File "/root/kernels-main/src/kernels/utils.py", line 144, in install_kernel
    return _load_kernel_from_path(repo_path, package_name, variant_locks)
  File "/root/kernels-main/src/kernels/utils.py", line 177, in _load_kernel_from_path
    raise FileNotFoundError(
FileNotFoundError: Kernel at path `/root/.cache/huggingface/hub/models--kernels-community--flash-attn/snapshots/90b3e941627659b28ff001c08b218315e1b7183b` does not have build: torch27-cxx11-cann81-aarch64-linux

After

`torch_dtype` is deprecated! Use `dtype` instead!
Operation successful

zheliuyu · 2025-10-13T12:19:35Z

@ArthurZucker @MekkCyber This PR is now ready for review.

MekkCyber

Indeed ! thanks for fixing

Add conditional checks to _check_and_adjust_attn_implementation()

bbba2d6

MekkCyber approved these changes Oct 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add conditional checks to _check_and_adjust_attn_implementation() #41542

Add conditional checks to _check_and_adjust_attn_implementation() #41542

zheliuyu commented Oct 13, 2025 •

edited

Loading

Uh oh!

zheliuyu commented Oct 13, 2025

Uh oh!

MekkCyber left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add conditional checks to _check_and_adjust_attn_implementation() #41542

Are you sure you want to change the base?

Add conditional checks to _check_and_adjust_attn_implementation() #41542

Conversation

zheliuyu commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

test

script

Before

After

Uh oh!

zheliuyu commented Oct 13, 2025

Uh oh!

MekkCyber left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zheliuyu commented Oct 13, 2025 •

edited

Loading