Skip to content

Remove graph breaks for torch.compile() in flash_attention_forward when Lllama Model is padding free tuned#33932

Merged
ArthurZucker merged 38 commits intohuggingface:mainfrom
Abhishek-TAMU:compile_llama
Oct 24, 2024
Merged

Remove graph breaks for torch.compile() in flash_attention_forward when Lllama Model is padding free tuned#33932
ArthurZucker merged 38 commits intohuggingface:mainfrom
Abhishek-TAMU:compile_llama

Commits

Commits on Oct 3, 2024

Commits on Oct 7, 2024

Commits on Oct 8, 2024

Commits on Oct 9, 2024

Commits on Oct 10, 2024

Commits on Oct 11, 2024

Commits on Oct 14, 2024

Commits on Oct 15, 2024

Commits on Oct 16, 2024

Commits on Oct 18, 2024

Commits on Oct 21, 2024

Commits on Oct 22, 2024

Commits on Oct 23, 2024