### 🐛 Describe the bug illegal memory of FusedLinearCrossEntropy ### Reproduce _No response_ ### Versions triton 3.5.0 and PyTorch 2.9