### 🐛 Describe the bug DyT (tolerance too tight) [nvidia ci](https://github.com/linkedin/Liger-Kernel/actions/runs/15644195860/job/44078368232#step:5:1911) multi token attention (only failed on intel ci) [intel ci](https://github.com/linkedin/Liger-Kernel/actions/runs/15644195864/job/44078367392#step:7:1969) ### Reproduce _No response_ ### Versions liger_kernel==https://github.com/linkedin/Liger-Kernel/commit/1f640a505a7c730c404560007edfd393a9e8f747