Fix reuse kv cache for torch attention #1539

ShashankMosaicML · 2024-09-21T06:13:37Z

Fixes KV cache reuse for torch attention. Also modifies KV cache reuse tests to also test for torch attention.

ShashankMosaicML added 2 commits September 20, 2024 17:12

..

28e5320

adding test

6f1e38b

ShashankMosaicML marked this pull request as ready for review September 21, 2024 06:49

ShashankMosaicML requested a review from a team as a code owner September 21, 2024 06:49

ShashankMosaicML requested a review from dakinggg September 21, 2024 06:50

adding sequence id based tests

bd40475

dakinggg approved these changes Sep 21, 2024

View reviewed changes

ShashankMosaicML merged commit d7c7822 into mosaicml:main Sep 22, 2024
9 checks passed

ShashankMosaicML deleted the fix_reuse_kv_torch branch September 22, 2024 18:03

Provide feedback