Closed as not planned
Description
I tried adding Flash Attention into qLoRA, I receive the following error:
RuntimeError: FlashAttention only support fp16 and bf16 data type
Is it possible to add support for 4-bit qLoRA?
Metadata
Metadata
Assignees
Labels
No labels