Skip to content

Make k-quants work with tensor dimensions that are not multiple of 256 #1919

Closed
@ikawrakow

Description

@ikawrakow

As discussed in #1602, k-quants do not work for the Falcon-7B model. This is due to the fact that the number of columns in many tensors (4544) is not divisible by 256, which is the super-block size of the k-quants.

It would be useful if k-quants could be adapted to work in such cases.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestmodelModel specific

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions