whisper: use vulkan as gpu backend when available #2302

mstephenson6 · 2024-07-14T00:22:14Z

When built with Vulkan support, attempt to initialize the ggml vk backend as a gpu

Signed-off-by: Matt Stephenson <[email protected]>

ggerganov · 2024-07-15T12:53:55Z

Hi, have you tested that it works correctly when Vulkan is enabled?

mstephenson6 · 2024-07-15T18:27:44Z

@ggerganov - yes, so far just on an Apple M1Pro running Fedora, with the "Honeykrisp" Vulkan driver.

I'm excited about this working, it's the very first GPU-accelerated ML I've been able to do on Linux + M1.

Happy to test other GPU/OS combos too.

ggerganov · 2024-07-15T18:30:35Z

ggml/src/ggml-vulkan.cpp

    VK_LOG_DEBUG("ggml_vk_init(" << ctx->name << ", " << idx << ")");
    ggml_vk_instance_init();
+    GGML_ASSERT(idx < vk_instance.device_indices.size());


Is this change necessary? The original code works in https://github.com/ggerganov/llama.cpp, not sure why we would need to change it here

I changed it so ggml_vk_instance_init() would populate device_indices right before the assertion, and it would avoid another vk-specific call in whisper.cpp. Without this, the assertion failed every time, no other code had attempted to initialize devices.

I didn't look at how llama.cpp did it, but I can check it out.

I see, seems the change is good. cc @0cc4m to confirm that it's OK to move the assert after the init call

Yeah, that's fine. Makes more sense that way, too.

* ggml: use vulkan as gpu backend when available Signed-off-by: Matt Stephenson <[email protected]> * whisper: enable using vk as default buffer type Signed-off-by: Matt Stephenson <[email protected]> --------- Signed-off-by: Matt Stephenson <[email protected]>

mstephenson6 added 2 commits July 13, 2024 19:58

ggml: use vulkan as gpu backend when available

2184844

Signed-off-by: Matt Stephenson <[email protected]>

whisper: enable using vk as default buffer type

b7d2b17

Signed-off-by: Matt Stephenson <[email protected]>

mstephenson6 changed the title ~~ggml: use vulkan as gpu backend when available~~ whisper: use vulkan as gpu backend when available Jul 14, 2024

ggerganov reviewed Jul 15, 2024

View reviewed changes

ggerganov approved these changes Jul 16, 2024

View reviewed changes

ggerganov merged commit f68298c into ggml-org:master Jul 16, 2024
45 of 46 checks passed

genehand mentioned this pull request Aug 2, 2024

Feature request: AMD GPU support with oneDNN AMD support OpenNMT/CTranslate2#1072

Open

chrootchad mentioned this pull request Feb 7, 2025

Detect Ubuntu Asahi during container setup containers/ramalama#727

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

whisper: use vulkan as gpu backend when available #2302

whisper: use vulkan as gpu backend when available #2302

Uh oh!

mstephenson6 commented Jul 14, 2024

Uh oh!

ggerganov commented Jul 15, 2024

Uh oh!

mstephenson6 commented Jul 15, 2024

Uh oh!

ggerganov Jul 15, 2024

Uh oh!

mstephenson6 Jul 15, 2024

Uh oh!

ggerganov Jul 15, 2024

Uh oh!

0cc4m Jul 15, 2024

Uh oh!

Uh oh!

Uh oh!

whisper: use vulkan as gpu backend when available #2302

whisper: use vulkan as gpu backend when available #2302

Uh oh!

Conversation

mstephenson6 commented Jul 14, 2024

Uh oh!

ggerganov commented Jul 15, 2024

Uh oh!

mstephenson6 commented Jul 15, 2024

Uh oh!

ggerganov Jul 15, 2024

Choose a reason for hiding this comment

Uh oh!

mstephenson6 Jul 15, 2024

Choose a reason for hiding this comment

Uh oh!

ggerganov Jul 15, 2024

Choose a reason for hiding this comment

Uh oh!

0cc4m Jul 15, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!