llama.cpp/ggml/src/ggml-vulkan
Jeff Bolz a91a41364b
vulkan: optimize coopmat2 dequant functions (#10855)
Change the code to do 16b loads when possible and extract the appropriate
component late, so the code is effectively decoding a pair of elements and
then selecting one. This can allow more commoning to happen in the compiler
when neighboring elements are loaded.
2024-12-21 08:04:45 +01:00
..
vulkan-shaders vulkan: optimize coopmat2 dequant functions (#10855) 2024-12-21 08:04:45 +01:00
CMakeLists.txt vulkan: disable spirv-opt for coopmat shaders (#10763) 2024-12-10 18:22:20 +01:00
ggml-vulkan.cpp vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809) 2024-12-17 06:52:55 +01:00