llama.cpp/ggml
Jeff Bolz 466300fe14
vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206)
Do masking on whole dwords, fetch all scales at once.
2025-01-16 22:23:49 +01:00
..
include CUDA: backwards pass for misc. ops, add tests (#11257) 2025-01-16 16:43:38 +01:00
src vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206) 2025-01-16 22:23:49 +01:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt fix: ggml: fix vulkan-shaders-gen build (#10448) 2025-01-15 14:17:42 +01:00