llama.cpp

History

Jeff Bolz 466300fe14 vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206 ) Do masking on whole dwords, fetch all scales at once.		2025-01-16 22:23:49 +01:00
..
include	CUDA: backwards pass for misc. ops, add tests (#11257 )	2025-01-16 16:43:38 +01:00
src	vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206 )	2025-01-16 22:23:49 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	fix: ggml: fix vulkan-shaders-gen build (#10448 )	2025-01-15 14:17:42 +01:00