llama.cpp

vbatts/llama.cpp

Fork 0

Commit graph

Author	SHA1	Message	Date
0cc4m	45c0e2e4c1	Refactor Vulkan backend to allow multiple contexts (#7961 ) * Refactor Vulkan backend to allow multiple contexts * Fix too many shader groups called validation error in llama3 on AMD and Intel GPUs * Fix Vulkan debug build error	2024-06-23 10:21:25 +02:00
0cc4m	7c7836d9d4	Vulkan Shader Refactor, Memory Debugging Option (#7947 ) * Refactor shaders, extract GLSL code from ggml_vk_generate_shaders.py into vulkan-shaders directory * Improve debug log code * Add memory debug output option * Fix flake8 * Fix unnecessary high llama-3 VRAM use	2024-06-16 07:17:31 +02:00

Author

SHA1

Message

Date

0cc4m

45c0e2e4c1

Refactor Vulkan backend to allow multiple contexts (#7961 )

* Refactor Vulkan backend to allow multiple contexts

* Fix too many shader groups called validation error in llama3 on AMD and Intel GPUs

* Fix Vulkan debug build error

2024-06-23 10:21:25 +02:00

0cc4m

7c7836d9d4

Vulkan Shader Refactor, Memory Debugging Option (#7947 )

* Refactor shaders, extract GLSL code from ggml_vk_generate_shaders.py into vulkan-shaders directory

* Improve debug log code

* Add memory debug output option

* Fix flake8

* Fix unnecessary high llama-3 VRAM use

2024-06-16 07:17:31 +02:00

2 commits