Vulkan Mixture of Experts (MoE) support (#7628)
* Finish Vulkan mul_mat_id implementation * Add Vulkan sum_rows and div ops * Fix MUL_MAT_ID matrix matrix shader * Fix MUL_MAT_ID matrix vector shader dispatch size * Fix MUL_MAT_ID matrix vector shader and dispatch code * Update Vulkan CPU offload for MUL_MAT_ID * Fix crash when using split mode none and setting a main GPU
This commit is contained in:
parent
a10cda58d3
commit
3d7ebf6312
5 changed files with 73389 additions and 13839 deletions
85978
ggml-vulkan-shaders.hpp
85978
ggml-vulkan-shaders.hpp
File diff suppressed because it is too large
Load diff
Loading…
Add table
Add a link
Reference in a new issue