llama.cpp/ggml/src/ggml-sycl
2025-02-07 09:27:53 +00:00
..
dpct SYCL: Introducing memory host pool (#11251) 2025-01-19 21:33:34 +08:00
backend.hpp SYCL: Add gated linear attention kernel (#11175) 2025-01-15 11:20:17 +08:00
CMakeLists.txt SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (#10584) 2024-12-04 09:29:20 +08:00
common.cpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
common.hpp SYCL: Introducing memory host pool (#11251) 2025-01-19 21:33:34 +08:00
concat.cpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
concat.hpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
conv.cpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
conv.hpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
convert.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
convert.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
dequantize.hpp Fixed dequant precision issues in Q4_1 and Q5_1 (#9711) 2024-10-03 07:50:44 +01:00
dmmv.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
dmmv.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
element_wise.cpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
element_wise.hpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
gemm.hpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
ggml-sycl.cpp SYCL: remove XMX info from print devices (#11712) 2025-02-07 09:27:53 +00:00
gla.cpp SYCL: Add gated linear attention kernel (#11175) 2025-01-15 11:20:17 +08:00
gla.hpp SYCL: Add gated linear attention kernel (#11175) 2025-01-15 11:20:17 +08:00
im2col.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
im2col.hpp [SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052) 2024-08-20 23:06:51 +08:00
mmq.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
mmq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mmvq.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
mmvq.hpp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
norm.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
norm.hpp [SYCL] Fix the sub group size of Intel (#8106) 2024-07-02 10:16:00 +08:00
outprod.cpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
outprod.hpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
presets.hpp Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (#10133) 2024-11-07 15:19:10 +08:00
rope.cpp SYCL: Reduce most of the compiler warnings (#10748) 2024-12-13 12:12:15 +05:30
rope.hpp [SYCL] Update SYCL-Rope op and Refactor (#8157) 2024-07-01 19:39:06 +08:00
softmax.cpp SYCL : SOFTMAX F16 mask support and other fixes (#11261) 2025-01-28 09:56:58 +00:00
softmax.hpp SYCL : SOFTMAX F16 mask support and other fixes (#11261) 2025-01-28 09:56:58 +00:00
tsembd.cpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
tsembd.hpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00
vecdotq.hpp sycl: Use syclcompat::dp4a (#10267) 2024-11-15 11:09:12 +08:00
wkv6.cpp llama: add support for QRWKV6 model architecture (#11001) 2025-01-10 09:58:08 +08:00
wkv6.hpp SYCL: Refactor ggml_sycl_compute_forward (#11121) 2025-01-10 08:13:03 +08:00