abhilash1910
|
d6fc1a0309
|
fix mac build
|
2024-01-23 07:19:30 -08:00 |
|
Abhilash Majumder
|
5f83a12382
|
fix blas matmul function
|
2024-01-23 17:56:37 +05:30 |
|
jianyuzh
|
b42a32d31a
|
replace tab by space
|
2024-01-23 20:20:16 +08:00 |
|
jianyuzh
|
756c4accaf
|
skip build sycl tool for other code path
|
2024-01-23 20:06:20 +08:00 |
|
Abhilash Majumder
|
88f64b7d3d
|
Remove unused headers
|
2024-01-23 17:34:57 +05:30 |
|
Abhilash Majumder
|
d097e2a4ef
|
editor format fix
|
2024-01-23 17:32:42 +05:30 |
|
Abhilash Majumder
|
be31379ef8
|
format fixes
|
2024-01-23 14:47:30 +05:30 |
|
Abhilash Majumder
|
bd716b2594
|
format fixes
|
2024-01-23 14:45:36 +05:30 |
|
abhilash1910
|
1ddaf44c30
|
editor config format
|
2024-01-23 01:03:34 -08:00 |
|
jianyuzh
|
97cbe18dd2
|
rename macro to intel hardware
|
2024-01-23 14:35:33 +08:00 |
|
jianyuzh
|
27c08c0429
|
Merge branch 'sycl' of https://github.com/abhilash1910/llama.cpp into sycl
|
2024-01-23 14:16:24 +08:00 |
|
jianyuzh
|
a0a1304b0c
|
add build&run script, clean CMakefile, update guide by review comments
|
2024-01-23 14:16:01 +08:00 |
|
Meng, Hengyu
|
b403784228
|
remove extra endif
|
2024-01-23 06:09:19 +00:00 |
|
abhilash1910
|
dd7f1396f9
|
cleanup 1
|
2024-01-22 21:37:16 -08:00 |
|
jianyuzh
|
533c647d0e
|
check for sycl blas, better performance
|
2024-01-23 13:34:05 +08:00 |
|
Meng, Hengyu
|
67e6b3cb7d
|
align pr4766
|
2024-01-23 03:32:09 +00:00 |
|
luoyu-intel
|
f008cc7b68
|
enable SYCL_F16 support
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
f396a3b65e
|
add know issue for pvc hang issue
|
2024-01-23 02:38:44 +00:00 |
|
luoyu-intel
|
623d8031cb
|
fix code err
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
e3481faa2f
|
rm original sycl code before refactor
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
ae941b1b57
|
add syc and link for sycl readme
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
35a0daaaa1
|
restore rm code to fix hang issue
|
2024-01-23 02:38:44 +00:00 |
|
luoyu-intel
|
d5f7d364f6
|
remove sycl version from include path
|
2024-01-23 02:38:44 +00:00 |
|
luoyu-intel
|
57e9fbadb2
|
fix return type
|
2024-01-23 02:38:44 +00:00 |
|
Neo Zhang Jianyu
|
593ce001e2
|
Update README_sycl.md
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
d80dd65f42
|
dos2unix
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
09b5619df4
|
rm rear space
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
7350fd48ef
|
add ls-sycl-device, rm unused files
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
0d6e7219b6
|
add ls-sycl-device tool
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
79d30d7713
|
add run script, comment debug code
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
a8936f4902
|
set nthread=1 when sycl, increase performance
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
95daece908
|
fix build with sycl
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
ca2cb6982a
|
update readme, refactor build script
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
c3c5b20ac5
|
mv dpct definition from folder dpct to ggml-sycl.h
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
c67c2ab228
|
refactor device log
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
a47f5ec42e
|
summary dpct definition in one header file to replace folder:dpct
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
5b5389941e
|
fix error: wrong result in 658746bb26702e50f2c59c0e4ada8e9da6010481
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
bd38129aeb
|
add print tensor function to debug
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
3645f25d74
|
correct queue: rm dtct:get_queue
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
fa3a58605b
|
clear CMAKE to rm unused lib and options
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
c709c3cb37
|
ren ggml-sycl.hpp -> ggml-sycl.h
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
69d76c8b58
|
fix error of select non-zero device, format device list
|
2024-01-23 02:38:44 +00:00 |
|
jianyuzh
|
c2ef7a9cb9
|
step 8, rename all macro & func from cuda by sycl
|
2024-01-23 02:38:42 +00:00 |
|
jianyuzh
|
3b1a743e82
|
step7 add debug for code path, rm log
|
2024-01-23 02:15:32 +00:00 |
|
jianyuzh
|
65f895d41b
|
support main device is non-zero
|
2024-01-23 02:15:32 +00:00 |
|
jianyuzh
|
3a9d2c54ba
|
step6, enhance error check, remove CUDA macro, enhance device id to fix none-zero id issue
|
2024-01-23 02:15:32 +00:00 |
|
jianyuzh
|
6dd32789b4
|
step 5 format device and print
|
2024-01-23 02:15:32 +00:00 |
|
jianyuzh
|
da752edaf5
|
add GGML_LIST_DEVICE function
|
2024-01-23 02:15:32 +00:00 |
|
jianyuzh
|
43f2c35859
|
step3 add fp16, slower 31->28
|
2024-01-23 02:15:32 +00:00 |
|
jianyuzh
|
02dffb68b8
|
step 2
|
2024-01-23 02:15:32 +00:00 |
|