abhilash1910
1ddaf44c30
editor config format
2024-01-23 01:03:34 -08:00
Xuan Son Nguyen
2bed4aa3f3
devops : add intel oneapi dockerfile ( #5068 )
...
Co-authored-by: Xuan Son Nguyen <xuanson.nguyen@snowpack.eu>
2024-01-23 09:11:39 +02:00
Michael Coppola
125d03a503
llama.vim : added api key support ( #5090 )
...
Co-authored-by: Michael Coppola <info@michaeljcoppola.com>
2024-01-23 08:51:27 +02:00
jianyuzh
97cbe18dd2
rename macro to intel hardware
2024-01-23 14:35:33 +08:00
jianyuzh
27c08c0429
Merge branch 'sycl' of https://github.com/abhilash1910/llama.cpp into sycl
2024-01-23 14:16:24 +08:00
jianyuzh
a0a1304b0c
add build&run script, clean CMakefile, update guide by review comments
2024-01-23 14:16:01 +08:00
Meng, Hengyu
b403784228
remove extra endif
2024-01-23 06:09:19 +00:00
abhilash1910
dd7f1396f9
cleanup 1
2024-01-22 21:37:16 -08:00
jianyuzh
533c647d0e
check for sycl blas, better performance
2024-01-23 13:34:05 +08:00
Meng, Hengyu
67e6b3cb7d
align pr4766
2024-01-23 03:32:09 +00:00
luoyu-intel
f008cc7b68
enable SYCL_F16 support
2024-01-23 02:38:44 +00:00
jianyuzh
f396a3b65e
add know issue for pvc hang issue
2024-01-23 02:38:44 +00:00
luoyu-intel
623d8031cb
fix code err
2024-01-23 02:38:44 +00:00
jianyuzh
e3481faa2f
rm original sycl code before refactor
2024-01-23 02:38:44 +00:00
jianyuzh
ae941b1b57
add syc and link for sycl readme
2024-01-23 02:38:44 +00:00
jianyuzh
35a0daaaa1
restore rm code to fix hang issue
2024-01-23 02:38:44 +00:00
luoyu-intel
d5f7d364f6
remove sycl version from include path
2024-01-23 02:38:44 +00:00
luoyu-intel
57e9fbadb2
fix return type
2024-01-23 02:38:44 +00:00
Neo Zhang Jianyu
593ce001e2
Update README_sycl.md
2024-01-23 02:38:44 +00:00
jianyuzh
d80dd65f42
dos2unix
2024-01-23 02:38:44 +00:00
jianyuzh
09b5619df4
rm rear space
2024-01-23 02:38:44 +00:00
jianyuzh
7350fd48ef
add ls-sycl-device, rm unused files
2024-01-23 02:38:44 +00:00
jianyuzh
0d6e7219b6
add ls-sycl-device tool
2024-01-23 02:38:44 +00:00
jianyuzh
79d30d7713
add run script, comment debug code
2024-01-23 02:38:44 +00:00
jianyuzh
a8936f4902
set nthread=1 when sycl, increase performance
2024-01-23 02:38:44 +00:00
jianyuzh
95daece908
fix build with sycl
2024-01-23 02:38:44 +00:00
jianyuzh
ca2cb6982a
update readme, refactor build script
2024-01-23 02:38:44 +00:00
jianyuzh
c3c5b20ac5
mv dpct definition from folder dpct to ggml-sycl.h
2024-01-23 02:38:44 +00:00
jianyuzh
c67c2ab228
refactor device log
2024-01-23 02:38:44 +00:00
jianyuzh
a47f5ec42e
summary dpct definition in one header file to replace folder:dpct
2024-01-23 02:38:44 +00:00
jianyuzh
5b5389941e
fix error: wrong result in 658746bb26702e50f2c59c0e4ada8e9da6010481
2024-01-23 02:38:44 +00:00
jianyuzh
bd38129aeb
add print tensor function to debug
2024-01-23 02:38:44 +00:00
jianyuzh
3645f25d74
correct queue: rm dtct:get_queue
2024-01-23 02:38:44 +00:00
jianyuzh
fa3a58605b
clear CMAKE to rm unused lib and options
2024-01-23 02:38:44 +00:00
jianyuzh
c709c3cb37
ren ggml-sycl.hpp -> ggml-sycl.h
2024-01-23 02:38:44 +00:00
jianyuzh
69d76c8b58
fix error of select non-zero device, format device list
2024-01-23 02:38:44 +00:00
jianyuzh
c2ef7a9cb9
step 8, rename all macro & func from cuda by sycl
2024-01-23 02:38:42 +00:00
jianyuzh
3b1a743e82
step7 add debug for code path, rm log
2024-01-23 02:15:32 +00:00
jianyuzh
65f895d41b
support main device is non-zero
2024-01-23 02:15:32 +00:00
jianyuzh
3a9d2c54ba
step6, enhance error check, remove CUDA macro, enhance device id to fix none-zero id issue
2024-01-23 02:15:32 +00:00
jianyuzh
6dd32789b4
step 5 format device and print
2024-01-23 02:15:32 +00:00
jianyuzh
da752edaf5
add GGML_LIST_DEVICE function
2024-01-23 02:15:32 +00:00
jianyuzh
43f2c35859
step3 add fp16, slower 31->28
2024-01-23 02:15:32 +00:00
jianyuzh
02dffb68b8
step 2
2024-01-23 02:15:32 +00:00
jianyuzh
ff83711055
step 1
2024-01-23 02:15:32 +00:00
jianyuzh
0c00b4f654
add debug functio, commit all help code
2024-01-23 02:15:32 +00:00
jianyuzh
233876936b
update init_cublas
2024-01-23 02:15:32 +00:00
jianyuzh
7a4343df61
first update for migration
2024-01-23 02:15:32 +00:00
slaren
011e8ec577
llama : fix not enough space in buffer with Qwen ( #5086 )
2024-01-22 23:42:41 +01:00
Kawrakow
6f9939d119
KL-divergence ( #5076 )
...
* kl-divergence: be able to save all logits to a file
* Add ability to compute KL-divergence
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-01-22 16:10:14 +02:00