llama.cpp

Author	SHA1	Message	Date
abhilash1910	1ddaf44c30	editor config format	2024-01-23 01:03:34 -08:00
Xuan Son Nguyen	2bed4aa3f3	devops : add intel oneapi dockerfile (#5068 ) Co-authored-by: Xuan Son Nguyen <xuanson.nguyen@snowpack.eu>	2024-01-23 09:11:39 +02:00
Michael Coppola	125d03a503	llama.vim : added api key support (#5090 ) Co-authored-by: Michael Coppola <info@michaeljcoppola.com>	2024-01-23 08:51:27 +02:00
jianyuzh	97cbe18dd2	rename macro to intel hardware	2024-01-23 14:35:33 +08:00
jianyuzh	27c08c0429	Merge branch 'sycl' of https://github.com/abhilash1910/llama.cpp into sycl	2024-01-23 14:16:24 +08:00
jianyuzh	a0a1304b0c	add build&run script, clean CMakefile, update guide by review comments	2024-01-23 14:16:01 +08:00
Meng, Hengyu	b403784228	remove extra endif	2024-01-23 06:09:19 +00:00
abhilash1910	dd7f1396f9	cleanup 1	2024-01-22 21:37:16 -08:00
jianyuzh	533c647d0e	check for sycl blas, better performance	2024-01-23 13:34:05 +08:00
Meng, Hengyu	67e6b3cb7d	align pr4766	2024-01-23 03:32:09 +00:00
luoyu-intel	f008cc7b68	enable SYCL_F16 support	2024-01-23 02:38:44 +00:00
jianyuzh	f396a3b65e	add know issue for pvc hang issue	2024-01-23 02:38:44 +00:00
luoyu-intel	623d8031cb	fix code err	2024-01-23 02:38:44 +00:00
jianyuzh	e3481faa2f	rm original sycl code before refactor	2024-01-23 02:38:44 +00:00
jianyuzh	ae941b1b57	add syc and link for sycl readme	2024-01-23 02:38:44 +00:00
jianyuzh	35a0daaaa1	restore rm code to fix hang issue	2024-01-23 02:38:44 +00:00
luoyu-intel	d5f7d364f6	remove sycl version from include path	2024-01-23 02:38:44 +00:00
luoyu-intel	57e9fbadb2	fix return type	2024-01-23 02:38:44 +00:00
Neo Zhang Jianyu	593ce001e2	Update README_sycl.md	2024-01-23 02:38:44 +00:00
jianyuzh	d80dd65f42	dos2unix	2024-01-23 02:38:44 +00:00
jianyuzh	09b5619df4	rm rear space	2024-01-23 02:38:44 +00:00
jianyuzh	7350fd48ef	add ls-sycl-device, rm unused files	2024-01-23 02:38:44 +00:00
jianyuzh	0d6e7219b6	add ls-sycl-device tool	2024-01-23 02:38:44 +00:00
jianyuzh	79d30d7713	add run script, comment debug code	2024-01-23 02:38:44 +00:00
jianyuzh	a8936f4902	set nthread=1 when sycl, increase performance	2024-01-23 02:38:44 +00:00
jianyuzh	95daece908	fix build with sycl	2024-01-23 02:38:44 +00:00
jianyuzh	ca2cb6982a	update readme, refactor build script	2024-01-23 02:38:44 +00:00
jianyuzh	c3c5b20ac5	mv dpct definition from folder dpct to ggml-sycl.h	2024-01-23 02:38:44 +00:00
jianyuzh	c67c2ab228	refactor device log	2024-01-23 02:38:44 +00:00
jianyuzh	a47f5ec42e	summary dpct definition in one header file to replace folder:dpct	2024-01-23 02:38:44 +00:00
jianyuzh	5b5389941e	fix error: wrong result in 658746bb26702e50f2c59c0e4ada8e9da6010481	2024-01-23 02:38:44 +00:00
jianyuzh	bd38129aeb	add print tensor function to debug	2024-01-23 02:38:44 +00:00
jianyuzh	3645f25d74	correct queue: rm dtct:get_queue	2024-01-23 02:38:44 +00:00
jianyuzh	fa3a58605b	clear CMAKE to rm unused lib and options	2024-01-23 02:38:44 +00:00
jianyuzh	c709c3cb37	ren ggml-sycl.hpp -> ggml-sycl.h	2024-01-23 02:38:44 +00:00
jianyuzh	69d76c8b58	fix error of select non-zero device, format device list	2024-01-23 02:38:44 +00:00
jianyuzh	c2ef7a9cb9	step 8, rename all macro & func from cuda by sycl	2024-01-23 02:38:42 +00:00
jianyuzh	3b1a743e82	step7 add debug for code path, rm log	2024-01-23 02:15:32 +00:00
jianyuzh	65f895d41b	support main device is non-zero	2024-01-23 02:15:32 +00:00
jianyuzh	3a9d2c54ba	step6, enhance error check, remove CUDA macro, enhance device id to fix none-zero id issue	2024-01-23 02:15:32 +00:00
jianyuzh	6dd32789b4	step 5 format device and print	2024-01-23 02:15:32 +00:00
jianyuzh	da752edaf5	add GGML_LIST_DEVICE function	2024-01-23 02:15:32 +00:00
jianyuzh	43f2c35859	step3 add fp16, slower 31->28	2024-01-23 02:15:32 +00:00
jianyuzh	02dffb68b8	step 2	2024-01-23 02:15:32 +00:00
jianyuzh	ff83711055	step 1	2024-01-23 02:15:32 +00:00
jianyuzh	0c00b4f654	add debug functio, commit all help code	2024-01-23 02:15:32 +00:00
jianyuzh	233876936b	update init_cublas	2024-01-23 02:15:32 +00:00
jianyuzh	7a4343df61	first update for migration	2024-01-23 02:15:32 +00:00
slaren	011e8ec577	llama : fix not enough space in buffer with Qwen (#5086 )	2024-01-22 23:42:41 +01:00
Kawrakow	6f9939d119	KL-divergence (#5076 ) * kl-divergence: be able to save all logits to a file * Add ability to compute KL-divergence --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>	2024-01-22 16:10:14 +02:00

1 2 3 4 5 ...

2052 commits