llama.cpp

History

Shanshan Shen 9a4b79bcfa CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454 ) * improve inferencing performance for ascend npu. Co-authored-by: Frank Mai <thxCode@thxcode0824@gmail.com> * some modification after review * some modifications after review * restore some modifications * restore some modifications --------- Co-authored-by: shanshan shen <shanshanshen333@gmail.com> Co-authored-by: Frank Mai <thxCode@thxcode0824@gmail.com>		2024-11-26 18:08:37 +08:00
..
kernels	CANN: Support Ascend310P to accelerate F32 and F16 Model (#10216 )	2024-11-22 14:07:20 +08:00
.clang-format	[CANN] Add Ascend NPU backend (#6035 )	2024-07-17 14:23:50 +03:00
acl_tensor.cpp	cann: support q4_0 model (#8822 )	2024-08-05 12:22:30 +08:00
acl_tensor.h	cann: support q4_0 model (#8822 )	2024-08-05 12:22:30 +08:00
aclnn_ops.cpp	CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454 )	2024-11-26 18:08:37 +08:00
aclnn_ops.h	[CANN] Add Ascend NPU backend (#6035 )	2024-07-17 14:23:50 +03:00
CMakeLists.txt	ggml : add support for dynamic loading of backends (#10469 )	2024-11-25 15:13:39 +01:00
common.h	CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454 )	2024-11-26 18:08:37 +08:00
Doxyfile	cann : fix doxy (ggml/0)	2024-09-08 11:05:55 +03:00
ggml-cann.cpp	CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454 )	2024-11-26 18:08:37 +08:00