ggml : add CLBlast support (#1164)

* Allow use of OpenCL GPU-based BLAS using ClBlast instead of OpenBLAS for context processing

* Improve ClBlast implementation, avoid recreating buffers, remove redundant transfers

* Finish merge of ClBlast support

* Move CLBlast implementation to separate file

Add buffer reuse code (adapted from slaren's cuda implementation)

* Add q4_2 and q4_3 CLBlast support, improve code

* Double CLBlast speed by disabling OpenBLAS thread workaround

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
Co-authored-by: slaren <2141330+slaren@users.noreply.github.com>

* Fix device selection env variable names

* Fix cast in opencl kernels

* Add CLBlast to CMakeLists.txt

* Replace buffer pool with static buffers a, b, qb, c

Fix compile warnings

* Fix typos, use GGML_TYPE defines, improve code

* Improve btype dequant kernel selection code, add error if type is unsupported

* Improve code quality

* Move internal stuff out of header
* Use internal enums instead of CLBlast enums
* Remove leftover C++ includes and defines
* Make event use easier to read

Co-authored-by: Henri Vasserman <henv@hot.ee>

* Use c compiler for opencl files

* Simplify code, fix include

* First check error, then release event

* Make globals static, fix indentation

* Rename dequant kernels file to conform with other file names

* Fix import cl file name

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
Co-authored-by: slaren <2141330+slaren@users.noreply.github.com>
Co-authored-by: Henri Vasserman <henv@hot.ee>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit is contained in:

0cc4m

2023-04-28 16:57:16 +02:00

• committed by

GitHub

parent 78ec543733

commit 7296c961d9

No known key found for this signature in database

GPG key ID: 4AEE18F83AFDEB23

8 changed files with 411 additions and 16 deletions

									
										3

ggml.h
									
										View file
										
				@ -858,10 +858,11 @@ extern "C" {

				    GGML_API int ggml_cpu_has_wasm_simd  (void);

				    GGML_API int ggml_cpu_has_blas       (void);

				    GGML_API int ggml_cpu_has_cublas     (void);

				    GGML_API int ggml_cpu_has_clblast    (void);

				    GGML_API int ggml_cpu_has_gpublas    (void);

				    GGML_API int ggml_cpu_has_sse3       (void);

				    GGML_API int ggml_cpu_has_vsx        (void);

				    //

				    // Internal types and functions exposed for tests and benchmarks

				    //

Rows
Columns

ggml : add CLBlast support (#1164)

3 ggml.h Unescape Escape View file

3

ggml.h

View file