OpenCL Token Generation Acceleration (#1459)

* Move back to C++ for OpenCL * Refactor OpenCL code to work more like the CUDA code, add missing functions * Deduplicate dequant kernels * Add OpenCL compile options * Use compile args for preprocessing constants * Restore default platform + device selection by id behavior --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de> Co-authored-by: Henri Vasserman <henv@hot.ee>
2023-05-22 23:33:24 +02:00 · 2023-05-22 23:33:24 +02:00 · 2e6cd4b025
commit 2e6cd4b025
parent 7e4ea5beff
8 changed files with 1113 additions and 536 deletions
--- a/ggml.h
+++ b/ggml.h
@ -249,6 +249,7 @@ extern "C" {
    enum ggml_backend {
        GGML_BACKEND_CPU = 0,
        GGML_BACKEND_CUDA = 1,
+        GGML_BACKEND_CL = 2,
    };

    // model file types