llama.cpp/ggml/src/ggml-sycl/gla.hpp at f446c2cf6a56a750b67c967505e717a996d2f2fd - vbatts/llama.cpp - Git - Batts Cloud

vbatts/llama.cpp

Akarshan Biswas f446c2cf6a

SYCL: Add gated linear attention kernel (#11175 )

* SYCL: Add Gated Linear attention kernel

* glahpp: add a space at the end of file

* gla: Put the barrier inside the main logic loop

2025-01-15 11:20:17 +08:00

8 lines

195 B

C++

Raw Blame History

 #ifndef GGML_SYCL_GLA_HPP
 #define GGML_SYCL_GLA_HPP
 #include "common.hpp"
 void ggml_sycl_op_gated_linear_attn(ggml_backend_sycl_context & ctx, ggml_tensor * dst);
 #endif  // GGML_SYCL_GLA_HPP