llama.cpp/ggml/src/ggml-metal
PAB efb6ae9630 feat: add GGML_UNARY_OP_ARGMAX Metal kernel (ggml/1019)
* implemented argmax kernel

* tpig -> tgpig

* change to strides

* contiguous assertions

* kernel working and tested

* argmax simd parallel implementation

* added 2 new tests for argmax in test-backend-ops

* cosmit

* added 3 tests cases for perf eval

* add test_argmax in make_test_cases_perf

* Update test-backend-ops.cpp

Co-authored-by: Diego Devesa <slarengh@gmail.com>

---------

Co-authored-by: Diego Devesa <slarengh@gmail.com>
2024-12-03 20:04:49 +02:00
..
CMakeLists.txt ggml : add support for dynamic loading of backends (#10469) 2024-11-25 15:13:39 +01:00
ggml-metal-impl.h metal : small-batch mat-mul kernels (#10581) 2024-12-03 11:52:33 +02:00
ggml-metal.m feat: add GGML_UNARY_OP_ARGMAX Metal kernel (ggml/1019) 2024-12-03 20:04:49 +02:00
ggml-metal.metal feat: add GGML_UNARY_OP_ARGMAX Metal kernel (ggml/1019) 2024-12-03 20:04:49 +02:00