ggml : unit test for quantization functions (#953)

* Unit test for quantization functions Use the ggml_internal_get_quantize_fn function to loop through all quantization formats and run a sanity check on the result. Also add a microbenchmark that times these functions directly without running the rest of the GGML graph. * test-quantize-fns: CI fixes Fix issues uncovered in CI - need to use sizes divisible by 32*8 for loop unrolling - use intrinsic header that should work on Mac * test-quantize: remove Per PR comment, subsumed by test-quantize-fns * test-quantize: fix for q8_0 intermediates
2023-04-22 11:10:39 +02:00 · 2023-04-22 11:10:39 +02:00 · 5f939498d5
commit 5f939498d5
parent 36b4f7e064
4 changed files with 466 additions and 43 deletions
--- a/tests/CMakeLists.txt
+++ b/tests/CMakeLists.txt
@ -6,5 +6,6 @@ function(llama_add_test source)
 endfunction()

 # llama_add_test(test-double-float.c) # SLOW
-llama_add_test(test-quantize.c)
+llama_add_test(test-quantize-fns.cpp)
+llama_add_test(test-quantize-perf.cpp)
 llama_add_test(test-tokenizer-0.cpp ${CMAKE_CURRENT_SOURCE_DIR}/../models/ggml-vocab.bin)