ggml : quantization refactoring (#3833)

* ggml : factor all quantization code in ggml-quants

ggml-ci

* ggml-quants : fix Zig and Swift builds + quantize tool

ggml-ci

* quantize : --pure option for disabling k-quant mixtures

---------

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>
This commit is contained in:
Georgi Gerganov 2023-10-29 18:32:28 +02:00 committed by GitHub
parent ff3bad83e2
commit d69d777c02
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23
11 changed files with 2372 additions and 2385 deletions

7280
ggml-quants.c Normal file

File diff suppressed because it is too large Load diff