gguf : add ftype meta info to the model (#2710)

* llama : add ftype meta info to the model ggml-ci * convert.py : add ftype when converting (does not work) * convert.py : fix Enum to IntEnum ggml-ci
2023-08-22 20:05:59 +03:00 · 2023-08-22 20:05:59 +03:00 · deb7dfca4b
commit deb7dfca4b
parent bac66994cf
4 changed files with 47 additions and 9 deletions
--- a/llama.h
+++ b/llama.h
@ -103,6 +103,8 @@ extern "C" {
        LLAMA_FTYPE_MOSTLY_Q5_K_S        = 16,// except 1d tensors
        LLAMA_FTYPE_MOSTLY_Q5_K_M        = 17,// except 1d tensors
        LLAMA_FTYPE_MOSTLY_Q6_K          = 18,// except 1d tensors
+
+        LLAMA_FTYPE_GUESSED = 1024, // not specified in the model file
    };

    typedef struct llama_token_data {