ggml : remove q1_3 and q2_2
* llama : remove the separate scale tensors of BitNet b1.58 They won't be needed, since the remaining ternary quant types have built-in scales.
This commit is contained in:
parent
45719a2472
commit
04eec58112
12 changed files with 45 additions and 693 deletions
|
@ -392,8 +392,6 @@ extern "C" {
|
|||
GGML_TYPE_Q4_0_8_8 = 33,
|
||||
GGML_TYPE_TQ1_0 = 34,
|
||||
GGML_TYPE_TQ2_0 = 35,
|
||||
GGML_TYPE_Q2_2 = 36,
|
||||
GGML_TYPE_Q1_3 = 37,
|
||||
GGML_TYPE_COUNT,
|
||||
};
|
||||
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue