From cbb31807a38485a27aba73657ec01b358e491b34 Mon Sep 17 00:00:00 2001
From: John <78893154+cmp-nct@users.noreply.github.com>
Date: Sat, 17 Jun 2023 21:34:24 +0200
Subject: [PATCH] Update README.md

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index e122703e5..4e8867a1d 100644
--- a/README.md
+++ b/README.md
@@ -1,7 +1,7 @@
 llama.cpp modification to run Falcon (work in progress)
 
 Status/Bugs:  
-* Quantization works except for Q_K_ types  
+* Quantization with QK_ type appear to fail on 7B models. (Q_ works on both, QK_ works on 40B)
 * CUDA not yet functional
 * python conversion script is very basic (produces ggml v0)
 * On linux Q5_1 7B user reports a batch token ingestion context memory issue, with -b 1 it's gone. Not reproduced on Windows