Update README.md

2023-06-17 16:42:23 +02:00 · 2023-06-17 16:42:23 +02:00 · 6e137abe56
commit 6e137abe56
parent abc77a7496
1 changed files with 2 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -2,7 +2,8 @@ llama.cpp modification to run Falcon (work in progress)
 Status:  
 * Quantization works except for Q_K_ types  
-* CUDA not yet functional  
+* CUDA not yet functional
 * context size calculation not proper (cuda as well as cpu)  
 It appears the Q5 Falcon 40B inference time on CPU is as fast as the A100 fp16 inference time at 2 tk/second