readme : add note that Q4 and Q5 have been changed
This commit is contained in:
parent
ca7f069f39
commit
b58b1f4bf6
1 changed files with 1 additions and 1 deletions
|
@ -9,8 +9,8 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
|
|||
|
||||
**Hot topics:**
|
||||
|
||||
- Qauntization formats `Q4` and `Q5` have changed - requantize any old models [(info)](https://github.com/ggerganov/llama.cpp/pull/1405)
|
||||
- [Roadmap May 2023](https://github.com/ggerganov/llama.cpp/discussions/1220)
|
||||
- [New quantization methods](https://github.com/ggerganov/llama.cpp#quantization)
|
||||
|
||||
<details>
|
||||
<summary>Table of Contents</summary>
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue