docfix: server readme: quantum models -> quantized models.

2024-07-21 14:18:16 +05:30 · 2024-07-21 14:18:16 +05:30 · e18281940e
commit e18281940e
parent 0ab192f500
1 changed files with 1 additions and 1 deletions
--- a/examples/server/README.md
+++ b/examples/server/README.md
@ -5,7 +5,7 @@ Fast, lightweight, pure C/C++ HTTP server based on [httplib](https://github.com/
 Set of LLM REST APIs and a simple web front end to interact with llama.cpp.

 **Features:**
- * LLM inference of F16 and quantum models on GPU and CPU
+ * LLM inference of F16 and quantized models on GPU and CPU
 * [OpenAI API](https://github.com/openai/openai-openapi) compatible chat completions and embeddings routes
 * Parallel decoding with multi-user support
 * Continuous batching