From f616b38b6bc0e169b0bb79508d69ac697eb7e7d9 Mon Sep 17 00:00:00 2001 From: Pierrick HYMBERT Date: Fri, 22 Mar 2024 12:12:13 +0100 Subject: [PATCH] docs: add model shard in hot topic --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index c2f3342f0..a4990e5ad 100644 --- a/README.md +++ b/README.md @@ -21,6 +21,7 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others) - Looking for contributions to add Deepseek support: https://github.com/ggerganov/llama.cpp/issues/5981 - Quantization blind testing: https://github.com/ggerganov/llama.cpp/discussions/5962 - Initial Mamba support has been added: https://github.com/ggerganov/llama.cpp/pull/5328 +- Support loading sharded model, using `gguf-split` CLI https://github.com/ggerganov/llama.cpp/pull/6187 ----