tool-calls: add deepseek r1 template + accommodate broken official template slightly better

2025-02-03 19:59:33 +00:00 · 2025-02-03 19:59:33 +00:00 · 7dc271fb37
commit 7dc271fb37
parent 0be7f652e9
3 changed files with 102 additions and 22 deletions
--- a/examples/server/README.md
+++ b/examples/server/README.md
@ -1202,11 +1202,19 @@ curl http://localhost:8080/v1/chat/completions \

  ```shell
  # Native support:
+
  llama-server --jinja -fa -hf bartowski/Qwen2.5-7B-Instruct-GGUF:Q4_K_M
  llama-server --jinja -fa -hf bartowski/Mistral-Nemo-Instruct-2407-GGUF:Q6_K_L
  llama-server --jinja -fa -hf bartowski/functionary-small-v3.2-GGUF:Q4_K_M
  llama-server --jinja -fa -hf bartowski/Llama-3.3-70B-Instruct-GGUF:Q4_K_M
-  llama-server --jinja -fa -hf bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF:Q4_K_M
+
+  # Native support for DeepSeek R1 works best w/ our own template (official template buggy)
+
+  llama-server --jinja -fa -hf bartowski/DeepSeek-R1-Distill-Qwen-7B-GGUF:Q6_K_L \
+    --chat-template-file models/templates/llama-cpp-deepseek-r1.jinja
+
+  llama-server --jinja -fa -hf bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF:Q4_K_M \
+    --chat-template-file models/templates/llama-cpp-deepseek-r1.jinja

  # Native support requires the right template for these GGUFs: