Merge branch 'master' into compilade/convert-hf-refactor

2024-04-30 14:08:05 -04:00 · 2024-04-30 14:08:05 -04:00 · 0d720acb91
commit 0d720acb91
parent 47e02eb7bc f364eb6fb5
30 changed files with 3316 additions and 739 deletions
--- a/examples/server/tests/features/embeddings.feature
+++ b/examples/server/tests/features/embeddings.feature
@ -5,7 +5,7 @@ Feature: llama.cpp server
  Background: Server startup
    Given a server listening on localhost:8080
    And   a model url https://huggingface.co/ggml-org/models/resolve/main/bert-bge-small/ggml-model-f16.gguf
-    And   a model file ggml-model-f16.gguf
+    And   a model file bert-bge-small.gguf
    And   a model alias bert-bge-small
    And   42 as server seed
    And   2 slots