server: tests: embeddings, use different KV Cache size

This commit is contained in:
Pierrick HYMBERT 2024-03-06 21:03:26 +01:00
parent 36e12f8fd3
commit ec6ba3bff1

View file

@ -9,7 +9,7 @@ Feature: llama.cpp server
And 42 as server seed
And 2 slots
And 1024 as batch size
And 1024 KV cache size
And 4096 KV cache size
And embeddings extraction
Then the server is starting
Then the server is healthy