server: tests: embeddings, use different KV Cache size

This commit is contained in:
Pierrick HYMBERT 2024-03-06 21:03:26 +01:00
parent 36e12f8fd3
commit ec6ba3bff1

View file

@ -9,7 +9,7 @@ Feature: llama.cpp server
And 42 as server seed And 42 as server seed
And 2 slots And 2 slots
And 1024 as batch size And 1024 as batch size
And 1024 KV cache size And 4096 KV cache size
And embeddings extraction And embeddings extraction
Then the server is starting Then the server is starting
Then the server is healthy Then the server is healthy