server, tests : bump batch to fit 1 embedding prompt

This commit is contained in:
Georgi Gerganov 2024-03-06 21:28:10 +02:00
parent 79ef3c0585
commit 36e12f8fd3
No known key found for this signature in database
GPG key ID: 449E073F9DC10735

View file

@ -8,7 +8,7 @@ Feature: llama.cpp server
And a model alias bert-bge-small And a model alias bert-bge-small
And 42 as server seed And 42 as server seed
And 2 slots And 2 slots
And 512 as batch size And 1024 as batch size
And 1024 KV cache size And 1024 KV cache size
And embeddings extraction And embeddings extraction
Then the server is starting Then the server is starting