server, tests : bump batch to fit 1 embedding prompt

This commit is contained in:
Georgi Gerganov 2024-03-06 21:28:10 +02:00
parent 79ef3c0585
commit 36e12f8fd3
No known key found for this signature in database
GPG key ID: 449E073F9DC10735

View file

@ -8,7 +8,7 @@ Feature: llama.cpp server
And a model alias bert-bge-small
And 42 as server seed
And 2 slots
And 512 as batch size
And 1024 as batch size
And 1024 KV cache size
And embeddings extraction
Then the server is starting