server, tests : bump batch to fit 1 embedding prompt
This commit is contained in:
parent
79ef3c0585
commit
36e12f8fd3
1 changed files with 1 additions and 1 deletions
|
@ -8,7 +8,7 @@ Feature: llama.cpp server
|
|||
And a model alias bert-bge-small
|
||||
And 42 as server seed
|
||||
And 2 slots
|
||||
And 512 as batch size
|
||||
And 1024 as batch size
|
||||
And 1024 KV cache size
|
||||
And embeddings extraction
|
||||
Then the server is starting
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue