server: tests: fix the multi users infinite loop test

This commit is contained in:
Pierrick HYMBERT 2024-02-21 02:29:50 +01:00
parent 469af4b4ec
commit 2a37bd6b86

View file

@ -98,7 +98,12 @@ Feature: llama.cpp server
"""
Write a very long poem.
"""
And 1024 max tokens to predict
And a prompt:
"""
Write a very long joke.
"""
And 512 max tokens to predict
And a user api key llama.cpp
Given concurrent completion requests
Then the server is busy
And all slots are busy