server: tests: fix the multi users infinite loop test

This commit is contained in:
Pierrick HYMBERT 2024-02-21 02:29:50 +01:00
parent 469af4b4ec
commit 2a37bd6b86

View file

@ -98,7 +98,12 @@ Feature: llama.cpp server
""" """
Write a very long poem. Write a very long poem.
""" """
And 1024 max tokens to predict And a prompt:
"""
Write a very long joke.
"""
And 512 max tokens to predict
And a user api key llama.cpp
Given concurrent completion requests Given concurrent completion requests
Then the server is busy Then the server is busy
And all slots are busy And all slots are busy