server: tests: check for infinite loops

This commit is contained in:
Pierrick HYMBERT 2024-02-20 22:45:30 +01:00
parent c355f76427
commit 367b59a15c

View file

@ -54,6 +54,10 @@ Feature: llama.cpp server
It was her greeting to Prince Vassily, a man high in rank
and office, who was the first to arrive at her soirée.
"""
And a prompt:
"""
Write another very long music lyrics.
"""
Given concurrent completion requests
Then the server is busy
And all slots are busy
@ -65,7 +69,7 @@ Feature: llama.cpp server
Scenario: Multi users OAI Compatibility
Given a system prompt "You are an AI assistant."
And a model tinyllama-2
And 1024 max tokens to predict
And 512 max tokens to predict
And streaming is enabled
Given a prompt:
"""
@ -77,7 +81,7 @@ Feature: llama.cpp server
"""
And a prompt:
"""
Write yet another very long music lyrics.
I believe the meaning of life is
"""
Given concurrent OAI completions requests
Then the server is busy
@ -85,3 +89,4 @@ Feature: llama.cpp server
Then the server is idle
And all slots are idle
Then all prompts are predicted