* server : simplify state machine for slot * add SLOT_STATE_DONE_PROMPT * pop_deferred_task * add missing notify_one * fix passkey test * metrics : add n_busy_slots_per_decode * fix test step * add test * maybe fix AddressSanitizer? * fix deque ? * missing lock * pop_deferred_task: also notify * Update examples/server/server.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> |
||
|---|---|---|
| .. | ||
| steps | ||
| embeddings.feature | ||
| environment.py | ||
| issues.feature | ||
| lora.feature | ||
| parallel.feature | ||
| passkey.feature | ||
| results.feature | ||
| security.feature | ||
| server.feature | ||
| slotsave.feature | ||
| wrong_usages.feature | ||