Update server README.md
This commit is contained in:
parent
668750357e
commit
006167dd65
1 changed files with 10 additions and 2 deletions
|
@ -762,7 +762,15 @@ Example:
|
||||||
|
|
||||||
Possible values for `slot[i].state` are:
|
Possible values for `slot[i].state` are:
|
||||||
- `0`: SLOT_STATE_IDLE
|
- `0`: SLOT_STATE_IDLE
|
||||||
- `1`: SLOT_STATE_PROCESSING
|
The slot is idle and ready to use.
|
||||||
|
- `1`: SLOT_STATE_PROCESSING_PROMPT
|
||||||
|
The slot is processing the input prompt tokens.
|
||||||
|
- `2`: SLOT_STATE_DONE_PROMPT
|
||||||
|
The slot has finished processing the input prompt. For embedding and rerank tasks the slot will be released soon, otherwise the slot will be used for generation.
|
||||||
|
- `3`: SLOT_STATE_GENERATING
|
||||||
|
The slot is generating output tokens.
|
||||||
|
|
||||||
|
[State diagram](https://github.com/ggerganov/llama.cpp/pull/9283)
|
||||||
|
|
||||||
### GET `/metrics`: Prometheus compatible metrics exporter
|
### GET `/metrics`: Prometheus compatible metrics exporter
|
||||||
|
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue