From f425240e1dc40db18cfffbabbbe0756761518807 Mon Sep 17 00:00:00 2001 From: Pierrick HYMBERT Date: Sat, 9 Mar 2024 01:23:52 +0100 Subject: [PATCH] server: bench: fix doc --- examples/server/bench/README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/examples/server/bench/README.md b/examples/server/bench/README.md index 0c8f6b516..481dc5c4d 100644 --- a/examples/server/bench/README.md +++ b/examples/server/bench/README.md @@ -53,7 +53,8 @@ The benchmark values can be overridden with: - `SERVER_BENCH_URL` server url prefix for chat completions, default `http://localhost:8080/v1` - `SERVER_BENCH_N_PROMPTS` total prompts to randomly select in the benchmark, default `480` - `SERVER_BENCH_MODEL_ALIAS` model alias to pass in the completion request, default `my-model` -- `SERVER_BENCH_MAX_TOKENS` max tokens to predict, default: `1024` +- `SERVER_BENCH_MAX_TOKENS` max tokens to predict, default: `512` +- `SERVER_BENCH_DATASET` path to the benchmark dataset file Or with [k6 options](https://k6.io/docs/using-k6/k6-options/reference/):