From 858d5469c84c625a35aad30fc1dc84742ae07638 Mon Sep 17 00:00:00 2001 From: slaren Date: Sat, 23 Sep 2023 21:47:42 +0200 Subject: [PATCH] minor edit --- examples/llama-bench/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/examples/llama-bench/README.md b/examples/llama-bench/README.md index e8e24954a..d02824bfa 100644 --- a/examples/llama-bench/README.md +++ b/examples/llama-bench/README.md @@ -45,7 +45,7 @@ llama-bench can perform two types of tests: - Prompt processing (pp): processing a prompt in batches (`-p`) - Text generation (tg): generating a sequence of tokens (`-n`) -With the exception of `-r`, `-o` and `-v`, all options can be specified multiple times to run multiple tests. Each pp and tg test is run with all combinations of the specified options. To specify multiple values for an option, the values can be separated by commas (eg. `-n 16,32`), or the option can be specified multiple times (eg. `-n 16 -n 32`). +With the exception of `-r`, `-o` and `-v`, all options can be specified multiple times to run multiple tests. Each pp and tg test is run with all combinations of the specified options. To specify multiple values for an option, the values can be separated by commas (e.g. `-n 16,32`), or the option can be specified multiple times (e.g. `-n 16 -n 32`). Each test is repeated the number of times given by `-r`, and the results are averaged. The results are given in average tokens per second (t/s) and standard deviation. Some output formats (e.g. json) also include the individual results of each repetition.