Introduce llama-run (#10291)
It's like simple-chat but it uses smart pointers to avoid manual memory cleanups. Less memory leaks in the code now. Avoid printing multiple dots. Split code into smaller functions. Uses no exception handling. Signed-off-by: Eric Curtin <ecurtin@redhat.com>
This commit is contained in:
parent
50d5cecbda
commit
0cc63754b8
7 changed files with 458 additions and 2 deletions
7
examples/run/README.md
Normal file
7
examples/run/README.md
Normal file
|
@ -0,0 +1,7 @@
|
|||
# llama.cpp/example/run
|
||||
|
||||
The purpose of this example is to demonstrate a minimal usage of llama.cpp for running models.
|
||||
|
||||
```bash
|
||||
./llama-run Meta-Llama-3.1-8B-Instruct.gguf
|
||||
...
|
Loading…
Add table
Add a link
Reference in a new issue