Update README.md
This commit is contained in:
parent
2b188521a1
commit
42f307ef6a
1 changed files with 2 additions and 2 deletions
|
@ -4,11 +4,11 @@ A hacky little script from Concedo that exposes llama.cpp function bindings, all
|
|||
|
||||
It's not very usable as there is a fundamental flaw with llama.cpp, which causes generation delay to scale linearly with original prompt length. Nobody knows why or really cares much, so I'm just going to publish whatever I have at this point.
|
||||
|
||||
If you care, please contribute to [this discussion](https://github.com/ggerganov/llama.cpp/discussions/229) which, if resolved, will actually make this viable.
|
||||
If you care, **please contribute to [this discussion](https://github.com/ggerganov/llama.cpp/discussions/229)** which, if resolved, will actually make this viable.
|
||||
|
||||
## Considerations
|
||||
- Don't want to use pybind11 due to dependencies on MSVCC
|
||||
- ZERO or MINIMAL changes as possible to main.cpp - do not move their function declarations elsewhere!
|
||||
- Leave main.cpp UNTOUCHED, We want to be able to update the repo and pull any changes automatically.
|
||||
- No dynamic memory allocation! Setup structs with FIXED (known) shapes and sizes for ALL output fields. Python will ALWAYS provide the memory, we just write to it.
|
||||
- No external libraries or dependencies. That means no Flask, Pybind and whatever. All You Need Is Python.
|
||||
- No external libraries or dependencies. That means no Flask, Pybind and whatever. All You Need Is Python.
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue