ochafik
f62e688387
tool-call
: fix crash / test non-tool call case (added llama_sampler_is_grammar_empty)
2024-09-27 06:04:41 +01:00
ochafik
0abfa36ca7
tool-call
: move usage examples to examples/agent
2024-09-27 05:10:30 +01:00
ochafik
6610ecf965
server
: rm bad debug code
2024-09-27 04:07:35 +01:00
ochafik
27cd07a056
json
: fix grammar conversion typo
2024-09-27 03:57:48 +01:00
ochafik
9295ca95db
tool-call
: fix agent type lints
2024-09-27 03:53:56 +01:00
ochafik
1e5c0e747e
chat-template
: fix jinja tests (make safe a passthrough)
2024-09-27 03:50:04 +01:00
ochafik
f9c1743bb5
minja
: fix iterables
2024-09-27 03:36:49 +01:00
ochafik
8299fac07c
tool-call
: adapt very simple agent + docker isolation from https://github.com/ggerganov/llama.cpp/pull/6389
2024-09-26 21:07:46 +01:00
ochafik
10f9fe8d49
tool-call
: fix tool call return format
2024-09-26 21:01:04 +01:00
ochafik
c88c932d98
fix gcc error + lint
2024-09-26 19:18:40 +01:00
ochafik
2926089c5d
fix lints
2024-09-26 19:06:29 +01:00
ochafik
5840e10069
tool-call
: merge & fix jinja template tests into test-chat-template
2024-09-26 19:05:00 +01:00
ochafik
50685f837f
minja
: add str.title()
2024-09-26 19:03:59 +01:00
ochafik
296331bba3
minja
: update chat template goldens w/ llama.3.1 arguments workaround
2024-09-26 18:10:27 +01:00
ochafik
9cfe4d7202
tool-call
: refactor llama_chat_template class + use in validate_model_chat_template
2024-09-26 18:06:03 +01:00
ochafik
cf7bece6a7
tool-call
: factor chat template away from legacy API
2024-09-26 17:19:29 +01:00
ochafik
d7ec84f78c
tool-call
: allow <|python_tag|> in functionary-medium-3.1
2024-09-26 06:52:34 +01:00
ochafik
3d2650ce65
fix gcc build
2024-09-26 06:52:34 +01:00
ochafik
749a21c67a
gcc appeasement
2024-09-26 06:08:18 +01:00
ochafik
0c870133d8
tool-call
: test/fix functionary-medium-v3.1's template (can "look" like llama3.1 template)
2024-09-26 05:56:15 +01:00
ochafik
8e4a9bad8a
minja
: allow none input to selectattr, and add safe passthrough filter
2024-09-26 05:53:12 +01:00
ochafik
5f5be9cde7
minja
: gcc tweaks
2024-09-26 05:06:11 +01:00
ochafik
2eb29bf8b8
tool-call
: update chat templates/goldens
2024-09-26 04:00:10 +01:00
ochafik
4cd82d61dd
tool-call
: fix pyright type errors
2024-09-26 03:59:38 +01:00
ochafik
059babdd9b
minja
: try to please gcc
2024-09-26 03:58:18 +01:00
ochafik
94377d743c
server
: catch errors in format_final_response_oaicompat instead of taking server down
2024-09-26 03:42:36 +01:00
ochafik
595e11cb11
tool-call
: fix/test functionary v3
2024-09-26 03:42:05 +01:00
ochafik
c124ab48ea
minja
: add str.endswith
2024-09-26 03:21:23 +01:00
ochafik
76d2938ef8
fix flake8 lints
2024-09-26 02:30:17 +01:00
ochafik
1b6280102b
fix editorconfig lints
2024-09-26 02:27:46 +01:00
ochafik
ab25e3fbf9
tool-call
: allow empty message content when there's tool_calls in format_chat
2024-09-26 02:19:04 +01:00
ochafik
d928ff4dfd
server
: catch errors in oaicompat_completion_params_parse instead of taking server down
2024-09-26 02:18:01 +01:00
ochafik
a774093a99
tool-call
: add server tests for llama 3.1
2024-09-26 02:17:30 +01:00
ochafik
9e366b3d03
server
: fix tailing comma in completions_seed
2024-09-26 02:15:48 +01:00
ochafik
45b243b4a5
minja
: fix llama_chat_apply_template + adde use_jinja param to validate_model_chat_template
2024-09-26 02:14:42 +01:00
ochafik
e983c9d0de
tool-call
: fix llama_chat_apply_template signature / test-chat-template
2024-09-25 22:02:58 +01:00
ochafik
97d0620968
minja
: fetch more templates (add models from test-chat-template)
2024-09-25 19:22:43 +01:00
ochafik
d15dcfb09d
tool-call
: add output example to readme
2024-09-25 19:22:16 +01:00
ochafik
33ea20edd1
Merge remote-tracking branch 'origin/master' into tool-call
2024-09-25 18:58:54 +01:00
ochafik
8f25531c44
tool-call
: add basic usage example to server readme
2024-09-25 18:00:31 +01:00
ochafik
4706bdbae1
tool-call
: support Functionary v3 vs. v3-llama3.1 variants
2024-09-25 17:33:00 +01:00
Xuan Son Nguyen
ea9c32be71
ci : fix docker build number and tag name ( #9638 )
...
* ci : fix docker build number and tag name
* fine-grant permissions
2024-09-25 17:26:01 +02:00
ochafik
41103c0ed6
server
: add --chat-template-file
2024-09-25 16:14:46 +01:00
ochafik
e309c6a47f
tool-call
: integrate minja & tool-call to server when --jinja is set
2024-09-25 16:14:46 +01:00
ochafik
3cfc21ea71
tool-call
: basic Functionary 3.2, Llama 3.1, Hermes 2 Pro grammar generators + parsers
2024-09-25 16:14:22 +01:00
ochafik
26c175b416
json
: build_grammar helper
2024-09-25 16:14:22 +01:00
ochafik
eaca756ecc
minja
: minimalist Jinja templating engine for LLM chat templates
2024-09-25 16:14:22 +01:00
ochafik
5b6d5040d5
grammar
: trigger words + refactor of antiprompts
2024-09-25 16:14:22 +01:00
Charles Xu
1e43630218
ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels ( #9217 )
...
* ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels
* added fallback mechanism when the offline re-quantized model is not
optimized for the underlying target.
* fix for build errors
* remove prints from the low-level code
* Rebase to the latest upstream
2024-09-25 16:12:20 +03:00
Xuan Son Nguyen
afbbfaa537
server : add more env vars, improve gen-docs ( #9635 )
...
* server : add more env vars, improve gen-docs
* update server docs
* LLAMA_ARG_NO_CONTEXT_SHIFT
2024-09-25 14:05:13 +02:00