Commit graph

3874 commits

Author SHA1 Message Date
ochafik
e33b342da7 tool-call: fix passing of tools to template + allow agent to finish 2024-09-27 06:24:22 +01:00
ochafik
f62e688387 tool-call: fix crash / test non-tool call case (added llama_sampler_is_grammar_empty) 2024-09-27 06:04:41 +01:00
ochafik
0abfa36ca7 tool-call: move usage examples to examples/agent 2024-09-27 05:10:30 +01:00
ochafik
6610ecf965 server: rm bad debug code 2024-09-27 04:07:35 +01:00
ochafik
27cd07a056 json: fix grammar conversion typo 2024-09-27 03:57:48 +01:00
ochafik
9295ca95db tool-call: fix agent type lints 2024-09-27 03:53:56 +01:00
ochafik
1e5c0e747e chat-template: fix jinja tests (make safe a passthrough) 2024-09-27 03:50:04 +01:00
ochafik
f9c1743bb5 minja: fix iterables 2024-09-27 03:36:49 +01:00
ochafik
8299fac07c tool-call: adapt very simple agent + docker isolation from https://github.com/ggerganov/llama.cpp/pull/6389 2024-09-26 21:07:46 +01:00
ochafik
10f9fe8d49 tool-call: fix tool call return format 2024-09-26 21:01:04 +01:00
ochafik
c88c932d98 fix gcc error + lint 2024-09-26 19:18:40 +01:00
ochafik
2926089c5d fix lints 2024-09-26 19:06:29 +01:00
ochafik
5840e10069 tool-call: merge & fix jinja template tests into test-chat-template 2024-09-26 19:05:00 +01:00
ochafik
50685f837f minja: add str.title() 2024-09-26 19:03:59 +01:00
ochafik
296331bba3 minja: update chat template goldens w/ llama.3.1 arguments workaround 2024-09-26 18:10:27 +01:00
ochafik
9cfe4d7202 tool-call: refactor llama_chat_template class + use in validate_model_chat_template 2024-09-26 18:06:03 +01:00
ochafik
cf7bece6a7 tool-call: factor chat template away from legacy API 2024-09-26 17:19:29 +01:00
ochafik
d7ec84f78c tool-call: allow <|python_tag|> in functionary-medium-3.1 2024-09-26 06:52:34 +01:00
ochafik
3d2650ce65 fix gcc build 2024-09-26 06:52:34 +01:00
ochafik
749a21c67a gcc appeasement 2024-09-26 06:08:18 +01:00
ochafik
0c870133d8 tool-call: test/fix functionary-medium-v3.1's template (can "look" like llama3.1 template) 2024-09-26 05:56:15 +01:00
ochafik
8e4a9bad8a minja: allow none input to selectattr, and add safe passthrough filter 2024-09-26 05:53:12 +01:00
ochafik
5f5be9cde7 minja: gcc tweaks 2024-09-26 05:06:11 +01:00
ochafik
2eb29bf8b8 tool-call: update chat templates/goldens 2024-09-26 04:00:10 +01:00
ochafik
4cd82d61dd tool-call: fix pyright type errors 2024-09-26 03:59:38 +01:00
ochafik
059babdd9b minja: try to please gcc 2024-09-26 03:58:18 +01:00
ochafik
94377d743c server: catch errors in format_final_response_oaicompat instead of taking server down 2024-09-26 03:42:36 +01:00
ochafik
595e11cb11 tool-call: fix/test functionary v3 2024-09-26 03:42:05 +01:00
ochafik
c124ab48ea minja: add str.endswith 2024-09-26 03:21:23 +01:00
ochafik
76d2938ef8 fix flake8 lints 2024-09-26 02:30:17 +01:00
ochafik
1b6280102b fix editorconfig lints 2024-09-26 02:27:46 +01:00
ochafik
ab25e3fbf9 tool-call: allow empty message content when there's tool_calls in format_chat 2024-09-26 02:19:04 +01:00
ochafik
d928ff4dfd server: catch errors in oaicompat_completion_params_parse instead of taking server down 2024-09-26 02:18:01 +01:00
ochafik
a774093a99 tool-call: add server tests for llama 3.1 2024-09-26 02:17:30 +01:00
ochafik
9e366b3d03 server: fix tailing comma in completions_seed 2024-09-26 02:15:48 +01:00
ochafik
45b243b4a5 minja: fix llama_chat_apply_template + adde use_jinja param to validate_model_chat_template 2024-09-26 02:14:42 +01:00
ochafik
e983c9d0de tool-call: fix llama_chat_apply_template signature / test-chat-template 2024-09-25 22:02:58 +01:00
ochafik
97d0620968 minja: fetch more templates (add models from test-chat-template) 2024-09-25 19:22:43 +01:00
ochafik
d15dcfb09d tool-call: add output example to readme 2024-09-25 19:22:16 +01:00
ochafik
33ea20edd1 Merge remote-tracking branch 'origin/master' into tool-call 2024-09-25 18:58:54 +01:00
ochafik
8f25531c44 tool-call: add basic usage example to server readme 2024-09-25 18:00:31 +01:00
ochafik
4706bdbae1 tool-call: support Functionary v3 vs. v3-llama3.1 variants 2024-09-25 17:33:00 +01:00
Xuan Son Nguyen
ea9c32be71
ci : fix docker build number and tag name (#9638)
* ci : fix docker build number and tag name

* fine-grant permissions
2024-09-25 17:26:01 +02:00
ochafik
41103c0ed6 server: add --chat-template-file 2024-09-25 16:14:46 +01:00
ochafik
e309c6a47f tool-call: integrate minja & tool-call to server when --jinja is set 2024-09-25 16:14:46 +01:00
ochafik
3cfc21ea71 tool-call: basic Functionary 3.2, Llama 3.1, Hermes 2 Pro grammar generators + parsers 2024-09-25 16:14:22 +01:00
ochafik
26c175b416 json: build_grammar helper 2024-09-25 16:14:22 +01:00
ochafik
eaca756ecc minja: minimalist Jinja templating engine for LLM chat templates 2024-09-25 16:14:22 +01:00
ochafik
5b6d5040d5 grammar: trigger words + refactor of antiprompts 2024-09-25 16:14:22 +01:00
Charles Xu
1e43630218
ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217)
* ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels

* added fallback mechanism when the offline re-quantized model is not
optimized for the underlying target.

* fix for build errors

* remove prints from the low-level code

* Rebase to the latest upstream
2024-09-25 16:12:20 +03:00