Commit graph

4849 commits

Author SHA1 Message Date
ochafik
cad1448ac7 Disable test-chat-handler on win32 like the other grammar-related tests 2025-01-28 14:46:37 +00:00
ochafik
cd63ba435e beef up test-chat-handler w/ delta expectations 2025-01-28 14:40:23 +00:00
ochafik
ba10b47ae5 Add missing link dep for windows build 2025-01-28 10:52:14 +00:00
ochafik
b5a74d1a24 Simplify parser defs (incremental parsing for streaming will need more thinking) 2025-01-28 10:48:11 +00:00
ochafik
ec4aeaf18a Revert "Allow tool use + streaming"
This reverts commit 62717145f7.
2025-01-28 10:29:17 +00:00
ochafik
62d45a552f Disable slow tests where appropriate, + nits 2025-01-28 09:47:41 +00:00
ochafik
d274ffcc95 build: Add missing optional include for gcc 2025-01-28 09:29:31 +00:00
ochafik
0a51e514f6 Update test-chat-handler.cpp 2025-01-28 09:24:35 +00:00
Olivier Chafik
2f99236f77 Tool-call: do last partial parse upon limit stop 2025-01-28 09:23:19 +00:00
Olivier Chafik
6d5682909f Cleanup dead code in llama_3_1 tool call code 2025-01-28 09:22:26 +00:00
Olivier Chafik
62717145f7 Allow tool use + streaming 2025-01-28 09:22:03 +00:00
ochafik
ef9efc9ed3 Fix Llama 3.1 (incl. constrained builtin tools e.g. <|python_tag|>foo.call(arg=vallue)) 2025-01-28 01:04:06 +00:00
ochafik
2d607f1a68 Update test-chat-handler.cpp 2025-01-27 23:29:28 +00:00
ochafik
b565ab2ab1 comment out broken tests in test_tool_call.py 2025-01-27 23:02:15 +00:00
ochafik
cafea60922 Split e2e test_tool_call from test_chat_completion 2025-01-27 22:46:33 +00:00
ochafik
90effb845f Pass grammar laziness all the way down to sampler (need to print special trigger tokens e.g. for Nemo even w/ tool_choice=required) 2025-01-27 22:46:17 +00:00
ochafik
ad229783c5 updated tool call example to be less ambiguous (deepseek likes to rant about hello world) 2025-01-27 22:44:44 +00:00
ochafik
fa065eb095 Rehabilitate test_format_detection 2025-01-27 20:46:03 +00:00
ochafik
add9124115 fix test-chat-handler grammar tests 2025-01-27 20:13:09 +00:00
ochafik
118f799ae4 DeepSeek-R1: implement grammar constraints 2025-01-27 17:52:46 +00:00
ochafik
92ac336dfa Prepare DeepSeek-R1-Distill-Llama-8B support 2025-01-27 17:26:43 +00:00
ochafik
09971e626c Update test_chat_completion.py 2025-01-27 15:43:03 +00:00
ochafik
67709552ad tool-call: compact json output to cap # tokens generated 2025-01-27 15:42:27 +00:00
ochafik
57f40e366b tool-call: fix lazy grammar & mixed content + tool calls parsing 2025-01-27 15:41:54 +00:00
ochafik
2efa0c27bf tool-call: add weather tool e2e tests 2025-01-27 15:02:09 +00:00
ochafik
15ec01e896 jinja: only add special tokens if template doesn't seem to handle them 2025-01-27 14:28:11 +00:00
ochafik
da606d8d41 tool-call: remove nonsensical code_interpreter code 2025-01-27 14:19:20 +00:00
ochafik
bddc1bebcc tool-call: fix special handling of special trigger tokens (Nemo) 2025-01-27 11:37:41 +00:00
ochafik
ca0c837b6a nits 2025-01-27 01:08:29 +00:00
ochafik
f7078cab36 tool-call: fix functionary v3.1 required test 2025-01-26 23:23:09 +00:00
ochafik
5ec4c5e4d3 reshuffle chat handlers 2025-01-26 21:38:07 +00:00
ochafik
43385b2ff2 sync: minja 2025-01-26 21:36:25 +00:00
ochafik
11594557e3 Merge branch 'tool-call' into tool-call-handler 2025-01-26 15:32:53 +00:00
ochafik
3f3fc03983 nit: trailing spaces 2025-01-26 15:32:13 +00:00
Olivier Chafik
51b7aab841 Update test_chat_completion.py 2025-01-25 04:57:40 +00:00
Olivier Chafik
a6463c1e35 jinja: don't add bos when jinja enabled 2025-01-25 04:52:42 +00:00
Olivier Chafik
0208b20767 Update test_chat_completion.py 2025-01-25 04:52:03 +00:00
Olivier Chafik
c479d39abd tool-call: allow special tokens that are grammar triggers 2025-01-25 04:51:53 +00:00
Olivier Chafik
36ed106f84 WIP chat handlers 2025-01-24 02:31:37 +00:00
Olivier Chafik
46415d7a51 Fix lazy trigger handling 2025-01-22 19:08:19 +00:00
Olivier Chafik
c2d836f9d0 Update real tool call tests (use less models) 2025-01-22 18:47:32 +00:00
Olivier Chafik
a46de6a03a Add grammar options + rename builder to common_grammar_builder 2025-01-22 18:36:04 +00:00
Olivier Chafik
cdfa8b9d4f Update chat-template.hpp 2025-01-22 18:35:24 +00:00
Olivier Chafik
5e358ade59 fix msg init warning 2025-01-22 18:35:20 +00:00
Olivier Chafik
f0231a586e fix common_chat_msg invocations 2025-01-22 16:25:51 +00:00
Olivier Chafik
d186721e41 Merge remote-tracking branch 'origin/master' into tool-call 2025-01-22 16:22:16 +00:00
Olivier Chafik
c64d2becb1
minja: sync at 0f5f7f2b37 (#11352) 2025-01-22 16:16:27 +00:00
Olivier Chafik
9ccc62b3c9 Sync minja after https://github.com/google/minja/pull/29 2025-01-22 14:32:18 +00:00
Jiří Podivín
96f4053934
Adding logprobs to /v1/completions (#11344)
Signed-off-by: Jiri Podivin <jpodivin@redhat.com>
2025-01-22 12:51:32 +01:00
Olivier Chafik
30d33d9f68 Update test_chat_completion.py 2025-01-22 11:42:36 +00:00