ochafik
|
76f6ab19ad
|
Update test_tool_call.py
|
2025-01-29 17:04:30 +00:00 |
|
ochafik
|
41eec4622b
|
rm unused templates, rename one
|
2025-01-29 16:50:54 +00:00 |
|
ochafik
|
40cc3f2fde
|
Merge branch 'tool-call' of github.com:ochafik/llama.cpp into tool-call
|
2025-01-29 16:45:59 +00:00 |
|
Olivier Chafik
|
384f54a135
|
Split bulk of tool call tests to slow lane
|
2025-01-29 16:13:45 +00:00 |
|
Olivier Chafik
|
923c805d04
|
rm dead code + nits
|
2025-01-29 15:57:58 +00:00 |
|
ochafik
|
4a1e8e9f91
|
refactor test-chat-handler
|
2025-01-29 04:00:01 +00:00 |
|
ochafik
|
18d5a1b2ca
|
nits
|
2025-01-29 02:15:34 +00:00 |
|
ochafik
|
47be437356
|
Text fireworks v2 template
|
2025-01-29 01:51:07 +00:00 |
|
ochafik
|
4cdbb8c53f
|
Revert breaking minja change
|
2025-01-29 01:50:49 +00:00 |
|
ochafik
|
64263910d8
|
Fix firefunction w/ jinja: requires two variables, use the chat handlers everywhere templates are used
|
2025-01-29 01:15:44 +00:00 |
|
ochafik
|
d603d067d5
|
sync: minja
|
2025-01-28 23:49:04 +00:00 |
|
ochafik
|
4f257550a2
|
minja: sync on https://github.com/google/minja/pull/33
|
2025-01-28 23:46:51 +00:00 |
|
ochafik
|
cad1448ac7
|
Disable test-chat-handler on win32 like the other grammar-related tests
|
2025-01-28 14:46:37 +00:00 |
|
ochafik
|
cd63ba435e
|
beef up test-chat-handler w/ delta expectations
|
2025-01-28 14:40:23 +00:00 |
|
ochafik
|
ba10b47ae5
|
Add missing link dep for windows build
|
2025-01-28 10:52:14 +00:00 |
|
ochafik
|
b5a74d1a24
|
Simplify parser defs (incremental parsing for streaming will need more thinking)
|
2025-01-28 10:48:11 +00:00 |
|
ochafik
|
ec4aeaf18a
|
Revert "Allow tool use + streaming"
This reverts commit 62717145f7 .
|
2025-01-28 10:29:17 +00:00 |
|
ochafik
|
62d45a552f
|
Disable slow tests where appropriate, + nits
|
2025-01-28 09:47:41 +00:00 |
|
ochafik
|
d274ffcc95
|
build: Add missing optional include for gcc
|
2025-01-28 09:29:31 +00:00 |
|
ochafik
|
0a51e514f6
|
Update test-chat-handler.cpp
|
2025-01-28 09:24:35 +00:00 |
|
Olivier Chafik
|
2f99236f77
|
Tool-call: do last partial parse upon limit stop
|
2025-01-28 09:23:19 +00:00 |
|
Olivier Chafik
|
6d5682909f
|
Cleanup dead code in llama_3_1 tool call code
|
2025-01-28 09:22:26 +00:00 |
|
Olivier Chafik
|
62717145f7
|
Allow tool use + streaming
|
2025-01-28 09:22:03 +00:00 |
|
ochafik
|
ef9efc9ed3
|
Fix Llama 3.1 (incl. constrained builtin tools e.g. <|python_tag|>foo.call(arg=vallue) )
|
2025-01-28 01:04:06 +00:00 |
|
ochafik
|
2d607f1a68
|
Update test-chat-handler.cpp
|
2025-01-27 23:29:28 +00:00 |
|
ochafik
|
b565ab2ab1
|
comment out broken tests in test_tool_call.py
|
2025-01-27 23:02:15 +00:00 |
|
ochafik
|
cafea60922
|
Split e2e test_tool_call from test_chat_completion
|
2025-01-27 22:46:33 +00:00 |
|
ochafik
|
90effb845f
|
Pass grammar laziness all the way down to sampler (need to print special trigger tokens e.g. for Nemo even w/ tool_choice=required)
|
2025-01-27 22:46:17 +00:00 |
|
ochafik
|
ad229783c5
|
updated tool call example to be less ambiguous (deepseek likes to rant about hello world)
|
2025-01-27 22:44:44 +00:00 |
|
ochafik
|
fa065eb095
|
Rehabilitate test_format_detection
|
2025-01-27 20:46:03 +00:00 |
|
ochafik
|
add9124115
|
fix test-chat-handler grammar tests
|
2025-01-27 20:13:09 +00:00 |
|
ochafik
|
118f799ae4
|
DeepSeek-R1: implement grammar constraints
|
2025-01-27 17:52:46 +00:00 |
|
ochafik
|
92ac336dfa
|
Prepare DeepSeek-R1-Distill-Llama-8B support
|
2025-01-27 17:26:43 +00:00 |
|
ochafik
|
09971e626c
|
Update test_chat_completion.py
|
2025-01-27 15:43:03 +00:00 |
|
ochafik
|
67709552ad
|
tool-call: compact json output to cap # tokens generated
|
2025-01-27 15:42:27 +00:00 |
|
ochafik
|
57f40e366b
|
tool-call: fix lazy grammar & mixed content + tool calls parsing
|
2025-01-27 15:41:54 +00:00 |
|
ochafik
|
2efa0c27bf
|
tool-call: add weather tool e2e tests
|
2025-01-27 15:02:09 +00:00 |
|
ochafik
|
15ec01e896
|
jinja: only add special tokens if template doesn't seem to handle them
|
2025-01-27 14:28:11 +00:00 |
|
ochafik
|
da606d8d41
|
tool-call: remove nonsensical code_interpreter code
|
2025-01-27 14:19:20 +00:00 |
|
ochafik
|
bddc1bebcc
|
tool-call: fix special handling of special trigger tokens (Nemo)
|
2025-01-27 11:37:41 +00:00 |
|
ochafik
|
ca0c837b6a
|
nits
|
2025-01-27 01:08:29 +00:00 |
|
ochafik
|
f7078cab36
|
tool-call: fix functionary v3.1 required test
|
2025-01-26 23:23:09 +00:00 |
|
ochafik
|
5ec4c5e4d3
|
reshuffle chat handlers
|
2025-01-26 21:38:07 +00:00 |
|
ochafik
|
43385b2ff2
|
sync: minja
|
2025-01-26 21:36:25 +00:00 |
|
ochafik
|
11594557e3
|
Merge branch 'tool-call' into tool-call-handler
|
2025-01-26 15:32:53 +00:00 |
|
ochafik
|
3f3fc03983
|
nit: trailing spaces
|
2025-01-26 15:32:13 +00:00 |
|
Olivier Chafik
|
51b7aab841
|
Update test_chat_completion.py
|
2025-01-25 04:57:40 +00:00 |
|
Olivier Chafik
|
a6463c1e35
|
jinja: don't add bos when jinja enabled
|
2025-01-25 04:52:42 +00:00 |
|
Olivier Chafik
|
0208b20767
|
Update test_chat_completion.py
|
2025-01-25 04:52:03 +00:00 |
|
Olivier Chafik
|
c479d39abd
|
tool-call: allow special tokens that are grammar triggers
|
2025-01-25 04:51:53 +00:00 |
|