ochafik
|
ef9efc9ed3
|
Fix Llama 3.1 (incl. constrained builtin tools e.g. <|python_tag|>foo.call(arg=vallue) )
|
2025-01-28 01:04:06 +00:00 |
|
ochafik
|
2d607f1a68
|
Update test-chat-handler.cpp
|
2025-01-27 23:29:28 +00:00 |
|
ochafik
|
b565ab2ab1
|
comment out broken tests in test_tool_call.py
|
2025-01-27 23:02:15 +00:00 |
|
ochafik
|
cafea60922
|
Split e2e test_tool_call from test_chat_completion
|
2025-01-27 22:46:33 +00:00 |
|
ochafik
|
90effb845f
|
Pass grammar laziness all the way down to sampler (need to print special trigger tokens e.g. for Nemo even w/ tool_choice=required)
|
2025-01-27 22:46:17 +00:00 |
|
ochafik
|
ad229783c5
|
updated tool call example to be less ambiguous (deepseek likes to rant about hello world)
|
2025-01-27 22:44:44 +00:00 |
|
ochafik
|
fa065eb095
|
Rehabilitate test_format_detection
|
2025-01-27 20:46:03 +00:00 |
|
ochafik
|
add9124115
|
fix test-chat-handler grammar tests
|
2025-01-27 20:13:09 +00:00 |
|
ochafik
|
118f799ae4
|
DeepSeek-R1: implement grammar constraints
|
2025-01-27 17:52:46 +00:00 |
|
ochafik
|
92ac336dfa
|
Prepare DeepSeek-R1-Distill-Llama-8B support
|
2025-01-27 17:26:43 +00:00 |
|
ochafik
|
09971e626c
|
Update test_chat_completion.py
|
2025-01-27 15:43:03 +00:00 |
|
ochafik
|
67709552ad
|
tool-call: compact json output to cap # tokens generated
|
2025-01-27 15:42:27 +00:00 |
|
ochafik
|
57f40e366b
|
tool-call: fix lazy grammar & mixed content + tool calls parsing
|
2025-01-27 15:41:54 +00:00 |
|
ochafik
|
2efa0c27bf
|
tool-call: add weather tool e2e tests
|
2025-01-27 15:02:09 +00:00 |
|
ochafik
|
15ec01e896
|
jinja: only add special tokens if template doesn't seem to handle them
|
2025-01-27 14:28:11 +00:00 |
|
ochafik
|
da606d8d41
|
tool-call: remove nonsensical code_interpreter code
|
2025-01-27 14:19:20 +00:00 |
|
ochafik
|
bddc1bebcc
|
tool-call: fix special handling of special trigger tokens (Nemo)
|
2025-01-27 11:37:41 +00:00 |
|
ochafik
|
ca0c837b6a
|
nits
|
2025-01-27 01:08:29 +00:00 |
|
ochafik
|
f7078cab36
|
tool-call: fix functionary v3.1 required test
|
2025-01-26 23:23:09 +00:00 |
|
ochafik
|
5ec4c5e4d3
|
reshuffle chat handlers
|
2025-01-26 21:38:07 +00:00 |
|
ochafik
|
43385b2ff2
|
sync: minja
|
2025-01-26 21:36:25 +00:00 |
|
ochafik
|
11594557e3
|
Merge branch 'tool-call' into tool-call-handler
|
2025-01-26 15:32:53 +00:00 |
|
ochafik
|
3f3fc03983
|
nit: trailing spaces
|
2025-01-26 15:32:13 +00:00 |
|
Olivier Chafik
|
51b7aab841
|
Update test_chat_completion.py
|
2025-01-25 04:57:40 +00:00 |
|
Olivier Chafik
|
a6463c1e35
|
jinja: don't add bos when jinja enabled
|
2025-01-25 04:52:42 +00:00 |
|
Olivier Chafik
|
0208b20767
|
Update test_chat_completion.py
|
2025-01-25 04:52:03 +00:00 |
|
Olivier Chafik
|
c479d39abd
|
tool-call: allow special tokens that are grammar triggers
|
2025-01-25 04:51:53 +00:00 |
|
Olivier Chafik
|
36ed106f84
|
WIP chat handlers
|
2025-01-24 02:31:37 +00:00 |
|
Olivier Chafik
|
46415d7a51
|
Fix lazy trigger handling
|
2025-01-22 19:08:19 +00:00 |
|
Olivier Chafik
|
c2d836f9d0
|
Update real tool call tests (use less models)
|
2025-01-22 18:47:32 +00:00 |
|
Olivier Chafik
|
a46de6a03a
|
Add grammar options + rename builder to common_grammar_builder
|
2025-01-22 18:36:04 +00:00 |
|
Olivier Chafik
|
cdfa8b9d4f
|
Update chat-template.hpp
|
2025-01-22 18:35:24 +00:00 |
|
Olivier Chafik
|
5e358ade59
|
fix msg init warning
|
2025-01-22 18:35:20 +00:00 |
|
Olivier Chafik
|
f0231a586e
|
fix common_chat_msg invocations
|
2025-01-22 16:25:51 +00:00 |
|
Olivier Chafik
|
d186721e41
|
Merge remote-tracking branch 'origin/master' into tool-call
|
2025-01-22 16:22:16 +00:00 |
|
Olivier Chafik
|
c64d2becb1
|
minja : sync at 0f5f7f2b37 (#11352)
|
2025-01-22 16:16:27 +00:00 |
|
Olivier Chafik
|
9ccc62b3c9
|
Sync minja after https://github.com/google/minja/pull/29
|
2025-01-22 14:32:18 +00:00 |
|
Jiří Podivín
|
96f4053934
|
Adding logprobs to /v1/completions (#11344)
Signed-off-by: Jiri Podivin <jpodivin@redhat.com>
|
2025-01-22 12:51:32 +01:00 |
|
Olivier Chafik
|
30d33d9f68
|
Update test_chat_completion.py
|
2025-01-22 11:42:36 +00:00 |
|
Olivier Chafik
|
c6a22edc57
|
Greedy sampling in tool call tests
|
2025-01-22 11:41:43 +00:00 |
|
Olivier Chafik
|
cce1166b37
|
Update tool-call.cpp
|
2025-01-22 11:25:26 +00:00 |
|
Olivier Chafik
|
a4226365bf
|
nits
|
2025-01-22 11:23:37 +00:00 |
|
Olivier Chafik
|
63387c6dca
|
smaller diff
|
2025-01-22 11:14:25 +00:00 |
|
Olivier Chafik
|
82b6e9a5c3
|
merge common_tool_calls into common_chat_msg
|
2025-01-22 11:05:05 +00:00 |
|
Olivier Chafik
|
01b345be0f
|
Merge remote-tracking branch 'origin/master' into tool-call
|
2025-01-22 10:02:23 +00:00 |
|
Olivier Chafik
|
a94f3b2727
|
common : utils to split / join / repeat strings (from json converter) (#11342)
* Factor string_join, string_split, string_repeat into common
* json: refactor to surface a versatile builder
* Update common.cpp
|
2025-01-22 09:51:44 +00:00 |
|
tc-mb
|
3e3357fd77
|
llava : support Minicpm-omni (#11289)
* init
* add readme
* update readme
* no use make
* update readme
* update fix code
* fix editorconfig-checker
* no change convert py
* use clip_image_u8_free
|
2025-01-22 09:35:48 +02:00 |
|
Olivier Chafik
|
2dd09c792f
|
more cleanups
|
2025-01-22 03:20:47 +00:00 |
|
Olivier Chafik
|
28cac497a6
|
drop llama_sampler_accept_str
|
2025-01-22 02:38:04 +00:00 |
|
Olivier Chafik
|
e211629b89
|
Merge branch 'string_utils' into tool-call
|
2025-01-22 02:27:10 +00:00 |
|