llama.cpp

Author	SHA1	Message	Date
ochafik	a59fde2955	update model template / format mapping	2025-02-08 18:21:29 +00:00
ochafik	c0f972bb45	Use --reasoning-format, remove forced thinking for now	2025-02-08 17:58:33 +00:00
Olivier Chafik	e6d9b52480	align Command R7B w/ --think / reasoning_content behaviour	2025-02-05 15:47:37 +00:00
ochafik	9d7c3cc51b	--think to force any model to return reasoning_content (or just parse <think> for deepseek r1)	2025-02-05 12:16:37 +00:00
Olivier Chafik	933f7a186e	Merge branch 'master' into r1-toolcall	2025-02-04 15:56:25 +00:00
Olivier Chafik	db288b60cb	`tool-call`: command r7b fix for normal responses (#11608 ) * fix command r7b normal response regex + add to server test * test multiline non-tool-call responses in test-chat	2025-02-04 15:48:53 +00:00
Olivier Chafik	39c1d8163b	return thoughts in reasoning_content field	2025-02-04 11:37:09 +00:00
ochafik	438ce0b8a1	fix test-chat	2025-02-04 04:51:36 +00:00
ochafik	86994db697	fix spaces	2025-02-04 03:47:52 +00:00
ochafik	f0154a6479	Fix / test models/templates/llama-cpp-deepseek-r1.jinja	2025-02-04 03:09:15 +00:00
ochafik	a682d1216d	fix / test parsing of r1 parser	2025-02-04 02:23:31 +00:00
ochafik	130ca222c9	DeepSeek R1: parse thoughts / return in separate field in API (non streamed mode)	2025-02-03 02:24:30 +00:00
Olivier Chafik	bfcce4d693	`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) (#11585 ) * `tool-call`: support Command R7B (w/ tool_plan return) * `tool-call`: cleaner preservation of tokens + warn when likely bad chat template override * `tool-call`: test cleanup / handle lazy grammar triggers	2025-02-02 09:25:38 +00:00
Olivier Chafik	8b576b6c55	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 ) --------- Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Xuan Son Nguyen <son@huggingface.co>	2025-01-30 19:13:58 +00:00

14 commits