llama.cpp

Author	SHA1	Message	Date
HanishKVC	2cbb00c340	SimpCfg: Add support for boolean fields wrt key-value	2024-05-06 11:27:56 +05:30
HanishKVC	f728dbddd0	ChatON: Add simpcfg based config file matching chaton_meta.json Add missing begin and end fields wrt deepseek-coder assistant in chaton_meta.json. Idea is to avoid json library dependency by adding a simple text based config file support.	2024-05-06 11:27:56 +05:30
HanishKVC	aea6850131	SimpCfg: Keep compiler happy, also add newline wrt alt logging def	2024-05-06 11:27:56 +05:30
HanishKVC	f4687fa5d4	SimpCfg:Parse config file and load string key-value fields	2024-05-06 11:27:56 +05:30
HanishKVC	ce75d434dc	SimpCfg: Initial skeleton : get and set string and bool values	2024-05-06 11:27:56 +05:30
HanishKVC	af9a0a211b	ChatON:ChatTmplApply: Avoid the stringstream	2024-05-06 11:27:56 +05:30
HanishKVC	889a45ff28	ChatON:ChatTmplApply:Update the function notes	2024-05-06 11:27:56 +05:30
HanishKVC	ff5f68826b	ChatON:ChatTmplApplySingle: Avoid streamstring, update func notes	2024-05-06 11:27:56 +05:30
HanishKVC	c4e829d492	ChatON:Mistral: Decouple \n from suffix, use wrt sys msg	2024-05-06 11:27:56 +05:30
HanishKVC	a724fd90bd	ChatON:Tests: Add a test templates program for chaton	2024-05-06 11:27:56 +05:30
HanishKVC	32e672c5dd	ChatON: Dont log final tagged message string to screen	2024-05-06 11:27:56 +05:30
HanishKVC	cad50c527e	ChatON: Update the note to match current logic	2024-05-06 11:27:56 +05:30
HanishKVC	55e3d63f13	ChatON:Mistral: Update to match jinja file	2024-05-06 11:27:56 +05:30
HanishKVC	ad5e5216ce	ChatON:Mistral: Add detailed meta json entries	2024-05-06 11:27:56 +05:30
HanishKVC	368fbf17a1	ChatON:ChatML: Update wrt detailed meta json	2024-05-06 11:27:56 +05:30
HanishKVC	a64dcd7796	ChatON:Zephyr: Update wrt detailed meta json, also update eos Pick eos from zephyr's tokenizer_config, which is different from what was hardcoded in the existing llama_chat_apply_template.	2024-05-06 11:27:56 +05:30
HanishKVC	18cd12524f	ChatON:Monarch:Update wrt detailed meta json	2024-05-06 11:27:56 +05:30
HanishKVC	006a398ebf	ChatON:DeepSeekCoder: Update tmplid and wrt detailed meta json	2024-05-06 11:27:56 +05:30
HanishKVC	1b2e921186	ChatON:DeepSeek: Update support wrt detailed meta json	2024-05-06 11:27:56 +05:30
HanishKVC	403a6c4323	ChatON:Gemma: update for detailed meta json Also as part of same add user role entry for system role also.	2024-05-06 11:27:56 +05:30
HanishKVC	a4b3285034	ChatON:Show Log on screen when template is applied	2024-05-06 11:27:56 +05:30
HanishKVC	d61b071b8d	Chaton:Common:Add missing newline wrt cmdline arg usage	2024-05-06 11:27:56 +05:30
HanishKVC	fee887fe31	ChatON:Common:Update the cmdline argument name used Had forgotten to update it before	2024-05-06 11:27:56 +05:30
HanishKVC	58e1ff16bc	ChatON: switch to ordered_json from json library to be in sync with the json namespace in server.	2024-05-06 11:27:56 +05:30
HanishKVC	a630564c48	ChatON:ChatTemplateApplyCAPI remaining base logic As c doesnt have the concept of pass by reference, and inturn the existing c api uses pointers wrt llama chat message structure, so switching to same wrt chat_tmpl_apply logics. Also fix a oversight in previous commit and add the remaining logic.	2024-05-06 11:27:56 +05:30
HanishKVC	308d3bf3ff	ChatON:WIP:Add c api wrapper for chat_template_apply Initial skeletons Update existing logics to help with same. Also the inbetween helper was having a bad signature wrt returning status and data, thats also fixed.	2024-05-06 11:27:56 +05:30
HanishKVC	e62699f923	ChatON: Add alertAssistantAtEnd flag & logic wrt MultiMsgs Apply While sending the current chat session along with new user query to the model, many models expect that a tag be added at the end to indicate that user is expecting the model to respond, this flags allows for the same.	2024-05-06 11:27:56 +05:30
HanishKVC	ea3a0f19cc	ChatON: Rather check for tmpl existance in single_ex	2024-05-06 11:27:56 +05:30
HanishKVC	01c8db70f7	ChatON+Main: Add C_API wrapper for single Add a c api wrapper for a single message tagging scenario. Inturn to match convention followed by existing chat_apply_template code, make it return the size expected of the tagged message string buffer. Update internal single logic to help with same. Explicitly check if tmpl specified is available in the loaded json or not and then return a error if not found.	2024-05-06 11:27:56 +05:30
HanishKVC	13857f29d6	ChatON+Main: Updates wrt detailed meta json Fix a oversight wrt key name. Add a alert in case if passed meta json file contains begin(BoS) wrt assistant role, similar to check for end (EoS) wrt user role. Bcas normally both (ie EoS wrt User and BoS wrt Assistant) shouldnt be needed. Update main wrt begin & prefix and suffix & end addition.	2024-05-06 11:27:56 +05:30
HanishKVC	b9e31304a5	ChatON: Update to new detailed format wrt llama2 and llama3 Wrt llama2 * add bos wrt llama2 system and user begins, but not assistant * split system suffix into suffix and end, and add systemuser-system flags so that end can be avoided wrt system+user message combo * add eos wrt assistant end * With these potentially this should work with main and server flows Wrt llama3 * add empty begin, end fields and systemuser-system flags * This should potentially work with main and server flows	2024-05-06 11:27:56 +05:30
HanishKVC	bf1167bfdb	ChatON: Backup the current simple meta json file	2024-05-06 11:27:56 +05:30
HanishKVC	0cd7c62706	ChatON: Keep compiler happy Move helpers to the begining, so can avoid adding prototype declerations/function signatures to the begining Get the char * wrt string data in the c++ string.	2024-05-06 11:27:56 +05:30
HanishKVC	6a0214c067	ChatON:MetaOK->MetaDump: Alert if user->end is needed or not Because user messages dont normally need a EoS token.	2024-05-06 11:27:56 +05:30
HanishKVC	344857b6cb	ChatOn:ChatOnTemplateApply: suffix,end flag based control Also fix a oversight wrt begin, when flag based begin adding control was introduced. NOTE: Currently system role suffix/end conditional adding always triggered, if 1st system prompt seen or additional system prompt is seen.	2024-05-06 11:27:56 +05:30
HanishKVC	f8ae21cec7	ChatON:ChatTemplateApplySingle: update begin+prefix, suffix+end	2024-05-06 11:27:56 +05:30
HanishKVC	5d76f08d37	ChatON: Need to explicitly specify string to use c_str	2024-05-06 11:27:56 +05:30
HanishKVC	7ba0144e42	ChatOn:chaton_tmpl_role_kv: try except to ignore missing ifany Cas of above reason, switch to directly accessing the keys in dump helper, which is inturn used by meta_ok check	2024-05-06 11:27:56 +05:30
HanishKVC	adab5775bf	ChatON: more detailed/spreadout json fields	2024-05-06 11:27:56 +05:30
HanishKVC	3f09eb5dea	ChatOn: ChatTemplateApply[Ex] return tagged msgs parts detail Now there is a simple and extended version of returning tagged messages. The extended version returns the tagged string, as well as the details of the parts that make up that tagged message interms of the type of parts and the lengths of the parts.	2024-05-06 11:27:56 +05:30
HanishKVC	825a78abaa	ChatOn: ChatTemplateApplySingle[Ex] return parts detail Now there is a simple and extended version of returning tagged message wrt a single role and its content. The extended version returns the tagged string, as well as the details of the parts that make up that tagged message interms of the type of parts and the lengths of the parts.	2024-05-06 11:27:56 +05:30
HanishKVC	92e780fb1a	ChatON:ChatParts: Allow flexibility for more refined tokenization	2024-05-06 11:27:56 +05:30
HanishKVC	6b23f15ffe	ChatON:ChatOnMetaJSon: Add suffix wrt assistant messages	2024-05-06 11:27:56 +05:30
HanishKVC	d1899728aa	ChatON: Test ChatParts in chat-template-apply	2024-05-06 11:27:56 +05:30
HanishKVC	9de1d6017f	ChatON:ChatParts class initial go Helps keep user prompt and chat-hs-template tag parts seperate, but in sequence	2024-05-06 11:27:56 +05:30
HanishKVC	3064a36e74	ChatON+:Update tmpl_role_kv to retrieve wrt multiple keys Use the same for user role's begin and prefix entries.	2024-05-06 11:27:56 +05:30
HanishKVC	f1f39c5256	ChatON:Add Monarch model template, which uses Begin + Prefix Inturn Begin/BoS is added only for non 1st user messages in a system+user prompts chain.	2024-05-06 11:27:56 +05:30
HanishKVC	724ff38345	ChatOn: Wrap getting begin in try-catch, so that even if a role doesnt contain begin, the logic will work fine.	2024-05-06 11:27:56 +05:30
HanishKVC	d70fca7a45	ChatOn: Add begin to the mix along with prefix Dump shows user->begin. chat-template-apply[-single] updated to work with begin and prefix TODO: need to wrap begin in a try-catch, so that irrespective of role, begin+prefix will work, irrespoective of whether that role has a begin entry or not.	2024-05-06 11:27:56 +05:30
HanishKVC	0f713d4c4f	ChatOn: meta json update wrt the new begin related fields	2024-05-06 11:27:56 +05:30

1 2 3 4 5 ...

2867 commits