llama.cpp

History

arch-btw 61715d5cc8 llama : Add IBM granite template (#10013 ) * Add granite template to llama.cpp * Add granite template to test-chat-template.cpp * Update src/llama.cpp Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> * Update tests/test-chat-template.cpp Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> * Added proper template and expected output * Small change to \n Small change to \n * Add code space & Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> * Fix spacing * Apply suggestions from code review * Update src/llama.cpp --------- Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>		2024-10-28 18:45:33 +01:00
..
CMakeLists.txt	llama : move vocab, grammar and sampling into separate files (#8508 )	2024-07-23 13:10:17 +03:00
llama-grammar.cpp	llama : refactor sampling v2 (#9294 )	2024-09-07 15:16:19 +03:00
llama-grammar.h	llama : refactor sampling v2 (#9294 )	2024-09-07 15:16:19 +03:00
llama-impl.h	log : add CONT level for continuing previous log entry (#9610 )	2024-09-24 10:15:35 +03:00
llama-sampling.cpp	llama : add DRY sampler (#9702 )	2024-10-25 19:07:34 +03:00
llama-sampling.h	llama : add DRY sampler (#9702 )	2024-10-25 19:07:34 +03:00
llama-vocab.cpp	llama : add DRY sampler (#9702 )	2024-10-25 19:07:34 +03:00
llama-vocab.h	llama : add DRY sampler (#9702 )	2024-10-25 19:07:34 +03:00
llama.cpp	llama : Add IBM granite template (#10013 )	2024-10-28 18:45:33 +01:00
unicode-data.cpp	server : better security control for public deployments (#9776 )	2024-10-08 13:27:04 +02:00
unicode-data.h	llama : reduce compile time and binary size (#9712 )	2024-10-02 15:49:55 +02:00
unicode.cpp	llama : reduce compile time and binary size (#9712 )	2024-10-02 15:49:55 +02:00
unicode.h	llama : move vocab, grammar and sampling into separate files (#8508 )	2024-07-23 13:10:17 +03:00