llama : add Qwen support (#4281)

* enable qwen to llama.cpp * llama : do not GPU split bias tensors --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-12-02 02:16:31 +08:00 · 2023-12-02 02:16:31 +08:00 · 37c746d687
commit 37c746d687
parent 880f57973b
5 changed files with 372 additions and 9 deletions
--- a/prompts/chat-with-qwen.txt
+++ b/prompts/chat-with-qwen.txt
@ -0,0 +1 @@
+You are a helpful assistant.