llama : add Qwen support (#4281)

* enable qwen to llama.cpp

* llama : do not GPU split bias tensors

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
This commit is contained in:
Shijie 2023-12-02 02:16:31 +08:00 committed by GitHub
parent 880f57973b
commit 37c746d687
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23
5 changed files with 372 additions and 9 deletions

View file

@ -0,0 +1 @@
You are a helpful assistant.