server : fix context shift (#5195)

* server : fix context shift + simplify self-extend

* server : take system_tokens into account

* server : more n_past fixes

* server : rever n_past_se changes
This commit is contained in:
Georgi Gerganov 2024-01-30 20:17:30 +02:00 committed by GitHub
parent 4003be0e5f
commit e6f291d158
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 60 additions and 50 deletions

View file

@ -48,6 +48,7 @@ chat_completion() {
top_p: 0.9,
n_keep: $n_keep,
n_predict: 256,
cache_prompt: true,
stop: ["\n### Human:"],
stream: true
}')"