sampling : refactor init to use llama_sampling_params (#3696)

* sampling : refactor init to use llama_sampling_params

* llama : combine repetition, frequency and presence penalties in 1 call

* examples : remove embd-input and gptneox-wip

* sampling : rename penalty params + reduce size of "prev" vector

* sampling : add llama_sampling_print helper

* sampling : hide prev behind API and apply #3661

ggml-ci

This commit is contained in:

Georgi Gerganov

2023-10-20 21:07:23 +03:00

• committed by

GitHub

parent 8cf19d60dc

commit d1031cf49c

No known key found for this signature in database

GPG key ID: 4AEE18F83AFDEB23

30 changed files with 365 additions and 4502 deletions

1083

examples/gptneox-wip/gptneox-main.cpp

View file

File diff suppressed because it is too large Load diff

Rows
Columns

sampling : refactor init to use llama_sampling_params (#3696)

1083 examples/gptneox-wip/gptneox-main.cpp View file

1083

examples/gptneox-wip/gptneox-main.cpp

View file