sampling : refactor init to use llama_sampling_params (#3696)

* sampling : refactor init to use llama_sampling_params

* llama : combine repetition, frequency and presence penalties in 1 call

* examples : remove embd-input and gptneox-wip

* sampling : rename penalty params + reduce size of "prev" vector

* sampling : add llama_sampling_print helper

* sampling : hide prev behind API and apply #3661

ggml-ci

This commit is contained in:

Georgi Gerganov

2023-10-20 21:07:23 +03:00

• committed by

GitHub

parent 8cf19d60dc

commit d1031cf49c

No known key found for this signature in database

GPG key ID: 4AEE18F83AFDEB23

30 changed files with 365 additions and 4502 deletions

4

examples/embd-input/.gitignore vendored

View file

 @ -1,4 +0,0 @@
 PandaGPT
 MiniGPT-4
 *.pth

Rows
Columns

sampling : refactor init to use llama_sampling_params (#3696)

4 examples/embd-input/.gitignore vendored Unescape Escape View file

4

examples/embd-input/.gitignore vendored

View file