Commit graph

2 commits

Author SHA1 Message Date
Georgi Gerganov
0dc0e9aa42
models : convert vocab files to LFS
ggml-ci
2024-05-08 09:54:38 +03:00
Galunid
daab3d7f45
Add more tokenizer tests (#3742)
* Add more tokenizer tests

* Add starcoder

* Update test vocab files

* Restrict bpe tokenizer tests to unicode planes

* Update comment

* Comment cosmetics

* Remove bloom vocab/test
2023-10-24 09:17:17 +02:00