gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)

* gguf-py: Refactor and add file reading support

* Replay changes from #3871

Credit to @cebtenzzre for that pull

* Various type annotation fixes.

* sort imports with isort (again)

* Fix missing return statement in add_tensor

* style cleanup with flake8

* fix NamedTuple and Enum usage

* Fix an issue with state init in GGUFReader

Move examples to an examples/ directory

Clean up examples

Add an example of modifying keys in a GGUF file

Update documentation with info on examples

Try to support people importing gguf/gguf.py directly

* Damagage is not a word.

* Clean up gguf-py/examples/modify_gguf.py whitespace

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* Update gguf-py/examples/modify_gguf.py formatting

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* Update gguf-py/gguf/gguf_reader.py type hint

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* Make examples executable, formatting changes

* Add more information to GGUFReader and examples comments

* Include a gguf Python package version bump

* Add convert-gguf-endian.py script

* cleanup

* gguf-py : bump minor version

* Reorganize scripts

* Make GGUFReader endian detection less arbitrary

* Add JSON dumping support to gguf-dump.py

Which I kind of regret now

* A few for gguf-dump.py cleanups

* Murder accidental tuple in gguf-py/scripts/gguf-dump.py

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* cleanup

* constants : remove unneeded type annotations

* fix python 3.8 compat

* Set up gguf- scripts in pyproject.toml

* And include scripts/__init__.py, derp

* convert.py: We can't currently support Q8_0 on big endian.

* gguf-py: SpecialVocab: Always try available sources for special token ids

gguf-py: SpecialVocab: Try to load merges from merges.txt if not in tokenizer.json

gguf-py: SpecialVocab: Add 'add_bos_token' type bools to GGUF metadata
u

* cleanup

* Promote add_X_token to GGUF metadata for BOS and EOS

---------

Co-authored-by: Jared Van Bortel <jared@nomic.ai>
Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

This commit is contained in:

Kerfuffle

2023-11-10 22:04:50 -07:00

• committed by

GitHub

parent 4a4fd3eefa

commit 34b0a08207

No known key found for this signature in database

GPG key ID: 4AEE18F83AFDEB23

20 changed files with 1982 additions and 1176 deletions

									
										8

gguf-py/pyproject.toml
									
										View file
										
				@ -1,11 +1,12 @@

				[tool.poetry]

				name = "gguf"

				version = "0.4.6"

				version = "0.5.0"

				description = "Write ML models in GGUF for GGML"

				authors = ["GGML <ggml@ggml.ai>"]

				packages = [

				    {include = "gguf"},

				    {include = "gguf/py.typed"},

				    {include = "scripts"},

				]

				readme = "README.md"

				homepage = "https://ggml.ai"

				@ -27,3 +28,8 @@ pytest = "^5.2"

				[build-system]

				requires = ["poetry-core>=1.0.0"]

				build-backend = "poetry.core.masonry.api"

				[tool.poetry.scripts]

				gguf-convert-endian = "scripts:gguf_convert_endian_entrypoint"

				gguf-dump = "scripts:gguf_dump_entrypoint"

				gguf-set-metadata = "scripts:gguf_set_metadata_entrypoint"

Rows
Columns

gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)

8 gguf-py/pyproject.toml Unescape Escape View file

8

gguf-py/pyproject.toml

View file