llama.cpp

Author	SHA1	Message	Date
Brian	c7b3616449	Update convert_hf_to_gguf.py It might help with the convert_lora_to_gguf.py script if default values were added here Co-authored-by: compilade <git@compilade.net>	2024-07-16 07:02:07 +10:00
brian khuu	9a925b56a0	metadata.py: account for decimal point in size label within model id components	2024-07-16 06:42:38 +10:00
Francis Couture-Harpin	417d7a7c62	convert_hf : use GGUFWriter to count model parameters	2024-07-16 06:42:38 +10:00
Francis Couture-Harpin	78a42fbee5	gguf-py : use pyyaml instead of python-frontmatter HF transformers already depends on pyyaml for model cards, so it should already be in the environment of the users of the convert scripts, unlike python-frontmatter. This should be completely equivalent since the model cards seem to use only YAML and never TOML.	2024-07-16 06:42:38 +10:00
brian khuu	3b1766a992	convert-*.py: flake8 remove blank line	2024-07-16 06:42:38 +10:00
brian khuu	f98f1098f9	convert-*.py: more rigorous regexp for get_model_id_components()	2024-07-16 06:42:38 +10:00
brian khuu	4e3761109d	covert-*.py: flake8 newline missing	2024-07-16 06:42:38 +10:00
brian khuu	8629b7bdc2	covert-*.py: per_model_weight_count_estimation() tensor arg type is Iterable[tuple[str, LazyTensor]]	2024-07-16 06:42:38 +10:00
brian khuu	144a7ec3a4	convert-*.py: pathlib.Path exist() --> is_file() or is_dir()	2024-07-16 06:42:38 +10:00
brian khuu	abc351c270	convert-*.py: quantized_by in model card is not relevant for converted gguf	2024-07-16 06:42:38 +10:00
brian khuu	9954b64862	convert-*.py: add logger and refactor load_model_card()	2024-07-16 06:42:38 +10:00
brian khuu	5cdb03b2fc	convert-*.py: update nix package to add python frontmatter	2024-07-16 06:42:38 +10:00
brian khuu	5ab1a84085	convert-*.py: dict_item --> Iterable	2024-07-16 06:42:38 +10:00
Brian	455c0e53ac	Apply suggestions from code review Co-authored-by: compilade <git@compilade.net>	2024-07-16 06:42:38 +10:00
brian khuu	ccff6c7fb2	convert-*.py: remove reference to uuid generation	2024-07-16 06:42:38 +10:00
Brian	8156835d4a	constants.py : Revert removal of backward compatibility KEY_GENERAL_SOURCE_URL	2024-07-16 06:42:38 +10:00
Brian	2c060303a6	Update constants.py : spacing correction	2024-07-16 06:42:38 +10:00
Brian	aa4e5892a0	Update convert_hf_to_gguf.py Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>	2024-07-16 06:42:38 +10:00
Brian	60278e4f4d	Update convert_hf_to_gguf.py Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>	2024-07-16 06:42:38 +10:00
brian khuu	ad217d7249	convert-*.py: remove autogenerated uuid	2024-07-16 06:42:38 +10:00
brian khuu	f2b425c59c	convert-*.py: import cast from typing and other refactor	2024-07-16 06:42:38 +10:00
brian khuu	04c4fffdcc	convert-*.py: prepare_tensors_for_writing() --> prepare_tensors() > Especially since it can be used for other purposes than "for writing", like preparing the tensors to then count and sum all their sizes. Co-authored-by: compilade <git@compilade.net>	2024-07-16 06:42:38 +10:00
brian khuu	64707b625c	convert-*.py: remove redundant gguf_writer.add_name() calls	2024-07-16 06:42:38 +10:00
brian khuu	f8b5931180	convert-*.py: parameter_class_attribute --> size_label	2024-07-16 06:42:38 +10:00
brian khuu	6eb08ac868	convert-*.py: Removing the redundant metadata is not None from all conditions, and indenting them. Co-authored-by: compilade <git@compilade.net>	2024-07-16 06:42:38 +10:00
brian khuu	4c91d077d2	convert-*.py: cast not required if Metadata.load_metadata_override returned a dict[str, Any] instead of a dict[str, object] Co-authored-by: compilade <git@compilade.net>	2024-07-16 06:42:38 +10:00
Brian	74383ba6d2	Apply suggestions from code review Co-authored-by: compilade <git@compilade.net>	2024-07-16 06:42:38 +10:00
brian khuu	dd14b8fdb1	convert-*.py: pyright type fixes	2024-07-16 06:42:38 +10:00
brian khuu	59a01df784	convert-*.py: refactor per model weight count estimation	2024-07-16 06:42:38 +10:00
brian khuu	2a976e1211	convert-*.py: write_tensors() --> prepare_tensors_for_writing()	2024-07-16 06:42:38 +10:00
brian khuu	fdc5a3fc80	convert-*.py: autogenerate general.uuid if missing	2024-07-16 06:42:35 +10:00
brian khuu	7ecb8f00a0	test: remove test_gguf.py and remove test_generate_any_missing_uuid()	2024-07-16 06:38:40 +10:00
brian khuu	007708e32d	gguf_writer.py: generate tensor uuid if missing	2024-07-16 06:38:40 +10:00
brian khuu	4dc8ddd35a	convert_hf_to_gguf.py: Remove code that is already in fill_templated_filename() and GGUFWriter()	2024-07-16 06:38:40 +10:00
brian khuu	2f23927d37	convert_hf_to_gguf.py: rebase error correction	2024-07-16 06:38:40 +10:00
brian khuu	5011eefeaf	convert_hf_to_gguf.py: optional, dataclass removed from type as it was unused	2024-07-16 06:38:40 +10:00
brian khuu	e9734434bd	convert-*.py: Remove self.model_name that was left in since last rebase	2024-07-16 06:38:40 +10:00
brian khuu	eaa47f5546	convert-*.py: separated unit test, hf_repo to repo_url	2024-07-16 06:38:40 +10:00
brian khuu	d060fcdbe2	convert-*.py: adjusted authorship KV store	2024-07-16 06:38:40 +10:00
brian khuu	91e65d9485	convert-*.py: add unittest to metadata class	2024-07-16 06:38:38 +10:00
brian khuu	3625a42061	convert-*.py: add heuristic to directory name fallback Also add source_url for huggingface url	2024-07-16 06:37:42 +10:00
brian khuu	39472a09da	convert-*.py: need to include self in per_model_weight_count_estimation()	2024-07-16 06:37:42 +10:00
brian khuu	54918ad14e	convert-*.py: refactor parameter weight class	2024-07-16 06:37:42 +10:00
brian khuu	32e80e094c	convert-*.py: base_model is actually in spec for model cards	2024-07-16 06:37:42 +10:00
brian khuu	4d5cd0670a	convert-*.py: use heuristics to parse _name_or_path	2024-07-16 06:37:42 +10:00
brian khuu	b0553f42da	convert-*.py: adjust help message	2024-07-16 06:37:42 +10:00
brian khuu	dd1571211e	convert-*.py: add quantized_by and enhance heuristics	2024-07-16 06:37:38 +10:00
brian khuu	5a86dfaa1c	convert-*.py: add general.organization to kv store	2024-07-16 06:36:03 +10:00
brian khuu	f7c20793b9	convert-*.py: enable --model-name direct metadata override	2024-07-16 06:36:03 +10:00
brian khuu	b1927eed82	convert-*.py: move per model weight estimation away from util back to main script plus some refactoring	2024-07-16 06:36:03 +10:00

1 2 3 4 5 ...

3460 commits