llama : add StableLM2 12B (#6635)

* StableLM2 12B support for huggingface -> GGUF

* StableLM12 tensormapping and constants

* StableLM-2-12b model support

* fix

* Added 12B support

* Removed autoformatting; resolved bug where model_arch was not selecting StableLM2

* Formatting

* Do QK norm stacking in model conversion step

* Converge StableLM and StableLM2 code to simplify graph construction

* Fix accidental removal

* Removed warnings

* Revert formatter

* Move QK norm stack to private function so it's easier to read

* refactor stablelm graph builder to support 1.6, 3b and 12b more efficiently

* Proper check for None type for new_name to avoid crash; formatting; revert change to base class `write_tensors()`

* Format

* Formatting

* format

Co-authored-by: compilade <git@compilade.net>

* Fix incorrect check for K norm

* space after commas; Keep indentation multiple of 4 spaces

* Flake8 format

* Removed unnecessary conditional branches

* Removed unused comment

* Fixed incorrect tensor passing

* Format

---------

Co-authored-by: compilade <git@compilade.net>

This commit is contained in:

Ashish

2024-04-16 08:48:35 -07:00

• committed by

GitHub

parent f4dea7da18

commit dbceec87c0

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

3 changed files with 134 additions and 12 deletions

									
										2

gguf-py/gguf/constants.py
									
										View file
										
				@ -455,6 +455,8 @@ MODEL_TENSORS: dict[MODEL_ARCH, list[MODEL_TENSOR]] = {

				        MODEL_TENSOR.FFN_GATE,

				        MODEL_TENSOR.FFN_DOWN,

				        MODEL_TENSOR.FFN_UP,

				        MODEL_TENSOR.ATTN_Q_NORM,

				        MODEL_TENSOR.ATTN_K_NORM,

				    ],

				    MODEL_ARCH.QWEN: [

				        MODEL_TENSOR.TOKEN_EMBD,

Rows
Columns

llama : add StableLM2 12B (#6635)

2 gguf-py/gguf/constants.py Unescape Escape View file

2

gguf-py/gguf/constants.py

View file