llama : add OpenELM support (#7359)

* Initial OpenELM support (270M only so far)

* Fill out missing entries in llama_model_type_name

* fixup! Initial OpenELM support (270M only so far)

Fix formatting

* llama : support all OpenELM models

* llama : add variable GQA and variable FFN sizes

Some metadata keys can now also be arrays to support setting
their value per-layer for models like OpenELM.

* llama : minor spacing changes

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* llama : use std::array for per-layer hparams

* llama : fix save/load state

* llama : do not print hparams for vocab-only models

* llama : handle n_head == 0

* llama : use const ref for print_f and fix division by zero

* llama : fix t5 uses of n_head and n_ff

* llama : minor comment

---------

Co-authored-by: Francis Couture-Harpin <git@compilade.net>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit is contained in:

Icecream95

2024-07-05 05:14:21 +12:00

• committed by

GitHub

parent 6f63d646c1

commit d7fd29fff1

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

5 changed files with 676 additions and 176 deletions

643

src/llama.cpp

View file

File diff suppressed because it is too large Load diff

Rows
Columns

llama : add OpenELM support (#7359)

643 src/llama.cpp View file

643

src/llama.cpp

View file