llama : refactor `src/llama.cpp` (#10902)

* llama : scatter llama.cpp into multiple modules (wip)

* llama : control-vector -> adapter

* llama : arch

* llama : mmap

ggml-ci

* ci : remove BUILD_SHARED_LIBS=OFF

ggml-ci

* llama : arch (cont)

ggml-ci

* llama : chat

ggml-ci

* llama : model

ggml-ci

* llama : hparams

ggml-ci

* llama : adapter

ggml-ci

* examples : fix

ggml-ci

* rebase

ggml-ci

* minor

* llama : kv cache

ggml-ci

* llama : impl

ggml-ci

* llama : batch

ggml-ci

* cont

ggml-ci

* llama : context

ggml-ci

* minor

* llama : context (cont)

ggml-ci

* llama : model loader

ggml-ci

* common : update lora

ggml-ci

* llama : quant

ggml-ci

* llama : quant (cont)

ggml-ci

* minor [no ci]

This commit is contained in:

Georgi Gerganov

2025-01-03 10:18:53 +02:00

• committed by

GitHub

parent 2f0ee84b9b

commit f66f582927

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

61 changed files with 12193 additions and 11649 deletions

1414

src/llama-arch.cpp Normal file

View file

File diff suppressed because it is too large Load diff

Rows
Columns

llama : refactor src/llama.cpp (#10902)

1414 src/llama-arch.cpp Normal file View file

llama : refactor `src/llama.cpp` (#10902)

1414

src/llama-arch.cpp Normal file

View file