Default branch

d7b31a9d84 · sync: minja (a72057e519) (#11774) · Updated 2025-02-10 09:34:09 +00:00

Branches

c8554b80be · Merge branch 'master' of https://github.com/ggerganov/llama.cpp into ceb/fix-cuda-warning-flags · Updated 2023-12-13 17:06:01 +00:00    vbatts

3050
12

e1241d9b46 · metal : switch to execution barriers + fix one of the barriers · Updated 2023-12-13 11:56:45 +00:00    vbatts

3061
47

fc5f334689 · readme : add API change notice · Updated 2023-12-07 10:35:02 +00:00    vbatts

3063
15

af99c6fbfc · llama : remove memory_f16 and kv_f16 flags · Updated 2023-12-05 16:18:16 +00:00    vbatts

3075
26

3cb1c348b3 · metal : try to improve batched decoding · Updated 2023-12-01 20:01:58 +00:00    vbatts

3080
2

eb594c0f7d · alloc : fix build with debug · Updated 2023-12-01 08:46:05 +00:00    vbatts

3104
14

5b74310e6e · build : enable libstdc++ assertions for debug builds · Updated 2023-11-30 23:18:24 +00:00    vbatts

3089
1

bb39b87964 · ggml : restore abort() in GGML_ASSERT · Updated 2023-11-28 00:27:09 +00:00    vbatts

3108
1

87f4102a70 · llama : revert n_threads_batch logic · Updated 2023-11-27 19:47:35 +00:00    vbatts

3109
3

6272b6764a · use stride=128 if built for tensor cores · Updated 2023-11-27 18:09:14 +00:00    vbatts

3112
3

8d8b76d469 · lookahead : add comments · Updated 2023-11-26 09:26:55 +00:00    vbatts

3124
9

21b70babf7 · straightforward /v1/models endpoint · Updated 2023-11-24 16:22:39 +00:00    vbatts

3125
12

f8e9f11428 · common : add -dkvc arg for enabling kv cache dumps · Updated 2023-11-23 16:47:56 +00:00    vbatts

3131
4

f824902623 · YaRN : correction to GPT-NeoX implementation · Updated 2023-11-15 22:10:52 +00:00    vbatts

3163
1

d0445a2eff · better documentation · Updated 2023-11-10 00:38:20 +00:00    vbatts

3180
3

47d604fa2d · fix issues · Updated 2023-11-05 12:20:22 +00:00    vbatts

3194
3

3ef358fffd · Revert "cuda : use CUDA memory pool with async memory allocation/deallocation when available (#3903)" · Updated 2023-11-04 20:26:51 +00:00    vbatts

3198
2

46868a499e · metal : multi-simd softmax · Updated 2023-11-01 19:16:34 +00:00    vbatts

3223
1

a8796f9609 · llm : cleanup + comments · Updated 2023-11-01 18:08:02 +00:00    vbatts

3232
4

7420bef83e · wip wip wip · Updated 2023-11-01 06:51:43 +00:00    vbatts

3232
1