Default branch

d7b31a9d84 · sync: minja (a72057e519) (#11774) · Updated 2025-02-10 09:34:09 +00:00

Branches

5261aee8d8 · sampling : one sequence per sampling context · Updated 2023-10-12 17:36:44 +00:00    vbatts

3304
1

2fcdf869cd · batched-bench : add mmq CLI arg · Updated 2023-10-11 16:42:33 +00:00    vbatts

3316
7

ee7456926e · ggml-alloc : fix assert in debug builds · Updated 2023-10-09 12:33:12 +00:00    vbatts

3325
1

ee268b5446 · llama : no longer perform uninitialized access to the KV cache · Updated 2023-10-08 08:49:38 +00:00    vbatts

3332
5

acead654d2 · Merge branch 'master' into fix-refact · Updated 2023-10-08 08:25:16 +00:00    vbatts

3332
4

6b9554a740 · metal : print more GPU info + disable mul_mm for MTLGPUFamiliy < Apple7 · Updated 2023-10-08 06:55:13 +00:00    vbatts

3339
5

ba44776dc2 · bump version · Updated 2023-10-07 18:47:48 +00:00    vbatts

3338
6

5ab6c2132a · server-parallel : add "--reverse-prompt" + compiler warning fixes · Updated 2023-10-06 11:32:19 +00:00    vbatts

3351
4

5418932b71 · llama : fix comments for llama_kv_cache API · Updated 2023-10-03 18:01:52 +00:00    vbatts

3376
5

c5650ed470 · server : avoid context swaps by shifting the KV cache · Updated 2023-09-28 16:03:36 +00:00    vbatts

3400
57

72e7ef4e53 · simple : fixes · Updated 2023-09-26 21:19:36 +00:00    vbatts

3426
48

784d14ed31 · llama : store non-RoPEd K cache (WIP) · Updated 2023-09-17 20:43:07 +00:00    vbatts

3438
5

92a4f86879 · llama : make starcoder graph build more consistent with others · Updated 2023-09-15 14:57:10 +00:00    vbatts

3448
20

e7e7b11455 · llama : remove experimental stuff · Updated 2023-09-14 19:52:01 +00:00    vbatts

3460
3

2f689dee06 · metal : minor · Updated 2023-09-07 12:33:21 +00:00    vbatts

3493
5

30ac7a4117 · gitignore : metal · Updated 2023-09-04 19:23:16 +00:00    vbatts

3505
12

f3a84b2e0d · llama : better express the KV cache dependencies in the graph · Updated 2023-09-04 18:44:48 +00:00    vbatts

3505
5

c79d130f74 · make : fix speculative build · Updated 2023-09-04 12:50:04 +00:00    vbatts

3506
9

847896aba7 · speculative : add --draft CLI arg · Updated 2023-09-03 10:51:07 +00:00    vbatts

3512
3

8c2b881281 · cuda : poc for norm quants (only -b 1 works) · Updated 2023-08-30 18:42:28 +00:00    vbatts

3553
3