Default branch

d7b31a9d84 · sync: minja (a72057e519) (#11774) · Updated 2025-02-10 09:34:09 +00:00

Branches

bf3494345e · metal : some mul_mv experiments · Updated 2024-11-26 12:48:50 +00:00    vbatts

502
1

b83cae088c · speculative : add infill mode · Updated 2024-11-26 09:14:17 +00:00    vbatts

507
1

1ee6c482d0 · Merge branch 'master' into compilade/mamba2 · Updated 2024-11-25 17:06:56 +00:00    vbatts

516
24

4ff0831ce6 · metal : use F16 math in mul_mat kernels · Updated 2024-11-25 13:15:26 +00:00    vbatts

520
1

f7b0233eca · wip · Updated 2024-11-16 08:33:55 +00:00    vbatts

582
1

5e6dad9322 · speculative : experimenting with Qwen2.5 · Updated 2024-11-14 09:31:31 +00:00    vbatts

604
2

33bdee667e · speculative : fix out-of-bounds access · Updated 2024-11-14 09:23:45 +00:00    vbatts

604
1

8c1b186cb5 · metal : minor Q4_0 optimization · Updated 2024-11-12 13:30:51 +00:00    vbatts

614
21

3d1fe1bb4d · metal : int -> short, style · Updated 2024-11-09 08:32:16 +00:00    vbatts

625
2

bd1198a67a · metal : fix build and some more comments · Updated 2024-11-09 08:09:50 +00:00    vbatts

625
1

a2385da59c · make : clean-up [no ci] · Updated 2024-11-08 11:46:20 +00:00    vbatts

632
9

94accca4c2 · vec move mask to shmem · Updated 2024-11-07 18:58:10 +00:00    vbatts

642
19

c5d8bb5a81 · leave only basic functions for SYCL CI · Updated 2024-11-06 07:47:50 +00:00    vbatts

707
2

4fc8673d09 · llama-bench : skip repeated values in consecutive lines · Updated 2024-11-02 14:37:33 +00:00    vbatts

667
1

20e12112fd · llama : suggest reduce ctx size when kv init fails · Updated 2024-11-01 23:55:19 +00:00    vbatts

670
2

afc4a7de65 · llama : enable flash attn automatically when supported · Updated 2024-10-30 22:30:06 +00:00    vbatts

687
1

8233009d4d · Support SYCL device register · Updated 2024-10-20 02:06:51 +00:00    vbatts

768
1

bc82fc2ed8 · llama-bench : add time-to-first-byte stat · Updated 2024-10-18 13:40:02 +00:00    vbatts

739
1

2d3fc54ac6 · add amx kernel for gemm · Updated 2024-10-18 03:35:49 +00:00    vbatts

749
1

630bce5a7f · ggml : fix possible buffer use after free in sched reserve · Updated 2024-10-17 22:21:54 +00:00    vbatts

747
1