Commit graph

2736 commits

Author SHA1 Message Date
Julia Longtin
3ff09248ff indent headers consistently. 2024-06-09 18:01:49 +00:00
Julia Longtin
3cf6eb0cc0 formatting. 2024-06-09 18:01:49 +00:00
Julia Longtin
90498c1181 minor changes. 2024-06-09 18:01:49 +00:00
Julia Longtin
33cc1d8c8e massively rewrite assembly routines. 2024-06-09 18:01:49 +00:00
Julia Longtin
20c2bc53f9 fix vector sizes. 2024-06-09 18:01:49 +00:00
Julia Longtin
2a47e5f05f separate filling aux16 from consuming aux16 by making it an array of vectors. 2024-06-09 18:01:49 +00:00
Julia Longtin
e579af1e95 loosen alignment requirements for zeros, add missing function, and promote aux8 to an array of vectors. 2024-06-09 18:01:49 +00:00
Julia Longtin
1c182a3896 promote aux8 into a vector. 2024-06-09 18:01:49 +00:00
Julia Longtin
3fef54f5ce fix our reference to src in the second place, and use a more accurate comment. 2024-06-09 18:01:49 +00:00
Julia Longtin
3cdfc9c596 spacing changes, eliminate dead references to k1 or zero, and use the right type when referring to src. 2024-06-09 18:01:49 +00:00
Julia Longtin
98c9b6972a better comments, and fix some small errors. 2024-06-09 18:01:49 +00:00
Julia Longtin
0c01d07835 perform 16 operations at a time. 2024-06-09 18:01:49 +00:00
Julia Longtin
d34e0ff835 use proper mov operator, and pass addresses. 2024-06-09 18:01:49 +00:00
Julia Longtin
e3468e041b attempt our first FMA. 2024-06-09 18:01:49 +00:00
Julia Longtin
da69ed5b3a add I32 vector memory clearing. 2024-06-09 18:01:49 +00:00
Julia Longtin
10237df57a promote aux32 to a vector. 2024-06-09 18:01:49 +00:00
Julia Longtin
3c29fd57ce add missing address of operators. 2024-06-09 18:01:49 +00:00
Julia Longtin
45c94bd89d promote aux16 to a vector. 2024-06-09 18:01:49 +00:00
Julia Longtin
31b8a5afd7 use quotes properly. 2024-06-09 18:01:49 +00:00
Julia Longtin
ed639a6cf9 use better memory save operator. 2024-06-09 18:01:49 +00:00
Julia Longtin
5c010f761f expand mask, and align memory. 2024-06-09 18:01:49 +00:00
Julia Longtin
7a00422fa3 try to use vectorized zeroing function. 2024-06-09 18:01:49 +00:00
Julia Longtin
2870bfc6dd add missing variable. 2024-06-09 18:01:48 +00:00
Julia Longtin
656bf28c91 copy right block. 2024-06-09 18:01:48 +00:00
Julia Longtin
e99f3a9bf4 fix typo. 2024-06-09 18:01:48 +00:00
Julia Longtin
84093a6be6 promote aux16 into a vector. (part three) 2024-06-09 18:01:48 +00:00
Julia Longtin
66d26d4914 promote aux16 into a vector. 2024-06-09 18:01:48 +00:00
Julia Longtin
2f0a949ae0 promote aux16 into a vector. 2024-06-09 18:01:48 +00:00
Julia Longtin
ff29b659c8 formatting improvement. 2024-06-09 18:01:48 +00:00
Julia Longtin
b3ec86e59c first fixes. 2024-06-09 18:01:48 +00:00
Julia Longtin
7f5adf3b5c attempt to speed up float clearing. 2024-06-09 18:01:48 +00:00
Julia Longtin
a015d8485e allow using code from ggml-phi-knc-dot_q5_K_q8_K.c 2024-06-09 18:01:48 +00:00
Julia Longtin
aee550af6c force to compile. 2024-06-09 18:01:48 +00:00
Julia Longtin
a7f8abeb9b tell ggml-common.h to export what we want. 2024-06-09 18:01:48 +00:00
Julia Longtin
8703abe225 pull in ggml specific types. 2024-06-09 18:01:48 +00:00
Julia Longtin
62e354354c import stdio.h for size_t. 2024-06-09 18:01:48 +00:00
Julia Longtin
3edaaca993 import stdint.h for sizeSt. 2024-06-09 18:01:48 +00:00
Julia Longtin
669ce9b720 begin work on targeting dot_q5_K_q8_K. 2024-06-09 18:01:48 +00:00
Julia Longtin
c9730c0e04 be more specific about the length of our list of run amounts. 2024-06-09 18:01:48 +00:00
Julia Longtin
a48d3b96d7 spacing changes. 2024-06-09 18:01:48 +00:00
Julia Longtin
bb73cb319c formatting changes. 2024-06-09 18:01:48 +00:00
Julia Longtin
a06fa4b1b5 use the same header as ggml.c, and remove some warnings. 2024-06-09 18:01:48 +00:00
Julia Longtin
5a9d2f5f71 remove intrinsics import, and use upConv to save 12 bytes of memory transit. 2024-06-09 18:01:48 +00:00
Julia Longtin
d095d8e9c7 Update ggml-phi-knc.c 2024-06-09 18:01:48 +00:00
Julia Longtin
a56a6f31fa add a benchmark / test binary. 2024-06-09 18:01:48 +00:00
Julia Longtin
d7d679e41a merge from upstream 2024-06-09 18:01:48 +00:00
Julia Longtin
c70b5f211b Update ggml.c 2024-06-09 18:01:48 +00:00
Julia Longtin
114e7dd762 Update ggml.c 2024-06-09 18:01:48 +00:00
Julia Longtin
83be3dbab7 Update ggml.c 2024-06-09 18:01:48 +00:00
Julia Longtin
192e4ad857 implement F32 dot products. 2024-06-09 18:01:48 +00:00