Commit graph

2480 commits

Author SHA1 Message Date
Julia Longtin
77d4ca906b spacing and capitalization changes. 2024-04-25 21:23:22 +00:00
Julia Longtin
d69cf87fce use or, instead of and. bug fix? 2024-04-24 17:50:12 +00:00
Julia Longtin
8cae9a9ef6 comment and spacing fixes. 2024-04-24 17:38:42 +00:00
Julia Longtin
90e99eaf1c fix an offset error, and get rid of tabs. 2024-04-22 18:29:31 +00:00
Julia Longtin
6d16090246 fix some small errors. 2024-04-22 18:22:22 +00:00
Julia Longtin
e298d9e65e further optimizations. 0.99 tokens per second. 2024-04-22 18:16:28 +00:00
Julia Longtin
53773e0b4a replace tabs with spaces. 2024-04-03 23:42:34 +00:00
Julia Longtin
9152143fe7 reformat, and label what these files are. 2024-04-03 23:21:24 +00:00
Julia Longtin
9ad5efafb0 use GGML_F32_EPR, and remove some dead code. 2024-04-03 22:04:45 +00:00
Julia Longtin
84df774d6a whoops. missing tab. 2024-04-03 21:58:29 +00:00
Julia Longtin
9412572205 add Makefile rule for generation .s file, for manual inspection. 2024-04-03 20:30:25 +00:00
Julia Longtin
6f67ea886f formatting changes. 2024-04-03 20:24:00 +00:00
Julia Longtin
96fdd214c8 indent headers consistently. 2024-04-03 19:01:18 +00:00
Julia Longtin
cb4422625a
Merge pull request #1 from julialongtin/k1om
K1om initial support. Round 1.
2024-04-02 17:07:46 +00:00
Julia Longtin
47190a7fe2 formatting. 2024-04-02 17:01:53 +00:00
Julia Longtin
8c17353717 minor changes. 2024-04-02 16:55:40 +00:00
Julia Longtin
9f569ca50b massively rewrite assembly routines. 2024-04-02 15:41:56 +00:00
Julia Longtin
12c9576aec fix vector sizes. 2024-03-25 19:43:37 +00:00
Julia Longtin
bc3d6db862 separate filling aux16 from consuming aux16 by making it an array of vectors. 2024-03-24 14:18:08 +00:00
Julia Longtin
ca0dc26704 loosen alignment requirements for zeros, add missing function, and promote aux8 to an array of vectors. 2024-03-24 13:35:05 +00:00
Julia Longtin
cf481cf901 promote aux8 into a vector. 2024-03-24 12:50:01 +00:00
Julia Longtin
169a145409 fix our reference to src in the second place, and use a more accurate comment. 2024-03-24 12:41:21 +00:00
Julia Longtin
c28bfe4552 spacing changes, eliminate dead references to k1 or zero, and use the right type when referring to src. 2024-03-24 12:37:47 +00:00
Julia Longtin
ba4f4129b3 better comments, and fix some small errors. 2024-03-24 12:17:06 +00:00
Julia Longtin
03a3e0eb7a perform 16 operations at a time. 2024-03-24 12:04:44 +00:00
Julia Longtin
5935bb34f4 use proper mov operator, and pass addresses. 2024-03-23 23:46:36 +00:00
Julia Longtin
a5132a1507 attempt our first FMA. 2024-03-23 22:16:57 +00:00
Julia Longtin
4477b8e123 add I32 vector memory clearing. 2024-03-23 21:16:23 +00:00
Julia Longtin
ea1edb0600 promote aux32 to a vector. 2024-03-23 21:12:35 +00:00
Julia Longtin
f967690a41 add missing address of operators. 2024-03-23 21:05:50 +00:00
Julia Longtin
2fdd11fe3a promote aux16 to a vector. 2024-03-23 21:00:51 +00:00
Julia Longtin
f09b3ed79e use quotes properly. 2024-03-23 20:53:16 +00:00
Julia Longtin
bb5eb95816 use better memory save operator. 2024-03-23 20:49:11 +00:00
Julia Longtin
9d7ca41703 expand mask, and align memory. 2024-03-23 20:48:43 +00:00
Julia Longtin
bd6d7e6238 try to use vectorized zeroing function. 2024-03-23 19:55:12 +00:00
Julia Longtin
f985372e3a add missing variable. 2024-03-23 19:49:16 +00:00
Julia Longtin
31d4f9312b copy right block. 2024-03-23 19:47:21 +00:00
Julia Longtin
e43a63e7c6 fix typo. 2024-03-23 16:29:30 +00:00
Julia Longtin
f092a10dc9 promote aux16 into a vector. (part three) 2024-03-23 16:27:11 +00:00
Julia Longtin
c72157a5a6 promote aux16 into a vector. 2024-03-23 16:24:11 +00:00
Julia Longtin
e3503c924a promote aux16 into a vector. 2024-03-23 16:21:20 +00:00
Julia Longtin
edb76ffddb formatting improvement. 2024-03-23 16:19:17 +00:00
Julia Longtin
6face8a0be first fixes. 2024-03-23 15:56:47 +00:00
Julia Longtin
0a2051aa88 attempt to speed up float clearing. 2024-03-23 15:55:00 +00:00
Julia Longtin
0b012c03ef allow using code from ggml-phi-knc-dot_q5_K_q8_K.c 2024-03-23 15:02:56 +00:00
Julia Longtin
0b3f17127f force to compile. 2024-03-23 14:58:33 +00:00
Julia Longtin
18f353987c tell ggml-common.h to export what we want. 2024-03-23 14:49:35 +00:00
Julia Longtin
cd20404250 pull in ggml specific types. 2024-03-23 14:38:15 +00:00
Julia Longtin
8f57803f58 import stdio.h for size_t. 2024-03-23 14:29:59 +00:00
Julia Longtin
9bcb8350d5 import stdint.h for sizeSt. 2024-03-23 14:28:29 +00:00