Julia Longtin
|
0124f7acd8
|
use or, instead of and. bug fix?
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
dc1f639bf0
|
comment and spacing fixes.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
4fb1547ba6
|
fix an offset error, and get rid of tabs.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
e37b7f8497
|
fix some small errors.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
c3d438bce2
|
further optimizations. 0.99 tokens per second.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
d966ac2ebe
|
replace tabs with spaces.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
fb83cd987d
|
reformat, and label what these files are.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
b8abefbec6
|
use GGML_F32_EPR, and remove some dead code.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
f84859a926
|
whoops. missing tab.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
ded4da4518
|
add Makefile rule for generation .s file, for manual inspection.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
aeb5ae85ad
|
formatting changes.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3ff09248ff
|
indent headers consistently.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3cf6eb0cc0
|
formatting.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
90498c1181
|
minor changes.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
33cc1d8c8e
|
massively rewrite assembly routines.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
20c2bc53f9
|
fix vector sizes.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
2a47e5f05f
|
separate filling aux16 from consuming aux16 by making it an array of vectors.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
e579af1e95
|
loosen alignment requirements for zeros, add missing function, and promote aux8 to an array of vectors.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
1c182a3896
|
promote aux8 into a vector.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3fef54f5ce
|
fix our reference to src in the second place, and use a more accurate comment.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3cdfc9c596
|
spacing changes, eliminate dead references to k1 or zero, and use the right type when referring to src.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
98c9b6972a
|
better comments, and fix some small errors.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
0c01d07835
|
perform 16 operations at a time.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
d34e0ff835
|
use proper mov operator, and pass addresses.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
e3468e041b
|
attempt our first FMA.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
da69ed5b3a
|
add I32 vector memory clearing.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
10237df57a
|
promote aux32 to a vector.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3c29fd57ce
|
add missing address of operators.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
45c94bd89d
|
promote aux16 to a vector.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
31b8a5afd7
|
use quotes properly.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
ed639a6cf9
|
use better memory save operator.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
5c010f761f
|
expand mask, and align memory.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
7a00422fa3
|
try to use vectorized zeroing function.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
2870bfc6dd
|
add missing variable.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
656bf28c91
|
copy right block.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
e99f3a9bf4
|
fix typo.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
84093a6be6
|
promote aux16 into a vector. (part three)
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
66d26d4914
|
promote aux16 into a vector.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
2f0a949ae0
|
promote aux16 into a vector.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
ff29b659c8
|
formatting improvement.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
b3ec86e59c
|
first fixes.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
7f5adf3b5c
|
attempt to speed up float clearing.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a015d8485e
|
allow using code from ggml-phi-knc-dot_q5_K_q8_K.c
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
aee550af6c
|
force to compile.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a7f8abeb9b
|
tell ggml-common.h to export what we want.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
8703abe225
|
pull in ggml specific types.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
62e354354c
|
import stdio.h for size_t.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
3edaaca993
|
import stdint.h for sizeSt.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
669ce9b720
|
begin work on targeting dot_q5_K_q8_K.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
c9730c0e04
|
be more specific about the length of our list of run amounts.
|
2024-06-09 18:01:48 +00:00 |
|