Julia Longtin
|
1c182a3896
|
promote aux8 into a vector.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3fef54f5ce
|
fix our reference to src in the second place, and use a more accurate comment.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3cdfc9c596
|
spacing changes, eliminate dead references to k1 or zero, and use the right type when referring to src.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
98c9b6972a
|
better comments, and fix some small errors.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
0c01d07835
|
perform 16 operations at a time.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
d34e0ff835
|
use proper mov operator, and pass addresses.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
e3468e041b
|
attempt our first FMA.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
da69ed5b3a
|
add I32 vector memory clearing.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
10237df57a
|
promote aux32 to a vector.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
3c29fd57ce
|
add missing address of operators.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
45c94bd89d
|
promote aux16 to a vector.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
31b8a5afd7
|
use quotes properly.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
ed639a6cf9
|
use better memory save operator.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
5c010f761f
|
expand mask, and align memory.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
7a00422fa3
|
try to use vectorized zeroing function.
|
2024-06-09 18:01:49 +00:00 |
|
Julia Longtin
|
2870bfc6dd
|
add missing variable.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
656bf28c91
|
copy right block.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
e99f3a9bf4
|
fix typo.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
84093a6be6
|
promote aux16 into a vector. (part three)
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
66d26d4914
|
promote aux16 into a vector.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
2f0a949ae0
|
promote aux16 into a vector.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
ff29b659c8
|
formatting improvement.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
b3ec86e59c
|
first fixes.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
7f5adf3b5c
|
attempt to speed up float clearing.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a015d8485e
|
allow using code from ggml-phi-knc-dot_q5_K_q8_K.c
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
aee550af6c
|
force to compile.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a7f8abeb9b
|
tell ggml-common.h to export what we want.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
8703abe225
|
pull in ggml specific types.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
62e354354c
|
import stdio.h for size_t.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
3edaaca993
|
import stdint.h for sizeSt.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
669ce9b720
|
begin work on targeting dot_q5_K_q8_K.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
c9730c0e04
|
be more specific about the length of our list of run amounts.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a48d3b96d7
|
spacing changes.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
bb73cb319c
|
formatting changes.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a06fa4b1b5
|
use the same header as ggml.c, and remove some warnings.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
5a9d2f5f71
|
remove intrinsics import, and use upConv to save 12 bytes of memory transit.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
d095d8e9c7
|
Update ggml-phi-knc.c
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a56a6f31fa
|
add a benchmark / test binary.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
d7d679e41a
|
merge from upstream
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
c70b5f211b
|
Update ggml.c
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
114e7dd762
|
Update ggml.c
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
83be3dbab7
|
Update ggml.c
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
192e4ad857
|
implement F32 dot products.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
7fce3f6b67
|
import intrinsics.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
b5ea05f003
|
use right type, and define GGML_F32_VEC_ZERO.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
429d69fd22
|
try to implement one intrinsic
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
7fb8d477ca
|
try to detect the PHI cross compiler in make.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
366279e09e
|
try to detect the PHI cross compiler in make.
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
5c0d49cde4
|
instead of checking on glibc, check on SYS_getcpu
|
2024-06-09 18:01:48 +00:00 |
|
Julia Longtin
|
a83e2cadc0
|
handle the case that we have no glibc on the PHI.
|
2024-06-09 18:01:48 +00:00 |
|