Julia Longtin
|
5a6024279f
|
separate filling aux16 from consuming aux16 by making it an array of vectors.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
d351d995b0
|
loosen alignment requirements for zeros, add missing function, and promote aux8 to an array of vectors.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
185d4b8bf7
|
promote aux8 into a vector.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
a95c7b0138
|
fix our reference to src in the second place, and use a more accurate comment.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
babe051eaa
|
spacing changes, eliminate dead references to k1 or zero, and use the right type when referring to src.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
b5c1135f4d
|
better comments, and fix some small errors.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
7e3eb5c01d
|
perform 16 operations at a time.
|
2024-05-13 22:12:55 +00:00 |
|
Julia Longtin
|
6d4535e829
|
use proper mov operator, and pass addresses.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
e72539bcc5
|
attempt our first FMA.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
b22e3e021e
|
add I32 vector memory clearing.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
1446a724df
|
promote aux32 to a vector.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
a9cc0e74d3
|
add missing address of operators.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
bff7b695b3
|
promote aux16 to a vector.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
df33835700
|
use quotes properly.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
2dc7991809
|
use better memory save operator.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
588a0b19cc
|
expand mask, and align memory.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
3994d81bf0
|
try to use vectorized zeroing function.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
e227717136
|
add missing variable.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
d5a27eb507
|
copy right block.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
9f92f9730e
|
fix typo.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
484c4abf8d
|
promote aux16 into a vector. (part three)
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
fb0fb9ff1b
|
promote aux16 into a vector.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
405b5fa731
|
promote aux16 into a vector.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
b92e06456c
|
formatting improvement.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
ea858eee03
|
first fixes.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
feed51c3f4
|
attempt to speed up float clearing.
|
2024-05-13 22:12:54 +00:00 |
|
Julia Longtin
|
2ed306623c
|
allow using code from ggml-phi-knc-dot_q5_K_q8_K.c
|
2024-05-13 22:12:50 +00:00 |
|
Julia Longtin
|
d5f39c3caa
|
force to compile.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
b794e48ff8
|
tell ggml-common.h to export what we want.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
2c5daab90f
|
pull in ggml specific types.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
7080280c5b
|
import stdio.h for size_t.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
96dce97091
|
import stdint.h for sizeSt.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
0e6c910db9
|
begin work on targeting dot_q5_K_q8_K.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
16cbe5dd81
|
be more specific about the length of our list of run amounts.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
c605e951dc
|
spacing changes.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
56be29fc58
|
formatting changes.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
97c69835dc
|
use the same header as ggml.c, and remove some warnings.
|
2024-05-13 22:11:16 +00:00 |
|
Julia Longtin
|
580a347e59
|
remove intrinsics import, and use upConv to save 12 bytes of memory transit.
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
9ba28eaed3
|
Update ggml-phi-knc.c
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
72e2b13185
|
add a benchmark / test binary.
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
6f699fc98d
|
merge from upstream
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
926b0e8076
|
Update ggml.c
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
6e1b77ad58
|
Update ggml.c
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
f940c96aac
|
Update ggml.c
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
2458643dac
|
implement F32 dot products.
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
59ce785f61
|
import intrinsics.
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
c08ddb831f
|
use right type, and define GGML_F32_VEC_ZERO.
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
25095cac23
|
try to implement one intrinsic
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
8f6e535edc
|
try to detect the PHI cross compiler in make.
|
2024-05-13 22:11:15 +00:00 |
|
Julia Longtin
|
f7f174ecc9
|
try to detect the PHI cross compiler in make.
|
2024-05-13 22:11:15 +00:00 |
|