Concedo
|
74edc401c1
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# CMakeLists.txt
# README.md
# flake.nix
# scripts/build-info.cmake
# scripts/verify-checksum-models.py
|
2023-09-27 01:30:15 +08:00 |
|
Concedo
|
eb86cd4027
|
bump token limits
|
2023-09-27 01:26:00 +08:00 |
|
Concedo
|
8bf6f7f8b0
|
added simulated OAI endpoint
|
2023-09-27 00:49:24 +08:00 |
|
Concedo
|
7f112e2cd4
|
support genkeys in polled streaming
|
2023-09-26 23:46:07 +08:00 |
|
DAN™
|
99115f3fa6
|
cmake : fix build-info.h on MSVC (#3309)
|
2023-09-25 18:45:33 -04:00 |
|
2f38b454
|
1726f9626f
|
docs: Fix typo CLBlast_DIR var. (#3330)
|
2023-09-25 20:24:52 +02:00 |
|
Concedo
|
6c2134a860
|
improved makefile, allowing building without k quants
|
2023-09-25 22:10:47 +08:00 |
|
Erik Scholz
|
a98b1633d5
|
nix : add cuda, use a symlinked toolkit for cmake (#3202)
|
2023-09-25 13:48:30 +02:00 |
|
Concedo
|
17ee719c56
|
improved remotelink cmd, fixed lib unload, updated class.py
|
2023-09-25 17:50:00 +08:00 |
|
Concedo
|
fdadbd0fbb
|
updated lite (+1 squashed commits)
Squashed commits:
[b4408c79] updated lite
|
2023-09-24 23:07:37 +08:00 |
|
Concedo
|
8ecf505d5d
|
improved embedded horde worker (+2 squashed commit)
Squashed commit:
[99234379] improved embedded horde worker
[ebcd1968] update lite
|
2023-09-24 15:16:49 +08:00 |
|
slaren
|
c091cdfb24
|
llama-bench : add README (#3317)
* llama-bench : add README
* minor edit
|
2023-09-23 21:48:24 +02:00 |
|
Concedo
|
32cf02487e
|
colab use mmq, update lite and ver
|
2023-09-23 23:32:00 +08:00 |
|
Cebtenzzre
|
51a7cf5c6e
|
examples : fix RoPE defaults to match PR #3240 (#3315)
|
2023-09-23 12:28:50 +03:00 |
|
Concedo
|
60098a176b
|
update colab model
|
2023-09-23 16:30:40 +08:00 |
|
Concedo
|
bfc696fcc4
|
update lite, update ver
|
2023-09-23 12:35:23 +08:00 |
|
Kevin Ji
|
bedb92b603
|
scripts : use /usr/bin/env in shebang (#3313)
|
2023-09-22 23:52:23 -04:00 |
|
Concedo
|
bd2500db36
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# README.md
# build.zig
# flake.nix
|
2023-09-23 10:51:34 +08:00 |
|
Concedo
|
a64d182b8b
|
sched yield fix again
|
2023-09-23 10:44:41 +08:00 |
|
Concedo
|
1f9e36c733
|
minor lite fixes
|
2023-09-23 09:37:49 +08:00 |
|
Concedo
|
de4e27904d
|
clear reader copy on new gen
|
2023-09-23 00:13:19 +08:00 |
|
Lee Drake
|
bc9d3e3971
|
Update README.md (#3289)
* Update README.md
* Update README.md
Co-authored-by: slaren <slarengh@gmail.com>
---------
Co-authored-by: slaren <slarengh@gmail.com>
|
2023-09-21 21:00:24 +02:00 |
|
shibe2
|
36b904e200
|
ggml-opencl.cpp: Make private functions static (#3300)
|
2023-09-21 14:10:26 -04:00 |
|
Concedo
|
14295922f9
|
updated ver, updated lite (+1 squashed commits)
Squashed commits:
[891291bc] updated lite to v67
|
2023-09-21 17:44:01 +08:00 |
|
Edward Taylor
|
324f3403d5
|
zig : fix for updated c lib (#3259)
|
2023-09-21 12:08:20 +03:00 |
|
yuiseki
|
f56c418ab0
|
embedding : update README.md (#3224)
|
2023-09-21 11:57:40 +03:00 |
|
Johannes Gäßler
|
8185710a80
|
CUDA: use only 1 thread if fully offloaded (#2915)
|
2023-09-21 11:43:53 +03:00 |
|
Georgi Gerganov
|
7eb41179ed
|
readme : update hot topics
|
2023-09-20 20:48:22 +03:00 |
|
Cebtenzzre
|
a5661d7e71
|
llama : allow gguf RoPE keys to be overridden with defaults (#3240)
|
2023-09-20 12:12:47 -04:00 |
|
Cebtenzzre
|
65c2c1c5ab
|
benchmark-matmult : do not use integer abs() on a float (#3277)
|
2023-09-20 12:06:08 -04:00 |
|
Concedo
|
2dda63a4eb
|
add tensor split field
|
2023-09-20 22:46:47 +08:00 |
|
kang
|
80834daecf
|
flake : Restore default package's buildInputs (#3262)
|
2023-09-20 15:48:22 +02:00 |
|
Concedo
|
712b8423f6
|
class.py changes
|
2023-09-20 21:27:49 +08:00 |
|
Concedo
|
b63cf223c9
|
add queue info
|
2023-09-20 21:07:21 +08:00 |
|
Concedo
|
0eb52cf6c2
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# Makefile
|
2023-09-20 21:01:34 +08:00 |
|
Concedo
|
006e87cb56
|
requirements txt
|
2023-09-20 21:00:23 +08:00 |
|
Alon
|
a40f2b656f
|
CI: FreeBSD fix (#3258)
* - freebsd ci: use qemu
|
2023-09-20 14:06:36 +02:00 |
|
Concedo
|
4a0c515da7
|
rename notepad to classic
|
2023-09-20 17:51:02 +08:00 |
|
Concedo
|
436cd474cd
|
regex fix
|
2023-09-20 16:02:19 +08:00 |
|
Georgi Gerganov
|
d119c04c15
|
examples : fix benchmark-matmult (#1554)
The precision for Q4_0 has degraded since #1508
|
2023-09-20 10:02:39 +03:00 |
|
Concedo
|
2fc91d8727
|
updated lite
|
2023-09-20 14:28:55 +08:00 |
|
Concedo
|
c03409c1f6
|
grammar sampling added for lite
|
2023-09-19 00:13:30 +08:00 |
|
Concedo
|
0142760fc3
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# Makefile
# README.md
|
2023-09-18 23:20:02 +08:00 |
|
Concedo
|
8c453d1e4e
|
added grammar sampling
|
2023-09-18 23:02:00 +08:00 |
|
Cebtenzzre
|
8781013ef6
|
make : restore build-info.h dependency for several targets (#3205)
|
2023-09-18 10:03:53 -04:00 |
|
Concedo
|
951614bfc6
|
library unloading is working
|
2023-09-18 15:03:52 +08:00 |
|
Erik Scholz
|
7ddf185537
|
ci : switch cudatoolkit install on windows to networked (#3236)
|
2023-09-18 02:21:47 +02:00 |
|
Johannes Gäßler
|
ee66942d7e
|
CUDA: fix peer access logic (#3231)
|
2023-09-17 23:35:20 +02:00 |
|
Johannes Gäßler
|
111163e246
|
CUDA: enable peer access between devices (#2470)
|
2023-09-17 16:37:53 +02:00 |
|
Concedo
|
34930bfdc2
|
updated lite
|
2023-09-17 20:43:04 +08:00 |
|