Commit graph

2173 commits

Author SHA1 Message Date
Concedo
6a821b268a improved SSE streamiing 2023-09-28 17:33:34 +08:00
Concedo
38d4c6cedd updated lite 2023-09-27 16:06:17 +08:00
Concedo
cf31658cbf added a flag to keep console in foreground 2023-09-27 01:53:30 +08:00
Concedo
74edc401c1 Merge branch 'master' into concedo_experimental
# Conflicts:
#	CMakeLists.txt
#	README.md
#	flake.nix
#	scripts/build-info.cmake
#	scripts/verify-checksum-models.py
2023-09-27 01:30:15 +08:00
Concedo
eb86cd4027 bump token limits 2023-09-27 01:26:00 +08:00
Concedo
8bf6f7f8b0 added simulated OAI endpoint 2023-09-27 00:49:24 +08:00
Concedo
7f112e2cd4 support genkeys in polled streaming 2023-09-26 23:46:07 +08:00
DAN™
99115f3fa6
cmake : fix build-info.h on MSVC (#3309) 2023-09-25 18:45:33 -04:00
2f38b454
1726f9626f
docs: Fix typo CLBlast_DIR var. (#3330) 2023-09-25 20:24:52 +02:00
Concedo
6c2134a860 improved makefile, allowing building without k quants 2023-09-25 22:10:47 +08:00
Erik Scholz
a98b1633d5
nix : add cuda, use a symlinked toolkit for cmake (#3202) 2023-09-25 13:48:30 +02:00
Concedo
17ee719c56 improved remotelink cmd, fixed lib unload, updated class.py 2023-09-25 17:50:00 +08:00
Concedo
fdadbd0fbb updated lite (+1 squashed commits)
Squashed commits:

[b4408c79] updated lite
2023-09-24 23:07:37 +08:00
Concedo
8ecf505d5d improved embedded horde worker (+2 squashed commit)
Squashed commit:

[99234379] improved embedded horde worker

[ebcd1968] update lite
2023-09-24 15:16:49 +08:00
slaren
c091cdfb24
llama-bench : add README (#3317)
* llama-bench : add README

* minor edit
2023-09-23 21:48:24 +02:00
Concedo
32cf02487e colab use mmq, update lite and ver 2023-09-23 23:32:00 +08:00
Cebtenzzre
51a7cf5c6e
examples : fix RoPE defaults to match PR #3240 (#3315) 2023-09-23 12:28:50 +03:00
Concedo
60098a176b update colab model 2023-09-23 16:30:40 +08:00
Concedo
bfc696fcc4 update lite, update ver 2023-09-23 12:35:23 +08:00
Kevin Ji
bedb92b603
scripts : use /usr/bin/env in shebang (#3313) 2023-09-22 23:52:23 -04:00
Concedo
bd2500db36 Merge branch 'master' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	README.md
#	build.zig
#	flake.nix
2023-09-23 10:51:34 +08:00
Concedo
a64d182b8b sched yield fix again 2023-09-23 10:44:41 +08:00
Concedo
1f9e36c733 minor lite fixes 2023-09-23 09:37:49 +08:00
Concedo
de4e27904d clear reader copy on new gen 2023-09-23 00:13:19 +08:00
Lee Drake
bc9d3e3971
Update README.md (#3289)
* Update README.md

* Update README.md

Co-authored-by: slaren <slarengh@gmail.com>

---------

Co-authored-by: slaren <slarengh@gmail.com>
2023-09-21 21:00:24 +02:00
shibe2
36b904e200
ggml-opencl.cpp: Make private functions static (#3300) 2023-09-21 14:10:26 -04:00
Concedo
14295922f9 updated ver, updated lite (+1 squashed commits)
Squashed commits:

[891291bc] updated lite to v67
2023-09-21 17:44:01 +08:00
Edward Taylor
324f3403d5
zig : fix for updated c lib (#3259) 2023-09-21 12:08:20 +03:00
yuiseki
f56c418ab0
embedding : update README.md (#3224) 2023-09-21 11:57:40 +03:00
Johannes Gäßler
8185710a80
CUDA: use only 1 thread if fully offloaded (#2915) 2023-09-21 11:43:53 +03:00
Georgi Gerganov
7eb41179ed
readme : update hot topics 2023-09-20 20:48:22 +03:00
Cebtenzzre
a5661d7e71
llama : allow gguf RoPE keys to be overridden with defaults (#3240) 2023-09-20 12:12:47 -04:00
Cebtenzzre
65c2c1c5ab
benchmark-matmult : do not use integer abs() on a float (#3277) 2023-09-20 12:06:08 -04:00
Concedo
2dda63a4eb add tensor split field 2023-09-20 22:46:47 +08:00
kang
80834daecf
flake : Restore default package's buildInputs (#3262) 2023-09-20 15:48:22 +02:00
Concedo
712b8423f6 class.py changes 2023-09-20 21:27:49 +08:00
Concedo
b63cf223c9 add queue info 2023-09-20 21:07:21 +08:00
Concedo
0eb52cf6c2 Merge branch 'master' into concedo_experimental
# Conflicts:
#	Makefile
2023-09-20 21:01:34 +08:00
Concedo
006e87cb56 requirements txt 2023-09-20 21:00:23 +08:00
Alon
a40f2b656f
CI: FreeBSD fix (#3258)
* - freebsd ci: use qemu
2023-09-20 14:06:36 +02:00
Concedo
4a0c515da7 rename notepad to classic 2023-09-20 17:51:02 +08:00
Concedo
436cd474cd regex fix 2023-09-20 16:02:19 +08:00
Georgi Gerganov
d119c04c15
examples : fix benchmark-matmult (#1554)
The precision for Q4_0 has degraded since #1508
2023-09-20 10:02:39 +03:00
Concedo
2fc91d8727 updated lite 2023-09-20 14:28:55 +08:00
Concedo
c03409c1f6 grammar sampling added for lite 2023-09-19 00:13:30 +08:00
Concedo
0142760fc3 Merge branch 'master' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	Makefile
#	README.md
2023-09-18 23:20:02 +08:00
Concedo
8c453d1e4e added grammar sampling 2023-09-18 23:02:00 +08:00
Cebtenzzre
8781013ef6
make : restore build-info.h dependency for several targets (#3205) 2023-09-18 10:03:53 -04:00
Concedo
951614bfc6 library unloading is working 2023-09-18 15:03:52 +08:00
Erik Scholz
7ddf185537
ci : switch cudatoolkit install on windows to networked (#3236) 2023-09-18 02:21:47 +02:00