Concedo
f39def81d4
Update readme with more info
2023-04-18 21:44:26 +08:00
Concedo
3614956bc7
update readme
2023-04-18 21:39:05 +08:00
Concedo
ea01771dd5
rwkv is done
2023-04-18 20:55:01 +08:00
Concedo
a76b15b581
Merge branch 'concedo' into concedo_experimental
...
# Conflicts:
# make_pyinstaller.bat
2023-04-18 17:42:43 +08:00
Gustavo Rocha Dias
ed5b5c45a9
doc - enhanced readme explaing how to compile at Windows. ( #80 )
2023-04-18 17:40:04 +08:00
Gustavo Rocha Dias
a9253cdfba
fix - at some OSs the PyInstaller command is case sensitive, at lowercase it doen't work. ( #81 )
2023-04-18 17:39:06 +08:00
Concedo
ac61e34d5f
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# CMakeLists.txt
# README.md
2023-04-18 17:38:10 +08:00
Concedo
c200b674f4
updated kobold lite, work on rwkv, added exe path to model load params, added launch parameter
2023-04-18 17:36:44 +08:00
Ivan Komarov
42747220b4
Do not close file after mmap (Windows version) ( #1034 )
2023-04-18 03:15:50 +02:00
Atsushi Tatsuma
e9298af389
readme : add Ruby bindings ( #1029 )
2023-04-17 22:34:35 +03:00
Cameron
4ad73137a1
add 4_0 to default outfile namestr dict ( #1031 )
...
this came up when trying to convert the gpt4all-lora-unfiltered-quantized.bin file
2023-04-17 20:26:23 +02:00
slaren
315a95a4d3
Add LoRA support ( #820 )
2023-04-17 17:28:55 +02:00
Arik Poznanski
efd05648c8
llama : well-defined static initialization of complex objects ( #927 )
...
* Replaced static initialization of complex objects with a initialization on first use. This prevents an undefined behavior on program run, for example, crash in Release build, works in Debug build
* replaced use of auto with exact type to avoid using -std=c++14
* Made the assessors functions for static maps be static const
2023-04-17 17:41:53 +03:00
Georgi Gerganov
eb17a026fd
quantize-stats : fix bug in --type argument
2023-04-17 17:31:06 +03:00
Concedo
8e923dc6e9
updated kobold lite
2023-04-17 21:33:57 +08:00
Georgi Gerganov
69b740289f
ggml : avoid using ggml_fp16_to_fp32() and ggml_fp32_to_fp16() in ggml.c
2023-04-17 16:16:23 +03:00
Ivan Komarov
f266259ad9
Speedup the AVX-512 implementation of ggml_vec_dot_q4_0() ( #933 )
2023-04-17 15:10:57 +02:00
Concedo
1f4a69c051
version number api
2023-04-17 19:31:15 +08:00
Concedo
364e2736c9
Merge branch 'master' into concedo
2023-04-17 17:34:50 +08:00
Concedo
763ad172c0
arranged files, updated kobold lite, modified makefile for extra link args on linux, started RWKV implementation
2023-04-17 17:31:45 +08:00
slaren
47f61aaa5f
Fix: do not close file on mmap ( #1017 )
2023-04-16 21:27:38 +02:00
Concedo
9581171a9f
updated embedded lite again
2023-04-16 22:42:51 +08:00
Concedo
bee6a401fd
slight clarity fix
2023-04-16 22:04:19 +08:00
Concedo
96fb12cfa2
Merge branch 'master' into concedo
2023-04-16 21:59:05 +08:00
Concedo
c757fbee1d
fixes to stopper tokens, fixed BLAS mode for GPT2 and GPTJ, updated kobold lite
2023-04-16 21:54:18 +08:00
Concedo
6548d3b3fb
Added prints for stopping sequences, made makefile 1% friendlier to arch linux users
2023-04-16 20:43:17 +08:00
Georgi Gerganov
3173a62eb9
stdout : vertical align outputs for better readibility
2023-04-16 13:59:27 +03:00
Concedo
525184930d
added a kobold API compatible implementation of stopping sequences
2023-04-16 18:37:49 +08:00
Pavol Rusnak
489537e6cf
examples: add missing <ctime> include for time() ( #1011 )
2023-04-16 10:13:00 +00:00
nanahi
2d3481c721
Fix msys2 build error and warnings ( #1009 )
2023-04-16 11:13:42 +02:00
Concedo
8bf2e50a11
converted the cl file to be a string literal instead
2023-04-16 15:57:30 +08:00
Concedo
5a4d1b5d15
Merge branch 'master' into concedo
...
# Conflicts:
# CMakeLists.txt
# Makefile
2023-04-16 14:08:23 +08:00
comex
74f5899df4
convert.py: Fix loading safetensors and ggml format on Windows ( #991 )
...
Calling `mmap.mmap` on Windows apparently resets the file offset of the
raw file object (and makes the BufferedReader return a *negative* file
offset). For safetensors, avoid using the file offset after calling
mmap. For GGML format, explicitly save and restore the offset.
Fixes #966 .
2023-04-15 23:53:21 +02:00
Stephan Walter
2f7c8e014e
Fix potential int8 overflow in non-SIMD vec_dot ( #986 )
2023-04-15 18:28:56 +00:00
Concedo
ad5676810a
merge CLBlast improvements - GPU dequant
2023-04-16 01:17:40 +08:00
Concedo
3e992eabb4
Merge remote-tracking branch 'occam/clblast-gpu-dequant' into concedo
2023-04-16 00:26:54 +08:00
Stephan Walter
0ad964631f
Refactor ggml.c for future tensor types ( #1001 )
2023-04-15 16:25:38 +00:00
Concedo
3eb1c1850e
accept non positional model arg
2023-04-16 00:23:07 +08:00
0cc4m
57d046eeb6
Enable dequantization on GPU for ClBlast
2023-04-15 18:04:24 +02:00
Georgi Gerganov
e95b6554b4
ggml : add Q8_0 quantization for intermediate results ( #951 )
...
* ggml : add Q8_0 quantization for intermediate results
* quantize-stats : fix test + add it to Makefile default
* Q8: use int8_t, AVX/AVX2 optimizations
* ggml : fix quantize_row_q8_0() ARM_NEON rounding
* minor : updates after rebase to latest master
* quantize-stats : delete obsolete strings
* ggml : fix q4_1 dot func
---------
Co-authored-by: Stephan Walter <stephan@walter.name>
2023-04-15 17:53:22 +03:00
Georgi Gerganov
aa485cee33
ggml : use posix_memalign on non-Windows env
2023-04-15 14:25:45 +03:00
0cc4m
8fbfc80e03
Fix clblast device selection on Linux
2023-04-15 12:02:36 +02:00
Ivan Komarov
c12b14b77f
benchmark : fix result validation in benchmark-q4_0-matmult ( #987 )
2023-04-15 08:51:54 +03:00
katsu560
106faaf297
cmake : add finding the OpenBLAS header file ( #992 )
2023-04-15 08:51:11 +03:00
Concedo
d00b865eb1
Merge branch 'master' into concedo
...
# Conflicts:
# .devops/full.Dockerfile
# Makefile
# flake.nix
2023-04-15 11:33:43 +08:00
Pavol Rusnak
c85e03d12e
Revert "main : alternative instruct mode (Vicuna support, etc.) ( #863 )" ( #982 )
...
This reverts commit f4d277ae17
.
2023-04-14 22:58:43 +03:00
Pavol Rusnak
489093548c
py : bump sentencepiece to 0.1.98 to support Python 3.11 ( #976 )
2023-04-14 19:46:49 +00:00
Stephan Walter
93265e988a
make : fix dependencies, use auto variables ( #983 )
2023-04-14 22:39:48 +03:00
Pavol Rusnak
c56b715269
Expose type name from ggml ( #970 )
...
Avoid duplication of type names in utils
Co-authored-by: Håkon H. Hitland <haakon@likedan.net>
2023-04-14 20:05:37 +02:00
Concedo
ea5d01002f
Merge branch 'concedo' of https://github.com/LostRuins/llamacpp-for-kobold into concedo
2023-04-15 01:14:10 +08:00