llama.cpp

Author	SHA1	Message	Date
at8u	ff05d05c96	miku.sh : add executable bit (#780 )	2023-04-05 18:59:13 +03:00
Georgi Gerganov	62b3e81aae	media : add logos and banners	2023-04-05 18:58:31 +03:00
Georgi Gerganov	8d10406d6e	readme : change logo + add bindings + add uis + add wiki	2023-04-05 18:56:20 +03:00
iacore	ed1c214e66	zig : add build.zig (#773 ) Co-authored-by: Locria Cyber <74560659+locriacyber@users.noreply.github.com>	2023-04-05 18:06:02 +03:00
Ivan Stepanov	0c44427df1	make : missing host optimizations in CXXFLAGS (#763 )	2023-04-05 17:38:37 +03:00
Adithya Balaji	594cc95fab	readme : update with CMake and windows example (#748 ) * README: Update with CMake and windows example * README: update with code-review for cmake build	2023-04-05 17:36:12 +03:00
at8u	88ed5761b8	examples : add Miku.sh (#724 ) * Add Miku.sh to examples * Add missing line to prompt in Miku.sh * Add --keep param to Miku.sh * Remove '[end_of_conversation]' line from Miku.sh No longer is necessary.	2023-04-05 17:32:42 +03:00
Andrew Duffy	58c438cf7d	Add Accelerate/BLAS when using Swift (#765 )	2023-04-05 06:44:24 -04:00
Concedo	5c1920df43	why nobody ever told me the makefile doesnt work outside x86 xD	2023-04-05 17:15:42 +08:00
Concedo	1490cdd71d	change GPT-J and GPT2 KVs to use fp16 instead	2023-04-05 15:53:07 +08:00
Concedo	57e9f929ee	renamed misnamed ACCELERATE define, and removed all -march=native and -mtune=native flags	2023-04-05 15:22:13 +08:00
Concedo	14273fea7a	integrated gpt2 support	2023-04-04 23:15:47 +08:00
Concedo	52de932842	removed main.exe to reduce clutter, added support for rep pen in gptj	2023-04-04 20:43:13 +08:00
Concedo	9c0dbbb08b	Merge branch 'master' into concedo	2023-04-04 00:51:05 +08:00
Concedo	dd2abd8bc7	lower default thread threshold	2023-04-04 00:42:49 +08:00
mgroeber9110	53dbba7695	Windows: reactive sigint handler after each Ctrl-C (#736 )	2023-04-03 18:00:55 +02:00
SebastianApel	437e77855a	10+% performance improvement of ggml_vec_dot_q4_0 on AVX2 (#654 ) * Performance improvement of AVX2 code * Fixed problem with MSVC compiler * Reviewer comments: removed double semicolon, deleted empty line 1962	2023-04-03 09:52:28 +02:00
Concedo	06c711d770	Merge branch 'master' into concedo # Conflicts: # .devops/full.Dockerfile # README.md	2023-04-03 15:10:08 +08:00
Concedo	eb5b22dda2	rebrand to koboldcpp	2023-04-03 10:35:18 +08:00
Ivan Stepanov	cd7fa95690	Define non-positive temperature behavior (#720 )	2023-04-03 02:19:04 +02:00
bsilvereagle	a0c0516416	Remove torch GPU dependencies from the Docker.full image (#665 ) By using `pip install torch --index-url https://download.pytorch.org/whl/cpu` instead of `pip install torch` we can specify we want to install a CPU-only version of PyTorch without any GPU dependencies. This reduces the size of the Docker image from 7.32 GB to 1.62 GB	2023-04-03 00:13:03 +02:00
Concedo	8dd8ab1659	Various enhancement and integration pygmalion.cpp	2023-04-03 00:04:43 +08:00
Thatcher Chamberlin	d8d4e865cd	Add a missing step to the gpt4all instructions (#690 ) `migrate-ggml-2023-03-30-pr613.py` is needed to get gpt4all running.	2023-04-02 12:48:57 +02:00
Christian Falch	e986f94829	Added api for getting/setting the kv_cache (#685 ) The api provides access methods for retrieving the current memory buffer for the kv_cache and its token number. It also contains a method for setting the kv_cache from a memory buffer. This makes it possible to load/save history - maybe support --cache-prompt paramater as well? Co-authored-by: Pavol Rusnak <pavol@rusnak.io>	2023-04-02 12:23:04 +02:00
Marian Cepok	c0bb1d3ce2	ggml : change ne to int64_t (#626 )	2023-04-02 13:21:31 +03:00
Concedo	3f4967b827	added new binaries	2023-04-02 17:14:38 +08:00
Concedo	bb965cc120	Merge branch 'master' into concedo # Conflicts: # README.md	2023-04-02 17:13:28 +08:00
Concedo	9aabb0d9db	massive refactor completed, GPT-J integrated	2023-04-02 17:03:30 +08:00
Leonardo Neumann	6e7801d08d	examples : add gpt4all script (#658 )	2023-04-02 10:56:20 +03:00
Stephan Walter	81040f10aa	llama : do not allocate KV cache for "vocab_only == true" (#682 ) Fixes sanitizer CI	2023-04-02 10:18:53 +03:00
Fabian	c4f89d8d73	make : use -march=native -mtune=native on x86 (#609 )	2023-04-02 10:17:05 +03:00
Murilo Santana	5b70e7de4c	fix default params for examples/main (#697 )	2023-04-02 04:41:12 +02:00
Concedo	b1f08813e3	added support for gpt4all original format	2023-04-02 00:53:46 +08:00
Ikko Eltociear Ashimine	a717cba844	py: huggingface -> Hugging Face (#686 )	2023-04-01 18:38:18 +02:00
rimoliga	d0a7f742e7	readme: replace termux links with homepage, play store is deprecated (#680 )	2023-04-01 16:57:30 +02:00
Slaren	0d054e292e	Show error message when -f fails	2023-04-01 16:08:40 +02:00
Concedo	085a9f90a7	still refactoring	2023-04-01 11:56:34 +08:00
Concedo	6e6125ebdb	updated pyinstaller to clean temp dir,removed warning flags from makefile because they are just clutter.	2023-04-01 09:25:41 +08:00
Concedo	9ab6e87b58	Merge branch 'master' into concedo # Conflicts: # CMakeLists.txt	2023-04-01 09:05:45 +08:00
Concedo	801b178f2a	still refactoring, but need a checkpoint to prepare build for 1.0.7	2023-04-01 08:55:14 +08:00
Stephan Walter	3525899277	Enable -std= for cmake builds, fix warnings (#598 )	2023-03-31 19:19:16 +00:00
Concedo	6b86f5ea22	halfway refactoring, wip adding other model types	2023-04-01 01:13:05 +08:00
slaren	1d08882afa	Optimize AVX2 ggml_vec_dot_q4_0 (#642 )	2023-03-31 15:55:52 +00:00
perserk	02c5b27e91	Add AVX acceleration (#617 ) * ggml : add AVX quantize_row_q4_0() * ggml : add AVX ggml_vec_dot_q4_0() * ggml : refactor AVX part of ggml_vec_dot_q4_0() https://github.com/ggerganov/llama.cpp/pull/617#issuecomment-1489985645	2023-03-31 13:55:44 +02:00
Concedo	56949197fe	added HF converter base	2023-03-31 19:10:21 +08:00
Concedo	17044257a0	Merge branch 'master' into concedo	2023-03-31 19:04:47 +08:00
Concedo	559a1967f7	Backwards compatibility formats all done Merge branch 'master' into concedo # Conflicts: # CMakeLists.txt # README.md # llama.cpp	2023-03-31 19:01:33 +08:00
Concedo	9eab39fe6d	prepare legacy functions (+1 squashed commits) Squashed commits: [8bc8d0d] prepare for big merge	2023-03-31 17:45:49 +08:00
Pavol Rusnak	cbef542879	py : cleanup the code - use f-strings where possible - drop first param of encode/decode functions since "utf-8" is the default	2023-03-31 10:32:01 +02:00
Concedo	79f9743347	improved console info, fixed utf encoding bugs	2023-03-31 15:38:38 +08:00

1 2 3 4 5 ...

479 commits