Olivier Chafik 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								1c641e6aac 
								
							 
						 
						
							
							
								
								build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )  
							
							... 
							
							
							
							* `main`/`server`: rename to `llama` / `llama-server` for consistency w/ homebrew
* server: update refs -> llama-server
gitignore llama-server
* server: simplify nix package
* main: update refs -> llama
fix examples/main ref
* main/server: fix targets
* update more names
* Update build.yml
* rm accidentally checked in bins
* update straggling refs
* Update .gitignore
* Update server-llm.sh
* main: target name -> llama-cli
* Prefix all example bins w/ llama-
* fix main refs
* rename {main->llama}-cmake-pkg binary
* prefix more cmake targets w/ llama-
* add/fix gbnf-validator subfolder to cmake
* sort cmake example subdirs
* rm bin files
* fix llama-lookup-* Makefile rules
* gitignore /llama-*
* rename Dockerfiles
* rename llama|main -> llama-cli; consistent RPM bin prefixes
* fix some missing -cli suffixes
* rename dockerfile w/ llama-cli
* rename(make): llama-baby-llama
* update dockerfile refs
* more llama-cli(.exe)
* fix test-eval-callback
* rename: llama-cli-cmake-pkg(.exe)
* address gbnf-validator unused fread warning (switched to C++ / ifstream)
* add two missing llama- prefixes
* Updating docs for eval-callback binary to use new `llama-` prefix.
* Updating a few lingering doc references for rename of main to llama-cli
* Updating `run-with-preset.py` to use new binary names.
Updating docs around `perplexity` binary rename.
* Updating documentation references for lookup-merge and export-lora
* Updating two small `main` references missed earlier in the finetune docs.
* Update apps.nix
* update grammar/README.md w/ new llama-* names
* update llama-rpc-server bin name + doc
* Revert "update llama-rpc-server bin name + doc"
This reverts commit e474ef1df4 
							
						 
						
							2024-06-13 00:41:52 +01:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Galunid 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								9c4c9cc83f 
								
							 
						 
						
							
							
								
								Move convert.py to examples/convert-legacy-llama.py ( #7430 )  
							
							... 
							
							
							
							* Move convert.py to examples/convert-no-torch.py
* Fix CI, scripts, readme files
* convert-no-torch -> convert-legacy-llama
* Move vocab thing to vocab.py
* Fix convert-no-torch -> convert-legacy-llama
* Fix lost convert.py in ci/run.sh
* Fix imports
* Fix gguf not imported correctly
* Fix flake8 complaints
* Fix check-requirements.sh
* Get rid of ADDED_TOKENS_FILE, FAST_TOKENIZER_FILE
* Review fixes 
							
						 
						
							2024-05-30 21:40:00 +10:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Juraj Bednar 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								3bd2c7ce1b 
								
							 
						 
						
							
							
								
								docker : add finetune option ( #4211 )  
							
							
							
						 
						
							2023-11-30 23:46:01 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Henri Vasserman 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								71d6975559 
								
							 
						 
						
							
							
								
								[Docker] fix tools.sh argument passing. ( #2884 )  
							
							... 
							
							
							
							* [Docker] fix tools.sh argument passing.
This should allow passing multiple arguments to containers with
the full image that are using the tools.sh frontend.
Fix from https://github.com/ggerganov/llama.cpp/issues/2535#issuecomment-1697091734  
							
						 
						
							2023-08-30 19:14:53 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Bodo Graumann 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								b782422a3e 
								
							 
						 
						
							
							
								
								devops : add missing quotes to bash script ( #2193 )  
							
							... 
							
							
							
							This prevents accidentally expanding arguments that contain spaces. 
							
						 
						
							2023-07-13 16:49:14 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jinwoo Jeong 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								3ec7e596b2 
								
							 
						 
						
							
							
								
								docker : add '--server' option ( #2174 )  
							
							
							
						 
						
							2023-07-11 19:12:35 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Jiří Podivín 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								b5c85468a3 
								
							 
						 
						
							
							
								
								Docker: change to calling convert.py ( #1641 )  
							
							... 
							
							
							
							Deprecation disclaimer was added to convert-pth-to-ggml.py 
							
						 
						
							2023-06-03 15:11:53 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Pavol Rusnak 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								859fee6dfb 
								
							 
						 
						
							
							
								
								quantize : use map to assign quantization type from string ( #1191 )  
							
							... 
							
							
							
							instead of `int` (while `int` option still being supported)
This allows the following usage:
`./quantize ggml-model-f16.bin ggml-model-q4_0.bin q4_0`
instead of:
`./quantize ggml-model-f16.bin ggml-model-q4_0.bin 2` 
							
						 
						
							2023-04-26 18:43:27 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								4cc053b6d5 
								
							 
						 
						
							
							
								
								Remove oboslete command from Docker script  
							
							
							
						 
						
							2023-03-23 22:39:44 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Stephan Walter 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								367946c668 
								
							 
						 
						
							
							
								
								Don't tell users to use a bad number of threads ( #243 )  
							
							... 
							
							
							
							The readme tells people to use the command line option "-t 8", causing 8
threads to be started. On systems with fewer than 8 cores, this causes a
significant slowdown. Remove the option from the example command lines
and use /proc/cpuinfo on Linux to determine a sensible default. 
							
						 
						
							2023-03-17 19:47:35 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Bernat Vadell 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								2af23d3043 
								
							 
						 
						
							
							
								
								🚀  Dockerize llamacpp ( #132 )  
							
							... 
							
							
							
							* feat: dockerize llamacpp
* feat: split build & runtime stages
* split dockerfile into main & tools
* add quantize into tool docker image
* Update .devops/tools.sh
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* add docker action pipeline
* change CI to publish at github docker registry
* fix name runs-on macOS-latest is macos-latest (lowercase)
* include docker versioned images
* fix github action docker
* fix docker.yml
* feat: include all-in-one command tool & update readme.md
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 
							
						 
						
							2023-03-17 10:47:06 +01:00