Diego Devesa 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								c7499c557c 
								
							 
						 
						
							
							
								
								examples : do not use common library in simple example ( #9803 )  
							
							... 
							
							
							
							* examples : do not use common library in simple example
* add command line parser, simplify code 
							
						 
						
							2024-10-10 19:50:49 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Diego Devesa 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								c81f3bbb05 
								
							 
						 
						
							
							
								
								cmake : do not build common library by default when standalone ( #9804 )  
							
							
							
						 
						
							2024-10-09 18:49:52 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								e7022064ab 
								
							 
						 
						
							
							
								
								perplexity : fix integer overflow ( #9783 )  
							
							... 
							
							
							
							* perplexity : fix integer overflow
ggml-ci
* perplexity : keep n_vocab as int and make appropriate casts
ggml-ci 
							
						 
						
							2024-10-09 17:00:18 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								3dc48fe75a 
								
							 
						 
						
							
							
								
								examples : remove llama.vim  
							
							... 
							
							
							
							An updated version will be added in #9787  
							
						 
						
							2024-10-09 10:55:42 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Diego Devesa 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								dca1d4b58a 
								
							 
						 
						
							
							
								
								ggml : fix BLAS with unsupported types ( #9775 )  
							
							... 
							
							
							
							* ggml : do not use BLAS with types without to_float
* ggml : return pointer from ggml_internal_get_type_traits to avoid unnecessary copies
* ggml : rename ggml_internal_get_type_traits -> ggml_get_type_traits
it's not really internal if everybody uses it 
							
						 
						
							2024-10-08 14:21:43 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xuan Son Nguyen 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								458367a906 
								
							 
						 
						
							
							
								
								server : better security control for public deployments ( #9776 )  
							
							... 
							
							
							
							* server : more explicit endpoint access settings
* protect /props endpoint
* fix tests
* update server docs
* fix typo
* fix tests 
							
						 
						
							2024-10-08 13:27:04 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								f4b2dcdf49 
								
							 
						 
						
							
							
								
								readme : fix typo [no ci]  
							
							
							
						 
						
							2024-10-06 13:49:41 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8c475b97b8 
								
							 
						 
						
							
							
								
								rerank : use [SEP] token instead of [BOS] ( #9737 )  
							
							... 
							
							
							
							* rerank : use [SEP] token instead of [BOS]
ggml-ci
* common : sanity check for non-NULL tokens
ggml-ci
* ci : adjust rank score interval
ggml-ci
* ci : add shebang to run.sh
ggml-ci 
							
						 
						
							2024-10-05 15:55:04 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Daniel Kleine 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								133c7b46b3 
								
							 
						 
						
							
							
								
								Fixed RNG seed docs ( #9723 )  
							
							... 
							
							
							
							* Update README.md
fixed RNG seed info
* changed print format to unsigned 
							
						 
						
							2024-10-04 10:54:44 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Radoslav Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								841713e1e4 
								
							 
						 
						
							
							
								
								rpc : enable vulkan ( #9714 )  
							
							... 
							
							
							
							closes  #8536  
						
							2024-10-03 13:00:52 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Zhenwei Jin 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								76b37d1541 
								
							 
						 
						
							
							
								
								gguf-split : improve --split and --merge logic ( #9619 )  
							
							... 
							
							
							
							* make sure params --split and --merge are not specified at same time
* update gguf-split params parse logic
* Update examples/gguf-split/gguf-split.cpp
Co-authored-by: slaren <slarengh@gmail.com>
---------
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>
Co-authored-by: slaren <slarengh@gmail.com> 
							
						 
						
							2024-10-02 10:21:57 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								148844fe97 
								
							 
						 
						
							
							
								
								examples : remove benchmark ( #9704 )  
							
							... 
							
							
							
							ggml-ci 
							
						 
						
							2024-10-02 10:14:44 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								cad341d889 
								
							 
						 
						
							
							
								
								metal : reduce command encoding overhead ( #9698 )  
							
							... 
							
							
							
							* metal : reduce command encoding overhead
ggml-ci
* metal : add comments 
							
						 
						
							2024-10-01 16:00:25 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									compilade 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								511636df0c 
								
							 
						 
						
							
							
								
								ci : reduce severity of unused Pyright ignore comments ( #9697 )  
							
							
							
						 
						
							2024-09-30 14:13:16 -04:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									vb 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								08a43d05b6 
								
							 
						 
						
							
							
								
								py : update transfomers version ( #9694 )  
							
							... 
							
							
							
							* update transfomers version.
* update hfh version. 
							
						 
						
							2024-09-30 18:03:47 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								f4d2b8846a 
								
							 
						 
						
							
							
								
								llama : add reranking support ( #9510 )  
							
							... 
							
							
							
							* py : add XLMRobertaForSequenceClassification [no ci]
* py : fix scalar-tensor conversion [no ci]
* py : fix position embeddings chop [no ci]
* llama : read new cls tensors [no ci]
* llama : add classigication head (wip) [no ci]
* llama : add "rank" pooling type
ggml-ci
* server : add rerank endpoint
ggml-ci
* llama : aboud ggml_repeat during classification
* rerank : cleanup + comments
* server : accept /rerank endpoint in addition to /v1/rerank [no ci]
* embedding : parse special tokens
* jina : support v1 reranker
* vocab : minor style
ggml-ci
* server : initiate tests for later
ggml-ci
* server : add docs
* llama : add comment [no ci]
* llama : fix uninitialized tensors
* ci : add rerank tests
ggml-ci
* add reranking test
* change test data
* Update examples/server/server.cpp
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>
* add `--reranking` argument
* update server docs
* llama : fix comment [no ci]
ggml-ci
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> 
							
						 
						
							2024-09-28 17:42:03 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Zhenwei Jin 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								6102037bbb 
								
							 
						 
						
							
							
								
								vocab : refactor tokenizer to reduce init overhead ( #9449 )  
							
							... 
							
							
							
							* refactor tokenizer
* llama : make llm_tokenizer more private
ggml-ci
* refactor tokenizer
* refactor tokenizer
* llama : make llm_tokenizer more private
ggml-ci
* remove unused files
* remove unused fileds to avoid unused filed build error
* avoid symbol link error
* Update src/llama.cpp
* Update src/llama.cpp
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 
							
						 
						
							2024-09-28 15:10:58 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xuan Son Nguyen 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								afbbfaa537 
								
							 
						 
						
							
							
								
								server : add more env vars, improve gen-docs ( #9635 )  
							
							... 
							
							
							
							* server : add more env vars, improve gen-docs
* update server docs
* LLAMA_ARG_NO_CONTEXT_SHIFT 
							
						 
						
							2024-09-25 14:05:13 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								cea1486ecf 
								
							 
						 
						
							
							
								
								log : add CONT level for continuing previous log entry ( #9610 )  
							
							
							
						 
						
							2024-09-24 10:15:35 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									StrangeBytesDev 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								0aa15011e3 
								
							 
						 
						
							
							
								
								server : add newline after chat example ( #9616 )  
							
							
							
						 
						
							2024-09-24 09:04:39 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								b0f27361f3 
								
							 
						 
						
							
							
								
								sampling : avoid expensive softmax during greedy sampling ( #9605 )  
							
							... 
							
							
							
							* sampling : avoid expensive softmax during greedy sampling
ggml-ci
* speculative : fix default RNG seed + set sparams.n_probs
* Update tests/test-sampling.cpp
Co-authored-by: slaren <slarengh@gmail.com>
* sampling : add clarifying comment [no ci]
---------
Co-authored-by: slaren <slarengh@gmail.com> 
							
						 
						
							2024-09-24 09:03:17 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xuan Son Nguyen 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								0b3bf966f4 
								
							 
						 
						
							
							
								
								server : add --no-context-shift option ( #9607 )  
							
							... 
							
							
							
							* server : add --no-context-shift option
* small fix
* Update examples/server/tests/features/embeddings.feature
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* tests : minor fix
* revert usage of GGML_ASSERT
* update server documentation
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 
							
						 
						
							2024-09-23 22:23:54 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								37f8c7b4c9 
								
							 
						 
						
							
							
								
								perplexity : remove extra new lines after chunks ( #9596 )  
							
							
							
						 
						
							2024-09-23 11:28:02 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									slaren 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								63351143b2 
								
							 
						 
						
							
							
								
								quantize : improve type name parsing ( #9570 )  
							
							... 
							
							
							
							quantize : do not ignore invalid types in arg parsing
quantize : ignore case of type and ftype arguments 
							
						 
						
							2024-09-20 20:55:36 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								d39e26741f 
								
							 
						 
						
							
							
								
								examples : flush log upon ctrl+c ( #9559 )  
							
							
							
						 
						
							2024-09-20 11:46:56 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Sigbjørn Skjæret 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								722ec1eb51 
								
							 
						 
						
							
							
								
								perplexity : do not escape input data by default ( #9548 )  
							
							
							
						 
						
							2024-09-20 09:38:10 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								6026da52d6 
								
							 
						 
						
							
							
								
								server : clean-up completed tasks from waiting list ( #9531 )  
							
							... 
							
							
							
							ggml-ci 
							
						 
						
							2024-09-19 12:44:53 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Sigbjørn Skjæret 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								eca0fab44e 
								
							 
						 
						
							
							
								
								imatrix : disable prompt escape by default ( #9543 )  
							
							
							
						 
						
							2024-09-19 10:58:14 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Vinesh Janarthanan 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8a308354f6 
								
							 
						 
						
							
							
								
								server : match OAI structured output response ( #9527 )  
							
							
							
						 
						
							2024-09-18 09:50:34 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Eric Zhang 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								f799155ab8 
								
							 
						 
						
							
							
								
								server : fix OpenSSL build (remove obsolete LOG_INFO) ( #9529 )  
							
							
							
						 
						
							2024-09-18 09:28:20 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Neo Zhang Jianyu 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								faf67b3de4 
								
							 
						 
						
							
							
								
								[SYCL]set context default value to avoid memory issue, update guide ( #9476 )  
							
							... 
							
							
							
							* set context default to avoid memory issue, update guide
* Update docs/backend/SYCL.md
Co-authored-by: Meng, Hengyu <hengyu.meng@intel.com>
---------
Co-authored-by: arthw <14088817+arthw@users.noreply.github.com>
Co-authored-by: Meng, Hengyu <hengyu.meng@intel.com> 
							
						 
						
							2024-09-18 08:30:31 +08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Michael Podvitskiy 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								7be099fa81 
								
							 
						 
						
							
							
								
								llama-bench: correct argument parsing error message ( #9524 )  
							
							
							
						 
						
							2024-09-17 22:41:38 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Bert Wagner 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8b836ae731 
								
							 
						 
						
							
							
								
								arg : add env variable for parallel ( #9513 )  
							
							... 
							
							
							
							* add env variable for parallel
* Update README.md with env:  LLAMA_ARG_N_PARALLEL 
							
						 
						
							2024-09-17 16:35:38 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Vinesh Janarthanan 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								441b72b91f 
								
							 
						 
						
							
							
								
								main : option to disable context shift ( #9484 )  
							
							... 
							
							
							
							* added cli arg to disable context shift
* reverted precommit
* updated README.md for main
* white space
* allow disabling context shift in the server
* Update common/arg.cpp
no-context-shift only works for main example
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* added server example to --no-context-shift args
* removed server changes
* white space
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 
							
						 
						
							2024-09-16 09:20:01 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								6262d13e0b 
								
							 
						 
						
							
							
								
								common : reimplement logging ( #9418 )  
							
							... 
							
							
							
							https://github.com/ggerganov/llama.cpp/pull/9418  
						
							2024-09-15 20:46:12 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									slaren 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								e6deac31f7 
								
							 
						 
						
							
							
								
								gguf-split : add basic checks ( #9499 )  
							
							... 
							
							
							
							* gguf-split : do not overwrite existing files when merging
* gguf-split : error when too many arguments are passed 
							
						 
						
							2024-09-15 19:02:27 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									VoidIsVoid 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								dcdcee3a74 
								
							 
						 
						
							
							
								
								server: add data: [DONE] to /chat/completions stream response ( #9459 )  
							
							
							
						 
						
							2024-09-14 11:36:44 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xuan Son Nguyen 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								feff4aa846 
								
							 
						 
						
							
							
								
								server : add loading html page while model is loading ( #9468 )  
							
							... 
							
							
							
							* Adding loading page for '/' server requests
* set content when model is loading
* removed loading html file
* updated cmakelist
* updated makefile
* cleaned up whitespace
* cleanup for PR removed error
* updated server test to handle 503 HTML
* updated server test to handle 503 HTML
* ca†ch 503 before parsing json
* revert test
* account for both api and web browser requests
* precommit corrections
* eol fix
* revert changes to pre-commit
* removed print statement
* made loading message more descriptive
* also support .html files
---------
Co-authored-by: VJHack <flymyplane21@gmail.com>
Co-authored-by: Vinesh Janarthanan <36610342+VJHack@users.noreply.github.com> 
							
						 
						
							2024-09-13 14:23:11 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								0abc6a2c25 
								
							 
						 
						
							
							
								
								llama : llama_perf + option to disable timings during decode ( #9355 )  
							
							... 
							
							
							
							* llama : llama_perf + option to disable timings during decode
ggml-ci
* common : add llama_arg
* Update src/llama.cpp
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>
* perf : separate functions in the API
ggml-ci
* perf : safer pointer handling + naming update
ggml-ci
* minor : better local var name
* perf : abort on invalid sampler pointer
ggml-ci
---------
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> 
							
						 
						
							2024-09-13 09:53:38 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Mathijs Henquet 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								78203641fe 
								
							 
						 
						
							
							
								
								server : Add option to return token pieces in /tokenize endpoint ( #9108 )  
							
							... 
							
							
							
							* server : added with_pieces functionality to /tokenize endpoint
* server : Add tokenize with pieces tests to server.feature
* Handle case if tokenizer splits along utf8 continuation bytes
* Add example of token splitting
* Remove trailing ws
* Fix trailing ws
* Maybe fix ci
* maybe this fix windows ci?
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co> 
							
						 
						
							2024-09-12 22:30:11 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									fengerhu1 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								e665744317 
								
							 
						 
						
							
							
								
								llava : fix the script error in MobileVLM README ( #9054 )  
							
							... 
							
							
							
							Signed-off-by: Erhu Feng <2748250768@qq.com> 
							
						 
						
							2024-09-12 14:34:22 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Michael Podvitskiy 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								ff76e18516 
								
							 
						 
						
							
							
								
								cmake : fixed the order of linking libraries for llama-quantize ( #9450 )  
							
							
							
						 
						
							2024-09-12 14:27:14 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								d6a04f872d 
								
							 
						 
						
							
							
								
								ggml : hide ggml_object, ggml_cgraph, ggml_hash_set ( #9408 )  
							
							... 
							
							
							
							* ggml : hide ggml_object, ggml_cgraph, ggml_hash_set
ggml-ci
* ggml : add ggml-impl.h to backends
* ggml : fix compiler warnings
ggml-ci
* ggml : add assert upon adding nodes 
							
						 
						
							2024-09-12 14:23:49 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Neo Zhang Jianyu 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								c9c8575a1a 
								
							 
						 
						
							
							
								
								enhance run script to be easy to change the parameters ( #9448 )  
							
							... 
							
							
							
							Co-authored-by: arthw <14088817+arthw@users.noreply.github.com> 
							
						 
						
							2024-09-12 17:44:17 +08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xuan Son Nguyen 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								0996c5597f 
								
							 
						 
						
							
							
								
								llava : correct args for minicpmv-cli ( #9429 )  
							
							
							
						 
						
							2024-09-11 12:59:13 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Georgi Gerganov 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								d2b496bff4 
								
							 
						 
						
							
							
								
								batched-bench : remove unused code ( #9305 )  
							
							
							
						 
						
							2024-09-11 10:03:54 +03:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									slaren 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								49006c67b4 
								
							 
						 
						
							
							
								
								llama : move random seed generation to the samplers ( #9398 )  
							
							... 
							
							
							
							* llama_sampler_penalties : clamp penalty_last_n to zero 
							
						 
						
							2024-09-10 18:04:25 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xuan Son Nguyen 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								bfe76d4a17 
								
							 
						 
						
							
							
								
								common : move arg parser code to arg.cpp ( #9388 )  
							
							... 
							
							
							
							* common : move arg parser to arg.cpp
* better categorize args
* add cmake
* missing climits
* missing cstdarg
* common : more explicit includes
* fix build
* refactor gpt_params_parse
* update server readme
* fix test
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 
							
						 
						
							2024-09-09 23:36:09 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									slaren 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								5fb5e24811 
								
							 
						 
						
							
							
								
								llama : minor sampling refactor (2) ( #9386 )  
							
							
							
						 
						
							2024-09-09 17:10:46 +02:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Antonis Makropoulos 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								5ed087573e 
								
							 
						 
						
							
							
								
								readme : add LLMUnity to UI projects ( #9381 )  
							
							... 
							
							
							
							* add LLMUnity to UI projects
* add newline to examples/rpc/README.md to fix editorconfig-checker unit test 
							
						 
						
							2024-09-09 14:21:38 +03:00