| 
								
								
									 Diego Devesa | 7cc2d2c889 | ggml : move AMX to the CPU backend (#10570) * ggml : move AMX to the CPU backend
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | 2024-11-29 21:54:58 +01:00 |  | 
				
					
						| 
								
								
									 Diego Devesa | 7eee341bee | common : use common_ prefix for common library functions (#9805) * common : use common_ prefix for common library functions
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | 2024-10-10 22:57:42 +02:00 |  | 
				
					
						| 
								
								
									 Xuan Son Nguyen | afbbfaa537 | server : add more env vars, improve gen-docs (#9635) * server : add more env vars, improve gen-docs
* update server docs
* LLAMA_ARG_NO_CONTEXT_SHIFT | 2024-09-25 14:05:13 +02:00 |  | 
				
					
						| 
								
								
									 Xuan Son Nguyen | bfe76d4a17 | common : move arg parser code to arg.cpp(#9388)* common : move arg parser to arg.cpp
* better categorize args
* add cmake
* missing climits
* missing cstdarg
* common : more explicit includes
* fix build
* refactor gpt_params_parse
* update server readme
* fix test
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | 2024-09-09 23:36:09 +02:00 |  | 
				
					
						| 
								
								
									 Xuan Son Nguyen | 1b9ae5189c | common : refactor arg parser (#9308) * (wip) argparser v3
* migrated
* add test
* handle env
* fix linux build
* add export-docs example
* fix build (2)
* skip build test-arg-parser on windows
* update server docs
* bring back missing --alias
* bring back --n-predict
* clarify test-arg-parser
* small correction
* add comments
* fix args with 2 values
* refine example-specific args
* no more lamba capture
Co-authored-by: slaren@users.noreply.github.com
* params.sparams
* optimize more
* export-docs --> gen-docs | 2024-09-07 20:43:51 +02:00 |  |