docs: introduce gpustack and gguf-parser (#8873)
* readme: introduce gpustack GPUStack is an open-source GPU cluster manager for running large language models, which uses llama.cpp as the backend. Signed-off-by: thxCode <thxcode0824@gmail.com> * readme: introduce gguf-parser GGUF Parser is a tool to review/check the GGUF file and estimate the memory usage without downloading the whole model. Signed-off-by: thxCode <thxcode0824@gmail.com> --------- Signed-off-by: thxCode <thxcode0824@gmail.com>
This commit is contained in:
		
							parent
							
								
									1262e7ed13
								
							
						
					
					
						commit
						84eb2f4fad
					
				
					 1 changed files with 2 additions and 0 deletions
				
			
		|  | @ -186,10 +186,12 @@ Unless otherwise noted these projects are open-source with permissive licensing: | |||
| 
 | ||||
| - [akx/ggify](https://github.com/akx/ggify) – download PyTorch models from HuggingFace Hub and convert them to GGML | ||||
| - [crashr/gppm](https://github.com/crashr/gppm) – launch llama.cpp instances utilizing NVIDIA Tesla P40 or P100 GPUs with reduced idle power consumption | ||||
| - [gpustack/gguf-parser](https://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser) - review/check the GGUF file and estimate the memory usage | ||||
| 
 | ||||
| **Infrastructure:** | ||||
| 
 | ||||
| - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp | ||||
| - [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs | ||||
| 
 | ||||
| **Games:** | ||||
| - [Lucy's Labyrinth](https://github.com/MorganRO8/Lucys_Labyrinth) - A simple maze game where agents controlled by an AI model will try to trick you. | ||||
|  |  | |||
		Loading…
	
	Add table
		Add a link
		
	
		Reference in a new issue