list llama-swap under tools in README

This commit is contained in:
Benson Wong 2025-01-01 12:59:19 -08:00
parent 0827b2c1da
commit 61037d7e6e
No known key found for this signature in database
GPG key ID: 8C992B23151E99AF

View file

@ -192,6 +192,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo
- [crashr/gppm](https://github.com/crashr/gppm) launch llama.cpp instances utilizing NVIDIA Tesla P40 or P100 GPUs with reduced idle power consumption
- [gpustack/gguf-parser](https://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser) - review/check the GGUF file and estimate the memory usage
- [Styled Lines](https://marketplace.unity.com/packages/tools/generative-ai/styled-lines-llama-cpp-model-292902) (proprietary licensed, async wrapper of inference part for game development in Unity3d with pre-built Mobile and Web platform wrappers and a model example)
- [llama-swap](https://github.com/mostlygeek/llama-swap) - transparent proxy for automatic model switching with llama-server
</details>