Add FA to Recent API Changes README.md

Promote new features
This commit is contained in:
Jeximo 2024-05-05 12:59:01 -03:00 committed by GitHub
parent 628b299106
commit e49c581a67
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -10,6 +10,7 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others)
### Recent API changes
- [2024 Apr 30] Flash Attention implemented https://github.com/ggerganov/llama.cpp/pull/5021
- [2024 Apr 21] `llama_token_to_piece` can now optionally render special tokens https://github.com/ggerganov/llama.cpp/pull/6807
- [2024 Apr 4] State and session file functions reorganized under `llama_state_*` https://github.com/ggerganov/llama.cpp/pull/6341
- [2024 Mar 26] Logits and embeddings API updated for compactness https://github.com/ggerganov/llama.cpp/pull/6122