This website requires JavaScript.
Explore
Help
Sign in
vbatts
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
Actions
2646
commits
380
branches
3056
tags
365
MiB
gg/flash-attn-a
Commit graph
2 commits
Author
SHA1
Message
Date
Georgi Gerganov
08e69c5008
cuda : adapt soft_max to F16 mask and pos
2024-03-28 19:40:11 +02:00
slaren
ae1f211ce2
cuda : refactor into multiple files (
#6269
)
2024-03-25 13:50:23 +01:00