CUDA: faster FlashAttention, kernel for bs == 1

This commit is contained in:
Johannes Gäßler 2024-03-29 23:02:39 +01:00
parent 08e69c5008
commit 912a6aa9b1

File diff suppressed because it is too large Load diff