llamafile : improve sgemm.cpp (#6796)

* llamafile : improve sgemm.cpp

- Re-enable by default
- Fix issue described in #6716
- Make code more abstract, elegant, and maintainable
- Faster handling of weirdly shaped `m` an `n` edge cases

* Address review comments

* Help clang produce fma instructions

* Address review comments
This commit is contained in:
Justine Tunney 2024-04-22 15:00:36 -04:00 committed by GitHub
parent e931888d50
commit 192090bae4
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
4 changed files with 406 additions and 567 deletions

945
sgemm.cpp

File diff suppressed because it is too large Load diff