Update ggml/src/ggml-cuda/mma.cuh
Co-authored-by: Diego Devesa <slarengh@gmail.com>
This commit is contained in:
parent
37910e42ef
commit
51670bd43e
1 changed files with 1 additions and 1 deletions
|
@ -11,7 +11,7 @@
|
||||||
// The functions get_i, get_j, and get_k can be used to get the physical 32 bit index of the lth element of a thread within a tile.
|
// The functions get_i, get_j, and get_k can be used to get the physical 32 bit index of the lth element of a thread within a tile.
|
||||||
// All matrix tiles have ne physical 32 bit elements per warp.
|
// All matrix tiles have ne physical 32 bit elements per warp.
|
||||||
//
|
//
|
||||||
// As desribted in the documentation, all pointers for load_ldmatrix must be to shared memory and aligned to 16 bytes.
|
// As described in the documentation, all pointers for load_ldmatrix must be to shared memory and aligned to 16 bytes.
|
||||||
|
|
||||||
#include "common.cuh"
|
#include "common.cuh"
|
||||||
|
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue