Transpose: Change TILE_DIM to 32 to fix bank conflicts

Fixes #175
This commit is contained in:
Jonathan Bentz 2025-02-20 17:46:44 -06:00 committed by GitHub
parent 8d564d5e3a
commit efb46383e0
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -53,7 +53,7 @@ const char *sSDKsample = "Transpose";
// TILE_DIM/BLOCK_ROWS elements. TILE_DIM must be an integral multiple of
// BLOCK_ROWS
#define TILE_DIM 16
#define TILE_DIM 32
#define BLOCK_ROWS 16
// This sample assumes that MATRIX_SIZE_X = MATRIX_SIZE_Y