README 301 B

12345678910
  1. The dimension of input matrix should be multiple size of block size.
  2. ******Adjustable work group size*****
  3. The kernel has square shape
  4. RD_WG_SIZE_0 or RD_WG_SIZE_0_0 describe one dimension
  5. The actually dimension = RD_WG_SIZE_0 * RD_WG_SIZE_0
  6. USAGE:
  7. make clean
  8. make KERNEL_DIM="-DRD_WG_SIZE_0=16"