Andrea Gussoni a06391e685 Implemented device selection for lud 8 anni fa
..
README 1f1ceb7172 Initialized repo with Rodinia 3.1 8 anni fa
lud.cpp a06391e685 Implemented device selection for lud 7 anni fa
lud_kernel.cl 1f1ceb7172 Initialized repo with Rodinia 3.1 8 anni fa
makefile 1f1ceb7172 Initialized repo with Rodinia 3.1 8 anni fa
run 1f1ceb7172 Initialized repo with Rodinia 3.1 8 anni fa
run-cpu a06391e685 Implemented device selection for lud 7 anni fa
run-gpu a06391e685 Implemented device selection for lud 7 anni fa

README

The dimension of input matrix should be multiple size of block size.

******Adjustable work group size*****
The kernel has square shape
RD_WG_SIZE_0 or RD_WG_SIZE_0_0 describe one dimension
The actually dimension = RD_WG_SIZE_0 * RD_WG_SIZE_0

USAGE:
make clean
make KERNEL_DIM="-DRD_WG_SIZE_0=16"