Effectivly this plots seperate curves for the following strides, where N is the size of the stride in bytes and i is an integer: * `N = i * 64` * `N = i * 64 + 16` * `N = i * 64 + 32` * `N = i * 64 + 48`
|9 years ago|
|.gitignore||10 years ago|
|README.md||10 years ago|
|bandwidth.py||10 years ago|
|kernels.cl||10 years ago|
|runner.py||10 years ago|
|sweepMemSize.py||10 years ago|
|sweepStride.py||9 years ago|
This is a collection of OpenCL kernels that should be able to reproduce the global memory performance numbers given in the AMD Accelerated Parallel Processing OpenCL™ Programming Guide and NVIDIA's OpenCL Best Practices Guide. Those kernels are wrapped by python script handline all the boilerplate and the actual measurements.
There are multiple scripts to choose from. For each script you will get additional invocation options by invoking it with
bandwidth.py- Compare the bandwidth of multiple kernels for a given memory size.