Hello,
I would like to port a GCN-optimized 1.2 kernels to run on latest Intel GPUs.
Are there any general guidelines I should be following?
For example, on GCN, if work group size is smaller than 64 (wave front size), then it is possible to dispense with
memory barriers, since work items will never be executed more than one Compute Unit. Does this apply to Intel GPU?
Any other things to keep in mind?
Also, in terms of install base, what version of HD Graphics GPU is the most common in the field?
And which CPUs have these GPUs?
Thanks so much,
Aaron