I write the OpenCL kernel code for HD Graphics.
I know each EU has 4-way SIMD FPU.
But, I don't know how to use SIMD operation.
Does kernel compiled for HD Graphics is vectorized automatically?
How to know whether the kernel code is vectorized.
When my kernel compiled with intel kernel builder to CPU, it is displayed "Kernel <~~> was successfully vectorized".
I think this code is compiled with vectorize.
But, compiled to GPU, it isn't display so.
Is this kernel code vectorized automatically if compiled for HD Graphics?
If kernel code isn't vectorized automatically, how to vectorize the code?