Hi all,
I am curious about the CPU implementation of OpenCL for Intel processors.
I run a small set of benchmark from clpeak on a i7-4770S (4 cores, hyperthreading enabled) under linux.
it shows the CPU utilization can achieve almost 800% (using top), meaning all CPU resource are utilized.
However, when I run the benchmark in clpeak individually, it shows maximum 400%.
Run benchmark consecutively can benefit from OpenCL runtime.
Is that mean when a workload is issued to OpenCL CPU runtime, it will not all of the cores but part of them.
Besides, is OpenCL CPU runtime using SIMD to execute consecutive workitems?
Appreciate in advance!
Best,
Biao