Hi
1. I have a global work size of 1024 by 1024.
2. I set the local work size to 16 by 16.
3. My CPU opnecl device has a maximum work-group-size of 8192.
4. I call clEnqueueNDRangeKernel with the desired local-work-size (along with all other necessary parameters)
5. I call:
a. clGetKernelWorkGroupInfo(kernel, device, CL_KERNEL_WORK_GROUP_SIZE, sizeof(size_t), (void*)&workGroupSizeUsed, NULL);
b. clGetKernelWorkGroupInfo(kernel, device, CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE, sizeof(size_t), (void*)&workGroupSizeUsed, NULL);
6. Both calls return 8192. How is this possible?
My expectation is 16 - the value that I passed to it.
Any help?