Running Aparapi Kernels on CPU
Hi, I'm trying to execute some aparapi kernel on CPU mode, but even after installing the Intel OpenCL SDK I'm still can't use the CPU mode, always switching to JTP with the following message:WARNING:...
View ArticleIntel SDK for OpenCL Applications 2016 R3 for windows setup wizard ended...
When installing intel_sdk_for_opencl_setup_6.3.0.1904, I encountered some error attached in the file. What can i do to solve this problem?Thanks for your advise.CPU: Inter core i7-3770IDE: visual...
View ArticleExecution of sparse matrix-vector failed
I'm using a new i7-7700k and the execution of this particular OpenCL test doesn't work. You can see the error in my attached screenshot. This happens with both CPU and GPU using newest drivers. AMD and...
View ArticleWhen the debugger on Mac OS available?
Hi there: I don't know if anyone else has the same feeling that developing on mac is much more difficult, because there is no debugger to analyze my OpenCL program. Sometimes the behavior is...
View ArticleQuestions about kernel instances & the analysis methods for kernel code
4Hi there & every expert: Sorry for bother! I have a question about the specification named <The Compute Architecture of Intel® Processor Graphics Gen7.5>, because the iris pro 5200 is...
View ArticleWhy do small changes make in kernel code wrong results?
The following kernel (my_kernel()) which is written based on my_function() calculates wrong results on Intel GPU. The same code works well on Intel CPU and AMD GPU platform. If you change the type of...
View ArticleDoes Intel HD graphics 530 support SYCL or SPIR to pass classes to kernel?
own votefavorI have been working on an c/c++ OpenCL solution for the past few weeks now. For my solution, I need to pass a class from my CPU(Host) to GPU(Device). When I try to pass the class as an...
View Articlestrange behavior when float16 are used and the meaning of thread idle
Hi OpenCL experts: I saw a sentence "Thread dispatch serialization becomes a gating factor when a kernel has insufficient work per a work-item." in page 6 of the paper named <Intel® VTune™...
View ArticleArticles Index -- check here for the latest guides and info
Guide to reporting issues: https://software.intel.com/en-us/articles/ocl-issue-reporting-guideGetting started guides:Linux: https://software.intel.com/en-us/articles/sdk-for-opencl-gsgWindows:...
View Articledot product kernel doesn't work on CPUs
Hi,I'm new to OpenCL and I have implemented a program to compute the dot product. The program works as expected if I use a GPU and it returns a wrong result if I use a CPU with more than one work-item...
View Articlestrange behavior when float16 are used and the meaning of thread idle
Hi OpenCL experts: I saw a sentence "Thread dispatch serialization becomes a gating factor when a kernel has insufficient work per a work-item." in page 6 of the paper named <Intel® VTune™...
View ArticleCode Builder Platform Info Properties typo
The "SPIR version" and "SPIR-V version" fields are reversed:Core i7-6802HQ with CPU 2.1, CPU 2.0 and HD 530 devices.
View ArticleDoes the Kaby Lake OpenCL 2.1 HD Graphics driver support SPIR-V?
I see that Kaby Lake IGPs have OpenCL 2.1 support.Does this imply that SPIR-V is supported on that platform?Is there any plan to bring 2.1 support to GEN9 and GEN8?
View ArticleIs there any GEN-friendly idiom for communicating subgroup uniformity?
It seems to me that GEN might benefit more from detecting "subgroup uniform" values than other architectures because of its unique register file architecture and instruction set.Are there are any GEN...
View ArticleGPU HD4600 OpenCL SPIR compilation problem
I use OpenCL SPIR for two algorithms, each one has templated functions for uchar, ushort, uint, float and double (templates are implemented via defines/includes). So I have two SPIR binaries, each...
View ArticleWhy is it needed to use CL_MEM_ALLOC_HOST_PTR | CL_MEM_COPY_HOST_PTR instead...
In this article there are recommendations about how to use OpenCL properly to achieve zero copy behavior when using Intel HD Graphics. In particular, there is a recommendation to use...
View ArticleSVM Trouble
Hello, everyone!I've some problem with Shared Virtual Memory and its buffers.I've got structure like this:typedef struct ListOfPtr ListOfPtr; struct ListOfPtr { cl_int some_value; ListOfPtr *first;...
View ArticleOpenCL stall on Apollo Lake GPU
SummaryWhen I run my app and select the GPU OpenCL device, the feeder thread stalls inside a blocking call to clEnqueueMapBuffer(). PreambleBuild: Yocto from the Apollo Lake BSP release gold,...
View Articlebuiltin workgroup reduction performance
Hi,I'm working on writing a global reduction in OpenCL 2.0. I started with the implementation from CLOGS:https://sourceforge.net/p/clogs/wiki/Home/Essentially, the approach is just a series of...
View ArticleHow can I reduce start latencies with OpenCL on the GPU?
I'm evaluating an Intel platform for an embedded real-time processor in our systems. Our application uses OpenCL to prcoess incoming data on a very short cycle in real-time. It is critical to the...
View Article