Quantcast
Channel: Intel® Software - OpenCL*
Viewing all articles
Browse latest Browse all 1182

New 24.20.100.6094 Win10 driver performance regression from .6025

$
0
0

My suite of kernels compiled (binaries) with the .6094 driver on Win10/x64 take almost twice the amount of time to execute as those compiled with .6025.

Compiling on .6025 and executing on .6094 shows no regression.

Compiling on .6094 and executing on .6094 or .6025 shows the huge performance drop.

Furthermore, the .6094 driver has reenabled support for dumping GEN assembly via the IOC64 -asm switch.

Inspection of the .6094 produced assembly shows long sequences of MOV operations that I believe are unnecessary. 

I wish there was a better way to report performance regressions (and reproducers) than here or the GitHub issues page (which is very quiet).

-ASM


Viewing all articles
Browse latest Browse all 1182

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>