-
Notifications
You must be signed in to change notification settings - Fork 62
Open
Labels
OpenCLRunning on GPUs and similar devicesRunning on GPUs and similar devicescomp-LogicRelated to internal code logicRelated to internal code logicpri-LowKept mostly for referenceKept mostly for referenceusabilityMakes using code more convenientMakes using code more convenient
Description
The precise timing possibility of OpenCL matvec (removed in r1334) makes it
hard to track issues with the OpenCL kernels on different devices. A desired
goal would be to add an option to change the command queue into profiling mode
and get the precise timings from events which are returned by
clEnqueueNDRangeKernel. It should be possible to do this during runtime, so it
can be implemented as option to adda directly instead of a compiler option
using ifdefs.
This would help to identify performance issues of the kernels on different
devices.
r1334 - febb9ca
Original issue reported on code.google.com by [email protected] on 21 May 2014 at 7:22
Metadata
Metadata
Assignees
Labels
OpenCLRunning on GPUs and similar devicesRunning on GPUs and similar devicescomp-LogicRelated to internal code logicRelated to internal code logicpri-LowKept mostly for referenceKept mostly for referenceusabilityMakes using code more convenientMakes using code more convenient