Reduce number of threads in push kernel for heterogeneous delays in bundle mode #270

denisalevi · 2022-02-15T14:00:45Z

We are calling the push kernel with as many threads as the largest synapse group ((preID, postGroup) pair). But in bundle mode we only need as many threads as the maximum number of bundles in a synapse group. For the Brunel Hakim benchmark that is 41 threads vs. 1024 threads.

Maybe round up to next multiple of warp size.

And if implement parallel reallocation of spike queue cuda vectors, then more threads might be beneficial though.

denisalevi added the optimisation label Feb 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce number of threads in push kernel for heterogeneous delays in bundle mode #270

Reduce number of threads in push kernel for heterogeneous delays in bundle mode #270

denisalevi commented Feb 15, 2022

Reduce number of threads in push kernel for heterogeneous delays in bundle mode #270

Reduce number of threads in push kernel for heterogeneous delays in bundle mode #270

Comments

denisalevi commented Feb 15, 2022