Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce number of threads in push kernel for heterogeneous delays in bundle mode #270

Open
denisalevi opened this issue Feb 15, 2022 · 0 comments

Comments

@denisalevi
Copy link
Member

We are calling the push kernel with as many threads as the largest synapse group ((preID, postGroup) pair). But in bundle mode we only need as many threads as the maximum number of bundles in a synapse group. For the Brunel Hakim benchmark that is 41 threads vs. 1024 threads.

Maybe round up to next multiple of warp size.

And if implement parallel reallocation of spike queue cuda vectors, then more threads might be beneficial though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant