Skip to content

Conversation

@dnikolaev-amd
Copy link

Total number of kernel threads (blocks * threads) must fit within 32 bits on ROCm
Looking for a way to limit the total number of threads if more than 2^23 threads required for triu_tril_kernel by increaseing the number of elements_per_thread

@dnikolaev-amd dnikolaev-amd force-pushed the dnikolaev/limit_threads_number_triu branch 3 times, most recently from 7cea2bb to d3771cb Compare January 24, 2026 00:58
@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jan 24, 2026

Jenkins build for b179e3e2d1076693fdb8c01dde507e3ae501bb00 commit finished as NOT_BUILT
Links: Pipeline Overview / Build artifacts / Test Results

@dnikolaev-amd dnikolaev-amd force-pushed the dnikolaev/limit_threads_number_triu branch from d3771cb to b179e3e Compare January 24, 2026 01:49
@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jan 24, 2026

Jenkins build for b179e3e2d1076693fdb8c01dde507e3ae501bb00 commit finished as FAILURE
Links: Pipeline Overview / Build artifacts / Test Results

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants