Skip to content

Conversation

@draganmladjenovic
Copy link
Contributor

Motivation

Makes sure that user doesn't have to distribute kernels nor set up AITER_ASM_DIR.

Technical Details

Embed code objects into binary. Use hipRegisterFatBinary to make it seamlessly work on multiple gpus. Make CFG tables read-only and AiterAsmKernels statically allocated.

Test Plan

Selected tests from op_tests on gfx942

Embed code objects into binary. Use hipRegisterFatBinary to
make it seamlessly work on multiple gpus. Make CFG tables
read-only and AiterAsmKernels statically allocated.
@draganmladjenovic draganmladjenovic requested review from a team and valarLip January 23, 2026 23:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants