Skip to content

Conversation

@djeong20
Copy link
Contributor

@djeong20 djeong20 commented Dec 2, 2025

This PR updates the OpenCL kernels for INT4 computation by eliminating compile options.
This patch now computes macros as runtime variables within the kernel functions.
This change reduces engine initialization time by initializing and building the kernel only once.

Self-evaluation:

  1. Build test: [X]Passed [ ]Failed [ ]Skipped
  2. Run test: [X]Passed [ ]Failed [ ]Skipped

…tion

This PR updates the OpenCL kernels for INT4 computation by eliminating compile options.
This patch now computes macros as runtime variables within the kernel functions.
This change reduces engine initialization time by initializing and building the kernel only once.

**Self-evaluation:**
1. Build test: [X]Passed [ ]Failed [ ]Skipped
2. Run test:   [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Donghyeon Jeong <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant