Skip to content

Conversation

jiqing-feng
Copy link

This blog shows: Intel Granite Rapids (C4) provides both performance gains and better cost efficiency for large MoE inference than Sapphire Rapids (C3).

* running gpt-oss on Intel Xeon

Signed-off-by: jiqing-feng <[email protected]>

* add TTFT image

Signed-off-by: jiqing-feng <[email protected]>

* add _blog.yml

Signed-off-by: jiqing-feng <[email protected]>

* minor fix

Signed-off-by: jiqing-feng <[email protected]>

* fix blog

Signed-off-by: jiqing-feng <[email protected]>

* fix content

Signed-off-by: jiqing-feng <[email protected]>

* update thumbnail

Signed-off-by: jiqing-feng <[email protected]>

* update expert parallelism diagram

Signed-off-by: jiqing-feng <[email protected]>

* fix model name and model link

Signed-off-by: jiqing-feng <[email protected]>

* fix result image links

Signed-off-by: jiqing-feng <[email protected]>

* fix script

Signed-off-by: jiqing-feng <[email protected]>

* update results

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
Signed-off-by: jiqing-feng <[email protected]>
Signed-off-by: jiqing-feng <[email protected]>
Signed-off-by: jiqing-feng <[email protected]>
@jiqing-feng jiqing-feng marked this pull request as draft September 30, 2025 14:46
@jiqing-feng jiqing-feng marked this pull request as ready for review September 30, 2025 14:46
@yao-matrix
Copy link
Contributor

@kding1 @IlyasMoutawwakil , pls help review, thx very much

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants