Skip to content

Conversation

echarlaix
Copy link
Contributor

@echarlaix echarlaix commented Sep 12, 2025

echarlaix and others added 4 commits September 16, 2025 13:38
Co-authored-by: Helena Kloosterman <[email protected]>
Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Pedro Cuenca <[email protected]>
echarlaix and others added 11 commits September 16, 2025 13:48
@echarlaix
Copy link
Contributor Author

Thanks a lot for the great review @pcuenca, didn't had time to include everything but will do in a second pass. The blog post is not ready for publication yet but once it is, I'll let you know

Copy link
Contributor

@merveenoyan merveenoyan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

super cool!

@echarlaix
Copy link
Contributor Author

Thanks a lot for your reviews @pcuenca @merveenoyan! The blog post is not ready yet (was set to draft but I should have clarified it in the description). Likely a lot will change in the following days, so don't want you to waste your time on corrections that could very well not be included in the final blog post, will let you know once ready !

@echarlaix echarlaix marked this pull request as ready for review October 7, 2025 16:00
Co-authored-by: Nikita Savelyev <[email protected]>
openvino-vlm.md Outdated
| openvino-8bit-woq| 0.247 | 0.016 | 0.482 | 63.928 |


This benchmark shows that small, optimized multimodal models, like [SmolVLM2-256M](https://huggingface.co/HuggingFaceTB/SmolVLM2-256M-Video-Instruct), can run efficiently on Intel CPUs. Weight-only quantization significantly reduces model size, improving efficiency without majorly impacting throughput.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cc @ezelanza would you mind updating once the benchmark is validated on your side

Co-authored-by: Nikita Savelyev <[email protected]>
Co-authored-by: Eze Lanza (Eze) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants