Transformer-based models #57

ChloeCarbonniere · 2025-04-14T08:26:52Z

Hello,

Is it possible to implement transforme-based models on the STM32N6 ?

RSERSTM · 2025-04-14T09:49:10Z

Hello ChloeCarbonniere,
What do you mean by transforme-based models ? Transformer-based models ?
Attention layers ? Encoder-only computer vision models ? Generative models ?
Thank you,

ChloeCarbonniere · 2025-04-14T09:52:10Z

Mi final goal is to implement a DETR on the STM32N6.

RSERSTM · 2025-04-14T10:17:16Z

Ok I understand.
For info STM32N6-DK has 128MB of Flash to store weights and RAM (internal 4.2MB + external 32MB) to store activations(the data) https://www.st.com/en/evaluation-tools/stm32n6570-dk.html.
In other words regarding these kind of Transformers (DETR usually takes GBytes of space) for the moment we don't have these in the model zoo because they are often to big for STM32N6.
But it should be possible to implement very small Transformers with quantized (8bit) Attention layers if the weights + activations fit in memory.
Regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformer-based models #57

Transformer-based models #57

ChloeCarbonniere commented Apr 14, 2025

RSERSTM commented Apr 14, 2025

ChloeCarbonniere commented Apr 14, 2025

RSERSTM commented Apr 14, 2025

Transformer-based models #57

Transformer-based models #57

Comments

ChloeCarbonniere commented Apr 14, 2025

RSERSTM commented Apr 14, 2025

ChloeCarbonniere commented Apr 14, 2025

RSERSTM commented Apr 14, 2025