feat: xlstm model and examples #3266

danielclough · 2025-12-26T08:11:37Z

Add xLSTM (Extended LSTM) Model and Example

Adds support for xLSTM (Extended Long Short-Term Memory), a modernized LSTM architecture achieving competitive performance with Transformers while maintaining linear complexity for inference.

Implementation

Model architecture (candle-transformers/src/models/xlstm.rs):

mLSTM blocks with matrix memory and exponential gating
Covariance update rule using outer product of key-value pairs
Stabilized gates with soft-capping and log-space computation
GroupNorm without bias for multihead normalization
SwiGLU FFN blocks with pre-norm residual connections

Text generation example (candle-examples/examples/xlstm/):

Single-token recurrent inference with stateful generation
Supports NX-AI/xLSTM-7b (~14GB VRAM in bf16, ~28GB in f32)
Configurable sampling (temperature, top-p, repeat penalty)
BOS token handling per model requirements

Usage

# Generate with default prompt (bf16, requires ~14GB VRAM)
cargo run --example xlstm --release --features cuda -- --prompt "Once upon a time" -n 50

# Use f32 precision
cargo run --example xlstm --release --features metal,accelerate -- --dtype f32 --prompt "The meaning of life is"

feat: xlstm model and examples

0efeee8

danielclough mentioned this pull request Dec 26, 2025

Add xLSTM model support #3258

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: xlstm model and examples #3266

feat: xlstm model and examples #3266

danielclough commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: xlstm model and examples #3266

Are you sure you want to change the base?

feat: xlstm model and examples #3266

Conversation

danielclough commented Dec 26, 2025

Add xLSTM (Extended LSTM) Model and Example

Implementation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant