Relational Transformer

This repository provides a reference implementation of the Relational Transformer architecture from the paper: Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data.

🥇 Foundation Model Capabilities of Relational Transformer:

🤖 Overview of the Relational Transformer Architecture:

Installation

Install pixi https://pixi.sh/latest/#installation.
Clone and install the repository:

git clone https://github.com/snap-stanford/relational-transformer
cd relational-transformer
pixi install
# compile and install the rust sampler
cd rustler
pixi run maturin develop --uv --release

Data Preparation

Download the datasets and tasks from Relbench:

cd .. # back to the root of the repository
pixi run python scripts/download_relbench.py

Link the cache repository

mkdir ~/scratch
ln -s ~/.cache/relbench ~/scratch/relbench

Preprocessing (per database):

cd rustler
pixi run cargo run --release -- pre rel-f1

Text embedding (per database):

pixi run python -m rt.embed rel-f1

Note

Steps 3. and 4. should be run for all databases:
rel-amazon, rel-avito, rel-event, rel-f1, rel-hm, rel-stack, rel-trial

Download Preprocessed Data

This project’s preprocessed data is hosted at hvag976/relational-transformer on the Hugging Face Hub. You can directly use this data and skip the data preparation step.

Install the CLI

pip install -U huggingface_hub

Create the destination

mkdir -p ~/scratch/pre

Download the repository contents into ~/scratch/pre

huggingface-cli download hvag976/relational-transformer \
  --repo-type dataset \
  --local-dir ~/scratch/pre \
  --local-dir-use-symlinks False

Download Pretrained Checkpoints

This project's pretrained model checkpoints are hosted at rishabh-ranjan/relational-transformer on the Hugging Face Hub. You can directly use these checkpoints and skip the pretraining step.

Two kinds of checkpoints are provided:

pretrain_<dataset>_<task>.pt: pretrained with <dataset> held out.
contd-pretrain_<dataset>_<task>.pt: obtained by continued pretraining of pretrain_<dataset>_<task>.pt on <dataset> with <task> held out.

To use, pass the checkpoint path to the load_ckpt_path argument of the training scripts.

Install the Hugging Face CLI (if not already installed):

pip install -U huggingface_hub

Download all checkpoints:

mkdir -p ~/scratch/rt_ckpts
huggingface-cli download rishabh-ranjan/relational-transformer \
  --repo-type model \
  --local-dir ~/scratch/rt_ckpts \
  --local-dir-use-symlinks False

Download a specific checkpoint:

mkdir -p ~/scratch/rt_ckpts
huggingface-cli download rishabh-ranjan/relational-transformer \
  --repo-type model \
  --include "pretrain_rel-amazon_user-churn.pt" \
  --local-dir ~/scratch/rt_ckpts \
  --local-dir-use-symlinks False

Experiments

First setup wandb with pixi run wandb login or pixi run wandb disabled. The following example commands replicate the results for rel-amazon/user-churn from the paper.

Pretrain on all datasets with rel-amazon held-out (takes about 2 hours on 8xA100 GPUs):

pixi run torchrun --standalone --nproc_per_node=8 scripts/pretrain.py

Continued pretrain from best checkpoint obtained above on all rel-amazon tasks with user-churn held-out (takes about 15 minutes on 8xA100 GPUs):

pixi run torchrun --standalone --nproc_per_node=8 scripts/contd_pretrain.py

Finetune from best checkpoint obtained above on rel-amazon/user-churn task only (takes about 1.5 hours on 8xA100 GPUs):

pixi run torchrun --standalone --nproc_per_node=8 scripts/finetune.py

Citation

Please cite our paper:

@misc{ranjan2025relationaltransformer,
    title={Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data}, 
    author={Rishabh Ranjan and Valter Hudovernik and Mark Znidar and Charilaos Kanatsoulis and Roshan Upendra and Mahmoud Mohammadi and Joe Meyer and Tom Palczewski and Carlos Guestrin and Jure Leskovec},
    year={2025},
    eprint={2510.06377},
    archivePrefix={arXiv},
    primaryClass={cs.LG},
    url={https://arxiv.org/abs/2510.06377}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
assets		assets
rt		rt
rustler		rustler
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
pixi.lock		pixi.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Relational Transformer

Installation

Data Preparation

Download Preprocessed Data

Download Pretrained Checkpoints

Experiments

Citation

About

Uh oh!

Contributors 2

Languages

snap-stanford/relational-transformer

Folders and files

Latest commit

History

Repository files navigation

Relational Transformer

Installation

Data Preparation

Download Preprocessed Data

Download Pretrained Checkpoints

Experiments

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Languages