HistDiST: Histopathological Diffusion-based Stain Transfer

Official pytorch implementation of HistDiST. This repository contains scripts for fine-tuning pretrained Stable Diffusion models for the task of stain transfer, along with the corresponding inference script.

Erik Großkopf, Valay Bundele, Mehran Hosseinzadeh, Hendrik P.A. Lensch

Abstract

Hematoxylin and Eosin (H&E) staining is the cornerstone of histopathology but lacks molecular specificity. While Immunohistochemistry (IHC) provides molecular insights, it is costly and complex, motivating H&E-to-IHC translation as a cost-effective alternative. Existing translation methods are mainly GAN-based, often struggling with training instability and limited structural fidelity, while diffusion-based approaches remain underexplored. We propose HistDiST, a Latent Diffusion Model (LDM) based framework for high-fidelity H&E-to-IHC translation. HistDiST introduces a dual-conditioning strategy, utilizing Phikon-extracted morphological embeddings alongside VAE-encoded H&E representations to ensure pathology-relevant context and structural consistency. To overcome brightness biases, we incorporate a rescaled noise schedule, v-prediction, and trailing timesteps, enforcing a zero-SNR condition at the final timestep. During inference, DDIM inversion preserves the morphological structure, while an η-cosine noise schedule introduces controlled stochasticity, balancing structural consistency and molecular fidelity. Moreover, we propose Molecular Retrieval Accuracy (MRA), a novel pathology-aware metric leveraging GigaPath embeddings to assess molecular relevance. Extensive evaluations on MIST and BCI datasets demonstrate that HistDiST significantly outperforms existing methods, achieving a 28% improvement in MRA on the H&E-to-Ki67 translation task, highlighting its effectiveness in capturing true IHC semantics.

Overview

HistDiST training pipeline

H&E generation (red arrows, label (2)) conditioned on CLIP text embeddings, and H&E-to-IHC translation (green arrows, label (1)) guided by Phikon embeddings and VAE-encoded H&E features. The VAE encoder maps images to latent space, where noise is added and later denoised by the U-Net. The Ge/Tr switch selects between generation and translation tasks, with each(numbered input, color-coded pathway) independently followed.

HistDiST inference pipeline

VAE encoder maps H&E image to latent space, where DDIM inversion derives noise latent and η-noise scheduling injects noise at different timesteps during denoising. The U-Net, conditioned on Phikon embeddings, refines the features, and the VAE decoder generates the final IHC output.

Setup

Models

All trained models can be downloaded from Google Drive. (Google Drive might move unet safetensors file out of zip to root and add a suffix to unet safetensor filename.)

Datasets

MIST and BCI datasets are used to finetune Stable Diffusion 1.5 for H&E to ER/HER2/Ki67/PR transfer. Images are cropped randomly from size 1024x1024 to 512x512.

Requirements

We ran training and inference with a single or multiple 24GB VRAM CUDA GPU's.

conda install pytorch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 pytorch-cuda=11.8 -c pytorch -c nvidia 
pip install diffusers==0.16.1 
pip install transformers==4.32.1

Training

Assumed dataset directory structure like MIST Dataset structure:
- TrainValAB/
  - trainA/ // Contains source H&E images with size 1024x1024
  - trainB/ // Contains corresponding IHC images with size 1024x1024

MODEL_NAME="botp/stable-diffusion-v1-5"
DATASET_DIR=/set/here/path/to/dataset/
export CUDA_VISIBLE_DEVICES=0
export NCCL_P2P_DISABLE="1"
export NCCL_IB_DISABLE="1"

accelerate launch --num_processes=3 --num_machines=1 --mixed_precision=bf16 --dynamo_backend=no --gpu_ids $CUDA_VISIBLE_DEVICES training/training.py \
--data_set_dir=$DATASET_DIR \
--num_train_epochs=300 \
--validation_epochs=50 \
--prediction_type="v_prediction" \
--output_dir=training/output \
--bias_he_ihc=0.5 \
--pretrained_model_name_or_path=$MODEL_NAME \
--train_batch_size=16 \
--learning_rate=1.5e-4 \
--lr_scheduler="cosine" \
--lr_warmup_steps=1000 \
--mixed_precision="bf16" \
--resolution=512 \
--translation_prompt="IHC" \
--he_generation_prompt="H&E" \
--gradient_checkpointing \
--enable_xformers_memory_efficient_attention \
--report_to="wandb" \
--checkpointing_steps=4000 \

Inference

python inference/inference.py \
--model_folder_path /set/here/path/to/er_model_folder \
--img_path inference/example_images/he.jpg

Evaluation

python evaluate.py \
--target_dir /path/to/target/dir \
--generated_dir /path/to/generated/dir

Citation

If you use this code for your research, please cite our paper.

@misc{grosskopf2025histdist,
  title={HistDiST: Histopathological Diffusion-based Stain Transfer}, 
  author={Erik Großkopf and Valay Bundele and Mehran Hosseinzadeh and Hendrik P. A. Lensch},
  year={2025}
}

Acknowledgement

We thank Fangda Li et al. for their MIST dataset we used for both training and evaluation and their evaluation script we based our evaluation implementation upon. Also we thank Liu et al. for their BCI dataset we also used for both training and evaluation. We thank Filiot et al. for their pretraiend Phikon model which we used for encoding input features during the translation training. We thank Xu et al. for their pretrained Prov-GigaPath model we used for your accuracy based metric. We thank all hugging face diffusers authors for their training and inference scripts we based our implementation upon.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
inference		inference
training		training
util		util
.gitignore		.gitignore
README.md		README.md
evaluate.py		evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HistDiST: Histopathological Diffusion-based Stain Transfer

Abstract

Overview

HistDiST training pipeline

HistDiST inference pipeline

Setup

Models

Datasets

Requirements

Training

Inference

Evaluation

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ErikGro/HistDiST

Folders and files

Latest commit

History

Repository files navigation

HistDiST: Histopathological Diffusion-based Stain Transfer

Abstract

Overview

HistDiST training pipeline

HistDiST inference pipeline

Setup

Models

Datasets

Requirements

Training

Inference

Evaluation

Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages