Locality in Image Diffusion Models Emerges from Data Statistics

This is the official implementation of the paper

Locality in Image Diffusion Models Emerges from Data Statistics

Artem Lukoianov¹, Chenyang Yuan², Justin Solomon¹, Vincent Sitzmann¹

¹Massachusetts Institute of Technology, ²Toyota Research Institute

For any questions please shoot an email to [email protected]

[NOTE:] 🐛 Help us improve! We've noticed inconsistent generation on MacOS -- check current issues. If you encounter any bugs, inconsistent behavior, or have suggestions, please open an issue. Your feedback is valuable!

Models

The repository implements several analytical diffusion models:

pca_locality (Main method): Our proposed analytical denoiser that captures locality from data statistics.
optimal: The theoretically optimal denoiser (reproduces training images).
wiener: Wiener filter-based denoiser.
nearest_dataset: Baseline that retrieves the nearest dataset image at each step.

Datasets

Supported datasets:

mnist: MNIST handwritten digits
fashion_mnist: Fashion-MNIST
cifar10: CIFAR-10
celeba_hq: CelebA-HQ
afhq: AFHQv2

Most datasets auto-download. For celeba_hq and afhq, so please download them manually and place data in data/datasets/.

Environment Setup

Prerequisites

Python 3.9 or higher
uv package manager.
[Recommended] CUDA-capable GPU -- if you dont have it, make sure to change the device in the config to CPU/MPS

Installation

No manual setup required! Just use uv run directly.

Alternative: Manual installation

If you prefer to set up the environment manually:

uv venv
source .venv/bin/activate  # On Linux/Mac (.venv\Scripts\activate on Windows)
uv pip install -e .

Download the baseline UNET weights and the data

First we need to run this script to download the weights of the UNET models pre-trained for all of the baseline datasets. It You can skip this step, but then the metrics wont be available -- make sure to disable baseline_path in the config.

uv run download_baseline_weights.py

Running Experiments

Single Experiment

Now, run the command below to generate images with our analytical model. UV will automatically create the virtual environment and install all dependencies (including the package in editable mode):

uv run generate.py --config configs/pca_locality/celeba_hq.yaml

The config path can be:

Relative to configs/ directory: pca_locality/celeba_hq.yaml
Absolute path: /path/to/config.yaml

Batch Experiments

Run all baseline-dataset combinations using the provided script:

./run_all_baselines.sh

This script iterates over:

Baselines: pca_locality, optimal, wiener, nearest_dataset
Datasets: afhq, celeba_hq, cifar10, fashion_mnist, mnist

It automatically skips missing config files and runs each experiment sequentially.

Notebook

For quick experimentation, you can use the Jupyter notebook: playground.ipynb

Configuration Files

Configuration files use YAML format with OmegaConf's defaults feature for inheritance. Each config inherits from configs/defaults.yaml and can override specific values.

Configuration Structure

A typical config file (configs/pca_locality/celeba_hq.yaml) looks like:

defaults:
  - /defaults.yaml

# Run metadata: name, seed, device, tags
experiment:
  run_name: pca_locality_celeba_hq  # Name of the run - overwritten in each individual config file
  tags: [baseline, pca_locality, celeba_hq]  # Tags for experiment organization
  seed: 42  # Random seed for reproducibility
  device: cuda  # Device to run on (cuda/cpu/mps)

# Dataset configuration: name, split, resolution, batch size
dataset:
  name: celeba_hq  # Dataset name (mnist, cifar10, celeba_hq, afhq, fashion_mnist)
  split: train  # Dataset split to use
  download: false  # Whether to auto-download (set false for manual downloads)
  batch_size: 256  # Batch size for dataset loading
  resolution: 64  # Image resolution (overrides default if specified)

# Model selection and hyperparameters
# Available models: pca_locality, optimal, wiener, nearest_dataset
model:
  name: pca_locality  # Model to use
  params:
    temperature: 1.0  # Temperature parameter for softmax weighting
    mask_threshold: 0.02  # Threshold for mask binarization

# Generation parameters: number of samples, inference steps
sampling:
  num_samples: 8  # Total number of samples to generate
  batch_size: 8  # Batch size for generation
  num_inference_steps: 10  # Number of diffusion steps

# Output and logging settings: WandB, file saving
metrics:
  baseline_path: "data/models/baseline_unet/celeba_hq/ckpt_epoch_200.pt"  # Path to baseline UNet checkpoint for comparison
  output:
    save_final_images: true  # Save individual sample images
    save_image_grid: true  # Save grid of all samples
    save_intermediate_images: true  # Save intermediate diffusion steps
  wandb:
    enabled: true  # Enable Weights & Biases logging
    project: locality-diffusion  # WandB project name

Config Overrides via CLI

You can override any config value from the command line using dot notation:

uv run generate.py --config configs/pca_locality/celeba_hq.yaml \
    sampling.num_samples=16 \
    model.params.temperature=0.5\
    experiment.device=cpu

Output Structure

Each experiment creates a run directory with the following structure:

data/runs/{experiment_name}/{run_name}_{optional:timestamp}/
├── config.yaml              # Saved configuration
├── grid.png                 # Grid of generated samples
├── metrics.json             # Computed metrics
├── logs/
│   └── generate.log         # Execution log
├── artifacts/
│   ├── images/              # Individual sample images
│   │   └── sample_0000.png
│   ├── intermediate_images/ # Intermediate diffusion steps
│   │   ├── x_t/            # Noisy images at each step
│   │   └── x0_pred/        # Predicted clean images at each step
│   └── comparison/          # Comparison grids (if baseline_path set)
└── code_snapshot/           # Git-tracked code snapshot

Weights & Biases Integration

WandB logging is enabled by default. Using WandB is convinient for studying generation results, but can slowdown the runs. To disable or configure:

metrics:
  wandb:
    enabled: false  # Disable WandB
    mode: offline   # Use offline mode
    project: my-project

Contributing

We welcome contributions to this repository! Here are some ways you can help:

Reporting Issues

If you encounter bugs or have suggestions for improvements, please open an issue on GitHub. When reporting bugs, please include:

A clear description of the problem
Steps to reproduce the issue
Your environment details (Python version, OS, etc.)
Relevant error messages or logs

Contributing Code

Fork the repository and create a new branch for your changes
Follow the code style: The project uses standard Python conventions. Ensure your code is well-documented and follows the existing patterns
Add tests if applicable (though the current codebase focuses on reproducibility of paper results)
Update documentation if you add new features or change existing behavior
Submit a pull request with a clear description of your changes

Adding New Models

To add a new analytical diffusion model:

Create a new file in src/local_diffusion/models/ implementing the BaseDenoiser interface
Register your model using the @register_model("model_name") decorator
Add configuration files in configs/model_name/ for each dataset
Update this README to document your model

Project Directory Structure

The project follows a structured layout:

locality-in-diffusion-models/
├── configs/              # Configuration files
│   ├── defaults.yaml     # Base configuration with common defaults
│   ├── pca_locality/     # Configs for the method proposed in our paper
│   ├── optimal/          # Optimal denoiser baseline
│   ├── wiener/           # Wiener filter baseline
│   └── nearest_dataset/  # Nearest neighbor baseline
├── src/
│   └── local_diffusion/  # Main package code
│       ├── models/       # Model implementations (pca_locality.py, etc.)
│       ├── data/         # Dataset loading utilities
│       ├── configuration.py  # Config management
│       └── metrics.py    # Evaluation metrics
├── data/                 # Data directory (created automatically)
│   ├── datasets/         # Dataset storage
│   ├── models/           # Precomputed models (Wiener filters, etc.)
│   ├── runs/             # Experiment outputs
│   └── wandb/            # Weights & Biases logs
├── generate.py           # Main entry point for experiments
├── playground.ipynb      # Interactive Jupyter notebook for experimentation
└── run_all_baselines.sh  # Batch script to run all experiments

Citation

If you find our project useful, please consider citing it:

@inproceedings{lukoianovlocality,
      title={Locality in Image Diffusion Models Emerges from Data Statistics},
      author={Lukoianov, Artem and Yuan, Chenyang and Solomon, Justin and Sitzmann, Vincent},
      booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems}
      year={2025},
      primaryClass={cs.CV},
      url={https://locality.lukoianov.com/}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
configs		configs
data/assets		data/assets
src/local_diffusion		src/local_diffusion
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE.txt		LICENSE.txt
README.md		README.md
download_baseline_weights.py		download_baseline_weights.py
generate.py		generate.py
playground.ipynb		playground.ipynb
pyproject.toml		pyproject.toml
run_all_baslines.sh		run_all_baslines.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Locality in Image Diffusion Models Emerges from Data Statistics

Models

Datasets

Environment Setup

Prerequisites

Installation

Download the baseline UNET weights and the data

Running Experiments

Single Experiment

Batch Experiments

Notebook

Configuration Files

Configuration Structure

Config Overrides via CLI

Output Structure

Weights & Biases Integration

Contributing

Reporting Issues

Contributing Code

Adding New Models

Project Directory Structure

Citation

About

Uh oh!

Releases

Packages

Languages

License

ottogin/locality-in-diffusion-models

Folders and files

Latest commit

History

Repository files navigation

Locality in Image Diffusion Models Emerges from Data Statistics

Models

Datasets

Environment Setup

Prerequisites

Installation

Download the baseline UNET weights and the data

Running Experiments

Single Experiment

Batch Experiments

Notebook

Configuration Files

Configuration Structure

Config Overrides via CLI

Output Structure

Weights & Biases Integration

Contributing

Reporting Issues

Contributing Code

Adding New Models

Project Directory Structure

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages