Read the full thesis here: LANDING IN THE LATENT SAPCE, Building labeled synthetic runway datasets with a data augmentation pipeline that uses diffusion models
Access the dataset on Kaggle
"But test everything; hold fast what is good." — 1 Thessalonians 5:21
GARD is the largest publicly available synthetic runway dataset, created entirely through a modular data augmentation pipeline called Canny2Concrete that leverages latent diffusion models (Stable Diffusion XL + ControlNet). It was developed as part of a BSc Computer Science Final Project at the University of London in 2025.
GARD contains 45,486 high-resolution images, annotated with pixel-level segmentation masks and YOLO-format labels, featuring:
- Varied lighting conditions (day, night, dusk, dawn)
- Multiple weather conditions (rain, fog, snow)
- Realistic backgrounds and occlusions
- 🔧 Canny2Concrete Pipeline: A modular, open-source pipeline to generate realistic runway images from structural templates using ControlNet and Stable Diffusion XL.
- 🧪 Intrinsic and Extrinsic Evaluations: Metrics like SSIM and real-model performance using YOLOv11.
- 📊 Largest Synthetic Runway Dataset: Surpassing BARS, RLD, LARD, and FS2020 in size and environmental diversity.
- 💡 Reproducibility: Full metadata stored in JSON for every image, including generation parameters and random seeds.
First row is template image (from LARD), canny edge structure, and binary segmentation mask. Then, the other three rows are images from the Base Images, Variant Images, and Variant Images With Occlusion datasets, respectively.
📦 Kaggle:
👉 https://www.kaggle.com/datasets/depaulagu/gard2025
🧪 Includes:
BaseImages/
: 6,498 base imagesVariantImages/
: 19,494 with rotation and outpaintingVariantImagesWithOcclusion/
: 19,494 with environmental occlusion
Each image has:
.png
image.json
label with metadata.mask.png
segmentation mask.txt
YOLO-format label
Using YOLOv11:
- Trained on GARD, validated against real LARD test sets.
- GARD-trained models matched or outperformed models trained on LARD. Results published in the thesis and, along with trained segmentation models' weights, publicly available on Kaggle
- SSIM confirms image diversity while maintaining structural fidelity.
- Python, PyTorch
- Diffusers, ControlNet, Albumentations, ImgAug, OpenCV
- Stable Diffusion XL (DreamShaper XL)
- YOLOv11
- Kaggle Datasets
- Jupyter Notebooks
This work is dedicated to:
- To Our Lady, Mary, Mother of God and my mother, in gratitude for her maternal care and intercession throughout this journey.
- To Saint Thomas Aquinas, the Angelic Doctor, whose love of truth shaped the Christian intellectual tradition.
May this work, in whatever good it contains, be for the glory of God and the service of truth.