SupResDiffGAN a new approach for the Super-Resolution task 🚀✨

Dawid Kopeć¹, Wojciech Kozłowski¹, Maciej Wizerkaniuk¹, Dawid Krutul¹, Jan Kocoń¹, and Maciej Zięba¹
WUST, Wybrzeże Stanisława Wyspiańskiego 27, 50-370 Wrocław, Poland
{wojciech.kozlowski, jan.kocon, maciej.zieba}@pwr.edu.pl

This repository contains the implementation of our novel super-resolution (SR) method, as presented in our paper published at the ICCS 2025. The repository is designed with modularity and flexibility in mind, leveraging PyTorch Lightning for training, Hydra for configuration management, and Weights & Biases (W&B) for experiment tracking.

Abstract 📜

In this work, we present SupResDiffGAN, a novel hybrid architecture that combines the strengths of Generative Adversarial Networks (GANs) and diffusion models for super-resolution tasks. By leveraging latent space representations and reducing the number of diffusion steps, SupResDiffGAN achieves significantly faster inference times than other diffusion-based super-resolution models while maintaining competitive perceptual quality. To prevent discriminator overfitting, we propose adaptive noise corruption, ensuring a stable balance between the generator and the discriminator during training. Extensive experiments on benchmark datasets show that our approach outperforms traditional diffusion models such as SR3 and I²SB in efficiency and image quality. This work bridges the performance gap between diffusion- and GAN-based methods, laying the foundation for real-time applications of diffusion models in high-resolution image generation.

Table of Contents 📚

SupResDiffGAN a new approach for the Super-Resolution task 🚀✨

Results 📊

The best and second-best results are highlighted in bold and underline, respectively. Methods are categorized into Diffusion-based and GAN-based to reflect their distinct architectural frameworks.

Model / Dataset	Imagenet	Celeb	Div2k	RealSR-nikon	RealSR-canon	Set14	Urban100
Metric	LPIPS ↓	LPIPS ↓	LPIPS ↓	LPIPS ↓	LPIPS ↓	LPIPS ↓	LPIPS ↓
GAN-based methods
SRGAN	0.3452	0.2441	0.3327	0.3464	0.3050	0.2901	0.3156
ESRGAN	0.2320	0.1903	0.2649	0.3380	0.3053	0.2375	0.2408
Real-ESRGAN	0.2123	0.1690	0.2562	0.3309	0.3020	0.2301	0.2285
Diffusion-based methods
SR3	0.3519	0.2229	0.3396	0.4018	0.4008	0.3015	0.2428
I²SB	0.3755	0.2221	0.3309	0.4069	0.3867	0.3169	0.2635
ResShift	0.5360	0.3275	0.4724	0.4959	0.4671	0.4832	0.4822
SupResDiffGAN	0.3079	0.1875	0.2876	0.3970	0.3853	0.2789	0.2570

Comparison of Time of Batch Inference in Seconds

The best and second-best results are highlighted in bold and underline, respectively. Methods are categorized into Diffusion-based and GAN-based to reflect their distinct architectural frameworks.

Model / Dataset	Imagenet	Celeb	Div2k	RealSR-nikon	RealSR-canon	Set14	Urban100
Metric	Time per batch [s]	Time per batch [s]	Time per batch [s]	Time per batch [s]	Time per batch [s]	Time per batch [s]	Time per batch [s]
GAN-based methods
SRGAN	0.0671	0.0109	0.0193	0.0367	0.0113	0.0888	0.0070
ESRGAN	0.2188	0.0870	0.2316	0.2711	0.1504	0.2049	0.0821
Real-ESRGAN	0.1392	0.0816	0.1899	0.2468	0.1427	0.2361	0.1013
Diffusion-based methods
SR3	1.9953	0.3072	7.6377	8.4242	3.6420	0.8627	1.5028
I²SB	1.6776	0.1184	6.7292	7.0910	3.1629	1.8049	1.2395
ResShift	2.2466	0.4394	8.6647	8.9677	4.1880	0.5983	1.6762
SupResDiffGAN	0.2954	0.1832	0.9333	1.0021	0.6114	0.3542	0.3206

Visual Results

Two representative SupResDiffGAN outputs: (top) 4× face superresolution at 128×128→512×512 pixels (bottom) 4× natural image super-resolution at 125×93→500×372 pixels.

Qualitative comparison of visual performance on two example images from ImageNet. Low-quality inputs are on the left, while results from bicubic upscale and seven SR models: SRGAN, ESRGAN, Real-ESRGAN, SR3, ResShift, I²SB, and Ours are on the right.

Getting Started 🛠️

Prerequisites

Python >= 3.9,
PyTorch Lightning == 2.2.2
CUDA-enabled GPU (recommended for training)

Installation

Clone the repository:

git clone https://github.com/Dawir7/SupResDiffGAN.git
cd SupResDiffGAN

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install the required dependencies:
```
pip install -r requirements-gpu.txt
```

Configuration Management with Hydra

This repository uses Hydra for managing configurations. Configuration files are located in the conf/ directory. You can override any configuration parameter directly from the command line.

Example: Override Parameters

python train_model.py model.name=ESRGAN dataset.batch_size=16 trainer.max_epochs=50

More information about overriding parameters in Hydra documentation Basic Override syntax

Configuration Files

config.yaml: Default configuration file.
config_srgan.yaml: Configuration for SRGAN.
config_esrgan.yaml: Configuration for ESRGAN.
config_real_esrgan: Configuration for Real-ESRGAN.
config_sr3.yaml: Configuration for SR3.
config_i2sb.yaml: Configuration for I²SB.
config_resshift.yaml: Configuration for ResShift.
config_supresdiffgan.yaml: Configuration for SupResDiffGAN.
config_supresdiffgan_without_adv.yaml: Configuration for SupResDiffGAN model without a discriminator or adversarial loss.
config_supresdiffgan_simple_gan.yaml: Configuration for SupResDiffGAN model with a discriminator but without Gaussian noise augmentation.

Experiment Tracking with W&B

This repository integrates Weights & Biases (W&B) for experiment tracking. Follow these steps to get started:

Login to W&B:
```
wandb login
```
Track Experiments:
- Metrics, losses, and visualizations are automatically logged to your W&B project.
- Customize the W&B project name in the configuration file in use f.e.:
```
wandb_logger:
   project: 'your_project' # your wandb project
   entity: 'your_entity' # your wandb entity
```
View Results:
- Visit https://wandb.ai and navigate to your project to view experiment results.

Datasets 📂

This section outlines how to download the necessary datasets for training and evaluating the SupResDiffGAN model. We provide a convenient bash script to automate the download process.

Prerequisites

Activated virtual environment (as described in the Installation section).
Note: If you haven't installed all GPU requirements using requirements-gpu.txt, the minimal libraries required for downloading the CelebA and ImageNet datasets are listed in requirements-data.txt. You can install these specifically using:
```
pip install -r requirements-data.txt
```

Download and Prepare Datasets

The get_data.sh script will download the specified datasets to the appropriate directories (the exact locations are defined within the script). Please ensure you have sufficient disk space before running the script.

Notes:

The specific implementation and sources for each dataset download are defined within the get_data.sh script. Refer to the script for more details on the download process for each dataset.
Due to the potentially long download and processing times for some datasets, especially ImageNet and large RealSR variants, it is highly recommended to run the script within a terminal multiplexer such as tmux or screen. This will allow the process to continue even if your SSH connection is interrupted.
Crucially, datasets are subjects to its own license terms and conditions. By using any of datasets, you are solely responsible for understanding and complying with the respective dataset's license. We, as the authors of this code repository, assume no responsibility for your usage of these datasets or any potential license violations. It is your responsibility to ensure your use adheres to the terms set forth by the dataset providers.
We strongly recommend that you familiarize yourself with the licensing terms of any dataset you choose to use before downloading and incorporating it into your workflow. Links to the official licenses are typically available on the dataset providers' websites.

Ensure you are in the repository's root directory:
```
cd SupResDiffGAN
```
Run the get_data.sh script with the desired dataset flags. The script accepts the following flags:
- -i or --imagenet: Downloads the ImageNet dataset.
- -c or --celeba: Downloads the CelebA dataset.
- -d or --div2k: Downloads the Div2k dataset.
- -r or --realsr: Downloads the RealSR dataset.
- -s or --set14: Downloads the Set14 dataset.
- -u or --urban100: Downloads the Urban100 dataset.

Examples:

Download the ImageNet dataset:
```
bash get_data.sh -i
```
Download the ImageNet and CelebA datasets:
```
bash get_data.sh -i -c
```
Download supported datasets using full names:
```
bash get_data.sh --celeba --div2k
```

Usage ▶️

Training

To train a model, use the train_model.py script. Example:

python train_model.py -cn "config_supresdiffgan"

Evaluation

To evaluate a trained model, use the evaluate_model.py script. Example:

python evaluate_model.py "config_supresdiffgan"

More info

More about configs in CONFIGS.md.

More about usage of Hydra flags: Hydra documentation

Model Weights

We provide pre-trained weights for SupResDiffGAN to facilitate evaluation and fine-tuning. These weights are trained on ImageNet and can be used for inference or as a starting point for further training.

Download Link

Download

Usage

To use a pre-trained model, specify the path to the checkpoint file in the load_model field of the configuration file. For example, in config.yaml:

model:
  load_model: 'path/to/your/checkpoint_file.pth'  # Path to the pre-trained model checkpoint

Citation 📖

If you use this repository in your research, please cite our paper:

@inproceedings{kopec2025supresdiffgan,
  title={SupResDiffGAN: A New Approach for the Super-Resolution Task},
  author={Kope{\'c}, Dawid and Koz{\l}owski, Wojciech and Wizerkaniuk, Maciej and Krutul, Dawid and Koco{\'n}, Jan and Zi{\k{e}}ba, Maciej},
  booktitle={Proceedings of the International Conference on Computational Science (ICCS)},
  year={2025}
}

Acknowledgement 🙏

We would like to acknowledge the following repositories and works that served as inspiration or baselines for our research:

PyTorch-GAN: A collection of PyTorch implementations of GANs.
Real-ESRGAN-bicubic: A bicubic version of Real-ESRGAN for super-resolution tasks.
Real-ESRGAN: A practical algorithm for general image restoration.
ResShift: A novel approach for image super-resolution.
I²SB: A diffusion-based method for image-to-image super-resolution.

We are grateful for the contributions of these projects to the field of super-resolution and deep learning.

License 📜

This repository is licensed under the Academic Free License (AFL) v3.0. See the LICENSE.txt file for the full license text.

By using this repository, you agree to comply with the terms of the Academic Free License and any applicable third-party licenses.

Third-Party Code and Licenses

Some parts of this repository are modified or adapted from other open-source projects mentioned in the Acknowledgement 🙏 section. These parts retain their original licenses, which are included in their respective directories. Please refer to the following for more details:

Real-ESRGAN: Licensed under the BSD 3-Clause License. See the RealESRGAN/LICENSE.txt file for the full license text.
ResShift: Licensed under the S-Lab License 1.0. See the ResShift/LICENSE.txt file for the full license text.
I²SB: Licensed under the NVIDIA Source Code License. See the I²SB/LICENSE.txt file for the full license text.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SupResDiffGAN a new approach for the Super-Resolution task 🚀✨

Abstract 📜

Table of Contents 📚

Results 📊

Comparison of Time of Batch Inference in Seconds

Visual Results

Getting Started 🛠️

Prerequisites

Installation

Configuration Management with Hydra

Example: Override Parameters

Configuration Files

Experiment Tracking with W&B

Datasets 📂

Prerequisites

Download and Prepare Datasets

Examples:

Usage ▶️

Training

Evaluation

More info

Model Weights

Download Link

Usage

Citation 📖

Acknowledgement 🙏

License 📜

Third-Party Code and Licenses

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ESRGAN		ESRGAN
I2SB		I2SB
RealESRGAN		RealESRGAN
ResShift		ResShift
SR3		SR3
SRGAN		SRGAN
SupResDiffGAN		SupResDiffGAN
assets		assets
conf		conf
scripts		scripts
.gitignore		.gitignore
CONFIGS.md		CONFIGS.md
LICENSE.txt		LICENSE.txt
README.md		README.md
evaluate_model.py		evaluate_model.py
get_data.sh		get_data.sh
requirements-data.txt		requirements-data.txt
requirements-gpu.txt		requirements-gpu.txt
train_model.py		train_model.py

License

Dawir7/SupResDiffGAN

Folders and files

Latest commit

History

Repository files navigation

SupResDiffGAN a new approach for the Super-Resolution task 🚀✨

Abstract 📜

Table of Contents 📚

Results 📊

Comparison of Time of Batch Inference in Seconds

Visual Results

Getting Started 🛠️

Prerequisites

Installation

Configuration Management with Hydra

Example: Override Parameters

Configuration Files

Experiment Tracking with W&B

Datasets 📂

Prerequisites

Download and Prepare Datasets

Examples:

Usage ▶️

Training

Evaluation

More info

Model Weights

Download Link

Usage

Citation 📖

Acknowledgement 🙏

License 📜

Third-Party Code and Licenses

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages