DiffCoRe-Mix : Context-Guided Responsible Data Augmentation with Diffusion Models [ICLRw-2025]

Khawar Islam, Naveed Akhtar

School of Computing and Information Systems, The University of Melbourne

📢 Latest Updates

Mar-15-25: Preprint is available.
Mar-13-25: Public release of the code and models.
Mar-12-25: Paper accepted at ICLRw-2025.

Key Features

Contextual & Negative Prompting: Guides the diffusion process to generate domain-specific images while suppressing undesired content.
Hard Cosine Similarity Filtration: Uses CLIP embeddings to filter out generated samples that do not meet semantic alignment criteria.
Composite Image Mixing: Combines real and generative images using both pixel-wise and patch-wise strategies.

Install

Clone this repository and navigate to DiffCoRe-Mix folder

git clone https://github.com/khawar-islam/DiffCoRe-Mix.git
cd DiffCoRe-Mix

Install Package

conda create -n DiffCoreMix python=3.9.19 -y
conda activate DiffCoreMix

Download pre-trained CosXL model

https://huggingface.co/cocktailpeanut/c/blob/main/cosxl.safetensors

To run the augmentation process, use:

python main.py --dataset <DATASET_NAME> --output_folder <PATH_TO_OUTPUT_FOLDER> --aug_per <AUGMENTATION_PERCENTAGE>

For instance, to augment the CUB200 dataset with 30% augmentation

python main.py --dataset cub200 --output_folder /path/to/cub200/train --aug_per 0.3

Examples

Citation

If you use DiffCoRe-Mix in your research, please cite our paper:

@inproceedings{islam2025context,
  title={Context-Guided Responsible Data Augmentation with Diffusion Models},
  author={Islam, Khawar and AKHTAR, NAVEED},
  booktitle={ICLR 2025 Workshop on Navigating and Addressing Data Problems for Foundation Models}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.idea		.idea
assets		assets
data/cub200		data/cub200
results		results
README.md		README.md
clip_feature.py		clip_feature.py
diffcoreMix.py		diffcoreMix.py
environment.yml		environment.yml
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiffCoRe-Mix : Context-Guided Responsible Data Augmentation with Diffusion Models [ICLRw-2025]

Khawar Islam, Naveed Akhtar

School of Computing and Information Systems, The University of Melbourne

📢 Latest Updates

Key Features

Install

Examples

Citation

About

Uh oh!

Releases

Packages

Languages

khawar-islam/DiffCoRe-Mix

Folders and files

Latest commit

History

Repository files navigation

DiffCoRe-Mix : Context-Guided Responsible Data Augmentation with Diffusion Models [ICLRw-2025]

Khawar Islam, Naveed Akhtar

School of Computing and Information Systems, The University of Melbourne

📢 Latest Updates

Key Features

Install

Examples

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages