⭐ GraCo: Granularity-Controllable Interactive Segmentation

This is the official implementation for our CVPR'24 highlight paper "GraCo: Granularity-Controllable Interactive Segmentation".

📣 Updates 🔥🔥🔥

[2025.1.4] Add a new type of granularity control signal: Semantic Phrase, which is very useful for segmenting specific parts (head segmentation, hand segmentation, etc.).
[2025.1.4] Releases the weights, and new training and inference code that supports both the granularity slider and the semantic phrase.
[2025.1.4] Update the interactive demo to support two granularity control signals. The tool can be put directly into practice with open source weights.
[2025.1.4] Add a novel Multi-grained Mask Trie (MMT) module and an extended Granularity-Controllable Learning (GCL) strategy. The former automatically extends the granularity abundance of existing part annotations through heuristic part merging, and the latter achieves efficient scaling and training of two granularity signals through dual-branch LoRA. The original AGG is split into the Fine-grained Mask Generator (FMG) and the Mask Granularity Estimator (MGE). The FMG is the same as the mask engine of the AGG, and the MGE is responsible for estimating the two types of granularity control signals for each mask.

💡 Introduction

Current IS pipelines fall into two categories: single-granularity output and multi-granularity output. The latter aims to alleviate the spatial ambiguity present in the former. However, the multi-granularity output pipeline suffers from limited interaction flexibility and produces redundant results. We introduce Granularity-Controllable Interactive Segmentation (GraCo), a novel approach that allows precise control of prediction granularity by introducing additional parameters to input. This enhances the customization of the interactive system and eliminates redundancy while resolving ambiguity. Nevertheless, the exorbitant cost of annotating multi-granularity masks and the lack of available datasets with granularity annotations make it difficult for models to acquire the necessary guidance to control output granularity. To address this problem, we design an any-granularity mask generator that exploits the semantic property of the pre-trained IS model to automatically generate abundant mask-granularity pairs without requiring additional manual annotation. Based on these pairs, we propose a granularity-controllable learning strategy that efficiently imparts the granularity controllability to the IS model.

🚀 Quick start

📍 Install

Install torch

# Install torch (according to your own cuda version, take 11.8 as an example)
pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118

Install other dependencies

# Install other dependencies
pip install -r requirements.txt

🍇 Our interactive demo

# running on cpu
python demo.py --checkpoint path/to/weights/sbd_vit_base.pth --lora_checkpoint path/to/GraCo_base_lora.pth --cpu

# running on gpu
python demo.py --checkpoint path/to/weights/sbd_vit_base.pth --lora_checkpoint path/to/GraCo_base_lora.pth --gpu 0

🏕️ Any-Granularity mask Generator (Optional)

If you do not use the automatically generated pseudo mask proposals, simply remove --part_path in the training command.

python any_granularity_generator.py --checkpoint weights/simpleclick/sbd_vit_base.pth  \
    --save-path part_output --save-name proposal.pkl --dataset-path /path/to/datasets/SBD/dataset

🦄 Train and Evaluation

Download pre-trained weights and place them in ./weights/simpleclick/

SimpleClick models

Train

bash train.sh

Evaluation on Instance-level, Part-level, Out-of-domain benchmarks

bash eval.sh

Complementarity analysis of two types of granularity control signals

bash analysis.sh

Acknowledgements

This repository is built upon SimpleClick. The project page is built using the template of Nerfies. Thank the authors of these open source repositories for their efforts. And thank the ACs and reviewers for their effort when dealing with our paper.

✨ Citation

If you find this repository helpful, please consider citing our paper.

@inproceedings{zhao2024graco,
  title={GraCo: Granularity-Controllable Interactive Segmentation},
  author={Zhao, Yian and Li, Kehan and Cheng, Zesen and Qiao, Pengchong and Zheng, Xiawu and Ji, Rongrong and Liu, Chang and Yuan, Li and Chen, Jie},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={3501--3510},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
interactive_demo		interactive_demo
isegm		isegm
models		models
weights/graco		weights/graco
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis.sh		analysis.sh
any_granularity_generator.py		any_granularity_generator.py
config.yml		config.yml
demo.py		demo.py
eval.sh		eval.sh
evaluate.py		evaluate.py
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

⭐ GraCo: Granularity-Controllable Interactive Segmentation

📣 Updates 🔥🔥🔥

💡 Introduction

🚀 Quick start

📍 Install

🍇 Our interactive demo

🏕️ Any-Granularity mask Generator (Optional)

🦄 Train and Evaluation

Acknowledgements

✨ Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Zhao-Yian/GraCo

Folders and files

Latest commit

History

Repository files navigation

⭐ GraCo: Granularity-Controllable Interactive Segmentation

📣 Updates 🔥🔥🔥

💡 Introduction

🚀 Quick start

📍 Install

🍇 Our interactive demo

🏕️ Any-Granularity mask Generator (Optional)

🦄 Train and Evaluation

Acknowledgements

✨ Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages