TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Reference

Zhang, Wenqiang, Zilong Huang, Guozhong Luo, Tao Chen, Xinggang Wang, Wenyu Liu, Gang Yu,and Chunhua Shen. "TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12083-12093. 2022.

Performance

ADE20k

Model	Backbone	Resolution	Training Iters	mIoU	mIoU (flip)	mIoU (ms+flip)	Links
TopFormer-Base	topformer	512x512	160000	38.28%	38.59%	-	model \| log \| vdl
TopFormer-Small	topformer	512x512	160000	35.60%	35.83%	-	model \| log \| vdl
TopFormer-Tiny	topformer	512x512	160000	32.49%	32.75%	-	model \| log \| vdl

Note that, the input resulution of TopFormer should be a multiple of 32.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Reference

Performance

ADE20k

Files

README.md

Latest commit

History

README.md

File metadata and controls

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Reference

Performance

ADE20k