Skip to content

🌈 Introducing DiverseVAR, a training-free framework that unleashes the inherent generative diversity of Visual Autoregressive models while preserving fidelity and text–image alignment. #177

@wangtong627

Description

@wangtong627

Thanks a lot to the inspiring progress in Visual Autoregressive (VAR) models!

We introduce DiverseVAR, a simple yet effective training-free framework that restores the lost generative diversity in VAR models.
By strategically manipulating the pivotal component at early scales, DiverseVAR significantly boosts diversity without harming fidelity or text-image alignment.
On both Infinity-2B and Infinity-8B, it consistently improves Recall, Coverage, and FID while keeping CLIP scores nearly unchanged.

Image

Arxiv: https://arxiv.org/abs/2511.17074
Github: https://github.com/wangtong627/DiverseVAR
Huggingface Daily Paper: https://huggingface.co/papers/2511.17074

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions