Skip to content

Deep Compression Vector Quantize AutoEncoder? #163

Open
@markson14

Description

@markson14

It's a very impressive job! Well done.

I am wondering if you have conducted any further experiments on vector quantization. The DCAE-f128 can compress a 256x256 image into a 2x2 feature map, resulting in 4 tokens with VQ. This could lead to significant acceleration in LLM training and inference, paving the way for real-time video generation. Feel free to ask if you need any more adjustments!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions