Hi there,
I noticed that SigLIP2 was introduced in release 2.31. I was wondering if there are any plans to include the training code for this model as well. Given the similarities, it seems like CoCa could be a good reference point, as mentioned in Hugging Face's explanation.
Additionally, does this mean that the pretrained weights for the AR decoder and the EMA image encoder will also be released?
Looking forward to your response—thanks!