Is it possible to describe the training data in Chinese? #1063
Unanswered
ctgushiwei
asked this question in
Q&A
Replies: 1 comment 4 replies
-
|
@ctgushiwei it could work, but you'd need a model with a tokenizer that supports chinese or a wide variety of languages and was trained with at least some chinese captions. SigLIP i18n and SigLIP2 models should be compatible and had Chinese in the language mix. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I would like to fine-tune the clip model based on my own dataset for an application in a Chinese context. Is it appropriate to describe the images in Chinese for fine-tuning the training?
Beta Was this translation helpful? Give feedback.
All reactions