-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
what user_query can i use? #23
Comments
You can simply use |
But when i tested several images, i found that whether there is an object in the test image, the model will output a localization result. Is there any way to adjust the settings , or is it caused by model hallucination ? |
Yes, such hallucination is probably caused by training data - for grounding training, we only got positive QA pairs, i.e., the object mentioned in the question is guaranteed to occur in the image. To remedy such hallucination, you can curate some negative QA pairs and finetune the model. |
If I need to fine-tune the model for locating target elements in an image, is there certain hyper parameters to focus on, please? |
If I want to provide the object's label and bounding box to Groma to help generate more accurate image descriptions, how should I structure the prompt? |
If i want to locate a specific target ,such as "a person wearing a yellow hat", what user_query can i use in inference?
The text was updated successfully, but these errors were encountered: