Open
Description
Thank you very much for your outstanding work!
I have a question that I haven't quite understood. When fine-tuning your RS5M model on RSICD or RSITMD using the methods outlined in the paper (infoNCE, lr=1e-6), I did not achieve the expected performance. Taking RSICD as an example, the paper and the weights you provided for RS5M RET-2 version result in an accuracy around 38, but when I fine-tuned using my own RS5M VitB32 version, the result was around 34. Could you provide more details on fine-tuning RET-2 or RSICD so that I can better replicate the process? Thank you very much.
Metadata
Metadata
Assignees
Labels
No labels