When doing some work testing multimodal transformer models in the medical field sometimes the models in question use Hybrid-Clip variants, such as these: https://huggingface.co/models?search=medclip. It would be great if some models like these could be supported.
More importantly, is there any way to save evaluation data to a csv or json file?