FTIIBench

(ARXIV24) This is the official code repository for "FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion."

Dataset

The text of FTII-Bench could be download from Google Drive

The images of FTII-Bench could be download from Google Drive

Set the appropriate paths in the run_eval_fi and run_eval_sc scripts.

bash run_eval_fi.sh # for flow insertion tasks
bash run_eval_sc.sh # for single choice tasks

For evaluating with BGE models You can run ./mllm_eval/bge_eval.ipynb in the Jupyter environment.

Thanks to the open-source code from Mantis

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
ftii_data		ftii_data
images		images
mllm_eval		mllm_eval
newsbench		newsbench
LICENSE		LICENSE
README.md		README.md
conversation.py		conversation.py
requirements.txt		requirements.txt
utils.py		utils.py