Skip to content

(ARXIV24) This is the official code repository for "FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion"

License

Notifications You must be signed in to change notification settings

IAAR-Shanghai/FTIIBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FTIIBench

(ARXIV24) This is the official code repository for "FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion."

Dataset

The text of FTII-Bench could be download from Google Drive

The images of FTII-Bench could be download from Google Drive

Note that the data is only used for research purposes!

Evaluation

  1. Set the appropriate paths in the run_eval_fi and run_eval_sc scripts.
    bash run_eval_fi.sh # for flow insertion tasks
    bash run_eval_sc.sh # for single choice tasks
  2. For evaluating with BGE models You can run ./mllm_eval/bge_eval.ipynb in the Jupyter environment.

Acknowledgement

Thanks to the open-source code from Mantis

About

(ARXIV24) This is the official code repository for "FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published