Impossible Videos

Show Lab, National University of Singapore

ICML 2025

News

[2025-05-01] Impossible Videos is accepted by ICML 2025!
[2025-03-19] We release the paper on arXiv!
[2025-03-18] We release the data and example evaluation code on Huggingface!

🤔 What are impossible videos?

Impossible videos refer to videos displaying counterfactual and anti-reality scenes that are impossible in the real world. We show some video examples below. Please visit our website to find more examples.

imp_vid_demo.mp4

💡 Why are we interested in impossible videos?

Impossible videos can be a touch stone for advanced video models. As an out-of-real-world-distribution data, it requires the model to not simply memorize real-world data and retrieve similar information based on the input, but to genuinely learn from real-world data and reason upon the input.

This project aims to advance video research by answering the following important questions:

Can today's video generation models effectively follow prompts to generate impossible video content?
Are today's video understanding models good enough for understanding impossible videos?

🔥 IPV-Bench

We introduce IPV-Bench, a novel benchmark designed to evaluate and foster progress in video understanding and generation.

§IPV Taxonomy: IPV-Bench is underpinned by a comprehensive taxonomy, encompassing 4 domains, 14 categories. It features diverse scenes that defy physical, biological, geographical, or social laws.
§IPV-Txt Prompt Suite: A prompt suite is constructed based on the taxonomy to evaluate video generation models, challenging their prompt following and creativity capabilities.
§IPV-Vid Videos: A video benchmark is curated to assess Video-LLMs on their ability of understanding impossible videos, which particularly requires reasoning on temporal dynamics and world knowledge.

🏆 Leaderboard

Text-to-video Generation

Video-LLM-based Video Understanding

🚀 Get Started

First, go to Huggingface and download our data and code, including videos, task files, and example evaluation code. The task files and example files can also be found in this GitHub repo.

Evaluate Impossible Video Generation

Use example_read_prompt.py to read the ipv_txt_prompt_suite.json file to get the text prompts.
Use the text prompt to generate videos using your models.
Annotate the visual quality and prompt following fields for each video.
Compute IPV Score by stating the percentage of videos that are both of high quality and good prompt following.

🛠️ In this study, we employ human annotation to provide reliable insights for the models. We are still polishing an automatic evaluation framework, which will be open-sourced in the future.

Evaluate Impossible Video Understanding

The benchmark involves three tasks: Judgement, Multi-choice QA, and Open-ended QA.
Navigate to example_eval/eval_judgement.py, example_eval/eval_mcqa.py, and example_eval/eval_openqa.py for each task.
The example code implements the full evaluation pipeline. To evaluate your model, you simply modify the inference_one() function to produce the output.

Reproduce

We specify the specific model versions in our evaluation.

Video Understanding

VideoLLaVA: LanguageBind/Video-LLaVA-7B-hf
Oryx: Oryx-1.5-7B
InternVL-2.5: OpenGVLab/InternVL2_5-8B-MPO
NVILA: Efficient-Large-Model/NVILA-8B
LongVU: Vision-CAIR/LongVU_Qwen2_7B
Qwen2-VL: Qwen/Qwen2-VL-7B-Instruct
LLaVA-Next: lmms-lab/LLaVA-Video-7B-Qwen2
GPT: gpt-4o
Gemini: gemini-1.5-flash

🎓 BibTeX

If you find our work helpful, please kindly star this repo and consider citing our paper.

@misc{bai2025impossible,
      title={Impossible Videos}, 
      author={Zechen Bai and Hai Ci and Mike Zheng Shou},
      year={2025},
      eprint={2503.14378},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.14378}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
assets		assets
example_eval		example_eval
README.md		README.md
example_read_mmu_task.py		example_read_mmu_task.py
example_read_prompt.py		example_read_prompt.py
ipv_txt_prompt_suite.json		ipv_txt_prompt_suite.json
judgement_answer.json		judgement_answer.json
judgement_question.json		judgement_question.json
mcqa_answer.json		mcqa_answer.json
mcqa_question.json		mcqa_question.json
openqa_answer.json		openqa_answer.json
openqa_question.json		openqa_question.json
video2taxonomy_label.json		video2taxonomy_label.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Impossible Videos

News

🤔 What are impossible videos?

💡 Why are we interested in impossible videos?

🔥 IPV-Bench

🏆 Leaderboard

Text-to-video Generation

Video-LLM-based Video Understanding

🚀 Get Started

Evaluate Impossible Video Generation

Evaluate Impossible Video Understanding

Reproduce

Video Understanding

🎓 BibTeX

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

showlab/Impossible-Videos

Folders and files

Latest commit

History

Repository files navigation

Impossible Videos

News

🤔 What are impossible videos?

💡 Why are we interested in impossible videos?

🔥 IPV-Bench

🏆 Leaderboard

Text-to-video Generation

Video-LLM-based Video Understanding

🚀 Get Started

Evaluate Impossible Video Generation

Evaluate Impossible Video Understanding

Reproduce

Video Understanding

🎓 BibTeX

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages