sora2 has great performance in sound. However, most video generation models are not support sound. refer : https://arxiv.org/abs/2511.04570 , their C.3.1 Example: GSM8K Problem and B.4.1 Prompts for Evaluation