UniVA-Bench Leaderboard

UniVA-Bench is an agent-oriented benchmark for unified video intelligence, covering Understanding, Generation, Editing, Segmentation, and agentic probing. We report CLIP / DINO / MLLM preference, segmentation J/F/J&F, and long-video QA accuracy, following the evaluation protocol described in our paper.

Table 1: Comparison across LongText2Video, Entities2Video and Video2Video