UniVA-Bench Leaderboard
UniVA-Bench is an agent-oriented benchmark for unified video intelligence, covering Understanding, Generation, Editing, Segmentation, and agentic probing. We report CLIP / DINO / MLLM preference, segmentation J/F/J&F, and long-video QA accuracy, following the evaluation protocol described in our paper.
Table 1: Comparison across LongText2Video, Entities2Video and Video2Video
LTX-Video | 0.281 | 0.903 | 3.333 | 0.287 | 0.88 | 1.789 | 0.226 | 0.894 | 4.068 |