UNO-Bench Released: One-Stop All-Modality Benchmark
The LongCat team announces UNO-Bench, a unified benchmark with strong Chinese support that evaluates single-modality and omni-modality intelligence in one framework. It reveals the Combination Law — weaker models show bottlenecks while stronger models achieve synergistic gains.
- 1,250 omni samples + 2,480 single-modality samples; 98% require cross-modal fusion
- Open-ended multi-step (MO) questions with weighted scoring; automatic scoring model reaches 95% accuracy
- Audio-visual decoupling and ablations to prevent shortcuts and enforce real fusion
- Cluster-guided sampling reduces compute by >90% with rank consistency
Combination Law (power-law): POmni ≈ 1.0332 · (PA × PV)^2.1918 + 0.2422