PLAN-Lab/HalluSegBench
Viewer
•
Updated
•
5.02k
•
95
None defined yet.
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching
PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation