FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 11 days ago • 93
PulseLM: A Foundation Dataset and Benchmark for PPG-Text Learning Paper • 2603.03331 • Published Feb 10 • 2
MemoryArena: Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks Paper • 2602.16313 • Published Feb 18 • 3
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks Paper • 2604.11778 • Published 6 days ago • 8
StoryBlender: Inter-Shot Consistent and Editable 3D Storyboard with Spatial-temporal Dynamics Paper • 2604.03315 • Published 18 days ago • 2
Personalizing Text-to-Image Generation to Individual Taste Paper • 2604.07427 • Published 11 days ago • 8
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models Paper • 2507.08128 • Published Jul 10, 2025 • 14
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published Mar 6, 2025 • 28
ArtVIP: Articulated Digital Assets of Visual Realism, Modular Interaction, and Physical Fidelity for Robot Learning Paper • 2506.04941 • Published Jun 5, 2025 • 2
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment Paper • 2604.11689 • Published 6 days ago • 11
ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated 5 days ago • 22
FoundationalASSIST: An Educational Dataset for Foundational Knowledge Tracing and Pedagogical Grounding of LLMs Paper • 2602.00070 • Published Jan 20 • 3
Physion-Eval: Evaluating Physical Realism in Generated Video via Human Reasoning Paper • 2603.19607 • Published 30 days ago • 3
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions Paper • 2502.13124 • Published Feb 18, 2025 • 8
Optimization-Guided Diffusion for Interactive Scene Generation Paper • 2512.07661 • Published Dec 8, 2025 • 5