RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Paper • 2601.05241 • Published 23 days ago • 24
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning Paper • 2509.15937 • Published Sep 19, 2025 • 20
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published Aug 20, 2025 • 69
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting Paper • 2507.15454 • Published Jul 21, 2025 • 7
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation Paper • 2308.07498 • Published Aug 14, 2023
Evolving Symbolic 3D Visual Grounder with Weakly Supervised Reflection Paper • 2502.01401 • Published Feb 3, 2025 • 1