InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation Paper • 2507.17520 • Published Jul 23, 2025 • 15
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts Paper • 2509.10813 • Published Sep 13, 2025 • 31
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities Paper • 2507.13019 • Published Jul 17, 2025 • 2
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy Paper • 2510.13778 • Published Oct 15, 2025 • 17
CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation Paper • 2506.19816 • Published Jun 24, 2025
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation Paper • 2506.10966 • Published Jun 12, 2025
VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs Paper • 2512.22342 • Published Dec 26, 2025 • 10
InternVLA-A1: Unifying Understanding, Generation and Action for Robotic Manipulation Paper • 2601.02456 • Published Jan 5 • 7
SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds Paper • 2604.08544 • Published Apr 9 • 16
VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation Paper • 2512.19021 • Published Dec 22, 2025
EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies Paper • 2606.18239 • Published 10 days ago • 15
EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies Paper • 2606.18239 • Published 10 days ago • 15
M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM Paper • 2603.16844 • Published Mar 17 • 11
RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Paper • 2601.05241 • Published Jan 8 • 24
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning Paper • 2509.15937 • Published Sep 19, 2025 • 21
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published Aug 20, 2025 • 69