Thin Wire Avoidance
Dynamic Obstacle Avoidance
Cluttered Scenes Navigation
Outdoor Alley Navigation
| Methods | Obs. | Home | Commercial | ||
|---|---|---|---|---|---|
| SR ↑ | SPL ↑ | SR ↑ | SPL ↑ | ||
| DD-PPO | RGB-D | 0.4 | 0.4 | 5.3 | 5.2 |
| iPlanner | Depth | 43.0 | 40.6 | 54.6 | 52.8 |
| ViPlanner | RGB-D | 45.0 | 43.2 | 63.7 | 61.9 |
| LoGoPlanner | RGB-D | 57.3 | 52.4 | 67.1 | 63.9 |
| InternVLA-N1(S1) | RGB-D | 60.0 | 55.6 | 71.4 | 68.2 |
| NavDP | RGB-D | 60.3 | 54.7 | 74.1 | 70.5 |
| SIDP | RGB-D | 63.2 | 56.5 | 81.2 | 73.4 |
| Mixed-RL | Depth | 22.0 | 10.4 | 26.9 | 22.8 |
| Ours (without VQA) | RGB | 76.4 | 73.7 | 73.9 | 72.6 |
| Ours (single view) | RGB | 79.9 | 77.6 | 86.7 | 84.7 |
| Ours | RGB | 86.3 | 81.1 | 89.9 | 85.5 |
Indoor VQA Ablation
Outdoor VQA Ablation
Error Recovery
Narrow Passage Squeezing
@article{xu2025mm,
title={MM-Nav: Multi-View VLA Model for Robust Visual Navigation via Multi-Expert Learning},
author={Xu, Tianyu and Chen, Jiawei and Zhang, Jiazhao and Zhang, Wenyao and Qi, Zekun and Li, Minghan and Liu, Jiahang and Yue, Lu and Zhang, Zhizheng and Wang, He},
journal={arXiv preprint arXiv:2510.03142},
year={2025}
}