UIUC · Computer Vision

Multi-Perspective Vision-Based Navigation

We extend visual navigation to learn from multiple camera perspectives, adding third-person context to reduce partial observability.

Felipe Felix Arias Victor Gonzalez

University of Illinois at Urbana-Champaign

Multi-perspective navigation architecture

Overview

Third-person context for navigation.

Visual navigation is hard because agents only see a slice of the world. We fuse first-person and third-person views to learn policies that benefit from shared context.

Multi-View Inputs

RGB, segmentation, and depth across two perspectives.

RGB view A
Segmentation view A
Depth view A
RGB view B
Segmentation view B
Depth view B

Top row: first-person. Bottom row: third-person (second robot) in the same environment.