Sign in

Correctable Landmark Discovery via Large Models for Vision-Language Navigation

By Bingqian Lin and others
Vision-Language Navigation (VLN) requires the agent to follow language instructions to reach a target position. A key factor for successful navigation is to align the landmarks implied in the instruction with diverse visual observations. However, previous VLN agents fail to perform accurate modality alignment especially in unexplored scenes, since they... Show more
June 5, 2024
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Click on play to start listening