Sign in

A modular vision language navigation and manipulation framework for long horizon compositional tasks in indoor environment

By Homagni Saha and others
In this paper we propose a new framework - MoViLan (Modular Vision and Language) for execution of visually grounded natural language instructions for day to day indoor household tasks. While several data-driven, end-to-end learning frameworks have been proposed for targeted navigation tasks based on the vision and language modalities, performance... Show more
January 19, 2021
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
A modular vision language navigation and manipulation framework for long horizon compositional tasks in indoor environment
Click on play to start listening