Abstract: Vision-and-Language Navigation tasks require an agent to navigate to a destination following natural language instructions. We focus on a challenging VLN dataset, Aerial Vision-and-Dialog ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results