Vision-and-language navigation (VLN) enables the agent to navigate to a
...
Obtaining accurate 3D object poses is vital for numerous computer vision...
Adversarial detection aims to determine whether a given sample is an
adv...
Vision-and-language navigation (VLN) is the task to enable an embodied a...