Interactive Object Grasping (IOG) is the task of identifying and graspin...
Language-Guided Robotic Manipulation (LGRM) is a challenging task as it
...
Visual dialog (VisDial) is a task of answering a sequence of questions
g...
Video Question Answering is a task which requires an AI agent to answer
...
Visual dialog is a task of answering a sequence of questions grounded in...
Visual dialog (VisDial) is a task which requires an AI agent to answer a...