Visual Dialog requires an agent to engage in a conversation with humans
...
Cross-lingual pre-training has achieved great successes using monolingua...
Visual dialogue is a challenging task that needs to extract implicit
inf...
Visual Dialogue task requires an agent to be engaged in a conversation w...
Different from Visual Question Answering task that requires to answer on...