We introduce a video framework for modeling the association between verb...
Synthesizing 3D human avatars interacting realistically with a scene is ...
Many machine learning methods operate by inverting a neural network at
i...
For computer vision systems to operate in dynamic situations, they need ...
Recent research in Visual Question Answering (VQA) has revealed
state-of...
We present a general computational approach that enables a machine to
ge...
Existing VQA datasets contain questions with varying levels of complexit...
An approach to make text visually appealing and memorable is semantic
re...