Video understanding tasks take many forms, from action detection to visu...
Modern deep learning requires large-scale extensively labelled datasets ...
Methods for object detection and segmentation rely on large scale
instan...
This manuscript describes our approach for the Visual Dialog Challenge 2...
Few-shot learning is a fundamental task in computer vision that carries ...
Neural networks trained on datasets such as ImageNet have led to major
a...