Methods for object detection and segmentation often require abundant
ins...
Video Instance Segmentation (VIS) aims to simultaneously classify, segme...
Traditional scene graph generation methods are trained using cross-entro...
Prior work in multi-task learning has mainly focused on predictions on a...