research
∙
06/27/2021
Building a Video-and-Language Dataset with Human Actions for Multimodal Logical Inference
This paper introduces a new video-and-language dataset with human action...
research
∙
06/10/2019