Attribute-specific fashion retrieval (ASFR) is a challenging information...
This paper addresses the temporal sentence grounding (TSG). Although exi...
Given an untrimmed video, temporal sentence grounding (TSG) aims to loca...
Temporal sentence localization in videos (TSLV) aims to retrieve the mos...
Temporal sentence grounding (TSG) aims to localize the temporal segment ...
Given an untrimmed video, temporal sentence localization (TSL) aims to
l...
Temporal sentence grounding (TSG) aims to identify the temporal boundary...
Distantly-Supervised Named Entity Recognition (DS-NER) effectively allev...
As an increasingly popular task in multimedia information retrieval, vid...
This paper studies the multimedia problem of temporal sentence grounding...
Spatial-Temporal Video Grounding (STVG) is a challenging task which aims...
Temporal video grounding (TVG) aims to localize a target segment in a vi...
Temporal sentence grounding (TSG) is crucial and fundamental for video
u...
A key solution to temporal sentence grounding (TSG) exists in how to lea...
We address the problem of temporal sentence localization in videos (TSLV...
This paper addresses the problem of temporal sentence grounding (TSG), w...
Although deep learning based methods have achieved great progress in
uns...
Facial expression recognition (FER), aiming to classify the expression
p...
Query-based moment localization is a new task that localizes the best ma...
Disease diagnosis on chest X-ray images is a challenging multi-label
cla...
We address the problem of semi-supervised video object segmentation (VOS...