Shared benchmark problems have historically been a fundamental driver of...
Most sign language translation (SLT) methods to date require the use of ...
Speech Recognition builds a bridge between the multimedia streaming
(aud...
Product Retrieval (PR) and Grounding (PG), aiming to seek image and
obje...
Dueling bandits are widely used to model preferential feedback that is
p...
Multi-media communications facilitate global interaction among people.
H...
We present a multi-resolution approach for constructing model-based
simu...
In heterogeneous rank aggregation problems, users often exhibit various
...
In this paper, we focus on the problem of applying the transformer struc...
We propose the Heterogeneous Thurstone Model (HTM) for aggregating ranke...
This paper addresses the challenging task of video captioning which aims...