The Meta Video Dataset (MetaVD) provides annotated relations between act...
Barlow Twins and VICReg are self-supervised representation learning mode...
In recent years, automatic video caption generation has attracted
consid...
This paper proposes an inexpensive way to learn an effective dissimilari...
In recent years, automatic generation of image descriptions (captions), ...
This paper discusses the effect of hubness in zero-shot learning, when r...