The goal of this work is to understand the way actions are performed in
...
Precisely naming the action depicted in a video can be a challenging and...
Generative models for audio-conditioned dance motion synthesis map music...
Recognising actions in videos relies on labelled supervision during trai...
This work introduces verb-only representations for actions and interacti...
First-person vision is gaining interest as it offers a unique viewpoint ...
Manual annotations of temporal bounds for object interactions (i.e. star...
This work deviates from easy-to-define class boundaries for object
inter...
We present SEMBED, an approach for embedding an egocentric object intera...