Recent advances in neural sequence-to-sequence models have led to promis...
Understanding and conversing about dynamic scenes is one of the key
capa...
An important yet rarely tackled problem in dialogue state tracking (DST)...
Speech recognition in cocktail-party environments remains a significant
...
City-identification of videos aims to determine the likelihood of a vide...