This paper presents a controllable text-to-video (T2V) diffusion model, ...
Sign Language Production (SLP) aims to translate spoken languages into s...
Conditional masked language models (CMLM) have shown impressive progress...
Continuous sign language recognition (cSLR) is a public significant task...
Since the superiority of Transformer in learning long-term dependency, t...
Non-autoregressive models generate target words in a parallel way, which...