With the rise and development of deep learning over the past decade, the...
Despite having impressive vision-language (VL) pretraining with BERT-bas...
Due to the widespread applications in real-world scenarios, metro riders...
It is well believed that video captioning is a fundamental but challengi...
A deep learning approach has been widely applied in sequence modeling
pr...
Creating aesthetically pleasing pieces of art, including music, has been...