Synthesizing realistic videos according to a given speech is still an op...
Positive-Unlabeled (PU) Learning is a challenge presented by binary
clas...
We consider the problem of sequential recommendation, where the current
...
3D facial avatar reconstruction has been a significant research topic in...
Instruction tuning has been shown to be able to improve cross-task
gener...
In this paper, we study video synthesis with emphasis on simplifying the...
Cloth changing person re-identification(Re-ID) can work under more
compl...
Recent works on multi-modal emotion recognition move towards end-to-end
...
Most existing deep-learning-based single image dynamic scene blind deblu...
Multimodal fine-grained sentiment analysis has recently attracted increa...
Finding relevant moments and highlights in videos according to natural
l...
Multimodal sentiment analysis has attracted increasing attention and lot...
Fragmentary data is becoming more and more popular in many areas which b...
In the current salient object detection network, the most popular method...
The huge domain gap between sketches and photos and the highly abstract
...
Auxiliary losses commonly used in image inpainting lead to better
recons...
Soft actuators have shown great advantages in compliance and morphology
...
An embodied task such as embodied question answering (EmbodiedQA), requi...
Visual recognition is currently one of the most important and active res...
Localizing individuals in crowds is more in accordance with the practica...
Recently, the problem of inaccurate learning targets in crowd counting d...
Recently, most siamese network based trackers locate targets via object
...
In this letter, we investigate a UAV-enabled communication system, where...
Video analysis has been moving towards more detailed interpretation (e.g...
Existing Multiple-Object Tracking (MOT) methods either follow the
tracki...
We aim to improve the performance of Multiple Object Tracking and
Segmen...
We aim to improve the performance of Multiple Object Tracking and
Segmen...
Person re-identification (Reid) is now an active research topic for AI-b...
Most of Multiple Object Tracking (MOT) approaches compute individual tar...
Three dimensional convolutional neural networks (3DCNNs) have been appli...
This work proposes a new human-related video processing task named 3D
pa...
Compared with Generative Adversarial Networks (GAN), the Energy-Based
ge...
Although skeleton-based action recognition has achieved great success in...
For real-time semantic video segmentation, most recent works utilise a
d...
We propose an image steganographic algorithm called EncryptGAN, which
di...
An effective and efficient person re-identification (ReID) algorithm wil...
The aim of this work is learning to reshape the object in an input image...
Current approaches have made great progress on image-to-image translatio...
FRAME (Filters, Random fields, And Maximum Entropy) is an energy-based
d...
Active learning aims to reduce annotation cost by predicting which sampl...
Humans can naturally understand an image in depth with the aid of rich
k...
Active learning (AL) aims to enable training high performance classifier...
In this paper, we propose the Lipschitz margin ratio and a new metric
le...
In this paper, we strive to answer two questions: What is the current st...
Person Re-identification (re-id) faces two major challenges: the lack of...
We investigate the scalable image classification problem with a large nu...
Sparse representation based classification (SRC) has been proved to be a...