Prosodic phrasing is crucial to the naturalness and intelligibility of
e...
This paper is concerned with deep generative models (DGMs) for unsupervi...
Survival analysis plays a crucial role in many healthcare decisions, whe...
Wave propagation problems are typically formulated as partial differenti...
The application of video captioning models aims at translating the conte...
Learning contrastive representations from pairwise comparisons has achie...
In-Network Computing (INC) has found many applications for performance b...
Social media platforms such as Instagram and Twitter have emerged as cri...
Dynamic facial expression recognition (DFER) is essential to the develop...
Multimodal emotion recognition is an active research topic in artificial...
Generating realistic human motion from given action descriptions has
exp...
Transformer is popular in recent 3D human pose estimation, which utilize...
With rich visual data, such as images, becoming readily associated with
...
This technical report presents our solution to Ball Action Spotting in
v...
Large language models (LLMs) encode a vast amount of world knowledge acq...
Popularity bias is a persistent issue associated with recommendation sys...
As Deepfake contents continue to proliferate on the internet, advancing ...
Clothes-invariant feature extraction is critical to the clothes-changing...
This technical report presents our Restormer-Plus approach, which was
su...
Over the past few decades, multimodal emotion recognition has made remar...
Detection Transformer (DETR) is a Transformer architecture based object
...
Offline meta reinforcement learning (OMRL) aims to learn transferrable
k...
It is a challenging problem to predict trends of futures prices with
tra...
In this paper, we consider fine-grained image object detection in
resour...
At the intersection of computational neuroscience (CN) and data mining (...
Most of the existing audio-driven 3D facial animation methods suffered f...
We propose a novel 3d colored shape reconstruction method from a single ...
We study stochastic delayed feedback in general multi-agent sequential
d...
Noisy partial label learning (noisy PLL) is an important branch of weakl...
Recent years have witnessed many successful applications of contrastive
...
In this paper, we investigate the feasibility, robustness and optimizati...
The discovery of drug-target interactions (DTIs) is a pivotal process in...
Partial label learning (PLL) is a typical weakly supervised learning, wh...
In light of the smoothness property brought by skip connections in ResNe...
Deep learning has achieved tremendous success in computer vision, while
...
With the proliferation of user-generated online videos, Multimodal Senti...
Graph-based models have achieved great success in person re-identificati...
The decision-making of TBM operating parameters has an important guiding...
We study the change point detection problem for high-dimensional linear
...
In this paper, we propose the solution to the Multi-Task Learning (MTL)
...
A covert attack method often used by APT organizations is the DNS tunnel...
Existing face forgery detection methods usually treat face forgery detec...
Training an ensemble of different sub-models has empirically proven to b...
To break the bottlenecks of mainstream cloud-based machine learning (ML)...
Artificial neural networks have realized incredible successes at image
r...
Image-to-image translation is an important and challenging problem in
co...
Recent online Multi-Object Tracking (MOT) methods have achieved desirabl...
How to effectively sample high-quality negative instances is important f...
Input features play a crucial role in the predictive performance of DNN-...
Speech emotion recognition (SER) is a crucial research topic in
human-co...