The large-scale visual-language pre-trained model, Contrastive Language-...
Multimodal entity linking (MEL) task, which aims at resolving ambiguous
...
Affordance-centric Question-driven Task Completion (AQTC) for Egocentric...
Multimodal Large Language Model (MLLM) recently has been a new rising
re...
Graph Neural Networks (GNNs) have been broadly applied in many urban
app...
Finding multiple temporal relationships among locations can benefit a bu...
Generating facial reactions in a human-human dyadic interaction is compl...
With the increasing demand for intelligent services of online video
plat...
Large language models (LLMs) have demonstrated powerful capabilities in ...
Automatic Micro-Expression (ME) spotting in long videos is a crucial ste...
To protect user privacy and meet legal regulations, federated learning (...
kNN-MT is a straightforward yet powerful approach for fast domain
adapta...
A good feature representation is the key to image classification. In
pra...
Although remarkable progress on the neural table-to-text methods has bee...
As one of the most important psychic stress reactions, micro-expressions...
A computational graph in a deep neural network (DNN) denotes a specific ...
Urban villages (UVs) refer to the underdeveloped informal settlement fal...
Deep neural networks (DNNs) are vulnerable to backdoor attacks. The back...
As a study on the efficient usage of data, Multi-source Unsupervised Dom...
In this paper, we study the problem of inferring spatially-varying Gauss...
Affordance-centric Question-driven Task Completion for Egocentric
Assist...
Vertical federated learning (VFL) is a privacy-preserving machine learni...
End-to-End Speech Translation (E2E-ST) has received increasing attention...
Attention mechanisms have significantly boosted the performance of video...
Feature quality has an impactful effect on recommendation performance.
T...
Object recognition from images means to automatically find object(s) of
...
End-to-end speech-to-text translation (E2E-ST) is becoming increasingly
...
With the booming of pre-trained transformers, remarkable progress has be...
Trip recommender system, which targets at recommending a trip consisting...
Recently many efforts have been devoted to applying graph neural network...
Drug discovery often relies on the successful prediction of protein-liga...
Electric Vehicle (EV) has become a preferable choice in the modern
trans...
Recent years have witnessed the rapid accumulation of massive electronic...
Out-of-town recommendation is designed for those users who leave their
h...
Adaptive learning rate methods have been successfully applied in many fi...
Accurately predicting the binding affinity between drugs and proteins is...
Reinforcement learning (RL) has shown great promise in optimizing long-t...
Accurate and fast 3D object detection from point clouds is a key task in...
Learning informative representations of users and items from the interac...
Recent years have witnessed the remarkable developments made by deep lea...
Identifying the arrival times of seismic P-phases plays a significant ro...
The prevalent object detectors to date, such as Faster R-CNN and RetinaN...
The development of intelligent traffic light control systems is essentia...
For a long time, object detectors have suffered from extreme imbalance
b...
Recent studies have consistently given positive hints that morphology is...
Recently, the Network Representation Learning (NRL) techniques, which
re...
The wide spread use of online recruitment services has led to informatio...
To cope with the accelerating pace of technological changes, talents are...