Contrastive Language-Audio Pretraining (CLAP) is pre-trained to associat...
Learning meaningful frame-wise features on a partially labeled dataset i...
Autonomous navigation of ground robots on uneven terrain is being consid...
Cloth-changing Person Re-Identification (CC-ReID) is a challenging task ...
Text-based audio generation models have limitations as they cannot encom...
Recently, the ability of language models (LMs) has attracted increasing
...
Despite significant progress in single image-based 3D human mesh recover...
The current National Airspace System (NAS) is reaching capacity due to
i...
When doing private domain marketing with cloud services, the merchants
u...
Chest X-ray (CXR) anatomical abnormality detection aims at localizing an...
Considering the instance-level discriminative ability, contrastive learn...
Underwater object detection (UOD) plays a significant role in aquacultur...
Existing skeleton-based action recognition methods typically follow a
ce...
We report competitive results on RobustBench for CIFAR and SVHN using a
...
Unsupervised hashing has received extensive research focus on the past
d...
Recently, deep learning-based facial landmark detection has achieved
sig...
Underwater object detection (UOD) is crucial for marine economic develop...
Given the massive cost of language model pre-training, a non-trivial
imp...
Most existing task-oriented dialog (TOD) systems track dialog states in ...
Energy-based language models (ELMs) parameterize an unnormalized distrib...
Cross-modal retrieval has become a prominent research topic in computer
...
Despite substantial progress in 3D human pose estimation from a single-v...
Deep neural networks have been applied in many computer vision tasks and...
3D human mesh recovery from a 2D pose plays an important role in various...
Multimodal magnetic resonance imaging (MRI) provides complementary
infor...
Occluded person re-identification (Re-ID) is a challenging problem due t...
3D human pose estimation errors would propagate along the human body top...
Sequential recommendation is an important task to predict the next-item ...
The crossMoDA challenge aims to automatically segment the vestibular
sch...
Recent deep metric learning (DML) methods typically leverage solely clas...
Language modeling on large-scale datasets leads to impressive performanc...
In this paper, we describe in detail our system for DCASE 2022 Task4. Th...
Building user simulators (USs) for reinforcement learning (RL) of
task-o...
Recently, there has been progress in supervised funetuning pretrained GP...
Robot actuators directly affect the performance of robots, and robot dri...
Recently, there have merged a class of task-oriented dialogue (TOD) data...
Modular design is the foundation of on orbit construction technology of ...
Forward and backward reaching inverse kinematics (FABRIK) is a heuristic...
In this paper, we describe Apollo, to the best of our knowledge, the wor...
Developing semi-supervised task-oriented dialog (TOD) systems by leverag...
Demystifying the delay propagation mechanisms among multiple airports is...
Self-supervised skeleton-based action recognition with contrastive learn...
A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Syste...
Complicated underwater environments bring new challenges to object detec...
Graph convolutional networks have been widely used for skeleton-based ac...
Deep convolutional neural networks (CNNs) have been widely used in vario...
Modern multi-layer perceptron (MLP) models have shown competitive result...
Arbitrary-oriented object detection (AOOD) is a challenging task to dete...
Recently, Transformer based pretrained language models (PLMs), such as G...
Molecular subtypes of breast cancer are important references to personal...