Adversarial robustness poses a critical challenge in the deployment of d...
Causal Video Question Answering (CVidQA) queries not only association or...
Obtaining large-scale labeled object detection dataset can be costly and...
The large amount of data collected by LiDAR sensors brings the issue of ...
Nowadays, the need for user editing in a 3D scene has rapidly increased ...
While recent large-scale video-language pre-training made great progress...
Point cloud registration is a crucial problem in computer vision and
rob...
A spatial AI that can perform complex tasks through visual signals and
c...
Fair Active Learning (FAL) utilized active learning techniques to achiev...
Robotic peg-in-hole assembly remains a challenging task due to its high
...
Object detection in low-light conditions remains a challenging but impor...
Monocular 3D object detection is an important yet challenging task in
au...
In the field of domain adaptation, a trade-off exists between the model
...
In few-shot imitation learning (FSIL), using behavioral cloning (BC) to ...
Shifts Challenge: Robustness and Uncertainty under Real-World Distributi...
Anomaly awareness is an essential capability for safety-critical applica...
Vehicle velocity and inter-vehicle distance estimation are essential for...
Recent work indicates that, besides being a challenge in producing
perce...
Spatial-temporal prediction is a critical problem for intelligent
transp...
Understanding and comprehending video content is crucial for many real-w...
Object detection plays a deep role in visual systems by identifying inst...
Despite the success of deep learning on supervised point cloud semantic
...
Most of the 3D networks are trained from scratch owning to the lack of
l...
To effectively apply robots in working environments and assist humans, i...
Dense Depth estimation plays a key role in multiple applications such as...
While recent progress has significantly boosted few-shot classification ...
The human ability of deep cognitive skills are crucial for the developme...
We study a novel task, Video Question-Answer Generation (VQAG), for
chal...
Deep learning-based blind image deblurring plays an essential role in so...
We proposed an end-to-end grasp detection network, Grasp Detection Netwo...
Cardiac Magnetic Resonance Imaging (CMR) is widely used since it can
ill...
In recent years, few-shot learning problems have received a lot of atten...
We study the XAI (explainable AI) on the face recognition task, particul...
Depth estimation features are helpful for 3D recognition. Commodity-grad...
3D point cloud segmentation remains challenging for structureless and
te...
Face hallucination is a generative task to super-resolve the facial imag...
Existing counting methods often adopt regression-based approaches and ca...
Nowadays, social media has become a popular platform for the public to s...
Due to the prevalence of mobile devices, mobile search becomes a more
co...
We present a new supervised architecture termed Mediated Mixture-of-Expe...
Unconstrained video recognition and Deep Convolution Network (DCN) are t...