Generative models have seen an explosion in popularity with the release ...
Predicting high dimensional video sequences is a curiously difficult pro...
This work introduces a flexible architecture for real-time occupancy
for...
Rotated bounding boxes drastically reduce output ambiguity of elongated
...
Content-based medical image retrieval is an important diagnostic tool th...
Naively trained AI models can be heavily biased. This can be particularl...
The self-attention mechanism, successfully employed with the transformer...
Single image-level annotations only correctly describe an often small su...
Inspired by the concept of preconditioning, we propose a novel method to...
We propose a new video camouflaged object detection (VCOD) framework tha...
Learning and generalizing to novel concepts with few samples (Few-Shot
L...
Learning and generalizing from limited examples, i,e, few-shot learning,...
Tracking requires building a discriminative model for the target in the
...
In the real world, medical datasets often exhibit a long-tailed data
dis...
We present an approach for visualizing mobile robots through an Augmente...
An appropriate user interface to collect human demonstration data for
de...
Tracking a time-varying indefinite number of objects in a video sequence...
Humans are very skillful in communicating their intent for when and wher...
Deep neural networks are known to be data-driven and label noise can hav...
Recently, ultra-widefield (UWF) 200-degree fundus imaging by Optos camer...
Domain adaptation and generative modelling have collectively mitigated t...
This paper presents a novel ensemble learning approach called Residual
L...
To reduce the human efforts in neural network design, Neural Architectur...
Camera-based end-to-end driving neural networks bring the promise of a
l...
The increasing presence of robots alongside humans, such as in human-rob...
We study the problem of placing a grasped object on an empty flat surfac...
For decades, advances in retinal imaging technology have enabled effecti...
Event cameras are paradigm-shifting novel sensors that report asynchrono...
Many existing conditional Generative Adversarial Networks (cGANs) are li...
Instantaneous and on demand accuracy-efficiency trade-off has been recen...
Neural network quantization and pruning are two techniques commonly used...
Recent works show that Generative Adversarial Networks (GANs) can be
suc...
Continuously estimating an agent's state space and a representation of i...
Grasping is the dominant approach for robot manipulation, but only a sin...
Robots need to learn behaviors in intuitive and practical ways for wides...
We present a robotic system capable of navigating autonomously by follow...
In contrast to traditional cameras, whose pixels have a common exposure ...
State-of-the-art deep neural network recognition systems are designed fo...
Visual place recognition (VPR) - the act of recognizing a familiar visua...
The practice of transforming raw data to a feature space so that inferen...
In this paper, we introduce a novel methodology for characterising the
p...
Deployment of deep learning models in robotics as sensory information
ex...
Since the resurgence of CNNs the robotic vision community has developed ...
Recovering structure and motion parameters given a image pair or a seque...
Dataset creation is typically one of the first steps when applying Artif...
Domain adaptation for visual recognition has undergone great progress in...
In recent times, many of the breakthroughs in various vision-related tas...
We present "just-in-time reconstruction" as real-time image-guided
inpai...
We present an efficient subpixel refinement method usinga learning-based...
Estimation of surface curvature from range data is important for a range...