b'Joo Hwee Lim'

research

∙ 09/17/2023

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Many studies focus on improving pretraining or developing new backbones ...

0 Burak Satar, et al. ∙

research

∙ 09/14/2023

Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos

A key challenge with procedure planning in instructional videos lies in ...

0 Fen Fang, et al. ∙

research

∙ 08/18/2023

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

We tackle the data scarcity challenge in few-shot point cloud recognitio...

0 Xuanyu Yi, et al. ∙

research

∙ 07/25/2023

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering

The main challenge in video question answering (VideoQA) is to capture a...

0 Yi Cheng, et al. ∙

research

∙ 06/07/2023

An Overview of Challenges in Egocentric Text-Video Retrieval

Text-video retrieval contains various challenges, including biases comin...

0 Burak Satar, et al. ∙

research

∙ 12/09/2022

Is Bio-Inspired Learning Better than Backprop? Benchmarking Bio Learning vs. Backprop

Bio-inspired learning has been gaining popularity recently given that Ba...

0 Manas Gupta, et al. ∙

research

∙ 11/23/2022

Reason from Context with Self-supervised Learning

A tiny object in the sky cannot be an elephant. Context reasoning is cri...

0 Xiao Liu, et al. ∙

research

∙ 11/21/2022

On the Robustness, Generalization, and Forgetting of Shape-Texture Debiased Continual Learning

Tremendous progress has been made in continual learning to maintain good...

0 Zenglin Shi, et al. ∙

research

∙ 11/09/2022

Portmanteauing Features for Scene Text Recognition

Scene text images have different shapes and are subjected to various dis...

0 Yew Lee Tan, et al. ∙

research

∙ 08/03/2022

Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition

Fine-grained action recognition is a challenging task in computer vision...

0 Mei Chee Leong, et al. ∙

research

∙ 07/27/2022

Identifying Hard Noise in Long-Tailed Sample Distribution

Conventional de-noising methods rely on the assumption that all samples ...

0 Xuanyu Yi, et al. ∙

research

∙ 06/29/2022

Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

In this report, we present our approach for EPIC-KITCHENS-100 Multi-Inst...

0 Burak Satar, et al. ∙

research

∙ 06/26/2022

Semantic Role Aware Correlation Transformer for Text to Video Retrieval

With the emergence of social media, voluminous video clips are uploaded ...

2 Burak Satar, et al. ∙

research

∙ 06/26/2022

RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval

Seas of videos are uploaded daily with the popularity of social channels...

0 Burak Satar, et al. ∙

research

∙ 05/24/2022

TAILOR: Teaching with Active and Incremental Learning for Object Registration

When deploying a robot to a new task, one often has to train it to detec...

2 Qianli Xu, et al. ∙

research

∙ 11/28/2021

FashionSearchNet-v2: Learning Attribute Representations with Localization for Image Retrieval with Attribute Manipulation

The focus of this paper is on the problem of image retrieval with attrib...

0 Kenan E. Ak, et al. ∙

research

∙ 10/12/2021

Joint Learning On The Hierarchy Representation for Fine-Grained Human Action Recognition

Fine-grained human action recognition is a core research topic in comput...

0 Mei Chee Leong, et al. ∙

research

∙ 09/24/2019

6D Pose Estimation with Correlation Fusion

6D object pose estimation is widely applied in robotic tasks such as gra...

0 Yi Cheng, et al. ∙

research

∙ 05/23/2019

Prototype Reminding for Continual Learning

Continual learning is a critical ability of continually acquiring and tr...

5 Mengmi Zhang, et al. ∙

research

∙ 02/01/2019

Lift-the-Flap: Context Reasoning Using Object-Centered Graphs

Children benefit from lift-the-flap books by taking on an active role in...

8 Mengmi Zhang, et al. ∙

research

∙ 08/07/2018

Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams

Segmenting video content into events provides semantic structures for in...

2 Ana García del Molino, et al. ∙

research

∙ 07/31/2018

Egocentric Spatial Memory

Egocentric spatial memory (ESM) defines a memory system with encoding, s...

0 Mengmi Zhang, et al. ∙

research

∙ 07/31/2018

What am I searching for?

Can we infer intentions and goals from a person's actions? As an example...

4 Mengmi Zhang, et al. ∙

research

∙ 07/18/2018

Finding any Waldo: zero-shot invariant and efficient visual search

Searching for a target object in a cluttered scene constitutes a fundame...

0 Mengmi Zhang, et al. ∙

Joo Hwee Lim

Featured Co-authors

Sign in with Google

Consider DeepAI Pro