Yannis Kalantidis

research

∙ 03/28/2023

Rethinking matching-based few-shot action recognition

Few-shot action recognition, i.e. recognizing new action classes given o...

0 Juliette Bertrand, et al. ∙

research

∙ 12/16/2022

Fake it till you make it: Learning(s) from a synthetic ImageNet clone

Recent large-scale image generation models such as Stable Diffusion have...

0 Mert Bülent Sarıyıldız, et al. ∙

research

∙ 10/05/2022

Granularity-aware Adaptation for Image Retrieval over Multiple Tasks

Strong image search models can be learned for a specific domain, ie. set...

0 Jon Almazan, et al. ∙

research

∙ 08/22/2022

PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling

Training state-of-the-art models for human pose estimation in videos req...

10 Fabien Baradel, et al. ∙

research

∙ 06/30/2022

Improving the Generalization of Supervised Models

We consider the problem of training a deep neural network on a given cla...

0 Mert Bülent Sarıyıldız, et al. ∙

research

∙ 01/31/2022

Learning Super-Features for Image Retrieval

Methods that combine local and global features have recently shown excel...

0 Philippe Weinzaepfel, et al. ∙

research

∙ 10/18/2021

TLDR: Twin Learning for Dimensionality Reduction

Dimensionality reduction methods are unsupervised approaches which learn...

0 Yannis Kalantidis, et al. ∙

research

∙ 10/18/2021

Leveraging MoCap Data for Human Mesh Recovery

Training state-of-the-art models for human body pose and shape recovery ...

1 Fabien Baradel, et al. ∙

research

∙ 01/13/2021

Probabilistic Embeddings for Cross-Modal Retrieval

Cross-modal retrieval methods build a common representation space for sa...

3 Sanghyuk Chun, et al. ∙

research

∙ 12/10/2020

Concept Generalization in Visual Representation Learning

Measuring concept generalization, i.e., the extent to which models train...

0 Mert Bülent Sarıyıldız, et al. ∙

research

∙ 10/02/2020

Hard Negative Mixing for Contrastive Learning

Contrastive learning has become a key component of self-supervised learn...

0 Yannis Kalantidis, et al. ∙

research

∙ 04/23/2020

Proceedings of the ICLR Workshop on Computer Vision for Agriculture (CV4A) 2020

This is the proceedings of the Computer Vision for Agriculture (CV4A) Wo...

0 Yannis Kalantidis, et al. ∙

research

∙ 10/21/2019

Decoupling Representation and Classifier for Long-Tailed Recognition

The long-tail distribution of the visual world poses great challenges fo...

0 Bingyi Kang, et al. ∙

research

∙ 06/01/2019

Learning to Generate Grounded Image Captions without Localization Supervision

When generating a sentence description for an image, it frequently remai...

0 Chih-Yao Ma, et al. ∙

research

∙ 04/10/2019

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

In natural images, information is conveyed at different frequencies wher...

0 Yunpeng Chen, et al. ∙

research

∙ 03/03/2019

Less is More: Learning Highlight Detection from Video Duration

Highlight detection has the potential to significantly ease video browsi...

0 Bo Xiong, et al. ∙

research

∙ 01/11/2019

DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition

Motion has shown to be useful for video understanding, where motion is t...

0 Zheng Shou, et al. ∙

research

∙ 12/17/2018

Grounded Video Description

Video description is one of the most challenging problems in vision and ...

8 Luowei Zhou, et al. ∙

research

∙ 11/30/2018

Graph-Based Global Reasoning Networks

Globally modeling and reasoning over relations between regions can be be...

0 Yunpeng Chen, et al. ∙

research

∙ 10/27/2018

A^2-Nets: Double Attention Networks

Learning to capture long-range relations is fundamental to image/video r...

0 Yunpeng Chen, et al. ∙

research

∙ 07/30/2018

Multi-Fiber Networks for Video Recognition

In this paper, we aim to reduce the computational cost of spatio-tempora...

0 Yunpeng Chen, et al. ∙

research

∙ 04/27/2018

Large-Scale Visual Relationship Understanding

Large scale visual understanding is challenging, as it requires a model ...

0 Ji Zhang, et al. ∙

research

∙ 08/04/2017

MemexQA: Visual Memex Question Answering

This paper proposes a new task, MemexQA: given a collection of photos or...

3 Lu Jiang, et al. ∙

research

∙ 12/06/2016

Tag Prediction at Flickr: a View from the Darkroom

Automated photo tagging has established itself as one of the most compel...

0 Kofi Boakye, et al. ∙

research

∙ 04/21/2016

Visual Congruent Ads for Image Search

The quality of user experience online is affected by the relevance and p...

0 Yannis Kalantidis, et al. ∙

research

∙ 04/21/2016

LOH and behold: Web-scale visual search, recommendation and clustering using Locally Optimized Hashing

We propose a novel hashing-based matching scheme, called Locally Optimiz...

0 Yannis Kalantidis, et al. ∙

research

∙ 02/23/2016

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Despite progress in perceptual tasks such as image classification, compu...

0 Ranjay Krishna, et al. ∙

research

∙ 12/13/2015

Cross-dimensional Weighting for Aggregated Deep Convolutional Features

We propose a simple and straightforward way of creating powerful image r...

0 Yannis Kalantidis, et al. ∙

Yannis Kalantidis

Featured Co-authors

Sign in with Google

Consider DeepAI Pro