Houdong Hu

research

∙ 04/19/2022

ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models

Learning visual representations from natural language supervision has re...

2 Chunyuan Li, et al. ∙

research

∙ 11/30/2021

MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark

Multi-camera tracking systems are gaining popularity in applications tha...

5 Xiaotian Han, et al. ∙

research

∙ 11/22/2021

Florence: A New Foundation Model for Computer Vision

Automated visual understanding of our diverse and open world demands com...

4 Lu Yuan, et al. ∙

research

∙ 07/27/2021

Image Scene Graph Generation (SGG) Benchmark

There is a surge of interest in image scene graph generation (object, at...

1 Xiaotian Han, et al. ∙

research

∙ 04/13/2020

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

Large-scale pre-training methods of learning cross-modal representations...

7 Xiujun Li, et al. ∙

research

∙ 10/28/2019

Applications of Generative Adversarial Models in Visual Search Reformulation

Query reformulation is the process by which a input search query is refi...

0 Kyle Xiao, et al. ∙

research

∙ 09/24/2019

Unified Vision-Language Pre-Training for Image Captioning and VQA

This paper presents a unified Vision-Language Pre-training (VLP) model. ...

17 Luowei Zhou, et al. ∙

research

∙ 09/22/2019

Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators

Grounding language to visual relations is critical to various language-a...

18 Kuang-Huei Lee, et al. ∙

research

∙ 04/12/2018

An Universal Image Attractiveness Ranking Framework

We propose a benchmark framework to rank image attractiveness using a no...

0 Ning Ma, et al. ∙

research

∙ 03/21/2018

Stacked Cross Attention for Image-Text Matching

In this paper, we study the problem of image-text matching. Inferring th...

0 Kuang-Huei Lee, et al. ∙

research

∙ 02/14/2018

Web-Scale Responsive Visual Search at Bing

In this paper, we introduce a web-scale general visual search system dep...

0 Houdong Hu, et al. ∙

Houdong Hu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro