Sachit Menon

research

∙ 03/14/2023

ViperGPT: Visual Inference via Python Execution for Reasoning

Answering visual queries is a complex task that requires both visual pro...

0 Dídac Surís, et al. ∙

research

∙ 01/26/2023

Affective Faces for Goal-Driven Dyadic Communication

We introduce a video framework for modeling the association between verb...

0 Scott Geng, et al. ∙

research

∙ 12/12/2022

Doubly Right Object Recognition: A Why Prompt for Visual Rationales

Many visual recognition models are evaluated only on their classificatio...

0 Chengzhi Mao, et al. ∙

research

∙ 12/08/2022

Task Bias in Vision-Language Models

Incidental supervision from language has become a popular approach for l...

0 Sachit Menon, et al. ∙

research

∙ 10/13/2022

Visual Classification via Description from Large Language Models

Vision-language models (VLMs) such as CLIP have shown promising performa...

0 Sachit Menon, et al. ∙

research

∙ 07/19/2022

Forget-me-not! Contrastive Critics for Mitigating Posterior Collapse

Variational autoencoders (VAEs) suffer from posterior collapse, where th...

2 Sachit Menon, et al. ∙

research

∙ 06/17/2022

Shadows Shed Light on 3D Objects

3D reconstruction is a fundamental problem in computer vision, and the t...

0 Ruoshi Liu, et al. ∙

research

∙ 03/08/2020

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

The primary aim of single-image super-resolution is to construct a high-...

0 Sachit Menon, et al. ∙

research

∙ 05/09/2018

New Techniques for Preserving Global Structure and Denoising with Low Information Loss in Single-Image Super-Resolution

This work identifies and addresses two important technical challenges in...

0 Yijie Bei, et al. ∙

Sachit Menon

Featured Co-authors

Sign in with Google

Consider DeepAI Pro