Noriyuki Kojima

research

∙ 09/06/2023

A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models

Key to tasks that require reasoning about natural language in visual con...

0 Noriyuki Kojima, et al. ∙

research

∙ 11/29/2022

Abstract Visual Reasoning with Tangram Shapes

We introduce KiloGram, a resource for studying abstract visual reasoning...

0 Anya Ji, et al. ∙

research

∙ 11/03/2022

lilGym: Natural Language Visual Reasoning with Reinforcement Learning

We present lilGym, a new benchmark for language-conditioned reinforcemen...

14 Anne Wu, et al. ∙

research

∙ 10/11/2022

Markup-to-Image Diffusion Models with Scheduled Sampling

Building on recent advances in image generation, we present a fully data...

0 Yuntian Deng, et al. ∙

research

∙ 08/10/2021

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

We study continual learning for natural language instruction generation,...

0 Noriyuki Kojima, et al. ∙

research

∙ 07/26/2020

OASIS: A Large-Scale Dataset for Single Image 3D in the Wild

Single-view 3D is the task of recovering 3D properties such as depth and...

30 Weifeng Chen, et al. ∙

research

∙ 05/04/2020

What is Learned in Visually Grounded Neural Syntax Acquisition

Visual features are a promising signal for learning bootstrap textual mo...

0 Noriyuki Kojima, et al. ∙

research

∙ 07/26/2019

To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments

In this paper we compare learning-based methods and classical methods fo...

3 Noriyuki Kojima, et al. ∙

research

∙ 09/24/2018

Speaker Naming in Movies

We propose a new model for speaker naming in movies that leverages visua...

2 Mahmoud Azab, et al. ∙

Noriyuki Kojima

Featured Co-authors

Sign in with Google

Consider DeepAI Pro