Dustin Schwenk

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Ali Farhadi
106 publications
Hannaneh Hajishirzi
102 publications
Aniruddha Kembhavi
53 publications
Roozbeh Mottaghi
53 publications
Ludwig Schmidt
48 publications
Derek Hoiem
34 publications
Sachin Mehta
29 publications
Jonghyun Choi
29 publications
Luca Weihs
26 publications
Mark Yatskar
24 publications
Jiasen Lu
20 publications

research

∙ 12/15/2022

Objaverse: A Universe of Annotated 3D Objects

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebIm...

0 Matt Deitke, et al. ∙

research

∙ 06/03/2022

A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

The Visual Question Answering (VQA) task aspires to provide a meaningful...

13 Dustin Schwenk, et al. ∙

research

∙ 12/01/2021

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text

Communicating with humans is challenging for AIs because it requires a s...

8 Christopher Clark, et al. ∙

research

∙ 09/23/2020

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers

Mirroring the success of masked language models, vision-and-language cou...

4 Jaemin Cho, et al. ∙

research

∙ 04/14/2020

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform

Visual recognition ecosystems (e.g. ImageNet, Pascal, COCO) have undenia...

2 Matt Deitke, et al. ∙

research

∙ 12/17/2019

Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game

The ubiquity of embodied gameplay, observed in a wide variety of animal ...

17 Luca Weihs, et al. ∙

research

∙ 04/10/2018

Imagine This! Scripts to Compositions to Videos

Imagining a scene described in natural language with realistic layout an...

2 Tanmay Gupta, et al. ∙

Success!

An error occurred

Dustin Schwenk

Featured Co-authors

Objaverse: A Universe of Annotated 3D Objects

A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform

Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game

Imagine This! Scripts to Compositions to Videos

Sign in with Google

Consider DeepAI Pro