b'Baoxiong Jia'

research

∙ 08/21/2023

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events

Intuitive physics is pivotal for human understanding of the physical wor...

0 Bo Dai, et al. ∙

research

∙ 04/09/2023

ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes

Understanding the continuous states of objects is essential for task lea...

0 Ran Gong, et al. ∙

research

∙ 01/15/2023

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

We introduce SceneDiffuser, a conditional generative model for 3D scene ...

0 Siyuan Huang, et al. ∙

research

∙ 11/28/2022

Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation

Current computer vision models, unlike the human visual system, cannot y...

0 Jiangyong Huang, et al. ∙

research

∙ 10/17/2022

Unsupervised Object-Centric Learning with Bi-Level Optimized Query Slot Attention

The ability to decompose complex natural scenes into meaningful object-c...

13 Baoxiong Jia, et al. ∙

research

∙ 10/08/2022

EgoTaskQA: Understanding Human Tasks in Egocentric Videos

Understanding human tasks through video observations is an essential cap...

0 Baoxiong Jia, et al. ∙

research

∙ 03/26/2021

ACRE: Abstract Causal REasoning Beyond Covariation

Causal induction, i.e., identifying unobservable mechanisms that lead to...

0 Chi Zhang, et al. ∙

research

∙ 03/26/2021

Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution

Spatial-temporal reasoning is a challenging task in Artificial Intellige...

0 Chi Zhang, et al. ∙

research

∙ 07/31/2020

LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities

Understanding and interpreting human actions is a long-standing challeng...

0 Baoxiong Jia, et al. ∙

research

∙ 11/29/2019

Learning Perceptual Inference by Contrasting

"Thinking in pictures," [1] i.e., spatial-temporal reasoning, effortless...

0 Chi Zhang, et al. ∙

research

∙ 03/07/2019

RAVEN: A Dataset for Relational and Analogical Visual rEasoNing

Dramatic progress has been witnessed in basic vision tasks involving low...

0 Chi Zhang, et al. ∙

research

∙ 08/23/2018

Learning Human-Object Interactions by Graph Parsing Neural Networks

This paper addresses the task of detecting and recognizing human-object ...

0 Siyuan Qi, et al. ∙

research

∙ 06/09/2018

Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction

Future predictions on sequence data (e.g., videos or audios) require the...

0 Siyuan Qi, et al. ∙

Baoxiong Jia

Featured Co-authors

Sign in with Google

Consider DeepAI Pro