Jiuxiang Gu

research

∙ 06/29/2023

LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding

Instruction tuning unlocks the superior capability of Large Language Mod...

0 Yanzhe Zhang, et al. ∙

research

∙ 11/27/2022

MGDoc: Pre-training with Multi-granular Hierarchy for Document Image Understanding

Document images are a ubiquitous source of data where the text is organi...

0 Zilong Wang, et al. ∙

research

∙ 11/24/2022

Delving into Out-of-Distribution Detection with Vision-Language Representations

Recognizing out-of-distribution (OOD) samples is critical for machine le...

0 Yifei Ming, et al. ∙

research

∙ 11/01/2022

User-Entity Differential Privacy in Learning Natural Language Models

In this paper, we introduce a novel concept of user-entity differential ...

0 Phung Lai, et al. ∙

research

∙ 11/27/2021

LAFITE: Towards Language-Free Training for Text-to-Image Generation

One of the major challenges in training text-to-image generation models ...

9 Yufan Zhou, et al. ∙

research

∙ 03/11/2021

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models

Recent studies indicate that NLU models are prone to rely on shortcut fe...

0 Mengnan Du, et al. ∙

research

∙ 10/03/2020

Unsupervised Cross-lingual Image Captioning

Most recent image captioning works are conducted in English as the major...

0 Jiahui Gao, et al. ∙

research

∙ 09/30/2020

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

Change Captioning is a task that aims to describe the difference between...

0 Xiangxi Shi, et al. ∙

research

∙ 11/06/2019

Resilient Load Restoration in Microgrids Considering Mobile Energy Storage Fleets: A Deep Reinforcement Learning Approach

Mobile energy storage systems (MESSs) provide mobility and flexibility t...

0 Shuhan Yao, et al. ∙

research

∙ 04/01/2019

Scene Graph Generation with External Knowledge and Image Reconstruction

Scene graph generation has received growing attention with the advanceme...

0 Jiuxiang Gu, et al. ∙

research

∙ 03/26/2019

Unpaired Image Captioning via Scene Graph Alignments

Deep neural networks have achieved great success on the image captioning...

0 Jiuxiang Gu, et al. ∙

research

∙ 07/08/2018

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

The explosion of video data on the internet requires effective and effic...

0 Xiangxi Shi, et al. ∙

research

∙ 03/14/2018

Unpaired Image Captioning by Language Pivoting

Image captioning is a multimodal task involving computer vision and natu...

0 Jiuxiang Gu, et al. ∙

research

∙ 11/17/2017

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models

Textual-visual cross-modal retrieval has been a hot research topic in bo...

0 Jiuxiang Gu, et al. ∙

research

∙ 09/11/2017

Stack-Captioning: Coarse-to-Fine Learning for Image Captioning

The existing image captioning approaches typically train a one-stage sen...

0 Jiuxiang Gu, et al. ∙

research

∙ 12/21/2016

An Empirical Study of Language CNN for Image Captioning

Language Models based on recurrent neural networks have dominated recent...

0 Jiuxiang Gu, et al. ∙

research

∙ 12/22/2015

Recent Advances in Convolutional Neural Networks

In the last few years, deep learning has led to very good performance on...

1 Jiuxiang Gu, et al. ∙

Jiuxiang Gu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro