b'Nan Duan'

research

∙ 09/18/2023

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Graphic layout generation, a growing research field, plays a significant...

0 Zecheng Tang, et al. ∙

research

∙ 08/26/2023

ORES: Open-vocabulary Responsible Visual Synthesis

Avoiding synthesizing specific visual concepts is an essential challenge...

0 Minheng Ni, et al. ∙

research

∙ 08/19/2023

GameEval: Evaluating LLMs on Conversational Games

The rapid advancements in large language models (LLMs) have presented ch...

0 Dan Qiao, et al. ∙

research

∙ 08/16/2023

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Controllable video generation has gained significant attention in recent...

0 Shengming Yin, et al. ∙

research

∙ 06/27/2023

Constructing Multilingual Code Search Dataset Using Neural Machine Translation

Code search is a task to find programming codes that semantically match ...

0 Ryo Sekizawa, et al. ∙

research

∙ 06/27/2023

GroundNLQ @ Ego4D Natural Language Queries Challenge 2023

In this report, we present our champion solution for Ego4D Natural Langu...

0 Zhijian Hou, et al. ∙

research

∙ 06/26/2023

LongCoder: A Long-Range Pre-trained Language Model for Code Completion

In this paper, we introduce a new task for code completion that focuses ...

0 Daya Guo, et al. ∙

research

∙ 06/15/2023

CMMLU: Measuring massive multitask language understanding in Chinese

As the capabilities of large language models (LLMs) continue to advance,...

0 Haonan Li, et al. ∙

research

∙ 05/31/2023

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning

Two-Tower Vision-Language (VL) models have shown promising improvements ...

0 Xiao Xu, et al. ∙

research

∙ 05/24/2023

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Large language models are powerful text processors and reasoners, but ar...

0 Zhihong Shao, et al. ∙

research

∙ 05/24/2023

BeamSearchQA: Large Language Models are Strong Zero-Shot QA Solver

Open-domain question answering is a crucial task that often requires acc...

0 Hao Sun, et al. ∙

research

∙ 05/23/2023

Query Rewriting for Retrieval-Augmented Large Language Models

Large Language Models (LLMs) play a powerful Reader of the Retrieve-then...

0 Xinbei Ma, et al. ∙

research

∙ 05/22/2023

Machine-Created Universal Language for Cross-lingual Transfer

There are two types of approaches to solving cross-lingual transfer: mul...

0 Yaobo Liang, et al. ∙

research

∙ 05/16/2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Diffusion models have gained significant attention in the realm of image...

1 Tong Wu, et al. ∙

research

∙ 05/11/2023

PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization

Based on the remarkable achievements of pre-trained language models in a...

0 Xinbei Ma, et al. ∙

research

∙ 05/08/2023

Code Execution with Pre-trained Language Models

Code execution is a fundamental aspect of programming language semantics...

0 Chenxiao Liu, et al. ∙

research

∙ 04/23/2023

Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models

Large language models (LLMs) can achieve highly effective performance on...

0 Jiashuo Sun, et al. ∙

research

∙ 04/20/2023

Learning to Program with Natural Language

Large Language Models (LLMs) have shown remarkable performance in variou...

0 Yiduo Guo, et al. ∙

research

∙ 04/17/2023

Low-code LLM: Visual Programming over LLMs

Effectively utilizing LLMs for complex tasks is challenging, often invol...

0 Yuzhe Cai, et al. ∙

research

∙ 04/13/2023

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

Evaluating the general abilities of foundation models to tackle human-le...

0 Wanjun Zhong, et al. ∙

research

∙ 04/03/2023

Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data

Chat models, such as ChatGPT, have shown impressive capabilities and hav...

0 Canwen Xu, et al. ∙

research

∙ 03/29/2023

AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators

Many natural language processing (NLP) tasks rely on labeled data to tra...

0 Xingwei He, et al. ∙

research

∙ 03/29/2023

TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs

Artificial Intelligence (AI) has made incredible progress recently. On t...

0 Yaobo Liang, et al. ∙

research

∙ 03/08/2023

Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

ChatGPT is attracting a cross-field interest as it provides a language i...

0 Chenfei Wu, et al. ∙

research

∙ 02/21/2023

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images

3D photography renders a static image into a video with appealing 3D vis...

0 Xiaodong Wang, et al. ∙

research

∙ 02/03/2023

Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval

Recently multi-lingual pre-trained language models (PLM) such as mBERT a...

0 Shunyu Zhang, et al. ∙

research

∙ 02/01/2023

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models

Large language models can perform various reasoning tasks by using chain...

0 Zhihong Shao, et al. ∙

research

∙ 12/22/2022

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model

In this paper, we propose a large-scale language pre-training for text G...

1 Zhenghao Lin, et al. ∙

research

∙ 12/18/2022

Curriculum Sampling for Dense Retrieval with Document Expansion

The dual-encoder has become the de facto architecture for dense retrieva...

0 Xingwei He, et al. ∙

research

∙ 12/15/2022

MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers

Dense retrieval aims to map queries and passages into low-dimensional ve...

0 Kun Zhou, et al. ∙

research

∙ 12/14/2022

APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Long-form numerical reasoning in financial analysis aims to generate a r...

0 Jiashuo Sun, et al. ∙

research

∙ 12/10/2022

LEAD: Liberal Feature-based Distillation for Dense Retrieval

Knowledge distillation is often used to transfer knowledge from a strong...

0 Hao Sun, et al. ∙

research

∙ 11/25/2022

CodeExp: Explanatory Code Document Generation

Developing models that can automatically generate detailed code explanat...

0 Haotian Cui, et al. ∙

research

∙ 11/18/2022

GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation

We introduce GENIUS: a conditional text generation model using sketches ...

0 Biyang Guo, et al. ∙

research

∙ 11/17/2022

Execution-based Evaluation for Data Science Code Generation Models

Code generation models can benefit data scientists' productivity by auto...

0 Junjie Huang, et al. ∙

research

∙ 11/16/2022

An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022

This technical report describes the CONE approach for Ego4D Natural Lang...

0 Zhijian Hou, et al. ∙

research

∙ 10/21/2022

SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval

Sampling proper negatives from a large document pool is vital to effecti...

0 Kun Zhou, et al. ∙

research

∙ 10/21/2022

Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning

Commonsense generation aims to generate a realistic sentence describing ...

0 Xingwei He, et al. ∙

research

∙ 10/20/2022

Disentangling Reasoning Capabilities from Language Models with Compositional Reasoning Transformers

This paper presents ReasonFormer, a unified reasoning framework for mirr...

0 Wanjun Zhong, et al. ∙

research

∙ 10/18/2022

Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis

Most existing pre-trained language representation models (PLMs) are sub-...

9 Shuai Fan, et al. ∙

research

∙ 10/18/2022

Soft-Labeled Contrastive Pre-training for Function-level Code Representation

Code contrastive pre-training has recently achieved significant progress...

0 Xiaonan Li, et al. ∙

research

∙ 10/11/2022

Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQA

Retrieving evidences from tabular and textual resources is essential for...

0 Junjie Huang, et al. ∙

research

∙ 09/27/2022

PROD: Progressive Distillation for Dense Retrieval

Knowledge distillation is an effective way to transfer knowledge from a ...

7 Zhenghao Lin, et al. ∙

research

∙ 09/22/2022

CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding

Video temporal grounding (VTG) targets to localize temporal moments in a...

0 Zhijian Hou, et al. ∙

research

∙ 07/20/2022

NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

In this paper, we present NUWA-Infinity, a generative model for infinite...

4 Chenfei Wu, et al. ∙

research

∙ 06/28/2022

Joint Generator-Ranker Learning for Natural Language Generation

Due to exposure bias, most existing natural language generation (NLG) mo...

0 Weizhou Shen, et al. ∙

research

∙ 06/17/2022

Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning

Vision-Language (VL) models with the Two-Tower architecture have dominat...

9 Xiao Xu, et al. ∙

research

∙ 06/07/2022

Unsupervised Context Aware Sentence Representation Pretraining for Multi-lingual Dense Retrieval

Recent research demonstrates the effectiveness of using pretrained langu...

0 Ning Wu, et al. ∙

research

∙ 06/01/2022

DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder

Recently most successful image synthesis models are multi stage process ...

0 Jie Shi, et al. ∙

research

∙ 05/23/2022

A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation

Non-Autoregressive generation is a sequence generation paradigm, which r...

0 Weizhen Qi, et al. ∙

Nan Duan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro