b'Canwen Xu'

research

∙ 06/26/2023

LongCoder: A Long-Range Pre-trained Language Model for Code Completion

In this paper, we introduce a new task for code completion that focuses ...

0 Daya Guo, et al. ∙

research

∙ 06/05/2023

RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems

Large Language Models (LLMs) have greatly advanced code auto-completion ...

0 Tianyang Liu, et al. ∙

research

∙ 04/03/2023

Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data

Chat models, such as ChatGPT, have shown impressive capabilities and hav...

0 Canwen Xu, et al. ∙

research

∙ 03/15/2023

Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization

We present Mirror, an open-source platform for data exploration and anal...

0 Canwen Xu, et al. ∙

research

∙ 10/21/2022

InforMask: Unsupervised Informative Masking for Language Model Pretraining

Masked language modeling is widely used for pretraining large language m...

0 Nafis Sadeq, et al. ∙

research

∙ 10/21/2022

Efficiently Tuned Parameters are Task Embeddings

Intermediate-task transfer can benefit a wide range of NLP tasks with pr...

4 Wangchunshu Zhou, et al. ∙

research

∙ 04/13/2022

Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

Prompt-based learning (i.e., prompting) is an emerging paradigm for expl...

0 Han Wang, et al. ∙

research

∙ 03/11/2022

LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval

In this paper, we propose LaPraDoR, a pretrained dual-tower dense retrie...

0 Canwen Xu, et al. ∙

research

∙ 03/06/2022

Leashing the Inner Demons: Self-Detoxification for Language Models

Language models (LMs) can reproduce (or amplify) toxic language seen dur...

0 Canwen Xu, et al. ∙

research

∙ 02/15/2022

A Survey on Model Compression for Natural Language Processing

With recent developments in new architectures like Transformer and pretr...

0 Canwen Xu, et al. ∙

research

∙ 02/15/2022

A Survey on Dynamic Neural Networks for Natural Language Processing

Effectively scaling large Transformer models is a main driver of recent ...

0 Canwen Xu, et al. ∙

research

∙ 02/02/2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

PromptSource is a system for creating, sharing, and using natural langua...

17 Stephen H. Bach, et al. ∙

research

∙ 10/15/2021

Multitask Prompted Training Enables Zero-Shot Task Generalization

Large language models have recently been shown to attain reasonable zero...

10 Victor Sanh, et al. ∙

research

∙ 09/07/2021

Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression

Recent studies on compression of pretrained language models (e.g., BERT)...

0 Canwen Xu, et al. ∙

research

∙ 09/07/2021

Datasets: A Community Library for Natural Language Processing

The scale, variety, and quantity of publicly-available NLP datasets has ...

6 Quentin Lhoest, et al. ∙

research

∙ 06/08/2021

Meta Learning for Knowledge Distillation

We present Meta Learning for Knowledge Distillation (MetaDistil), a simp...

0 Wangchunshu Zhou, et al. ∙

research

∙ 04/06/2021

Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge

Cant is important for understanding advertising, comedies and dog-whistl...

0 Canwen Xu, et al. ∙

research

∙ 06/07/2020

BERT Loses Patience: Fast and Robust Inference with Early Exit

In this paper, we propose Patience-based Early Exit, a straightforward y...

0 Wangchunshu Zhou, et al. ∙

research

∙ 04/26/2020

MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization

Recently, large-scale datasets have vastly facilitated the development i...

0 Canwen Xu, et al. ∙

research

∙ 02/07/2020

BERT-of-Theseus: Compressing BERT by Progressive Module Replacing

In this paper, we propose a novel model compression approach to effectiv...

0 Canwen Xu, et al. ∙

research

∙ 11/10/2019

Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders

Current neural Natural Language Generation (NLG) models cannot handle em...

0 Yu Duan, et al. ∙

research

∙ 08/28/2019

Exploiting Multiple Embeddings for Chinese Named Entity Recognition

Identifying the named entities mentioned in text would enrich many seman...

0 Canwen Xu, et al. ∙

research

∙ 07/02/2019

Obj-GloVe: Scene-Based Contextual Object Embedding

Recently, with the prevalence of large-scale image dataset, the co-occur...

0 Canwen Xu, et al. ∙

research

∙ 01/21/2019

DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets

In recent years, with the prevalence of social media and smart devices, ...

0 Canwen Xu, et al. ∙

Canwen Xu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro