b'Wen-tau Yih'

research

∙ 05/26/2023

Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering

We propose EAR, a query Expansion And Reranking approach for improving p...

0 Yung-Sung Chuang, et al. ∙

research

∙ 05/23/2023

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

Evaluating the factuality of long-form text generated by large language ...

0 Sewon Min, et al. ∙

research

∙ 05/14/2023

Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing

Interactive semantic parsing based on natural language (NL) feedback, wh...

0 Hao Yan, et al. ∙

research

∙ 05/09/2023

Large Language Model Programs

In recent years, large pre-trained language models (LLMs) have demonstra...

0 Imanol Schlag, et al. ∙

research

∙ 05/04/2023

VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation

We propose a new two-stage pre-training framework for video-to-text gene...

0 Xilun Chen, et al. ∙

research

∙ 02/16/2023

LEVER: Learning to Verify Language-to-Code Generation with Execution

The advent of large language models trained on code (code LLMs) has led ...

0 Ansong Ni, et al. ∙

research

∙ 02/15/2023

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

Various techniques have been developed in recent years to improve dense ...

0 Sheng-Chieh Lin, et al. ∙

research

∙ 01/30/2023

REPLUG: Retrieval-Augmented Black-Box Language Models

We introduce REPLUG, a retrieval-augmented language modeling framework t...

7 Weijia Shi, et al. ∙

research

∙ 12/02/2022

Nonparametric Masked Language Modeling

Existing language models (LMs) predict tokens with a softmax over a fini...

0 Sewon Min, et al. ∙

research

∙ 11/29/2022

Coder Reviewer Reranking for Code Generation

Sampling diverse programs from a code language model and reranking with ...

0 Tianyi Zhang, et al. ∙

research

∙ 11/22/2022

Retrieval-Augmented Multimodal Language Modeling

Recent multimodal models such as DALL-E and CM3 have achieved remarkable...

28 Michihiro Yasunaga, et al. ∙

research

∙ 11/18/2022

CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval

Multi-vector retrieval methods combine the merits of sparse (e.g. BM25) ...

0 Minghan Li, et al. ∙

research

∙ 11/16/2022

Task-aware Retrieval with Instructions

We study the problem of retrieval with instructions, where users of a re...

0 Akari Asai, et al. ∙

research

∙ 10/25/2022

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering

We introduce RoMQA, the first benchmark for robust, multi-evidence, mult...

0 Victor Zhong, et al. ∙

research

∙ 09/21/2022

Adapting Pretrained Text-to-Text Models for Long Text Sequences

We present an empirical study of adapting an existing pretrained text-to...

5 Wenhan Xiong, et al. ∙

research

∙ 05/24/2022

Structured Prompt Tuning

We propose structured prompt tuning, a simple and effective method to im...

2 Chi-Liang Liu, et al. ∙

research

∙ 04/22/2022

Autoregressive Search Engines: Generating Substrings as Document Identifiers

Knowledge-intensive language tasks require NLP systems to both provide t...

0 Michele Bevilacqua, et al. ∙

research

∙ 04/15/2022

Improving Passage Retrieval with Zero-Shot Question Generation

We propose a simple and effective re-ranking method for improving passag...

0 Devendra Singh Sachan, et al. ∙

research

∙ 04/12/2022

InCoder: A Generative Model for Code Infilling and Synthesis

Code is seldom written in a single left-to-right pass and is instead rep...

6 Daniel Fried, et al. ∙

research

∙ 12/18/2021

The Web Is Your Oyster – Knowledge-Intensive NLP against a Very Large Web Corpus

In order to address the increasing demands of real-world applications, t...

0 Aleksandra Piktus, et al. ∙

research

∙ 12/14/2021

Boosted Dense Retriever

We propose DrBoost, a dense retrieval ensemble inspired by boosting. DrB...

0 Patrick Lewis, et al. ∙

research

∙ 12/14/2021

Simple Local Attentions Remain Competitive for Long-Context Tasks

Many NLP tasks require processing long contexts beyond the length limit ...

0 Wenhan Xiong, et al. ∙

research

∙ 10/14/2021

CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training

With the rise of large-scale pre-trained language models, open-domain qu...

0 Patrick Huber, et al. ∙

research

∙ 10/14/2021

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning

Conventional fine-tuning of pre-trained language models tunes all model ...

1 Yuning Mao, et al. ∙

research

∙ 10/13/2021

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

Despite their recent popularity and well known advantages, dense retriev...

0 Xilun Chen, et al. ∙

research

∙ 07/28/2021

Domain-matched Pre-training Tasks for Dense Retrieval

Pre-training on larger datasets with ever increasing model size is now a...

0 Barlas Oguz, et al. ∙

research

∙ 06/02/2021

On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study

In adversarial data collection (ADC), a human workforce interacts with a...

0 Divyansh Kaushik, et al. ∙

research

∙ 04/18/2021

On the Influence of Masking Policies in Intermediate Pre-training

Current NLP models are predominantly trained through a pretrain-then-fin...

0 Qinyuan Ye, et al. ∙

research

∙ 04/12/2021

On Unifying Misinformation Detection

In this paper, we introduce UnifiedM2, a general-purpose misinformation ...

0 Nayeon Lee, et al. ∙

research

∙ 01/01/2021

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

We review the EfficientQA competition from NeurIPS 2020. The competition...

15 Sewon Min, et al. ∙

research

∙ 01/01/2021

Multi-task Retrieval for Knowledge-Intensive Tasks

Retrieving relevant contexts from a large corpus is a crucial step for t...

0 Jean Maillard, et al. ∙

research

∙ 12/31/2020

Studying Strategically: Learning to Mask for Closed-book QA

Closed-book question-answering (QA) is a challenging task that requires ...

0 Qinyuan Ye, et al. ∙

research

∙ 12/31/2020

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

Natural language (NL) explanations of model predictions are gaining popu...

0 Kushal Lakhotia, et al. ∙

research

∙ 12/30/2020

Joint Verification and Reranking for Open Fact Checking Over Tables

Structured information is an important knowledge source for automatic ve...

0 Michael Schlichtkrull, et al. ∙

research

∙ 10/21/2020

RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

State-of-the-art Machine Reading Comprehension (MRC) models for Open-dom...

0 Srinivasan Iyer, et al. ∙

research

∙ 10/06/2020

Efficient One-Pass End-to-End Entity Linking for Questions

We present ELQ, a fast end-to-end entity linking model for questions, wh...

0 Belinda Z. Li, et al. ∙

research

∙ 09/27/2020

Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval

We propose a simple and efficient multi-hop dense retrieval approach for...

0 Wenhan Xiong, et al. ∙

research

∙ 06/07/2020

Language Models as Fact Checkers?

Recent work has suggested that language models (LMs) store both common-s...

0 Nayeon Lee, et al. ∙

research

∙ 05/22/2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Large pre-trained language models have been shown to store factual knowl...

0 Patrick Lewis, et al. ∙

research

∙ 05/17/2020

TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data

Recent years have witnessed the burgeoning of pretrained language models...

0 Pengcheng Yin, et al. ∙

research

∙ 05/02/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Despite the widely successful applications, bootstrapping and fine-tunin...

13 Ziyu Yao, et al. ∙

research

∙ 04/10/2020

Dense Passage Retrieval for Open-Domain Question Answering

Open-domain question answering relies on efficient passage retrieval to ...

0 Vladimir Karpukhin, et al. ∙

research

∙ 02/22/2020

Unsupervised Question Decomposition for Question Answering

We aim to improve question answering (QA) by decomposing hard questions ...

14 Ethan Perez, et al. ∙

research

∙ 10/11/2019

Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

As a promising paradigm, interactive semantic parsing has shown to impro...

0 Ziyu Yao, et al. ∙

research

∙ 09/10/2019

Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Our goal is to better comprehend procedural text, e.g., a paragraph abou...

0 Bhavana Dalvi Mishra, et al. ∙

research

∙ 06/21/2019

Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Our goal is procedural text comprehension, namely tracking how the prope...

0 Xinya Du, et al. ∙

research

∙ 11/20/2018

QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

Many natural language questions require recognizing and reasoning with q...

0 Oyvind Tafjord, et al. ∙

research

∙ 10/06/2018

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

Conversational machine comprehension requires a deep understanding of th...

0 Hsin-Yuan Huang, et al. ∙

research

∙ 09/05/2018

Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations

Semantic parsing from denotations faces two key challenges in model trai...

0 Dipendra Misra, et al. ∙

research

∙ 08/29/2018

Reasoning about Actions and State Changes by Injecting Commonsense Knowledge

Comprehending procedural text, e.g., a paragraph describing photosynthes...

0 Niket Tandon, et al. ∙

Wen-tau Yih

Featured Co-authors

Sign in with Google

Consider DeepAI Pro