We study the problem of efficient generative inference for Transformer
m...
Large language models have been shown to achieve remarkable performance
...
We present the zero-shot entity linking task, where mentions must be lin...
We introduce a novel method of generating synthetic question answering
c...
Hierarchical neural architectures are often used to capture long-distanc...
We introduce a new language representation model called BERT, which stan...
Program synthesis is the task of automatically generating a program
cons...
In this paper, we propose a new universal machine translation approach
f...
We study the problem of semantic code repair, which can be broadly defin...
Most recently proposed methods for Neural Program Induction work under t...
Attentional sequence-to-sequence models have become the new standard for...
We introduce the first dataset for sequential vision-to-language, and ex...
There has been an explosion of work in the vision & language community d...
In this paper, we explore different neural network architectures that ca...
Integrating vision and language has long been a dream in work on artific...
Two recent approaches have achieved state-of-the-art results in image
ca...