In this paper, we investigate the in-context learning ability of
retriev...
Large decoder-only language models (LMs) can be largely improved in term...
Pretrained large language models have become indispensable for solving
v...
Augmenting pretrained language models (LMs) with a vision encoder (e.g.,...
Parameter efficient learning methods (PERMs) have recently gained signif...
Closed-book question answering (QA) requires a model to directly answer ...
We explore the idea of compressing the prompts used to condition languag...
FP8 is a natural progression for accelerating deep learning training
inf...
Pretrained language models (LMs) are susceptible to generate text with
n...
Training large transformer models is one of the most important computati...
Existing knowledge-grounded dialogue systems typically use finetuned ver...
Pre-trained language models (LMs) are shown to easily generate toxic
lan...
Pretrained general-purpose language models can achieve state-of-the-art
...
Detecting social bias in text is challenging due to nuance, subjectivity...
Transformers have achieved success in both language and vision domains.
...
Large language models have led to state-of-the-art accuracies across a r...
Recent work on training neural retrievers for open-domain question answe...
State-of-the-art conversational agents have advanced significantly in
co...
There has been an influx of biomedical domain-specific language models,
...
Existing pre-trained large language models have shown unparalleled gener...
Non-goal oriented dialog agents (i.e. chatbots) aim to produce varying a...
We introduce a language generative model framework for generating a styl...
Question and answer generation is a data augmentation method that aims t...
We propose a novel approach for image segmentation that combines Neural
...
Recent work in unsupervised language modeling demonstrates that training...
Recent work in unsupervised language modeling demonstrates that training...
Learning to synthesize high frame rate videos via interpolation requires...
We propose and evaluate new techniques for compressing and speeding up d...
We present Deep Voice, a production-quality text-to-speech system constr...