Long-form question answering (LFQA) enables answering a wide range of
qu...
In this paper, we study the generation quality of interpolation-based
re...
To detect the deployment of large language models for malicious use case...
While machine translation evaluation metrics based on string overlap (e....
To understand what kinds of linguistic knowledge are encoded by pretrain...