To produce accurate predictions, language models (LMs) must balance betw...
Transformers typically require some form of positional encoding, such as...
NLP benchmarks have largely focused on short texts, such as sentences an...
Latent alignment objectives such as CTC and AXE significantly improve
no...
Large pre-trained language models have been shown to encode large amount...