Using large language models (LLMs) for source code has recently gained
a...
Multiple pattern matching in strings is a fundamental problem in text
pr...
Finetuning large pre-trained language models with a task-specific head h...
This document describes the findings of the Third Workshop on Neural
Gen...
This document describes the findings of the Second Workshop on Neural Ma...
Training of neural machine translation (NMT) models usually uses mini-ba...
In this paper, we propose a new method for calculating the output layer ...
We describe DyNet, a toolkit for implementing neural network models base...