We introduce and demonstrate how to effectively train multilingual machi...
Incorporating language-specific (LS) modules is a proven method to boost...
Bilingual lexicons form a critical component of various natural language...
The ability to extract high-quality translation dictionaries from monoli...
Multilingual sentence representations from large models can encode seman...
Mining high-quality bitexts for low-resource languages is challenging. T...
Driven by the goal of eradicating language barriers on a global scale,
m...
Recent model pruning methods have demonstrated the ability to remove
red...
Obtaining meaningful quality scores for machine translation systems thro...
Neural Machine Translation (NMT) models are typically trained on
heterog...
Recent work in multilingual machine translation (MMT) has focused on the...
Neural Machine Translation (NMT) models are known to suffer from noisy
i...
Much recent work in bilingual lexicon induction (BLI) views word embeddi...
This paper presents the JHU-Microsoft joint submission for WMT 2021 qual...
We propose a novel scheme to use the Levenshtein Transformer to perform ...
We describe Facebook's multilingual model submission to the WMT2021 shar...
Typically, a linearly orthogonal transformation mapping is learned by
al...
As neural machine translation (NMT) systems become an important part of
...
The scarcity of parallel data is a major obstacle for training high-qual...
A popular natural language processing task decades ago, word alignment h...
Cross-lingual named-entity lexicon are an important resource to multilin...
Saliency methods are widely used to interpret neural network predictions...
Large web-crawled corpora represent an excellent resource for improving ...
Low-resource Multilingual Neural Machine Translation (MNMT) is typically...
Linear embedding transformation has been shown to be effective for zero-...
Simultaneous text translation and end-to-end speech translation have rec...
Transformer-based models have achieved state-of-the-art performance on s...
The COVID-19 pandemic is the worst pandemic to strike the world in over ...
Many valid translations exist for a given sentence, and yet machine
tran...
In this work, we exploit the simple idea that a document and its transla...
Despite the reported success of unsupervised machine translation (MT), t...
Cross-lingual document alignment aims to identify pairs of documents in ...
We share the findings of the first shared task on improving robustness o...
Despite their original goal to jointly learn to align and translate, Neu...
The term translationese has been used to describe the presence of unusua...
In this paper, we describe our submission to the WMT19 low-resource para...
Stack Long Short-Term Memory (StackLSTM) is useful for various applicati...
The vast majority of language pairs in the world are low-resource becaus...
To better understand the effectiveness of continued training, we analyze...
Standard neural machine translation (NMT) systems operate primarily on w...
We examine how various types of noise in the parallel training data impa...
Draft of textbook chapter on neural machine translation. a comprehensive...
We explore six challenges for neural machine translation: domain mismatc...
Neural machine translation (NMT) models are able to partially learn synt...