The language-independency of encoded representations within multilingual...
This paper studies the impact of layer normalization (LayerNorm) on zero...
In spite of the potential for ground-breaking achievements offered by la...
Large-scale language-agnostic sentence embedding models such as LaBSE (F...
Solving math word problems is the task that analyses the relation of
qua...
Relation extraction (RE) has achieved remarkable progress with the help ...
To solve Math Word Problems, human students leverage diverse reasoning l...
Massively multilingual sentence representation models, e.g., LASER,
SBER...
Contrastive pre-training on distant supervision has shown remarkable
eff...
Word alignment has proven to benefit many-to-many neural machine transla...
In the present study, we propose novel sequence-to-sequence pre-training...
Large-scale models for learning fixed-dimensional cross-lingual sentence...
Neural machine translation (NMT) needs large parallel corpora for
state-...
Sequence-to-sequence (S2S) pre-training using large monolingual data is ...