Gertjan van Noord

research

∙ 02/28/2023

Are Character-level Translations Worth the Wait? An Extensive Comparison of Character- and Subword-level Models for Machine Translation

Pretrained large character-level language models have been recently revi...

0 Lukas Edman, et al. ∙

research

∙ 12/02/2022

Subword-Delimited Downsampling for Better Character-Level Translation

Subword-level models have been the dominant paradigm in NLP. However, ch...

0 Lukas Edman, et al. ∙

research

∙ 05/27/2022

Patching Leaks in the Charformer for Efficient Character-Level Generation

Character-based representations have important advantages over subword-b...

0 Lukas Edman, et al. ∙

research

∙ 05/24/2022

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer

Massively multilingual models are promising for transfer learning across...

0 Ahmet Üstün, et al. ∙

research

∙ 05/10/2022

The Importance of Context in Very Low Resource Language Modeling

This paper investigates very low resource language model pretraining, wh...

0 Lukas Edman, et al. ∙

research

∙ 09/24/2021

Unsupervised Translation of German–Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language

This paper describes the methods behind the systems submitted by the Uni...

0 Lukas Edman, et al. ∙

research

∙ 04/29/2020

UDapter: Language Adaptation for Truly Universal Dependency Parsing

Recent advances in the field of multilingual dependency parsing have bro...

0 Ahmet Üstün, et al. ∙

research

∙ 12/19/2019

BERTje: A Dutch BERT Model

The transformer-based pre-trained language model BERT has helped to impr...

0 Wietse de Vries, et al. ∙

research

∙ 10/10/2017

MoNoise: Modeling Noise Using a Modular Normalization System

We propose MoNoise: a normalization model focused on generalizability an...

0 Rob van der Goot, et al. ∙

research

∙ 03/30/2016

Bilingual Learning of Multi-sense Embeddings with Discrete Autoencoders

We present an approach to learning multi-sense word embeddings relying b...

0 Simon Šuster, et al. ∙

research

∙ 08/31/2015

Word Representations, Tree Models and Syntactic Functions

Word representations induced from models with discrete latent variables ...

0 Simon Šuster, et al. ∙

Gertjan van Noord

Featured Co-authors

Sign in with Google

Consider DeepAI Pro