Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection

04/22/2020
by   Joakim Nivre, et al.
0

Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word segmentation; a morphological layer comprising lemmas, universal part-of-speech tags, and standardized morphological features; and a syntactic layer focusing on syntactic relations between predicates, arguments and modifiers. In this paper, we describe version 2 of the guidelines (UD v2), discuss the major changes from UD v1 to UD v2, and give an overview of the currently available treebanks for 90 languages.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset