EU DGT-UD: Slovenian
This action may take several minutes for large corpora, please wait.

EU DGT-UD: Slovenian

JRC EU DGT Translation Memory (2016) annotated with UD-Pipe: Slovenian part

Counts
Tokens100252375
Words77865562
Sentences5740018
Documents45111
General info
Corpus description Document
LanguageSlovenian
EncodingUTF-8
Compiled08/15/2018 10:21:48
Tagset Description
Lexicon sizes
word589611
lempos578128
tag 1091
pos 17
feats 966
deprel 31
head_word 425281
head_lempos404111
head_tag 1029
head_pos 18
head_feats 908
lc 500142
lemma465391
lemma_lc408044
head_lc 366517
head_lemma336583
head_lemma_lc297793
Tags legend
NounNOUN.*
Noun properPROPN.*
VerbVERB.*
AdjectiveAdj.*
PronounPRON.*
AdverbADV.*
AdpositionADP.*
Coord_ConjunctionCCONJ.*
Subord_ConjunctionSCONJ.*
NumeralNUM.*
ParticlePART.*
DeterminerDET.*
InterjectionINTJ.*
SymbolSYM.*
ResidualX.*

Structures and attributes