EU DGT-UD: Czech
This action may take several minutes for large corpora, please wait.

EU DGT-UD: Czech

JRC EU DGT Translation Memory (2016) annotated with UD-Pipe: Czech part

Counts
Tokens99910405
Words76598197
Sentences5682283
Documents44195
General info
Corpus description Document
LanguageCzech
EncodingUTF-8
Compiled08/15/2018 08:56:44
Tagset Description
Lexicon sizes
word562516
lempos433705
tag 2262
pos 17
feats 2166
deprel 41
head_word 411698
head_lempos311763
head_tag 2148
head_pos 18
head_feats 2065
lc 473056
lemma392544
lemma_lc350550
head_lc 355060
head_lemma286141
head_lemma_lc259667
Tags legend
NounNOUN.*
Noun properPROPN.*
VerbVERB.*
AdjectiveAdj.*
PronounPRON.*
AdverbADV.*
AdpositionADP.*
Coord_ConjunctionCCONJ.*
Subord_ConjunctionSCONJ.*
NumeralNUM.*
ParticlePART.*
DeterminerDET.*
InterjectionINTJ.*
SymbolSYM.*
ResidualX.*

Structures and attributes