EU DGT-UD: Croatian
This action may take several minutes for large corpora, please wait.

EU DGT-UD: Croatian

JRC EU DGT Translation Memory (2016) annotated with UD-Pipe: Croatian part

Counts
Tokens28089498
Words23167343
Sentences1627529
Documents9394
General info
Corpus description Document
LanguageCroatian
EncodingUTF-8
Compiled08/15/2018 09:36:10
Tagset Description
Lexicon sizes
word376185
lempos334187
tag 1064
pos 17
feats 982
deprel 36
head_word 246184
head_lempos208355
head_tag 966
head_pos 18
head_feats 891
lc 329089
lemma287579
lemma_lc259747
head_lc 216981
head_lemma181604
head_lemma_lc164991
Tags legend
NounNOUN.*
Noun properPROPN.*
VerbVERB.*
AdjectiveAdj.*
PronounPRON.*
AdverbADV.*
AdpositionADP.*
Coord_ConjunctionCCONJ.*
Subord_ConjunctionSCONJ.*
NumeralNUM.*
ParticlePART.*
DeterminerDET.*
InterjectionINTJ.*
SymbolSYM.*
ResidualX.*

Structures and attributes