EU DGT-UD: Bulgarian
This action may take several minutes for large corpora, please wait.

EU DGT-UD: Bulgarian

JRC EU DGT Translation Memory (2016) annotated with UD-Pipe: Bulgarian part

Counts
Tokens73301874
Words59796854
Sentences3986937
Documents26259
General info
Corpus description Document
LanguageBulgarian
EncodingUTF-8
Compiled08/15/2018 08:52:06
Tagset Description
Lexicon sizes
word594850
lempos578489
tag 418
pos 17
feats 333
deprel 32
head_word 392292
head_lempos381013
head_tag 399
head_pos 18
head_feats 321
lc 513779
lemma512021
lemma_lc462762
head_lc 344337
head_lemma341051
head_lemma_lc310704
Tags legend
NounNOUN.*
Noun properPROPN.*
VerbVERB.*
AdjectiveAdj.*
PronounPRON.*
AdverbADV.*
AdpositionADP.*
Coord_ConjunctionCCONJ.*
Subord_ConjunctionSCONJ.*
NumeralNUM.*
ParticlePART.*
DeterminerDET.*
InterjectionINTJ.*
SymbolSYM.*
ResidualX.*

Structures and attributes