EU DGT-UD: Portuguese
This action may take several minutes for large corpora, please wait.

EU DGT-UD: Portuguese

JRC EU DGT Translation Memory (2016) annotated with UD-Pipe: Portuguese part

Counts
Tokens123673320
Words104649382
Sentences5739461
Documents39100
General info
Corpus description Document
LanguagePortuguese
EncodingUTF-8
Compiled08/15/2018 10:09:02
Tagset Description
Lexicon sizes
word575595
lempos597155
tag 1085
pos 17
feats 202
deprel 36
head_word 345526
head_lempos333759
head_tag 1011
head_pos 19
head_feats 199
lc 507121
lemma544125
lemma_lc493621
head_lc 300842
head_lemma309194
head_lemma_lc280014
Tags legend
NounNOUN.*
Noun properPROPN.*
VerbVERB.*
AdjectiveAdj.*
PronounPRON.*
AdverbADV.*
AdpositionADP.*
Coord_ConjunctionCCONJ.*
Subord_ConjunctionSCONJ.*
NumeralNUM.*
ParticlePART.*
DeterminerDET.*
InterjectionINTJ.*
SymbolSYM.*
ResidualX.*

Structures and attributes