This action may take several minutes for large corpora, please wait.
EU DGT-UD: Romanian
JRC EU DGT Translation Memory (2016) annotated with UD-Pipe: Romanian part
Counts |
Tokens | 72828357 |
Words | 60378180 |
Sentences | 3656735 |
Documents | 26716 |
General info |
Corpus description |
Document |
Language | Romanian |
Encoding | UTF-8 |
Compiled | 08/15/2018 10:12:21 |
Tagset |
Description |
Lexicon sizes |
word | 506965 |
lempos | 526428 |
tag
| 416 |
pos
| 17 |
feats
| 338 |
deprel
| 47 |
head_word
| 344951 |
head_lempos | 335694 |
head_tag
| 396 |
head_pos
| 18 |
head_feats
| 325 |
lc
| 446129 |
lemma | 459764 |
lemma_lc | 422617 |
head_lc
| 298883 |
head_lemma | 297648 |
head_lemma_lc | 271910 |
Tags legend |
Noun | NOUN.* |
Noun proper | PROPN.* |
Verb | VERB.* |
Adjective | Adj.* |
Pronoun | PRON.* |
Adverb | ADV.* |
Adposition | ADP.* |
Coord_Conjunction | CCONJ.* |
Subord_Conjunction | SCONJ.* |
Numeral | NUM.* |
Particle | PART.* |
Determiner | DET.* |
Interjection | INTJ.* |
Symbol | SYM.* |
Residual | X.* |
Structures and attributes