SETimes.SR
This action may take several minutes for large corpora, please wait.

SETimes.SR

Manually annotated Serbian corpus SETimes.SR (morphosyntax, syntax, named entities)

Counts
Tokens86726
Words74585
Sentences3891
Documents163
General info
Corpus description Document
LanguageSerbian
EncodingUTF-8
Compiled08/19/2018 12:20:04
Tagset Description
Lexicon sizes
word17586
id 86726
lempos 9071
tag 557
feats 878
ud_dep 36
ud_dep_head_id 30882
ud_dep_head_lemma 5426
ud_dep_head_tag 331
lc 16374
norm 16374
lemma8617
lemma_lc 8536
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
InterjectionI.*
AbbreviationY.*
ResidualX.*
Lempos suffixes
Noun-n
Verb-v
Adjective-a
Pronoun-p
Adverb-r
Preposition-s
Conjunction-c
Numeral-m
Particle-q
Interjection-i
Abbreviation-y
Residual-x

Structures and attributes