SETimes.SR
This action may take several minutes for large corpora, please wait.

SETimes.SR

Manually annotated Serbian corpus SETimes.SR (morphosyntax, syntax, named entities)

Counts
Tokens86726
Words74585
Sentences3891
Documents163
General info
Corpus description Document
LanguageSerbian
EncodingUTF-8
Compiled08/19/2018 12:20:04
Tagset Description
Lexicon sizes
word
id
lempos
tag
feats
ud_dep
ud_dep_head_id
ud_dep_head_lemma
ud_dep_head_tag
lc
norm
lemma
lemma_lc
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
InterjectionI.*
AbbreviationY.*
ResidualX.*
Lempos suffixes
Noun-n
Verb-v
Adjective-a
Pronoun-p
Adverb-r
Preposition-s
Conjunction-c
Numeral-m
Particle-q
Interjection-i
Abbreviation-y
Residual-x