ReLDI-hr (manually tagged Croatian tweets)
This action may take several minutes for large corpora, please wait.

ReLDI-hr (manually tagged Croatian tweets)

Croatian tweets with manualy normalised (standardised), morphosyntactically tagged and lemmatised words and named entities ReLDI-hr v2.0

Counts
Tokens89102
Words71768
Sentences7938
Documents3871
General info
Corpus description Document
LanguageCroatian
EncodingUTF-8
Compiled12/16/2017 15:17:51
Tagset Description
Lexicon sizes
word
norm
lempos
tag
diff
lc
lemma
lemma_lc
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*