ReLDI-hr (manually tagged Croatian tweets)
This action may take several minutes for large corpora, please wait.

ReLDI-hr (manually tagged Croatian tweets)

Croatian tweets with manualy normalised (standardised), morphosyntactically tagged and lemmatised words and named entities ReLDI-hr v2.1

Counts
Tokens89104
Words71768
Sentences7939
Documents3871
General info
Corpus description Document
LanguageCroatian
EncodingUTF-8
Compiled07/28/2019 20:02:34
Tagset Description
Lexicon sizes
word27289
norm 25395
lempos17260
tag 694
ud_pos 33
ud_feats 766
diff5
lc 25219
lemma16659
lemma_lc16020
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*

Structures and attributes