bsWaC (Bosnian Web)
This action may take several minutes for large corpora, please wait.

bsWaC (Bosnian Web)

Bosnian Web Corpus v1.2 (2014)

Counts
Tokens286865790
Words248478730
Sentences12886124
Paragraphs5725897
Documents896059
General info
Corpus description Document
LanguageBosnian
EncodingUTF-8
Compiled10/28/2017 19:31:05
Tagset Description
Lexicon sizes
word3148685
norm2641969
lempos2609084
tag696
lc 2681887
lemma2257982
lemma_lc2034046
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*

Structures and attributes