itWaC (Italian Web)
This action may take several minutes for large corpora, please wait.

itWaC (Italian Web)

Italian WaCky Web Corpus (2010)

Counts
Tokens1909698363
Words1593977091
Sentences68147599
Documents1867618
General info
Corpus description Document
LanguageItalian
EncodingUTF-8
Compiled10/28/2017 23:00:49
Tagset Description
Lexicon sizes
word
lempos
ctag
tag
tag_sl
lc
norm
lemma
lemma_lc
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*