itWaC (Italian Web)
This action may take several minutes for large corpora, please wait.

itWaC (Italian Web)

Italian WaCky Web Corpus (2010)

Counts
Tokens1909698363
Words1593977091
Sentences68147599
Documents1867618
General info
Corpus description Document
LanguageItalian
EncodingUTF-8
Compiled10/28/2017 23:00:49
Tagset Description
Lexicon sizes
word6278105
lempos5865706
ctag52
tag72
tag_sl72
lc 5274403
norm 5274403
lemma5653649
lemma_lc5050122
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*

Structures and attributes