deWaC (German Web)
This action may take several minutes for large corpora, please wait.

deWaC (German Web)

German WaCky Web Corpus (2010)

Counts
Tokens1627118296
Words1348199799
Sentences92395254
Documents1751903
General info
Corpus description Document
LanguageGerman
EncodingUTF-8
Compiled10/28/2017 21:44:56
Tagset Description
Lexicon sizes
word16097581
lempos16206346
ctag55
tag164
tag_sl164
lc 15080119
norm 15080119
lemma15631409
lemma_lc14856931
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*

Structures and attributes