deWaC (German Web)
This action may take several minutes for large corpora, please wait.

deWaC (German Web)

German WaCky Web Corpus (2010)

Counts
Tokens1627118296
Words1348199799
Sentences92395254
Documents1751903
General info
Corpus description Document
LanguageGerman
EncodingUTF-8
Compiled10/28/2017 21:44:56
Tagset Description
Lexicon sizes
word
lempos
ctag
tag
tag_sl
lc
norm
lemma
lemma_lc
Tags legend
NounN.*
Noun properNp.*
Noun commonNc.*
VerbV.*
AdjectiveA.*
PronounP.*
AdverbR.*
PrepositionS.*
ConjunctionC.*
NumeralM.*
ParticleQ.*
ArticleT.*
InterjectionI.*
AbbreviationY.*
ResidualX.*