jpWaC-L3 (Japanese Web, easy sentences)
This action may take several minutes for large corpora, please wait.

jpWaC-L3 (Japanese Web, easy sentences)

Japanese Web texts with automatically assigned difficulty level '3' (easy). PoS and lemma annotated with ChaSen. Crawl and annotation in 2007.

Counts
Tokens1039984
Words906111
Sentences103298
Documents23314
General info
Corpus description Document
LanguageJapanese
EncodingUTF-8
Compiled10/28/2017 18:27:04
Tagset Description
Lexicon sizes
word4089
lempos2491
tag63
ctag63
level3
lc 4089
lemma2339
lemma_lc2339

Structures and attributes