jpWaC-L4 (Japanese Web, v.easy sentences)
This action may take several minutes for large corpora, please wait.

jpWaC-L4 (Japanese Web, v.easy sentences)

Japanese Web texts with automatically assigned difficulty level '4' (very easy). PoS and lemma annotated with ChaSen. Crawl and annotation in 2007.

Counts
Tokens300958
Words255701
Sentences36563
Documents14316
General info
Corpus description Document
LanguageJapanese
EncodingUTF-8
Compiled10/28/2017 18:27:01
Tagset Description
Lexicon sizes
word
lempos
tag
ctag
level
lc
lemma
lemma_lc