Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)




- Name
- jpWaC-L4.vert.gz
- Size
- 1.24 MB
- Format
- application/gzip
- Description
- L4 sentences (very easy)
- MD5
- 50843dff6fcb5068d45703312de081c4

- Name
- jpWaC-L3.vert.gz
- Size
- 4.19 MB
- Format
- application/gzip
- Description
- L3 sentences (easy)
- MD5
- ed5f6d5ac497f9bccbb777d9aa9da16b

- Name
- jpWaC-L2.vert.gz
- Size
- 19.22 MB
- Format
- application/gzip
- Description
- L2 sentences (intemediate)
- MD5
- 23ea8a0e7710a5c63b3ce16a4b420fdd

- Name
- jpWaC-L1.vert.gz
- Size
- 8 MB
- Format
- application/gzip
- Description
- L1 sentences (difficult)
- MD5
- 8e959ec1bffbd26ca0b0fec29c31d222

- Name
- jpWaC-L0.vert.gz
- Size
- 178.92 MB
- Format
- application/gzip
- Description
- L0 sentences (very difficult)
- MD5
- 7ba968fa47702c36bd91d80c36fd5e4b

- Name
- jpWaC-L.vert.gz
- Size
- 1.39 GB
- Format
- application/gzip
- Description
- Complete Web corpus
- MD5
- 08bb1469bc3a21a2a34115c884392d70