Files in this item
- Name
- README.md
- Size
- 9.01 KB
- Format
- Unknown
- Description
- README file for the MaCoCu-Genre corpora
- MD5
- da7ec8e525f2d18fa6e03fa0949f66ed
- Name
- MaCoCu-Genre.bg.jsonl.gz
- Size
- 13.51 GB
- Format
- application/gzip
- Description
- Bulgarian corpus
- MD5
- b0dbf58e812fbbeecd93f602cf4adde6
- Name
- MaCoCu-Genre.bs.jsonl.gz
- Size
- 1.91 GB
- Format
- application/gzip
- Description
- Bosnian corpus
- MD5
- 025e2adcb84d2d3853b135c2dc758ead
- Name
- MaCoCu-Genre.ca.jsonl.gz
- Size
- 3.98 GB
- Format
- application/gzip
- Description
- Catalan corpus
- MD5
- 7881287c8abe16879c760598f625ad48
- Name
- MaCoCu-Genre.cnr.jsonl.gz
- Size
- 431.93 MB
- Format
- application/gzip
- Description
- Montenegrin corpus
- MD5
- 1b3219ba3d04d16de59701107fe9dbf3
- Name
- MaCoCu-Genre.el.jsonl.gz
- Size
- 17.71 GB
- Format
- application/gzip
- Description
- Greek corpus
- MD5
- d9f72dd228930670de623d2476d8fb79
- Name
- MaCoCu-Genre.hr.jsonl.gz
- Size
- 6.15 GB
- Format
- application/gzip
- Description
- Croatian corpus
- MD5
- ca947f2fb0e85051727f9e91bccfb0a6
- Name
- MaCoCu-Genre.is.jsonl.gz
- Size
- 2.29 GB
- Format
- application/gzip
- Description
- Icelandic corpus
- MD5
- 7cb6ab9e0c7e44c92b84b8e1947f81c4
- Name
- MaCoCu-Genre.mk.jsonl.gz
- Size
- 1.94 GB
- Format
- application/gzip
- Description
- Macedonian corpus
- MD5
- 1405c45d86ce1b44afc00c0cfe04b931
- Name
- MaCoCu-Genre.sl.jsonl.gz
- Size
- 4.78 GB
- Format
- application/gzip
- Description
- Slovenian corpus
- MD5
- d916830d1f3bbdbe42f0cf041cf8b220
- Name
- MaCoCu-Genre.sq.jsonl.gz
- Size
- 1.49 GB
- Format
- application/gzip
- Description
- Albanian corpus
- MD5
- 47b1f9265d8696475c133a7bf1769f55
- Name
- MaCoCu-Genre.sr.jsonl.gz
- Size
- 6.48 GB
- Format
- application/gzip
- Description
- Serbian corpus
- MD5
- 1244960d52b23e1c741fc76a18f01847
- Name
- MaCoCu-Genre.tr.jsonl.gz
- Size
- 13.46 GB
- Format
- application/gzip
- Description
- Turkish corpus
- MD5
- abe376c21256798ded30e54770666aa0
- Name
- MaCoCu-Genre.uk.jsonl.gz
- Size
- 27.31 GB
- Format
- application/gzip
- Description
- Ukrainian corpus
- MD5
- 5c7a1eca339b18be270993ec294b0844