Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)




- Name
- bswac.gz
- Size
- 654.71 MB
- Format
- application/gzip
- Description
- Web-as-Corpus 2014 Bosnian texts
- MD5
- f40da84f9e1a4049f589480b2c6f647c

- Name
- hrwac.gz
- Size
- 3.1 GB
- Format
- application/gzip
- Description
- Web-as-Corpus 2011 and 2014 Croatian texts
- MD5
- 7306c700626c5b9a803879fd96da2d21

- Name
- cnrwac.gz
- Size
- 214.97 MB
- Format
- application/gzip
- Description
- Web-as-Corpus 2019 Montenegrin texts
- MD5
- 3169e200a089b9dd7214096bec8c0376

- Name
- srwac.gz
- Size
- 1.21 GB
- Format
- application/gzip
- Description
- Web-as-Corpus 2014 Serbian texts
- MD5
- 66f3a5e31b2c38215c8f4eddd7df6903

- Name
- cc100-hr.gz
- Size
- 7.49 GB
- Format
- application/gzip
- Description
- Common Crawl Croatian texts
- MD5
- a9702d6d1f6dd2bb34de5acb0f45f3aa

- Name
- cc100-sr.gz
- Size
- 1.79 GB
- Format
- application/gzip
- Description
- Common Crawl Serbian texts
- MD5
- c7e9b916c22a5ecc13f65d657b6815f2

- Name
- classla-bs.gz
- Size
- 1.31 GB
- Format
- application/gzip
- Description
- Web-as-Corpus 2020 Bosnian texts
- MD5
- ddffa3382d25409006b20338bb8c8e7b

- Name
- classla-hr.gz
- Size
- 3.32 GB
- Format
- application/gzip
- Description
- Web-as-Corpus 2020 Croatian texts
- MD5
- 967c44708a26cda5003696192793aab4

- Name
- classla-sr.gz
- Size
- 1.85 GB
- Format
- application/gzip
- Description
- Web-as-Corpus 2020 Serbian texts
- MD5
- 4ecf85bd53ba955ee5d2bbbf73a7d0f6

- Name
- riznica.gz
- Size
- 217.57 MB
- Format
- application/gzip
- Description
- Croatian newspaper and literary texts
- MD5
- df60a9569317ce390be0424604a446f1