dc.contributor.author | Brozović Rončević, Dunja |
dc.contributor.author | Ćavar, Damir |
dc.contributor.author | Ćavar, Małgorzata |
dc.contributor.author | Stojanov, Tomislav |
dc.contributor.author | Štrkalj Despot, Kristina |
dc.contributor.author | Ljubešić, Nikola |
dc.contributor.author | Erjavec, Tomaž |
dc.date.accessioned | 2018-03-07T14:59:31Z |
dc.date.available | 2018-03-07T14:59:31Z |
dc.date.issued | 2018-03-07 |
dc.identifier.uri | http://hdl.handle.net/11356/1180 |
dc.description | The Croatian Language Corpus was built between 2007 and 2011 at the Institute of Croatian Language and Linguistics in the scope of the research programme "Hrvatska jezična riznica" as a reference corpus of Croatian language to serve various lexicographic and other linguistic and language technology projects. The corpus consists of 28% of fiction texts and 72% of specialized texts. In 2017, the corpus was segmented, part-of-speech tagged and lemmatized inside the MREŽNIK project to be used for the development of the first Croatian corpus-based dictionary. |
dc.language.iso | hrv |
dc.publisher | Institute of Croatian Language and Linguistics |
dc.relation.isreferencedby | http://riznica.ihjj.hr/CLC-Slavicorp.pdf |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | http://riznica.ihjj.hr |
dc.subject | reference corpus |
dc.title | Croatian language corpus Riznica 0.1 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute |
sponsor | Croatian Science Foundation IP-2016-06-2141 MREŽNIK nationalFunds |
sponsor | Ministry of Science, Education and Sports 2120920 Hrvatska jezična riznica nationalFunds |
size.info | 101782863 tokens |
size.info | 85273724 words |
size.info | 4717985 sentences |
size.info | 14781 texts |
files.count | 1 |
files.size | 479969047 |
featuredService.kontext | search|https://www.clarin.si/kontext/first_form?corpname=riznica |
featuredService.noske | search|https://www.clarin.si/ske/#dashboard?corpname=riznica |
Files in this item
This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)





- Name
- Riznica.zip
- Size
- 457.73 MB
- Format
- application/zip
- Description
- Corpus in vertical format
- MD5
- 8024f2685e7f661f14b8d07e73c8390e
- Riznica
- riznica.vert1 GB
- riznica.regi2 kB
- README.txt167 B