Show simple item record

 
dc.contributor.author Brozović Rončević, Dunja
dc.contributor.author Ćavar, Damir
dc.contributor.author Ćavar, Małgorzata
dc.contributor.author Stojanov, Tomislav
dc.contributor.author Štrkalj Despot, Kristina
dc.contributor.author Ljubešić, Nikola
dc.contributor.author Erjavec, Tomaž
dc.date.accessioned 2018-03-07T14:59:31Z
dc.date.available 2018-03-07T14:59:31Z
dc.date.issued 2018-03-07
dc.identifier.uri http://hdl.handle.net/11356/1180
dc.description The Croatian Language Corpus was built between 2007 and 2011 at the Institute of Croatian Language and Linguistics in the scope of the research programme "Hrvatska jezična riznica" as a reference corpus of Croatian language to serve various lexicographic and other linguistic and language technology projects. The corpus consists of 28% of fiction texts and 72% of specialized texts. In 2017, the corpus was segmented, part-of-speech tagged and lemmatized inside the MREŽNIK project to be used for the development of the first Croatian corpus-based dictionary.
dc.language.iso hrv
dc.publisher Institute of Croatian Language and Linguistics
dc.relation.isreferencedby http://riznica.ihjj.hr/CLC-Slavicorp.pdf
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.label PUB
dc.source.uri http://riznica.ihjj.hr
dc.subject reference corpus
dc.title Croatian language corpus Riznica 0.1
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute
sponsor Croatian Science Foundation IP-2016-06-2141 MREŽNIK nationalFunds
sponsor Ministry of Science, Education and Sports 2120920 Hrvatska jezična riznica nationalFunds
size.info 101782863 tokens
size.info 85273724 words
size.info 4717985 sentences
size.info 14781 texts
files.count 1
files.size 479969047
featuredService.kontext search|https://www.clarin.si/kontext/first_form?corpname=riznica
featuredService.noske search|https://www.clarin.si/ske/#dashboard?corpname=riznica


 Files in this item

Icon
Name
Riznica.zip
Size
457.73 MB
Format
application/zip
Description
Corpus in vertical format
MD5
8024f2685e7f661f14b8d07e73c8390e
 Download file  Preview
 File Preview  
  • Riznica
    • riznica.vert1 GB
    • riznica.regi2 kB
    • README.txt167 B

Show simple item record