Show simple item record

 
dc.contributor.author Ljubešić, Nikola
dc.date.accessioned 2019-04-02T09:29:36Z
dc.date.available 2019-04-02T09:29:36Z
dc.date.issued 2019-03-31
dc.identifier.uri http://hdl.handle.net/11356/1232
dc.description hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, MSD features, UPOS, morphological features, frequency, per-million frequency) 8-tuple. The (wordform, lemma, MSD) triple frequencies are calculated on the hrWaC v2.2 corpus. The MSD tagset follows the MULTEXT-East V6 tagset for the Serbo-Croatian macro-language available at http://nl.ijs.si/ME/V6/msd/html/msd-hbs.html. The UPOS + morphological features follow the UD v2 specifications available at http://universaldependencies.org/guidelines.html.
dc.language.iso hrv
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://www.lrec-conf.org/proceedings/lrec2016/summaries/340.html
dc.relation.replaces http://hdl.handle.net/11356/1072
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.subject morphology
dc.subject inflection
dc.title Inflectional lexicon hrLex 1.3
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType computationalLexicon
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute
sponsor Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
size.info 6427709 items
size.info 164206 entries
files.count 1
files.size 54477922


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
hrLex_v1.3.gz
Size
51.95 MB
Format
application/gzip
Description
Lexicon in tab-separated format
MD5
e55a21f10bbb4f6c22afe31a65803649
 Download file

Show simple item record