Show simple item record

 
dc.contributor.author Dobrovoljc, Kaja
dc.contributor.author Krek, Simon
dc.contributor.author Holozan, Peter
dc.contributor.author Erjavec, Tomaž
dc.contributor.author Romih, Miro
dc.date.accessioned 2015-06-14T07:49:46Z
dc.date.available 2015-06-14T07:49:46Z
dc.date.issued 2015-06-14
dc.identifier.uri http://hdl.handle.net/11356/1039
dc.description Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains approx. 100.000 most frequent Slovenian lemmas, their inflected or derivative word forms and the corresponding grammatical description. Lemmatization rules, part-of-speech categorization and the set of feature-value pairs follow the JOS morphosyntactic specifications. In addition to grammatical information, each word form is also given the information on its absolute corpus frequency and its compliance with the reference language standard. Note that this entry updates Sloleks 1.0 by fixing various encoding and content errors. The resource is further described in: Kaja Dobrovoljc, Simon Krek and Tomaž Erjavec, 2017: The Sloleks Morphological Lexicon and its Future Development. In (Vojko Gorjanc, Polona Gantar, Iztok Kosem and Simon Krek, eds.): Dictionary of Modern Slovene: Problems and Solutions. Ljubljana University Press, Faculty of Arts. https://e-knjige.ff.uni-lj.si/znanstvena-zalozba/catalog/download/2/1/47-1
dc.language.iso slv
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.relation.isreferencedby https://e-knjige.ff.uni-lj.si/znanstvena-zalozba/catalog/download/2/1/47-1?inline=1
dc.relation.replaces http://hdl.handle.net/11356/1033
dc.relation.isreplacedby http://hdl.handle.net/11356/1230
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.label PUB
dc.source.uri http://eng.slovenscina.eu/sloleks/opis
dc.subject morphology
dc.subject inflection
dc.subject word forms
dc.subject derivation
dc.subject LMF
dc.subject lemmatisation
dc.title Morphological lexicon Sloleks 1.2
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType lexicon
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN.SI data & tools
demo.uri http://eng.slovenscina.eu/sloleks
contact.person Kaja Dobrovoljc kaja.dobrovoljc@trojina.si Trojina, Institute for Applied Slovene Studies
sponsor Ministry of Education, Science and Sport 3311-08-986003 Communication in Slovene Other
size.info 100805 entries
size.info 2774745 words
files.count 5
files.size 83642042


 Files in this item

 Download all files in item (79.77 MB)
Icon
Name
Sloleks_v1.2.zip
Size
26.96 MB
Format
application/zip
Description
Sloleks in LMF XML format, PoS tags in Slovenian.
MD5
3b15dc1a094e3ff2f4f1ef69702e6f41
 Download file  Preview
 File Preview  
    • Sloleks_v1.2.xml1 GB
Icon
Name
DTD_LMF_REV_16.zip
Size
1.67 KB
Format
application/zip
Description
Document Type Definition for Sloleks in LMF XML format.
MD5
1f68826b88d476f6ebd46dc17e0f3e05
 Download file  Preview
 File Preview  
    • DTD_LMF_REV_16.dtd7 kB
Icon
Name
sloleks-sl.tbl_v1.2.zip
Size
12.65 MB
Format
application/zip
Description
Sloleks in tabular format, PoS tags in Slovenian.
MD5
0c2886e88558df5a6b9cdde4c7e20fb3
 Download file  Preview
 File Preview  
    • sloleks-sl_v1.2.tbl99 MB
Icon
Name
sloleks-en.tbl_v1.2.zip
Size
12.74 MB
Format
application/zip
Description
Sloleks in tabular format, PoS tags in English.
MD5
1c2594b2bc7de40e0b73e18c7646bd6d
 Download file  Preview
 File Preview  
    • sloleks-en_v1.2.tbl101 MB
Icon
Name
sloleksUD-en_v1.2.zip
Size
27.42 MB
Format
application/zip
Description
Sloleks in tabular format, PoS tags in English with added Universal Dependencies morphosyntactic features.
MD5
a6829126a4ed99d95a6fed46d9a2495f
 Download file  Preview
 File Preview  
    • sloleksUD-en_v1.2.tbl435 MB

Show simple item record