dc.contributor.author | Krsnik, Luka |
dc.contributor.author | Robnik-Šikonja, Marko |
dc.contributor.author | Šef, Tomaž |
dc.contributor.author | Krek, Simon |
dc.date.accessioned | 2018-05-08T01:59:30Z |
dc.date.available | 2018-05-08T01:59:30Z |
dc.date.issued | 2018-05-08 |
dc.identifier.uri | http://hdl.handle.net/11356/1186 |
dc.description | This lexicon is an extended version of Sloleks 1.2, http://hdl.handle.net/11356/1039. It contains all the original data from Sloleks with added information about the stress of each word form, which is included in two ways: information about stress location only, and information about stress location and type. Stress assignment was performed automatically, with algorithms based on deep neural networks which correctly predicted accent location in 91.5% and combined accent type and location in 88.5% of test data. Therefore not all accents are correct. This updated 1.1 version of the lexicon contains stress asignments with an improved algorithm, which reduces the error by about 1% against the previous 1.0 version. |
dc.language.iso | slv |
dc.publisher | Faculty of Computer and Information Science, University of Ljubljana |
dc.publisher | Centre for Language Resources and Technologies, University of Ljubljana |
dc.relation.isreferencedby | http://videolectures.net/jota_krsnik_napovedovanje_naglasa/ |
dc.relation.isreferencedby | https://repozitorij.uni-lj.si/IzpisGradiva.php?id=98276 |
dc.relation.replaces | http://hdl.handle.net/11356/1156 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://gitea.cjvt.si/lkrsnik/stress_asignment |
dc.subject | word stress |
dc.title | Automatically stress labelled morphological lexicon Sloleks 1.2, version 1.1 |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType | computationalLexicon |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Luka Krsnik krsnik.luka92@gmail.com Faculty of Computer and Information Science, University of Ljubljana |
size.info | 2774745 words |
size.info | 100805 entries |
files.count | 2 |
files.size | 58624194 |
Datoteke v tem vnosu
Prenesi vse datoteke v vnosu (55.91 MB)To je vnos
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
z licenco:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)





- Ime
- accented_sloleks2.xml.zip
- Velikost
- 37.22 MB
- Format
- application/zip
- Opis
- Sloleks with accented words in LMF XML format (PoS tags in Slovenian).
- MD5
- 7c6b102647fb1328677c23ab9d2dacae

- Ime
- accented_sloleks.zip
- Velikost
- 18.69 MB
- Format
- application/zip
- Opis
- Sloleks with accented words in tabular format (PoS tags in Slovenian).
- MD5
- 2ba78fc7395631541f6a323b634869cb