Files in this item

 Download all files in item (696.06 KB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
readme.txt
Size
2.82 KB
Format
Text file
Description
readme file
MD5
d4b6fe03066bfdb7e48f146fe897d196
 Download file  Preview
 File Preview  
Title: Slovene sentiment lexicon JOB 1.0 Author: Jože Bučar, Faculty of Information Studies Novo mesto (contact: joze.bucar@gmail.com) Abstract: The JOB lexicon for sentiment analysis of Slovenian texts contains a list of 25524 headwords from the List of Slovenian headwords 1.1 (http://hdl.handle.net/11356/1038) extended with sentiment ratings based on the AFINN model with an integer between -5 (very negative) and +5 (very positive). The ratings are derived from the lemmatized version of the Manually sentiment annotated Slovenian (sentence-based) news corpus SentiNews 1.0 (http://hdl.handle.net/11356/1110). Model: The original sentence-level annotations were based on the five-level Lickert scale (integer between one (very negative) and five (very positive)). Therefore, we used linear transformation to transform average scores of sentences from the Lickert model to the AFINN (score one within the Lickert model transformed to minus five within the AFINN model, score five wit . . .
Icon
Name
Slovene_sentiment_lexicon_JOB.txt
Size
693.23 KB
Format
Text file
Description
sentiment lexicon
MD5
57bc2cc1c5b7349ec18cb256b2234c09
 Download file  Preview
 File Preview  
Word AFINN freq avg_AFINN sd_AFINN a 0 3415 -0.466 1.787 aa -1 57 -1.116 1.367 ab -1 6 -1.277 2.189 aba -1 5 -0.610 1.046 abančen 0 3 -0.443 1.443 abc 0 28 0.121 1.564 abe -1 1 -0.860 0.000 abeceda 0 1 0.390 0.000 abeceden 2 3 1.640 2.500 abecednik 2 1 1.640 0.000 abjar 4 1 4.140 0.000 abonent 2 3 2.057 0.722 abonentski 2 3 1.779 0.241 abonma 0 8 0.182 2.294 abonmajski 0 1 0.390 0.000 abraham 0 4 0.390 1.768 absoluten 1 36 0.598 2.069 absolutist -2 1 -2.110 0.000 absolutističen 0 1 0.390 0.000 absolvent 0 5 -0.339 2.215 absolventski 0 1 0.390 0.000 absorbirati 0 5 -0.235 1.250 absorpcijski 2 2 1.640 0.000 abstiniranje 3 2 2.890 0.000 absurd -1 10 -1.402 1.608 absurden -1 19 -1.215 1.395 absurdno 1 2 1.015 0.884 absurdnost -2 1 -2.110 0.000 abs. 3 1 2.890 0.000 ac 0 18 0.390 2.134 act 0 1 0.390 0.000 ad 0 15 -0.304 1.914 adam 1 6 1.223 2.041 adaptacija 2 4 1.640 0.000 adaptirati 0 1 0.390 0.000 adenski -2 1 -2.110 0.000 adidas 1 12 1.185 3.273 adida . . .