Files in this item
Download all files in item (696.06 KB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- readme.txt
- Size
- 2.82 KB
- Format
- Text file
- Description
- readme file
- MD5
- d4b6fe03066bfdb7e48f146fe897d196
Title: Slovene sentiment lexicon JOB 1.0
Author: Jože Bučar, Faculty of Information Studies Novo mesto (contact: joze.bucar@gmail.com)
Abstract:
The JOB lexicon for sentiment analysis of Slovenian texts contains a list of 25524 headwords from the List of Slovenian headwords 1.1 (http://hdl.handle.net/11356/1038) extended with sentiment ratings based on the AFINN model with an integer between -5 (very negative) and +5 (very positive). The ratings are derived from the lemmatized version of the Manually sentiment annotated Slovenian (sentence-based) news corpus SentiNews 1.0 (http://hdl.handle.net/11356/1110).
Model:
The original sentence-level annotations were based on the five-level Lickert scale (integer between one (very negative) and five (very positive)). Therefore, we used linear transformation to transform average scores of sentences from the Lickert model to the AFINN (score one within the Lickert model transformed to minus five within the AFINN model, score five wit . . .
- Name
- Slovene_sentiment_lexicon_JOB.txt
- Size
- 693.23 KB
- Format
- Text file
- Description
- sentiment lexicon
- MD5
- 57bc2cc1c5b7349ec18cb256b2234c09
Word AFINN freq avg_AFINN sd_AFINN
a 0 3415 -0.466 1.787
aa -1 57 -1.116 1.367
ab -1 6 -1.277 2.189
aba -1 5 -0.610 1.046
abančen 0 3 -0.443 1.443
abc 0 28 0.121 1.564
abe -1 1 -0.860 0.000
abeceda 0 1 0.390 0.000
abeceden 2 3 1.640 2.500
abecednik 2 1 1.640 0.000
abjar 4 1 4.140 0.000
abonent 2 3 2.057 0.722
abonentski 2 3 1.779 0.241
abonma 0 8 0.182 2.294
abonmajski 0 1 0.390 0.000
abraham 0 4 0.390 1.768
absoluten 1 36 0.598 2.069
absolutist -2 1 -2.110 0.000
absolutističen 0 1 0.390 0.000
absolvent 0 5 -0.339 2.215
absolventski 0 1 0.390 0.000
absorbirati 0 5 -0.235 1.250
absorpcijski 2 2 1.640 0.000
abstiniranje 3 2 2.890 0.000
absurd -1 10 -1.402 1.608
absurden -1 19 -1.215 1.395
absurdno 1 2 1.015 0.884
absurdnost -2 1 -2.110 0.000
abs. 3 1 2.890 0.000
ac 0 18 0.390 2.134
act 0 1 0.390 0.000
ad 0 15 -0.304 1.914
adam 1 6 1.223 2.041
adaptacija 2 4 1.640 0.000
adaptirati 0 1 0.390 0.000
adenski -2 1 -2.110 0.000
adidas 1 12 1.185 3.273
adida . . .