Files in this item

 Download all files in item (696.06 KB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
readme.txt
Size
2.82 KB
Format
Text file
Description
readme file
MD5
d4b6fe03066bfdb7e48f146fe897d196
 Download file  Preview
 File Preview  
Title: Slovene sentiment lexicon JOB 1.0

Author: Jože Bučar, Faculty of Information Studies Novo mesto (contact: joze.bucar@gmail.com)

Abstract:
The JOB lexicon for sentiment analysis of Slovenian texts contains a list of 25524 headwords from the List of Slovenian headwords 1.1 (http://hdl.handle.net/11356/1038) extended with sentiment ratings based on the AFINN model with an integer between -5 (very negative) and +5 (very positive). The ratings are derived from the lemmatized version of the Manually sentiment annotated Slovenian (sentence-based) news corpus SentiNews 1.0 (http://hdl.handle.net/11356/1110).


Model:
The original sentence-level annotations were based on the five-level Lickert scale (integer between one (very negative) and five (very positive)). Therefore, we used linear transformation to transform average scores of sentences from the Lickert model to the AFINN (score one within the Lickert model transformed to minus five within the AFINN model, score five wit . . .
                                            
Icon
Name
Slovene_sentiment_lexicon_JOB.txt
Size
693.23 KB
Format
Text file
Description
sentiment lexicon
MD5
57bc2cc1c5b7349ec18cb256b2234c09
 Download file  Preview
 File Preview  
Word	AFINN	freq	avg_AFINN	sd_AFINN
a	0	3415	-0.466	1.787
aa	-1	57	-1.116	1.367
ab	-1	6	-1.277	2.189
aba	-1	5	-0.610	1.046
abančen	0	3	-0.443	1.443
abc	0	28	0.121	1.564
abe	-1	1	-0.860	0.000
abeceda	0	1	0.390	0.000
abeceden	2	3	1.640	2.500
abecednik	2	1	1.640	0.000
abjar	4	1	4.140	0.000
abonent	2	3	2.057	0.722
abonentski	2	3	1.779	0.241
abonma	0	8	0.182	2.294
abonmajski	0	1	0.390	0.000
abraham	0	4	0.390	1.768
absoluten	1	36	0.598	2.069
absolutist	-2	1	-2.110	0.000
absolutističen	0	1	0.390	0.000
absolvent	0	5	-0.339	2.215
absolventski	0	1	0.390	0.000
absorbirati	0	5	-0.235	1.250
absorpcijski	2	2	1.640	0.000
abstiniranje	3	2	2.890	0.000
absurd	-1	10	-1.402	1.608
absurden	-1	19	-1.215	1.395
absurdno	1	2	1.015	0.884
absurdnost	-2	1	-2.110	0.000
abs.	3	1	2.890	0.000
ac	0	18	0.390	2.134
act	0	1	0.390	0.000
ad	0	15	-0.304	1.914
adam	1	6	1.223	2.041
adaptacija	2	4	1.640	0.000
adaptirati	0	1	0.390	0.000
adenski	-2	1	-2.110	0.000
adidas	1	12	1.185	3.273
adida . . .