dc.contributor.author | Markov, Ilia |
dc.contributor.author | Hilte, Lisa |
dc.contributor.author | Ljubešić, Nikola |
dc.contributor.author | Fišer, Darja |
dc.contributor.author | Daelemans, Walter |
dc.date.accessioned | 2022-08-27T16:49:28Z |
dc.date.available | 2022-08-27T16:49:28Z |
dc.date.issued | 2022-08-24 |
dc.identifier.uri | http://hdl.handle.net/11356/1483 |
dc.description | The LiLaH-HAG dataset (HAG is short for hate-age-gender) consists of metadata on Facebook comments to Facebook posts of mainstream media in Great Britain, Flanders, Slovenia and Croatia. The metadata available in the dataset are the hatefulness of the comment (0 is acceptable, 1 is hateful), age of the commenter (0-25, 26-30, 36-65, 65-), gender of the commenter (M or F), and the language in which the comment was written (EN, NL, SL, HR). The hatefulness of the comment was assigned by multiple well-trained annotators by reading comments in the order of appearance in a discussion thread, while the age and gender variables were estimated from the Facebook profile of a specific user by a single annotator. |
dc.language.iso | eng |
dc.language.iso | nld |
dc.language.iso | slv |
dc.language.iso | hrv |
dc.publisher | Jožef Stefan Institute |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://lilah.eu |
dc.subject | social media |
dc.subject | hate speech |
dc.subject | demographic variables |
dc.title | Facebook metadata dataset LiLaH-HAG |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute |
sponsor | ARRS (Slovenian Research Agency) N6-0099 LiLaH: Linguistic Landscape of Hate Speech nationalFunds |
sponsor | FWO (Research Foundation - Flanders) G070619N LiLaH: Linguistic Landscape of Hate Speech nationalFunds |
sponsor | ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds |
size.info | 10356 entries |
files.count | 1 |
files.size | 131303 |
Datoteke v tem vnosu
To je vnos
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
z licenco:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)





- Ime
- LiLaH-HAG.tsv
- Velikost
- 128.23 KB
- Format
- Neznano
- Opis
- TSV file
- MD5
- 9a2c5893aaa7d22403a19c414ac4098c