dc.contributor.author |
Pahor de Maiti Tekavčič, Kristina |
dc.contributor.author |
Ljubešić, Nikola |
dc.contributor.author |
Fišer, Darja |
dc.date.accessioned |
2024-05-28T12:02:54Z |
dc.date.available |
2024-05-28T12:02:54Z |
dc.date.issued |
2024-05-27 |
dc.identifier.uri |
http://hdl.handle.net/11356/1947 |
dc.description |
The FRENK-fr dataset contains French socially unacceptable and acceptable comments posted in response to news articles that cover the topics of LGBT and migrants, and which were posted on Facebook by prominent French media outlets (20 minutes, Le Figaro and Le Monde). The original thread order of comments based on the time of publishing is preserved in the dataset.
These comments were manually annotated for the type and target of socially unacceptable comments. The creation process, including data collection, filtering, annotation schema and annotation procedure, was adopted from the FRENK 1.1 dataset (http://hdl.handle.net/11356/1462), which makes FRENK-fr fully comparable to the datasets of Croatian, English and Slovenian comments included in the FRENK 1.1.
Apart from manual annotation of the type and target of socially unacceptable discourse, the comments are accompanied with metadata, namely the topic of the news item (LGBT or migrants) that triggered the comment, the news item itself and the media outlet authoring it, an anonymised user ID, and information about the reply level in the thread.
The dataset consists of 10,239 Facebook comments posted under 66 news items. It includes 3,071 comments that were labelled as socially unacceptable, and 7,168 that were labelled as socially acceptable. |
dc.language.iso |
fra |
dc.publisher |
Faculty of Arts, University of Ljubljana |
dc.publisher |
Jožef Stefan Institute |
dc.publisher |
Institute of Contemporary History |
dc.rights |
CLARIN.SI Licence ACA ID-BY-NC-INF-NORED 1.0 |
dc.rights.uri |
https://clarin.si/repository/xmlui/page/licence-aca-id-by-nc-inf-nored-1.0 |
dc.rights.label |
ACA |
dc.source.uri |
http://nl.ijs.si/frenk/ |
dc.subject |
offensive language |
dc.subject |
hate speech |
dc.subject |
news comments |
dc.title |
Offensive language dataset of French comments FRENK-fr 1.0 |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
has.files |
yes |
branding |
CLARIN.SI data & tools |
contact.person |
Kristina Pahor de Maiti Tekavčič kristina.pahordemaiti@ff.uni-lj.si Faculty of Arts, University of Ljubljana |
sponsor |
ARRS (Slovenian Research Agency) J7-8280 FRENK: Resources, methods, and tools for the understanding, identification, and classification of various forms of socially unacceptable discourse in the information society nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) N6-0099 LiLaH: Linguistic Landscape of Hate Speech nationalFunds |
sponsor |
Slovenian Research Agency (ARRS) P6-0436 Digital humanities: resources, tools and methods nationalFunds |
size.info |
10239 texts |
files.count |
1 |
files.size |
722671 |