Show simple item record

 
dc.contributor.author Pahor de Maiti Tekavčič, Kristina
dc.contributor.author Ljubešić, Nikola
dc.contributor.author Fišer, Darja
dc.date.accessioned 2024-05-28T12:02:54Z
dc.date.available 2024-05-28T12:02:54Z
dc.date.issued 2024-05-27
dc.identifier.uri http://hdl.handle.net/11356/1947
dc.description The FRENK-fr dataset contains French socially unacceptable and acceptable comments posted in response to news articles that cover the topics of LGBT and migrants, and which were posted on Facebook by prominent French media outlets (20 minutes, Le Figaro and Le Monde). The original thread order of comments based on the time of publishing is preserved in the dataset. These comments were manually annotated for the type and target of socially unacceptable comments. The creation process, including data collection, filtering, annotation schema and annotation procedure, was adopted from the FRENK 1.1 dataset (http://hdl.handle.net/11356/1462), which makes FRENK-fr fully comparable to the datasets of Croatian, English and Slovenian comments included in the FRENK 1.1. Apart from manual annotation of the type and target of socially unacceptable discourse, the comments are accompanied with metadata, namely the topic of the news item (LGBT or migrants) that triggered the comment, the news item itself and the media outlet authoring it, an anonymised user ID, and information about the reply level in the thread. The dataset consists of 10,239 Facebook comments posted under 66 news items. It includes 3,071 comments that were labelled as socially unacceptable, and 7,168 that were labelled as socially acceptable.
dc.language.iso fra
dc.publisher Faculty of Arts, University of Ljubljana
dc.publisher Jožef Stefan Institute
dc.publisher Institute of Contemporary History
dc.rights CLARIN.SI Licence ACA ID-BY-NC-INF-NORED 1.0
dc.rights.uri https://clarin.si/repository/xmlui/page/licence-aca-id-by-nc-inf-nored-1.0
dc.rights.label ACA
dc.source.uri http://nl.ijs.si/frenk/
dc.subject offensive language
dc.subject hate speech
dc.subject news comments
dc.title Offensive language dataset of French comments FRENK-fr 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Kristina Pahor de Maiti Tekavčič kristina.pahordemaiti@ff.uni-lj.si Faculty of Arts, University of Ljubljana
sponsor ARRS (Slovenian Research Agency) J7-8280 FRENK: Resources, methods, and tools for the understanding, identification, and classification of various forms of socially unacceptable discourse in the information society nationalFunds
sponsor ARRS (Slovenian Research Agency) N6-0099 LiLaH: Linguistic Landscape of Hate Speech nationalFunds
sponsor Slovenian Research Agency (ARRS) P6-0436 Digital humanities: resources, tools and methods nationalFunds
size.info 10239 texts
files.count 1
files.size 722671


 Files in this item

This item is
Academic Use
and licensed under:
CLARIN.SI Licence ACA ID-BY-NC-INF-NORED 1.0
Inform Before Use Attribution Required Noncommercial
Icon
Name
frenk-fr.zip
Size
705.73 KB
Format
application/zip
Description
dataset
MD5
3dab7aa19d7119556251339861c24171
 Download file

Show simple item record