Prikaži enostavni zapis vnosa

 
dc.contributor.author Čibej, Jaka
dc.contributor.author Kos, Sara
dc.contributor.author Kastelic, Maja
dc.contributor.author Gabrovšek, Dejan
dc.contributor.author Trojar, Mitja
dc.contributor.author Ježovnik, Janoš
dc.contributor.author Bizjak Končar, Aleksandra
dc.contributor.author Krvina, Domen
dc.contributor.author Petric Žižić, Špela
dc.contributor.author Divjak Race, Duša
dc.contributor.author Vreš, Domen
dc.date.accessioned 2026-06-01T11:01:49Z
dc.date.available 2026-06-01T11:01:49Z
dc.date.issued 2026-06-01
dc.identifier.uri http://hdl.handle.net/11356/2218
dc.description GaMS-Instruct-SAFE is a an instruction-following safety dataset designed to fine-tune Slovene large language models to provide safe responses (i.e. to train them to refuse responding to prompts that could lead to physical, economic or psychological harm). It consists of pairs of prompts and responses with various safety topics (e.g. sexual harassment, terrorism, violent crime, drugs). The prompts were written by human annotators using LabelStudio (Tkachenko et al. 2025) based on provided set of criteria (such as topic, expected prompt length, language standardness, different jailbreak strategies) to make the dataset as varied as possible (see Čibej 2024 for more details). In version 0.5, the responses to the prompts were generated using GaMS-27B-Instruct-Nemotron (https://huggingface.co/cjvt/GaMS-27B-Instruct-Nemotron). Only prompt-response pairs in which the model refused to cooperate were included. More responses will be added in future versions. The annotations for this dataset were created using Label Studio, open-source data labeling software developed by Heartex (Tkachenko et al. 2025). References: Čibej, Jaka, 2024: First steps toward the compilation of a safety dataset for Slovene large language models. Jezikovne tehnologije in digitalna humanistika. https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=slv&id=164271 Tkachenko, Maxim, Mikhail Malyuk, Andrey Holmanyuk, Nikolai Liubimov, 2025: Label Studio: Data labeling software. https://github.com/HumanSignal/label-studio
dc.language.iso slv
dc.publisher Faculty of Computer and Information Science, University of Ljubljana
dc.relation.isreferencedby https://zenodo.org/records/13912515
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri http://cjvt.si/povejmo
dc.subject instruction following dataset
dc.subject large language models
dc.subject safety
dc.title Slovene instruction-following safety dataset for large language models GaMS-Instruct-SAFE 0.5
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Jaka Čibej jaka.cibej@ff.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana
sponsor ARIS (Slovenian Research and Innovation Agency) NOO PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models) nationalFunds
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
size.info 501 units
files.count 1
files.size 154155


 Datoteke v tem vnosu

Icon
Ime
GaMS-Instruct-SAFE_0.5.zip
Velikost
150.54 KB
Format
application/zip
Opis
GaMS-Instruct-SAFE 0.5 (ZIP)
MD5
0743ffcbccd3dd55984b23e3076689ed
 Prenesi datoteko  Predogled
 Predogled datoteke  

Prikaži enostavni zapis vnosa