| dc.contributor.author | Čibej, Jaka |
| dc.contributor.author | Kos, Sara |
| dc.contributor.author | Kastelic, Maja |
| dc.contributor.author | Gabrovšek, Dejan |
| dc.contributor.author | Trojar, Mitja |
| dc.contributor.author | Ježovnik, Janoš |
| dc.contributor.author | Bizjak Končar, Aleksandra |
| dc.contributor.author | Krvina, Domen |
| dc.contributor.author | Petric Žižić, Špela |
| dc.contributor.author | Divjak Race, Duša |
| dc.contributor.author | Vreš, Domen |
| dc.date.accessioned | 2026-06-01T11:01:49Z |
| dc.date.available | 2026-06-01T11:01:49Z |
| dc.date.issued | 2026-06-01 |
| dc.identifier.uri | http://hdl.handle.net/11356/2218 |
| dc.description | GaMS-Instruct-SAFE is a an instruction-following safety dataset designed to fine-tune Slovene large language models to provide safe responses (i.e. to train them to refuse responding to prompts that could lead to physical, economic or psychological harm). It consists of pairs of prompts and responses with various safety topics (e.g. sexual harassment, terrorism, violent crime, drugs). The prompts were written by human annotators using LabelStudio (Tkachenko et al. 2025) based on provided set of criteria (such as topic, expected prompt length, language standardness, different jailbreak strategies) to make the dataset as varied as possible (see Čibej 2024 for more details). In version 0.5, the responses to the prompts were generated using GaMS-27B-Instruct-Nemotron (https://huggingface.co/cjvt/GaMS-27B-Instruct-Nemotron). Only prompt-response pairs in which the model refused to cooperate were included. More responses will be added in future versions. The annotations for this dataset were created using Label Studio, open-source data labeling software developed by Heartex (Tkachenko et al. 2025). References: Čibej, Jaka, 2024: First steps toward the compilation of a safety dataset for Slovene large language models. Jezikovne tehnologije in digitalna humanistika. https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=slv&id=164271 Tkachenko, Maxim, Mikhail Malyuk, Andrey Holmanyuk, Nikolai Liubimov, 2025: Label Studio: Data labeling software. https://github.com/HumanSignal/label-studio |
| dc.language.iso | slv |
| dc.publisher | Faculty of Computer and Information Science, University of Ljubljana |
| dc.relation.isreferencedby | https://zenodo.org/records/13912515 |
| dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | http://cjvt.si/povejmo |
| dc.subject | instruction following dataset |
| dc.subject | large language models |
| dc.subject | safety |
| dc.title | Slovene instruction-following safety dataset for large language models GaMS-Instruct-SAFE 0.5 |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| contact.person | Jaka Čibej jaka.cibej@ff.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana |
| sponsor | ARIS (Slovenian Research and Innovation Agency) NOO PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models) nationalFunds |
| sponsor | ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds |
| size.info | 501 units |
| files.count | 1 |
| files.size | 154155 |
Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- GaMS-Instruct-SAFE_0.5.zip
- Size
- 150.54 KB
- Format
- application/zip
- Description
- GaMS-Instruct-SAFE 0.5 (ZIP)
- MD5
- 0743ffcbccd3dd55984b23e3076689ed
- GaMS-Instruct-SAFE_0.5
- GaMS-Instruct-SAFE_0.5.json614 kB
- 00README.txt5 kB