<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href='static/style.xsl' type='text/xsl'?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-04T11:48:52Z</responseDate><request verb="GetRecord" identifier="oai:www.clarin.si:11356/2073" metadataPrefix="oai_dc">http://www.clarin.si/repository/oai/request</request><GetRecord><record><header><identifier>oai:www.clarin.si:11356/2073</identifier><datestamp>2025-12-04T10:14:03Z</datestamp><setSpec>hdl_11356_1023</setSpec><setSpec>hdl_11356_1024</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
<dc:title>Corpus of spoken Slovenian ROG-Dialog 1.0</dc:title>
<dc:creator>Verdonik, Darinka</dc:creator>
<dc:creator>Rupnik, Peter</dc:creator>
<dc:creator>Vidinić, Jasna</dc:creator>
<dc:creator>Ljubešić, Nikola</dc:creator>
<dc:subject>speech transcription</dc:subject>
<dc:subject>speech recordings</dc:subject>
<dc:subject>dialogue act</dc:subject>
<dc:subject>spoken corpus</dc:subject>
<dc:subject>spoken language</dc:subject>
<dc:subject>sentiment classification</dc:subject>
<dc:description>Corpus of spoken Slovenian ROG-Dialog consists of volunteered audio, recorded by students by asking their relatives or acquaintances to talk on record in their homes. The speakers were directed to use various styles of dialogue, including instructions, interviews, discussions, story telling, and chatting. Dialogue themes were freely chosen, most prevalent themes include travelling, health, childhood memories, work, technology, food, and entertainment. &#xd;
&#xd;
Recordings and metadata were uploaded to the Govorjena Slovenščina web portal (https://govorjena-slovenscina.um.si/), manually segmented and transcribed in both colloquial and standardized orthographic transcriptions, and annotated with dialogue acts and sentiment. &#xd;
&#xd;
The 25 speakers in this corpus cover all statistical regions of Slovenia with their ages ranging from 21 to 82 years. The corpus includes speakers from both rural and urban areas. Reflecting this geographic and social diversity, speech samples range from standard colloquial registers to local dialects, with some speakers employing distinct regional varieties.&#xd;
&#xd;
ROG-Dialog is distributed as:&#xd;
- EXMARaLDA format (.EXB files)  for viewing with Partitur Editor (https://www.exmaralda.org/)&#xd;
- .EXS files and Rog-Art.coma file for searching through the annotated corpus in the EXMARaLDA EXAKT concordancer (https://www.exmaralda.org/)&#xd;
- .TRS files for viewing the transcriptions without annotations with Transcriber (https://trans.sourceforge.net/en/presentation.php)&#xd;
- .TXT plain-text files&#xd;
&#xd;
ROG-dialog data were compiled to complement the ROG-Artur subcorpus of the ROG 1.0 training corpus of spoken Slovenian (http://hdl.handle.net/11356/1992). However, the two corpora differ in their annotation levels, and harmonising these remains a task for future merging.</dc:description>
<dc:date>2025-12-02</dc:date>
<dc:type>corpus</dc:type>
<dc:identifier>http://hdl.handle.net/11356/2073</dc:identifier>
<dc:language>slv</dc:language>
<dc:rights>Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)</dc:rights>
<dc:rights>PUB</dc:rights>
<dc:rights>https://creativecommons.org/licenses/by-sa/4.0/</dc:rights>
<dc:format>application/zip</dc:format>
<dc:format>application/zip</dc:format>
<dc:format>text/plain; charset=utf-8</dc:format>
<dc:format>downloadable_files_count: 2</dc:format>
<dc:publisher>Faculty of Electrical Engineering and Computer Science, University of Maribor</dc:publisher>
<dc:publisher>Jožef Stefan Institute</dc:publisher>
<dc:source>https://govorjena-slovenscina.um.si/</dc:source>
</oai_dc:dc>
</metadata></record></GetRecord></OAI-PMH>