Show simple item record

 
dc.contributor.author Krajnc Ivič, Mira
dc.contributor.author Mihailović, Larisa
dc.contributor.author Ivič, Dominik
dc.contributor.author Verdonik, Darinka
dc.date.accessioned 2025-11-25T10:15:15Z
dc.date.available 2025-11-25T10:15:15Z
dc.date.issued 2025-11-24
dc.identifier.uri http://hdl.handle.net/11356/2065
dc.description The KROHOT corpus consists of 10 audio recordings of private, spontaneous conversations between two or three speakers, with a total duration of 232 minutes. Most recordings were made between May and September 2025. The conversations include recollections about past events that triggered spontaneous humorous reactions among participants (conversational humour). Segments containing humour were manually annotated using a tagging scheme developed exclusively for this corpus. The scheme comprises five primary categories: vocabulary (lexical choice, including figurative use), relation (relationship between speakers), content (topical focus), attitude (speaker’s opinion toward the topic), and manner (purposefully humorous way of speaking). These categories are not mutually exclusive and can be combined. The corpus allows for the analysis of linguistic and communicative phenomena, including markers of humour and strategies used to achieve humorous effects (teasing, mocking, irony, or metaphorical language) in informal private spoken conversations. The corpus is available as WAV audio recordings, while the (aligned) transcriptions are given in the formats of the EXMARaLDA (https://exmaralda.org/en/) and Transcriber (https://trans.sourceforge.net/) tools, as well as in plain text.
dc.language.iso slv
dc.publisher Filozofska fakulteta, Univerza v Mariboru
dc.publisher Faculty of Electrical Engineering and Computer Science, University of Maribor
dc.relation.isreferencedby https://doi.org/10.18690/um.ff.4.2024.10
dc.rights Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.label PUB
dc.source.uri https://www.clarin.si/info/services/projects/#Corpus_of_conversational_humor_Krohot
dc.subject spoken corpus
dc.subject humour
dc.subject speech database
dc.subject speech recordings
dc.subject speech transcription
dc.subject free conversation
dc.title Corpus of conversational humor Krohot 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType audio
has.files yes
branding CLARIN.SI data & tools
contact.person Mira Krajnc Ivič mira.krajnc@um.si Filozofska fakulteta, Univerza v Mariboru
sponsor Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds
sponsor Public Agency for Scientific Research and Innovation of the Republic of Slovenia GC-0002 Large Language Models for Digital Humanities (LLM4DH) nationalFunds
size.info 232 minutes
size.info 10 texts
files.count 2
files.size 921527339


 Files in this item

 Download all files in item (878.84 MB)
Icon
Name
README.md
Size
4.19 KB
Format
Unknown
Description
Description of the structure of the data
MD5
570bcacfc473b159f3312c8632b6a286
 Download file
Icon
Name
Krohot.zip
Size
878.83 MB
Format
application/zip
Description
Corpus and documentation
MD5
ce757716ad8457381f56159781cb881d
 Download file  Preview
 File Preview  
  • Krohot
    • DATA
      • WAV
        • Krohot-GSO-P0047.wav101 MB
        • Krohot-GSO-P0046.wav129 MB
        • Krohot-GSO-P0032.wav101 MB
        • Krohot-GSO-P0045.wav145 MB
        • Krohot-GSO-P0044.wav127 MB
        • Krohot-GSO-P0030.wav151 MB
        • Krohot-GSO-P0042.wav101 MB
        • Krohot-GSO-P0041.wav101 MB
        • Krohot-GSO-P0035.wav105 MB
        • Krohot-GSO-P0048.wav109 MB
      • EXS
        • Krohot-GSO-P0041_s.exs1 MB
        • Krohot-GSO-P0045_s.exs1 MB
        • Krohot-GSO-P0047_s.exs1 MB
        • Krohot-GSO-P0035_s.exs1 MB
        • Krohot-GSO-P0030_s.exs1 MB
        • Krohot-GSO-P0042_s.exs1 MB
        • Krohot-GSO-P0044_s.exs1 MB
        • Krohot-GSO-P0032_s.exs928 kB
        • Krohot-GSO-P0046_s.exs1 MB
        • Krohot-GSO-P0048_s.exs856 kB
      • Krohot.coma53 kB
      • TRS
        • Krohot-GSO-P0032-std.trs76 kB
        • Krohot-GSO-P0030-std.trs87 kB
        • Krohot-GSO-P0046-pog.trs88 kB
        • Krohot-GSO-P0044-pog.trs83 kB
        • Krohot-GSO-P0042-pog.trs80 kB
        • Krohot-GSO-P0047-std.trs84 kB
        • Krohot-GSO-P0041-pog.trs72 kB
        • Krohot-GSO-P0035-pog.trs95 kB
        • Krohot-GSO-P0044-std.trs84 kB
        • Krohot-GSO-P0042-std.trs81 kB
        • Krohot-GSO-P0030-pog.trs87 kB
        • Krohot-GSO-P0048-pog.trs51 kB
        • Krohot-GSO-P0047-pog.trs84 kB
        • Krohot-GSO-P0045-pog.trs91 kB
        • Krohot-GSO-P0048-std.trs51 kB
        • Krohot-GSO-P0046-std.trs89 kB
        • Krohot-GSO-P0045-std.trs92 kB
        • Krohot-GSO-P0032-pog.trs73 kB
        • Krohot-GSO-P0035-std.trs97 kB
        • Krohot-GSO-P0041-std.trs74 kB
      • TXT
        • Krohot-GSO-P0030-pog.txt34 kB
        • Krohot-GSO-P0032-pog.txt24 kB
        • Krohot-GSO-P0046-pog.txt29 kB
        • Krohot-GSO-P0048-pog.txt22 kB
        • Krohot-GSO-P0035-pog.txt27 kB
        • Krohot-GSO-P0044-pog.txt27 kB
        • Krohot-GSO-P0047-pog.txt29 kB
        • Krohot-GSO-P0042-pog.txt29 kB
        • Krohot-GSO-P0045-pog.txt35 kB
        • Krohot-GSO-P0041-pog.txt26 kB
      • EXB
        • Krohot-GSO-P0046.exb190 kB
        • Krohot-GSO-P0032.exb169 kB
        • Krohot-GSO-P0045.exb220 kB
        • Krohot-GSO-P0044.exb197 kB
        • Krohot-GSO-P0030.exb212 kB
        • Krohot-GSO-P0042.exb193 kB
        • Krohot-GSO-P0041.exb165 kB
        • Krohot-GSO-P0035.exb209 kB
        • Krohot-GSO-P0048.exb140 kB
        • Krohot-GSO-P0047.exb192 kB
    • PREBERIME.md3 kB
    • README.md4 kB
    • DOC
      • Krohot-DOC-en.pdf79 kB
      • Krohot-shema-sl.pdf150 kB
      • Krohot-meta-speeches.tsv4 kB
      • Krohot-meta-speakers.tsv5 kB
      • Krohot-schema-en.pdf137 kB
      • Krohot-DOC-sl.pdf87 kB

Show simple item record