| dc.contributor.author | Krajnc Ivič, Mira |
| dc.contributor.author | Mihailović, Larisa |
| dc.contributor.author | Ivič, Dominik |
| dc.contributor.author | Verdonik, Darinka |
| dc.date.accessioned | 2025-11-25T10:15:15Z |
| dc.date.available | 2025-11-25T10:15:15Z |
| dc.date.issued | 2025-11-24 |
| dc.identifier.uri | http://hdl.handle.net/11356/2065 |
| dc.description | The KROHOT corpus consists of 10 audio recordings of private, spontaneous conversations between two or three speakers, with a total duration of 232 minutes. Most recordings were made between May and September 2025. The conversations include recollections about past events that triggered spontaneous humorous reactions among participants (conversational humour). Segments containing humour were manually annotated using a tagging scheme developed exclusively for this corpus. The scheme comprises five primary categories: vocabulary (lexical choice, including figurative use), relation (relationship between speakers), content (topical focus), attitude (speaker’s opinion toward the topic), and manner (purposefully humorous way of speaking). These categories are not mutually exclusive and can be combined. The corpus allows for the analysis of linguistic and communicative phenomena, including markers of humour and strategies used to achieve humorous effects (teasing, mocking, irony, or metaphorical language) in informal private spoken conversations. The corpus is available as WAV audio recordings, while the (aligned) transcriptions are given in the formats of the EXMARaLDA (https://exmaralda.org/en/) and Transcriber (https://trans.sourceforge.net/) tools, as well as in plain text. |
| dc.language.iso | slv |
| dc.publisher | Filozofska fakulteta, Univerza v Mariboru |
| dc.publisher | Faculty of Electrical Engineering and Computer Science, University of Maribor |
| dc.relation.isreferencedby | https://doi.org/10.18690/um.ff.4.2024.10 |
| dc.rights | Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | https://www.clarin.si/info/services/projects/#Corpus_of_conversational_humor_Krohot |
| dc.subject | spoken corpus |
| dc.subject | humour |
| dc.subject | speech database |
| dc.subject | speech recordings |
| dc.subject | speech transcription |
| dc.subject | free conversation |
| dc.title | Corpus of conversational humor Krohot 1.0 |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | audio |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| contact.person | Mira Krajnc Ivič mira.krajnc@um.si Filozofska fakulteta, Univerza v Mariboru |
| sponsor | Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds |
| sponsor | Public Agency for Scientific Research and Innovation of the Republic of Slovenia GC-0002 Large Language Models for Digital Humanities (LLM4DH) nationalFunds |
| size.info | 232 minutes |
| size.info | 10 texts |
| files.count | 2 |
| files.size | 921527339 |
Files in this item
Download all files in item (878.84 MB)This item is
Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
- Name
- README.md
- Size
- 4.19 KB
- Format
- Unknown
- Description
- Description of the structure of the data
- MD5
- 570bcacfc473b159f3312c8632b6a286
- Name
- Krohot.zip
- Size
- 878.83 MB
- Format
- application/zip
- Description
- Corpus and documentation
- MD5
- ce757716ad8457381f56159781cb881d
- Krohot
- DATA
- WAV
- Krohot-GSO-P0047.wav101 MB
- Krohot-GSO-P0046.wav129 MB
- Krohot-GSO-P0032.wav101 MB
- Krohot-GSO-P0045.wav145 MB
- Krohot-GSO-P0044.wav127 MB
- Krohot-GSO-P0030.wav151 MB
- Krohot-GSO-P0042.wav101 MB
- Krohot-GSO-P0041.wav101 MB
- Krohot-GSO-P0035.wav105 MB
- Krohot-GSO-P0048.wav109 MB
- EXS
- Krohot-GSO-P0041_s.exs1 MB
- Krohot-GSO-P0045_s.exs1 MB
- Krohot-GSO-P0047_s.exs1 MB
- Krohot-GSO-P0035_s.exs1 MB
- Krohot-GSO-P0030_s.exs1 MB
- Krohot-GSO-P0042_s.exs1 MB
- Krohot-GSO-P0044_s.exs1 MB
- Krohot-GSO-P0032_s.exs928 kB
- Krohot-GSO-P0046_s.exs1 MB
- Krohot-GSO-P0048_s.exs856 kB
- Krohot.coma53 kB
- TRS
- Krohot-GSO-P0032-std.trs76 kB
- Krohot-GSO-P0030-std.trs87 kB
- Krohot-GSO-P0046-pog.trs88 kB
- Krohot-GSO-P0044-pog.trs83 kB
- Krohot-GSO-P0042-pog.trs80 kB
- Krohot-GSO-P0047-std.trs84 kB
- Krohot-GSO-P0041-pog.trs72 kB
- Krohot-GSO-P0035-pog.trs95 kB
- Krohot-GSO-P0044-std.trs84 kB
- Krohot-GSO-P0042-std.trs81 kB
- Krohot-GSO-P0030-pog.trs87 kB
- Krohot-GSO-P0048-pog.trs51 kB
- Krohot-GSO-P0047-pog.trs84 kB
- Krohot-GSO-P0045-pog.trs91 kB
- Krohot-GSO-P0048-std.trs51 kB
- Krohot-GSO-P0046-std.trs89 kB
- Krohot-GSO-P0045-std.trs92 kB
- Krohot-GSO-P0032-pog.trs73 kB
- Krohot-GSO-P0035-std.trs97 kB
- Krohot-GSO-P0041-std.trs74 kB
- TXT
- Krohot-GSO-P0030-pog.txt34 kB
- Krohot-GSO-P0032-pog.txt24 kB
- Krohot-GSO-P0046-pog.txt29 kB
- Krohot-GSO-P0048-pog.txt22 kB
- Krohot-GSO-P0035-pog.txt27 kB
- Krohot-GSO-P0044-pog.txt27 kB
- Krohot-GSO-P0047-pog.txt29 kB
- Krohot-GSO-P0042-pog.txt29 kB
- Krohot-GSO-P0045-pog.txt35 kB
- Krohot-GSO-P0041-pog.txt26 kB
- EXB
- Krohot-GSO-P0046.exb190 kB
- Krohot-GSO-P0032.exb169 kB
- Krohot-GSO-P0045.exb220 kB
- Krohot-GSO-P0044.exb197 kB
- Krohot-GSO-P0030.exb212 kB
- Krohot-GSO-P0042.exb193 kB
- Krohot-GSO-P0041.exb165 kB
- Krohot-GSO-P0035.exb209 kB
- Krohot-GSO-P0048.exb140 kB
- Krohot-GSO-P0047.exb192 kB
- WAV
- PREBERIME.md3 kB
- README.md4 kB
- DOC
- Krohot-DOC-en.pdf79 kB
- Krohot-shema-sl.pdf150 kB
- Krohot-meta-speeches.tsv4 kB
- Krohot-meta-speakers.tsv5 kB
- Krohot-schema-en.pdf137 kB
- Krohot-DOC-sl.pdf87 kB
- DATA