What's New
corpus
Description:
This entry contains the first part of the audiobook "Pramatija ali Bučman" (Pramatija, or the Bogeyman) by author Leopold Suhodolčan (COBISS ID: 264527107, ISBN: 978-961-7194-44-9).
Television spotlights were shining ...
This item contains 9 files (81.11
MB).
Publicly Available
corpus
Description:
This entry contains the first part of the audiobook "Rdeči lev" (The red lion) by author Leopold Suhodolčan (COBISS ID: 264850179, ISBN: 978-961-7194-48-7).
Blaž was faster and soon managed to escape them, but they still ...
This item contains 6 files (92.75
MB).
Publicly Available
corpus
Description:
This entry contains the first part of the audiobook "Na večerji s krokodilom" (Dinner with a crocodile) by author Leopold Suhodolčan (COBISS ID: 264847619, ISBN: 978-961-7194-45-6).
This item contains 11 files (90.74
MB).
Publicly Available
Most Viewed Items
Top Last Week
corpus
Description:
The dataset of user comments provided for research purposes for the EMBEDDIA, a Horizon 2020 project, extracted from the database of user comments from the 24sata.hr news portal. The 24sata.hr is the largest-circulation ...
This item contains 3 files (1.89
GB).
Publicly Available
corpus
Description:
SloIE is a manually labelled dataset of Slovene idiomatic expressions. It contains 29,400 sentences with 75 different expressions that can occur with either a literal or an idiomatic meaning, with appropriate manual ...
This item contains 1 file (4.22
MB).
Publicly Available
corpus
Description:
The dataset represents the Twitter production in Slovenian in the period from 2018 until 2020. It consists of tweet IDs, retweet IDs, pseudo-anonymized user IDs, publication dates, and automatically assigned hate labels ...
This item contains 1 file (182.04
MB).
Publicly Available