What's New

 corpus 
corpus
Description:
This entry contains the first part of the audiobook "Rumena podmornica" (The yellow submarine) by author Leopold Suhodolčan (COBISS ID: 273740035, ISBN: 978-961-7194-61-6). When it grew dark, Žiga and Marko cautiously ...
 This item contains 5 files (128.31 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
This entry contains the first part of the audiobook "Velikan in pajac" (The giant and the clown) by author Leopold Suhodolčan (COBISS ID: 272964355, ISBN: 978-961-7194-58-6). Velikan in Pajac (The giant and the clown) ...
 This item contains 5 files (148.12 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
This entry contains the first part of the audiobook "Stopinje po zraku" (Footsteps in the air) by author Leopold Suhodolčan (COBISS ID: 273202947, ISBN: 978-961-7194-59-3). The detective assignment that catches Naočnik ...
 This item contains 7 files (105.68 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
SloIE is a manually labelled dataset of Slovene idiomatic expressions. It contains 29,400 sentences with 75 different expressions that can occur with either a literal or an idiomatic meaning, with appropriate manual ...
 This item contains 1 file (4.22 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike
 corpus 
corpus
Description:
The dataset represents the Twitter production in Slovenian in the period from 2018 until 2020. It consists of tweet IDs, retweet IDs, pseudo-anonymized user IDs, publication dates, and automatically assigned hate labels ...
 This item contains 1 file (182.04 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
The dataset of user comments provided for research purposes for the EMBEDDIA, a Horizon 2020 project, extracted from the database of user comments from the 24sata.hr news portal. The 24sata.hr is the largest-circulation ...
 This item contains 3 files (1.89 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works