What's New
corpus
Description:
This entry contains the first part of the audiobook "Izgubljeni Prešeren" (Lost Prešeren) by author Ivan Sivec (COBISS ID: 276089603, ISBN: 978-961-7143-64-5).
The story begins when Tina, as a university student, helps ...
This item contains 3 files (79.52
MB).
Publicly Available
corpus
Description:
This entry includes the first part of the e-book "Okupacija" (Occupation) by author Gal Prevoršek (COBISS.SI-ID 275187459, ISBN 978-961-7272-63-5).
This item contains 1 file (590.3
KB).
Publicly Available
corpus
Description:
This entry includes the first part of the e-book "Šlagerji" (Hits) by author Feri Lainšček (COBISS.SI-ID 275166467, ISBN 978-961-7272-62-8).
The book brings an extensive selection of Lainšček's poetic texts set to music, ...
This item contains 1 file (4.14
MB).
Publicly Available
Most Viewed Items
Top Last Week
corpus
Description:
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is meant as a gold-standard training and testing dataset for tokenisation, sentence segmentation, word normalisation, morphosyntactic ...
This item contains 7 files (3.83
MB).
Publicly Available
corpus
Description:
ParlaMint 5.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
This item contains 31 files (5.94
GB).
Publicly Available
corpus
Description:
ParlaMint-en 3.0 comprises linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 3.0 (http://hdl.handle.net/11356/1488) which were machine translated to English and the translation ...
This item contains 26 files (38.68
GB).
Publicly Available