What's New
corpus
Description:
This entry contains the first part of the audiobook "Izgubljeni Prešeren" (Lost Prešeren) by author Ivan Sivec (COBISS ID: 276089603, ISBN: 978-961-7143-64-5).
The story begins when Tina, as a university student, helps ...
Ta vnos vsebuje 3 datotek(e) (79.52
MB).
Publicly Available
corpus
Description:
This entry includes the first part of the e-book "Okupacija" (Occupation) by author Gal Prevoršek (COBISS.SI-ID 275187459, ISBN 978-961-7272-63-5).
Ta vnos vsebuje 1 datoteko (590.3
KB).
Publicly Available
corpus
Description:
This entry includes the first part of the e-book "Šlagerji" (Hits) by author Feri Lainšček (COBISS.SI-ID 275166467, ISBN 978-961-7272-62-8).
The book brings an extensive selection of Lainšček's poetic texts set to music, ...
Ta vnos vsebuje 1 datoteko (4.14
MB).
Publicly Available
Največ ogledov
V preteklem tednu
corpus
Description:
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is meant as a gold-standard training and testing dataset for tokenisation, sentence segmentation, word normalisation, morphosyntactic ...
Ta vnos vsebuje 7 datotek(e) (3.83
MB).
Publicly Available
corpus
Description:
ParlaMint 5.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
Ta vnos vsebuje 31 datotek(e) (5.94
GB).
Publicly Available
corpus
Description:
ParlaMint-en 3.0 comprises linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 3.0 (http://hdl.handle.net/11356/1488) which were machine translated to English and the translation ...
Ta vnos vsebuje 26 datotek(e) (38.68
GB).
Publicly Available