What's New
corpus
Description:
SloPhonSeg 1.0 is a dataset of automatically generated phonetic segmentations and acoustic-phonetic measurements for selected recordings and transcriptions from the spoken corpus Gos 2.1 (http://hdl.handle.net/11356/1863).
The ...
Ta vnos vsebuje 3 datotek(e) (477.6
MB).
Publicly Available
corpus
Description:
This entry contains the first part of the audiobook "En korak, en utrip srca" (One step, one heartbeat) by author Leopold Suhodolčan (COBISS ID: 277539843, ISBN: 978-961-291-545-2).
Recreational marathon runner Samo ...
Ta vnos vsebuje 3 datotek(e) (140.25
MB).
Publicly Available
corpus
Description:
This entry contains the first part of the audiobook "Cesar Arnulf" (Emperor Arnulf) by author Leopold Suhodolčan (COBISS ID: 277489667, ISBN: 978-961-291-548-39).
Ta vnos vsebuje 8 datotek(e) (150.29
MB).
Publicly Available
Največ ogledov
V preteklem tednu
corpus
Description:
ParlaMint 5.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
Ta vnos vsebuje 31 datotek(e) (5.94
GB).
Publicly Available
corpus
Description:
SloIE is a manually labelled dataset of Slovene idiomatic expressions. It contains 29,400 sentences with 75 different expressions that can occur with either a literal or an idiomatic meaning, with appropriate manual ...
Ta vnos vsebuje 1 datoteko (4.22
MB).
Publicly Available
corpus
Description:
This entry includes the first part of the e-book "Socialna omrežja" (Social Networks) by author Aleš Jelenko (COBISS.SI-ID 270071555, ISBN 978-961-7272-26-0).
What do we say when we speak in a digital language? And what ...
Ta vnos vsebuje 1 datoteko (466.99
KB).
Publicly Available