What's New
toolService
Description:
This is a retrained Slovenian model for the Trankit v1.1.1 library for multilingual natural language processing (https://pypi.org/project/trankit/), trained on the concatenation of the SSJ UD treebank of written Slovenian ...
This item contains 1 file (145.55
MB).
Publicly Available
corpus
Description:
This entry contains the first part of the audiobook "Sam bog naj jo bere" (Let only God read it) by author Alenka Čurin Janžekovič (COBISS ID: 277038339, ISBN: 978-961-291-543-8).
An extraordinary first-person account ...
This item contains 4 files (177.19
MB).
Publicly Available
corpus
Description:
This entry contains the first part of the audiobook "Gelika" (Gelika) by author Ema Golčer (COBISS ID: 277477635, ISBN: 978-961-291-546-9).
This item contains 7 files (227.37
MB).
Publicly Available
Most Viewed Items
Top Last Week
corpus
Description:
This entry includes the first part of the e-book "Socialna omrežja" (Social networks) by author Aleš Jelenko (COBISS.SI-ID 270071555, ISBN 978-961-7272-26-0).
What do we say when we speak in a digital language? And what ...
This item contains 1 file (466.99
KB).
Publicly Available
corpus
Description:
ParlaMint 5.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
This item contains 31 files (5.94
GB).
Publicly Available
corpus
Description:
COLESLAW 1.0 is a large-scale collection of Slovenian legal texts compiled from authoritative public sources. The corpus covers legislative, judicial, and governmental legal documents and is designed to support research ...
This item contains 1 file (1.24
GB).
Publicly Available