What's New

 corpus 
corpus
Description:
The dataset contains social media posts from X and traditional media articles from online news sources related to the Slovenian commemorations of the Day of Resistance. We used two types of data: For the social media ...
 Ta vnos vsebuje 2 datotek(e) (2.5 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 toolService 
toolService
Description:
This is a retrained Slovenian model for the Trankit v1.1.1 library for multilingual natural language processing (https://pypi.org/project/trankit/), trained on the concatenation of the SSJ UD treebank of written Slovenian ...
 Ta vnos vsebuje 1 datoteko (145.55 MB).
 
Publicly Available
 corpus 
corpus
Description:
This entry contains the first part of the audiobook "Sam bog naj jo bere" (Let only God read it) by author Alenka Čurin Janžekovič (COBISS ID: 277038339, ISBN: 978-961-291-543-8). An extraordinary first-person account ...
 Ta vnos vsebuje 4 datotek(e) (177.19 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike

Največ ogledov

V preteklem tednu
 lexicalConceptualResource 
lexicalConceptualResource
Description:
Trilingual (EN-LT-NO) glossary of terms denoting phobia types extracted from the articles of English "The Guardian", Lithuanian "DELFI", and Norwegian "Dagbladet" news media sites.
 Ta vnos ne vsebuje datotek.
 corpus 
corpus
Description:
COLESLAW 1.0 is a large-scale collection of Slovenian legal texts compiled from authoritative public sources. The corpus covers legislative, judicial, and governmental legal documents and is designed to support research ...
 Ta vnos vsebuje 1 datoteko (1.24 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Author(s):
Description:
goo300k is a manually annotated reference corpus of historical Slovene. It contains 1,100 pages (about 300,000 tokens) sampled from 89 texts from the period 1584-1899. Each text contains extensive meta-data and per-page ...
 Ta vnos vsebuje 2 datotek(e) (8.9 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required