What's New
lexicalConceptualResource

Description:
SNES (Stalno naglašene enote iz Sloleksa; Constantly accentuated units from Sloleks) is a dataset containing Slovene final accentuated word parts (i.e., the ending part of an accentuated word from its last grapheme with ...
Ta vnos vsebuje 1 datoteko (525.54
KB).
Publicly Available



corpus

Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 57 publishers. Trendi 2025-07 covers the period from January 2019 to July 2025, complementing the Gigafida ...
Ta vnos ne vsebuje datotek.
corpus

Description:
The ParlaSpeech corpora are built from the transcripts of parliamentary proceedings of Croatian, Serbian, Polish, and Czech parliaments available in the ParlaMint 4.0 corpus (http://hdl.handle.net/11356/1859), and the ...
Ta vnos vsebuje 10 datotek(e) (10.16
GB).
Publicly Available



Največ ogledov
V preteklem tednu
toolService

Description:
The X-GENRE classifier is a text classification model that can be used for automatic genre identification. The model classifies texts to one of 9 genre labels: Information/Explanation, News, Instruction, Opinion/Argumentation, ...
Ta vnos vsebuje 1 datoteko (779.93
MB).
Publicly Available



corpus

Description:
Trilingual parallel corpus on general data protection regulation. The size of the corpus is 54,468 words in English, 42,566 words in Lithuanian, and 47,740 words in Danish.
Ta vnos ne vsebuje datotek.
lexicalConceptualResource

Description:
This dictionary has been prepared to support the Syrian Textbook prepared at the University of Vienna.
See also: https://hdl.handle.net/11022/0000-0007-C093-9
Ta vnos ne vsebuje datotek.