What's New

 corpus 
corpus
Description:
The SloREL corpus contains annotations for training relation extraction models on Slovene documents. It contains documents from Slovene Wikipedia with annotated entities and relations. We constructed the annotations using ...
 This item contains 1 file (39.74 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Description:
The Developmental corpus Šolar consists of 5,485 texts written by students in Slovenian secondary schools (age 15-19) and pupils in the 7th-9th grade of primary school (13-15), with a small percentage also from the 6th ...
 This item contains 4 files (194.6 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The verbal Western South Slavic database (WeSoSlaV) contains 3000 most frequent Slovenian and 5300 most frequent BCS verbs which are all coded for a number of properties spanning from their phonology, morphology to their ...
 This item contains 3 files (513.16 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The 24sata news portal consists of a portal with daily news and several smaller portals covering news from specific topics, such as automotive news, health, culinary content, and lifestyle advice. The dataset contains over ...
 This item contains 2 files (1.26 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works