What's New

 corpus 
corpus
Description:
The SloREL corpus contains annotations for training relation extraction models on Slovene documents. It contains documents from Slovene Wikipedia with annotated entities and relations. We constructed the annotations using ...
 Ta vnos vsebuje 1 datoteko (38.71 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The terminological dictionary was compiled within the framework of the project Development of Slovene in the Digital Environment. It is an example collection of 413 terms from the field of artificial intelligence, especially ...
 Ta vnos vsebuje 1 datoteko (28.89 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The Glossary for academic integrity comprises 212 terms related to academic integrity presented in alphabetical order. Each term is accompanied by a definition, and if relevant, common synonyms. The Glossarywas developed ...
 Ta vnos vsebuje 1 datoteko (56.14 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required

Največ ogledov

V preteklem tednu
 corpus 
corpus
Author(s):
Description:
The corpus contains 256,567 documents from the Slovenian news portals 24ur, Dnevnik, Finance, Rtvslo, and Žurnal24. These portals contain political, business, economic and financial content. The submission contains 7 files: ...
 Ta vnos vsebuje 8 datotek(e) (616.88 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
The 24sata news portal consists of a portal with daily news and several smaller portals covering news from specific topics, such as automotive news, health, culinary content, and lifestyle advice. The dataset contains over ...
 Ta vnos vsebuje 2 datotek(e) (1.26 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works
 toolService 
toolService
Description:
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI P5 XML formats and outputs .CSV files that ...
 Ta vnos vsebuje 1 datoteko (16.26 MB).
 
Publicly Available