What's New

 corpus 
corpus
Description:
The dataset consists of 7514 Slovenian news articles from the SentiNews 1.0 corpus by Bučar et al. 2017 (http://hdl.handle.net/11356/1110) which had available article keywords. We provide the train and test data splits ...
 This item contains 2 files (6.05 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
The Trendi corpus is a monitor corpus of Slovene. It contains news from 107 different media websites, published by 48 different publishers. Trendi 2022-05 covers the period from January 2019 to May 2022, complementing the ...
 This item contains no files.
 lexicalConceptualResource 
lexicalConceptualResource
Description:
Algemeen Nederlands Woordenboek (ANW). The ANW is a corpus-based, digital dictionary that describes contemporary Dutch in the Netherlands, Flanders, Suriname, and the Caribbean as comprehensively as possible. The language ...
 This item contains no files.

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The 24sata news portal consists of a portal with daily news and several smaller portals covering news from specific topics, such as automotive news, health, culinary content, and lifestyle advice. The dataset contains over ...
 This item contains 2 files (1.26 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works