What's New

 lexicalConceptualResource 
lexicalConceptualResource
Description:
The resource contains several datasets containing domain-specific data in three languages, English, Slovenian and Croatian, which can be used for various knowledge extraction or knowledge modelling tasks. The resource ...
 This item contains 1 file (1.25 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 lexicalConceptualResource 
lexicalConceptualResource
Description:
2,060 recordings in mp3 format were made for the School Dictionary of the Slovenian Language based on the original recordings in wav format (48 kHZ, 24-bit). Around 600 recordings were made at the Institute of Ethnomusicology, ...
 This item contains 1 file (54.56 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Description:
The FRENK dataset consists of comments to Facebook posts (news articles) of mainstream media outlets from Croatia, Great Britain, and Slovenia, on the topics of migrants and LGBT. The dataset contains whole discussion ...
 This item contains 2 files (4.48 MB).
 
Academic Use Inform Before Use Attribution Required Noncommercial

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The novel "1984" by George Orwell is the central component of the MULTEXT-East corpus. This parallel and sentence aligned corpus contains the novel in the English original (about 100,000 words in length), and its translations ...
 This item contains 1 file (14.12 MB).
 
Academic Use Attribution Required Noncommercial
 lexicalConceptualResource 
lexicalConceptualResource
Author(s):
Description:
hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, MSD features, UPOS, morphological features, frequency, per-million frequency) 8-tuple. The (wordform, lemma, ...
 This item contains 1 file (51.95 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Author(s):
Description:
KOMET 1.0 is a hand-annotated corpus for metaphorical expressions which contains about 200,000 words from Slovene journalistic, fiction and on-line texts. To annotate metaphors in the corpus an adapted and modified ...
 This item contains 1 file (6.97 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike