Najnovejše

 lexicalConceptualResource 
lexicalConceptualResource
Opis:
The list of single-word occupations in Slovene is based on the Slovene Standard Classification of Occupations (https://www.uradni-list.si/glasilo-uradni-list-rs/vsebina?urlid=199728&stevilka=1641). The list includes 234 ...
 Ta vnos vsebuje 1 datoteko (5.94 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Opis:
The corpus consists of transcripts of audio-recorded biographical interviews with 19 participants. The interviews are about forms of address that speakers use in colloquial and in formal settings, and about their attitudes ...
 Ta vnos vsebuje 1 datoteko (2.39 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike
 corpus 
corpus
Opis:
Corpus of Slovenian school texts is a lemmatized and POS-tagged specialized corpus, which includes 428 short school texts written primarily by primary-school students from 1st to 5th grades from 2017 to 2020. The corpus ...
 Ta vnos vsebuje 1 datoteko (1.14 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required

Največ ogledov

V preteklem tednu
 lexicalConceptualResource 
lexicalConceptualResource
Opis:
A lexicon of 751 emoji characters with automatically assigned sentiment. The sentiment is computed from 70,000 tweets, labeled by 83 human annotators in 13 European languages. The process and analysis of emoji sentiment ...
 Ta vnos vsebuje 3 datotek(e) (93.95 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Opis:
The dataset contains over 1.6 million tweets (tweet IDs), labeled with sentiment by human annotators. There are 15 Twitter corpora for the corresponding 15 European languages. The data can be used to train and evaluate ...
 Ta vnos vsebuje 16 datotek(e) (49.38 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Opis:
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starting at the end of 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in ...
 Ta vnos vsebuje 9 datotek(e) (5.12 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required