What's New
corpus
Description:
This corpus is specialized, static (i.e., no future growth is planned), diachronic and covers the period from 2002 to 2022.
The SMS messages included in this corpus were obtained from voluntary donors (informants). Both ...
Ta vnos vsebuje 1 datoteko (1.69
MB).
Publicly Available
corpus
Description:
This is the third version of a spoken corpus of Albanian in Kosovo.
The data of the corpus is based on short life stories of 212 informants out of sample of 1800 speakers balanced across all regions of Kosovo and the ...
Ta vnos vsebuje 1 datoteko (1.76
MB).
Publicly Available
corpus
Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 74 publishers. Trendi 2024-06 covers the period from January 2019 to June 2024, complementing the Gigafida ...
Ta vnos ne vsebuje datotek.
Največ ogledov
V preteklem tednu
corpus
Description:
This corpus is specialized, static (i.e., no future growth is planned), diachronic and covers the period from 2002 to 2022.
The SMS messages included in this corpus were obtained from voluntary donors (informants). Both ...
Ta vnos vsebuje 1 datoteko (1.69
MB).
Publicly Available
corpus
Description:
ParlaMint 4.1 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
Ta vnos vsebuje 30 datotek(e) (5.87
GB).
Publicly Available
corpus
Description:
The dataset of user comments provided for research purposes for the EMBEDDIA, a Horizon 2020 project, extracted from the database of user comments from the 24sata.hr news portal. The 24sata.hr is the largest-circulation ...
Ta vnos vsebuje 3 datotek(e) (1.89
GB).
Publicly Available