What's New
corpus

Description:
The Ukrainian parliamentary corpus ParlaMint-UA 4.0.1 is an extended version of the ParlaMint-UA 4.0 corpus (available as a collection of plain texts along with TSV metadata of the speeches http://hdl.handle.net/11356/1859 ...
This item contains 4 files (3.84
GB).
Publicly Available


corpus

Description:
Šolar-Eval is a specialized dataset designed for the evaluation of Slovene spell- and grammar-checking tools and methodologies. It encompasses 109 essays authored by Slovene primary and secondary school students, featuring ...
This item contains 4 files (12.54
MB).
Publicly Available




lexicalConceptualResource

Description:
This resource contains 713,310 collocation candidates, which were automatically extracted from the Gigafida 2.0 corpus (http://hdl.handle.net/11356/1320) and annotated whether they are legitimate collocations or not. The ...
This item contains 1 file (9.7
MB).
Publicly Available



Most Viewed Items
Top Last Week
corpus

Description:
The Ukrainian parliamentary corpus ParlaMint-UA 4.0.1 is an extended version of the ParlaMint-UA 4.0 corpus (available as a collection of plain texts along with TSV metadata of the speeches http://hdl.handle.net/11356/1859 ...
This item contains 4 files (3.84
GB).
Publicly Available


corpus

Description:
ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
This item contains 30 files (5.67
GB).
Publicly Available


corpus

Description:
The novel "1984" by George Orwell is the central component of the MULTEXT-East corpus. This parallel and sentence aligned corpus contains the novel in the English original (about 100,000 words in length), and its translations ...
This item contains 1 file (14.12
MB).
Academic Use

