What's New

 corpus 
corpus
Description:
The CVET corpus contains 230 texts (around 175 thousand words) of varying length, published in the religious journal "Cvetje z vertov sv. Frančiška" between 1887 and 1916, when the magazine was edited by the linguist Fr. ...
 Ta vnos vsebuje 4 datotek(e) (15.02 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 73 publishers. Trendi 2024-04 covers the period from January 2019 to April 2024, complementing the Gigafida ...
 Ta vnos ne vsebuje datotek.
 corpus 
corpus
Description:
The DIALECT-COPA datasets comprise Choice of Plausible Alternatives (COPA) datasets for three South Slavic dialects: (1) COPA-SL-CER for the Cerkno dialect of Slovenian, spoken in the Slovenian Littoral region, specifically ...
 Ta vnos vsebuje 6 datotek(e) (279.69 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike

Največ ogledov

V preteklem tednu
 corpus 
corpus
Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 70 publishers. Trendi 2024-02 covers the period from January 2019 to February 2024, complementing the ...
 Ta vnos ne vsebuje datotek.
 corpus 
corpus
Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 70 publishers. Trendi 2024-03 covers the period from January 2019 to March 2024, complementing the Gigafida ...
 Ta vnos ne vsebuje datotek.
 corpus 
corpus
Description:
Janes-Preklop is a corpus of Slovene tweets that is manually annotated for code-switching (the use of words from two or more languages within one sentence or utterance), according to the supplied typology. Words in the ...
 Ta vnos vsebuje 4 datotek(e) (1.28 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike