Repozitorij CLARIN.SI

What's New

corpus

CLARIN.SI data & tools

"Choice of plausible alternatives" datasets in South Slavic dialects DIALECT-COPA

Author(s):

Ljubešić, Nikola ; et al.prikaži vse

Ljubešić, Nikola ; Kuzman, Taja ; Rupnik, Peter ; Milosavljević, Stefan ; Galant, Nada ; Benčina, Sonja ; Čibej, Jaka

Description:

The DIALECT-COPA datasets comprise Choice of Plausible Alternatives (COPA) datasets for three South Slavic dialects: (1) COPA-SL-CER for the Cerkno dialect of Slovenian, spoken in the Slovenian Littoral region, specifically ...

Ta vnos vsebuje 6 datotek(e) (279.69 KB).

Publicly Available Distributed under Creative Commons

languageDescription

CLARIN.SI data & tools

Overview of inflectional paradigms in Slovenian

Author(s):

Štarkl, Ema ; Mišmaš, Petra and Simonović, Marko

Description:

The purpose of the overview is to provide a comprehensive overview of the inflectional features associated with specific endings. Each ending has a dedicated row in the table and is exemplified by a word in the relevant ...

Ta vnos vsebuje 2 datotek(e) (208.07 KB).

Publicly Available Distributed under Creative Commons

corpus

CLARIN.SI data & tools

The Sarajevo Corpus of SMS Messages in Bosnian

Author(s):

Wasserscheidt, Philipp ; et al.prikaži vse

Wasserscheidt, Philipp ; Bulić, Halid ; Durmišević, Elma ; Hodžić-Čavkić, Azra ; Bajraktarević, Enisa ; Ahmetspahić-Peljto, Azra ; Šabić, Belmin

Description:

This corpus is specialized, static (i.e., no future growth is planned), diachronic and covers the period from 2002 to 2022. All messages included in this Corpus were obtained from voluntary donors (informants). Both senders ...

Ta vnos vsebuje 1 datoteko (1.73 MB).

Publicly Available Distributed under Creative Commons

Največ ogledov

V preteklem tednu

corpus

CLARIN.SI data & tools

Croatian corpus of non-professional written language by typical speakers and speakers with language disorders RAPUT 1.0

Author(s):

Kuvač Kraljević, Jelena ; Hržica, Gordana ; Štefanec, Vanja ; Kologranić Belić, Lana and Ljubešić, Nikola

Description:

The corpus consists of texts produced by nonprofessional typical speakers and speakers with different language disorders (developmental language disorder, dyslexia, traumatic brain injury, aphasia, other). Roughly half of ...

Ta vnos vsebuje 2 datotek(e) (8.11 MB).

Publicly Available Distributed under Creative Commons

languageDescription

CLARIN.SI data & tools

Overview of inflectional paradigms in Slovenian

Author(s):

Štarkl, Ema ; Mišmaš, Petra and Simonović, Marko

Description:

Ta vnos vsebuje 2 datotek(e) (208.07 KB).

Publicly Available Distributed under Creative Commons

corpus

CLARIN.SI data & tools