Repozitorij CLARIN.SI

What's New

languageDescription

Overview of inflectional paradigms in Slovenian

Author(s):

Štarkl, Ema ; Mišmaš, Petra and Simonović, Marko

Description:

The purpose of the overview is to provide a comprehensive overview of the inflectional features associated with specific endings. Each ending has a dedicated row in the table and is exemplified by a word in the relevant ...

Ta vnos vsebuje 2 datotek(e) (208.07 KB).

Publicly Available Distributed under Creative Commons

corpus

CLARIN.SI data & tools

The Sarajevo Corpus of SMS Messages in Bosnian

Author(s):

Wasserscheidt, Philipp ; et al.prikaži vse

Wasserscheidt, Philipp ; Bulić, Halid ; Durmišević, Elma ; Hodžić-Čavkić, Azra ; Bajraktarević, Enisa ; Ahmetspahić-Peljto, Azra ; Šabić, Belmin

Description:

This corpus is specialized, static (i.e., no future growth is planned), diachronic and covers the period from 2002 to 2022. All messages included in this Corpus were obtained from voluntary donors (informants). Both senders ...

Ta vnos vsebuje 1 datoteko (1.73 MB).

Publicly Available Distributed under Creative Commons

corpus

CLARIN.SI data & tools

Albanian Spoken Corpus in Kosovo 0.2

Author(s):

Wasserscheidt, Philipp ; Rugova, Bardh and Baftiu, Adelajda

Description:

This is the second version of a spoken corpus of Albanian in Kosovo. The data of the corpus is based on short life stories of 212 informants out of sample of 1800 speakers balanced across all regions of Kosovo and the ...

Ta vnos vsebuje 1 datoteko (1.76 MB).

Publicly Available Distributed under Creative Commons

Največ ogledov

V preteklem tednu

toolService

CLARIN.SI data & tools

Slovene Conformer CTC BPE E2E Automated Speech Recognition model RSDO-DS2-ASR-E2E 2.0

Author(s):

Lebar Bajec, Iztok ; Bajec, Marko ; Bajec, Žan and Rizvič, Mitja

Description:

This Conformer CTC BPE E2E Automated Speech Recognition model was trained following the NVIDIA NeMo Conformer-CTC recipe (for details see the official NVIDIA NeMo NMT documentation, https://docs.nvidia.com/deeplearning/n ...

Ta vnos vsebuje 1 datoteko (430.61 MB).

Publicly Available

corpus

CLARIN.SI data & tools

Multilingual comparable corpora of parliamentary debates ParlaMint 4.0

Author(s):

Erjavec, Tomaž ; et al.prikaži vse

Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja

Description:

ParlaMint 4.0 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...

Ta vnos vsebuje 30 datotek(e) (5.67 GB).

Publicly Available Distributed under Creative Commons

corpus

CLARIN.SI data & tools

The Sarajevo Corpus of SMS Messages in Bosnian

Author(s):

Wasserscheidt, Philipp ; et al.prikaži vse

Wasserscheidt, Philipp ; Bulić, Halid ; Durmišević, Elma ; Hodžić-Čavkić, Azra ; Bajraktarević, Enisa ; Ahmetspahić-Peljto, Azra ; Šabić, Belmin

Description:

Ta vnos vsebuje 1 datoteko (1.73 MB).

Publicly Available Distributed under Creative Commons

Linguistic Data and NLP Tools

Find

Citation Support (with Persistent IDs)

Deposit Free and Safe

License of your Choice (Open licenses encouraged)

Easy to Find

Easy to Cite

What's New

Največ ogledov

Partnerji

Partnerji

Repozitorij