What's New

 corpus 
corpus
Description:
The dataset consists of mid-length sentences from the parliamentary proceedings of Bosnia and Herzegovina, Croatia, Czechia, Serbia, Slovakia, Slovenia, and the United Kingdom, annotated with a 6-level sentiment schema ...
 This item contains 8 files (7.43 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 toolService 
toolService
Description:
The inflectional data lookup module serves as an optional component within the cordex library (https://github.com/clarinsi/cordex/) that significantly improves the quality of the results. The module consists of a pickled ...
 This item contains 1 file (31.44 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike
 toolService 
toolService
Description:
This is a collection of modular teaching and learning content created in the UPSKILLS project ( UPgrading the SKIlls of Linguistics and Language Students) and downloaded from the Moodle platform in .mbz format. The learning ...
 This item contains 13 files (1.94 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required

Most Viewed Items

Top Last Week
 corpus 
corpus
Author(s):
Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Fišer, Darja ; Pirker, Hannes ; Wissik, Tanja ; Schopper, Daniel ; Kirnbauer, Martin ; Mochtak, Michal ; Ljubešić, Nikola ; Rupnik, Peter ; Pol, Henk van der ; Depoorter, Griet ; de Does, Jesse ; Simov, Kiril ; Grigorova, Vladislava ; Grigorov, Ilko ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Mölder, Martin ; Kahusk, Neeme ; Vider, Kadri ; Bel, Nuria ; Antiba-Cartazo, Iván ; Pisani, Marilina ; Zevallos, Rodolfo ; Regueira, Xosé Luís ; Vladu, Adina Ioana ; Magariños, Carmen ; Bardanca, Daniel ; Barcala, Mario ; Garcia, Marcos ; Pérez Lago, María ; García Louzao, Pedro ; Vivel Couso, Ainhoa ; Vázquez Abuín, Marta ; García Díaz, Noelia ; Vidal Miguéns, Adrián ; Fernández Rei, Elisa ; Diwersy, Sascha ; Luxardo, Giancarlo ; Coole, Matthew ; Rayson, Paul ; Nwadukwe, Amanda ; Gkoumas, Dimitris ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Gavriilidou, Maria ; Piperidis, Stelios ; Ligeti-Nagy, Noémi ; Jelencsik-Mátyus, Kinga ; Varga, Zsófia ; Dodé, Réka ; Barkarson, Starkaður ; Agnoloni, Tommaso ; Bartolini, Roberto ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Darģis, Roberts ; van Heusden, Ruben ; Marx, Maarten ; Depuydt, Katrien ; Tungland, Lars Magne ; Rudolf, Michał ; Nitoń, Bartłomiej ; Aires, José ; Mendes, Amália ; Cardoso, Aida ; Pereira, Rui ; Yrjänäinen, Väinö ; Norén, Fredrik Mohammadi ; Magnusson, Måns ; Jarlbrink, Johan ; Meden, Katja ; Pančur, Andrej ; Ojsteršek, Mihael ; Çöltekin, Çağrı ; Kryvenko, Anna
Description:
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2022, with the individual corpora being between 9 and 125 million words in size. The ...
 This item contains 27 files (58.26 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Author(s):
Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Fišer, Darja ; Pirker, Hannes ; Wissik, Tanja ; Schopper, Daniel ; Kirnbauer, Martin ; Ljubešić, Nikola ; Rupnik, Peter ; Mochtak, Michal ; Pol, Henk van der ; Depoorter, Griet ; Simov, Kiril ; Grigorova, Vladislava ; Grigorov, Ilko ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Mölder, Martin ; Kahusk, Neeme ; Vider, Kadri ; Bel, Nuria ; Antiba-Cartazo, Iván ; Pisani, Marilina ; Zevallos, Rodolfo ; Vladu, Adina Ioana ; Magariños, Carmen ; Bardanca, Daniel ; Barcala, Mario ; Garcia, Marcos ; Pérez Lago, María ; García Louzao, Pedro ; Vivel Couso, Ainhoa ; Vázquez Abuín, Marta ; García Díaz, Noelia ; Vidal Miguéns, Adrián ; Fernández Rei, Elisa ; Regueira, Xosé Luís ; Diwersy, Sascha ; Luxardo, Giancarlo ; Coole, Matthew ; Rayson, Paul ; Nwadukwe, Amanda ; Gkoumas, Dimitris ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Gavriilidou, Maria ; Piperidis, Stelios ; Ligeti-Nagy, Noémi ; Jelencsik-Mátyus, Kinga ; Varga, Zsófia ; Dodé, Réka ; Barkarson, Starkaður ; Agnoloni, Tommaso ; Bartolini, Roberto ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Darģis, Roberts ; van Heusden, Ruben ; Marx, Maarten ; Tungland, Lars Magne ; Rudolf, Michał ; Nitoń, Bartłomiej ; Aires, José ; Mendes, Amália ; Cardoso, Aida ; Pereira, Rui ; Yrjänäinen, Väinö ; Norén, Fredrik Mohammadi ; Magnusson, Måns ; Jarlbrink, Johan ; Meden, Katja ; Pančur, Andrej ; Ojsteršek, Mihael ; Çöltekin, Çağrı ; Kryvenko, Anna
Description:
ParlaMint 3.0 is a multilingual set of 26 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2022, with the individual corpora being between 9 and 125 million words in size. The ...
 This item contains 27 files (5.22 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Description:
The dataset consists of mid-length sentences from the parliamentary proceedings of Bosnia and Herzegovina, Croatia, Czechia, Serbia, Slovakia, Slovenia, and the United Kingdom, annotated with a 6-level sentiment schema ...
 This item contains 8 files (7.43 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike