corpus
Author(s):
Bañón, Marta
; et al.show everyone
Bañón, Marta
;
Chichirau, Malina
;
Esplà-Gomis, Miquel
;
Forcada, Mikel L.
;
Galiano-Jiménez, Aarón
;
García-Romero, Cristian
;
Kuzman, Taja
;
Ljubešić, Nikola
;
van Noord, Rik
;
Pla Sempere, Leopoldo
;
Ramírez-Sánchez, Gema
;
Rupnik, Peter
;
Suchomel, Vít
;
Toral, Antonio
;
Zaragoza-Bernabeu, Jaume
Description:
The Montenegrin web corpus MaCoCu-cnr 1.0 was built by crawling the ".me" internet top-level domain in 2021 and 2022, extending the crawl dynamically to other domains as well. The crawler is available at https://github.c ...
This item contains 2 files (500.14
MB).
Publicly Available
corpus
Author(s):
Erjavec, Tomaž
; et al.show everyone
Erjavec, Tomaž
;
Kopp, Matyáš
;
Ogrodniczuk, Maciej
;
Osenova, Petya
;
Agirrezabal, Manex
;
Agnoloni, Tommaso
;
Aires, José
;
Albini, Monica
;
Alkorta, Jon
;
Antiba-Cartazo, Iván
;
Arrieta, Ekain
;
Barcala, Mario
;
Bardanca, Daniel
;
Barkarson, Starkaður
;
Bartolini, Roberto
;
Battistoni, Roberto
;
Bel, Nuria
;
Bonet Ramos, Maria del Mar
;
Calzada Pérez, María
;
Cardoso, Aida
;
Çöltekin, Çağrı
;
Coole, Matthew
;
Darģis, Roberts
;
de Libano, Ruben
;
Depoorter, Griet
;
Diwersy, Sascha
;
Dodé, Réka
;
Fernandez, Kike
;
Fernández Rei, Elisa
;
Frontini, Francesca
;
Garcia, Marcos
;
García Díaz, Noelia
;
García Louzao, Pedro
;
Gavriilidou, Maria
;
Gkoumas, Dimitris
;
Grigorov, Ilko
;
Grigorova, Vladislava
;
Haltrup Hansen, Dorte
;
Iruskieta, Mikel
;
Jarlbrink, Johan
;
Jelencsik-Mátyus, Kinga
;
Jongejan, Bart
;
Kahusk, Neeme
;
Kirnbauer, Martin
;
Kryvenko, Anna
;
Ligeti-Nagy, Noémi
;
Ljubešić, Nikola
;
Luxardo, Giancarlo
;
Magariños, Carmen
;
Magnusson, Måns
;
Marchetti, Carlo
;
Marx, Maarten
;
Meden, Katja
;
Mendes, Amália
;
Mochtak, Michal
;
Mölder, Martin
;
Montemagni, Simonetta
;
Navarretta, Costanza
;
Nitoń, Bartłomiej
;
Norén, Fredrik Mohammadi
;
Nwadukwe, Amanda
;
Ojsteršek, Mihael
;
Pančur, Andrej
;
Papavassiliou, Vassilis
;
Pereira, Rui
;
Pérez Lago, María
;
Piperidis, Stelios
;
Pirker, Hannes
;
Pisani, Marilina
;
Pol, Henk van der
;
Prokopidis, Prokopis
;
Quochi, Valeria
;
Rayson, Paul
;
Regueira, Xosé Luís
;
Rii, Andriana
;
Rudolf, Michał
;
Ruisi, Manuela
;
Rupnik, Peter
;
Schopper, Daniel
;
Simov, Kiril
;
Sinikallio, Laura
;
Skubic, Jure
;
Tungland, Lars Magne
;
Tuominen, Jouni
;
van Heusden, Ruben
;
Varga, Zsófia
;
Vázquez Abuín, Marta
;
Venturi, Giulia
;
Vidal Miguéns, Adrián
;
Vider, Kadri
;
Vivel Couso, Ainhoa
;
Vladu, Adina Ioana
;
Wissik, Tanja
;
Yrjänäinen, Väinö
;
Zevallos, Rodolfo
;
Fišer, Darja
Description:
ParlaMint 4.1 is a set of comparable corpora containing transcriptions of parliamentary debates of 29 European countries and autonomous regions, mostly starting in 2015 and extending to mid-2022. The individual corpora ...
This item contains 30 files (5.87
GB).
Publicly Available
corpus
Author(s):
Erjavec, Tomaž
; et al.show everyone
Erjavec, Tomaž
;
Ogrodniczuk, Maciej
;
Osenova, Petya
;
Ljubešić, Nikola
;
Simov, Kiril
;
Grigorova, Vladislava
;
Rudolf, Michał
;
Pančur, Andrej
;
Kopp, Matyáš
;
Barkarson, Starkaður
;
Steingrímsson, Steinþór
;
van der Pol, Henk
;
Depoorter, Griet
;
de Does, Jesse
;
Jongejan, Bart
;
Haltrup Hansen, Dorte
;
Navarretta, Costanza
;
Calzada Pérez, María
;
de Macedo, Luciana D.
;
van Heusden, Ruben
;
Marx, Maarten
;
Çöltekin, Çağrı
;
Coole, Matthew
;
Agnoloni, Tommaso
;
Frontini, Francesca
;
Montemagni, Simonetta
;
Quochi, Valeria
;
Venturi, Giulia
;
Ruisi, Manuela
;
Marchetti, Carlo
;
Battistoni, Roberto
;
Sebők, Miklós
;
Ring, Orsolya
;
Darģis, Roberts
;
Utka, Andrius
;
Petkevičius, Mindaugas
;
Briedienė, Monika
;
Krilavičius, Tomas
;
Morkevičius, Vaidas
;
Diwersy, Sascha
;
Luxardo, Giancarlo
;
Rayson, Paul
Description:
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the ...
This item contains 18 files (2.17
GB).
Publicly Available