• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Subject : TEI     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Erjavec, Tomaž (64)
    • Ljubešić, Nikola (28)
    • Fišer, Darja (24)
    • Pančur, Andrej (12)
    • Simov, Kiril (11)
    • Kopp, Matyáš (10)
    • Agnoloni, Tommaso (9)
    • Arhar Holdt, Špela (9)
    • Barkarson, Starkaður (9)
    • Battistoni, Roberto (9)
    • Coole, Matthew (9)
    • Darģis, Roberts (9)
    • Depoorter, Griet (9)
    • Diwersy, Sascha (9)
    • Frontini, Francesca (9)
    • Grigorova, Vladislava (9)
    • Haltrup Hansen, Dorte (9)
    • Jongejan, Bart (9)
    • Krek, Simon (9)
    • Luxardo, Giancarlo (9)
    • ... View More
Subject  
    • manual annotation (23)
    • named entities (17)
    • computer-mediated communication (16)
    • part-of-speech tagging (16)
    • tokenisation (16)
    • parliamentary debates (15)
    • Parla-CLARIN (14)
    • Slovenian Parliament (12)
    • word normalisation (12)
    • lemmatisation (10)
    • Belgian Parliament (9)
    • Bulgarian Parliament (9)
    • COVID-19 (9)
    • Croatian Parliament (9)
    • Czech Parliament (9)
    • Danish Parliament (9)
    • Dutch Parliament (9)
    • French Parliament (9)
    • Hungarian Parliament (9)
    • ... View More
Rights  
    • PUB (66)
    • ACA (6)
Language (ISO)  
    • Slovenian (60)
    • English (13)
    • Serbian (11)
    • Croatian (10)
    • Bulgarian (8)
    • Czech (8)
    • Hungarian (8)
    • Polish (7)
    • Danish (6)
    • Dutch (6)
    • Estonian (6)
    • French (6)
    • German (6)
    • Icelandic (6)
    • Italian (6)
    • Latvian (6)
    • Russian (6)
    • Spanish (6)
    • Turkish (6)
    • Ukrainian (5)
    • ... View More
Type  
    • text (71)
    • corpus (67)
    • lexicalConceptualResource (5)
    • audio (1)

Showing 1 through 72 out of 72 results

  • 1
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    •  100

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Author(s):
    Erjavec, Tomaž ; Krek, Simon and Dobrovoljc, Kaja
     This item contains 4 files (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.2 (transcription)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2021-09-23)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Erjavec, Tomaž ; Majhenič, Simona ; Žgank, Andrej
     This item contains 3 files (21.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of term-annotated texts RSDO5 1.1
    (ZRC SAZU / 2021-12-07)
    
    Author(s):
    Jemec Tomazin, Mateja ; et al.show everyone Jemec Tomazin, Mateja ; Trojar, Mitja ; Atelšek, Simon ; Fajfar, Tanja ; Erjavec, Tomaž ; Žagar Karer, Mojca
     This item contains 4 files (15.62 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 3.0
    (Jožef Stefan Institute / 2022-12-06)
    
    Author(s):
    Lenardič, Jakob ; et al.show everyone Lenardič, Jakob ; Čibej, Jaka ; Arhar Holdt, Špela ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 2 files (8.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 3.0
    (Jožef Stefan Institute / 2022-12-06)
    
    Author(s):
    Lenardič, Jakob ; Čibej, Jaka ; Arhar Holdt, Špela ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (12.16 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-2022) siParl 4.0
    (Institute of Contemporary History / 2024-06-05)
    
    Author(s):
    Pančur, Andrej ; et al.show everyone Pančur, Andrej ; Meden, Katja ; Erjavec, Tomaž ; Ojsteršek, Mihael ; Šorn, Mojca ; Blaj Hribar, Neja
     This item contains 5 files (14.28 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SUK 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-08-22)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Gantar, Polona ; Čibej, Jaka ; Pori, Eva ; Terčon, Luka ; Munda, Tina ; Žitnik, Slavko ; Robida, Nejc ; Blagus, Neli ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Kuzman, Taja ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 2 files (45.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of academic Slovene KAS 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 4 files (13.71 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of 1968 Slovenian literature Maj68 3.0
    (ZRC SAZU / 2024-10-22)
    
    Author(s):
    Juvan, Marko ; et al.show everyone Juvan, Marko ; Žejn, Andrejka ; Šorli, Mojca ; Mandić, Lucija ; Tomažin, Andrej ; Jež, Andraž ; Balžalorsky Antić, Varja ; Erjavec, Tomaž
     This item contains 6 files (1.33 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 3.0
    (Jožef Stefan Institute / 2023-04-07)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (8.54 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian linguistic training corpus SETimes.SR 2.0
    (Regional Linguistic Data Initiative Centre ReLDI; Jožef Stefan Institute / 2023-06-13)
    
    Author(s):
    Batanović, Vuk ; Ljubešić, Nikola ; Samardžić, Tanja and Erjavec, Tomaž
     This item contains 4 files (9.4 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (65.97 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 30 files (5.87 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Kuzman, Taja ; et al.show everyone Kuzman, Taja ; Ljubešić, Nikola ; Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Rayson, Paul ; Vidler, John ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (53.36 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos 2.1 (transcriptions)
    (Centre for Language Resources and Technologies, University of Ljubljana; Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; ZRC SAZU; Jožef Stefan Institute / 2023-08-28)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Zwitter Vitez, Ana ; Zemljarič Miklavčič, Jana ; Krek, Simon ; Stabej, Marko ; Erjavec, Tomaž ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Majhenič, Simona ; Žgank, Andrej ; Bizjak, Andreja ; Gril, Lucija ; Dobrišek, Simon ; Križaj, Janez ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Trojar, Mitja ; Bernjak, Mitja ; Dretnik, Naum ; Strle, Gregor ; Dobrovoljc, Kaja ; Ljubešić, Nikola ; Rupnik, Peter
     This item contains 4 files (117.83 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Collection of Slovenian paremiological units Pregovori 1.1
    (ZRC SAZU; Jožef Stefan Institute / 2023-09-30)
    
    Author(s):
    Babič, Saša ; et al.show everyone Babič, Saša ; Miha, Peče ; Erjavec, Tomaž ; Ivančič Kutin, Barbara ; Šrimpf Vendramin, Katarina ; Kropej Telban, Monika ; Jakop, Nataša ; Stanonik, Marija
     This item contains 3 files (22.19 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Twitter training corpus ReLDI-NormTagNER-sr 3.0
    (Jožef Stefan Institute / 2023-04-07)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (8.81 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-1992) SlovParl 2.0
    (Institute of Contemporary History / 2017-11-24)
    
    Author(s):
    Pančur, Andrej ; Šorn, Mojca and Erjavec, Tomaž
     This item contains 3 files (169.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (2.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 1.2
    (Jožef Stefan Institute / 2016-12-30)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka and Arhar Holdt, Špela
     This item contains 4 files (4.01 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 7 files (5.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Bartolini, Roberto ; Cimino, Andrea ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (23.37 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.3
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-07-07)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (42.85 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.0
    (CLARIN ERIC / 2023-10-24)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (61.05 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 4.0
    (CLARIN ERIC / 2023-10-24)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 30 files (5.67 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 4.0
    (CLARIN ERIC / 2023-11-14)
    
    Author(s):
    Kuzman, Taja ; et al.show everyone Kuzman, Taja ; Ljubešić, Nikola ; Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Rayson, Paul ; Vidler, John ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (67 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-2022) siParl 3.0
    (Institute of Contemporary History / 2022-12-06)
    
    Author(s):
    Pančur, Andrej ; et al.show everyone Pančur, Andrej ; Erjavec, Tomaž ; Meden, Katja ; Ojsteršek, Mihael ; Šorn, Mojca ; Blaj Hribar, Neja
     This item contains 2 files (5.63 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SUK 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-12-05)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Gantar, Polona ; Čibej, Jaka ; Pori, Eva ; Terčon, Luka ; Munda, Tina ; Žitnik, Slavko ; Robida, Nejc ; Blagus, Neli ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Kuzman, Taja ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 2 files (43.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.1
    (Jožef Stefan Institute / 2019-07-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.51 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-09-23)
    
    Author(s):
    Zwitter Vitez, Ana ; Zemljarič Miklavčič, Jana ; Krek, Simon ; Stabej, Marko and Erjavec, Tomaž
     This item contains 2 files (22.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.2
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-01-26)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (40.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    MULTEXT-East "1984" document corpus 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Bruda, Ştefan ; Dimitrova, Ludmila ; Ide, Nancy ; Kaalep, Heiki-Jaan ; Krstev, Cvetana ; Orav, Heili ; Oravecz, Csaba ; Paldre, Leho ; Petkevič, Vladimír ; Priest-Dorman, Greg ; Simov, Kiril ; Sinapova, Lydia ; Sokolovsky, Paul ; Sryvkin, Sergey ; Tufiş, Dan ; Utka, Andrius ; Villandi, Viire ; Vitas, Duško ; Vuković, Olga
     This item contains 1 file (4.62 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    MULTEXT-East "1984" annotated corpus 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Barbu, Ana-Maria ; Derzhanski, Ivan ; Dimitrova, Ludmila ; Garabík, Radovan ; Ide, Nancy ; Kaalep, Heiki-Jaan ; Kotsyba, Natalia ; Krstev, Cvetana ; Oravecz, Csaba ; Petkevič, Vladimír ; Priest-Dorman, Greg ; QasemiZadeh, Behrang ; Radziszewski, Adam ; Simov, Kiril ; Tufiş, Dan ; Zdravkova, Katerina
     This item contains 1 file (14.12 MB).
     
    Academic Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of questions and answers of the Terminologišče terminological counselling service
    (ZRC SAZU / 2022-11-29)
    
    Author(s):
    Atelšek, Simon ; et al.show everyone Atelšek, Simon ; Fajfar, Tanja ; Jemec Tomazin, Mateja ; Trojar, Mitja ; Sitar, Jera ; Žagar Karer, Mojca
     This item contains 1 file (528.98 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Japanese-Slovene learner's dictionary jaSlo 3.1
    (Faculty of Arts, University of Ljubljana / 2016-01-30)
    
    Author(s):
    Hmeljak, Kristina ; Erjavec, Tomaž and Srdanović, Irena
     This item contains 1 file (603.86 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Speech Database of Spoken Flight Information Enquiries SOFES 1.0
    (Faculty of Electrical Engineering, University of Ljubljana / 2017-06-17)
    
    Author(s):
    Dobrišek, Simon ; Žganec Gros, Jerneja ; Žibert, Janez ; Mihelič, France and Pavešić, Nikola
     This item contains 3 files (1.4 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The "Arcticae horulae" dictionary of German borrowings in Slovenian
    (CLARIN.SI / 2020-11-15)
    
    Author(s):
    Pirman, Alenka
     This item contains 3 files (20.21 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel corpus of idiomatic text ParaDiom 1.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2022-11-15)
    
    Author(s):
    Donaj, Gregor and Antloga, Špela
     This item contains 1 file (1.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Epigraphic corpus of Medieval and Early Modern inscriptions in Slovenia MEMIS 1.0
    (ZRC SAZU / 2020-12-06)
    
    Author(s):
    Pobežin, Gregor
     This item contains 1 file (156.41 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Lexicon of historical Slovene imp25k 1.1
    (Jožef Stefan Institute / 2014-09-13)
    
    Author(s):
    Erjavec, Tomaž
     This item contains 2 files (25.42 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Collection of Slovenian paremiological units Pregovori 1.0
    (ZRC SAZU; Jožef Stefan Institute / 2022-01-11)
    
    Author(s):
    Babič, Saša ; et al.show everyone Babič, Saša ; Miha, Peče ; Erjavec, Tomaž ; Ivančič Kutin, Barbara ; Šrimpf Vendramin, Katarina ; Kropej Telban, Monika ; Jakop, Nataša ; Stanonik, Marija
     This item contains 3 files (21.65 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Slovene-Japanese Learner's Dictionary sloJa 1.0
    (Faculty of Arts, University of Ljubljana / 2023-11-16)
    
    Author(s):
    Hmeljak, Kristina ; et al.show everyone Hmeljak, Kristina ; Barovič Božjak, Laura ; Bostič, Nadja ; Gerl, Katarina Hitomi ; Hrastnik, Jan ; Kališnik, Nina ; Kleč, Sara ; Kovač, Eva ; Sangawa Hmeljak, Nina ; Tomše, Jure ; Erjavec, Tomaž
     This item contains 2 files (1.27 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Japanese web corpus with difficulty levels jpWaC-L 1.0
    (Jožef Stefan Institute / 2008-11-14)
    
    Author(s):
    Erjavec, Tomaž ; Hmeljak Sangawa, Kristina and Kawamura, Yoshiko
     This item contains 6 files (1.6 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Syn 1.0
    (Jožef Stefan Institute / 2017-01-03)
    
    Author(s):
    Arhar Holdt, Špela ; Erjavec, Tomaž and Fišer, Darja
     This item contains 4 files (1.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC shortening corpus Janes-Kratko 1.0
    (Jožef Stefan Institute / 2017-01-20)
    
    Author(s):
    Goli, Teja ; Osrajnik, Eneja ; Fišer, Darja and Erjavec, Tomaž
     This item contains 4 files (1.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Tweet comma corpus Janes-Vejica 1.0
    (Jožef Stefan Institute / 2017-02-16)
    
    Author(s):
    Popič, Damjan ; et al.show everyone Popič, Damjan ; Zupan, Katja ; Logar, Polona ; Kavčič, Teja ; Erjavec, Tomaž ; Fišer, Darja
     This item contains 4 files (1.82 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Forum corpus Janes-Forum 1.0
    (Jožef Stefan Institute / 2017-08-17)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola and Fišer, Darja
     This item contains 2 files (573.23 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    News comment corpus Janes-News 1.0
    (Jožef Stefan Institute / 2017-08-17)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola and Fišer, Darja
     This item contains 2 files (186.48 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Blog post and comment corpus Janes-Blog 1.0
    (Jožef Stefan Institute / 2017-08-17)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola and Fišer, Darja
     This item contains 2 files (411.31 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Wikipedia talk corpus Janes-Wiki 1.0
    (Jožef Stefan Institute / 2017-08-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (55.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus hr500k 1.0
    (Jožef Stefan Institute / 2018-04-13)
    
    Author(s):
    Ljubešić, Nikola ; Agić, Željko ; Klubička, Filip ; Batanović, Vuk and Erjavec, Tomaž
     This item contains 3 files (91.53 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SETimes.SR 1.0
    (Regional Linguistic Data Initiative Centre ReLDI / 2018-08-20)
    
    Author(s):
    Batanović, Vuk ; Ljubešić, Nikola ; Samardžić, Tanja and Erjavec, Tomaž
     This item contains 3 files (10.91 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Tweet code-switching corpus Janes-Preklop 1.0
    (Jožef Stefan Institute / 2017-10-13)
    
    Author(s):
    Reher, Špela ; Erjavec, Tomaž and Fišer, Darja
     This item contains 4 files (1.28 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    ŠUSS archive of questions and answers about the Slovenian language (1998-2010)
    (CLARIN.SI / 2019-09-15)
    
    Author(s):
    Marušič, Franc Lanko ; et al.show everyone Marušič, Franc Lanko ; Marvin, Tatjana ; Potrato, Tina ; Saksida, Amanda ; Tomažin, Petra ; Verovnik, Tina ; Žaucer, Rok ; Železnikar, Jaka ; Benčina, Barbara ; Vekjet, Ivana ; Jejčič, Irena ; Mišmaš, Petra ; Marc, Neva ; Leban, Ivana ; Kobal, Elena ; Halilović, Amra ; Krošelj, Sara ; Gaši, Elbasana ; Papler, Urša ; Koglot, Marina ; Žnidarčič, Mateja ; Bajc, Sara ; Brus, Karmen ; Adamlje, Sara ; Šušanj, Špela ; Vodopivec, Ana ; Erjavec, Tomaž
     This item contains 2 files (5.46 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Slovenian school texts SBSJ 1.0
    (ZRC SAZU / 2021-04-01)
    
    Author(s):
    Ahačič, Kozma ; et al.show everyone Ahačič, Kozma ; Atelšek, Simon ; Erjavec, Tomaž ; Holozan, Peter ; Jakop, Nataša ; Jemec Tomazin, Mateja ; Ježovnik, Janoš ; Ledinek, Nina ; Perdih, Andrej ; Romih, Miro ; Trojar, Mitja
     This item contains 1 file (1.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 3.0
    (CLARIN ERIC / 2023-08-10)
    
    Author(s):
    Kuzman, Taja ; et al.show everyone Kuzman, Taja ; Ljubešić, Nikola ; Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Fišer, Darja ; Pirker, Hannes ; Wissik, Tanja ; Schopper, Daniel ; Kirnbauer, Martin ; Mochtak, Michal ; Rupnik, Peter ; Pol, Henk van der ; Depoorter, Griet ; de Does, Jesse ; Simov, Kiril ; Grigorova, Vladislava ; Grigorov, Ilko ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Mölder, Martin ; Kahusk, Neeme ; Vider, Kadri ; Bel, Nuria ; Antiba-Cartazo, Iván ; Pisani, Marilina ; Zevallos, Rodolfo ; Regueira, Xosé Luís ; Vladu, Adina Ioana ; Magariños, Carmen ; Bardanca, Daniel ; Barcala, Mario ; Garcia, Marcos ; Pérez Lago, María ; García Louzao, Pedro ; Vivel Couso, Ainhoa ; Vázquez Abuín, Marta ; García Díaz, Noelia ; Vidal Miguéns, Adrián ; Fernández Rei, Elisa ; Diwersy, Sascha ; Luxardo, Giancarlo ; Coole, Matthew ; Rayson, Paul ; Nwadukwe, Amanda ; Gkoumas, Dimitris ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Gavriilidou, Maria ; Piperidis, Stelios ; Ligeti-Nagy, Noémi ; Jelencsik-Mátyus, Kinga ; Varga, Zsófia ; Dodé, Réka ; Barkarson, Starkaður ; Agnoloni, Tommaso ; Bartolini, Roberto ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Darģis, Roberts ; van Heusden, Ruben ; Marx, Maarten ; Depuydt, Katrien ; Tungland, Lars Magne ; Rudolf, Michał ; Nitoń, Bartłomiej ; Aires, José ; Mendes, Amália ; Cardoso, Aida ; Pereira, Rui ; Yrjänäinen, Väinö ; Norén, Fredrik Mohammadi ; Magnusson, Måns ; Jarlbrink, Johan ; Meden, Katja ; Pančur, Andrej ; Ojsteršek, Mihael ; Çöltekin, Çağrı ; Kryvenko, Anna
     This item contains 26 files (38.68 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Parliamentary corpus of first Yugoslavia (1919-1939) yu1Parl 1.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2023-07-15)
    
    Author(s):
    Kavčič, Alenka ; Mundjar, Aleksander and Marolt, Matija
     This item contains 3 files (2.65 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Ukrainian parliamentary corpus ParlaMint-UA 4.0.1
    (CLARIN.SI / 2023-11-29)
    
    Author(s):
    Kopp, Matyáš ; Kryvenko, Anna and Rii, Andriana
     This item contains 4 files (3.84 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of academic Slovene KAS 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-11-28)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 6 files (42.11 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Academic Slovene (PhD theses) KAS-dr 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-11-28)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 3 files (2.52 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Academic Slovene (BSc/BA theses) KAS-dipl 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-11-28)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 5 files (27.63 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    English-Slovene term candidates KAS-biterm 1.0
    (Jožef Stefan Institute / 2020-05-05)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola and Fišer, Darja
     This item contains 1 file (50.74 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Carniolan Provincial Assembly corpus Kranjska 1.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2023-05-08)
    
    Author(s):
    Kavčič, Alenka ; Mundjar, Aleksander and Marolt, Matija
     This item contains 3 files (28.53 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Written corpus ccKres 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2013-09-30)
    
    Author(s):
    Logar, Nataša ; Erjavec, Tomaž ; Krek, Simon ; Grčar, Miha and Holozan, Peter
     This item contains 3 files (201.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Digital library and corpus of historical Slovene IMP 1.1
    (Jožef Stefan Institute / 2014-07-28)
    
    Author(s):
    Erjavec, Tomaž
     This item contains 4 files (338.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of texts by Hijacint Repič in "Cvetje z vertov sv. Frančiška" CVET 1.0
    (Science and Research Centre Koper / 2024-05-07)
    
    Author(s):
    Košir, Diana and Erjavec, Tomaž
     This item contains 4 files (15.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Reference corpus of historical Slovene goo300k 1.2
    (Jožef Stefan Institute / 2015-05-05)
    
    Author(s):
    Erjavec, Tomaž
     This item contains 2 files (8.9 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    The corpus of older Slovenian narrative prose PriLit 1.0
    (ZRC SAZU / 2021-03-18)
    
    Author(s):
    Žejn, Andrejka and Erjavec, Tomaž
     This item contains 4 files (55.07 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Written corpus ccGigafida 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2013-09-30)
    
    Author(s):
    Logar, Nataša ; Erjavec, Tomaž ; Krek, Simon ; Grčar, Miha and Holozan, Peter
     This item contains 3 files (1.89 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of longer narrative Slovenian prose KDSP 1.0
    (ZRC SAZU / 2023-07-07)
    
    Author(s):
    Mandić, Lucija and Erjavec, Tomaž
     This item contains 4 files (268.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Collection of Slovenian riddles Uganke 1.0
    (ZRC SAZU / 2025-04-25)
    
    Author(s):
    Babič, Saša ; Erjavec, Tomaž ; Farič, Ana and Peče, Miha
     This item contains 4 files (3.74 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • 1
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    •  100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".