• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Type : corpus     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (78)
    • Erjavec, Tomaž (69)
    • Fišer, Darja (29)
    • Kuzman, Taja (22)
    • Rupnik, Peter (22)
    • Toral, Antonio (22)
    • Esplà-Gomis, Miquel (21)
    • Arhar Holdt, Špela (18)
    • Batanović, Vuk (16)
    • Bañón, Marta (16)
    • Forcada, Mikel L. (16)
    • García-Romero, Cristian (16)
    • Krek, Simon (16)
    • Pla Sempere, Leopoldo (16)
    • Ramírez-Sánchez, Gema (16)
    • Suchomel, Vít (16)
    • van der Werff, Tobias (16)
    • van Noord, Rik (16)
    • Zaragoza, Jaume (16)
    • Ferme, Marko (12)
    • ... View More
Subject  
    • TEI (48)
    • manual annotation (34)
    • parallel corpus (29)
    • web corpus (27)
    • computer-mediated communication (25)
    • multilingual (23)
    • news corpus (17)
    • named entities (16)
    • part-of-speech tagging (15)
    • lemmatisation (13)
    • tokenisation (13)
    • news comments (12)
    • Twitter (12)
    • word normalisation (12)
    • spoken corpus (11)
    • academic writing (9)
    • sentiment classification (8)
    • specialised corpus (8)
    • speech database (8)
    • terminology (8)
    • ... View More
Rights  
    • PUB (160)
    • ACA (20)
Language (ISO)  
    • Slovenian (112)
    • English (46)
    • Croatian (35)
    • Serbian (26)
    • Bulgarian (10)
    • Bosnian (9)
    • Lithuanian (9)
    • Estonian (8)
    • Spanish (8)
    • Hungarian (7)
    • Latvian (7)
    • Macedonian (7)
    • Montenegrin (7)
    • Russian (7)
    • Dutch (6)
    • Italian (6)
    • Czech (5)
    • Danish (5)
    • French (5)
    • Polish (5)
    • ... View More
Type  
    • text (183)
    • audio (8)
    • video (1)
Contain Files  
    • yes (180)
    • no (12)

Showing 1 through 10 out of 192 results

  • 1
  • 2
  • 3
  •  
  • 20
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.0 (audio)
    (VideoLectures.NET / 2019-03-26)
    
    Author(s):
    VideoLectures.NET
     This item contains 6 files (8.85 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Bartolini, Roberto ; Cimino, Andrea ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (23.37 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (2.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-09-23)
    
    Author(s):
    Zwitter Vitez, Ana ; Zemljarič Miklavčič, Jana ; Krek, Simon ; Stabej, Marko and Erjavec, Tomaž
     This item contains 2 files (22.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.2 (transcription)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2021-09-23)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Erjavec, Tomaž ; Majhenič, Simona ; Žgank, Andrej
     This item contains 3 files (21.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.1
    (Jožef Stefan Institute / 2019-07-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.51 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of 1968 Slovenian literature Maj68 2.0
    (ZRC SAZU / 2022-03-12)
    
    Author(s):
    Juvan, Marko ; et al.show everyone Juvan, Marko ; Žejn, Andrejka ; Šorli, Mojca ; Mandić, Lucija ; Tomažin, Andrej ; Jež, Andraž ; Balžalorsky Antić, Varja ; Erjavec, Tomaž
     This item contains 5 files (1.33 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Author(s):
    Erjavec, Tomaž ; Krek, Simon and Dobrovoljc, Kaja
     This item contains 4 files (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus SlovParl 2.0
    (Institute of Contemporary History / 2017-11-24)
    
    Author(s):
    Pančur, Andrej ; Šorn, Mojca and Erjavec, Tomaž
     This item contains 3 files (169.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of comma placement Vejica 1.3
    (Amebis, d. o. o., Kamnik / 2018-04-15)
    
    Author(s):
    Holozan, Peter
     This item contains 2 files (3.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • 1
  • 2
  • 3
  •  
  • 20
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • Slovenian Language Technologies Society
  • Trojina, Institute for Applied Slovene Studies

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".