• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   What can you do?
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Type : corpus      Language : Slovenian     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Erjavec, Tomaž (37)
    • Fišer, Darja (18)
    • Ljubešić, Nikola (17)
    • Arhar Holdt, Špela (9)
    • Krek, Simon (9)
    • Stabej, Marko (5)
    • Borovič, Mladen (4)
    • Boškovič, Borko (4)
    • Ferme, Marko (4)
    • Goli, Teja (4)
    • Hrovat, Goran (4)
    • Klemenc, Bojan (4)
    • Kosem, Iztok (4)
    • Laskowski, Cyprian (4)
    • Ojsteršek, Milan (4)
    • Rozman, Tadeja (4)
    • Verdonik, Darinka (4)
    • Zupan, Katja (4)
    • Dobrovoljc, Kaja (3)
    • Grčar, Miha (3)
    • ... View More
Subject  
    • TEI (30)
    • manual annotation (16)
    • computer-mediated communication (14)
    • named entities (8)
    • word normalisation (8)
    • spoken corpus (7)
    • multilingual (6)
    • terminology (6)
    • parallel corpus (5)
    • part-of-speech tagging (5)
    • Twitter (5)
    • academic writing (4)
    • developmental corpus (4)
    • error annotation (4)
    • historical language (4)
    • lemmatisation (4)
    • parliamentary debates (4)
    • sentiment classification (4)
    • Slovenian Parliament (4)
    • speech database (4)
    • ... View More
Rights  
    • PUB (56)
    • ACA (6)
Language (ISO)  
    • English (7)
    • Bulgarian (5)
    • Hungarian (4)
    • Polish (4)
    • Serbian (4)
    • Croatian (3)
    • Czech (3)
    • Estonian (3)
    • German (3)
    • Romanian (3)
    • Slovak (3)
    • Spanish (3)
    • Lithuanian (2)
    • Portuguese (2)
    • Russian (2)
    • Swedish (2)
    • Albanian (1)
    • Bosnian (1)
    • Danish (1)
    • ... View More

Showing 1 through 10 out of 62 results

  • 1
  • 2
  • 3
  •  
  • 7
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of comma placement Vejica 1.3
    (Amebis, d. o. o., Kamnik / 2018-04-15)
    
    Author(s):
    Holozan, Peter
     This item contains 2 files (3.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus SlovParl 2.0
    (Institute of Contemporary History / 2017-11-24)
    
    Author(s):
    Pančur, Andrej ; Šorn, Mojca ; Erjavec, Tomaž
     This item contains 3 files (169.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.0 (transcription)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-03-26)
    
    Author(s):
    Verdonik, Darinka ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Erjavec, Tomaž ; Majhenič, Simona ; Žgank, Andrej
     This item contains 3 files (18.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.0 (audio)
    (VideoLectures.NET / 2019-03-26)
    
    Author(s):
    VideoLectures.NET
     This item contains 6 files (8.85 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Author(s):
    Erjavec, Tomaž ; Krek, Simon ; Dobrovoljc, Kaja
     This item contains 4 files (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 1.2
    (Jožef Stefan Institute / 2016-12-30)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela
     This item contains 4 files (4.01 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 7 files (5.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus Šolar 2.0
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Kosem, Iztok ; Arhar Holdt, Špela ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Pori, Eva ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Kocjančič, Polonca ; Klemenc, Bojan ; Rozman, Tadeja
     This item contains 2 files (21.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus (without language corrections) Šolar 2.0 Clear
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Kosem, Iztok ; Arhar Holdt, Špela ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Kocjančič, Polonca ; Laskowski, Cyprian ; Klemenc, Bojan ; Pori, Eva ; Rozman, Tadeja
     This item contains 2 files (29.22 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.2
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-01-26)
    
    Author(s):
    Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (40.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • 1
  • 2
  • 3
  •  
  • 7
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Domestic Research Society
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • Slovenian Language Technologies Society

Partners

  • Research Centre of the Slovenian Academy of Sciences and Arts
  • Trojina, Institute for Applied Slovene Studies
  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIN repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".