• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Language : Slovenian     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Erjavec, Tomaž (91)
    • Ljubešić, Nikola (78)
    • Dobrovoljc, Kaja (67)
    • Krek, Simon (65)
    • Arhar Holdt, Špela (64)
    • Čibej, Jaka (53)
    • Kosem, Iztok (45)
    • Fišer, Darja (39)
    • Robnik-Šikonja, Marko (35)
    • Gantar, Polona (27)
    • Laskowski, Cyprian (23)
    • Terčon, Luka (22)
    • Pori, Eva (21)
    • Krsnik, Luka (20)
    • Klemenc, Bojan (17)
    • Kuzman, Taja (17)
    • Rupnik, Peter (16)
    • Pollak, Senja (15)
    • Ledinek, Nina (14)
    • Ferme, Marko (13)
    • ... View More
Subject  
    • TEI (62)
    • manual annotation (30)
    • lemmatisation (23)
    • language model (21)
    • part-of-speech tagging (21)
    • terminology (21)
    • computer-mediated communication (19)
    • dictionary (19)
    • spoken corpus (19)
    • multilingual (18)
    • lexicography (17)
    • tokenisation (16)
    • historical language (14)
    • parliamentary debates (14)
    • n-grams (13)
    • parsing (13)
    • collocations (12)
    • morphology (12)
    • named entities (12)
    • parallel corpus (12)
    • ... View More
Rights  
    • PUB (313)
    • ACA (19)
    • RES (1)
Language (ISO)  
    • English (60)
    • Croatian (33)
    • Hungarian (23)
    • Bulgarian (21)
    • German (21)
    • Serbian (21)
    • Spanish (20)
    • Dutch (17)
    • Estonian (17)
    • Portuguese (17)
    • French (16)
    • Russian (16)
    • Czech (15)
    • Danish (15)
    • Italian (15)
    • Polish (15)
    • Bosnian (14)
    • Latvian (14)
    • Swedish (14)
    • ... View More
Type  
    • text (297)
    • corpus (167)
    • lexicalConceptualResource (139)
    • toolService (47)
    • audio (10)
    • image (1)
    • languageDescription (1)
    • video (1)
Contain Files  
    • yes (333)
    • no (21)

Showing 1 through 40 out of 354 results

  • 1
  • 2
  • 3
  •  
  • 9
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    •  40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    MULTEXT-East "1984" document corpus 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Bruda, Ştefan ; Dimitrova, Ludmila ; Ide, Nancy ; Kaalep, Heiki-Jaan ; Krstev, Cvetana ; Orav, Heili ; Oravecz, Csaba ; Paldre, Leho ; Petkevič, Vladimír ; Priest-Dorman, Greg ; Simov, Kiril ; Sinapova, Lydia ; Sokolovsky, Paul ; Sryvkin, Sergey ; Tufiş, Dan ; Utka, Andrius ; Villandi, Viire ; Vitas, Duško ; Vuković, Olga
     This item contains 1 file (4.62 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Multilingual Culture-Independent Word Analogy Datasets
    (Faculty of Computer and Information Science, University of Ljubljana / 2019-11-25)
    
    Author(s):
    Ulčar, Matej ; et al.show everyone Ulčar, Matej ; Vaik, Kristiina ; Lindström, Jessica ; Linde, Dace ; Dailidėnaitė, Milda ; Šumakov, Andrei
     This item contains 3 files (6.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Post-edited and error annotated machine translation corpus PErr 1.0
    (Insight Centre for Data Analytics, National University of Ireland, Galway / 2016-05-24)
    
    Author(s):
    Popović, Maja and Arčan, Mihael
     This item contains 1 file (364.69 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    CroSloEngual BERT 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2020-07-09)
    
    Author(s):
    Ulčar, Matej and Robnik-Šikonja, Marko
     This item contains 3 files (476.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    SimLex-999 Slovenian translation SimLex-999-sl 1.0
    (University of Ljubljana / 2020-05-15)
    
    Author(s):
    Pollak, Senja ; et al.show everyone Pollak, Senja ; Vulić, Ivan ; Pelicon, Andraž ; Repar, Andraž ; Armendariz, Carlos ; Matthew, Purver ; Ljubešić, Nikola
     This item contains 3 files (37.3 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Reference List of Slovene Frequent Common Words
    (Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana / 2020-09-10)
    
    Author(s):
    Pollak, Senja ; Arhar Holdt, Špela ; Krek, Simon and Robnik-Šikonja, Marko
     This item contains 1 file (69.51 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset of Slovene idiomatic expressions SloIE
    (Faculty of Computer and Information Science, University of Ljubljana / 2020-07-27)
    
    Author(s):
    Škvorc, Tadej ; Gantar, Polona and Robnik-Šikonja, Marko
     This item contains 1 file (4.22 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    List of single-word male and female occupations in Slovenian
    (Jožef Stefan Institute; Faculty of Computer and Information Science, University of Ljubljana / 2020-09-24)
    
    Author(s):
    Supej, Anka ; Ulčar, Matej ; Robnik-Šikonja, Marko and Pollak, Senja
     This item contains 1 file (5.94 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian keyword extraction dataset from SentiNews 1.0
    (Jožef Stefan Institute / 2022-03-28)
    
    Author(s):
    Koloski, Boshko ; Martinc, Matej ; Tavchioski, Ilija ; Škrlj, Blaž and Pollak, Senja
     This item contains 2 files (6.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Slovenian RoBERTa contextual embeddings model: SloBERTa 2.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2021-01-17)
    
    Author(s):
    Ulčar, Matej and Robnik-Šikonja, Marko
     This item contains 2 files (1.29 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    MULTEXT-East "1984" annotated corpus 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Barbu, Ana-Maria ; Derzhanski, Ivan ; Dimitrova, Ludmila ; Garabík, Radovan ; Ide, Nancy ; Kaalep, Heiki-Jaan ; Kotsyba, Natalia ; Krstev, Cvetana ; Oravecz, Csaba ; Petkevič, Vladimír ; Priest-Dorman, Greg ; QasemiZadeh, Behrang ; Radziszewski, Adam ; Simov, Kiril ; Tufiş, Dan ; Zdravkova, Katerina
     This item contains 1 file (14.12 MB).
     
    Academic Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Morphological lexicon Sloleks 3.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-12-05)
    
    Author(s):
    Čibej, Jaka ; et al.show everyone Čibej, Jaka ; Gantar, Kaja ; Dobrovoljc, Kaja ; Krek, Simon ; Holozan, Peter ; Erjavec, Tomaž ; Romih, Miro ; Arhar Holdt, Špela ; Krsnik, Luka ; Robnik-Šikonja, Marko
     This item contains 1 file (239.75 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Slovene Text Denormalizator RSDO-DS2-DENORM 1.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2022-12-01)
    
    Author(s):
    Jelovšek, Tjaša ; Lebar Bajec, Iztok ; Bajec, Marko ; Bajec, Žan and Cvek, Jernej
     This item contains 1 file (8.92 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of questions and answers of the Terminologišče terminological counselling service
    (ZRC SAZU / 2022-11-29)
    
    Author(s):
    Atelšek, Simon ; et al.show everyone Atelšek, Simon ; Fajfar, Tanja ; Jemec Tomazin, Mateja ; Trojar, Mitja ; Sitar, Jera ; Žagar Karer, Mojca
     This item contains 1 file (528.98 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Dictionary of the Slovenian Language in the Works of Janez Svetokriški
    (Slovenian Academy of Sciences and Arts; dr. Bruno Breschi Foundation; ZRC SAZU / 2006)
    
    Author(s):
    Snoj, Marko
     This item contains 1 file (1.7 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Manually sentiment annotated Slovenian news corpus SentiNews 1.0
    (Faculty of Information Studies Novo mesto / 2017-04-29)
    
    Author(s):
    Bučar, Jože
     This item contains 5 files (72.04 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Terminological dictionary of tax terminology
    (ZRC SAZU; Faculty of Public Administration, University of Ljubljana / 2022-06-29)
    
    Author(s):
    Hudej, Nika ; et al.show everyone Hudej, Nika ; Jemec Tomazin, Mateja ; Klun, Maja ; Kostelec, Andreja ; Kovač, Polonca ; Podlipnik, Jernej
     This item contains 1 file (94.49 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Terminology identification dataset KAS-term 1.0
    (Jožef Stefan Institute / 2018-08-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Arhar Holdt, Špela ; Bren, Urban ; Robnik-Šikonja, Marko ; Udovič, Boštjan
     This item contains 4 files (17.26 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene coreference resolution corpus coref149
    (Faculty of Computer and Information Science, University of Ljubljana / 2018-03-19)
    
    Author(s):
    Žitnik, Slavko
     This item contains 1 file (452.84 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word-level n-grams from the Gigafida 2.0 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2019-11-18)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (21.29 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Slovene sentiment lexicon JOB 1.0
    (Faculty of Information Studies Novo mesto / 2017-05-09)
    
    Author(s):
    Bučar, Jože
     This item contains 2 files (696.06 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Twitter sentiment for 15 European languages
    (Jožef Stefan Institute / 2016-02-23)
    
    Author(s):
    Mozetič, Igor ; Grčar, Miha and Smailović, Jasmina
     This item contains 16 files (49.38 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    School dictionary of Slovenian language (human audio recordings)
    (ZRC SAZU / 2021-11-16)
    
    Author(s):
    Snoj, Marko ; Mirtič, Tanja and Vendramin, Peter
     This item contains 1 file (54.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    PyTorch model for Slovenian Named Entity Recognition SloNER 1.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2023-01-27)
    
    Author(s):
    Prelevikj, Marko ; Žitnik, Slavko and Knez, Timotej
     This item contains 1 file (387.44 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel corpus EN-SL RSDO4 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-10-28)
    
    Author(s):
    Repar, Andraž and Lebar Bajec, Iztok
     This item contains 1 file (189.06 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Core vocabulary for Slovenian as L2 1.0
    (Centre for Slovene as a Second and Foreign Language, University of Ljubljana; Centre for Language Resources and Technologies, University of Ljubljana / 2022-11-11)
    
    Author(s):
    Klemen, Matej ; Arhar Holdt, Špela and Pollak, Senja
     This item contains 1 file (141.31 KB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian Twitter dataset 2018-2020 1.0
    (Jožef Stefan Institute / 2021-07-20)
    
    Author(s):
    Evkoski, Bojan ; Pelicon, Andraž ; Mozetič, Igor ; Ljubešić, Nikola and Kralj Novak, Petra
     This item contains 1 file (182.04 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Glossary of equestrianism
    (Amebis, d. o. o., Kamnik / 2019-09-01)
    
    Author(s):
    Marič, Sintia
     This item contains 1 file (5 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    ZRCola 2
    (ZRC SAZU / 2016-10-19)
    
    Author(s):
    Ježovnik, Janoš ; Weiss, Peter and Amebis, d.o.o.
     This item contains 1 file (158.44 KB).
     
    Publicly Available

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Emoji Sentiment Ranking 1.0
    (Jožef Stefan Institute / 2015-09-14)
    
    Author(s):
    Kralj Novak, Petra ; Smailović, Jasmina ; Sluban, Borut and Mozetič, Igor
     This item contains 3 files (93.95 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The Dictionary of the Clothing Terminology of the Zilja Dialect in Canale Valley (Kanalska dolina – Val Canale – Kanaltal – Valcjanâl): photographs
    (Slovensko kulturno središče Planika Kanalska dolina / Centro culturale Sloveno Stella Alpina Val Canale / 2019-03-25)
    
    Author(s):
    Gliha Komac, Nataša ; Kandutsch, Elisa ; Bartaloth, Rudi and Smole, Matevž
     This item contains 1 file (115.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Dictionary of New Slovenian Words - SNB (ELEXIS)
    (ZRC SAZU / 2020-06-23)
    
    Author(s):
    Bizjak Končar, Aleksandra ; et al.show everyone Bizjak Končar, Aleksandra ; Snoj, Marko ; Gložančev, Alenka ; Kern, Boris ; Kostanjevec, Polona ; Krvina, Domen ; Ledinek, Nina ; Michelizza, Mija ; Perdih, Andrej ; Petric, Špela ; Šircelj-Žnidaršič, Ivanka ; Žele, Andreja ; Mirtič, Tanja ; Gliha Komac, Nataša ; Klemenčič, Simona
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Dictionary of Lesser Used Slovenian Words (ELEXIS)
    (ZRC SAZU / 2020-06-23)
    
    Author(s):
    Šircelj-Žnidaršič, Ivanka ; et al.show everyone Šircelj-Žnidaršič, Ivanka ; Hajnšek-Holz, Milena ; Kostanjevec, Polona ; Žele, Andreja ; Humar, Marjeta ; Nartnik, Vlado ; Keber, Janez ; Košmrlj-Levačič, Borislava ; Jakopin, Primož
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Growing Dictionary of the Slovenian Language
    (ZRC SAZU / 2014)
    
    Author(s):
    Krvina, Domen
     This item contains 1 file (72.23 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene text simplification dataset SloTS
    (Faculty of Computer and Information Science, University of Ljubljana / 2022-11-23)
    
    Author(s):
    Gorenc, Sabina and Robnik-Šikonja, Marko
     This item contains 1 file (181.89 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word parts from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (33.41 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of words from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (4.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Dictionary of Twitterese Janes-Dict 1.0
    (Faculty of Arts, University of Ljubljana / 2018-01-17)
    
    Author(s):
    Gantar, Polona ; Škrjanec, Iza ; Fišer, Darja and Erjavec, Tomaž
     This item contains 1 file (106.54 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Lemma list of the Beseda Corpus Lemmatisation Lexicon (ELEXIS)
    (ZRC SAZU / 2020-06-23)
    
    Author(s):
    Jakopin, Primož
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Valency lexicon extracted from the Gigafida 2.1 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-03-16)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Gantar, Polona ; Krsnik, Luka ; Laskowski, Cyprian ; Dobrovoljc, Kaja ; Arhar Holdt, Špela ; Čibej, Jaka ; Kosem, Iztok ; Klemenc, Bojan ; Robnik-Šikonja, Marko ; Gorjanc, Vojko
     This item contains 2 files (357.92 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  • 2
  • 3
  •  
  • 9
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    •  40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".