• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Language : Slovenian      Author : Dobrovoljc, Kaja     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Krek, Simon (25)
    • Čibej, Jaka (23)
    • Arhar Holdt, Špela (22)
    • Erjavec, Tomaž (9)
    • Gantar, Polona (8)
    • Kosem, Iztok (8)
    • Laskowski, Cyprian (7)
    • Ljubešić, Nikola (7)
    • Gorjanc, Vojko (6)
    • Klemenc, Bojan (6)
    • Krsnik, Luka (6)
    • Robnik-Šikonja, Marko (6)
    • Holozan, Peter (3)
    • Zupan, Katja (3)
    • Diaci, Ajda (2)
    • Holz, Nanika (2)
    • Jezeršek, Lucija (2)
    • Kavčič, Teja (2)
    • Kuzman, Taja (2)
    • ... View More
Subject  
    • n-grams (11)
    • multiword expressions (8)
    • frequency list (6)
    • manual annotation (6)
    • lemmatisation (5)
    • morphology (5)
    • spoken corpus (5)
    • collocations (4)
    • morphosyntactic tags (4)
    • part-of-speech tagging (4)
    • standard language (4)
    • TEI (4)
    • wordlist (4)
    • words (4)
    • characters (3)
    • CONLL-U (3)
    • lemmas (3)
    • named entities (3)
    • semantic role labelling (3)
    • syntactic structures (3)
    • ... View More
Type  
    • text (31)
    • lexicalConceptualResource (25)
    • corpus (6)
    • toolService (2)
Contain Files  
    • yes (31)
    • no (2)

Showing 1 through 33 out of 33 results

  • 1
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    •  40
    • 60
    • 80
    • 100

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Consonant-vowel structures in the Gigafida 2.0 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-02-13)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 5 files (141.75 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word-level n-grams from the Gigafida 2.0 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2019-11-18)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (21.29 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.3
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-07-07)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (42.85 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Gos corpus n-grams 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-03)
    
    Author(s):
    Dobrovoljc, Kaja
     This item contains 3 files (21.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word parts from the Gigafida 2.0 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2019-11-18)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 30 files (1.97 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Thesaurus of Modern Slovene 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-03-25)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Laskowski, Cyprian ; Robnik-Šikonja, Marko ; Kosem, Iztok ; Arhar Holdt, Špela ; Gantar, Polona ; Čibej, Jaka ; Gorjanc, Vojko ; Klemenc, Bojan ; Dobrovoljc, Kaja
     This item contains 1 file (5.42 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon slMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola ; Krek, Simon ; Dobrovoljc, Kaja and Erjavec, Tomaž
     This item contains 1 file (73.96 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Valency lexicon extracted from the Gigafida 2.1 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-03-16)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Gantar, Polona ; Krsnik, Luka ; Laskowski, Cyprian ; Dobrovoljc, Kaja ; Arhar Holdt, Špela ; Čibej, Jaka ; Kosem, Iztok ; Klemenc, Bojan ; Robnik-Šikonja, Marko ; Gorjanc, Vojko
     This item contains 2 files (357.92 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    List of formulaic sequences in spoken Slovenian
    (Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana / 2020-01-06)
    
    Author(s):
    Dobrovoljc, Kaja ; Roblek, Rebeka ; Vianello, Chiara ; Diaci, Ajda and Vuga, Zala
     This item contains 1 file (362.47 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Slovene ontology of semantic types for nouns SLONEST-noun 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Pori, Eva ; Gantar, Polona ; Logar, Nataša ; Krek, Simon ; Laskowski, Cyprian ; Arhar Holdt, Špela ; Čibej, Jaka ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Klemenc, Bojan ; Ljubešić, Nikola
     This item contains 1 file (58.7 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Multiword Expressions lexicon extracted from the Gigafida 2.1 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-03-25)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Gantar, Apolonija ; Laskowski, Cyprian ; Krsnik, Luka ; Kosem, Iztok ; Brank, Janez ; Dobrovoljc, Kaja ; Arhar Holdt, Špela ; Čibej, Jaka ; Robnik-Šikonja, Marko ; Klemenc, Bojan ; Gorjanc, Vojko
     This item contains 1 file (1.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of collocations from the Gigafida 2.1 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-03-09)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Gantar, Polona ; Kosem, Iztok ; Dobrovoljc, Kaja ; Arhar Holdt, Špela ; Čibej, Jaka ; Laskowski, Cyprian ; Klemenc, Bojan ; Krsnik, Luka
     This item contains 1 file (139.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Written Standard Slovene Gigafida 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-06-13)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Erjavec, Tomaž ; Repar, Andraž ; Čibej, Jaka ; Arhar Holdt, Špela ; Gantar, Polona ; Kosem, Iztok ; Robnik-Šikonja, Marko ; Ljubešić, Nikola ; Dobrovoljc, Kaja ; Laskowski, Cyprian ; Grčar, Miha ; Holozan, Peter ; Šuster, Simon ; Gorjanc, Vojko ; Stabej, Marko ; Logar, Nataša
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Morphological lexicon Sloleks 1.2
    (Centre for Language Resources and Technologies, University of Ljubljana / 2015-06-14)
    
    Author(s):
    Dobrovoljc, Kaja ; Krek, Simon ; Holozan, Peter ; Erjavec, Tomaž and Romih, Miro
     This item contains 5 files (79.77 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    IMP corpus n-grams 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-03)
    
    Author(s):
    Dobrovoljc, Kaja
     This item contains 3 files (326.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Author(s):
    Erjavec, Tomaž ; Krek, Simon and Dobrovoljc, Kaja
     This item contains 4 files (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of character-level n-grams from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (2.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of character-level n-grams from the Gigafida 2.0 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2019-11-18)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 2 files (74.36 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of words from the Gigafida 2.0 corpus
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2019-11-18)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 13 files (270.42 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 7 files (5.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word-level n-grams from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 3 files (287.52 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of words from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (4.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Kres corpus n-grams 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-03)
    
    Author(s):
    Dobrovoljc, Kaja
     This item contains 3 files (2.34 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    List of formulaic sequences in standard written Slovenian
    (Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana / 2020-01-06)
    
    Author(s):
    Dobrovoljc, Kaja ; Roblek, Rebeka ; Vianello, Chiara ; Diaci, Ajda and Vuga, Zala
     This item contains 1 file (296.85 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Consonant-vowel structures in the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 7 files (3.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Morphological lexicon Sloleks 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-03-26)
    
    Author(s):
    Dobrovoljc, Kaja ; et al.show everyone Dobrovoljc, Kaja ; Krek, Simon ; Holozan, Peter ; Erjavec, Tomaž ; Romih, Miro ; Arhar Holdt, Špela ; Čibej, Jaka ; Krsnik, Luka ; Robnik-Šikonja, Marko
     This item contains 2 files (85.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Collocations Dictionary of Modern Slovene KSSS 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-09-20)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Gantar, Polona ; Krek, Simon ; Arhar Holdt, Špela ; Čibej, Jaka ; Laskowski, Cyprian ; Pori, Eva ; Klemenc, Bojan ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Ljubešić, Nikola
     This item contains 1 file (311.31 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word parts from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Author(s):
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     This item contains 1 file (33.41 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Corpus extraction tool LIST 1.2
    (Centre for Language Resources and Technologies, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; Jožef Stefan Institute / 2019-11-18)
    
    Author(s):
    Krsnik, Luka ; et al.show everyone Krsnik, Luka ; Arhar Holdt, Špela ; Čibej, Jaka ; Dobrovoljc, Kaja ; Ključevšek, Aleksander ; Krek, Simon ; Robnik-Šikonja, Marko
     This item contains 1 file (16.26 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.2
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-01-26)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (40.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Janes corpus n-grams 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-01)
    
    Author(s):
    Dobrovoljc, Kaja
     This item contains 3 files (3.75 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The classla-stanza model for semantic role labeling of standard Slovenian
    (Jožef Stefan Institute / 2022-06-10)
    
    Author(s):
    Krsnik, Luka ; Ljubešić, Nikola and Dobrovoljc, Kaja
     This item contains 1 file (53.87 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Monitor corpus of Slovene Trendi 2022-05
    (Jožef Stefan Institute / 2022-06-23)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Čibej, Jaka ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Ljubešić, Nikola ; Ponikvar, Primož ; Šinkec, Mihael ; Krek, Simon
     This item contains no files.

  • 1
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    •  40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • Slovenian Language Technologies Society
  • Trojina, Institute for Applied Slovene Studies

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".