• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Author : Ljubešić, Nikola     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Rupnik, Peter (73)
    • Kuzman, Taja (64)
    • Erjavec, Tomaž (57)
    • Toral, Antonio (50)
    • Esplà-Gomis, Miquel (49)
    • Bañón, Marta (44)
    • Forcada, Mikel L. (44)
    • García-Romero, Cristian (44)
    • Pla Sempere, Leopoldo (44)
    • Ramírez-Sánchez, Gema (44)
    • Suchomel, Vít (44)
    • van Noord, Rik (44)
    • Fišer, Darja (36)
    • Terčon, Luka (31)
    • Chichirau, Malina (28)
    • Galiano-Jiménez, Aarón (28)
    • Zaragoza-Bernabeu, Jaume (28)
    • Dobrovoljc, Kaja (19)
    • Osenova, Petya (16)
    • ... View More
Subject  
    • web corpus (63)
    • language model (38)
    • multilingual (31)
    • TEI (31)
    • parallel corpus (30)
    • part-of-speech tagging (26)
    • computer-mediated communication (25)
    • lemmatisation (23)
    • manual annotation (20)
    • parliamentary debates (18)
    • named entities (15)
    • Croatian Parliament (13)
    • Czech Parliament (13)
    • Slovenian Parliament (13)
    • Belgian Parliament (12)
    • Bulgarian Parliament (12)
    • COVID-19 (12)
    • Danish Parliament (12)
    • Dutch Parliament (12)
    • French Parliament (12)
    • ... View More
Rights  
    • PUB (191)
    • ACA (16)
Language (ISO)  
    • Slovenian (78)
    • Croatian (59)
    • English (58)
    • Serbian (43)
    • Bulgarian (21)
    • Bosnian (18)
    • Macedonian (16)
    • Icelandic (15)
    • Turkish (15)
    • Dutch (13)
    • Modern Greek (1453-) (13)
    • Catalan (12)
    • Czech (12)
    • Finnish (11)
    • French (11)
    • Polish (11)
    • Spanish (11)
    • Ukrainian (11)
    • Danish (10)
    • Hungarian (10)
    • ... View More
Type  
    • text (155)
    • corpus (141)
    • toolService (47)
    • lexicalConceptualResource (21)
    • audio (7)
Contain Files  
    • yes (207)
    • no (2)

Showing 1 through 80 out of 209 results

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    •  80
    • 100

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    SimLex-999 Slovenian translation SimLex-999-sl 1.0
    (University of Ljubljana / 2020-05-15)
    
    Author(s):
    Pollak, Senja ; et al.show everyone Pollak, Senja ; Vulić, Ivan ; Pelicon, Andraž ; Repar, Andraž ; Armendariz, Carlos ; Matthew, Purver ; Ljubešić, Nikola
     This item contains 3 files (37.3 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Croatian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.34 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Terminology identification dataset KAS-term 1.0
    (Jožef Stefan Institute / 2018-08-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Arhar Holdt, Špela ; Bren, Urban ; Robnik-Šikonja, Marko ; Udovič, Boštjan
     This item contains 4 files (17.26 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian corpus of non-professional written language by typical speakers and speakers with language disorders RAPUT 1.0
    (Jožef Stefan Institute; Faculty of Education and Rehabilitation, University of Zagreb / 2021-06-15)
    
    Author(s):
    Kuvač Kraljević, Jelena ; Hržica, Gordana ; Štefanec, Vanja ; Kologranić Belić, Lana and Ljubešić, Nikola
     This item contains 2 files (8.11 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian Twitter dataset 2018-2020 1.0
    (Jožef Stefan Institute / 2021-07-20)
    
    Author(s):
    Evkoski, Bojan ; Pelicon, Andraž ; Mozetič, Igor ; Ljubešić, Nikola and Kralj Novak, Petra
     This item contains 1 file (182.04 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Finnish web corpus fiWaC 1.0
    (Jožef Stefan Institute / 2016-09-20)
    
    Author(s):
    Ljubešić, Nikola ; Pirinen, Tommi and Toral, Antonio
     This item contains 38 files (15.28 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset and baseline model of moderated content FRENK-STYRIA-24sata 1.0
    (Jožef Stefan Institute / 2018-10-27)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (7.62 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Text collection for training the BERTić transformer model BERTić-data
    (Jožef Stefan Institute / 2021-05-05)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 10 files (21.14 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Bulgarian 1.0
    (Jožef Stefan Institute; IICT-BAS / 2020-07-07)
    
    Author(s):
    Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 2 files (107.32 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon srLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (54.16 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Text classification model SloBERTa-Trendi-Topics 1.0
    (Jožef Stefan Institute / 2022-10-28)
    
    Author(s):
    Čibej, Jaka ; et al.show everyone Čibej, Jaka ; Kuzman, Taja ; Ljubešić, Nikola ; Kosem, Iztok ; Ponikvar, Primož ; Dobrovoljc, Kaja ; Krek, Simon
     This item contains 1 file (389.15 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    DSI-enriched ParaCrawl 9 en-es corpus
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-25)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 4 files (176.82 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    The news dataset for discriminating between Bosnian, Croatian and Serbian SETimes.HBS 1.0
    (Jožef Stefan Institute / 2022-01-26)
    
    Author(s):
    Ljubešić, Nikola and Rupnik, Peter
     This item contains 1 file (20.15 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    English YouTube Hate Speech Corpus
    (Jožef Stefan Institute / 2021-10-14)
    
    Author(s):
    Ljubešić, Nikola ; Mozetič, Igor ; Cinelli, Matteo and Kralj Novak, Petra
     This item contains 3 files (30.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Text classification model fastText-Trendi-Topics 1.0
    (Jožef Stefan Institute / 2022-10-28)
    
    Author(s):
    Kuzman, Taja ; et al.show everyone Kuzman, Taja ; Čibej, Jaka ; Ljubešić, Nikola ; Kosem, Iztok ; Ponikvar, Primož ; Dobrovoljc, Kaja ; Krek, Simon
     This item contains 1 file (890.16 MB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Terčon, Luka ; Čibej, Jaka and Ljubešić, Nikola
     This item contains 1 file (2.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Concreteness and imageability lexicon MEGA.HR-Crossling
    (Jožef Stefan Institute; Faculty of Humanities and Social Sciences, University of Zagreb / 2018-05-28)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (164.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Ljubešić, Nikola ; Terčon, Luka and Čibej, Jaka
     This item contains 2 files (509.87 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola ; Zdravkova, Katerina ; Stojanoska, Sanja ; Erjavec, Tomaž and Krsnik, Luka
     This item contains 2 files (146.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus hrenWaC 2.0
    (Jožef Stefan Institute / 2016-03-09)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (186.46 MB).
     
    Academic Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon hrMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (152.39 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian Twitter hate speech dataset IMSyPP-sl
    (Jožef Stefan Institute / 2021-02-17)
    
    Author(s):
    Kralj Novak, Petra ; Mozetič, Igor and Ljubešić, Nikola
     This item contains 4 files (5.19 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Slovene ontology of semantic types for nouns SLONEST-noun 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Pori, Eva ; Gantar, Polona ; Logar, Nataša ; Krek, Simon ; Laskowski, Cyprian ; Arhar Holdt, Špela ; Čibej, Jaka ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Klemenc, Bojan ; Ljubešić, Nikola
     This item contains 1 file (58.7 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Serbian 1.1
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (90.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Croatian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (178.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The Orange workflow for observing collocation trends ColTrend 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Krek, Simon ; Čibej, Jaka ; Gantar, Polona ; Arhar Holdt, Špela ; Logar, Nataša ; Laskowski, Cyprian ; Klemenc, Bojan ; Ljubešić, Nikola ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Pori, Eva
     This item contains 1 file (70.03 MB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Serbian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.15 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Serbian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Slovenian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Tourism English-Croatian Parallel Corpus 2.0
    (Abu-MaTran project / 2016-01-28)
    
    Author(s):
    Toral, Antonio ; et al.show everyone Toral, Antonio ; Esplà-Gomis, Miquel ; Klubička, Filip ; Ljubešić, Nikola ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Rubino, Raphael ; Way, Andy
     This item contains 1 file (69.36 MB).
     
    Academic Use Attribution Required Noncommercial

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Croatian 1.1
    (Jožef Stefan Institute / 2020-07-17)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (89.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Serbian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (160.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    DSI-enriched ParaCrawl 9 en-nl corpus
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-28)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 2 files (55.54 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.0
    (Jožef Stefan Institute / 2021-05-28)
    
    Author(s):
    Ljubešić, Nikola ; Fišer, Darja and Erjavec, Tomaž
     This item contains 1 file (4.17 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Choice of plausible alternatives dataset in Croatian COPA-HR
    (Jožef Stefan Institute / 2021-02-24)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 3 files (194.2 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian-English parallel corpus srenWaC 1.0
    (Jožef Stefan Institute / 2016-03-09)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (70.94 MB).
     
    Academic Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene-English parallel corpus slenWaC 1.0
    (Jožef Stefan Institute / 2016-03-10)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (94.44 MB).
     
    Academic Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Facebook metadata dataset LiLaH-HAG
    (Jožef Stefan Institute / 2022-08-24)
    
    Author(s):
    Markov, Ilia ; Hilte, Lisa ; Ljubešić, Nikola ; Fišer, Darja and Daelemans, Walter
     This item contains 1 file (128.23 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for JOS dependency parsing of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (176.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The sentiment corpus of parliamentary debates ParlaSent-BCS v1.0
    (Jožef Stefan Institute / 2022-06-08)
    
    Author(s):
    Mochtak, Michal ; Rupnik, Peter and Ljubešić, Nikola
     This item contains 1 file (1.13 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for semantic role labeling of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (58.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon hrLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (51.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The Orange workflow for observing collocation clusters ColEmbed 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Čibej, Jaka ; Ljubešić, Nikola ; Krek, Simon ; Gantar, Polona ; Arhar Holdt, Špela ; Logar, Nataša ; Laskowski, Cyprian ; Klemenc, Bojan ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Pori, Eva
     This item contains 1 file (86.32 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Semantic hypergraph corpus SemCRO 1.0
    (University of Mostar; University of Split; Jožef Stefan Institute / 2020-11-20)
    
    Author(s):
    Vasić, Daniel ; et al.show everyone Vasić, Daniel ; Žitko, Branko ; Gašpar, Angelina ; Ljubešić, Nikola ; Štrkalj Despot, Kristina ; Merkler, Danijela
     This item contains 1 file (21.66 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon slMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola ; Krek, Simon ; Dobrovoljc, Kaja and Erjavec, Tomaž
     This item contains 1 file (73.96 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Bilingual terminology extraction dataset KAS-biterm 1.0
    (Jožef Stefan Institute / 2018-08-18)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola and Bitenc, Maja
     This item contains 2 files (1.83 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Finnish-English parallel corpus fienWaC 1.0
    (Jožef Stefan Institute / 2016-03-09)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (283.67 MB).
     
    Academic Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    The Twitter user dataset for discriminating between Bosnian, Croatian, Montenegrin and Serbian Twitter-HBS 1.0
    (Jožef Stefan Institute / 2022-01-26)
    
    Author(s):
    Ljubešić, Nikola and Rupnik, Peter
     This item contains 1 file (12.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Slovenian 2.1
    (Jožef Stefan Institute / 2023-03-30)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (504.03 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Slovenian 2.1
    (Jožef Stefan Institute / 2023-03-30)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (2.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset and baseline model of moderated content FRENK-MMC-RTV 1.0
    (Jožef Stefan Institute / 2018-10-27)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (4.65 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.sr 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (3.41 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Word embeddings CLARIN.SI-embed.mk 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (1.71 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.sr 1.0
    (Jožef Stefan Institute / 2018-12-10)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 4 files (3.36 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    Word embeddings CLARIN.SI-embed.mk 0.1
    (Jožef Stefan Institute / 2020-10-13)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (1.23 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene web corpus MaCoCu-sl 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (12.9 GB).
     
    Publicly Available

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.sl 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 2 files (4.22 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.hr 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (4.16 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.bg 1.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (3.49 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.sl 1.0
    (Jožef Stefan Institute / 2018-11-26)
    
    Author(s):
    Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 4 files (6.41 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.hr 1.0
    (Jožef Stefan Institute / 2018-12-10)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 4 files (4.88 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene web corpus MaCoCu-sl 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-19)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (5.57 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus MaCoCu-mk 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.79 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus MaCoCu-bg 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (12.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese web corpus MaCoCu-mt 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus MaCoCu-mk 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-28)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (4.09 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus MaCoCu-bg 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (28.22 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish web corpus MaCoCu-tr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (31.42 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese web corpus MaCoCu-mt 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (2.65 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese-English parallel corpus MaCoCu-mt-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (1.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (7.12 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bosnian web corpus MaCoCu-bs 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Runić, Marija ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (2.21 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Montenegrin web corpus MaCoCu-cnr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (500.14 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian web corpus MaCoCu-sr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (7.62 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (16.72 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian-English parallel corpus MaCoCu-mk-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (442.99 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus MaCoCu-hr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (2.42 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish-English parallel corpus MaCoCu-tr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (3.03 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Albanian-English parallel corpus MaCoCu-sq-en 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (590.81 MB).
     
    Publicly Available

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    •  80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".