• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Language : Croatian     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (57)
    • Erjavec, Tomaž (16)
    • Rupnik, Peter (16)
    • Fišer, Darja (9)
    • Kuzman, Taja (9)
    • Batanović, Vuk (7)
    • Pollak, Senja (7)
    • Agnoloni, Tommaso (6)
    • Barkarson, Starkaður (6)
    • Battistoni, Roberto (6)
    • Calzada Pérez, María (6)
    • Coole, Matthew (6)
    • Darģis, Roberts (6)
    • Depoorter, Griet (6)
    • Diwersy, Sascha (6)
    • Esplà-Gomis, Miquel (6)
    • Frontini, Francesca (6)
    • Grigorova, Vladislava (6)
    • Haltrup Hansen, Dorte (6)
    • Jongejan, Bart (6)
    • ... View More
Subject  
    • language model (12)
    • parliamentary debates (10)
    • TEI (10)
    • computer-mediated communication (9)
    • multilingual (9)
    • web corpus (9)
    • news corpus (8)
    • part-of-speech tagging (8)
    • Croatian Parliament (7)
    • Czech Parliament (7)
    • Parla-CLARIN (7)
    • Slovenian Parliament (7)
    • Belgian Parliament (6)
    • Bulgarian Parliament (6)
    • COVID-19 (6)
    • Danish Parliament (6)
    • Dutch Parliament (6)
    • French Parliament (6)
    • Hungarian Parliament (6)
    • Icelandic Parliament (6)
    • ... View More
Rights  
    • PUB (76)
    • ACA (4)
Language (ISO)  
    • Slovenian (31)
    • English (25)
    • Serbian (24)
    • Bosnian (20)
    • Latvian (14)
    • Bulgarian (12)
    • Dutch (12)
    • Estonian (11)
    • Finnish (11)
    • Hungarian (11)
    • Polish (11)
    • Spanish (11)
    • Swedish (11)
    • Catalan (10)
    • Czech (10)
    • Danish (10)
    • French (10)
    • Icelandic (10)
    • Italian (10)
    • ... View More
Type  
    • text (72)
    • corpus (52)
    • lexicalConceptualResource (21)
    • toolService (14)
    • audio (2)
Contain Files  
    • yes (80)
    • no (7)

Showing 1 through 87 out of 87 results

  • 1
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    •  100

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Multilingual Culture-Independent Word Analogy Datasets
    (Faculty of Computer and Information Science, University of Ljubljana / 2019-11-25)
    
    Author(s):
    Ulčar, Matej ; et al.show everyone Ulčar, Matej ; Vaik, Kristiina ; Lindström, Jessica ; Linde, Dace ; Dailidėnaitė, Milda ; Šumakov, Andrei
     This item contains 3 files (6.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    CroSloEngual BERT 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2020-07-09)
    
    Author(s):
    Ulčar, Matej and Robnik-Šikonja, Marko
     This item contains 3 files (476.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Sentiment Annotated Dataset of Croatian News
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Pelicon, Andraž ; Pranjić, Marko ; Miljković, Dragana ; Škrlj, Blaž and Pollak, Senja
     This item contains 1 file (85.6 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
    (Ekspress Meedia Group; Styria Media Group / 2022-02-10)
    
    Author(s):
    Freienthal, Linda ; et al.show everyone Freienthal, Linda ; Pelicon, Andraž ; Martinc, Matej ; Škrlj, Blaž ; Krustok, Ivar ; Pranjić, Marko ; Cabrera-Diego, Luis Adrián ; Purver, Matthew ; Pollak, Senja ; Kuulmets, Hele-Andra ; Shekhar, Ravi ; Koloski, Boshko
     This item contains 1 file (434.28 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The Croatian web dictionary Mrežnik (A-F) 1.0
    (Institute for Croatian Language and Linguistics / 2021-12-23)
    
    Author(s):
    Hudeček, Lana ; et al.show everyone Hudeček, Lana ; Mihaljević, Milica ; Blagus Bartolec, Goranka ; Brač, Ivana ; Horvat, Joža ; Ivšić Majić, Dubravka ; Lewis, Kristian ; Kovačević, Barbara ; Kramarić, Martina ; Lazić, Daria ; Matas Ivanković, Ivana ; Matijević, Maja ; Mihaljević, Josip ; Pasini, Dinka ; Sučević-Međeral, Krešimir ; Vidović, Domagoj
     This item contains 2 files (7.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Annotated corpus of Croatian language-related news articles MetaLangNEWS-Hr
    (ZRC SAZU; Regional Linguistic Data Initiative Centre ReLDI / 2020-10-30)
    
    Author(s):
    Bogetić, Ksenija and Batanović, Vuk
     This item contains 3 files (11.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Croatian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.34 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Terminological dictionary of tax terminology
    (ZRC SAZU; Faculty of Public Administration, University of Ljubljana / 2022-06-29)
    
    Author(s):
    Hudej, Nika ; et al.show everyone Hudej, Nika ; Jemec Tomazin, Mateja ; Klun, Maja ; Kostelec, Andreja ; Kovač, Polonca ; Podlipnik, Jernej
     This item contains 1 file (94.49 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Croatian Language Resources for NooJ (ELEXIS)
    (Faculty of Humanities and Social Sciences, University of Zagreb / 2020-12-17)
    
    Author(s):
    Vučković, Kristina ; et al.show everyone Vučković, Kristina ; Tadić, Marko ; Bekavac, Božo ; Kocijan, Kristina ; Kurolt, Silvia ; Mijić, Linda ; Kocijan, Kristina ; Košković, Lucija ; Bajac, Petra
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian corpus of non-professional written language by typical speakers and speakers with language disorders RAPUT 1.0
    (Jožef Stefan Institute; Faculty of Education and Rehabilitation, University of Zagreb / 2021-06-15)
    
    Author(s):
    Kuvač Kraljević, Jelena ; Hržica, Gordana ; Štefanec, Vanja ; Kologranić Belić, Lana and Ljubešić, Nikola
     This item contains 2 files (8.11 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Twitter sentiment for 15 European languages
    (Jožef Stefan Institute / 2016-02-23)
    
    Author(s):
    Mozetič, Igor ; Grčar, Miha and Smailović, Jasmina
     This item contains 16 files (49.38 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset and baseline model of moderated content FRENK-STYRIA-24sata 1.0
    (Jožef Stefan Institute / 2018-10-27)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (7.62 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Text collection for training the BERTić transformer model BERTić-data
    (Jožef Stefan Institute / 2021-05-05)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 10 files (21.14 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The news dataset for discriminating between Bosnian, Croatian and Serbian SETimes.HBS 1.0
    (Jožef Stefan Institute / 2022-01-26)
    
    Author(s):
    Ljubešić, Nikola and Rupnik, Peter
     This item contains 1 file (20.15 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Concreteness and imageability lexicon MEGA.HR-Crossling
    (Jožef Stefan Institute; Faculty of Humanities and Social Sciences, University of Zagreb / 2018-05-28)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (164.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus hrenWaC 2.0
    (Jožef Stefan Institute / 2016-03-09)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (186.46 MB).
     
    Academic Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon hrMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (152.39 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Dictionary of Croatian Idioms (ELEXIS)
    (Croatian Academy of Sciences and Arts / 2022-05-03)
    
    Author(s):
    Filipović Petrović, Ivana and Parizoska, Jelena
     This item contains no files.

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Croatian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (178.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Annotated corpus of Croatian language-related news comments MetaLangNEWS-COMMENTS-Hr
    (ZRC SAZU; Regional Linguistic Data Initiative Centre ReLDI / 2020-10-30)
    
    Author(s):
    Bogetić, Ksenija and Batanović, Vuk
     This item contains 3 files (14.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Treq Translation Equivalents (ELEXIS)
    (Institute of the Czech National Corpus / 2020-06-25)
    
    Author(s):
    Rosen, Alexandr ; Vavřín, Martin and Zasina, Adrian
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Tourism English-Croatian Parallel Corpus 2.0
    (Abu-MaTran project / 2016-01-28)
    
    Author(s):
    Toral, Antonio ; et al.show everyone Toral, Antonio ; Esplà-Gomis, Miquel ; Klubička, Filip ; Ljubešić, Nikola ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Rubino, Raphael ; Way, Andy
     This item contains 1 file (69.36 MB).
     
    Academic Use Attribution Required Noncommercial

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Croatian 1.1
    (Jožef Stefan Institute / 2020-07-17)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (89.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.0
    (Jožef Stefan Institute / 2021-05-28)
    
    Author(s):
    Ljubešić, Nikola ; Fišer, Darja and Erjavec, Tomaž
     This item contains 1 file (4.17 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Choice of plausible alternatives dataset in Croatian COPA-HR
    (Jožef Stefan Institute / 2021-02-24)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 3 files (194.2 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Facebook metadata dataset LiLaH-HAG
    (Jožef Stefan Institute / 2022-08-24)
    
    Author(s):
    Markov, Ilia ; Hilte, Lisa ; Ljubešić, Nikola ; Fišer, Darja and Daelemans, Walter
     This item contains 1 file (128.23 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The sentiment corpus of parliamentary debates ParlaSent-BCS v1.0
    (Jožef Stefan Institute / 2022-06-08)
    
    Author(s):
    Mochtak, Michal ; Rupnik, Peter and Ljubešić, Nikola
     This item contains 1 file (1.13 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The Croatian Web Dictionary Mrežnik (ELEXIS)
    (Institute of Croatian Language and Linguistics / 2021-12-23)
    
    Author(s):
    Hudeček, Lana ; et al.show everyone Hudeček, Lana ; Mihaljević, Milica ; Blagus Bartolec, Goranka ; Brač, Ivana ; Horvat, Joža ; Ivšić Majić, Dubravka ; Lewis, Kristian ; Kovačević, Barbara ; Kramarić, Martina ; Lazić, Daria ; Matas Ivanković, Ivana ; Matijević, Maja ; Mihaljević, Josip ; Pasini, Dinka ; Sučević-Međeral, Krešimir ; Vidović, Domagoj
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon hrLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (51.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    TermFrame: Terms, definitions and semantic annotations for karstology
    (Faculty of Arts, University of Ljubljana; Jožef Stefan Institute / 2021-11-18)
    
    Author(s):
    Vintar, Špela ; et al.show everyone Vintar, Špela ; Pollak, Senja ; Saksida, Amanda ; Stepišnik, Uroš ; Pintar, Kristian ; Grčić Simeunović, Larisa ; Hadalin, Teja ; Podpečan, Vid ; Martinc, Matej ; Repar, Andraž ; Vrtovec, Katarina
     This item contains 1 file (1.25 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    PASSWORD English Multilingual Dictionary - KEMD (ELEXIS)
    (K Dictionaries Ltd / 2021-03-04)
    
    Author(s):
    K Dictionaries Ltd
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    School Dictionary of the Croatian Language (ELEXIS)
    (Institute of Croatian Language and Linguistics / 2020-12-16)
    
    Author(s):
    Birtić, Matea ; et al.show everyone Birtić, Matea ; Blagus Bartolec, Goranka ; Hudeček, Lana ; Jojić, Ljiljana ; Kovačević, Barbara ; Lewis, Kristian ; Matas Ivanković, Ivana ; Mihaljević, Milica ; Miloš, Irena ; Ramadanović, Ermina ; Vidović, Domagoj
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Bosnia and Herzegovina language-related news comments MetaLangNEWS-COMMENTS-Bs
    (ZRC SAZU; Regional Linguistic Data Initiative Centre ReLDI / 2022-09-30)
    
    Author(s):
    Bogetić, Ksenija ; Milinković, Michael and Batanović, Vuk
     This item contains 2 files (1.88 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Semantic hypergraph corpus SemCRO 1.0
    (University of Mostar; University of Split; Jožef Stefan Institute / 2020-11-20)
    
    Author(s):
    Vasić, Daniel ; et al.show everyone Vasić, Daniel ; Žitko, Branko ; Gašpar, Angelina ; Ljubešić, Nikola ; Štrkalj Despot, Kristina ; Merkler, Danijela
     This item contains 1 file (21.66 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Bosnia and Herzegovina language-related news articles MetaLangNEWS-Bs
    (ZRC SAZU; Regional Linguistic Data Initiative Centre ReLDI / 2022-09-30)
    
    Author(s):
    Bogetić, Ksenija ; Milinković, Michael and Batanović, Vuk
     This item contains 2 files (2.03 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Croatian SenseGraph 1.0
    (Faculty of Electrical Engineering and Computing, University of Zagreb / 2018-12-14)
    
    Author(s):
    Šnajder, Jan and Alagić, Domagoj
     This item contains 1 file (955.93 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The Twitter user dataset for discriminating between Bosnian, Croatian, Montenegrin and Serbian Twitter-HBS 1.0
    (Jožef Stefan Institute / 2022-01-26)
    
    Author(s):
    Ljubešić, Nikola and Rupnik, Peter
     This item contains 1 file (12.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.hr 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (4.16 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.hr 1.0
    (Jožef Stefan Institute / 2018-12-10)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 4 files (4.88 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (7.12 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (16.72 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus MaCoCu-hr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (2.42 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus MaCoCu-hr-en 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-28)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 2 files (1.15 GB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (177.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (191.81 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (98.13 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for UD dependency parsing of standard Croatian
    (Jožef Stefan Institute / 2019-10-11)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (1.13 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     This item contains 2 files (179.88 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    ELMo embeddings models for seven languages
    (Faculty of Computer and Information Science, University of Ljubljana / 2019-11-25)
    
    Author(s):
    Ulčar, Matej
     This item contains 7 files (1.35 GB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (98.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Database of the Western South Slavic Verb HyperVerb -- Derivation
    (University of Nova Gorica; University of Graz / 2023-07-04)
    
    Author(s):
    Milosavljević, Stefan ; et al.show everyone Milosavljević, Stefan ; Mišmaš, Petra ; Simonović, Marko ; Arsenijević, Boban ; Gomboc Čeh, Katarina ; Marušič, Franc Lanko ; Simić, Jelena ; Žaucer, Rok
     This item contains 3 files (719.76 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The multilingual sentiment dataset of parliamentary debates ParlaSent 1.0
    (Jožef Stefan Institute / 2023-09-18)
    
    Author(s):
    Mochtak, Michal ; Rupnik, Peter ; Meden, Katja and Ljubešić, Nikola
     This item contains 8 files (7.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    A Resource for Evaluating Graded Word Similarity in Context: CoSimLex
    (Queen Mary University / 2020)
    
    Author(s):
    Armendariz, Carlos ; et al.show everyone Armendariz, Carlos ; Matthew, Purver ; Ulčar, Matej ; Pollak, Senja ; Ljubešić, Nikola ; Robnik-Šikonja, Marko ; Granroth-Wilding, Mark ; Vaik, Kristiina
     This item contains 6 files (750.07 KB).
     
    Publicly Available

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Portal of Contemporary Croatian Personal Names (ELEXIS)
    (Institute of Croatian Language and Linguistics / 2021-02-03)
    
    Author(s):
    Čilaš Šimpraga, Ankica ; Ivšić, Dubravka and Vidović, Domagoj
     This item contains no files.

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The LiLaH Emotion Lexicon of Croatian, Dutch and Slovene
    (Jožef Stefan Institute; Centre for Computational Linguistics and Psycholinguistics (CLiPS) / 2020-06-04)
    
    Author(s):
    Daelemans, Walter ; et al.show everyone Daelemans, Walter ; Fišer, Darja ; Franza, Jasmin ; Kranjčić, Denis ; Lemmens, Jens ; Ljubešić, Nikola ; Markov, Ilia ; Popič, Damjan
     This item contains 1 file (199.85 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (2.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus hrWaC 2.1
    (Jožef Stefan Institute / 2016-05-12)
    
    Author(s):
    Ljubešić, Nikola and Klubička, Filip
     This item contains 15 files (9.21 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    JRC EU DGT Translation Memory Parsebank DGT-UD 1.0
    (Jožef Stefan Institute / 2018-08-15)
    
    Author(s):
    Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 24 files (24.42 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus hr500k 1.0
    (Jožef Stefan Institute / 2018-04-13)
    
    Author(s):
    Ljubešić, Nikola ; Agić, Željko ; Klubička, Filip ; Batanović, Vuk and Erjavec, Tomaž
     This item contains 3 files (91.53 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian language corpus Riznica 0.1
    (Institute of Croatian Language and Linguistics / 2018-03-07)
    
    Author(s):
    Brozović Rončević, Dunja ; et al.show everyone Brozović Rončević, Dunja ; Ćavar, Damir ; Ćavar, Małgorzata ; Stojanov, Tomislav ; Štrkalj Despot, Kristina ; Ljubešić, Nikola ; Erjavec, Tomaž
     This item contains 1 file (457.73 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Bartolini, Roberto ; Cimino, Andrea ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (23.37 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Comparable corpora of South-Slavic Wikipedias CLASSLA-Wikipedia 1.0
    (Jožef Stefan Institute / 2021-05-05)
    
    Author(s):
    Ljubešić, Nikola ; Markoski, Filip ; Markoska, Elena and Erjavec, Tomaž
     This item contains 7 files (5.04 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Croatian news portals ENGRI (2014-2018)
    (University of Rijeka, Faculty of Maritime Studies / 2021-03-14)
    
    Author(s):
    Bogunović, Irena ; Kučić, Mario ; Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 12 files (8.48 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parliamentary corpus of first Yugoslavia (1919-1939) yu1Parl 1.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2023-07-15)
    
    Author(s):
    Kavčič, Alenka ; Mundjar, Aleksander and Marolt, Matija
     This item contains 3 files (2.65 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.1
    (Jožef Stefan Institute / 2021-11-17)
    
    Author(s):
    Ljubešić, Nikola ; Fišer, Darja ; Erjavec, Tomaž and Šulc, Ajda
     This item contains 2 files (4.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.0
    (CLARIN ERIC / 2023-10-24)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (61.05 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 4.0
    (CLARIN ERIC / 2023-10-24)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 30 files (5.67 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    "Choice of plausible alternatives" datasets in South Slavic dialects DIALECT-COPA
    (Jožef Stefan Institute / 2024-04-26)
    
    Author(s):
    Ljubešić, Nikola ; et al.show everyone Ljubešić, Nikola ; Kuzman, Taja ; Rupnik, Peter ; Milosavljević, Stefan ; Galant, Nada ; Benčina, Sonja ; Čibej, Jaka
     This item contains 6 files (279.69 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Multilingual text genre classification model X-GENRE
    (Jožef Stefan Institute / 2024-09-25)
    
    Author(s):
    Kuzman, Taja and Ljubešić, Nikola
     This item contains 1 file (779.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Genre-enriched web corpora MaCoCu-Genre
    (Jožef Stefan Institute / 2024-10-07)
    
    Author(s):
    Kuzman, Taja and Ljubešić, Nikola
     This item contains 14 files (101.43 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus CLASSLA-web.hr 1.0
    (Jožef Stefan Institute / 2024-03-26)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (20.31 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian linguistic training corpus hr500k 2.0
    (Jožef Stefan Institute / 2023-04-13)
    
    Author(s):
    Ljubešić, Nikola and Samardžić, Tanja
     This item contains 7 files (49.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 3.0
    (Jožef Stefan Institute / 2023-04-07)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (8.54 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Map task corpus of heritage BCMS 1.0
    (Department of Slavonic Languages and Literatures (Slavisches Seminar), University of Zurich / 2023-03-11)
    
    Author(s):
    Lemmenmeier-Batinić, Dolores
     This item contains 2 files (751.91 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Keyword extraction datasets for Croatian, Estonian, Latvian and Russian 1.0
    (Ekspress Meedia Group; Styria Media Group / 2021-06-04)
    
    Author(s):
    Koloski, Boshko ; Pollak, Senja ; Škrlj, Blaž and Martinc, Matej
     This item contains 1 file (224.84 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    24sata news article archive 1.0
    (Styria Media Group / 2021-04-19)
    
    Author(s):
    Purver, Matthew ; Shekhar, Ravi ; Pranjić, Marko ; Pollak, Senja and Martinc, Matej
     This item contains 2 files (1.26 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    24sata news comment dataset 1.0
    (Styria Media Group / 2021-04-19)
    
    Author(s):
    Shekhar, Ravi ; Pranjic, Marko ; Pollak, Senja ; Pelicon, Andraž and Purver, Matthew
     This item contains 3 files (1.89 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Database of the Western South Slavic Verb HyperVerb 2.0 -- WeSoSlav
    (University of Graz; University of Nova Gorica / 2024-12-10)
    
    Author(s):
    Arsenijević, Boban ; et al.show everyone Arsenijević, Boban ; Gomboc Čeh, Katarina ; Marušič, Franc Lanko ; Milosavljević, Stefan ; Mišmaš, Petra ; Simić, Jelena ; Simonović, Marko ; Žaucer, Rok
     This item contains 3 files (11.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Database of the Western South Slavic Verb HyperVerb 1.0
    (University of Graz; University of Nova Gorica / 2022-09-01)
    
    Author(s):
    Marušič, Franc Lanko ; et al.show everyone Marušič, Franc Lanko ; Žaucer, Rok ; Mišmaš, Petra ; Arsenijević, Boban ; Simonović, Marko ; Milosavljević, Stefan ; Gomboc Čeh, Katarina ; Simić, Jelena
     This item contains 3 files (513.16 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (65.97 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 30 files (5.87 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Parliamentary spoken corpus of Croatian ParlaSpeech-HR 2.0
    (Jožef Stefan Institute / 2024-01-25)
    
    Author(s):
    Ljubešić, Nikola ; Koržinek, Danijel and Rupnik, Peter
     This item contains 8 files (207.33 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Heritage Bosnian, Croatian, and Serbian spoken by Second Generation Speakers in Germany He-BCS-Ge
    (University of Regensburg; University of Zurich / 2024-11-10)
    
    Author(s):
    Romić, Daniel
     This item contains 1 file (109.1 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual IPTC Media Topic dataset EMMediaTopic 1.0
    (Jožef Stefan Institute / 2024-12-02)
    
    Author(s):
    Kuzman, Taja and Ljubešić, Nikola
     This item contains 1 file (71.3 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The "Mići Princ" text and speech dataset of Chakavian micro-dialects
    (Jožef Stefan Institute / 2024-03-05)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Perinčić, Tea
     This item contains 6 files (1.04 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    •  100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".