• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Language : Croatian     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (36)
    • Erjavec, Tomaž (11)
    • Pollak, Senja (7)
    • Batanović, Vuk (5)
    • Rupnik, Peter (5)
    • Esplà-Gomis, Miquel (4)
    • Fišer, Darja (4)
    • Klubička, Filip (4)
    • Martinc, Matej (4)
    • Toral, Antonio (4)
    • Ulčar, Matej (4)
    • Pelicon, Andraž (3)
    • Pranjić, Marko (3)
    • Purver, Matthew (3)
    • Shekhar, Ravi (3)
    • Škrlj, Blaž (3)
    • Štefanec, Vanja (3)
    • Agnoloni, Tommaso (2)
    • Barkarson, Starkaður (2)
    • Battistoni, Roberto (2)
    • ... View More
Subject  
    • language model (9)
    • multilingual (9)
    • computer-mediated communication (6)
    • news corpus (6)
    • news comments (5)
    • part-of-speech tagging (5)
    • web corpus (5)
    • lemmatisation (4)
    • manual annotation (4)
    • parallel corpus (4)
    • TEI (4)
    • contextual embeddings (3)
    • named entity recognition (3)
    • offensive language (3)
    • parliamentary debates (3)
    • sentiment classification (3)
    • word embeddings (3)
    • Belgian Parliament (2)
    • Bulgarian Parliament (2)
    • closely related languages (2)
    • ... View More
Rights  
    • PUB (46)
    • ACA (4)
Language (ISO)  
    • English (15)
    • Slovenian (15)
    • Latvian (9)
    • Estonian (7)
    • Lithuanian (7)
    • Serbian (7)
    • Bosnian (6)
    • Bulgarian (6)
    • Dutch (6)
    • Finnish (6)
    • Hungarian (6)
    • Polish (6)
    • Spanish (6)
    • Swedish (6)
    • Czech (5)
    • Danish (5)
    • French (5)
    • Italian (5)
    • Russian (5)
    • ... View More
Type  
    • text (43)
    • corpus (31)
    • lexicalConceptualResource (12)
    • toolService (8)
    • audio (1)
    • languageDescription (1)
Contain Files  
    • yes (50)
    • no (2)

Showing 1 through 20 out of 52 results

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    •  20
    • 40
    • 60
    • 80
    • 100

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Multilingual Culture-Independent Word Analogy Datasets
    (Faculty of Computer and Information Science, University of Ljubljana / 2019-11-25)
    
    Author(s):
    Ulčar, Matej ; et al.show everyone Ulčar, Matej ; Vaik, Kristiina ; Lindström, Jessica ; Linde, Dace ; Dailidėnaitė, Milda ; Šumakov, Andrei
     This item contains 3 files (6.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    CroSloEngual BERT 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2020-07-09)
    
    Author(s):
    Ulčar, Matej and Robnik-Šikonja, Marko
     This item contains 3 files (476.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Sentiment Annotated Dataset of Croatian News
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Pelicon, Andraž ; Pranjić, Marko ; Miljković, Dragana ; Škrlj, Blaž and Pollak, Senja
     This item contains 1 file (85.6 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Keyword extraction datasets for Croatian, Estonian, Latvian and Russian 1.0
    (Ekspress Meedia Group; Styria Media Group / 2021-06-04)
    
    Author(s):
    Koloski, Boshko ; Pollak, Senja ; Škrlj, Blaž and Martinc, Matej
     This item contains 1 file (224.84 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    24sata news comment dataset 1.0
    (Styria Media Group / 2021-04-19)
    
    Author(s):
    Shekhar, Ravi ; Pranjic, Marko ; Pollak, Senja ; Pelicon, Andraž and Purver, Matthew
     This item contains 3 files (1.89 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    24sata news article archive 1.0
    (Styria Media Group / 2021-04-19)
    
    Author(s):
    Purver, Matthew ; Shekhar, Ravi ; Pranjić, Marko ; Pollak, Senja and Martinc, Matej
     This item contains 2 files (1.26 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • languageDescription
    CLARIN.SI data & tools
    languageDescription
    ELMo embeddings models for seven languages
    (Faculty of Computer and Information Science, University of Ljubljana / 2019-11-25)
    
    Author(s):
    Ulčar, Matej
     This item contains 7 files (1.35 GB).
     
    Publicly Available

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    A Resource for Evaluating Graded Word Similarity in Context: CoSimLex
    (Queen Mary University / 2020)
    
    Author(s):
    Armendariz, Carlos ; et al.show everyone Armendariz, Carlos ; Matthew, Purver ; Ulčar, Matej ; Pollak, Senja ; Ljubešić, Nikola ; Robnik-Šikonja, Marko ; Granroth-Wilding, Mark ; Vaik, Kristiina
     This item contains 5 files (486.73 KB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
    (Ekspress Meedia Group; Styria Media Group / 2022-02-10)
    
    Author(s):
    Freienthal, Linda ; et al.show everyone Freienthal, Linda ; Pelicon, Andraž ; Martinc, Matej ; Škrlj, Blaž ; Krustok, Ivar ; Pranjić, Marko ; Cabrera-Diego, Luis Adrián ; Purver, Matthew ; Pollak, Senja ; Kuulmets, Hele-Andra ; Shekhar, Ravi ; Koloski, Boshko
     This item contains 1 file (434.28 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset and baseline model of moderated content FRENK-STYRIA-24sata 1.0
    (Jožef Stefan Institute / 2018-10-27)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (7.62 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Bartolini, Roberto ; Cimino, Andrea ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (23.37 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (2.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The LiLaH Emotion Lexicon of Croatian, Dutch and Slovene
    (Jožef Stefan Institute; Centre for Computational Linguistics and Psycholinguistics (CLiPS) / 2020-06-04)
    
    Author(s):
    Daelemans, Walter ; et al.show everyone Daelemans, Walter ; Fišer, Darja ; Franza, Jasmin ; Kranjčić, Denis ; Lemmens, Jens ; Ljubešić, Nikola ; Markov, Ilia ; Popič, Damjan
     This item contains 1 file (199.85 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian language corpus Riznica 0.1
    (Institute of Croatian Language and Linguistics / 2018-03-07)
    
    Author(s):
    Brozović Rončević, Dunja ; et al.show everyone Brozović Rončević, Dunja ; Ćavar, Damir ; Ćavar, Małgorzata ; Stojanov, Tomislav ; Štrkalj Despot, Kristina ; Ljubešić, Nikola ; Erjavec, Tomaž
     This item contains 1 file (457.73 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon hrMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (152.39 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Croatian news portals ENGRI (2014-2018)
    (University of Rijeka, Faculty of Maritime Studies / 2021-03-14)
    
    Author(s):
    Bogunović, Irena ; Kučić, Mario ; Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 12 files (8.48 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Tourism English-Croatian Parallel Corpus 2.0
    (Abu-MaTran project / 2016-01-28)
    
    Author(s):
    Toral, Antonio ; et al.show everyone Toral, Antonio ; Esplà-Gomis, Miquel ; Klubička, Filip ; Ljubešić, Nikola ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Rubino, Raphael ; Way, Andy
     This item contains 1 file (69.36 MB).
     
    Academic Use Attribution Required Noncommercial

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Croatian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.34 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Concreteness and imageability lexicon MEGA.HR-Crossling
    (Jožef Stefan Institute; Faculty of Humanities and Social Sciences, University of Zagreb / 2018-05-28)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (164.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    •  20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • Slovenian Language Technologies Society
  • Trojina, Institute for Applied Slovene Studies

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".