• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Publisher : Centre for Language Resources and Technologies, University of Ljubljana      Type : corpus     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Erjavec, Tomaž (31)
    • Ljubešić, Nikola (21)
    • Arhar Holdt, Špela (20)
    • Krek, Simon (17)
    • Robnik-Šikonja, Marko (16)
    • Batanović, Vuk (13)
    • Verdonik, Darinka (13)
    • Ferme, Marko (12)
    • Kosem, Iztok (12)
    • Žagar, Aleš (12)
    • Bogetić, Ksenija (11)
    • Ojsteršek, Milan (11)
    • Borovič, Mladen (10)
    • Dobrovoljc, Kaja (10)
    • Fišer, Darja (10)
    • Čibej, Jaka (10)
    • Stabej, Marko (9)
    • Žitnik, Slavko (9)
    • Antloga, Špela (8)
    • Boškovič, Borko (8)
    • ... View More
Subject  
    • TEI (23)
    • spoken corpus (19)
    • academic writing (10)
    • manual annotation (10)
    • parallel corpus (9)
    • scientific texts (9)
    • specialised corpus (9)
    • news corpus (8)
    • part-of-speech tagging (8)
    • coreference resolution (7)
    • dependency treebank (7)
    • news discourse (7)
    • speech transcription (7)
    • spoken language (7)
    • student writing (7)
    • BSc/BA theses (6)
    • developmental corpus (6)
    • MSc/MA theses (6)
    • multilingual corpus (6)
    • named entities (6)
    • ... View More
Rights  
    • PUB (93)
    • ACA (13)
    • RES (2)
Language (ISO)  
    • Slovenian (82)
    • English (18)
    • Serbian (18)
    • Croatian (11)
    • Bosnian (5)
    • Lithuanian (5)
    • French (4)
    • German (2)
    • Hebrew (2)
    • Macedonian (2)
    • Montenegrin (2)
    • Albanian (1)
    • Bulgarian (1)
    • Danish (1)
    • GhegAlbanian (1)
    • Romanian (1)
    • Serbo-Croatian (1)
    • Spanish (1)
Type  
    • text (107)
    • audio (10)
    • video (1)
Contain Files  
    • yes (108)
    • no (10)

Showing 1 through 10 out of 118 results

  • 1
  • 2
  • 3
  •  
  • 12
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel corpus EN-SL RSDO4 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-10-28)
    
    Author(s):
    Repar, Andraž and Lebar Bajec, Iztok
     This item contains 1 file (189.06 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    ASR database ARTUR 1.0 (transcriptions)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; ZRC SAZU / 2023-02-22)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Bizjak, Andreja ; Sepesy Maučec, Mirjam ; Gril, Lucija ; Dobrišek, Simon ; Križaj, Janez ; Strle, Gregor ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Lokovšek, Jure ; Trojar, Mitja ; Erjavec, Tomaž ; Bernjak, Mitja ; Žganec Gros, Jerneja ; Čakš, Peter ; Pucer, Matevž ; Cvetko, Mitja ; Pavlič, Jani ; Zelenik, Marijana ; Ivanovska, Marija ; Grm, Klemen ; Longyka, Jure ; Mihelič, Aleš ; Vesnicer, Boštjan ; Dretnik, Naum
     This item contains 1 file (48.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene corpus for general relation extraction SloREL 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2022-09-15)
    
    Author(s):
    Štravs, Miha ; Knez, Timotej and Žitnik, Slavko
     This item contains 1 file (38.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Annotated Corpus of Pre-Standardized Balkan Slavic Literature 1.1
    (Slavic Seminary, University of Zurich / 2021-07-02)
    
    Author(s):
    Šimko, Ivan
     This item contains 5 files (3.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of textbooks for learning Slovenian as L2 KUUS 2.0
    (Centre for Slovene as a Second and Foreign Language, University of Ljubljana; Centre for Language Resources and Technologies, University of Ljubljana / 2023-10-19)
    
    Author(s):
    Klemen, Matej ; et al.show everyone Klemen, Matej ; Kosem, Iztok ; Arhar Holdt, Špela ; Pollak, Senja ; Huber, Damjan ; Lutar, Mateja
     This item contains 3 files (38.95 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    ASR database ARTUR 1.0 (audio)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; Alpineon d.o.o.; STA / 2023-02-27)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Bizjak, Andreja ; Žgank, Andrej ; Bernjak, Mitja ; Antloga, Špela ; Majhenič, Simona ; Čakš, Peter ; Pucer, Matevž ; Cvetko, Mitja ; Zelenik, Marijana ; Pavlič, Jani ; Dobrišek, Simon ; Križaj, Janez ; Strle, Gregor ; Ivanovska, Marija ; Grm, Klemen ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Lokovšek, Jure ; Longyka, Jure ; Trojar, Mitja ; Žganec Gros, Jerneja ; Mihelič, Aleš ; Vesnicer, Boštjan ; Dretnik, Naum ; Bordon, David
     This item contains 39 files (324.53 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.2 (transcription)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2021-09-23)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Erjavec, Tomaž ; Majhenič, Simona ; Žgank, Andrej
     This item contains 3 files (21.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Serbian Forms of Address 1.1
    (Department of Slavonic Languages and Literatures (Slavisches Seminar), University of Zurich / 2023-05-01)
    
    Author(s):
    Lemmenmeier-Batinić, Dolores
     This item contains 3 files (6.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The Sarajevo Corpus of SMS Messages in Bosnian 1.1
    (University of Sarajevo – Faculty of Philosophy / 2024-07-16)
    
    Author(s):
    Wasserscheidt, Philipp ; et al.show everyone Wasserscheidt, Philipp ; Bulić, Halid ; Durmišević, Elma ; Hodžić-Čavkić, Azra ; Bajraktarević, Enisa ; Ahmetspahić-Peljto, Azra ; Šabić, Belmin
     This item contains 1 file (1.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Albanian Spoken Corpus in Kosovo 1.0
    (University of Prishtina "Hasan Prishtina" / 2024-07-08)
    
    Author(s):
    Wasserscheidt, Philipp ; Rugova, Bardh and Baftiu, Adelajda
     This item contains 1 file (1.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • 1
  • 2
  • 3
  •  
  • 12
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".