• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   What can you do?
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Subject : word normalisation      Language : Slovenian     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (7)
    • Arhar Holdt, Špela (2)
    • Zupan, Katja (2)
    • Čibej, Jaka (2)
    • Dobrovoljc, Kaja (1)
Subject  
    • named entities (6)
    • TEI (6)
    • manual annotation (3)
    • tokenisation (2)
    • blogs (1)
    • experimental data (1)
    • forums (1)
    • historical language (1)
    • lemmatisation (1)
    • news comments (1)
    • part-of-speech tagging (1)
    • Twitter (1)
    • Wikipedia (1)

Showing 1 through 8 out of 8 results

  • 1
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 1.2
    (Jožef Stefan Institute / 2016-12-30)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela
     This item contains 4 files (4.01 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 7 files (5.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset of normalised Slovene text KonvNormSl 1.0
    (Jožef Stefan Institute / 2016-09-19)
    
    Author(s):
    Ljubešić, Nikola ; Zupan, Katja ; Fišer, Darja ; Erjavec, Tomaž
     This item contains 1 file (4.57 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Twitter corpus Janes-Tweet 1.0
    (Jožef Stefan Institute / 2017-09-05)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Fišer, Darja
     This item contains 2 files (1.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    News comment corpus Janes-News 1.0
    (Jožef Stefan Institute / 2017-08-17)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola ; Fišer, Darja
     This item contains 2 files (186.48 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Forum corpus Janes-Forum 1.0
    (Jožef Stefan Institute / 2017-08-17)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola ; Fišer, Darja
     This item contains 2 files (573.23 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Blog post and comment corpus Janes-Blog 1.0
    (Jožef Stefan Institute / 2017-08-17)
    
    Author(s):
    Erjavec, Tomaž ; Ljubešić, Nikola ; Fišer, Darja
     This item contains 2 files (411.31 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Wikipedia talk corpus Janes-Wiki 1.0
    (Jožef Stefan Institute / 2017-08-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Fišer, Darja
     This item contains 2 files (55.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Domestic Research Society
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • Slovenian Language Technologies Society

Partners

  • Research Centre of the Slovenian Academy of Sciences and Arts
  • Trojina, Institute for Applied Slovene Studies
  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIN repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".