• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Title : "CLASSLA-web"     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (76)
    • Kuzman, Taja (30)
    • Rupnik, Peter (29)
    • Terčon, Luka (26)
    • Toral, Antonio (22)
    • Bañón, Marta (21)
    • Esplà-Gomis, Miquel (21)
    • Forcada, Mikel L. (21)
    • García-Romero, Cristian (21)
    • Pla Sempere, Leopoldo (21)
    • Ramírez-Sánchez, Gema (21)
    • Suchomel, Vít (21)
    • van Noord, Rik (21)
    • Chichirau, Malina (14)
    • Galiano-Jiménez, Aarón (14)
    • Zaragoza-Bernabeu, Jaume (14)
    • van der Werff, Tobias (7)
    • Zaragoza, Jaume (7)
    • Erjavec, Tomaž (6)
    • Štefanec, Vanja (6)
    • ... View More
Subject  
    • language model (37)
    • web corpus (36)
    • part-of-speech tagging (13)
    • lemmatisation (12)
    • computer-mediated communication (11)
    • automatic genre identification (9)
    • genre corpus (8)
    • parsing (8)
    • named entity recognition (7)
    • born-digital dictionary (1)
    • children's dictionary (1)
    • comparable corpus (1)
    • dictionary for foreigners (1)
    • difficulty level (1)
    • general dictionary (1)
    • general monolingual explanatory dictionary (1)
    • genre (1)
    • genre classification (1)
    • lexicographic resource (1)
    • manual annotation (1)
    • ... View More
Language (ISO)  
    • Slovenian (20)
    • Croatian (18)
    • Serbian (17)
    • Bulgarian (8)
    • Macedonian (8)
    • Bosnian (5)
    • Montenegrin (5)
    • Icelandic (3)
    • Turkish (3)
    • Albanian (2)
    • Catalan (2)
    • Maltese (2)
    • Modern Greek (1453-) (2)
    • Ukrainian (2)
    • English (1)
    • Finnish (1)
    • Japanese (1)
    • Serbo-Croatian (1)
Type  
    • text (41)
    • toolService (41)
    • corpus (39)
    • lexicalConceptualResource (2)
Contain Files  
    • yes (81)
    • no (1)

Showing 1 through 80 out of 82 results

  • 1
  • 2
  •  
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    •  Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    •  80
    • 100

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for named entity recognition of standard Slovenian 2.2
    (Jožef Stefan Institute / 2025-02-07)
    
    Author(s):
    Terčon, Luka ; Dobrovoljc, Kaja and Ljubešić, Nikola
     This item contains 2 files (146.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Slovenian 2.2
    (Jožef Stefan Institute / 2025-02-07)
    
    Author(s):
    Terčon, Luka ; Dobrovoljc, Kaja and Ljubešić, Nikola
     This item contains 2 files (192.22 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of spoken Slovenian 2.2
    (Jožef Stefan Institute / 2025-02-07)
    
    Author(s):
    Terčon, Luka ; Dobrovoljc, Kaja and Ljubešić, Nikola
     This item contains 2 files (514.74 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of spoken Slovenian 2.2
    (Jožef Stefan Institute / 2025-02-07)
    
    Author(s):
    Terčon, Luka ; Dobrovoljc, Kaja and Ljubešić, Nikola
     This item contains 1 file (2.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of spoken Slovenian 2.2
    (Jožef Stefan Institute / 2025-02-07)
    
    Author(s):
    Terčon, Luka ; Dobrovoljc, Kaja and Ljubešić, Nikola
     This item contains 2 files (195.49 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Genre-enriched web corpora MaCoCu-Genre
    (Jožef Stefan Institute / 2024-10-07)
    
    Author(s):
    Kuzman, Taja and Ljubešić, Nikola
     This item contains 14 files (101.43 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Montenegrin web corpus CLASSLA-web.cnr 1.0
    (Jožef Stefan Institute / 2024-03-26)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (1.4 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus CLASSLA-web.bg 1.0
    (Jožef Stefan Institute / 2024-03-26)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (32.1 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian web corpus CLASSLA-web.sr 1.0
    (Jožef Stefan Institute / 2024-03-26)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (21.58 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus CLASSLA-web.hr 1.0
    (Jožef Stefan Institute / 2024-03-26)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (20.31 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bosnian web corpus CLASSLA-web.bs 1.0
    (Jožef Stefan Institute / 2024-03-26)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (6.36 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus CLASSLA-web.mk 1.0
    (Jožef Stefan Institute / 2024-03-25)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (4.48 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian web corpus CLASSLA-web.sl 1.0
    (Jožef Stefan Institute / 2024-03-22)
    
    Author(s):
    Ljubešić, Nikola ; Rupnik, Peter and Kuzman, Taja
     This item contains 2 files (16.36 GB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Macedonian 2.1
    (Jožef Stefan Institute / 2023-06-27)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola ; Zdravkova, Katerina and Erjavec, Tomaž
     This item contains 1 file (2.19 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Bulgarian 2.1
    (Jožef Stefan Institute; IICT-BAS / 2023-06-27)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola ; Osenova, Petya ; Simov, Kiril and Krsnik, Luka
     This item contains 2 files (163.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Bulgarian 2.1
    (Jožef Stefan Institute; IICT-BAS / 2023-06-27)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 2 files (190.67 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Macedonian 2.1
    (Jožef Stefan Institute / 2023-06-27)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola ; Zdravkova, Katerina ; Stojanoska, Sanja and Erjavec, Tomaž
     This item contains 2 files (147.17 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Bulgarian 2.1
    (Jožef Stefan Institute; IICT-BAS / 2023-06-27)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 1 file (52.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Greek web corpus MaCoCu-el 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-24)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (16.23 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Catalan web corpus MaCoCu-ca 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-24)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (4.72 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Ukrainian web corpus MaCoCu-uk 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-24)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (24.58 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Icelandic web corpus MaCoCu-is 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-19)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (2.48 GB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (179.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (177.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (191.81 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (98.13 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     This item contains 2 files (172.92 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     This item contains 2 files (179.88 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (104.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (104.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (189.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Author(s):
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (98.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus MaCoCu-mk 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.79 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus MaCoCu-bg 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (12.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese web corpus MaCoCu-mt 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (7.12 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bosnian web corpus MaCoCu-bs 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Runić, Marija ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (2.21 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Montenegrin web corpus MaCoCu-cnr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (500.14 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian web corpus MaCoCu-sr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (7.62 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Albanian web corpus MaCoCu-sq 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.63 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish web corpus MaCoCu-tr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (15.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene web corpus MaCoCu-sl 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-19)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (5.57 GB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Slovenian 2.1
    (Jožef Stefan Institute / 2023-03-30)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (504.03 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Slovenian 2.1
    (Jožef Stefan Institute / 2023-03-30)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (2.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Terčon, Luka ; Čibej, Jaka and Ljubešić, Nikola
     This item contains 1 file (2.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Ljubešić, Nikola ; Terčon, Luka and Čibej, Jaka
     This item contains 2 files (509.87 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for JOS dependency parsing of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 2 files (176.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for semantic role labeling of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Author(s):
    Terčon, Luka and Ljubešić, Nikola
     This item contains 1 file (58.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Web Corpus PDRS 1.0
    (Institute for Serbian Language SANU / 2023-01-22)
    
    Author(s):
    Wasserscheidt, Philipp
     This item contains 7 files (6.46 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene web corpus MaCoCu-sl 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (12.9 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus MaCoCu-bg 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (28.22 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish web corpus MaCoCu-tr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (31.42 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese web corpus MaCoCu-mt 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (2.65 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (16.72 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Icelandic web corpus MaCoCu-is 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-29)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (4.55 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus MaCoCu-mk 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2022-04-28)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; van der Werff, Tobias ; Zaragoza, Jaume
     This item contains 3 files (4.09 GB).
     
    Publicly Available

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The Croatian web dictionary Mrežnik (A-F) 1.0
    (Institute for Croatian Language and Linguistics / 2021-12-23)
    
    Author(s):
    Hudeček, Lana ; et al.show everyone Hudeček, Lana ; Mihaljević, Milica ; Blagus Bartolec, Goranka ; Brač, Ivana ; Horvat, Joža ; Ivšić Majić, Dubravka ; Lewis, Kristian ; Kovačević, Barbara ; Kramarić, Martina ; Lazić, Daria ; Matas Ivanković, Ivana ; Matijević, Maja ; Mihaljević, Josip ; Pasini, Dinka ; Sučević-Međeral, Krešimir ; Vidović, Domagoj
     This item contains 2 files (7.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The Croatian Web Dictionary Mrežnik (ELEXIS)
    (Institute of Croatian Language and Linguistics / 2021-12-23)
    
    Author(s):
    Hudeček, Lana ; et al.show everyone Hudeček, Lana ; Mihaljević, Milica ; Blagus Bartolec, Goranka ; Brač, Ivana ; Horvat, Joža ; Ivšić Majić, Dubravka ; Lewis, Kristian ; Kovačević, Barbara ; Kramarić, Martina ; Lazić, Daria ; Matas Ivanković, Ivana ; Matijević, Maja ; Mihaljević, Josip ; Pasini, Dinka ; Sučević-Međeral, Krešimir ; Vidović, Domagoj
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene Web genre identification corpus GINCO 1.0
    (Jožef Stefan Institute / 2021-12-02)
    
    Author(s):
    Kuzman, Taja ; Brglez, Mojca ; Rupnik, Peter and Ljubešić, Nikola
     This item contains 2 files (1.77 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Montenegrin web corpus meWaC 1.0
    (Jožef Stefan Institute / 2021-05-13)
    
    Author(s):
    Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 2 files (2.47 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Comparable corpora of South-Slavic Wikipedias CLASSLA-Wikipedia 1.0
    (Jožef Stefan Institute / 2021-05-05)
    
    Author(s):
    Ljubešić, Nikola ; Markoski, Filip ; Markoska, Elena and Erjavec, Tomaž
     This item contains 7 files (5.04 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola ; Zdravkova, Katerina ; Stojanoska, Sanja ; Erjavec, Tomaž and Krsnik, Luka
     This item contains 2 files (146.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Croatian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (178.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Serbian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (160.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Serbian 1.1
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (90.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Croatian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Serbian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.15 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Slovenian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Croatian 1.1
    (Jožef Stefan Institute / 2020-07-17)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (89.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Bulgarian 1.0
    (Jožef Stefan Institute; IICT-BAS / 2020-07-07)
    
    Author(s):
    Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 2 files (107.32 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.34 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Serbian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for UD dependency parsing of standard Croatian
    (Jožef Stefan Institute / 2019-10-11)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (1.13 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for UD dependency parsing of standard Serbian
    (Jožef Stefan Institute / 2019-10-11)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (624 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Serbian
    (Jožef Stefan Institute / 2019-10-10)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (549.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Opinion corpus of Slovene web commentaries KKS 1.001
    (Faculty of Computer and Information Science, University of Ljubljana / 2017-05-28)
    
    Author(s):
    Kadunc, Klemen and Robnik-Šikonja, Marko
     This item contains 4 files (5.72 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    R crawlers for five Slovenian web media 1.0
    (Faculty of Information Studies Novo mesto / 2017-04-23)
    
    Author(s):
    Bučar, Jože
     This item contains 6 files (213.86 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Finnish web corpus fiWaC 1.0
    (Jožef Stefan Institute / 2016-09-20)
    
    Author(s):
    Ljubešić, Nikola ; Pirinen, Tommi and Toral, Antonio
     This item contains 38 files (15.28 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian web corpus srWaC 1.1
    (Jožef Stefan Institute / 2016-05-12)
    
    Author(s):
    Ljubešić, Nikola and Klubička, Filip
     This item contains 6 files (3.51 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus hrWaC 2.1
    (Jožef Stefan Institute / 2016-05-12)
    
    Author(s):
    Ljubešić, Nikola and Klubička, Filip
     This item contains 15 files (9.21 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  • 2
  •  
  •    
    • Sort items by
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    •  Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    •  80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".