Prikaži enostavni zapis vnosa

 
dc.contributor.author Terčon, Luka
dc.contributor.author Dobrovoljc, Kaja
dc.contributor.author Ljubešić, Nikola
dc.date.accessioned 2025-02-09T11:43:49Z
dc.date.available 2025-02-09T11:43:49Z
dc.date.issued 2025-02-07
dc.identifier.uri http://hdl.handle.net/11356/2014
dc.description This model for named entity recognition of standard Slovenian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla) by training on the SUK training corpus (http://hdl.handle.net/11356/1959) and using the CLARIN.SI-embed.sl 2.0 word embeddings (http://hdl.handle.net/11356/1791). The difference to the previous version of the model is that the model was trained using the SUK training corpus and uses new embeddings.
dc.language.iso slv
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://dx.doi.org/10.18653/v1/W19-3704
dc.relation.replaces http://hdl.handle.net/11356/1321
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://github.com/clarinsi/classla
dc.subject named entity recognition
dc.subject language model
dc.title The CLASSLA-Stanza model for named entity recognition of standard Slovenian 2.2
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding CLARIN.SI data & tools
contact.person Luka Terčon luka.tercon@fri.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana
contact.person Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor ARRS (Slovenian Research Agency) J7-8280 FRENK: Resources, methods, and tools for the understanding, identification, and classification of various forms of socially unacceptable discourse in the information society nationalFunds
sponsor ARRS (Slovenian Research Agency) N6-0099 LiLaH: Linguistic Landscape of Hate Speech nationalFunds
sponsor ARRS (Slovenian Research Agency) J7-4642 MEZZANINE nationalFunds
sponsor ARRS (Slovenian Research Agency) Z6-4617 Treebank-Driven Approach to the Study of Spoken Slovenian nationalFunds
files.count 2
files.size 153815846


 Datoteke v tem vnosu

 Prenesi vse datoteke v vnosu (146.69 MB)
Icon
Ime
ner.zip
Velikost
41.42 MB
Format
application/zip
Opis
Language model
MD5
35e6e10542e1c660a4b6972fd4e34b90
 Prenesi datoteko  Predogled
 Predogled datoteke  
    • ner-1 B
Icon
Ime
sl_ssj.pretrain.zip
Velikost
105.27 MB
Format
application/zip
Opis
Pretrained word embeddings
MD5
653cfb0ad1eb2accb2f50ae22908b474
 Prenesi datoteko  Predogled
 Predogled datoteke  
    • sl_ssj.pretrain.pt-1 B

Prikaži enostavni zapis vnosa