Show simple item record

 
dc.contributor.author Ljubešić, Nikola
dc.date.accessioned 2020-06-20T10:49:44Z
dc.date.available 2020-06-20T10:49:44Z
dc.date.issued 2020-06-19
dc.identifier.uri http://hdl.handle.net/11356/1321
dc.description This model for named entity recognition of standard Slovenian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the ssj500k training corpus (http://hdl.handle.net/11356/1210) and using the CLARIN.SI-embed.sl word embeddings (http://hdl.handle.net/11356/1204).
dc.language.iso slv
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://dx.doi.org/10.18653/v1/W19-3704
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://github.com/clarinsi/classla-stanfordnlp
dc.subject named entity recognition
dc.subject language model
dc.title The CLASSLA-StanfordNLP model for named entity recognition of standard Slovenian 1.0
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding CLARIN.SI data & tools
contact.person Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor ARRS (Slovenian Research Agency) J7-8280 FRENK: Resources, methods, and tools for the understanding, identification, and classification of various forms of socially unacceptable discourse in the information society nationalFunds
sponsor ARRS (Slovenian Research Agency) N6-0099 LiLaH: Linguistic Landscape of Hate Speech nationalFunds
files.count 2
files.size 111272344


 Files in this item

 Download all files in item (106.12 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
ssj500k
Size
46.1 MB
Format
Unknown
Description
Language model
MD5
3f5003e050962523985c2f90ef5cc3c8
 Download file
Icon
Name
ssj500k.pretrain.pt
Size
60.02 MB
Format
Unknown
Description
Pretrained word embeddings
MD5
b58fddac0f0e9befc32c57fa20fe8c59
 Download file

Show simple item record