Show simple item record

 
dc.contributor.author Ljubešić, Nikola
dc.date.accessioned 2020-06-20T10:48:05Z
dc.date.available 2020-06-20T10:48:05Z
dc.date.issued 2020-06-19
dc.identifier.uri http://hdl.handle.net/11356/1322
dc.description This model for named entity recognition of standard Croatian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the hr500k training corpus (http://hdl.handle.net/11356/1183) and using the CLARIN.SI-embed.hr word embeddings (http://hdl.handle.net/11356/1205).
dc.language.iso hrv
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://dx.doi.org/10.18653/v1/W19-3704
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://github.com/clarinsi/classla-stanfordnlp
dc.subject named entity recognition
dc.subject language model
dc.title The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding CLARIN.SI data & tools
contact.person Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor ARRS (Slovenian Research Agency) J7-8280 FRENK: Resources, methods, and tools for the understanding, identification, and classification of various forms of socially unacceptable discourse in the information society nationalFunds
sponsor ARRS (Slovenian Research Agency) N6-0099 LiLaH: Linguistic Landscape of Hate Speech nationalFunds
files.count 2
files.size 111507298


 Files in this item

 Download all files in item (106.34 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
hr500k
Size
46.11 MB
Format
Unknown
Description
Language model
MD5
82b21451ad9ab2b04787b6989824826a
 Download file
Icon
Name
hr500k.pretrain.pt
Size
60.23 MB
Format
Unknown
Description
Pretrained word embeddings
MD5
0532f53e5488441732fc2ebd283d8654
 Download file

Show simple item record