Show simple item record

 
dc.contributor.author Ljubešić, Nikola
dc.contributor.author Zdravkova, Katerina
dc.contributor.author Stojanoska, Sanja
dc.contributor.author Erjavec, Tomaž
dc.date.accessioned 2020-11-05T17:40:57Z
dc.date.available 2020-11-05T17:40:57Z
dc.date.issued 2020-11-05
dc.identifier.uri http://hdl.handle.net/11356/1373
dc.description This model for morphosyntactic annotation of standard Macedonian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the 1984 training corpus (to be published) and using the Macedonian CLARIN.SI word embeddings (http://hdl.handle.net/11356/1359). The model produces simultaneously UPOS, FEATS and XPOS (MULTEXT-East) labels. The estimated F1 of the XPOS annotations is ~97.6.
dc.language.iso mkd
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://dx.doi.org/10.18653/v1/W19-3704
dc.relation.isreplacedby http://hdl.handle.net/11356/1395
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://github.com/clarinsi/classla-stanfordnlp
dc.subject part-of-speech tagging
dc.subject language model
dc.title The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.0
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
hidden hidden
has.files yes
branding CLARIN.SI data & tools
contact.person Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds
files.count 2
files.size 245556532


 Files in this item

 Download all files in item (234.18 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
mk
Size
44.37 MB
Format
Unknown
Description
Language model
MD5
10e5032f17ad7f68ccd6b6fce72ba036
 Download file
Icon
Name
mk.pretrain.pt.zip
Size
189.81 MB
Format
application/zip
Description
Pretrained word embeddings
MD5
265a4eceebe5006cbbceaaa5531f6858
 Download file  Preview
 File Preview  
    • mk.pretrain.pt273 MB

Show simple item record