Prikaži enostavni zapis vnosa
dc.contributor.author |
Ljubešić, Nikola |
dc.contributor.author |
Perovšek, Matic |
dc.contributor.author |
Erjavec, Tomaž |
dc.date.accessioned |
2017-11-27T15:14:31Z |
dc.date.available |
2017-11-27T15:14:31Z |
dc.date.issued |
2017-11-27 |
dc.identifier.uri |
http://hdl.handle.net/11356/1169 |
dc.description |
Word standardisation of non-standard language as found in user-generated content, using cSMTiser (https://github.com/clarinsi/csmtiser), a tool for text normalisation via character-level machine translation. The tool has been trained on the Janes-Norm dataset (http://hdl.handle.net/11356/1084) and background resources. |
dc.language.iso |
slv |
dc.publisher |
Jožef Stefan Institute |
dc.source.uri |
https://github.com/clarinsi/csmtiser |
dc.subject |
word normalisation |
dc.title |
cSMTiser: word standardisation |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
service |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
hidden |
hidden |
hasMetadata |
true |
has.files |
no |
branding |
CLARIN.SI data & tools |
contact.person |
Tomaž Erjavec tomaz.erjavec@ijs.si Jožef Stefan Institute |
sponsor |
ARRS (Slovenian Research Agency) J6-6842 JANES: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene nationalFunds |
sponsor |
Swiss National Science Foundation 160501 ReLDI Other |
sponsor |
ARRS (Slovenian Research Agency) P2-103 Knowledge Technologies nationalFunds |
files.count |
0 |
files.size |
0 |
Prikaži enostavni zapis vnosa