Prikaži enostavni zapis vnosa
| dc.contributor.author |
Ljubešić, Nikola |
| dc.contributor.author |
Perovšek, Matic |
| dc.contributor.author |
Erjavec, Tomaž |
| dc.date.accessioned |
2017-11-27T15:14:31Z |
| dc.date.available |
2017-11-27T15:14:31Z |
| dc.date.issued |
2017-11-27 |
| dc.identifier.uri |
http://hdl.handle.net/11356/1169 |
| dc.description |
Word standardisation of non-standard language as found in user-generated content, using cSMTiser (https://github.com/clarinsi/csmtiser), a tool for text normalisation via character-level machine translation. The tool has been trained on the Janes-Norm dataset (http://hdl.handle.net/11356/1084) and background resources. |
| dc.language.iso |
slv |
| dc.publisher |
Jožef Stefan Institute |
| dc.source.uri |
https://github.com/clarinsi/csmtiser |
| dc.subject |
word normalisation |
| dc.title |
cSMTiser: word standardisation |
| dc.type |
toolService |
| metashare.ResourceInfo#ContentInfo.detailedType |
service |
| metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
| hidden |
hidden |
| hasMetadata |
true |
| has.files |
no |
| branding |
CLARIN.SI data & tools |
| contact.person |
Tomaž Erjavec tomaz.erjavec@ijs.si Jožef Stefan Institute |
| sponsor |
ARRS (Slovenian Research Agency) J6-6842 JANES: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene nationalFunds |
| sponsor |
Swiss National Science Foundation 160501 ReLDI Other |
| sponsor |
ARRS (Slovenian Research Agency) P2-103 Knowledge Technologies nationalFunds |
| files.count |
0 |
| files.size |
0 |
Prikaži enostavni zapis vnosa