| dc.contributor.author | Ljubešić, Nikola |
| dc.contributor.author | Perovšek, Matic |
| dc.contributor.author | Erjavec, Tomaž |
| dc.date.accessioned | 2017-11-27T15:14:31Z |
| dc.date.available | 2017-11-27T15:14:31Z |
| dc.date.issued | 2017-11-27 |
| dc.identifier.uri | http://hdl.handle.net/11356/1169 |
| dc.description | Word standardisation of non-standard language as found in user-generated content, using cSMTiser (https://github.com/clarinsi/csmtiser), a tool for text normalisation via character-level machine translation. The tool has been trained on the Janes-Norm dataset (http://hdl.handle.net/11356/1084) and background resources. |
| dc.language.iso | slv |
| dc.publisher | Jožef Stefan Institute |
| dc.source.uri | https://github.com/clarinsi/csmtiser |
| dc.subject | word normalisation |
| dc.title | cSMTiser: word standardisation |
| dc.type | toolService |
| metashare.ResourceInfo#ContentInfo.detailedType | service |
| metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
| hidden | hidden |
| hasMetadata | true |
| has.files | no |
| branding | CLARIN.SI data & tools |
| contact.person | Tomaž Erjavec tomaz.erjavec@ijs.si Jožef Stefan Institute |
| sponsor | ARRS (Slovenian Research Agency) J6-6842 JANES: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene nationalFunds |
| sponsor | Swiss National Science Foundation 160501 ReLDI Other |
| sponsor | ARRS (Slovenian Research Agency) P2-103 Knowledge Technologies nationalFunds |
| files.count | 0 |
| files.size | 0 |