Show simple item record

 
dc.contributor.author Ljubešić, Nikola
dc.contributor.author Perovšek, Matic
dc.contributor.author Erjavec, Tomaž
dc.date.accessioned 2017-11-27T15:14:31Z
dc.date.available 2017-11-27T15:14:31Z
dc.date.issued 2017-11-27
dc.identifier.uri http://hdl.handle.net/11356/1169
dc.description Word standardisation of non-standard language as found in user-generated content, using cSMTiser (https://github.com/clarinsi/csmtiser), a tool for text normalisation via character-level machine translation. The tool has been trained on the Janes-Norm dataset (http://hdl.handle.net/11356/1084) and background resources.
dc.language.iso slv
dc.publisher Jožef Stefan Institute
dc.source.uri https://github.com/clarinsi/csmtiser
dc.subject word normalisation
dc.title cSMTiser: word standardisation
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType service
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
hidden hidden
hasMetadata true
has.files no
branding CLARIN.SI data & tools
contact.person Tomaž Erjavec tomaz.erjavec@ijs.si Jožef Stefan Institute
sponsor ARRS (Slovenian Research Agency) J6-6842 JANES: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene nationalFunds
sponsor Swiss National Science Foundation 160501 ReLDI Other
sponsor ARRS (Slovenian Research Agency) P2-103 Knowledge Technologies nationalFunds
files.count 0
files.size 0


Show simple item record