Show simple item record

 
dc.contributor.author Reher, Špela
dc.contributor.author Erjavec, Tomaž
dc.contributor.author Fišer, Darja
dc.date.accessioned 2017-10-18T18:28:09Z
dc.date.available 2017-10-18T18:28:09Z
dc.date.issued 2017-10-13
dc.identifier.uri http://hdl.handle.net/11356/1154
dc.description Janes-Preklop is a corpus of Slovene tweets that is manually annotated for code-switching (the use of words from two or more languages within one sentence or utterance), according to the supplied typology. Words in the corpus are also automatically tagged with MSDs and lemmas.
dc.language.iso slv
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://nl.ijs.si/janes/wp-content/uploads/2017/09/Magistrsko-delo_%C5%A0pela-Reher_final.pdf
dc.relation.isreferencedby http://nl.ijs.si/janes/viri/rocno-oznaceni-korpusi/#Janes-Preklop
dc.relation.isreferencedby https://doi.org/10.1007/s10579-018-9425-z
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri http://nl.ijs.si/janes/
dc.subject computer-mediated communication
dc.subject Twitter
dc.subject code-switching
dc.subject TEI
dc.subject manual annotation
dc.title Tweet code-switching corpus Janes-Preklop 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN.SI data & tools
contact.person Tomaž Erjavec tomaz.erjavec@ijs.si Jožef Stefan Institute
sponsor ARRS (Slovenian Research Agency) J6-6842 JANES: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene nationalFunds
size.info 1104 texts
size.info 19769 tokens
files.count 4
files.size 1337833
featuredService.kontext Search|https://www.clarin.si/kontext/first_form?corpname=janes_preklop
featuredService.noske Search|https://www.clarin.si/noske/run.cgi/corp_info?corpname=janes_preklop


 Files in this item

 Download all files in item (1.28 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
Janes-preklop-smernice.pdf
Size
314.32 KB
Format
PDF
Description
Annotation guidelines (in Slovene)
MD5
66b25e4cfda001deb10893fcdd196ac4
 Download file
Icon
Name
Janes-Preklop.TEI.zip
Size
609.2 KB
Format
application/zip
Description
Corpus in TEI format
MD5
2764d76c96408efc99bbf84a922ffcda
 Download file  Preview
 File Preview  
  • Janes-Preklop.TEI
    • janes.preklop.back.xml531 kB
    • schema
      • tei_janes_doc.html2 MB
      • tei_janes.rng399 kB
      • tei_janes_schema.xml2 kB
      • tei_janes.zip44 kB
      • tei_janes.rnc188 kB
    • janes.preklop.xml19 kB
    • janes.preklop.body.xml2 MB
    • 00README.txt208 B
Icon
Name
Janes-Preklop.vert.zip
Size
183.27 KB
Format
application/zip
Description
Corpus in vertical format
MD5
6b74c85e568a2720188b24dff848e855
 Download file  Preview
 File Preview  
Icon
Name
Janes-Preklop-Lexicon.zip
Size
199.7 KB
Format
application/zip
Description
Lexcion of code-switches segments
MD5
801f928a20ebd4c3505d9847b7f43461
 Download file  Preview
 File Preview  
    • janes.preklop.lexicon.tbl150 kB
    • janes.preklop.lexicon.xlsx168 kB

Show simple item record