dc.contributor.author | Reher, Špela |
dc.contributor.author | Erjavec, Tomaž |
dc.contributor.author | Fišer, Darja |
dc.date.accessioned | 2017-10-18T18:28:09Z |
dc.date.available | 2017-10-18T18:28:09Z |
dc.date.issued | 2017-10-13 |
dc.identifier.uri | http://hdl.handle.net/11356/1154 |
dc.description | Janes-Preklop is a corpus of Slovene tweets that is manually annotated for code-switching (the use of words from two or more languages within one sentence or utterance), according to the supplied typology. Words in the corpus are also automatically tagged with MSDs and lemmas. |
dc.language.iso | slv |
dc.publisher | Jožef Stefan Institute |
dc.relation.isreferencedby | http://nl.ijs.si/janes/wp-content/uploads/2017/09/Magistrsko-delo_%C5%A0pela-Reher_final.pdf |
dc.relation.isreferencedby | http://nl.ijs.si/janes/viri/rocno-oznaceni-korpusi/#Janes-Preklop |
dc.relation.isreferencedby | https://doi.org/10.1007/s10579-018-9425-z |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | http://nl.ijs.si/janes/ |
dc.subject | computer-mediated communication |
dc.subject | |
dc.subject | code-switching |
dc.subject | TEI |
dc.subject | manual annotation |
dc.title | Tweet code-switching corpus Janes-Preklop 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
hidden | false |
hasMetadata | false |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Tomaž Erjavec tomaz.erjavec@ijs.si Jožef Stefan Institute |
sponsor | ARRS (Slovenian Research Agency) J6-6842 JANES: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene nationalFunds |
size.info | 1104 texts |
size.info | 19769 tokens |
files.count | 4 |
files.size | 1337833 |
featuredService.kontext | Search|https://www.clarin.si/kontext/first_form?corpname=janes_preklop |
featuredService.noske | Search|https://www.clarin.si/ske/#dashboard?corpname=janes_preklop |
Files in this item
Download all files in item (1.28 MB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- Janes-preklop-smernice.pdf
- Size
- 314.32 KB
- Format
- Description
- Annotation guidelines (in Slovene)
- MD5
- 66b25e4cfda001deb10893fcdd196ac4
- Name
- Janes-Preklop.TEI.zip
- Size
- 609.2 KB
- Format
- application/zip
- Description
- Corpus in TEI format
- MD5
- 2764d76c96408efc99bbf84a922ffcda
- Janes-Preklop.TEI
- janes.preklop.back.xml531 kB
- schema
- tei_janes_doc.html2 MB
- tei_janes.rng399 kB
- tei_janes_schema.xml2 kB
- tei_janes.zip44 kB
- tei_janes.rnc188 kB
- janes.preklop.xml19 kB
- janes.preklop.body.xml2 MB
- 00README.txt208 B
- Name
- Janes-Preklop.vert.zip
- Size
- 183.27 KB
- Format
- application/zip
- Description
- Corpus in vertical format
- MD5
- 6b74c85e568a2720188b24dff848e855
- Janes-Preklop.vert
- janes_preklop.vert1 MB
- janes_preklop.regi3 kB
- 00README.txt208 B
- Name
- Janes-Preklop-Lexicon.zip
- Size
- 199.7 KB
- Format
- application/zip
- Description
- Lexcion of code-switches segments
- MD5
- 801f928a20ebd4c3505d9847b7f43461