Files in this item
Download all files in item (42.85 MB)This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- ssj500k.conllu.zip
- Size
- 8.55 MB
- Format
- application/zip
- Description
- Corpus in CONLL-U format: complete corpus with UD morphology and separately the UD syntactically annotated part split into train/dev/test
- MD5
- ebfd53684457bb4651c13b9bebb45423
- ssj500k.conllu
- sl_ssj-ud-dev.conllu1 MB
- ssj500k-morpho.conllu42 MB
- sl_ssj-ud-test.conllu1 MB
- 00README.txt148 B
- sl_ssj-ud-train.conllu9 MB
- Name
- ssj500k-en.TEI.zip
- Size
- 12.42 MB
- Format
- application/zip
- Description
- Corpus encoded in TEI format with annotations in English
- MD5
- 735165b029f6f739082c15e74ee7a7da
- ssj500k-en.TEI
- ssj500k.back.xml500 kB
- ssj500k-en.xml50 kB
- schema
- tei_clarin_doc.xml7 MB
- tei_clarin.zip87 kB
- tei_clarin_example.xml32 kB
- tei_clarin.rnc282 kB
- tei_clarin_schema.xml3 kB
- tei_clarin.dtd229 kB
- tei_clarin_doc.html7 MB
- tei_clarin.rng579 kB
- 00README.txt148 B
- ssj500k-en.body.xml101 MB
- Name
- ssj500k-sl.TEI.zip
- Size
- 12.42 MB
- Format
- application/zip
- Description
- Corpus encoded in TEI format with annotations in Slovene
- MD5
- 8d15e43fba438b2dcf328e7453efcd0a
- ssj500k-sl.TEI
- ssj500k-sl.xml50 kB
- ssj500k-sl.body.xml101 MB
- ssj500k.back.xml500 kB
- schema
- tei_clarin_doc.xml7 MB
- tei_clarin.zip87 kB
- tei_clarin_example.xml32 kB
- tei_clarin.rnc282 kB
- tei_clarin_schema.xml3 kB
- tei_clarin.dtd229 kB
- tei_clarin_doc.html7 MB
- tei_clarin.rng579 kB
- 00README.txt148 B
- Name
- ssj500k.vert.zip
- Size
- 9.46 MB
- Format
- application/zip
- Description
- Corpus in derived vertical (Sketch Engine / CQP) format
- MD5
- 8e7c003641c19be0c107b6043eb7f81a
- ssj500k.vert
- ssj500k23.vert89 MB
- ssj500k23.regi4 kB
- 00README.txt148 B