dc.contributor.author | Arhar Holdt, Špela |
dc.contributor.author | Goli, Teja |
dc.contributor.author | Lavrič, Polona |
dc.contributor.author | Laskowski, Cyprian |
dc.contributor.author | Klemenc, Bojan |
dc.contributor.author | Rozman, Tadeja |
dc.contributor.author | Stritar Kučuk, Mojca |
dc.contributor.author | Krek, Simon |
dc.contributor.author | Krapš Vodopivec, Irena |
dc.contributor.author | Stabej, Marko |
dc.contributor.author | Kosem, Iztok |
dc.date.accessioned | 2019-11-08T07:58:22Z |
dc.date.available | 2019-11-08T07:58:22Z |
dc.date.issued | 2019-07-08 |
dc.identifier.uri | http://hdl.handle.net/11356/1231 |
dc.description | The corpus contains 2094 texts from the corpus Šolar 2.0 (http://hdl.handle.net/11356/1214), i.e. only those in which error annotations can be found. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided. The original error annotations from Šolar 1.0 have been re-categorized according to a new system (the specifications in Slovene are attached). There are 36,671 error annotations in total, which also include corrections made by teachers. The corpus consists of 756,130 words from student texts (this word count does not include teacher corrections). |
dc.language.iso | slv |
dc.publisher | Trojina, Institute for Applied Slovene Studies |
dc.publisher | Centre for Language Resources and Technologies, University of Ljubljana |
dc.relation.replaces | http://hdl.handle.net/11356/1036 |
dc.relation.isreplacedby | http://hdl.handle.net/11356/1589 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://www.cjvt.si/raziskovalno-delo/projekti-cjvt/korpus-solar/ |
dc.subject | developmental corpus |
dc.subject | student writing |
dc.subject | error annotation |
dc.title | Error-annotated developmental corpus Šolar 2.0 Error |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Iztok Kosem iztok.kosem@ff.uni-lj.si Centre for Language Resources and Technologies, University of Ljubljana |
sponsor | Ministry of Culture 3340-15-141006 Upgrade of Šolar Corpus nationalFunds |
sponsor | ARRS (Slovenian Research Agency) I0-0051 Centre for Applied Linguistics (CUJ) nationalFunds |
sponsor | University of Ljubljana I0-0022 Network of Research Infrastructure Centres (MRIC) nationalFunds |
size.info | 2094 texts |
size.info | 756130 words |
size.info | 881170 tokens |
files.count | 2 |
files.size | 11289280 |
Files in this item
Download all files in item (10.77 MB)This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)





- Name
- Solar2.0-Error.zip
- Size
- 10.07 MB
- Format
- application/zip
- Description
- Corpus in XML format
- MD5
- f098c51558bde82e21514a7685bc0f47
- Solar2.0-Error
- solar2-error.xml80 MB
- schema
- solar.xsd30 kB
- 00README.txt222 B

- Name
- Smernice za označevanje korpusa Šolar 2.0 (v1.0).pdf
- Size
- 714.09 KB
- Format
- Description
- Guidelines for error annotation (in Slovenian)
- MD5
- df44421fe80ec4efad1e8741fd5905e1