Show simple item record

 
dc.contributor.author Arhar Holdt, Špela
dc.contributor.author Goli, Teja
dc.contributor.author Lavrič, Polona
dc.contributor.author Laskowski, Cyprian
dc.contributor.author Klemenc, Bojan
dc.contributor.author Rozman, Tadeja
dc.contributor.author Stritar Kučuk, Mojca
dc.contributor.author Krek, Simon
dc.contributor.author Krapš Vodopivec, Irena
dc.contributor.author Stabej, Marko
dc.contributor.author Kosem, Iztok
dc.date.accessioned 2019-11-08T07:58:22Z
dc.date.available 2019-11-08T07:58:22Z
dc.date.issued 2019-07-08
dc.identifier.uri http://hdl.handle.net/11356/1231
dc.description The corpus contains 2094 texts from the corpus Šolar 2.0 (http://hdl.handle.net/11356/1214), i.e. only those in which error annotations can be found. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided. The original error annotations from Šolar 1.0 have been re-categorized according to a new system (the specifications in Slovene are attached). There are 36,671 error annotations in total, which also include corrections made by teachers. The corpus consists of 756,130 words from student texts (this word count does not include teacher corrections).
dc.language.iso slv
dc.publisher Trojina, Institute for Applied Slovene Studies
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.relation.replaces http://hdl.handle.net/11356/1036
dc.relation.isreplacedby http://hdl.handle.net/11356/1589
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.label PUB
dc.source.uri https://www.cjvt.si/raziskovalno-delo/projekti-cjvt/korpus-solar/
dc.subject developmental corpus
dc.subject student writing
dc.subject error annotation
dc.title Error-annotated developmental corpus Šolar 2.0 Error
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Iztok Kosem iztok.kosem@ff.uni-lj.si Centre for Language Resources and Technologies, University of Ljubljana
sponsor Ministry of Culture 3340-15-141006 Upgrade of Šolar Corpus nationalFunds
sponsor ARRS (Slovenian Research Agency) I0-0051 Centre for Applied Linguistics (CUJ) nationalFunds
sponsor University of Ljubljana I0-0022 Network of Research Infrastructure Centres (MRIC) nationalFunds
size.info 2094 texts
size.info 756130 words
size.info 881170 tokens
files.count 2
files.size 11289280


 Files in this item

 Download all files in item (10.77 MB)
Icon
Name
Solar2.0-Error.zip
Size
10.07 MB
Format
application/zip
Description
Corpus in XML format
MD5
f098c51558bde82e21514a7685bc0f47
 Download file  Preview
 File Preview  
Icon
Name
Smernice za označevanje korpusa Šolar 2.0 (v1.0).pdf
Size
714.09 KB
Format
PDF
Description
Guidelines for error annotation (in Slovenian)
MD5
df44421fe80ec4efad1e8741fd5905e1
 Download file

Show simple item record