Show simple item record

 
dc.contributor.author Kosem, Iztok
dc.contributor.author Rozman, Tadeja
dc.contributor.author Pori, Eva
dc.contributor.author Arhar Holdt, Špela
dc.contributor.author Kocjančič, Polonca
dc.contributor.author Laskowski, Cyprian
dc.contributor.author Klemenc, Bojan
dc.date.accessioned 2019-07-16T11:14:43Z
dc.date.available 2019-07-16T11:14:43Z
dc.date.issued 2019-07-16
dc.identifier.uri http://hdl.handle.net/11356/1224
dc.description The ccŠolar corpus contains 1693 texts collected during 2016-2018, as part of the upgrade of the corpus Šolar project. The project aims were to increase the size of the Šolar 1.0 corpus and to improve text balance across regions and education level. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided. The ccŠolar 1.0 corpus is offered separately because the new texts were collected under CC BY 4.0 licence, a more open licence than the earlier texts.
dc.language.iso slv
dc.publisher Trojina, Institute for Applied Slovene Studies
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri https://www.cjvt.si/raziskovalno-delo/projekti-cjvt/korpus-solar/
dc.subject developmental corpus
dc.subject student writing
dc.title Developmental corpus ccŠolar 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Iztok Kosem iztok.kosem@ff.uni-lj.si Centre for Language Resources and Technologies, University of Ljubljana
sponsor Ministry of Culture 3340-15-141006 Upgrade of Šolar Corpus nationalFunds
sponsor ARRS (Slovenian Research Agency) I0-0051 Centre for Applied Linguistics (CUJ) nationalFunds
sponsor University of Ljubljana I0-0022 Network of Research Infrastructure Centres (MRIC) nationalFunds
size.info 1693 texts
size.info 468821 words
size.info 540868 tokens
files.count 1
files.size 6111742


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Distributed under Creative Commons Attribution Required
Icon
Name
ccSolar1.0.zip
Size
5.83 MB
Format
application/zip
Description
Corpus in TEI format
MD5
03390cae483db47a1ad69e99611451b9
 Download file  Preview
 File Preview  
  • ccSolar1.0
    • ccSolar.xml43 MB
    • schema
      • tei_clarin.zip87 kB
      • tei_clarin.rnc291 kB
      • tei_clarin.dtd233 kB
      • tei_clarin.rng592 kB
    • 00README.txt214 B

Show simple item record