The ccŠolar corpus contains 1693 texts collected during 2016-2018, as part of the upgrade of the corpus Šolar project. The project aims were to increase the size of the Šolar 1.0 corpus and to improve text balance across regions and education level. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided.
The ccŠolar 1.0 corpus is offered separately because the new texts were collected under CC BY 4.0 licence, a more open licence than the earlier texts.
Ministry of Culture3340-15-141006"Upgrade of Šolar Corpus"ARRS (Slovenian Research Agency)I0-0051"Centre for Applied Linguistics (CUJ)"University of LjubljanaI0-0022"Network of Research Infrastructure Centres (MRIC)"