dc.contributor.author |
Klemen, Matej |
dc.contributor.author |
Arhar Holdt, Špela |
dc.contributor.author |
Pollak, Senja |
dc.date.accessioned |
2022-11-14T09:54:38Z |
dc.date.available |
2022-11-14T09:54:38Z |
dc.date.issued |
2022-11-11 |
dc.identifier.uri |
http://hdl.handle.net/11356/1697 |
dc.description |
The Core vocabulary for Slovenian as L2 is based on an analysis of the vocabulary appearing in the KUUS corpus (http://hdl.handle.net/11356/1696), which includes textbooks for Slovenian as a second and foreign language. By exporting lemmas, comparing them with the Reference list of Slovene frequent common words (Pollak et al. 2020, http://hdl.handle.net/11356/1346) and manual review, a list of 5273 words was compiled. The lemmas were classified into the first three CEFR levels. The list includes 350 words with the assigned label A1-core, 864 words with the label A1-larger, 1451 words with the label A2 and 2608 words at level B1. The file is in a tab separated format, containing lemma, part-of-speech (following the MULTEXT-East tagset for Slovenian), the information if the lemma appears in the Reference List of Slovene Frequent Common Words or not, and the relative average frequency.
The word lists are presented in more detail in: KLEMEN, Matej, ARHAR HOLDT, Špela, POLLAK, Senja, KOSEM, Iztok, HUBER, Damjan, LUTAR, Mateja, 2022: Korpus učbenikov za učenje slovenščine kot drugega in tujega jezika. Nataša Pirih Svetina, Ina Ferbežar (eds.): Na stičišču svetov: slovenščina kot drugi in tuji jezik. Obdobja 41. Ljubljana: Založba Univerze v Ljubljani. 165–174. DOI: https://doi.org/10.4312/Obdobja.41.2784-7152. |
dc.language.iso |
slv |
dc.publisher |
Centre for Slovene as a Second and Foreign Language, University of Ljubljana |
dc.publisher |
Centre for Language Resources and Technologies, University of Ljubljana |
dc.relation.isreferencedby |
https://doi.org/10.4312/Obdobja.41.2784-7152 |
dc.rights |
CLARIN.SI Licence ACA ID-BY-NC-INF-NORED 1.0 |
dc.rights.uri |
https://clarin.si/repository/xmlui/page/licence-aca-id-by-nc-inf-nored-1.0 |
dc.rights.label |
ACA |
dc.source.uri |
https://centerslo.si/KUUS |
dc.subject |
Slovenian as L2 |
dc.subject |
CEFR |
dc.subject |
vocabulary |
dc.title |
Core vocabulary for Slovenian as L2 1.0 |
dc.type |
lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType |
wordList |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
has.files |
yes |
branding |
CLARIN.SI data & tools |
contact.person |
Matej Klemen matej.klemen@ff.uni-lj.si Centre for Slovene as a Second and Foreign Language, University of Ljubljana |
sponsor |
Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds |
sponsor |
ARRS J7-3159 Empirical foundations for digitally-supported development of writing skills nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) P2-103 Knowledge Technologies nationalFunds |
sponsor |
Centre for Slovene as a Second and Foreign Language, University of Ljubljana - KUUS ownFunds |
size.info |
5273 entries |
files.count |
1 |
files.size |
144705 |